%%{init: {'theme': 'neutral', 'themeVariables': { 'fontFamily': 'monospace', "fontSize":"16px"}}}%% flowchart TD Root(<code>/</code>) bin(<code>/bin</code>) etc(<code>/etc</code>) home(<code>/home</code>) usr(<code>/usr</code>) var(<code>/var</code>) tmp(<code>/tmp</code>) Root --> bin Root --> etc Root --> home Root --> usr Root --> var Root --> tmp
Directories
When presented with a new map, the most important thing to find is your location on it (it’s hard to know where you’re going without knowing where you are). In Linux, directories are more than just containers for files; they are a critical part of the hierarchical file system, which is organized in a tree-like structure starting from the root directory (/
).
Root Directory
The root directory is the topmost directory in the Linux file system hierarchy. All other directories and files are nested within it. It is represented by a single forward slash /
and contains critical system directories like /bin
, /etc
, /home
, and /var
.
Subdirectories
Beneath the root directory, the Linux file system is organized into various subdirectories, each serving specific purposes:
/bin
– Contains essential user command binaries
/etc
– Stores system-wide configuration files.
/home
– Houses personal directories for each user.
/var
– Contains variable data like logs and spool files.
In Linux systems, the ~
represents the user’s ‘home’ directory.
File paths
A file path is a character string specifying the unique location of a file or directory within the hierarchical file system. File paths can be absolute (starting from the root (/
) directory) or relative (starting from the current (.
) directory).
Manage
In the Linux world, file and directory management is a fundamental skill. This chapter introduces some common commands that will allow you to create, copy, move, remove, and link files and directories.
mkdir
mkdir
(Make Directory) builds a new folder wherever you tell it to, like making a new folder in data for inputs (data/in
) or outputs (data/out
)
mkdir data/in
mkdir data/out
Confirm with tree -d
(the -d
is for directories):
tree data -d
# data
# ├── in
# └── out
#
# 3 directories
cp
cp
duplicates files or folders. The cp
command is used to copy files or directories from one location to another. Imagine having a file (binary_data.tsv
) on your root (.
) directory that you want to copy to the /data/in
folder; you could use cp
to make a duplicate.
cp binary_data.tsv data/in/binary_data.tsv
Confirm with tree
tree data/in
# data/in
# └── binary_data.tsv
#
# 1 directory, 1 file
mv
mv
, short for move, moves files or directories from one location to another. We’ll use it to move data/binary_data.tsv
to data/out/binary_data.tsv
:
# move file
mv data/in/binary_data.tsv data/out/binary_data.tsv
Confirm move with tree
:1
tree data -P *.tsv
# data
# ├── in
# └── out
# └── binary_data.tsv
#
# 3 directories, 1 file
It can also be used for renaming files.
# rename file
mv data/out/binary_data.tsv data/out/bin_dat.tsv
# confirm rename
tree data/out
# data/out
# └── bin_dat.tsv
#
# 1 directory, 1 file
mv
is especially useful for organizing files and directories that are in the wrong place.
rm
The rm
command stands for remove and is used to delete files or directories.
# remove doc folder
rm data/out
# rm: data/out: is a directory
By default, it won’t remove a directory without the -R
or -r
option.
# add option
rm -R data/out
Recap
Given the principle that everything is a file in Linux, directories are treated as special types of files that contain references to other files and directories, facilitating the organization and management of data.
Understanding how to navigate, manipulate, and manage directories is essential for effective use of the Linux operating system.
The
-P *.tsv
option fortree
tells it to look indata
for files or folders with a.tsv
extension. We’ll cover wildcards and patterns in the Symbols & Patterns chapter.↩︎As quoted in Bioinformatics Data Skills: Reproducible and Robust Research with Open Source Tools (2015) by Vince Buffalo.↩︎