I want to iterate over a list of files. This list is the result of a find
command, so I came up with:
getlist() {
for f in $(find . -iname "foo*")
do
echo "File found: $f"
# do something useful
done
}
It's fine except if a file has spaces in its name:
$ ls
foo_bar_baz.txt
foo bar baz.txt
$ getlist
File found: foo_bar_baz.txt
File found: foo
File found: bar
File found: baz.txt
What can I do to avoid the split on spaces?
I really like for loops and array iteration, so I figure I will add this answer to the mix...
I also liked marchelbling's stupid file example. :)
Inside the test directory:
This adds each file listing line into a bash array named
arr
with any trailing newline removed.Let's say we want to give these files better names...
${!arr[@]} expands to 0 1 2 so "${arr[$i]}" is the ith element of the array. The quotes around the variables are important to preserve the spaces.
The result is three renamed files:
find
has an-exec
argument that loops over the find results and executes an arbitrary command. For example:Here
{}
represents the found files, and wrapping it in""
allows for the resultant shell command to deal with spaces in the file name.In many cases you can replace that last
\;
(which starts a new command) with a\+
, which will put multiple files in the one command (not necessarily all of them at once though, seeman find
for more details).There are several workable ways to accomplish this.
If you wanted to stick closely to your original version it could be done this way:
This will still fail if file names have literal newlines in them, but spaces will not break it.
However, messing with IFS isn't necessary. Here's my preferred way to do this:
If you find the
< <(command)
syntax unfamiliar you should read about process substitution. The advantage of this overfor file in $(find ...)
is that files with spaces, newlines and other characters are correctly handled. This works becausefind
with-print0
will use anull
(aka\0
) as the terminator for each file name and, unlike newline, null is not a legal character in a file name.The advantage to this over the nearly-equivalent version
Is that any variable assignment in the body of the while loop is preserved. That is, if you pipe to
while
as above then the body of thewhile
is in a subshell which may not be what you want.The advantage of the process substitution version over
find ... -print0 | xargs -0
is minimal: Thexargs
version is fine if all you need is to print a line or perform a single operation on the file, but if you need to perform multiple steps the loop version is easier.EDIT: Here's a nice test script so you can get an idea of the difference between different attempts at solving this problem
Since you aren't doing any other type of filtering with
find
, you can use the following as ofbash
4.0:The
**/
will match zero or more directories, so the full pattern will matchfoo*
in the current directory or any subdirectories.There is also a very simple solution: rely on bash globbing
Note that I am not sure this behavior is the default one but I don't see any special setting in my shopt so I would go and say that it should be "safe" (tested on osx and ubuntu).