Unix shell file copy flattening folder structure

2019-03-08 09:51发布

On the UNIX bash shell (specifically Mac OS X Leopard) what would be the simplest way to copy every file having a specific extension from a folder hierarchy (including subdirectories) to the same destination folder (without subfolders)?

Obviously there is the problem of having duplicates in the source hierarchy. I wouldn't mind if they are overwritten.

Example: I need to copy every .txt file in the following hierarchy

/foo/a.txt
/foo/x.jpg
/foo/bar/a.txt
/foo/bar/c.jpg
/foo/bar/b.txt

To a folder named 'dest' and get:

/dest/a.txt
/dest/b.txt

标签: unix shell
5条回答
▲ chillily
2楼-- · 2019-03-08 10:27

The answers above don't allow for name collisions as the asker didn't mind files being over-written.

I do mind files being over-written so came up with a different approach. Replacing each / in the path with - keep the hierarchy in the names, and puts all the files in one flat folder.

We use find to get the list of all files, then awk to create a mv command with the original filename and the modified filename then pass those to bash to be executed.

find ./from -type f | awk '{ str=$0; sub(/\.\//, "", str); gsub(/\//, "-", str); print "mv " $0 " ./to/" str }' | bash

where ./from and ./to are directories to mv from and to.

查看更多
成全新的幸福
3楼-- · 2019-03-08 10:28

The only problem with Magnus' solution is that it forks off a new "cp" process for every file, which is not terribly efficient especially if there is a large number of files.

On Linux (or other systems with GNU coreutils) you can do:

find . -name "*.xml" -print0 | xargs -0 echo cp -t a

(The -0 allows it to work when your filenames have weird characters -- like spaces -- in them.)

Unfortunately I think Macs come with BSD-style tools. Anyone know a "standard" equivalent to the "-t" switch?

查看更多
孤傲高冷的网名
4楼-- · 2019-03-08 10:32

As far as the man page for cp on a FreeBSD box goes, there's no need for a -t switch. cp will assume the last argument on the command line to be the target directory if more than two names are passed.

查看更多
Ridiculous、
5楼-- · 2019-03-08 10:39

In bash:

find /foo -iname '*.txt' -exec cp \{\} /dest/ \;

find will find all the files under the path /foo matching the wildcard *.txt, case insensitively (That's what -iname means). For each file, find will execute cp {} /dest/, with the found file in place of {}.

查看更多
迷人小祖宗
6楼-- · 2019-03-08 10:48

If you really want to run just one command, why not cons one up and run it? Like so:

$ find /foo  -name '*.txt' | xargs echo | sed -e 's/^/cp /' -e 's|$| /dest|' | bash -sx

But that won't matter too much performance-wise unless you do this a lot or have a ton of files. Be careful of name collusions, however. I noticed in testing that GNU cp at least warns of collisions:

cp: will not overwrite just-created `/dest/tubguide.tex' with `./texmf/tex/plain/tugboat/tubguide.tex'

I think the cleanest is:

$ find /foo  -name '*.txt' | xargs -i cp {} /dest

Less syntax to remember than the -exec option.

查看更多
登录 后发表回答