How to delete the first column ( which is in fact

2019-04-04 22:53发布

问题:

I have data file with many thousands columns and rows. I want to delete the first column which is in fact the row counter. I used this command in linux:

cut -d " " -f 2- input.txt > output.txt

but nothing changed in my output. Does anybody knows why it does not work and what should I do?

This is what my input file looks like:

col1 col2 col3 col4 ...
     1 0 0 0 1
     2 0 1 0 1
     3 0 1 0 0
     4 0 0 0 0 
     5 0 1 1 1 
     6 1 1 1 0
     7 1 0 0 0 
     8 0 0 0 0
     9 1 0 0 0
     10 1 1 1 1
     11 0 0 0 1
    .
    .
    .

I want my output look like this:

col1 col2 col3 col4 ...
0 0 0 1
0 1 0 1
0 1 0 0
0 0 0 0 
0 1 1 1 
1 1 1 0
1 0 0 0 
0 0 0 0
1 0 0 0
1 1 1 1
0 0 0 1
.
.
.

I also tried the sed command:

 sed '1d' input.file > output.file

But it deletes the first row not the first column.

Could anybody guide me?

回答1:

@Karafka I had CSV files so I added the "," separator (you can replace with yours

cut -d"," -f2- input.csv  > output.csv

Then, I used a loop to go over all files inside the directory

# files are in the directory tmp/
for f in tmp/*
do
    name=`basename $f`
    echo "processing file : $name"
    #kepp all column excep the first one of each csv file 

    cut -d"," -f2- $f > new/$name
    #files using the same names are stored in directory new/  
done


回答2:

idiomatic use of cut will be

cut -f2- input > output

if you delimiter is tab ("\t").

Or, simply with awk magic (will work for both space and tab delimiter)

 awk '{$1=""}1' input | awk '{$1=$1}1' > output

first awk will delete field 1, but leaves a delimiter, second awk removes the delimiter. Default output delimiter will be space, if you want to change to tab, add -vOFS="\t" to the second awk.

UPDATED

Based on your updated input the problem is the initial spaces that cut treats as multiple columns. One way to address is to remove them first before feeding to cut

sed 's/^ *//' input | cut -d" " -f2- > output

or use the awk alternative above which will work in this case as well.



回答3:

You can use cut command with --complement option:

cut -f1 -d" " --complement input.file > output.file

This will output all columns except the first one.



标签: linux bash shell