How to split file on first empty line in a portabl

2019-02-16 12:30发布

I want to split a file containg HTTP response into two files: one containing only HTTP headers, and one containg the body of a message. For this I need to split a file into two on first empty line (or for UNIX tools on first line containing only CR = '\r' character) using a shell script.

How to do this in a portable way (for example using sed, but without GNU extensions)? One can assume that empty line would not be first line in a file. Empty line can got to either, none or both of files; it doesn't matter to me.

4条回答
【Aperson】
2楼-- · 2019-02-16 13:09

You can extract the first part of your file (HTTP headers) with:

awk '{if($0=="")exit;print}' myFile

and the second part (HTTP body) with:

awk '{if(body)print;if($0=="")body=1}' myFile
查看更多
对你真心纯属浪费
3楼-- · 2019-02-16 13:13

You can use csplit:

echo "a
b
c

d
e
f" | csplit -s - '/^$/'

Or

csplit -s filename '/^$/'

(assuming the contents of "filename" are the same as the output of the echo) would create, in this case, two files named "xx00" and "xx01". The prefix can be changed from "xx" to "outfile", for example, with -f outfile and the number of digits in the filename could be changed to 3 with -n 3. You can use a more complex regex if you need to deal with Macintosh line endings.

To split a file at each empty line, you can use:

csplit -s filename '/^$/' '{*}'

The pattern '{*}' causes the preceding pattern to be repeated as many times as possible.

查看更多
闹够了就滚
4楼-- · 2019-02-16 13:14

Given the awk script

BEGIN { fout="headers" }
/^$/ { fout="body" }
{ print $0 > fout }

awk -f foo.awk < httpfile will write out the two files headers and body for you.

查看更多
我只想做你的唯一
5楼-- · 2019-02-16 13:27
$ cat test.txt
a
b
c

d
e
f
$ sed '/^$/q' test.txt 
a
b
c

$ sed '1,/^$/d' test.txt 
d
e
f

Change the /^$/ to /^\s*$/ if you expect there may be whitespace on the blank line.

查看更多
登录 后发表回答