sed or awk to delete a block

2019-09-15 08:24发布

my input file has blocks like

[abc]  
para1=123  
para2=456  
para3=111  

[pqr]  
para1=333    
para2=765    
para3=1345    

[xyz]    
para1=888    
para2=236    
para3=964    

[pqr]    
para1=tyu    
para2=ghj     
para3=ghjk     

[xyz]    
para1=qwe    
para2=asd    
para3=zxc

Now I need to delete the block which is duplicate using sed or awk. Have to delete the block which we get first from the top of the file. Ex: in above case, we have get the output like

[abc]  
para1=123  
para2=456  
para3=111  

[pqr]    
para1=tyu    
para2=ghj     
para3=ghjk     

[xyz]    
para1=qwe    
para2=asd    
para3=zxc

标签： bash shell awk sed

3条回答

劳资没心，怎么记你

2楼-- · 2019-09-15 08:26

This keeps the last instance of each block not the first

 tac file | awk -F"\n" '!x[$NF]++' RS= ORS="\n\n"  |  tac

Slight problem with this method is that as the field separator is a newline the lines have to have the same amount of whitespace after the text as it is counted as the field.
Otherwise should work perfectly :)

 tac file | awk '!x[$(NF-1)]++' RS= ORS="\n\n"  |  tac

This also works :)

0人赞添加讨论(0) 举报

相关推荐>>

3楼-- · 2019-09-15 08:32

$ cat tst.awk
BEGIN{ RS=""; ORS="\n\n" }
!seen[$1]++ { keys[++numKeys] = $1 }
{ rec[$1] = $0 }
END {
    for (k=1; k<=numKeys; k++) {
        print rec[keys[k]]
    }
}

$ awk -f tst.awk file
[abc]
para1=123
para2=456
para3=111

[pqr]
para1=tyu
para2=ghj
para3=ghjk

[xyz]
para1=qwe
para2=asd
para3=zxc

0人赞添加讨论(0) 举报

Bombasti

4楼-- · 2019-09-15 08:53

I do get this from using awk (not sure if you did forget the abc block)

awk '!a[$1]++' RS= ORS="\n\n" file
[abc]
para1=123
para2=456
para3=111

[pqr]
para1=333
para2=765
para3=1345

[xyz]
para1=888
para2=236
para3=964

0人赞添加讨论(0) 举报

sed or awk to delete a block

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间