How do I sum all numbers from output of jq

I have this command that I would like to sum all the numbers from the output.

The command looks like this

$(hadoop fs -ls -R /reports/dt=2018-08-27 | grep _stats.json | awk '{print $NF}' | xargs hadoop fs -cat | jq '.duration')

So it's going to list all the folders in /reports/dt=2018-08-27 and get only _stats.json and pass that through jq from hadoop -cat and get only .duration from the json. Which in the end I get the result like this.

1211789 1211789 373585 495379 1211789

But I would like the command to sum all those numbers together to become 4504331

标签： jq

6条回答

手持菜刀，她持情操

2楼-- · 2020-04-08 14:36

You can just use add now.

jq '.duration | add'

0人赞添加讨论(0) 举报

小情绪 Triste *

3楼-- · 2020-04-08 14:37

Use a for loop.

total=0
for num in $(hadoop fs -ls -R /reports/dt=2018-08-27 | grep _stats.json | awk '{print $NF}' | xargs hadoop fs -cat | jq '.duration')
do
    ((total += num))
done
echo $total

0人赞添加讨论(0) 举报

SAY GOODBYE

4楼-- · 2020-04-08 14:39

Another option (and one that works even if not all your durations are integers) is to make your jq code do the work:

sample_data='{"duration": 1211789}
{"duration": 1211789}
{"duration": 373585}
{"duration": 495379}
{"duration": 1211789}'

jq -n '[inputs | .duration] | reduce .[] as $num (0; .+$num)' <<<"$sample_data"

...properly emits as output:

Replace the <<<"$sample_data" with a pipeline on stdin as desired.

0人赞添加讨论(0) 举报

Deceive 欺骗

5楼-- · 2020-04-08 14:39

awk to the rescue!

$ ... | awk '{sum+=$0} END{print sum}'

4504331

0人赞添加讨论(0) 举报

手持菜刀，她持情操

6楼-- · 2020-04-08 14:41

For clarity and generality, it might be worthwhile defining sigma(s) to add a stream of numbers:

... | jq -n '
  def sigma(s): reduce s as $x(0;.+$x); 
  sigma(inputs | .duration)'

0人赞添加讨论(0) 举报

Root（大扎）

7楼-- · 2020-04-08 14:42

jq '[.duration] | add'

will do

0人赞添加讨论(0) 举报

How do I sum all numbers from output of jq

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间