How to identify disk space consumed for a particular directory pattern using hdfs command without listing all files under that directory?
How hdfs dfs -du -h
command can be clubbed efficiently with other command to get disk space consumed for particular directory pattern?
Example like: hdfs dfs -du -h /data/timestamp=201901*
should give me disk space consumed for that pattern for the month of January 2019 ?
hdfs dfs -du -h /data/timestamp=201901*
Configured Capacity: 12827080276992 (1 TB)
Present Capacity: 12729266364289 (0.5 TB)
DFS Remaining: 1671299095456 (0.5 TB)
DFS Used: 11057967268833 (10.06 TB)
DFS Used%: 46.07%
Under replicated blocks: 56
Blocks with corrupt replicas: 0
Missing blocks: 0
Missing blocks (with replication factor 1): 0