<property>
<name>mapreduce.fileoutputcommitter.marksuccessfuljobs</name>
<value>false</value>
</property>
I want to override the above property to true. The property needs to be false for the rest of the jobs on the cluster, but I need, in my oozie workflow, hadoop to create _SUCCESS file in the output directory after the completion of job. Its a hive action in the workflow which writes output. Please help.
Hive unfortunately overrides this capability by setting it's own NullOutputComitter:
see
Looks like you'll have to create the flag manually. We have filed HIVE-3700 for this.
You can add 'dfs' command to your hive script, like
dfs -touchz '$table_base_path'/dt='${partition}'/_SUCCESS
https://archive.cloudera.com/cdh4/cdh/4/hive/language_manual/cli.html
I ran into the same issue and ended up using a shell action to create the flag.
Here's a full example: http://nathan.vertile.com/blog/2014/09/02/oozie-data-pipeline-done-flag/