I already created a bulk load in hbase in a kerberos cluster with a driver class similar to this (working):
public static void main(String[] args) {
try {
int response = ToolRunner.run(HBaseConfiguration.create(), new HBaseBulkLoadDriver(), args);
if(response == 0) {
System.out.println("Job is successfully completed...");
} else {
System.out.println("Job failed...");
}
} catch(Exception exception) {
exception.printStackTrace();
}
}
@Override
public int run(String[] args) throws Exception {
int result=0;
final String inputPath = args[0];
final String outputPath = args[1];
final String keytab = args[2];
Configuration configuration = getConf();
configuration.set("data.seperator", DATA_SEPERATOR);
configuration.set("hbase.table.name",TABLE_NAME);
// configuration.set("INTRO",COLUMN_FAMILY_INTRO);
configuration.set("hbase.zookeeper.quorum","zk_quorum");
configuration.set("hbase.zookeeper.property.clientPort","2181");
configuration.set("hbase.master","master:port");
configuration.set("hadoop.security.authentication", "Kerberos");
configuration.set("hbase.security.authentication", "kerberos");
//configuration.set("COLUMN_FAMILY_2",COLUMN_FAMILY_2);
Job job = new Job(configuration);
// job configuration
job.setJarByClass(HBaseBulkLoadDriver.class);
job.setJobName("Bulk Loading HBase Table:"+TABLE_NAME);
job.setInputFormatClass(TextInputFormat.class);
job.setMapOutputKeyClass(ImmutableBytesWritable.class);
//mapper class
job.setMapperClass(HBaseBulkLoadMapper.class);
FileInputFormat.addInputPaths(job,inputPath);
FileSystem.getLocal(getConf()).delete(new Path(outputPath), true);
FileOutputFormat.setOutputPath(job, new Path(outputPath));
job.setMapOutputValueClass(Put.class);
HFileOutputFormat.configureIncrementalLoad(job, new HTable(configuration,TABLE_NAME));
job.waitForCompletion(true);
System.out.println("Output written to folder :" + outputPath);
System.out.println("To proceed loading files user: hbase:hbase must own recursivly the folder!");
System.out.println("Is hbase user owing the folder?press Y to load the data , press N and job will fail");
String IsHbaseOwnerOftheFolder = System.console().readLine();
if (job.isSuccessful() && IsHbaseOwnerOftheFolder.equals("Y")) {
HBaseBulkLoad.doBulkLoad(outputPath, keytab, TABLE_NAME);
} else {
result = -1;
}
return result;
}
Now I would like to implement the secure bulk load but seem that this must be implemented using coprocessor framework (hbase 1.0.0) can someone give me a complete example of how to use the securebulkloadHFiles method ? Thanks for the help
I will answer myself to my question :
For this answer to work a table in hbase must already exist moreover HFile have to be already generated for the import