Actually I am trying my level best to integrate with R, but I got this error.
packages ‘rmr’, ‘rJava‘, ‘RJSONIO‘, ‘rhdfs’, ‘rhbase’, ‘plyrmr’ are not available (for R version 3.1.3)
Steps to integrate Hadoop with R:
Installed R, and Hadoop in ubuntu.
Add these three lines in ~/.bashrc file.
*export HADOOP_PREFIX=/Users/hadoop/hadoop-1.1.2
export HADOOP_CMD=/Users/hadoop/hadoop-1.1.2/bin/hadoop
export HADOOP_STREAMING=/Users/hadoop/hadoop-1.1.2/contrib/streaming/hadoop-streaming-1.1.2.jar*
Installed R packages by using this command
install.packages(c("rJava", "RJSONIO", "rmr", "rhdfs", "rhbase", "plyrmr").
But i got above error. What is the main problem how to integrate R and Hadoop. I have followed this link to integrate.
With
devtools
you can install directly from Github:can you try to install these packages by specifying the reposetory:
Download packages rhdfs, rhbase, rmr2 and plyrmr from https://github.com/RevolutionAnalytics/RHadoop/wiki and install them as below :
You can download the packages from CRAN's website and install them without connecting to the repository.
For example, to download 'rJava' package, you can visit this link: http://cran.r-project.org/web/packages/rJava/index.html
From there, download the 'Package source', and install them manually like this:
You can follow the same pattern for the other mentioned packages. The rhdfs, rhbase and rmr packages are located at https://github.com/RevolutionAnalytics/RHadoop/wiki/Downloads