How to install RHadoop packages (Rmr, Rhdfs, Rhbas

2019-01-24 11:33发布

问题:

Actually I am trying my level best to integrate with R, but I got this error.

packages ‘rmr’, ‘rJava‘, ‘RJSONIO‘, ‘rhdfs’, ‘rhbase’, ‘plyrmr’ are not available (for R version 3.1.3)

Steps to integrate Hadoop with R:

Installed R, and Hadoop in ubuntu.

Add these three lines in ~/.bashrc file.

*export HADOOP_PREFIX=/Users/hadoop/hadoop-1.1.2

export HADOOP_CMD=/Users/hadoop/hadoop-1.1.2/bin/hadoop

export HADOOP_STREAMING=/Users/hadoop/hadoop-1.1.2/contrib/streaming/hadoop-streaming-1.1.2.jar*

Installed R packages by using this command

install.packages(c("rJava", "RJSONIO", "rmr", "rhdfs", "rhbase", "plyrmr").

But i got above error. What is the main problem how to integrate R and Hadoop. I have followed this link to integrate.

回答1:

Download packages rhdfs, rhbase, rmr2 and plyrmr from https://github.com/RevolutionAnalytics/RHadoop/wiki and install them as below :

install.packages("<path>/rhdfs_1.0.8.tar.gz", repos=NULL, type="source")
install.packages("<path>/rmr2_2.2.2.tar.gz", repos=NULL, type="source")
install.packages("<path>plyrmr_0.2.0.tar.gz", repos=NULL, type="source")
install.packages("<path>/rhbase_1.2.0.tar.gz", repos=NULL, type="source")


回答2:

With devtools you can install directly from Github:

install.packages('devtools')
devtools::install_github(c('RevolutionAnalytics/rmr2/pkg', 'RevolutionAnalytics/plyrmr/pkg'))


回答3:

can you try to install these packages by specifying the reposetory:

install.packages(c("rJava", "RJSONIO", "rmr", "rhdfs", "rhbase", "plyrmr"), repos="http://cran.r-project.org/")


回答4:

You can download the packages from CRAN's website and install them without connecting to the repository.

For example, to download 'rJava' package, you can visit this link: http://cran.r-project.org/web/packages/rJava/index.html

From there, download the 'Package source', and install them manually like this:

install.packages('path to downloads/rJava_0.9-6.tar.gz',repos=NULL)

You can follow the same pattern for the other mentioned packages. The rhdfs, rhbase and rmr packages are located at https://github.com/RevolutionAnalytics/RHadoop/wiki/Downloads



标签: r hadoop rhadoop