240
收录了3920篇文章 ·4311个问题 · 0人关注
0

Installing of SparkR

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IhavethelastversionofR-3.2.1.NowIwanttoinstallSparkRonR.AfterIexecute: >install.packages("SparkR") Igotback: Instal......

0

java.io.IOException: Could not locate executable n

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: I'mnotabletorunasimplesparkjobinScalaIDE(Mavensparkproject)installedonWindows7 Sparkcoredependencyhasbeenadded. val......

0

Renaming column names of a DataFrame in Spark Scal

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Iamtryingtoconvertalltheheaders/columnnamesofaDataFrameinSpark-Scala.asofnowIcomeupwithfollowingcodewhichonlyrepl......

0

How to zip two (or more) DataFrame in Spark

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IhavetwoDataFrameaandb. aislike Column1|Column2 abc|123 cde|23 bislike Column1 1 2 Iwan......

0

Use collect_list and collect_set in Spark SQL

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Accordingtothedocs,thecollect_setandcollect_listfunctionsshouldbeavailableinSparkSQL.However,Icannotgetittowork.I'mrun......

0

Pyspark filter dataframe by columns of another dat

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: NotsurewhyI'mhavingadifficulttimewiththis,itseemssosimpleconsideringit'sfairlyeasytodoinRorpandas.Iwantedtoavoid......

0

Defining a UDF that accepts an Array of objects in

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: WhenworkingwithSpark'sDataFrames,UserDefinedFunctions(UDFs)arerequiredformappingdataincolumns.UDFsrequirethatargumenttype......

0

Save ML model for future usage

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IwasapplyingsomeMachineLearningalgorithmslikeLinearRegression,LogisticRegression,andNaiveBayestosomedata,butIwastryingt......

0

Best Practice to launch Spark Applications via Web

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IwanttoexposemySparkapplicationstotheuserswithawebapplication. Basically,theusercandecidewhichactionhewantstorunand......

0

How to Define Custom partitioner for Spark RDDs of

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IamnewtoSpark.Ihavealargedatasetofelements[RDD]andIwanttodivideitintotwoexactlyequalsizedpartitionsmaintainingorder......

0

How to create an empty DataFrame with a specified

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IwanttocreateonDataFramewithaspecifiedschemainScala.IhavetriedtouseJSONread(Imeanreadingemptyfile)butIdon'tthinkt......

0

Overwrite specific partitions in spark dataframe w

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Iwanttooverwritespecificpartitionsinsteadofallinspark.Iamtryingthefollowingcommand: df.write.orc('maprfs:///hdfs-base-path',......

0

Apache Spark Moving Average

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IhaveahugefileinHDFShavingTimeSeriesdatapoints(YahooStockprices). IwanttofindthemovingaverageoftheTimeSerieshowdo......

0

Spark: what's the best strategy for joining a

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IhavetwoRDD'sthatIwanttojoinandtheylooklikethis: valrdd1:RDD[(T,U)] valrdd2:RDD[((T,W),V)] Ithappenstobethecasethat......

0

How to set up Spark on Windows?

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IamtryingtosetupApacheSparkonWindows. Aftersearchingabit,IunderstandthatthestandalonemodeiswhatIwant. Whichbinariesdo......

0

Requirements for converting Spark dataframe to Pan

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: I'mrunningSparkonHadoop'sYARN.Howdoesthisconversionwork?Doesacollect()takeplacebeforetheconversion? AlsoIneedtoinsta......

0

Spark Scala: How to convert Dataframe[vector] to D

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IjustusedStandardScalertonormalizemyfeaturesforaMLapplication.Afterselectingthescaledfeatures,Iwanttoconvertthisbackt......

0

Spark losing println() on stdout

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Ihavethefollowingcode: valblueCount=sc.accumulator[Long](0) valoutput=input.map{data=> for(value<-data.getValues()){ ......

0

Explode (transpose?) multiple columns in Spark SQL

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IamusingSparkSQL(ImentionthatitisinSparkincasethataffectstheSQLsyntax-I'mnotfamiliarenoughtobesureyet)andIhave......

0

Is groupByKey ever preferred over reduceByKey

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IalwaysusereduceByKeywhenIneedtogroupdatainRDDs,becauseitperformsamapsidereducebeforeshufflingdata,whichoftenmeansth......