240
收录了3920篇文章 ·4311个问题 · 0人关注
0

pyspark; check if an element is in collect_list [d

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Thisquestionalread...

0

Custom aggregation on PySpark dataframes

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IhaveaPySparkDataFramewithonecolumnasonehotencodedvectors.Iwanttoaggregatethedifferentonehotencoded......

0

spark.default.parallelism for Parallelize RDD defa

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Sparkstandaloneclusterwithamasterand2workernodes4cpucoreoneachworker.Total8coresforallworkers. When......

0

How to calculate lag difference in Spark Structure

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IamwritingaSparkStructuredStreamingprogram.Ineedtocreateanadditionalcolumnwiththelagdifference. Torep......

0

how to handle the Exception in spark map() functio

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IwanttoignoreExceptioninmap()function,forexample: rdd.map(_.toInt) whererddisaRDD[String]. butifitme......

0

Pyspark on yarn-cluster mode

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Isthereanywaytorunpysparkscriptswithyarn-clustermodewithoutusingthespark-submitscript?Ineeditinthis......

0

SPARK, ML, Tuning, CrossValidator: access the metr

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: InordertobuildaNaiveBayesmulticlassclassifier,IamusingaCrossValidatortoselectthebestparametersinmypip......

0

How to transform RDD, Dataframe or Dataset straigh

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Isthereanyway(oranyplans)tobeabletoturnSparkdistributedcollections(RDDs,DataframeorDatasets)directlyi......

0

java.lang.NoSuchMethodError Jackson databind and S

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Iamtryingtorunspark-submitwithSpark1.1.0andJackson2.4.4.IhavescalacodewhichusesJacksontode-serialize......

0

Spark SQL is not converting timezone correctly [du

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Thisquestionalread...

0

How do I run the Spark decision tree with a catego

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IhaveafeaturesetwithacorrespondingcategoricalFeaturesInfo:Map[Int,Int].However,forthelifeofmeIcannotfig......

0

Zeppelin: How to restart sparkContext in zeppelin

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IamusingIsolatedmodeofzeppelinssparkinterpreter,withthismodeitwillstartanewjobforeachnotebookinspar......

0

How to convert org.apache.spark.rdd.RDD[Array[Doub

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IamtryingtoimplementKMeansusingApacheSpark. valdata=sc.textFile(irisDatasetString) valparsedData=data.map(......

0

When to use Kryo serialization in Spark?

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IamalreadycompressingRDDsusingconf.set("spark.rdd.compress","true")andpersist(MEMORY_AND_DISK_SER).WillusingKr......

0

sbt assembly shading to create fat jar to run on s

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: I'musingsbtassemblytocreateafatjarwhichcanrunonspark.Havedependenciesongrpc-netty.Guavaversiononspar......

0

Spark Indefinite Waiting with “Asked to send map o

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Myjobsoftenhangwiththiskindofmessage: 14/09/0100:32:18INFOspark.MapOutputTrackerMasterActor:Askedtosendma......

0

Timeout Exception in Apache-Spark during program E

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IamrunningaBashScriptinMAC.ThisscriptcallsasparkmethodwritteninScalalanguageforalargenumberoftime......

0

how to select all columns that starts with a commo

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IhaveadataframeinSpark1.6andwanttoselectjustsomecolumnsoutofit.Thecolumnnamesarelike: colA,colB,c......

0

how to introduce the schema in a Row in Spark?

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IntheRowJavaAPIthereisarow.schema(),howeverthereisnotarow.set(StructTypeschema). AlsoItriedtoRowFact......

0

How to append an element to an array column of a S

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: SupposeIhavethefollowingDataFrame: scala>valdf1=Seq("a","b").toDF("id").withColumn("nums",array(lit(1...