240
收录了3920篇文章 ·4311个问题 · 0人关注
0

java.lang.IllegalArgumentException at org.apache.x

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IStartedgettingthefollowingerroranytimeItrytocollectmyrdd\'s.IthappenedafterIinstalledjava10.1SoofcourseItookitout......

0

How to run independent transformations in parallel

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Iamtryingtorun2functionsdoingcompletelyindependenttransformationsonasingleRDDinparallelusingPySpark.Whataresomemethods......

0

Unpivot in spark-sql/pyspark

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IhaveaproblemstatementathandwhereinIwanttounpivottableinspark-sql/pyspark.IhavegonethroughthedocumentationandIcouldse......

0

SPARK SQL replacement for mysql GROUP_CONCAT aggre

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Ihaveatableoftwostringtypecolumns(username,friend)andforeachusername,Iwanttocollectallofit\'sfriendsononerow,concat......

0

Encoder error while trying to map dataframe row to

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: WhenImtryingtodothesamethinginmycodeasmentionedbelow dataframe.map(row=>{ valrow1=row.getAs[String](1) valmake=if......

0

How do I skip a header from CSV files in Spark?

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: SupposeIgivethreefilespathstoaSparkcontexttoreadandeachfilehasaschemainthefirstrow.Howcanweskipschemalinesfromhe......

0

Spark - load CSV file as DataFrame?

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IwouldliketoreadaCSVinsparkandconvertitasDataFrameandstoreitinHDFSwithdf.registerTempTable(\"table_name\") Ihavetried......

0

How to define partitioning of DataFrame?

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: I\'vestartedusingSparkSQLandDataFramesinSpark1.4.0.I\'mwantingtodefineacustompartitioneronDataFrames,inScala,butnotse......

0

Spark 2.0 Dataset vs DataFrame

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: startingoutwithspark2.0.1Igotsomequestions.Ireadalotofdocumentationbutsofarcouldnotfindsufficientanswers: Whatisthe......

0

How to define and use a User-Defined Aggregate Fun

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: IknowhowtowriteaUDFinSparkSQL: defbelowThreshold(power:Int):Boolean={ returnpower<-40 } sqlContext.udf.regis......

0

Difference between DataFrame (in Spark 2.0 i.e Dat

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: I\'mjustwonderingwhatisthedifferencebetweenanRDDandDataFrame(Spark2.0.0DataFrameisameretypealiasforDataset[Row])inApac......

0

How to stop INFO messages displaying on spark cons

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: I\'dliketostopvariousmessagesthatarecomingonsparkshell. Itriedtoeditthelog4j.propertiesfileinordertostopthesemessage.......

0

How to serve a Spark MLlib model?

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: I\'mevaluatingtoolsforproductionMLbasedapplicationsandoneofouroptionsisSparkMLlib,butIhavesomequestionsabouthowtoser......

0

DataFrame / Dataset groupBy behaviour/optimization

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: SupposewehaveDataFramedfconsistingofthefollowingcolumns: Name,Surname,Size,Width,Length,Weigh Nowwewanttoperformaco......

0

How do I split an RDD into two or more RDDs?

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: I\'mlookingforawaytosplitanRDDintotwoormoreRDDs.TheclosestI\'veseenisScalaSpark:SplitcollectionintoseveralRDD?which......

0

How to convert rdd object to dataframe in spark

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: HowcanIconvertanRDD(org.apache.spark.rdd.RDD[org.apache.spark.sql.Row])toaDataframeorg.apache.spark.sql.DataFrame.Iconvertedada......

0

SparkSQL: apply aggregate functions to a list of c

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Isthereawaytoapplyanaggregatefunctiontoall(oralistof)columnsofadataframe,whendoingagroupBy?Inotherwords,istherea......

0

Concatenate columns in Apache Spark DataFrame

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: HowdoweconcatenatetwocolumnsinanApacheSparkDataFrame? IsthereanyfunctioninSparkSQLwhichwecanuse? 回答1: WithrawSQL......

0

Write single CSV file using spark-csv

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Iamusinghttps://github.com/databricks/spark-csv,IamtryingtowriteasingleCSV,butnotableto,itismakingafolder. NeedaScala......

0

Spark java.lang.OutOfMemoryError: Java heap space

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):问题: Mycluster:1master,11slaves,eachnodehas6GBmemory. Mysettings: spark.executor.memory=4g,Dspark.akka.frameSize=512 Hereisthe......