I would like to know how to configure stopwordsremover with french language in spark 1.6.3.
I'm currently using pyspark.
Thanks for your help.
Best regards,
I would like to know how to configure stopwordsremover with french language in spark 1.6.3.
I'm currently using pyspark.
Thanks for your help.
Best regards,
Based on Python Spark 1.6.3 docs, pyspark.ml.feature.StopWordsRemover does not have a language parameter. However you can always provide your own list of stopwords via the "stopWords" parameter.
Take a look at the nltk package
I use it for portuguese words:
Hope it helps