I know that Spark ML pipelines can be exported to PMML using the JPMML-SparkML library. I am just struggling to find out how I could do it from R using sparklyr.
I am aware of open github issue, where two ideas were raised:
using Scala API, something like:
model <- ml_kmeans(<...>) sparkapi::invoke(model$.model, "toPMML", "./myModelPMML.xml")
leverage https://github.com/jpmml/jpmml-converter and the https://github.com/jpmml/jpmml-sparkml
However I could not find any follow ups on that tips. Has anyone tried any of these solutions?
Here is the related github issue to this question.
You can use the
sparklyr2pmml
package to export Sparklyr/Apache Spark ML pipelines to PMML.