I'm writing a spark application and would like to use algorithms in MLlib. In the API doc I found two different classes for the same algorithm.
For example, there is one LogisticRegression in org.apache.spark.ml.classification also a LogisticRegressionwithSGD in org.apache.spark.mllib.classification.
The only difference I can find is that the one in org.apache.spark.ml is inherited from Estimator and was able to be used in cross validation. I was quite confused that they are placed in different packages. Is there anyone know the reason for it? Thanks!
The spark mllib guide says:
and
I think the doc explains it very well.
It's JIRA ticket
And From Design Doc: