Wrappers for Spark’s MLlib machine learning library in SparkR have been slow to arrive. However, the future looks bright.
The imminent 2.0 release will bring k-means support to SparkR and the 2.1 release is scheduled to include wrappers for the following machine learning stalwarts
- Alternating Least Squares (ALS)
- Decision Trees
- Gaussian Mixture Models
- Isotonic Regression
- Latent Dirichlet Allocation (LDA)
- Multilayer Perceptron Classifiers
- Random Forests