-
Notifications
You must be signed in to change notification settings - Fork 81
Mini Project: PySpark
Mikiko Bazeley edited this page Feb 12, 2019
·
4 revisions
- http://spark.apache.org/docs/2.0.0/api/python/_modules/pyspark/ml/tuning.html
- https://spark.apache.org/docs/2.2.0/api/java/index.html?org/apache/spark/ml/classification/GBTClassifier.html
- https://wesslen.github.io/page2/
- https://medium.com/@fxzero/how-to-predict-user-churn-using-pyspark-fe25f6de1d7a
- https://wesslen.github.io/twitter/predicting_twitter_profile_location_with_pyspark/
- https://easyrdatascience.files.wordpress.com/2018/07/pyspark-6-introducing-ml-package.pdf
- https://www.kaggle.com/tekrei/apache-spark-gbtclassifier-with-cv
- https://towardsdatascience.com/hyperparameters-part-ii-random-search-on-spark-77667e68b606
- https://stackoverflow.com/questions/39529012/pyspark-get-all-parameters-of-models-created-with-paramgridbuilder
- https://spark.apache.org/docs/2.2.0/api/python/_modules/pyspark/mllib/evaluation.html
- https://spark.apache.org/docs/2.2.0/mllib-evaluation-metrics.html
- https://spark.apache.org/docs/2.1.3/api/java/org/apache/spark/ml/classification/GBTClassifier.html
Explanation of Stages:
- https://docs.databricks.com/spark/latest/mllib/binary-classification-mllib-pipelines.html
- https://spark.apache.org/docs/2.1.1/ml-pipeline.html
- https://dataplatform.cloud.ibm.com/analytics/notebooks/5e4963d9-faea-455d-a7db-ff6302d1d8f5/view?access_token=5d23d36be72dea35ebbde9b4b5f4a16d0053ee898f1ab2ab73cf1301ce9322be
- https://stackoverflow.com/questions/37021964/pyspark-model-interpretation-from-pipeline-model/38259379
- https://stackoverflow.com/questions/38664620/any-way-to-access-methods-from-individual-stages-in-pyspark-pipelinemodel
- Question 8 help: https://dscareercommunity.springboard.com/t/y7rdp0