USC DSCI 553 - Foundations & Applications of Data Mining - Spring 2024 - Prof. Wei-Min Shen
-
Updated
Sep 16, 2024 - Python
USC DSCI 553 - Foundations & Applications of Data Mining - Spring 2024 - Prof. Wei-Min Shen
This assignment focuses on implementing the SON Algorithm using the Spark Framework to identify frequent itemsets within large datasets.
Data Mining to find the Frequent Itemsets using SON algorithm
Finding Frequent Item Sets over millions shopping baskets: realized by SON algorithm on Spark in Python on AWS.
Finding restaurants tuples that appears in review data from Yelp.com
Implementation of a-priori and SON algorithm on Spark.
Implemented the SON Algorithm using the Apache Spark Framework to find frequent itemsets. Used the A-Priori Algorithm to process each chunk of the data.
Simple Projects in Data Mining
Implementation of Data Mining Algorithm on Spark with Python3
Add a description, image, and links to the son-algorithm topic page so that developers can more easily learn about it.
To associate your repository with the son-algorithm topic, visit your repo's landing page and select "manage topics."