Instacart Market Basket Analysis

This repo shows a set of Jupyter Notebooks that I used to tackle the Instacart Masket Basket Analysis challenge. The dataset for this competition is a relational set of files describing customers' orders over time. The goal of the competition is to predict which products will be in a user's next order. The dataset is anonymized and contains a sample of over 3 million grocery orders from more than 200,000 Instacart users. For each user, Instacart provides between 4 and 100 of their orders, with the sequence of products purchased in each order. Instacart also provides the week and hour of day the order was placed, and a relative measure of time between orders.

Here are the different notebooks:

Data Exploration: Exploring the raw datasets.
Customer Segmentation: Segmenting the customers with Principal Component Analysis and K-Means Clustering.
Association Rule Mining: Applying the Apriori algorithm to mine association rules between orders and customers.

A 3-part series of accompanied Medium blog posts have been written up and can be viewed here:

Environment

Saturn Cloud

Requirements

Dependencies

Choose the latest versions of any of the dependencies below:

License

MIT. See the LICENSE file for the copyright notice.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
data		data
notebooks		notebooks
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Instacart Market Basket Analysis

Environment

Requirements

Dependencies

License

About

Releases

Packages

Languages

khanhnamle1994/instacart-orders

Folders and files

Latest commit

History

Repository files navigation

Instacart Market Basket Analysis

Environment

Requirements

Dependencies

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages