abtest-mlops

Table of content

abtest-mlops

Overview

A/B testing allows comparing two or more versions of a given service against each other to find out which variation performs better.

This repository contains an implementation of AB testing for the Classical, Sequential, and ML approaches. I have used data collected by an Advertising company running an online ad for a client to increase brand awareness. To increase its market competitiveness, the advertising company provides a further service that quantifies the increase in brand awareness as a result of the ads it shows to online users.

The main objective is to design a reliable hypothesis testing algorithm to test if the ads that the advertising company runs resulted in a significant lift in brand awareness. Through this, we will explore Classical, Sequential, and ML approaches to A/B testing,

Requirements

Python 3.5 and above, Pip and MYSQL

Install

git clone https://github.com/eandualem/abtest-mlops
cd abtest-mlops
pip install -r requirements.txt

Features

Data Exploration

The notebook for Data Exploration is inside the notebooks folder in the file classical-ab-testing.ipynb.

Classical A/B Testing

The notebook for Classical A/B Testing is inside the notebooks folder in the file classical-ab-testing.ipynb.

Sequential A/B Testing

The notebook for Sequential A/B Testing is inside the notebooks folder in the file sequential-ab-testing.ipynb.

ML A/B Testing

The notebook for ML A/B Testing is inside the notebooks folder in the file ml-ab-testing.ipynb.

Scripts

create_dataset_versions: simple script for creating different versions of the data AdSmartABdata.csv
create_dataset: simple script for creating train, test split of AdSmartABdata.csv
create_features: simple script for creating features for train and test data
train_model: class trains a model using 5-fold cross validation and returns the best model
train_logistic_model: simple script for training logistic regression using TrainModel class
train_decision_trees_model: simple script for training decision tree using TrainModel class
train_xgboost_model: simple script for training xgboost using TrainModel class
evaluate_model: class for calculates evaluation metrics for a give model using actual data
evaluate_logistic_model: simple script for evaluating logistic model using EvaluateModel class
evaluate_decision_trees: simple script for evaluating decision tree model using EvaluateModel class
evaluate_xgboost_model: simple script for evaluating xgboost model using EvaluateModel class
df_helper: helper class for reading csv and saving csv files

Test

There is a test file for df_helper inside the tests folder.

Travis CI

The file .travis.yml contains the configuration for Travis.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.dvc		.dvc
.github/workflows		.github/workflows
.vscode		.vscode
data		data
features		features
mlruns/0		mlruns/0
models		models
notebooks		notebooks
scripts		scripts
tests		tests
.dvcignore		.dvcignore
.flake8		.flake8
.gitignore		.gitignore
Dockerfile		Dockerfile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
log.py		log.py
requirements.txt		requirements.txt
setup.py		setup.py
travis.yml		travis.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

abtest-mlops

Overview

Requirements

Install

Features

Data Exploration

Classical A/B Testing

Sequential A/B Testing

ML A/B Testing

Scripts

Test

Travis CI

About

Releases

Packages

Languages

eandualem/abtest-mlops

Folders and files

Latest commit

History

Repository files navigation

abtest-mlops

Overview

Requirements

Install

Features

Data Exploration

Classical A/B Testing

Sequential A/B Testing

ML A/B Testing

Scripts

Test

Travis CI

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages