Skip to content

A machine learning project to predict breast cancer using logistic regression. This project includes data preprocessing, feature scaling, model training, and evaluation, based on a guided project from Coursera.

License

Notifications You must be signed in to change notification settings

harshita2234/Breast-Cancer-Prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Breast Cancer Prediction

This project uses machine learning to predict breast cancer based on features of cell nuclei present in the breast cancer dataset. It is based on a guided project from Coursera.

Project Description

The goal of this project is to build a logistic regression classifier that can accurately predict whether a tumor is benign or malignant based on certain features. The dataset used is the Breast Cancer Wisconsin (Diagnostic) Dataset.

Dataset

The dataset contains 569 instances of tumors, with 30 features each. The features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image.

Project Structure

  • breast_cancer_prediction.ipynb: Jupyter notebook containing the code for the project.

Requirements

The project requires the following Python packages:

  • pandas
  • numpy
  • scikit-learn
  • matplotlib (optional, for visualizations)

You can install these using pip:

pip install pandas numpy scikit-learn matplotlib

Usage

  1. Clone the repository:
git clone https://github.com/harshita2234/Breast-Cancer-Prediction.git
  1. Navigate to the project directory:
cd Breast-Cancer-Prediction
  1. Open the Jupyter notebook:
jupyter notebook breast_cancer_prediction.ipynb
  1. Run the cells in the notebook to see the analysis and results.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

  • Coursera for the guided project
  • Wisconsin Diagnostic Breast Cancer (WDBC) dataset

About

A machine learning project to predict breast cancer using logistic regression. This project includes data preprocessing, feature scaling, model training, and evaluation, based on a guided project from Coursera.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published