The Student Data Processing Project is a Python-based data processing and analysis project designed to handle student data and perform various operations on it. This README.md file provides an overview of the project, its purpose, and instructions on how to use it.
The purpose of this project is to:
- Parse an Excel file containing student data.
- Generate unique email addresses for students.
- Remove special characters from email addresses.
- Perform data analysis, including categorizing students by gender and identifying similarities in student names.
- Create backup files on Google Drive for data security.
- Python version 3.11.5 installed on your system.
- Clone the repository on your local machine.
git clone https://github.com/mikemwai/codelabs.git
- Navigate to the project directory and create a virtual environment on your local machine through the command line:
py -m venv myenv
- Activate your virtual environment:
- On Windows:
myenv\Scripts\activate
- On Mac:
source myenv/bin/activate
- Install project dependencies on your virtual environment:
pip install -r requirements.txt
- Run
main.py
to process the data. - Check the generated files in the
output
folder in the root directory.
If you'd like to contribute to this project:
- Please fork the repository
- Create a new branch for your changes
- Submit a pull request
If you have any issues with the project, feel free to open up an issue.
This project is licensed under the MIT License - see the LICENSE file for details.