This project explores a dataset of car specifications and performs various data cleaning, analysis, and visualization tasks.
The goal of this project is to gain insights into the car dataset and demonstrate fundamental data manipulation techniques using Python and Pandas.
-
Data Loading: The dataset is loaded from a raw CSV file hosted on GitHub.
-
Data Cleaning:
- Null values in the 'Cylinders' column are replaced with the mean value.
-
Value Count Analysis:
-
Filtering:
- Records with 'Origin' as 'Asia' or 'Europe' are extracted.
-
Removing Unwanted Records:
- Records with 'Weight' greater than 4000 are removed.
-
Column Manipulation:
- All values in the 'MPG_City' column are increased by 3 using a lambda function.
- Pandas
- Matplotlib
- Make sure you have the necessary libraries installed.
- Replace the placeholder URL with the actual raw URL of your dataset.
- Run the notebook cells sequentially.
- Explore more advanced data analysis techniques, such as regression or clustering.
- Create interactive visualizations.
- Incorporate additional data sources for a richer analysis.
To add the image here, follow these steps:
- Save the bar chart as a PNG file. You can do this by right-clicking on the chart in your notebook and selecting "Save Image As...".
- Upload the PNG file to your GitHub repository.
- Reference the image in the README using Markdown syntax: