Welcome to the GitHub repository for the project "Billion-Dollar Dreams: India's Startup Saga." This repository contains the datasets, code, and visualizations used in the analysis of India's dynamic startup landscape.
This project aims to analyze the Indian startup ecosystem by exploring trends, sectoral dominance, and the impact of funding on startup success. The analysis uses real-world data to provide insights into the growth patterns, challenges, and opportunities within the startup landscape.
India has emerged as a global startup hub, with rapid growth driven by technology, innovation, and government initiatives. This project provides a comprehensive analysis of the ecosystem, covering aspects such as geographical distribution, valuation trends, sectoral performance, and the role of funding.
For a more detailed discussion and narrative on the findings of this analysis, please read the accompanying Medium article. This article offers a comprehensive overview of the insights derived from the data.
The datasets used in this project are available in the datasets folder. These include:
- Indian Startups Dataset: Detailed information on Indian startups, including location, sector, funding rounds, and valuations.
Please refer to the datasets/
folder for the raw data files used in the analysis.
Visual representations play a crucial role in understanding data-driven insights. The visualizations folder contains all the charts, graphs, and maps used in the project. These include:
- Startup Distribution Map: A geographical visualization of startup concentrations across major Indian cities.
- Valuation Growth Graph: A line chart showing the increase in startup valuations over time.
- Sectoral Dominance Chart: A breakdown of startups by sector, highlighting the leading industries.
Please refer to the visualizations/
folder for accessing the charts that are the outputs in the analysis.
The following tools and libraries were used to conduct the analysis and create visualizations:
- Python: Core programming language used for data manipulation and analysis.
- Pandas: For data cleaning and manipulation.
- NumPy: For numerical operations.
- Matplotlib & Seaborn: For data visualization.
- Jupyter Notebook: For interactive data exploration and analysis.
This analysis provides key insights into the Indian startup ecosystem, including:
- Geographical Concentration: Identification of major startup hubs.
- Valuation Trends: Understanding the growth trajectory of startup valuations.
- Sectoral Performance: Insights into which sectors are leading the market.
- Funding Dynamics: Exploration of the relationship between funding rounds and startup success.
If you have any questions or want to discuss this project further, feel free to reach out:
- Name: Nirmal Sai Swaroop J
- Email: nirmalsai22@gmail.com
- GitHub: Profile
Thank you for exploring this project! Your feedback and contributions are highly valued.