A small analysis of a dataset of all startups from 2015-2021 from Kaggle using jupyter notebook files.
Basic cleaning and visualization of presented data.
Partial Schema and data standardization performed with a mix of excel and python techniques
Questions for which trends found:
-
% of Funding Across Years from 2015 - 2021
-
No. Of Funding in Different Industries
-
Breakup of Verticals within Consumer Internet Industry
-
Most Funded Startups
-
Startups with most number of Funding Rounds
-
Funding with respect to Location
-
Most Active Investors
-
Distribution of Most Common Funding Stages
Automating Re-standardisation of schema of dataset focusing on data accuracy to completely remove data bias