Web scraping tools for eVTOL aircraft data extraction maintaining a dataframe
- About
- Use Cases
- Installation
- File Descriptions
- Licensing, Authors, and Acknowledgements
- View the notebooks
This repository contains tools for scraping eVTOL aircraft data from two websites:
Each set of scraper functions, contained in individual packages, follows an identical process architecture show below.
Source data for:
- Market research
- Competitive intelligence
- BeautifulSoup
- Standard libraries across Python 3* Anaconda distribution.
-
Jupyter Notebook for analysis:
- evtolnews_scrape.ipynb
- transportup_scrape.ipynb
-
Python files for custom functions
- transportup_funcs.py
- evtolnews_scrapefuncs.py
-
.csv files for read and writing to directory and results dataframes
-
tu_directory_{date}.csv --> dataframe of aircraft links (to scrape)
-
tu_results_df_{date}.csv --> dataframe of scraped and cleaned aircraft data
-
evtolnews_directory_{date}.csv --> dataframe of aircraft links (to scrape)
-
evtolnews_results_df_{date}.csv --> dataframe of scraped and cleaned aircraft data
-
All code is open for any and all usage.
View the evtol.news scraper at Jupyter's NBviewer site, click here.
View the TransportUP scraper at Jupyter's NBviewer site, click here.