The project forecasts company's stocks price taking into account its historical performance, sentimental analysis on Edgar reports, legal proceeding sections and sentimental anaylsis on global news concerning the companies.
This repo presents code in 4 module given below:
3 Mins Working Demo https://www.youtube.com/watch?v=5HZyUnGaipw
- Pandas
- Numpy
- Beautiful Soup
- Keras
- Scikit-Learn
- PostgreSQL
- SQLAlchemy
- Matplotlib
- AWScli
- NLTK
- Gensim
- Boto3
- Paramiko
- Dash
filegen.py
- creates a CSV file which contains all the information on edgar reports including 10-K and 10-Qbulkdl.py
- downloads all the Edgar reports provided in the cik master csv fileclean_rawfile.py
- cleans all the edgar reports provided in the csv file by removing all the html tagssentiment_score.py
- creates a csv file by running sentiments analysis on all the clean files created from edgar reportssimilarity_legalproceedings.ipynb
- extracts the legal proceedings section from Edgar reports and calculate the similarity index from all its previous years files
prediction_meanprice_sentiment.py
- builds and trains LSTM models for 35 companies mentioned in the tickerprediction_90_days.py
- forecasts 90 days values from the models createdmutualfund_data.ipynb
- creates data(csv file) for the top mutual fundsmutual_funds.py
- builds and trains LSTM for the top mutuals funds
lambda_function.py
-
sentiment_analysis_news.ipynb
-bigquery.py
-
Remaining files in the assets and pages are related to the web UI created for this project
python3 index.py
- to run the application