FOMC_Analysis_Fintech-GT

This repository does the following:

Downloads all official documents from the Fed's website and converts them into text
Analyzes word frequencies and trends

Installation

Navigate to a command prompt
Make a virtual environment with python using python's venv command
Activate it, then install all required packages using the command pip install -r requirements.txt

Perform the following in the order shown

Run setup.py to configure the .env file, should contain names for different directories
Download the PDFs using fomcscraper.py
Convert the PDFs into text (stored in pickle files) and eliminate all punctuation and stop words (i.e. common English words) using data_processing.py
Aggregate word frequencies by word and by date with word_frequency.py
Cache the document types as a python dictionary by running generate_doctype.py

Separate results by document type (e.g. Beige Books, Blue Books) using split_doctype.py
Create CSV files from Counter objects with counter_csv.py
Generate N-Grams of words by using generate_ngrams.py

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
BarFreq.py		BarFreq.py
OverTime.py		OverTime.py
README.md		README.md
counter_csv.py		counter_csv.py
data_processing.py		data_processing.py
doc_type_freq.py		doc_type_freq.py
doc_types.txt		doc_types.txt
fomcscraper.py		fomcscraper.py
generate_doctype.py		generate_doctype.py
generate_ngrams.py		generate_ngrams.py
install.sh		install.sh
requirements.txt		requirements.txt
setup.py		setup.py
split_doctype.py		split_doctype.py
util.py		util.py
viterbi.py		viterbi.py
word_frequency.py		word_frequency.py