EMNLP Assignment 1

code structure

Pay attention that the directory of data(named "data") should be placed in the same directory as .sh and .py files, and it should contain four files, namely sst_train.csv, sst_test.csv, yelp_train.csv, yelp_test.csv.
The structure of working directory should be:

-main.py

-model.py

-config.py

-cleandata.py

-utils.py

-run.sh

-data

|----sst_train.csv

|----sst_test.csv

|----yelp_train.csv

|----yelp_test.csv

main.py contains the main routine of the procedure, which includes loading data, pre-processing data, training model and evaluation.

model.py contains the DIY model, and many sub-functions defined in it.

config.py contains the operation of getting options by using argparse.ArgumentParser. --dataset and --alpha could be defined by users in shell.

cleandata.py contains functions of doing data pre-processing.

utils.py defines metrics and other basic functions.

Dependency

Python 3.8.9(64-bit)

NLTK 3.5

numpy 1.20.2

Run it now!

Please run the shell to check the program by typing as follows:

If you are MAC user, then:

bash run_mac.sh {sst, yelp}

Otherwise:

bash run.sh {sst, yelp}

The argument(chosen dataset) for .sh file will be passed to the program. By default, it'll run on sst-5.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
cleandata.py		cleandata.py
code_NB_np.py		code_NB_np.py
code_naive_Bayes_cleaned.py		code_naive_Bayes_cleaned.py
code_naive_Bayes_naive.py		code_naive_Bayes_naive.py
code_naive_Bayes_origin.py		code_naive_Bayes_origin.py
code_sample.py		code_sample.py
config.py		config.py
main.py		main.py
model.py		model.py
np_feature.py		np_feature.py
run.sh		run.sh
run_mac.sh		run_mac.sh
sklearn_model.py		sklearn_model.py
utils.py		utils.py
visualization.ipynb		visualization.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EMNLP Assignment 1

code structure

Dependency

Run it now!

About

Releases

Packages

Languages

Rubywong123/NaiveBayes_classifier

Folders and files

Latest commit

History

Repository files navigation

EMNLP Assignment 1

code structure

Dependency

Run it now!

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages