GitHub - Lizhecheng02/UCSD-CSE256-PA2: CSE 256 LIGN 256 - Statistical Natural Lang Proc

This Repo is for CSE 256 LIGN 256 - Statistical Natural Lang Proc - Nakashole [FA24] PA2

Author: Zhecheng Li && Professor: Ndapa Nakashole

Python Environment

1. Install Packages

pip install -r requirements.txt

Prepare Data

All datasets are already in the GitHub repo.

Run Codes

1. Encoder

If you want to train with traditional attention and mean embedding output, use:
```
python main.py --run "encoder_classic_mean"
```
If you want to train with slide window attention and mean embedding output, use:
```
python main.py --run "encoder_window_attention"
```
If you want to train with alibi relative positional embedding and mean embedding output, use:
```
python main.py --run "encoder_alibi"
```
If you want to train with disentangled attention patterns and mean embedding output, use:
```
python main.py --run "encoder_deberta"
```
If you want to train with extra [cls] token to represent the final embedding output, use:
```
python main.py --run "encoder_cls_token"
```

You can change the parameters in main.py, but you should be able to get around 86-87% accuracy using default values.

2. Decoder

If you want to train the traditional decoder-only model for text generation, use:
```
python main.py --run "decoder"
```

You can also change the parameters in main.py, but you should be able to get around 4.8 loss using default values.

Questions

You are welcome to discuss any issues you encounter while running this GitHub repository. Feel free to either open an issue or contact me directly at zhl186@ucsd.edu.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
acc_plots		acc_plots
attention_maps		attention_maps
speechesdataset		speechesdataset
.gitignore		.gitignore
.python_version		.python_version
CSE256_PA2.pdf		CSE256_PA2.pdf
README.md		README.md
dataset.py		dataset.py
main.py		main.py
requirements.txt		requirements.txt
tokenizer.py		tokenizer.py
transformer.py		transformer.py
utilities.py		utilities.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

This Repo is for CSE 256 LIGN 256 - Statistical Natural Lang Proc - Nakashole [FA24] PA2

Author: Zhecheng Li && Professor: Ndapa Nakashole

Python Environment

1. Install Packages

Prepare Data

Run Codes

1. Encoder

2. Decoder

Questions

About

Releases

Packages

Languages

Lizhecheng02/UCSD-CSE256-PA2

Folders and files

Latest commit

History

Repository files navigation

This Repo is for CSE 256 LIGN 256 - Statistical Natural Lang Proc - Nakashole [FA24] PA2

Author: Zhecheng Li && Professor: Ndapa Nakashole

Python Environment

1. Install Packages

Prepare Data

Run Codes

1. Encoder

2. Decoder

Questions

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages