The dataset folder was downloaded from here https://www.kaggle.com/datasets/venky73/spam-mails-dataset?resource=download.
The final result achieves about 95% accuracy.
Resources:
- A pdf about naive bayes spam filter: https://courses.cs.washington.edu/courses/cse312/18sp/lectures/naive-bayes/naivebayesnotes.pdf
- A wikipedia article https://en.wikipedia.org/wiki/Naive_Bayes_spam_filtering