A Large-scale Multilingual Benchmark Dataset for Automated Translation of Bangla Regional Dialects to Bangla Language
-
Updated
Feb 4, 2024 - Jupyter Notebook
A Large-scale Multilingual Benchmark Dataset for Automated Translation of Bangla Regional Dialects to Bangla Language
Bengali/Bangla Fake Review Detection Dataset
This study addresses the gap in translating Bangla regional dialects into standard Bangla by creating a large-scale multilingual benchmark dataset of 32,500 sentences in Bangla, Banglish, and English, representing five regional Bangla dialects such as Sylheti, Chittagong, Mymensingh, Noakhali, and Barishal.
Add a description, image, and links to the bangla-bert-base topic page so that developers can more easily learn about it.
To associate your repository with the bangla-bert-base topic, visit your repo's landing page and select "manage topics."