Skip to content
This repository has been archived by the owner on May 10, 2023. It is now read-only.

[ur] Urdu sentences added in Roman Script #446

Closed
omer-beg opened this issue May 28, 2021 · 2 comments
Closed

[ur] Urdu sentences added in Roman Script #446

omer-beg opened this issue May 28, 2021 · 2 comments

Comments

@omer-beg
Copy link

Someone has copied a sentiment analysis dataset in Roman Urdu and added it here. The source is: https://github.com/Smat26/Roman-Urdu-Dataset/blob/master/Dataset/Roman%20Urdu%20DataSet.csv

The problem is that this dataset does not contain Urdu letters. Please delete this from the repository as it is becoming a hurdle in reviewing sentences and launching Urdu. For example:

Tips ka tour lgaty hain kisi din lol,Neutral,

Source: https://github.com/Smat26/Roman-Urdu-Dataset/blob/master/Dataset/Roman%20Urdu%20DataSet.csv
👍👎
Hahhah aa jou kar lo experience 😂😂,Neutral,

Source: https://github.com/Smat26/Roman-Urdu-Dataset/blob/master/Dataset/Roman%20Urdu%20DataSet.csv
👍👎
apni baat karo bh😂😂😂,Neutral,

Source: https://github.com/Smat26/Roman-Urdu-Dataset/blob/master/Dataset/Roman%20Urdu%20DataSet.csv
👍👎
"or phr at the end ""kher choro hmen kia"" ",Neutral,

Source: https://github.com/Smat26/Roman-Urdu-Dataset/blob/master/Dataset/Roman%20Urdu%20DataSet.csv
👍👎

@MichaelKohler
Copy link
Member

It also seems those are not really Public Domain sentences. I will removed them.

MichaelKohler pushed a commit that referenced this issue May 29, 2021
## [2.4.3](v2.4.2...v2.4.3) (2021-05-29)

### Bug Fixes

* remove Urdu sentences in different script (fixes [#446](#446)) ([163d108](163d108))
@MichaelKohler
Copy link
Member

🎉 This issue has been resolved in version 2.4.3 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants