-
-
Notifications
You must be signed in to change notification settings - Fork 9.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
'big5' encoding not detected #4299
Comments
We rely on |
Something strange is going on then...
|
I had something similar recently, so I decided to take a look. The encoding that is selected for this response seems to be coming from The detected character set by One solution would be to remove the assumption in |
This is covered by #2086. However, I should note that there is no way that Requests will ever look into the HTML: only that it will use chardet. |
See this feed: http://www.digitimes.com.tw/tech/rss/xml/xmlrss_10_0.xml
The top line of the feed is:
But the detected encoding is iso-8859-1. It should be big5.
Expected Result
r.encoding should be big5.
Actual Result
r.encoding is iso-8859-1.
Reproduction Steps
System Information
The text was updated successfully, but these errors were encountered: