Be aware of encodings different to ASCII #46

leandro-lucarella-sociomantic · 2013-10-17T16:57:30Z

#45 exposed one problem when handling text encoded with something different from ASCII. The problem goes much deeper than that, and to properly support any encoding across the whole program, every usage of a str/unicode object must be revised.

The text was updated successfully, but these errors were encountered:

pjz · 2014-07-23T16:05:49Z

Having done this a bit before, you likely want to start by adding:

from __future__ import unicode_literals

which will make all literals be unicode without having to put a u in front of them. Then you just have to fix the places where you actually want to be messing with bytes directly.

For more future compatibility with python 3 you probably also want to add:

from __future__ import absolute_import
from __future__ import division
from __future__ import print_function

And then fix all the bugs :)

http://stackoverflow.com/questions/5937251/writing-python-2-7-code-that-is-as-close-to-python-3-x-syntax-as-possible has other tips if you run into particular issues.

leandro-lucarella-sociomantic · 2014-07-23T17:38:06Z

Thanks for the tips, I know the current unicode situation is horrible. Eventually we will need to take care of that, in Python 2.x is very hard to have "unicode-correctness", at least in my experience it was always a mess. Luckily Python 3.x took care of that :)

mihails-strasuns-sociomantic mentioned this issue Oct 17, 2013

UnicodeEncodeError thrown in pull list #45

Closed

leandro-lucarella-sociomantic modified the milestones: v0.11, Future Dec 17, 2015

leandro-lucarella-sociomantic added the prio-high label Dec 17, 2015

leandro-lucarella-sociomantic mentioned this issue Dec 17, 2015

Verbose cloning causes UnicodeDecodeError in debugf(). #167

Closed

leandro-lucarella-sociomantic modified the milestone: v1.0.0 Dec 29, 2016

leandro-lucarella-sociomantic mentioned this issue Sep 12, 2017

Port to Python3 #224

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Be aware of encodings different to ASCII #46

Be aware of encodings different to ASCII #46

leandro-lucarella-sociomantic commented Oct 17, 2013

pjz commented Jul 23, 2014

leandro-lucarella-sociomantic commented Jul 23, 2014 •

edited

Loading

Be aware of encodings different to ASCII #46

Be aware of encodings different to ASCII #46

Comments

leandro-lucarella-sociomantic commented Oct 17, 2013

pjz commented Jul 23, 2014

leandro-lucarella-sociomantic commented Jul 23, 2014 • edited Loading

leandro-lucarella-sociomantic commented Jul 23, 2014 •

edited

Loading