Context Navigation

Changes between Version 2 and Version 3 of UnicodeEncodeError

-              v2
+              v3
 }}}
+== ... but I was decoding? == #decode
+A more subtle and confusing way to trigger this error is when trying to ''decode'' an `unicode` string. Wait... decoding a sequence of unicode characters? Does that even make sense? Well, normally not, but Python interprets that as a shortcut for decoding the `str` object obtained from that unicode string encoded using the default encoding. So we have the following equivalence:
+{{{
+u"string".decode(enc) == str(u"string").decode(enc)
+}}}
+That could be called a `u"cadeau empoisonné"` ;-)
+Of course, if `u"string"` can't be first encoded the naive way in order to produce that temporary `str` object, it will trigger the same error we saw above:
+{{{
+>>> u'chaîne de caractères'.decode('utf-8')
+Traceback (most recent call last):
+  File "<stdin>", line 1, in ?
+UnicodeDecodeError: 'ascii' codec can't decode byte 0xee
+                    in position 3: ordinal not in range(128)
+}}}
+In practice, this happens when an API designed to handle a `str` object suddenly receive an `unicode` object. It's "normal" to call `s.decode(...)` if `s` is a `str` object, but this will fail with the above confusing error if `s` is actually an `unicode` object containing characters not present in the ASCII character set.
 ----
 See also: TracDev/UnicodeGuidelines, UnicodeDecodeError