Angry Bits

Words on bytes and bits

Unicode in Python: Common Pitfalls

Most people don't get Unicode right. It is not their fault, I believe the biggest trouble here is that people underestimate the complexity of Unicode.

I've been working with text manipulation tasks using Python for a long time. During the last ten years I've been collecting thousands ...