By dotCore
via research.swtch.com
Submitted: Jan 02 2013 / 01:32
UTF-8 is a way to encode Unicode code points—integer values from 0 through 10FFFF—into a byte stream, and it is far simpler than many people realize. The easiest way to make it confusing or complicated is to treat it as a black box, never looking inside. So let's start by looking inside.
Add your comment