url-encoding in browser address bar -
when put non-alpha-numeric symbols in browser address bar, got url-encoded. example, http://ru2.php.net/manual-lookup.php?pattern=привет turns http://ru2.php.net/manual-lookup.php?pattern=%ef%f0%e8%e2%e5%f2.
the question is: 2 percent-prefixed hex digits mean?
they bytes of windows 1251 encoding of cyrillic. since there 6 of them, can't utf-8, since takes 12 bytes of utf-8 6 chars of cyrillic.
the code chart cp1251 can found here: http://en.wikipedia.org/wiki/windows-1251.
just 20 hex space, each of cyrillic characters has numeric value, expressible 2 hex digits.
Comments
Post a Comment