UTF-8 Encoding UTF-8 is a compromise character encoding that can be as compact as ASCII (if the file is just plain English text) but can also contain any unicode characters

The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) - Joel on Software

Кодировка текста ASCII (Windows 1251, CP866, KOI8-R) и Юникод (UTF 8, 16, 32) — как исправить проблему с кракозябрами | - создание, продвижение и заработок на сайте

Convert your text files from any encoding to any other one. The screen shot shows Japanese, English and Thai text encoded as UTF-8 Unicode. Conversion to the legacy Thai code page would lose the Japanese characters.

Characters, Symbols and the Unicode Miracle - Computerphile // How did we get to UTF-8 as the web standard? Tom Scott capsulizes the "Unicode Miracle" in a few minutes.