UTF-8
UTF-8 stands for Unicode Transformation Format, 8-bit Encoding. UTF-8 is an encoding of Unicode which is more efficient for storage of English (ASCII), whereas other language data is expanded and can be represented by up to four bytes.
For example, English (ASCII) characters use one byte for each character, accented European characters use two bytes, and Asian languages use three bytes for each character.