However, with the introduction of Unicode, Romanization is now becoming less necessary. |
This means that a Unicode document can contain any number of characters from any number of languages, without having to worry about clashes between them. |
Oriya's one of the more obscure of the Indic scripts in Unicode. |
However, for various reasons, Unicode sometimes provides a separate code point for a digraph, encoded as a single character. |
Gentium is a Unicode typeface that contains Roman, Greek and Cyrillic characters, including many characters seldom seen in even the most ambitious typefaces. |
As there are two bytes for every letter, we assume we are dealing with a multibyte representation of text, most likely a Unicode encoding. |