This section describes how to work with non-Unicode characters in the Discovery Framework.
Because the Discovery Framework is Java-based, it can only read Unicode or Latin-1 characters. In the case of other characters, you can work around this limitation by converting the native file to ASCII, using a converter such as native2ascii, which is freely available as part of the JDK.
Keep in mind the following guidelines:
More information about working with non-Unicode characters can be found on the Liferay Portal Website.