Hi, the eclipse text encoding setting has nothing to do with this, you do not need to change it.
The CAS Visual Debugger can handle utf-8 ok, so what you tried should work. Make sure the file you saved is really utf-8, and not something else. Also, make sure to set the code page in CVD to utf-8 *before* you load the file. Another thing you can do to make sure it's not a CVD problem is to copy the text from Notepad and just paste it into the CVD text window. Note that you can edit and save files in CVD. Also, there's a possibility that you don't have an appropriate font installed that Java can use to render the Telugu. I don't think that's the case though, since you're seeing the text ok in Notepad. If all this doesn't help, you can attach the file, either to an email or a Jira issue, and I'll take a look. --Thilo chandra sekhar wrote:
Respected Sir, I am developing an annotator ,which finds the "nouns" present in a text document ,which contains my local language.( Telugu , A language which speaks by 45 crore people of India) . When I want to open the text file ,which contains the text to be analyzed in telugu language , in CAS visual debugger , I am getting all squares. I stored the text in UTF-8 format in a notepad. These are the steps i did : (I am using eclipse 3.3.1.1 version) 1)Windows-> preferences ->General->Workspace then in Textfileencoding , i selected UTF-8. 2)While using cas visual debugger I am using file->code page and setting to UTF-8. when i do these two steps then also ,I am unable to display the text ,I am getting squares. when I cheked help ->about eclipse SDK->config details , I fouind file.encoding=Cp1252. please help me in this regard. ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ please visit : http://www.parivaar.org/
