Hi,

the eclipse text encoding setting has nothing
to do with this, you do not need to change it.

The CAS Visual Debugger can handle utf-8 ok, so
what you tried should work.  Make sure the file
you saved is really utf-8, and not something else.
Also, make sure to set the code page in CVD to
utf-8 *before* you load the file.

Another thing you can do to make sure it's not
a CVD problem is to copy the text from Notepad
and just paste it into the CVD text window.  Note
that you can edit and save files in CVD.

Also, there's a possibility that you don't have
an appropriate font installed that Java can use
to render the Telugu.  I don't think that's the
case though, since you're seeing the text ok in
Notepad.

If all this doesn't help, you can attach the file,
either to an email or a Jira issue, and I'll take
a look.

--Thilo

chandra sekhar wrote:
Respected Sir,

I am developing an annotator ,which finds the "nouns" present in a text
document ,which contains my local language.( Telugu , A language which
speaks by 45 crore people of India) . When I want to  open the text file
,which contains the text to be analyzed in  telugu language , in CAS visual
debugger , I am getting all squares. I stored the text in UTF-8 format in a
notepad.

These are the steps i did :  (I am using eclipse 3.3.1.1 version)

1)Windows-> preferences ->General->Workspace  then in Textfileencoding , i
selected UTF-8.
2)While using cas visual debugger I am using file->code page  and setting to
UTF-8.

when i do these two steps then also ,I am unable to display the text ,I am
getting squares.

when I cheked  help ->about eclipse SDK->config details ,  I fouind
file.encoding=Cp1252.


please help me in this regard.





------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

please visit : http://www.parivaar.org/

Reply via email to