Thanks,

Yes, I want to add support for indexing Chinese Document. I had review the
String Class, I think we should modify StringMatch Class. In Chinese
Documents, words is NOT sepreate by space, that is, process single double
character is not enough, we MUST implement a mechanics to detect Chinese
Words.

------------------------------------------------
Jin Bo
Ph.D. Student of Automatic Control
East China University of Science and Technology
130 Mei Long Road
Shanghai 200237, PRC
------------------------------------------------
Email: [EMAIL PROTECTED], [EMAIL PROTECTED]
------------------------------------------------




����Ӧ �� Geoff Hutchison <[EMAIL PROTECTED]>
                                                             
                                                             
                                                             
 �ռ���: ��/ITEC                                           
                                                             
 ����:   [EMAIL PROTECTED]                                      
                                                             
                                                             
                                                             
 ����:   Re: htdig: Chinese Support                          
                                                             





Content-type: text/plain; charset=us-ascii



> I'm interesting in adding Chinese Language support in ht://Dig, If some
one
> got some idea and advise, pls contact me.
Great!
One step would be to translate all the HTML files in the installdir/
directory. In fact, I'd like to get a repository of translations of the
ht://Dig files.
The next step is to migrate error messages to GNU gettext. This will
require a little work (going through all of the code and getting
acquainted with gettext) but would be most appreciated. This would set the
stage for tranlations into any number of languages.
The final step is to add support for indexing Chinese documents (I think
this is what you *really* want to do). First, we'll need to change the
String class in htlib to support double-byte characters. Then we'll need
to change all the code to use double-byte characters by default (since
regular characters will just ignore the second byte).
I think that's about it, but I'm not an expert at supporting
internationalization. ;-)
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.



----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.

Reply via email to