Re: Persian letters various code pages

2005-11-21 Thread Jon D.

--- <[EMAIL PROTECTED]> wrote:


> I need to know
> the various code pages 
> for both Persian and Arabic letters based on UTF8
> standard and UNICODE.
> 
> Could you please help me in this regard to let me
> have the various codes?


Character maps (.txt and .pdf):
http://students.cs.byu.edu/~jonsafar/orthography.txt
or
http://students.cs.byu.edu/~jonsafar/persian_charmaps.pdf


To romanize Persian texts:
download this: 
http://students.cs.byu.edu/~jonsafar/perstem.pl
then type:
perl perstem.pl --nostem --input utf8 < myinput.txt >
myoutput.txt

Hope this helps,
-Jon D.





__ 
Yahoo! FareChase: Search multiple travel sites in one click.
http://farechase.yahoo.com
___
PersianComputing mailing list
PersianComputing@lists.sharif.edu
http://lists.sharif.edu/mailman/listinfo/persiancomputing


Persian letters various code pages

2005-11-20 Thread Masood Ghayoomi

Dear Members,

I have a question about various code pages for Persian letters. As you might 
know, there are some difficulties to work with the Persian texts because of 
various code pages for the letters. The codes are mixed with Arabic ones and 
it has made it so difficult to make a search on a corpus that is made from 
on-line texts and even counting words and sorting them in a database.
Right now we have such a difficulty. I need to know the various code pages 
for both Persian and Arabic letters based on UTF8 standard and UNICODE.


Could you please help me in this regard to let me have the various codes?

Many thanks in advance for your help,
Regards,
Masood.

_
Don't just search. Find. Check out the new MSN Search! 
http://search.msn.com/


___
PersianComputing mailing list
PersianComputing@lists.sharif.edu
http://lists.sharif.edu/mailman/listinfo/persiancomputing