My Excel macro isnt extremely fast, but it
works. Once your text has been converted into numeric strings, sorting itself is
fast. I sorted a +200,000 word dictionary in a language that requires sorting
by syllables (Tibetan). It had to be done in several steps, due to Excel
spreadsheet limitations, but the result was flawless.
Peter
From: Said Marjan
Zazai [mailto:[EMAIL PROTECTED]
Sent: Monday, February 23, 2004
4:07 PM
To: [EMAIL PROTECTED]; [EMAIL PROTECTED]
Cc: ABDUL MAJEED
Subject: Re: [paktype] Sorting
Problem with Pashto
Thanks Paul and Peter
I'll forward you the correct sorting order for Pashto ASAP.
But Peter, wont it take too much of a computer memory for
the string conversion for 24,000 words dictionary?
- Original Message -
From: Linguasoft
To: [EMAIL PROTECTED]
; [EMAIL PROTECTED]
Sent: Monday, February
23, 2004 7:17 PM
Subject: RE: [paktype]
Sorting Problem with Pashto
Dear Said Marjan Zazai,
As a workaround for automatic sorting in Office, try
assigning a two- or three-digit numeric code to each Unicode used in Pashto,
then convert all your Pashto strings in numeric strings and sort them. A bit
clumsy, but it works! (We do the same for other languages for which automatic
sorting isnt yet supported.)
BTW, please also send me the correct sorting for Pashto and
if you want, I send you a simple Excel macro in return that will do the sorting
for your dictionary.
Best regards,
Peter E. Hauer
Linguasoft
Vienna, Austria
From: Said Marjan
Zazai [mailto:[EMAIL PROTECTED]
Sent: Monday, February 23, 2004
2:27 PM
To: [EMAIL PROTECTED]
Cc: [EMAIL PROTECTED]
Subject: [paktype] Sorting Problem
with Pashto
I am working on a Dictionary project for Pashto language.
Was trying to enter English into Pashto dictionary into Microsoft
Access and while sorting the Pashto column on ascending order, I
came to across a bug (thats what i'll call it).
The image below is a screen short of Access Database sorted
on the right column which is Pashto Meaning of the English word on the left
side. As you can see that in Pashto the letter TTeh (Teh with circle) comes
after Teh and before wow but in this list TTeh, TZEEM, and TSEH comes after wow
whichcomes at the end almost.
Could anyone tell me whats the problem here and how can we
fix it?
Thank you
Said Marjan Zazai,
Afghan Tech.
Kabul, Afghanistan.
Homepage:
www.paktype.org
Homepage:
www.paktype.org
Homepage:
www.paktype.org
Yahoo! Groups Links
To visit your group
on the web, go to:
http://groups.yahoo.com/group/paktype/
To unsubscribe from
this group, send an email to:
[EMAIL PROTECTED]
Your use of Yahoo!
Groups is subject to the Yahoo!
Terms of Service.
image001.gif___
PersianComputing mailing list
[EMAIL PROTECTED]
http://lists.sharif.edu/mailman/listinfo/persiancomputing