Hello Kais

1) I support creating pritable version
2) I suggest arranging verses in the natural Arabic order (i.e., from right to 
left, with each line containing say 3/4 words) rather than the current vertical 
order
3) It is likely that people print black and white, and hence all these colour 
codings loose their significance, try thinking alternative style for printable 
version (like e.g, bold face, italic, overline, underline, parenthesis, etc.)
4) Agree with Eric in need to compact presentation through abbreviated tags, 
and keep an index page for reference
5) Division by Juz should be ok with 30 parts
6) a disclaimer should be included about the accuracy and that it is still 
ongoing validation.
7) along with english word-by-word meaning, it would be better to include one 
full translation (e.g. Sahih intl.) for each verse.


best regards,

Abdul-Baquee M. Sharaf
PhD Student
Language Technologies Group
School of Computing
University of Leeds
UK
________________________________________
From: comp-quran-requ...@comp.leeds.ac.uk [comp-quran-requ...@comp.leeds.ac.uk] 
On Behalf Of Kais Dukes [dukes.k...@googlemail.com]
Sent: 06 February 2010 10:30
To: comp-quran@comp.leeds.ac.uk
Subject: Advice Required - Printed Version of the Quranic Arabic Corpus for     
distribution

Hello comp-quran members,

I am writing to you all, to get some advice on a printed version of
the Quranic Arabic Corpus. We have been receiving a lot of requests
lately for something that users can download for offline use. Because
I beleive that the accuracy of the grammar is getting quite
reasonable, I am now considering this. In any case, we can always
update what we produce as the grammar improves.

What I had in mind, was a set of PDF files that could be downloaded.
For example, perhaps 30 PDFs (1 per juz of the Quran, see:
http://en.wikipedia.org/wiki/Juz'). My question is - I would be keen
to find out from members of the mailing list if they think that this
is a good idea? And if so, what would the best format be? I was
thinking of starting with the word-by-word grammar (not the syntactic
treebank). Perhaps starting with the information here:

http://corpus.quran.com/wordbyword.jsp

The problem is that if we displayed those images at the same
resolution, with the same information, that comes out to about 7
Quranic words per printed A4 page. Given that there are 77,430 Arabic
words in the Quran (according to our counting of whitespace) that
would give 11,061 pages in total - or 368 pages per juz (i.e. 368
pages for each of the 30 PDF files). That doesn't sound very
reasonable to me. If we shrink the images and text by 50% that would
give about 5,530 pages in total. Do you think that perhaps we should
just display part-of-speech tags to save space? Or perhaps Quranic
researchers and students perfer the whole grammar written out in
textual form?

I'm open to any suggestions on how to best display this information in
printed form for a set of PDFs. Any suggestions are more than welcome.
Feel free to reply directly to the mailing list, just hit "reply all":

comp-quran@comp.leeds.ac.uk

Looking forward to hearing from you.

Kind Regards,

-- Kais Dukes

Language Research Group
School of Computing
University of Leeds

http://corpus.quran.com - The Quranic Arabic Corpus
comp-quran@comp.leeds.ac.uk - Computional Quranic Arabic discussion list



Reply via email to