Hello Kais 1) I support creating pritable version 2) I suggest arranging verses in the natural Arabic order (i.e., from right to left, with each line containing say 3/4 words) rather than the current vertical order 3) It is likely that people print black and white, and hence all these colour codings loose their significance, try thinking alternative style for printable version (like e.g, bold face, italic, overline, underline, parenthesis, etc.) 4) Agree with Eric in need to compact presentation through abbreviated tags, and keep an index page for reference 5) Division by Juz should be ok with 30 parts 6) a disclaimer should be included about the accuracy and that it is still ongoing validation. 7) along with english word-by-word meaning, it would be better to include one full translation (e.g. Sahih intl.) for each verse.
best regards, Abdul-Baquee M. Sharaf PhD Student Language Technologies Group School of Computing University of Leeds UK ________________________________________ From: comp-quran-requ...@comp.leeds.ac.uk [comp-quran-requ...@comp.leeds.ac.uk] On Behalf Of Kais Dukes [dukes.k...@googlemail.com] Sent: 06 February 2010 10:30 To: comp-quran@comp.leeds.ac.uk Subject: Advice Required - Printed Version of the Quranic Arabic Corpus for distribution Hello comp-quran members, I am writing to you all, to get some advice on a printed version of the Quranic Arabic Corpus. We have been receiving a lot of requests lately for something that users can download for offline use. Because I beleive that the accuracy of the grammar is getting quite reasonable, I am now considering this. In any case, we can always update what we produce as the grammar improves. What I had in mind, was a set of PDF files that could be downloaded. For example, perhaps 30 PDFs (1 per juz of the Quran, see: http://en.wikipedia.org/wiki/Juz'). My question is - I would be keen to find out from members of the mailing list if they think that this is a good idea? And if so, what would the best format be? I was thinking of starting with the word-by-word grammar (not the syntactic treebank). Perhaps starting with the information here: http://corpus.quran.com/wordbyword.jsp The problem is that if we displayed those images at the same resolution, with the same information, that comes out to about 7 Quranic words per printed A4 page. Given that there are 77,430 Arabic words in the Quran (according to our counting of whitespace) that would give 11,061 pages in total - or 368 pages per juz (i.e. 368 pages for each of the 30 PDF files). That doesn't sound very reasonable to me. If we shrink the images and text by 50% that would give about 5,530 pages in total. Do you think that perhaps we should just display part-of-speech tags to save space? Or perhaps Quranic researchers and students perfer the whole grammar written out in textual form? I'm open to any suggestions on how to best display this information in printed form for a set of PDFs. Any suggestions are more than welcome. Feel free to reply directly to the mailing list, just hit "reply all": comp-quran@comp.leeds.ac.uk Looking forward to hearing from you. Kind Regards, -- Kais Dukes Language Research Group School of Computing University of Leeds http://corpus.quran.com - The Quranic Arabic Corpus comp-quran@comp.leeds.ac.uk - Computional Quranic Arabic discussion list