Asslamu Alaykum Saleem and Boumediene, It's great to hear from both Qatar and also from the United Arabic Emirates University. Thank you both for your related emails. I am afraid that there is no software download for the Quranic Arabic Corpus at present, and there is also no treebank download just yet. However, interested researchers do already download and use the tagged morphological data (in XML format) from the website's download page: http://corpus.quran.com/download. For each word in the Quran, you can download a data file which contains the Arabic text, interlinear word-by-word English translation, part-of-speech tagging, and morphological features.
With regards to the treebank, this is still being annotated and so is not yet fully available for download. Hopefully once the entire Quran has been annotated syntactically using dependency graphs, we do indeed hope to make this data available off-line. I would expect this to be the case later in the year. For a standalone software download, I am afraid that this is not currently one of the project's main aims. There could one day possibly be a software-related download for the Quranic Arabic Corpus, but right now the focus is online annotation - i.e. concentrating on the existing tasks of highly accurate morphological tagging, followed by full coverage for syntactic analysis (dependency graphs) for the entire Quran. You may also find these questions (and others) covered online in the project’s FAQ section: http://corpus.quran.com/faq.jsp I am afraid that for the time being, the only way to access the treebank and the graphical display of Quranic morphology, is online. However, many people have previously got in touch asking about downloadable software - so there is obviously a demand for desktop software for this new annotated Quranic data. I can imagine that this may happen sometime in the future, although possibly as a seperate project that builds on the data that is being annotated through collaborative online annotation. Boumediene, I would be quite interested to know more about your opinion on how Arabic grammar is currently being represented using dependency graphs. To the best of my knowledge, this is the first time dependency graphs have been applied to the Quran on this scale, and also the first time that full traditional Arabic grammar and syntax (i'rab) is being applied to construct a machine-readable Arabic treebank. So because of this, I am keen to collect feedback on how you might feel the current approach could be improved. Kind Regards, -- Kais Dukes Language Research Group School of Computing University of Leeds http://corpus.quran.com - The Quranic Arabic Corpus [email protected] - Computional Quranic Arabic discussion list > ------------------------------------------- > First message: > From: SALEEM Kalathil[SMTP:[email protected]] > Sent: Tuesday, January 19, 2010 8:12:58 AM > > Assalaamualaikum, > > Hope this finds you all well and fine by the mercy of allah. > > Take this opportunity to appreciate you for your website works on Quran., may > allah bless you in this world and hereaftet to the way to jannah. > > I would like to get the software of quran word by word for learning purpose. > . Please advise the charges for the software cds and courier charges which i > can sent you either by any online or by westernunion transfers. > > Kind urgent replies will be appreciated. > > Thanks and regards, > > Saleem Kalathil > P.O. Box 12309, Doha,Qatar. > Salata Jadeed, Mamoora-Qatar. > Tel: +974 5417823 > ------------------------------------------- > Second message: > From: Boumediene Belkhouche[SMTP:[email protected]] > Sent: Tuesday, January 19, 2010 5:24:28 AM > > AA, > Dear Kais, > > I am impressed by the quality and extent of your work on the Qur'an. One > aspect that I am interested is your approach to representing the grammar. I > would like to request from you a copy of the entire syntactic treebank. I > know it is on the website, but browsing it one page at a time is not > convenient for me. > > Jazakum Allahu khayran. > > wassalam, > Boumediene
