Hello Ossama, Thank you for your response. When you said "seperate the files into seperate ayas", did you mean "seperate suras"? That is already done. The XML file is seperated into suras here: http://cvs.arabeyes.org/viewcvs/projects/quran/data/ar/text/ So I'm that the XML data in the above folder is the most recently updated data. I have two questions/suggestions about this data:
1) Can we get rid of the <searchtext> element in the Quran XML data and instead use a smarter search algortihm that removes special characters and diacritics before searching the Quran text? I think that maintaing Arabic text data that has the special characters and diacritics manually removed is more error-prone than using a smarter search algorithm. By the way, this algorithm doesn't really have to be that smart, it could simply remove the characters that should not be searched from both the Quran text and the search keyword and match them against each other. Of course even smarter algorithms that provide grammatical context aware searching would be nicer. 2) What are the copyright conditions on this Arabic Quran XML data? The reason I am asking is that I personally would like to contribute to the fixing of this Arabic Quran XML data, and I also know a few other friends who would like to contribute as well, but we want to make sure that the copyright will not restrict anyone to download this data for free without restrictions and make additional changes to it, similar to a typical open-source license. Why is this important? Because not all Quran manuscripts are exactly the same and they have slight differences in the orthography (spelling) of certain words and the option should be given to whomever downloads these files to change the spelling of such words to whichever style they prefer and use it as such. Thanks, Mete --- Ossama Khayat <[EMAIL PROTECTED]> wrote: > Hello, > As far as I remember, I asked Mohammad Yousef to > separate the files into separate Ayas so we could > help > in adding the special characters for the Quran. > Since then, we haven't heard any news from him. > > regards, > Ossama Khayat > > --- Mete Kural <[EMAIL PROTECTED]> wrote: > > Hello again, > > Just wanted to ask again if anyone knows about the > > Arabic Quran XML files status as I have asked > below. > > Thank you, > > Mete > > > > --- Mete Kural <[EMAIL PROTECTED]> wrote: > > > Salaamun Aleykum, > > > > > > I noticed that there is a new folder in the > Quran > > > CVS > > > for Arabic Quran's XML data: > > > > > > http://cvs.arabeyes.org/viewcvs/projects/quran/data/ar/text/ > > > > > > This is different than the Arabic data found in > > > here: > > > > > > http://cvs.arabeyes.org/viewcvs/projects/quran/libquran/data/xml/ > > > > > > In the README file for the Quran project, it > says: > > > "Data files (texts, audio, tafsirs) are > > distributed > > > separately. See > > > http://www.arabeyes.org/projects/quran" > > > > > > So what is the status of the Arabic Quran XML > > data? > > > Which folder contains the most recent data? And > > what > > > are the new copyright conditions on the data? > > > > > > Thanks, > > > Mete > > > > > > > > > > > > > > > > > > _______________________________________________ > > > General mailing list > > > [EMAIL PROTECTED] > > > > http://lists.arabeyes.org/mailman/listinfo/general > > > > _______________________________________________ > > General mailing list > > [EMAIL PROTECTED] > > http://lists.arabeyes.org/mailman/listinfo/general > > > __________________________________ > Do you Yahoo!? > The New Yahoo! Shopping - with improved product > search > http://shopping.yahoo.com > _______________________________________________ > General mailing list > [EMAIL PROTECTED] > http://lists.arabeyes.org/mailman/listinfo/general _______________________________________________ General mailing list [EMAIL PROTECTED] http://lists.arabeyes.org/mailman/listinfo/general

