Re: [AI] White Paper: OCR Softwares for Indian languages
Hi Prashant, Good work. It seems that Yogesh is a propietary font and if it is not unicode compliant font then SAFA will not be able to read it because SAFA can work only with unicode compliant font. Is C-DAC still working on the development of Chitrankan? Thanks, Saurabh On Mon, Aug 4, 2008 at 1:38 PM, manoj gupta [EMAIL PROTECTED] wrote: Dear Prateek, The Freedom Scientific has recently launched Jaws 9.1 version which can also read out all the material in Hindi. regards, Manoj Gupta - Original Message - From: prateek aggarwal [EMAIL PROTECTED] To: accessindia@accessindia.org.in Sent: Monday, August 04, 2008 5:19 PM Subject: Re: [AI] White Paper: OCR Softwares for Indian languages hey folks, has any one of you used chitrankan? is it possible for us to read the hindi text by any way? actually, there is no speech output available in software. sadly, i could not even found any trick to read the same text by safa. has any of you found any way? is there any other software available also to read hindi scanned matterial? regards, prateek agarwal. cell: 09928341197 e-mails: [EMAIL PROTECTED] [EMAIL PROTECTED] --- you can visit my website for lots of stuff related to visually impaired and others. please go on to www.prateekagarwal.webs.com Join Access India convention: For updates on it visit: http://accessindia.org.in/harish/convention.htm Registration is now open! To unsubscribe send a message to [EMAIL PROTECTED] with the subject unsubscribe. To change your subscription to digest mode or make any other changes, please visit the list home page at http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in Join Access India convention: For updates on it visit: http://accessindia.org.in/harish/convention.htm Registration is now open! To unsubscribe send a message to [EMAIL PROTECTED] the subject unsubscribe. To change your subscription to digest mode or make any other changes, please visit the list home page at http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in Join Access India convention: For updates on it visit: http://accessindia.org.in/harish/convention.htm Registration is now open! To unsubscribe send a message to [EMAIL PROTECTED] with the subject unsubscribe. To change your subscription to digest mode or make any other changes, please visit the list home page at http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in
Re: [AI] White Paper: OCR Softwares for Indian languages
hey folks, has any one of you used chitrankan? is it possible for us to read the hindi text by any way? actually, there is no speech output available in software. sadly, i could not even found any trick to read the same text by safa. has any of you found any way? is there any other software available also to read hindi scanned matterial? regards, prateek agarwal. cell: 09928341197 e-mails: [EMAIL PROTECTED] [EMAIL PROTECTED] --- you can visit my website for lots of stuff related to visually impaired and others. please go on to www.prateekagarwal.webs.com Join Access India convention: For updates on it visit: http://accessindia.org.in/harish/convention.htm Registration is now open! To unsubscribe send a message to [EMAIL PROTECTED] with the subject unsubscribe. To change your subscription to digest mode or make any other changes, please visit the list home page at http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in
Re: [AI] White Paper: OCR Softwares for Indian languages
Dear Prateek, The Freedom Scientific has recently launched Jaws 9.1 version which can also read out all the material in Hindi. regards, Manoj Gupta - Original Message - From: prateek aggarwal [EMAIL PROTECTED] To: accessindia@accessindia.org.in Sent: Monday, August 04, 2008 5:19 PM Subject: Re: [AI] White Paper: OCR Softwares for Indian languages hey folks, has any one of you used chitrankan? is it possible for us to read the hindi text by any way? actually, there is no speech output available in software. sadly, i could not even found any trick to read the same text by safa. has any of you found any way? is there any other software available also to read hindi scanned matterial? regards, prateek agarwal. cell: 09928341197 e-mails: [EMAIL PROTECTED] [EMAIL PROTECTED] --- you can visit my website for lots of stuff related to visually impaired and others. please go on to www.prateekagarwal.webs.com Join Access India convention: For updates on it visit: http://accessindia.org.in/harish/convention.htm Registration is now open! To unsubscribe send a message to [EMAIL PROTECTED] with the subject unsubscribe. To change your subscription to digest mode or make any other changes, please visit the list home page at http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in Join Access India convention: For updates on it visit: http://accessindia.org.in/harish/convention.htm Registration is now open! To unsubscribe send a message to [EMAIL PROTECTED] with the subject unsubscribe. To change your subscription to digest mode or make any other changes, please visit the list home page at http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in
Re: [AI] White Paper: OCR Softwares for Indian languages
Prashant Fine reader version 9 also has a font training module. Have you tried using this module to facilitate Indian language optical character recognition? Join Access India convention: For updates on it visit: http://accessindia.org.in/harish/convention.htm Registration is now open! To unsubscribe send a message to [EMAIL PROTECTED] with the subject unsubscribe. To change your subscription to digest mode or make any other changes, please visit the list home page at http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in
Re: [AI] White Paper: OCR Softwares for Indian languages
I appreciate the efforts of Mr. Prashant and x.r.c.v.i. team for this comprehensive presentation. Good luck to the team. Dr. Kalpana - Original Message - From: Prashant Naik [EMAIL PROTECTED] To: accessindia@accessindia.org.in Sent: Sunday, August 03, 2008 1:12 PM Subject: [AI] White Paper: OCR Softwares for Indian languages Dear Access India Members, During the Daisy Forum of India meeting held in Mumbai on 11th and 12th April 2008, I was given the responsibility to find information on the status of OCR Softwares for Indian languages. So here I am presenting the findings that I am able to research. I have prepared a White Paper on it which I posted in the PDF format on the daisy forum of India's mailing list 3 days back.But for benefit and awareness of others I am pasting content of it below this message. This will also help those who had posted queries on A I regarding this. White Paper: OCR Softwares for Indian languages Date: July 31st, 2008 Introduction : OCR softwares are available for English and other foreign languages but what is the status of OCR software availability for Indian languages? During the Daisy Forum of India meeting held in Mumbai on 11th and 12th April 2008, I was given the responsibility to find information on this. So here I am presenting the findings that I am able to research. Definitions : OCR: - Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic translation of images of handwritten, typewritten or printed text (usually captured by a scanner) into machine-editable text. OCR Software: - OCR Software converts paper documents into electronic data, so that you can handle the information (electronic text) in your computer system. Indian Languages: - Indian Constitution recognizes Hindi in Devanāgarī script as the official language of the central government India the Constitution of India recognizes 22 languages, spoken in different parts of the country, {All definitions source is Wikipedia) Findings : As per the research on the web highlighted one workshop / seminar organized by Rediff Centre for Indian Language Content Management On the theme of Brainstorming Workshop on OCR for Indian Languages on 16-17 March, 2007, at Hotel Regalis, Mysore. Reference. Link: http://www.isim.ac.in/RCILCM/index.htm Further research on Access India (mailing group for the blind) querying more on this and contact with NAB Karnataka to get more info on this theme did not throw up anything significant. Visit by Mr. Venki, rediff.com Technical Head : During a meeting with Mr. Venki at the XRCVC in the month of June 2008, Some more information about the conference was secured. This was because Mr. Venki himself was a one of the members of the organizing team from rediff. He made the following observation. Overall the conference was good. Speakers had shared new ideas on developing Indian OCR. However further following up with regard to this conference it seems no significant progress have been made thereafter. Chennai Print Access Seminar Findings Our XRCVC team member Neha learned about many technological developments from the Print Access conference which was held at Chennai on April 19th, 2008. She shared lot of information, contacts and links. E.g. Acharya website (http://acharya.iitm.ac.in) TTS translator in 22 languages, Ravi TTS for Telgu, C-DAC softwares like Mantra, Shruti Drishti, Shrut Lekhan and very important lead on Indian OCR software developed by C-DAC Pune. Visit to C-DAC Pune : On May 14th and 15th, the XRCVC team visited C-DAC Pune. The visit was very fruitful. A fully developed off-the-shelf product for Hindi-Devnagri Indian language software named as CHITRANKAN developed by GIST Development Team, C-DAC, Pune, Maharashtra. They demonstrated the product. The result was very good. CHITRANKAN is commercially used by 2-3 organizations in Pune. Other C-DAC resources : OCR softwares in Hindi called CHITRANKAN, in Marathi called CHITRAKSHARIKA and in Malayalam called NAYANA. About NAYANA : Source: http://www.malayalamresourcecentre.org/Mrc/products/nayana.html NAYANA is a product that enables the user to convert printed Malayalam documents to editable computer files. This system is very simple to use and requires no prior expertise. FEATURES - NAYANA processes all types of printed Malayalam Documents. - Supports TIFF and BMP image formats. - Supports document Images with resolution 300 dpi and above. - Detection and correction of document skew of -5o to +5o. - The output document can be stored in both ISCII and ISFOC form. - The output document can be saved as TXT, RTF, HTML or ACI file formats. - User friendly interface. - Recognition speed of 50 char /sec. - Conversion of printed documents to editable text. - Optical Character