Re: [AI] White Paper: OCR Softwares for Indian languages

2008-08-05 Thread saurabh malav
Hi Prashant,

Good work.
It seems that Yogesh is a propietary font and if it is not unicode compliant
font then SAFA will not be able to read it because SAFA can work only with
unicode compliant font.
Is C-DAC still working on the development of Chitrankan?

Thanks,
Saurabh

On Mon, Aug 4, 2008 at 1:38 PM, manoj gupta [EMAIL PROTECTED] wrote:

 Dear Prateek,
The Freedom Scientific has recently launched Jaws 9.1 version which can
 also read out all the  material in Hindi.
regards,
Manoj Gupta
 - Original Message -
 From: prateek aggarwal [EMAIL PROTECTED]
 To: accessindia@accessindia.org.in
  Sent: Monday, August 04, 2008 5:19 PM
 Subject: Re: [AI] White Paper: OCR Softwares for Indian languages


  hey folks,
  has any one of you used chitrankan?
  is it possible for us to read the hindi text by any way?
  actually,  there is no speech output available in software.   sadly, i
  could not even found any trick to read the same text by safa.
  has any of you found any way?
  is there any other software available also to read hindi scanned
 matterial?
  regards,
  prateek agarwal.
  cell: 09928341197
  e-mails:
  [EMAIL PROTECTED]
  [EMAIL PROTECTED]
  ---
  you can visit my website for lots of stuff related to visually
  impaired and others.
  please go on to
  www.prateekagarwal.webs.com
 
  Join Access India convention: For updates on it visit:
 http://accessindia.org.in/harish/convention.htm
  Registration is now open!
 
  To unsubscribe send a message to [EMAIL PROTECTED]
 with the subject unsubscribe.
 
  To change your subscription to digest mode or make any other changes,
 please visit the list home page at
 
 http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in
 


 Join Access India convention: For updates on it visit:
 http://accessindia.org.in/harish/convention.htm
 Registration is now open!

 To unsubscribe send a message to [EMAIL PROTECTED] the subject unsubscribe.

 To change your subscription to digest mode or make any other changes,
 please visit the list home page at
  http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in

Join Access India convention: For updates on it visit: 
http://accessindia.org.in/harish/convention.htm
Registration is now open!

To unsubscribe send a message to [EMAIL PROTECTED] with the subject unsubscribe.

To change your subscription to digest mode or make any other changes, please 
visit the list home page at
  http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in


Re: [AI] White Paper: OCR Softwares for Indian languages

2008-08-04 Thread prateek aggarwal
hey folks,
has any one of you used chitrankan?
is it possible for us to read the hindi text by any way?
actually,  there is no speech output available in software.   sadly, i
could not even found any trick to read the same text by safa.
has any of you found any way?
is there any other software available also to read hindi scanned matterial?
regards,
prateek agarwal.
cell: 09928341197
e-mails:
[EMAIL PROTECTED]
[EMAIL PROTECTED]
---
you can visit my website for lots of stuff related to visually
impaired and others.
please go on to
www.prateekagarwal.webs.com

Join Access India convention: For updates on it visit: 
http://accessindia.org.in/harish/convention.htm
Registration is now open!

To unsubscribe send a message to [EMAIL PROTECTED] with the subject unsubscribe.

To change your subscription to digest mode or make any other changes, please 
visit the list home page at
  http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in


Re: [AI] White Paper: OCR Softwares for Indian languages

2008-08-04 Thread manoj gupta
Dear Prateek,
The Freedom Scientific has recently launched Jaws 9.1 version which can
also read out all the  material in Hindi.
regards,
Manoj Gupta
- Original Message - 
From: prateek aggarwal [EMAIL PROTECTED]
To: accessindia@accessindia.org.in
Sent: Monday, August 04, 2008 5:19 PM
Subject: Re: [AI] White Paper: OCR Softwares for Indian languages


 hey folks,
 has any one of you used chitrankan?
 is it possible for us to read the hindi text by any way?
 actually,  there is no speech output available in software.   sadly, i
 could not even found any trick to read the same text by safa.
 has any of you found any way?
 is there any other software available also to read hindi scanned
matterial?
 regards,
 prateek agarwal.
 cell: 09928341197
 e-mails:
 [EMAIL PROTECTED]
 [EMAIL PROTECTED]
 ---
 you can visit my website for lots of stuff related to visually
 impaired and others.
 please go on to
 www.prateekagarwal.webs.com

 Join Access India convention: For updates on it visit:
http://accessindia.org.in/harish/convention.htm
 Registration is now open!

 To unsubscribe send a message to [EMAIL PROTECTED]
with the subject unsubscribe.

 To change your subscription to digest mode or make any other changes,
please visit the list home page at

http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in



Join Access India convention: For updates on it visit: 
http://accessindia.org.in/harish/convention.htm
Registration is now open!

To unsubscribe send a message to [EMAIL PROTECTED] with the subject unsubscribe.

To change your subscription to digest mode or make any other changes, please 
visit the list home page at
  http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in


Re: [AI] White Paper: OCR Softwares for Indian languages

2008-08-03 Thread Pranav Lal
Prashant
Fine reader version 9 also has a font training module. Have you tried using 
this module to facilitate Indian language optical character recognition?


Join Access India convention: For updates on it visit: 
http://accessindia.org.in/harish/convention.htm
Registration is now open!

To unsubscribe send a message to [EMAIL PROTECTED] with the subject unsubscribe.

To change your subscription to digest mode or make any other changes, please 
visit the list home page at
  http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in


Re: [AI] White Paper: OCR Softwares for Indian languages

2008-08-03 Thread Kalpana Kharade
I appreciate the efforts of Mr. Prashant  and x.r.c.v.i. team for this 
comprehensive presentation. Good luck to the team.
Dr. Kalpana
- Original Message - 
From: Prashant Naik [EMAIL PROTECTED]
To: accessindia@accessindia.org.in
Sent: Sunday, August 03, 2008 1:12 PM
Subject: [AI] White Paper: OCR Softwares for Indian languages


 Dear Access India Members,



 During the Daisy Forum of India meeting held in Mumbai on 11th and 12th
 April 2008, I was given the responsibility to find information on the 
 status
 of OCR Softwares for Indian languages.  So here I am presenting the 
 findings
 that I am able to research.  I have prepared a White Paper on it which I
 posted in the PDF format on the daisy forum of India's mailing list 3 days
 back.But for benefit and awareness of others I am pasting content of 
 it
 below this message. This will also help those who had posted queries on A 
 I
 regarding this.



 White Paper: OCR Softwares for Indian languages

 Date: July 31st, 2008

 Introduction :

 OCR softwares are available for English and other foreign languages but 
 what
 is

 the status of OCR software availability for Indian languages?

 During the Daisy Forum of India meeting held in Mumbai on 11th and 12th
 April

 2008, I was given the responsibility to find information on this. So here 
 I
 am

 presenting the findings that I am able to research.



 Definitions :

 OCR: - Optical character recognition, usually abbreviated to OCR, is the

 mechanical or electronic translation of images of handwritten, typewritten
 or

 printed text (usually captured by a scanner) into machine-editable text.

 OCR Software: - OCR Software converts paper documents into electronic 
 data,

 so that you can handle the information (electronic text) in your computer
 system.

 Indian Languages: - Indian Constitution recognizes Hindi in Devanāgarī
 script

 as the official language of the central government India the Constitution 
 of
 India

 recognizes 22 languages, spoken in different parts of the country,

 {All definitions source is Wikipedia)



 Findings :

 As per the research on the web highlighted one workshop / seminar 
 organized
 by

 Rediff Centre for Indian Language Content Management

 On the theme of Brainstorming Workshop on OCR for Indian Languages on

 16-17 March, 2007, at Hotel Regalis, Mysore.

 Reference. Link: http://www.isim.ac.in/RCILCM/index.htm

 Further research on Access India (mailing group for the blind) querying 
 more
 on

 this and contact with NAB Karnataka to get more info on this theme did not
 throw

 up anything significant.



 Visit by Mr. Venki, rediff.com Technical Head :

 During a meeting with Mr. Venki at the XRCVC in the month of June 2008,

 Some more information about the conference was secured. This was because

 Mr. Venki himself was a one of the members of the organizing team from
 rediff.

 He made the following observation. Overall the conference was good.
 Speakers

 had shared new ideas on developing Indian OCR.

 However further following up with regard to this conference it seems no

 significant progress have been made thereafter.



 Chennai Print Access Seminar Findings

 Our XRCVC team member Neha learned about many technological

 developments from the Print Access conference which was held at Chennai 
 on

 April 19th, 2008. She shared lot of information, contacts and links.

 E.g. Acharya website (http://acharya.iitm.ac.in) TTS translator in 22
 languages,

 Ravi TTS for Telgu, C-DAC softwares like Mantra, Shruti Drishti, Shrut
 Lekhan

 and very important lead on Indian OCR software developed by C-DAC Pune.



 Visit to C-DAC Pune :

 On May 14th and 15th, the XRCVC team visited C-DAC Pune. The visit was 
 very

 fruitful. A fully developed off-the-shelf product for Hindi-Devnagri 
 Indian

 language software named as CHITRANKAN developed by GIST Development

 Team, C-DAC, Pune, Maharashtra. They demonstrated the product. The result

 was very good. CHITRANKAN is commercially used by 2-3 organizations in

 Pune.

 Other C-DAC resources :

 OCR softwares in Hindi called CHITRANKAN, in Marathi called

 CHITRAKSHARIKA and in Malayalam called NAYANA.



 About NAYANA :

 Source: http://www.malayalamresourcecentre.org/Mrc/products/nayana.html

 NAYANA is a product that enables the user to convert printed Malayalam

 documents to editable computer files. This system is very simple to use 
 and

 requires no prior expertise.

 FEATURES

 - NAYANA processes all types of printed Malayalam Documents.

 - Supports TIFF and BMP image formats.

 - Supports document Images with resolution 300 dpi and above.

 - Detection and correction of document skew of -5o to +5o.

 - The output document can be stored in both ISCII and ISFOC form.

 - The output document can be saved as TXT, RTF, HTML or ACI file formats.

 - User friendly interface.

 - Recognition speed of 50 char /sec.

 - Conversion of printed documents to editable text.

 - Optical Character