On Apr 30, 9:45 pm, Rajesh Pandey <[email protected]> wrote: > Hi Falke, > Here is a sample Image. I have more images that are used for testing but > they are copyrighted so I can't send them here in public but I can email > them individually. >
OK, this looks like "standard" devanagari... not much different from Hindi. Hmm... my accuracy was pretty bad (see text below). But I believe the resolution has a lot to do with it. This looks like either 300 or 150 dpi. I would try scanning at 600dpi. Also, it just occurred to me: Even if the fonts are similar, you'd have to create a separate, Nepali dictionary, to use that feature optimally. --------- my results ------------ आदृछे रुत्साश्या षाड़च्चे षाणाहर-अष्ट सक्शच्चा चप्तादृब्ब रु जुम्लद्धआन्न ग्राणा हो श्यच्चे च्चाश्लो बुब्लदृव्रच्चे ड्डणयंणा क्या ख्याग्लाइं झ फ्लाफ्ला छा ल्यश्चाढ नुप्लव्रच्चे क्लान्नाश्ले म्नहाँव्रच्चा ब्लॉ व्रच्चाग्लाई च्चाझ्यादृव्रच्चे छा ख्वा स्थाश्या ०१२- व्रब्लणाझं' क्षट्टक्लक्षट्टदृम्लन्न व्रहानं णादृछंश्ले आज़ धसां छादृब्ल व्रच्चाश्या श्लोह्म णाड़न्ना ह्माह्मरूक्रव्रश्ले छा णश्लो दृह्माव्रदृ व्लाझप्लन्नाई दृरूयुब्वच्चों शुदृच्चादृ व्रच्चानुच्चे ड्डेम्भह्मतुहगृ आख्या व्रच्चाणा ग्रह; ५ म्नसादृव्रच्चा रूज ठशाझसल्शई रुवप्लाप्रा सिध्यम्लनुच्चे ब्लणुव्रअ ब्रच्चाणा च्चिदृ ५ णाइम्लश्या हैनं हो बीते न्नणा, यं' णादृछ क्या ख्याश्लो व्रच्चाह्म श्चाठा आणा दृहेछा ------ end my results -------- Again, this is 3.02 -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

