Sir
While I extract the existing eng.traineddata the  following error appears
$ combine_tessdata -u tesseract-ocr/tessdata/eng.traineddata
/home/temp/eng.
Extracting tessdata components from tesseract-ocr/tessdata/eng.traineddata
Error openning /home/temp/eng.unicharset




On Sat, Jun 1, 2013 at 11:35 AM, mamata nayak <mamata2...@gmail.com> wrote:

> Sir,
> please help me
> Actually character set of my language consists of about 500 characters.
> I have divide these into subset's i.e about 10 .tif files and generate box
> file and edit those using Qt editor separately and then use the following
> command:
>
> $ cat >> LohitOriya.tr C.e0.tr
>
> to concatenate one .tr files with the previously generated LohitOriya.tr
> file.
>
> $ unicharset_extractor A.3.box B.e0.box C.e0.box
>
> to generate the unicharset  file.
>
> Please response as early as possible.
>
> Eagerly waiting
> $unicharset_extractor
>
>
> On Tue, May 21, 2013 at 3:38 PM, Shree Devi Kumar <shreesh...@gmail.com>wrote:
>
>> Mamata,
>>  Please see https://code.google.com/p/tesseract-ocr/downloads/list for
>> the available language data friles for tesseract 3.02. In case Odia is
>> similar to bangala, you can use the bengali traineddata to bootstrap for
>> odia.
>>
>> Shree
>>
>> Shree Devi Kumar
>> ____________________________________________________________
>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>
>>
>> On Tue, May 21, 2013 at 2:26 PM, mamata nayak <mamata2...@gmail.com>wrote:
>>
>>> Sir
>>> Can you please tell me, the recent list of indian languages those are
>>> trained the tesseract-ocr engine.
>>>
>>> Thank you
>>>
>>>
>>> On Sun, May 12, 2013 at 12:23 PM, Shree Devi Kumar <shreesh...@gmail.com
>>> > wrote:
>>>
>>>> Are you training Odia language?
>>>>
>>>> Have you seen
>>>> http://tdil-dc.in/tdildcMain/articles/374232Odia%20Script%20Grammar_Ver1.0.pdf
>>>> ?
>>>>
>>>>
>>>> Shree Devi Kumar
>>>> ____________________________________________________________
>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>>
>>>>
>>>> On Sat, May 11, 2013 at 9:01 PM, mamata nayak <mamata2...@gmail.com>wrote:
>>>>
>>>>> Thank you sir.
>>>>> I could able to detect a set of character set of my language.
>>>>> However a single character among all of those i.e ଫୀ is recognized as
>>>>> character pairs differently at different place in training image such as
>>>>> କ୍ଷୀଛୀ, ନୀନୀ .ଯୀଛୀ, ପୀଛୀ, ବୀନୀ as it occurs 5 times
>>>>> .
>>>>> then i use unicharambigs file having the information as follows
>>>>> v1
>>>>> 2    କ୍ଷୀ ଛୀ    1    ଫୀ    1
>>>>> 2    ନୀ ନୀ    1    ଫୀ    1
>>>>> 2    ଯୀ ଛୀ    1    ଫୀ    1
>>>>> 2    ପୀ ଛୀ    1    ଫୀ    1
>>>>> 2    ବୀ ନୀ    1    ଫୀ    1
>>>>> But the problem while recognizing these pair of characters it replace
>>>>> with ଫୀ
>>>>> So please understood my problem and give suggestion.
>>>>> thanking you
>>>>>
>>>>>
>>>>> On Wed, May 8, 2013 at 5:47 PM, Quan Nguyen <nguyen...@gmail.com>wrote:
>>>>>
>>>>>> You would need to run the tesseract command to generate the box file
>>>>>> for your image, e.g.:
>>>>>>
>>>>>> tesseract eng.timesitalic.exp0.tif eng.timesitalic.exp0 batch.nochop 
>>>>>> makebox
>>>>>>
>>>>>>
>>>>>> Check Tesseract Training Wiki for more details.
>>>>>>
>>>>>> http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3
>>>>>>
>>>>>> Once you have the TIFF/Box pair, you can open it in jTessBoxEditor.
>>>>>>
>>>>>>
>>>>>>
>>>>>> On Wednesday, May 8, 2013 12:29:43 AM UTC-5, mama wrote:
>>>>>>
>>>>>>> Good Morning Sir,
>>>>>>> Thanks for your reply.
>>>>>>> Now my problem is, for few set of characters of my language the
>>>>>>> jTessBoxEditor could open the corresponding tif file and generate its 
>>>>>>> box
>>>>>>> file but for few other it can't be generate the box co-ordinate.Please 
>>>>>>> sir
>>>>>>> I have attached the file.
>>>>>>>
>>>>>>>
>>>>>>> On Sat, May 4, 2013 at 7:38 PM, Quan Nguyen <nguy...@gmail.com>wrote:
>>>>>>>
>>>>>>>> What Ubuntu and Java versions are installed on your machine? You
>>>>>>>> probably has a headless Java -- i.e., one without graphics libraries. 
>>>>>>>> Can
>>>>>>>> you use Oracle Java 7, which is the version I tested with? Thanks.
>>>>>>>>
>>>>>>>>
>>>>>>>> http://askubuntu.com/**questions/55848/how-do-i-**
>>>>>>>> install-oracle-java-jdk-7<http://askubuntu.com/questions/55848/how-do-i-install-oracle-java-jdk-7>
>>>>>>>>
>>>>>>>> On Saturday, May 4, 2013 8:10:33 AM UTC-5, mama wrote:
>>>>>>>>
>>>>>>>>> sir,
>>>>>>>>> After giving this command at the command prompt, the output as
>>>>>>>>> follows
>>>>>>>>> java -Xms128m -Xmx512m -jar jTessBoxEditor.jar
>>>>>>>>> 4 May, 2013 6:21:23 PM java.util.prefs.**FileSystemPref**erences$2
>>>>>>>>> run
>>>>>>>>> INFO: Created user preferences directory.
>>>>>>>>> Exception in thread "AWT-EventQueue-0" java.awt.HeadlessException
>>>>>>>>>     at java.awt.GraphicsEnvironment.**c**heckHeadless(**
>>>>>>>>> GraphicsEnvironme**nt.java:173)
>>>>>>>>>     at java.awt.Window.<init>(Window.****java:546)
>>>>>>>>>     at java.awt.Frame.<init>(Frame.**ja**va:419)
>>>>>>>>>     at java.awt.Frame.<init>(Frame.**ja**va:384)
>>>>>>>>>     at javax.swing.JFrame.<init>(**JFra**me.java:174)
>>>>>>>>>     at net.sourceforge.tessboxeditor.****Gui.<init>(Unknown
>>>>>>>>> Source)
>>>>>>>>>     at net.sourceforge.tessboxeditor.****GuiWithMRU.<init>(Unknown
>>>>>>>>> Source)
>>>>>>>>>     at net.sourceforge.tessboxeditor.****GuiWithEdit.<init>(Unknown
>>>>>>>>> Source)
>>>>>>>>>     at net.sourceforge.tessboxeditor.****GuiWithSpinner.<init>(Unknown
>>>>>>>>> Source)
>>>>>>>>>     at net.sourceforge.tessboxeditor.****GuiWithFont.<init>(Unknown
>>>>>>>>> Source)
>>>>>>>>>     at net.sourceforge.tessboxeditor.****GuiWithLaF.<init>(Unknown
>>>>>>>>> Source)
>>>>>>>>>     at net.sourceforge.tessboxeditor.****GuiWithTools.<init>(Unknown
>>>>>>>>> Source)
>>>>>>>>>     at net.sourceforge.tessboxeditor.****GuiWithTools$2.run(Unknown
>>>>>>>>> Source)
>>>>>>>>>     at java.awt.event.**InvocationEvent**.dispatch(**
>>>>>>>>> InvocationEvent.**java:226)
>>>>>>>>>     at java.awt.EventQueue.**dispatchEv**entImpl(EventQueue.**
>>>>>>>>> java:673)
>>>>>>>>>     at java.awt.EventQueue.access$**300**(EventQueue.java:96)
>>>>>>>>>     at java.awt.EventQueue$2.run(**Even**tQueue.java:634)
>>>>>>>>>     at java.awt.EventQueue$2.run(**Even**tQueue.java:632)
>>>>>>>>>     at java.security.**AccessController**.doPrivileged(**Native
>>>>>>>>> Method)
>>>>>>>>>     at java.security.**AccessControlCon**text$1.**
>>>>>>>>> doIntersectionPrivilege**(**AccessControlContext.java:**105)
>>>>>>>>>     at java.awt.EventQueue.**dispatchEv**ent(EventQueue.java:**
>>>>>>>>> 643)
>>>>>>>>>     at java.awt.EventDispatchThread.**p**umpOneEventForFilters(**
>>>>>>>>> EventDis**patchThread.java:275)
>>>>>>>>>     at java.awt.EventDispatchThread.**p**umpEventsForFilter(**
>>>>>>>>> EventDispat**chThread.java:200)
>>>>>>>>>     at java.awt.EventDispatchThread.**p**umpEventsForHierarchy(**
>>>>>>>>> EventDis**patchThread.java:190)
>>>>>>>>>     at java.awt.EventDispatchThread.**p**umpEvents(**
>>>>>>>>> EventDispatchThread.**java:185)
>>>>>>>>>     at java.awt.EventDispatchThread.**p**umpEvents(**
>>>>>>>>> EventDispatchThread.**java:177)
>>>>>>>>>     at java.awt.EventDispatchThread.**r**
>>>>>>>>> un(EventDispatchThread.java:**13**8)
>>>>>>>>>
>>>>>>>>> However i could not get how to open the window
>>>>>>>>> [image: jTessBoxEditor Swing UI][image: Box View]
>>>>>>>>> jTessBoxEditor Swing U
>>>>>>>>>
>>>>>>>>> Please reply me
>>>>>>>>> Thank you
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Wed, May 1, 2013 at 3:32 AM, Quan Nguyen <nguy...@gmail.com>wrote:
>>>>>>>>>
>>>>>>>>>> Version 0.9 Release:
>>>>>>>>>>
>>>>>>>>>> - Enhance Generate TIFF/Box functionality to allow for combining
>>>>>>>>>> prepending symbols in addition to appending
>>>>>>>>>> - Fix a bug that failed to persist changes to table in edit mode
>>>>>>>>>> - Find function now supports partial matches
>>>>>>>>>> - Fix a problem with table not scrolling along when row header
>>>>>>>>>> has focus and scrolling
>>>>>>>>>>
>>>>>>>>>> http://sourceforge.net/**project**s/vietocr/files/**
>>>>>>>>>> jTessBoxEditor**/<http://sourceforge.net/projects/vietocr/files/jTessBoxEditor/>
>>>>>>>>>>
>>>>>>>>>> --
>>>>>>>>>> --
>>>>>>>>>> You received this message because you are subscribed to the Google
>>>>>>>>>> Groups "tesseract-ocr" group.
>>>>>>>>>> To post to this group, send email to tesser...@googlegroups.com
>>>>>>>>>>
>>>>>>>>>> To unsubscribe from this group, send email to
>>>>>>>>>> tesseract-oc...@**googlegroups.**com
>>>>>>>>>>
>>>>>>>>>> For more options, visit this group at
>>>>>>>>>> http://groups.google.com/**group**/tesseract-ocr?hl=en<http://groups.google.com/group/tesseract-ocr?hl=en>
>>>>>>>>>>
>>>>>>>>>> ---
>>>>>>>>>> You received this message because you are subscribed to a topic
>>>>>>>>>> in the Google Groups "tesseract-ocr" group.
>>>>>>>>>> To unsubscribe from this topic, visit
>>>>>>>>>> https://groups.google.com/d/**to**pic/tesseract-ocr/**
>>>>>>>>>> QQ8wC59YKUI/**unsubscribe?hl=en<https://groups.google.com/d/topic/tesseract-ocr/QQ8wC59YKUI/unsubscribe?hl=en>
>>>>>>>>>> .
>>>>>>>>>>  To unsubscribe from this group and all its topics, send an email
>>>>>>>>>> to tesseract-oc...@**googlegroups.**com.
>>>>>>>>>>
>>>>>>>>>> For more options, visit https://groups.google.com/**grou**
>>>>>>>>>> ps/opt_out <https://groups.google.com/groups/opt_out>.
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>>  --
>>>>>>>> --
>>>>>>>> You received this message because you are subscribed to the Google
>>>>>>>> Groups "tesseract-ocr" group.
>>>>>>>> To post to this group, send email to tesser...@googlegroups.com
>>>>>>>> To unsubscribe from this group, send email to
>>>>>>>> tesseract-oc...@**googlegroups.com
>>>>>>>> For more options, visit this group at
>>>>>>>> http://groups.google.com/**group/tesseract-ocr?hl=en<http://groups.google.com/group/tesseract-ocr?hl=en>
>>>>>>>>
>>>>>>>> ---
>>>>>>>> You received this message because you are subscribed to a topic in
>>>>>>>> the Google Groups "tesseract-ocr" group.
>>>>>>>> To unsubscribe from this topic, visit https://groups.google.com/d/*
>>>>>>>> *topic/tesseract-ocr/**QQ8wC59YKUI/unsubscribe?hl=en<https://groups.google.com/d/topic/tesseract-ocr/QQ8wC59YKUI/unsubscribe?hl=en>
>>>>>>>> .
>>>>>>>> To unsubscribe from this group and all its topics, send an email to
>>>>>>>> tesseract-oc...@**googlegroups.com.
>>>>>>>> For more options, visit 
>>>>>>>> https://groups.google.com/**groups/opt_out<https://groups.google.com/groups/opt_out>
>>>>>>>> .
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>>  --
>>>>>> --
>>>>>> You received this message because you are subscribed to the Google
>>>>>> Groups "tesseract-ocr" group.
>>>>>> To post to this group, send email to tesseract-ocr@googlegroups.com
>>>>>> To unsubscribe from this group, send email to
>>>>>> tesseract-ocr+unsubscr...@googlegroups.com
>>>>>> For more options, visit this group at
>>>>>> http://groups.google.com/group/tesseract-ocr?hl=en
>>>>>>
>>>>>> ---
>>>>>> You received this message because you are subscribed to a topic in
>>>>>> the Google Groups "tesseract-ocr" group.
>>>>>> To unsubscribe from this topic, visit
>>>>>> https://groups.google.com/d/topic/tesseract-ocr/QQ8wC59YKUI/unsubscribe?hl=en
>>>>>> .
>>>>>> To unsubscribe from this group and all its topics, send an email to
>>>>>> tesseract-ocr+unsubscr...@googlegroups.com.
>>>>>> For more options, visit https://groups.google.com/groups/opt_out.
>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>>  --
>>>>> --
>>>>> You received this message because you are subscribed to the Google
>>>>> Groups "tesseract-ocr" group.
>>>>> To post to this group, send email to tesseract-ocr@googlegroups.com
>>>>> To unsubscribe from this group, send email to
>>>>> tesseract-ocr+unsubscr...@googlegroups.com
>>>>> For more options, visit this group at
>>>>> http://groups.google.com/group/tesseract-ocr?hl=en
>>>>>
>>>>> ---
>>>>> You received this message because you are subscribed to the Google
>>>>> Groups "tesseract-ocr" group.
>>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>>> an email to tesseract-ocr+unsubscr...@googlegroups.com.
>>>>>
>>>>> For more options, visit https://groups.google.com/groups/opt_out.
>>>>>
>>>>>
>>>>>
>>>>
>>>>  --
>>>> --
>>>> You received this message because you are subscribed to the Google
>>>> Groups "tesseract-ocr" group.
>>>> To post to this group, send email to tesseract-ocr@googlegroups.com
>>>> To unsubscribe from this group, send email to
>>>> tesseract-ocr+unsubscr...@googlegroups.com
>>>> For more options, visit this group at
>>>> http://groups.google.com/group/tesseract-ocr?hl=en
>>>>
>>>> ---
>>>> You received this message because you are subscribed to a topic in the
>>>> Google Groups "tesseract-ocr" group.
>>>> To unsubscribe from this topic, visit
>>>> https://groups.google.com/d/topic/tesseract-ocr/QQ8wC59YKUI/unsubscribe?hl=en
>>>> .
>>>> To unsubscribe from this group and all its topics, send an email to
>>>> tesseract-ocr+unsubscr...@googlegroups.com.
>>>> For more options, visit https://groups.google.com/groups/opt_out.
>>>>
>>>>
>>>>
>>>
>>>  --
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To post to this group, send email to tesseract-ocr@googlegroups.com
>>> To unsubscribe from this group, send email to
>>> tesseract-ocr+unsubscr...@googlegroups.com
>>> For more options, visit this group at
>>> http://groups.google.com/group/tesseract-ocr?hl=en
>>>
>>> ---
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-ocr+unsubscr...@googlegroups.com.
>>> For more options, visit https://groups.google.com/groups/opt_out.
>>>
>>>
>>>
>>
>>  --
>> --
>> You received this message because you are subscribed to the Google
>> Groups "tesseract-ocr" group.
>> To post to this group, send email to tesseract-ocr@googlegroups.com
>> To unsubscribe from this group, send email to
>> tesseract-ocr+unsubscr...@googlegroups.com
>> For more options, visit this group at
>> http://groups.google.com/group/tesseract-ocr?hl=en
>>
>> ---
>> You received this message because you are subscribed to a topic in the
>> Google Groups "tesseract-ocr" group.
>> To unsubscribe from this topic, visit
>> https://groups.google.com/d/topic/tesseract-ocr/QQ8wC59YKUI/unsubscribe?hl=en
>> .
>> To unsubscribe from this group and all its topics, send an email to
>> tesseract-ocr+unsubscr...@googlegroups.com.
>> For more options, visit https://groups.google.com/groups/opt_out.
>>
>>
>>
>
>

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesseract-ocr@googlegroups.com
To unsubscribe from this group, send email to
tesseract-ocr+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to