I believe ImageMagick would be able to do it -- maybe 'mogrify' with the '-transparent' option in combination with something else... Basically you need to present a binary image with only the text you want visible to tesseract. It cannot do what you want internally. --Sven
On Wed, Mar 20, 2013 at 10:20 AM, Waldemar Pross <[email protected]>wrote: > Can you see the example image in my original post? The "Id eam..." > screenshot? > Here is another one. See attachment. > > > On Wednesday, 20 March 2013 12:25:44 UTC, sventech wrote: >> >> Show us an example image >> >> On Tuesday, March 19, 2013, Waldemar Pross wrote: >> >>> Let's imagine following scenario (it is not very realistic, but will >>> make clear what I a mean): >>> Somebody has computer created text (not hand written) and highlights a >>> text with let's say blue color. >>> He makes a screenshot. >>> >>> Now, can tessract-ocr apply OCR just on this highlighted piece? >>> >>> below an example. >>> >>> Would tesseract be able to recognise "vel justo quaestio definitionem >>> te, viderer perpetua vim cu. Recteque >>> omittantur id duo, amet utamur incorrupte has an" ? >>> (Of course it would be English or another real language) >>> >>> Or do I need to cut off the unnecessary pieces somewhere else? >>> >>> Thank you! >>> >>> <https://lh5.googleusercontent.com/-eVhBboIJWJ8/UUjYa_Y9VSI/AAAAAAAABkk/Bilg20nyTu8/s1600/2013-03-19_2126.png> >>> >>> -- >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To post to this group, send email to [email protected] >>> To unsubscribe from this group, send email to >>> tesseract-ocr+unsubscribe@**googlegroups.com >>> For more options, visit this group at >>> http://groups.google.com/**group/tesseract-ocr?hl=en<http://groups.google.com/group/tesseract-ocr?hl=en> >>> >>> --- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to tesseract-ocr+unsubscribe@**googlegroups.com. >>> For more options, visit >>> https://groups.google.com/**groups/opt_out<https://groups.google.com/groups/opt_out> >>> . >>> >>> >>> >> >> >> -- >> ``All that is gold does not glitter, >> not all those who wander are lost; >> the old that is strong does not wither, >> deep roots are not reached by the frost. >> From the ashes a fire shall be woken, >> a light from the shadows shall spring; >> renewed shall be blade that was broken, >> the crownless again shall be king.” >> > -- > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > > --- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > For more options, visit https://groups.google.com/groups/opt_out. > > > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

