Thanks Joe and Kevin for replying.

Just to close the loop on this issue. I was "incorrectly" expecting
ExtractImageMetadata to identify Text in Images. As per Kevin's suggestion
we are now exploring tesseract for this.

Regards
Mayank



On Thu, Sep 28, 2017 at 4:23 PM, Kevin Doran <[email protected]>
wrote:

> Hi Mayank,
>
> To clarify, are you attempting to extract image metadata (e.g., timestamp,
> width, height) or convert a photo/graphic of text into a text string using
> optical character recognition (OCR).
>
> If the former (metadata), Joe has you on the correct track for digging
> deeper. If OCR, has been done by others with NiFi using custom processors.
> Keyword search is "NiFi OCR". I would recommend giving Jeremy Dyer's
> Tesseract Processor [1] a look. Here is a guide Jeremy published [2].
>
> Cheers,
> Kevin
>
> [1] https://github.com/jdye64/nifi-addons/tree/master/
> Processors/nifi-tesseract
> [2] https://community.hortonworks.com/articles/28380/nifi-ocr-
> using-apache-nifi-to-read-childrens-books.html
>
> On 9/28/17, 16:14, "Joe Witt" <[email protected]> wrote:
>
>     Mayank,
>
>     When you tried it what happened?  Did you look at the flow file
>     attributes after the extraction?
>
>     Can you share a jpeg you're using that is not working as you'd expect?
>
>     Thanks
>     Joe
>
>     On Thu, Sep 28, 2017 at 4:12 PM, mayank rathi <[email protected]>
> wrote:
>     > Hello All,
>     >
>     > Can NiFi extract text from jpg images? If Yes, then which processor
> do I
>     > need to use? I tried ExtractImageMetadata processor and it did not
> help.
>     >
>     > Thanks in advance.
>     >
>
>
>
>


-- 
NOTICE: This email message is for the sole use of the intended recipient(s)
and may contain confidential and privileged information. Any unauthorized
review, use, disclosure or distribution is prohibited. If you are not the
intended recipient, please contact the sender by reply email and destroy
all copies of the original message.

Reply via email to