Hi Mayank,

To clarify, are you attempting to extract image metadata (e.g., timestamp, 
width, height) or convert a photo/graphic of text into a text string using 
optical character recognition (OCR). 

If the former (metadata), Joe has you on the correct track for digging deeper. 
If OCR, has been done by others with NiFi using custom processors. Keyword 
search is "NiFi OCR". I would recommend giving Jeremy Dyer's Tesseract 
Processor [1] a look. Here is a guide Jeremy published [2].

Cheers, 
Kevin

[1] https://github.com/jdye64/nifi-addons/tree/master/Processors/nifi-tesseract 
[2] 
https://community.hortonworks.com/articles/28380/nifi-ocr-using-apache-nifi-to-read-childrens-books.html
 

On 9/28/17, 16:14, "Joe Witt" <[email protected]> wrote:

    Mayank,
    
    When you tried it what happened?  Did you look at the flow file
    attributes after the extraction?
    
    Can you share a jpeg you're using that is not working as you'd expect?
    
    Thanks
    Joe
    
    On Thu, Sep 28, 2017 at 4:12 PM, mayank rathi <[email protected]> 
wrote:
    > Hello All,
    >
    > Can NiFi extract text from jpg images? If Yes, then which processor do I
    > need to use? I tried ExtractImageMetadata processor and it did not help.
    >
    > Thanks in advance.
    >
    


Reply via email to