Hi Guys

The link to the initial code is available in JIRA, at this stage the focus is on preparing a solid initial PR, and then we can all improve Tika related code :-)

Cheers, Sergey
On 24/05/17 11:41, Sergey Beryozkin wrote:
Hi Tim, All,

I thought I'd start a dedicated thread.

I added some initial comments to [1], I'm quite close now to creating the initial PR.

Thanks, Sergey

[1] https://issues.apache.org/jira/browse/BEAM-2328
On 23/05/17 17:42, Allison, Timothy B. wrote:
Another idea...if you have any interest, it would be great to get Apache Beam set up on our Rackspace VM (with Spark?) and use it for our regression tests?

-----Original Message-----
From: Sergey Beryozkin [mailto:[email protected]]
Sent: Friday, May 19, 2017 4:21 PM
To: [email protected]
Subject: Re: Extracting Text from embedded images in PDF docs

Hi Tim

Sure, once I get an initial PR ready I'll send an update and I'll explain what I did for a start and we will discuss it further



--
Sergey Beryozkin

Talend Community Coders
http://coders.talend.com/

Reply via email to