[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-07-13 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16085343#comment-16085343 ] Sergey Beryozkin commented on BEAM-2328: [~talli...@mitre.org] Hi Tim - the PR has been updated to

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-07-03 Thread JIRA
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16072549#comment-16072549 ] Jean-Baptiste Onofré commented on BEAM-2328: I'm still reviewing the PR. It's short to include

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-16 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16051839#comment-16051839 ] ASF GitHub Bot commented on BEAM-2328: -- GitHub user sberyozkin opened a pull request:

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-16 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16051834#comment-16051834 ] Sergey Beryozkin commented on BEAM-2328: HI All, The initial cleanup of the 'tikaio' branch is now

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-14 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049057#comment-16049057 ] Sergey Beryozkin commented on BEAM-2328: Hi JB, All, I'm now ready to create the initial PR. As I

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-02 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16034412#comment-16034412 ] Sergey Beryozkin commented on BEAM-2328: Hi JB, Tim re org.json dependencies, FYI, at the moment

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-01 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032904#comment-16032904 ] Sergey Beryozkin commented on BEAM-2328: Sorry, Tika already reports the characters... > Introduce

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-01 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032881#comment-16032881 ] Sergey Beryozkin commented on BEAM-2328: Hi JB, Tim Yes, TikaReader returns Strings, but as JB

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-01 Thread JIRA
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032843#comment-16032843 ] Jean-Baptiste Onofré commented on BEAM-2328: Thanks [~talli...@mitre.org] for the update about

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-01 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032842#comment-16032842 ] Tim Allison commented on BEAM-2328: --- I've only taken a quick look at the patch. Looks great to me! The

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-01 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032838#comment-16032838 ] Tim Allison commented on BEAM-2328: --- bq. The last thing I'd like to investigate for a start is to check

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-01 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032833#comment-16032833 ] Tim Allison commented on BEAM-2328: --- Y. We're in the process of removing {{org.json}}. TIKA-1804. Ugh.

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-01 Thread JIRA
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032829#comment-16032829 ] Jean-Baptiste Onofré commented on BEAM-2328: By the way, Tika should also remove the

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-01 Thread JIRA
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032828#comment-16032828 ] Jean-Baptiste Onofré commented on BEAM-2328: Awesome ! I'm starting the review. > Introduce

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-01 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032825#comment-16032825 ] Sergey Beryozkin commented on BEAM-2328: I've added some TikaReader and TikaSource tests. Tika

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-05-25 Thread JIRA
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16024987#comment-16024987 ] Jean-Baptiste Onofré commented on BEAM-2328: Thanks [~sergey_beryozkin] ! I will do first round

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-05-25 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16024981#comment-16024981 ] Sergey Beryozkin commented on BEAM-2328: The initial code is here:

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-05-24 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16023826#comment-16023826 ] Sergey Beryozkin commented on BEAM-2328: Sorry for a bit of a noise, I spotted in the docs that the