[
https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16023233#comment-16023233
]
Stephen Sisk commented on BEAM-2328:
------------------------------------
hi! We're excited you're working on this, mind adding this to the list of
in-progress IO transforms? You just need to add it here:
https://github.com/apache/beam-site/blob/asf-site/src/documentation/io/built-in.md
> Introduce Apache Tika Input component
> -------------------------------------
>
> Key: BEAM-2328
> URL: https://issues.apache.org/jira/browse/BEAM-2328
> Project: Beam
> Issue Type: New Feature
> Components: sdk-ideas, sdk-java-extensions
> Reporter: Sergey Beryozkin
> Assignee: Sergey Beryozkin
> Fix For: 2.1.0
>
>
> Apache Tika is a popular project that offers an extensive support for parsing
> the variety of file formats. It is used in many projects including Lucene and
> Elastic Search.
> Supporting a Tika Input (Read) at the Beam level would be of major interest
> to many users.
> PR is to follow
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)