Thank you, David!  I heartily second this recommendation: please do not
reinvent the wheel!

On Tue, Nov 26, 2019 at 6:13 AM David Pilato <[email protected]> wrote:

> You could have a look at FSCrawler project BTW which supports indexing
> local files and files over ssh.
>
> https://fscrawler.readthedocs.io/en/latest/
>
> It uses Tika behind the scene.
>
> HTH
> Le 26 nov. 2019 à 12:07 +0100, Tim Allison <[email protected]>, a écrit
> :
>
> You won't be able to parse the files without reading the bytes from the
> remote server...so you have to transfer the bytes somehow.  Once you do
> that and parse the files, then you can send what you want over to Elastic.
>
> Let me know if I misunderstood the question.
>
> Cheers,
>
>       Tim
>
> On Mon, Nov 25, 2019 at 3:49 PM Cyrus Cheng <[email protected]>
> wrote:
>
>> Hi, I'm currently developing a project. I would like to use Tika to parse
>> files that are stored on a remote server from a local server, then ingest
>> them into an elastic cluster without transferring the files over to the
>> local server at all. Is this possible? Thanks in advance.
>>
>

Reply via email to