Thank you, David! I heartily second this recommendation: please do not reinvent the wheel!
On Tue, Nov 26, 2019 at 6:13 AM David Pilato <[email protected]> wrote: > You could have a look at FSCrawler project BTW which supports indexing > local files and files over ssh. > > https://fscrawler.readthedocs.io/en/latest/ > > It uses Tika behind the scene. > > HTH > Le 26 nov. 2019 à 12:07 +0100, Tim Allison <[email protected]>, a écrit > : > > You won't be able to parse the files without reading the bytes from the > remote server...so you have to transfer the bytes somehow. Once you do > that and parse the files, then you can send what you want over to Elastic. > > Let me know if I misunderstood the question. > > Cheers, > > Tim > > On Mon, Nov 25, 2019 at 3:49 PM Cyrus Cheng <[email protected]> > wrote: > >> Hi, I'm currently developing a project. I would like to use Tika to parse >> files that are stored on a remote server from a local server, then ingest >> them into an elastic cluster without transferring the files over to the >> local server at all. Is this possible? Thanks in advance. >> >
