Hi Dan, The JDBC connector pulls content from a column in a database table. It does not fetch the contents of any URLs.
The use case you are describing sounds most similar to the RSS connector. Karl On Wed, Nov 26, 2014 at 5:27 PM, Dan Davis <[email protected]> wrote: > > I've seen the online user documentation, but one key piece of information > is missing for me. What does the database connector do? Does it visit > each URL, retrieve the document, so that Tika can work on it? > That I think is the key to refine my understanding of whether ManifoldCF > can address my needs. > > Some of the things I need to ingest however are similar, but based on XML > files which were in turn output via database queries. So, does anyone > have an XML repository connector that can break-up a file into virtual > "documents" by using Xpath expressions? > > Thanks, > > Daniel Davis, Systems/Applications Architect (Contractor), > > Office of Computer and Communications Systems, National Library of > Medicine, NIH >
