Please ask for help on the solr-user list. This is the dev list for Solr internals. Thanks
On Fri, May 19, 2017 at 3:51 PM William Nelis <[email protected]> wrote: > Hello. > > > > I am new to Solr and have a question about incremental indexing. We have a > source text file that contains millions of rows. Each row is saved as a > document in Solr. There is one field in each row that is a unique > identifier. > > > > Unfortunately, this source text file can change. We need to check it every > hour for changes. If rows are removed, we must remove them from Solr. If > rows are added, we must add them to Solr. > > > > We do not want to drop all records and re-load them. Instead we would like > to diff for the changes. What is the recommended way of doing this? Can we > just get all values Solr stores for the unique identifier field and do the > diff external to Solr? Does Solr provide functionality that will allow us > to do the incremental changes even though the source file itself is not > incremental? > > > > > > An example of the file format (obviously this is not a real file): > > > > AAQX This is the first document 213.32 > > AAZT This is the second document 243.23 > > ABGT This is the third document 321.43 > > ... > > > > The first column is the unique identifier (there are far more columns, but > this has been simplified). > > > > > > Thank you for your help. > > > -- Lucene/Solr Search Committer, Consultant, Developer, Author, Speaker LinkedIn: http://linkedin.com/in/davidwsmiley | Book: http://www.solrenterprisesearchserver.com
