Re: Importing data to Solr

2012-07-20 Thread Lance Norskog
> My data is in an enormous text file that is parsed in python, You mean it is in Python s-expressions? I don't think there is a parser in DIH for that. On Thu, Jul 19, 2012 at 9:27 AM, Erick Erickson wrote: > First, turn off all your soft commit stuff, that won't help in your situation. > If yo

Re: Importing data to Solr

2012-07-19 Thread Erick Erickson
First, turn off all your soft commit stuff, that won't help in your situation. If you do leave autocommit on, make it a really high number (let's say 1,000,000 to start). You won't have to make 300M calls, you can batch, say, 1,000 docs into each request. DIH supports a bunch of different data so

Re: Importing data to Solr

2012-07-19 Thread Michael Della Bitta
Hi Jonatan, Ideally you'd use a Solr API client that allowed batched updates, so you'd be sending documents 100 at a time, say. Alternatively, if you're good with Java, you could build an index by using the EmbeddedSolrServer class in the same process as the code you use to parse the documents. Bu

Importing data to Solr

2012-07-19 Thread Jonatan Fournier
Hello, I was wondering if there's other ways to import data in Solr than posting xml/json/csv to the server URL (e.g. locally building the index). Is the DataImporter only for database? My data is in an enormous text file that is parsed in python, I get clean json/xml out of it if I want, but the