Re: Fastest way to use solrj

2010-01-27 Thread Noble Paul നോബിള്‍ नोब्ळ्
how many fields are there in each doc? the binary format just reduces overhead. it does not touch/compress the payload 2010/1/27 Tim Terlegård tim.terleg...@gmail.com: I have 3 millon documents, each having 5000 chars. The xml file is about 15GB. The binary file is also about 15GB. I was a

Re: Fastest way to use solrj

2010-01-27 Thread Tim Terlegård
I have 6 fields. The text field is the biggest, it contains almost all of the 5000 chars. /Tim 2010/1/27 Noble Paul നോബിള്‍ नोब्ळ् noble.p...@corp.aol.com: how many fields are there in each doc? the binary format just reduces overhead. it does not touch/compress the payload 2010/1/27 Tim

Re: Fastest way to use solrj

2010-01-27 Thread Noble Paul നോബിള്‍ नोब्ळ्
The binary format just reduces overhead. in your case , all the data is in the big text field which is not compressed. But overall, the parsing is a lot faster for the binary format. So you see a perf boost 2010/1/27 Tim Terlegård tim.terleg...@gmail.com: I have 6 fields. The text field is the

Re: Fastest way to use solrj

2010-01-26 Thread Noble Paul നോബിള്‍ नोब्ळ्
if you write only a few docs you may not observe much difference in size. if you write large no:of docs you may observe a big difference. 2010/1/27 Tim Terlegård tim.terleg...@gmail.com: I got the binary format to work perfectly now. Performance is better than with xml. Thanks! Although, it

Re: Fastest way to use solrj

2010-01-20 Thread Noble Paul നോബിള്‍ नोब्ळ्
2010/1/20 Tim Terlegård tim.terleg...@gmail.com: BinaryRequestWriter does not read from a file and post it Is there any other way or is this use case not supported? I tried this: $ curl host/solr/update/javabin -F stream.file=/tmp/data.bin $ curl host/solr/update -F stream.body=' commit /'

Re: Fastest way to use solrj

2010-01-20 Thread Tim Terlegård
Yes, it worked! Thank you very much. But do I need to use curl or can I use CommonsHttpSolrServer or StreamingUpdateSolrServer? If I can't use BinaryWriter then I don't know how to do this. /Tim 2010/1/20 Noble Paul നോബിള്‍ नोब्ळ् noble.p...@corp.aol.com: 2010/1/20 Tim Terlegård

Re: Fastest way to use solrj

2010-01-19 Thread Noble Paul നോബിള്‍ नोब्ळ्
2010/1/19 Tim Terlegård tim.terleg...@gmail.com: There are a few ways to use solrj. I just learned that I can use the javabin format to get some performance gain. But when I try the binary format nothing is added to the index. This is how I try to use this:    server = new

Re: Fastest way to use solrj

2010-01-19 Thread Tim Terlegård
2010/1/19 Noble Paul നോബിള്‍ नोब्ळ् noble.p...@corp.aol.com: 2010/1/19 Tim Terlegård tim.terleg...@gmail.com:    server = new CommonsHttpSolrServer(http://localhost:8983/solr;)    server.setRequestWriter(new BinaryRequestWriter())    request = new UpdateRequest()    

Fastest way to use solrj

2010-01-18 Thread Tim Terlegård
There are a few ways to use solrj. I just learned that I can use the javabin format to get some performance gain. But when I try the binary format nothing is added to the index. This is how I try to use this: server = new CommonsHttpSolrServer(http://localhost:8983/solr;)