Re: Multiple passes with WordDelimiterFilterFactory

2010-08-29 Thread Shawn Heisey
It's metadata for a collection of 45 million documents that is mostly photos, with some videos and text. The data is imported from a MySQL database and split among six large shards (each nearly 13GB) and a small shard with data added in the last week. That works out to between 300,000 and

anybody using solr with Cassandra?

2010-08-29 Thread Siju George
Hi, Is anybody using Solr with Cassandra? Are there any Gotcha's? Thanks --Siju

Re: Multiple passes with WordDelimiterFilterFactory

2010-08-29 Thread Shawn Heisey
On 8/28/2010 7:59 PM, Shawn Heisey wrote: The only drop in term quality that I noticed was that possessive words (apostrophe-s) no longer have the original preserved. I haven't yet decided whether that's a problem. I finally did notice another drop in term quality from the dual pass -

Re: Multiple passes with WordDelimiterFilterFactory

2010-08-29 Thread Erick Erickson
Look at the tokenizer/filter chain that makes up your analyzers, and see: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters for other tokenizer/analyzer/filter options. You're on the right track looking at the various choices provided, and I suspect you'll find what you need... Be a

Re: ExternalFileField best practices

2010-08-29 Thread simon
The extended dismax parser (see SOLR-1553) may do what you are looking for From its feature list.. 'Supports the boost parameter.. like the dismax bf param, but multiplies the function query instead of adding it in' On Sun, Aug 29, 2010 at 12:27 AM, Andy angelf...@yahoo.com wrote: But isn't

Re: Multiple passes with WordDelimiterFilterFactory

2010-08-29 Thread Shawn Heisey
Thank you for taking the time to help. The way I've got the word delimiter index filter set up with only one pass, wolf-biederman will result in wolf, biederman, wolfbiederman, and wolf-biederman. With two passes, the last one is not present. One pass changes gremlin's to gremlin and

Re: Search Results optimization

2010-08-29 Thread Hasnain
also my request handler looks like this requestHandler name=mb_artists class=solr.SearchHandler lst name=defaults str name=defTypedismax/str str name=qfname ^2.4/str str name=tie0.1/str /lst /requestHandler I really need some help on this, again, what I want is...if I search for swingline red

Re: Multiple passes with WordDelimiterFilterFactory

2010-08-29 Thread Erick Erickson
There's nothing built into SOLR that I know of that'll deal with auto-detecting multiple languages and doing the right thing. I know there's been discussion of that, searching the users' list might help... You may have to write your own analyzer that tries to do this, but I have no clue how you'd

Updating document without removing fields

2010-08-29 Thread Max Lynch
Hi, I have a master solr server and two slaves. On each of the slaves I have programs running that read the slave index, do some processing on each document, add a few new fields, and commit the changes back to the master. The problem I'm running into right now is one slave will update one

Re: Updating document without removing fields

2010-08-29 Thread Lance Norskog
No. Document creation is all-or-nothing, fields are not updateable. I think you have to filter all of your field changes through a join server. That is, all field updates could go to a database and the master would read document updates from that database. Or, you could have one updater feed