RE: Facet sorting seems weird

2013-07-15 Thread David Quarterman
Hi Henrik, Try setting up a copyfield in your schema and set the copied field to use something like 'text_ws' which implements LowerCaseFilterFactory. Then sort on the copyfield. Regards, DQ -Original Message- From: Henrik Ossipoff Hansen [mailto:h...@entertainment-trading.com]

RE: Commit different database rows to solr with same id value?

2013-07-10 Thread David Quarterman
Hi Jason, Assuming you're using DIH, why not build a new, unique id within the query to use as the 'doc_id' for SOLR? We do something like this in one of our collections. In MySQL, try this (don't know what it would be for any other db but there must be equivalents): select

SOLR 4.0 frequent admin problem

2013-07-04 Thread David Quarterman
Hi, About once a week the admin system comes up with SolrCore Initialization Failures. There's nothing in the logs and SOLR continues to work in the application it's supporting and in the 'direct access' mode (i.e. http://123.465.789.100:8080/solr/collection1/select?q=bingo:*). The cure is to

RE: SOLR 4.0 frequent admin problem

2013-07-04 Thread David Quarterman
SOLR-118, seems an old issue... On 4 Jul 2013 06:43, David Quarterman da...@corexe.com wrote: Hi, About once a week the admin system comes up with SolrCore Initialization Failures. There's nothing in the logs and SOLR continues to work in the application it's supporting and in the 'direct

RE: Newbie SolR - Need advice

2013-07-03 Thread David Quarterman
Hi Fabio, Sandeep is right - it'll take time. SOLR isn't straightforward when you first start out but the tutorial is the best first step. You can then adapt the various config files in the tutorial to adapt to your situation. I'd recommend a simple approach to get the hang of it and just

RE: Newbie SolR - Need advice

2013-07-02 Thread David Quarterman
Hi Fabio, Like Jack says, try the tutorial. But to answer your question, SOLR isn't a bolt on to SQLServer or any other DB. It's a fantastically fast indexing/searching tool. You'll need to use the DataImportHandler (see the tutorial) to import your data from the DB into the indices that SOLR

RE: Newbie SolR - Need advice

2013-07-02 Thread David Quarterman
: David Quarterman [via Lucene] ml-node+s472066n4074772...@n3.nabble.com Date: 02/07/2013 16:57 (GMT+00:00) To: fabio1605 fabio.to...@btinternet.com Subject: RE: Newbie SolR - Need advice Hi Fabio, Like Jack says, try the tutorial. But to answer your question, SOLR isn't a bolt

RE: Building a central index with Lucene + Solr

2013-03-05 Thread David Quarterman
Hi Alvaro, I agree with Otis Alexandre (esp. Windows + PHP!). However, there are plenty of people using Solr PHP out there very successfully. There's another good package at http://code.google.com/p/solr-php-client/ which is easy to implement and has some example usage. Regards, DQ

RE: Edismax odd results

2013-02-22 Thread David Quarterman
Hi Erick, Funnily enough, I cracked it about 5 minutes before your email arrived! Problem was using WhiteSpaceTokenizer instead of Standard AND had the LowerCaseFilter after the PorterStemmingFilter. Getting them in the right order has solved all the problems and we get all our engineer boots,

RE: If we Open Source our platform, would it be interesting to you?

2013-02-21 Thread David Quarterman
Hi Marcelo, Looked through your site and the framework looks very powerful as an aggregator. We do a lot of data aggregation from many different sources in many different formats (XML, JSON, text, CSV, etc) using RDBMS as the main repository for eventual SOLR indexing. A 'one-stop-shop' for

RE: Edismax odd results

2013-02-20 Thread David Quarterman
AM, David Quarterman wrote: This is definitely driving us mad now! Changed to PorterStemming and there's very little difference. If we add fq=engineer, we get 0 results. Add fq=engineer* and we get the 90 in the system. Try with fq=ankle* and we get 2. Correct. Try with fq=harness* and we

RE: Edismax odd results

2013-02-20 Thread David Quarterman
: Re: Edismax odd results When you get back to this tomorrow, also try and paste the parsed query bits you get back when you append debug=all. Sometimes it's surprising what the parsed query _really_ looks like Best Erick On Tue, Feb 19, 2013 at 3:13 PM, David Quarterman da...@corexe.com

RE: Edismax odd results

2013-02-20 Thread David Quarterman
about Best Erick On Wed, Feb 20, 2013 at 7:13 AM, David Quarterman da...@corexe.com wrote: Hi Erick, Debug=all posted on http://justpaste.it/davidqhogdebug. Can't see anything obvious myselfbut then I'm not an expert! Regards, DQ -Original Message- From: Erick Erickson

Edismax odd results

2013-02-19 Thread David Quarterman
Hi all, We have an index of boots which contains harness boots, engineer boots , ankle boots, etc. An edismax search on the index for 'harness boots' brings back 2,175 boots with 'harness' results at the top. 'Searching 'engineer boots' brings back everything but 'engineer boots', same for

RE: Edismax odd results

2013-02-19 Thread David Quarterman
to comparing the scoring. -- Jack Krupansky -Original Message- From: David Quarterman Sent: Tuesday, February 19, 2013 6:21 AM To: solr-user@lucene.apache.org Subject: Edismax odd results Hi all, We have an index of boots which contains harness boots, engineer boots , ankle boots, etc

RE: Edismax odd results

2013-02-19 Thread David Quarterman
level of INFO) should have a record of what parameters were actually received when the query was made. Thanks, Shawn On 2/19/2013 9:14 AM, David Quarterman wrote: Hi Jack, Here's q test query we've been using: select?q=+engineer+bootsdefType=edismaxfl=prodnameqf=prodnameplural spf2

RE: Edismax odd results

2013-02-19 Thread David Quarterman
Hi Shawn/Jack, The log shows the query going in okay, nothing gets stripped out so we're still at a loss to understand this. Could it be theta Snowball stemming is too invasive? Regards, DQ -Original Message- From: David Quarterman [mailto:da...@corexe.com] Sent: 19 February 2013 16

RE: Edismax odd results

2013-02-19 Thread David Quarterman
reduces 'engineer' to 'engin' so I'd have expected a lot more results. Anyone got any ideas? Regards, DQ -Original Message- From: David Quarterman [mailto:da...@corexe.com] Sent: 19 February 2013 17:09 To: solr-user@lucene.apache.org Subject: RE: Edismax odd results Hi Shawn/Jack

Re: Edismax odd results

2013-02-19 Thread David Quarterman
Hi Shawn, Now finished for the day but will post the schema tomorrow. Thanks for the help (and Jack too). Regards, DQ P.S. did reindex after changing schema and the analyzer/query stuff matches precisely!! Shawn Heisey s...@elyograg.org wrote: On 2/19/2013 11:16 AM, David Quarterman wrote

RE: Feature design question: use autocomple?te to search on 2 different fields, and return 2 different data groups

2012-11-01 Thread David Quarterman
We had a similar requirement and found the best solution (unfortunately) was to spend a small amount of money. Have a look at Sematext's site (www.sematext.com). Their Autocomplete is awesome and we have a fantastic looking AC now on our development site, grouped by category, product brand with

RE: Feature design question: use autocomple?te to search on 2 different fields, and return 2 different data groups

2012-11-01 Thread David Quarterman
Fernando, Pretty much the problem we came up against. We had a basic AC running using SpellChecker a while ago but it was the grouping that floored us and sent us elsewhere. Again, multiple queries seemed like the only possible answer but in an AC scenario, even with SOLR's speed, probably too

SOLR 4.0 Beta documents being duplicated

2012-10-05 Thread David Quarterman
Hi, We've been using V4.x of SOLR since last November without too much trouble. Our MySQL database is refreshed daily and a full import is run automatically after the refresh and generally produces around 86,000 products, obviously on unique doc_id's. So, we upgraded to 4.0 Beta a few days

RE: SOLR 4.0 Beta documents being duplicated

2012-10-05 Thread David Quarterman
44/solr/ There will be an RC2 sometime, a couple of problems have been found, but using RC1 should minimize any update to the official 4.0 plus have a lot of improvements over BETA... Best Erick On Fri, Oct 5, 2012 at 10:25 AM, David Quarterman da...@corexe.com wrote: Hi, We've been using V4.x