analysis tool vs. reality

2010-08-04 Thread Justin Lolofie
Erik: Yes, I did re-index if that means adding the document again.
Here are the exact steps I took:

1. analysis.jsp ABC12 does NOT match title ABC12 (however, ABC or 12 does)
2. changed schema.xml WordDelimeterFilterFactory catenate-all
3. restarted tomcat
4. deleted the document with title ABC12
5. added the document with title ABC12
6. query ABC12 does NOT result in the document with title ABC12
7. analysis.jsp ABC12 DOES match that document now

Is there any way to see, given an ID, how something is indexed internally?

Lance: I understand the index/query sections of analysis.jsp. However,
it operates on text that you enter into the form, not on actual index
data. Since all my documents have a unique ID, I'd like to supply an
ID and a query, and get back the same index/query sections- using
whats actually in the index.


-- Forwarded message --
From: Erik Hatcher erik.hatc...@gmail.com
To: solr-user@lucene.apache.org
Date: Tue, 3 Aug 2010 22:43:17 -0400
Subject: Re: analysis tool vs. reality
Did you reindex after changing the schema?


On Aug 3, 2010, at 7:35 PM, Justin Lolofie wrote:

Hi Erik, thank you for replying. So, turning on debugQuery shows
information about how the query is processed- is there a way to see
how things are stored internally in the index?

My query is ABC12. There is a document who's title field is
ABC12. However, I can only get it to match if I search for ABC or
12. This was also true in the analysis tool up until recently.
However, I changed schema.xml and turned on catenate-all in
WordDelimterFilterFactory for title fieldtype. Now, in the analysis
tool ABC12 matches ABC12. However, when doing an actual query, it
does not match.

Thank you for any help,
Justin


-- Forwarded message --
From: Erik Hatcher erik.hatc...@gmail.com
To: solr-user@lucene.apache.org
Date: Tue, 3 Aug 2010 16:50:06 -0400
Subject: Re: analysis tool vs. reality
The analysis tool is merely that, but during querying there is also a
query parser involved.  Adding debugQuery=true to your request will
give you the parsed query in the response offering insight into what
might be going on.   Could be lots of things, like not querying the
fields you think you are to a misunderstanding about some text not
being analyzed (like wildcard clauses).

 Erik

On Aug 3, 2010, at 4:43 PM, Justin Lolofie wrote:

  Hello,

  I have found the analysis tool in the admin page to be very useful in
  understanding my schema. I've made changes to my schema so that a
  particular case I'm looking at matches properly. I restarted solr,
  deleted the document from the index, and added it again. But still,
  when I do a query, the document does not get returned in the results.

  Does anyone have any tips for debugging this sort of issue? What is
  different between what I see in analysis tool and new documents added
  to the index?

  Thanks,
  Justin


analysis tool vs. reality

2010-08-04 Thread Justin Lolofie
Wow, I got to work this morning and my query results now include the
'ABC12' document. I'm not sure what that means. Either I made a
mistake in the process I described in the last email (I dont think
this is the case) or there is some kind of caching of query results
going on that doesnt get flushed on a restart of tomcat.




Erik: Yes, I did re-index if that means adding the document again.
Here are the exact steps I took:

1. analysis.jsp ABC12 does NOT match title ABC12 (however, ABC or 12 does)
2. changed schema.xml WordDelimeterFilterFactory catenate-all
3. restarted tomcat
4. deleted the document with title ABC12
5. added the document with title ABC12
6. query ABC12 does NOT result in the document with title ABC12
7. analysis.jsp ABC12 DOES match that document now

Is there any way to see, given an ID, how something is indexed internally?

Lance: I understand the index/query sections of analysis.jsp. However,
it operates on text that you enter into the form, not on actual index
data. Since all my documents have a unique ID, I'd like to supply an
ID and a query, and get back the same index/query sections- using
whats actually in the index.


-- Forwarded message --
From: Erik Hatcher erik.hatc...@gmail.com
To: solr-user@lucene.apache.org
Date: Tue, 3 Aug 2010 22:43:17 -0400
Subject: Re: analysis tool vs. reality
Did you reindex after changing the schema?


On Aug 3, 2010, at 7:35 PM, Justin Lolofie wrote:

Hi Erik, thank you for replying. So, turning on debugQuery shows
information about how the query is processed- is there a way to see
how things are stored internally in the index?

My query is ABC12. There is a document who's title field is
ABC12. However, I can only get it to match if I search for ABC or
12. This was also true in the analysis tool up until recently.
However, I changed schema.xml and turned on catenate-all in
WordDelimterFilterFactory for title fieldtype. Now, in the analysis
tool ABC12 matches ABC12. However, when doing an actual query, it
does not match.

Thank you for any help,
Justin


-- Forwarded message --
From: Erik Hatcher erik.hatc...@gmail.com
To: solr-user@lucene.apache.org
Date: Tue, 3 Aug 2010 16:50:06 -0400
Subject: Re: analysis tool vs. reality
The analysis tool is merely that, but during querying there is also a
query parser involved.  Adding debugQuery=true to your request will
give you the parsed query in the response offering insight into what
might be going on.   Could be lots of things, like not querying the
fields you think you are to a misunderstanding about some text not
being analyzed (like wildcard clauses).

 Erik

On Aug 3, 2010, at 4:43 PM, Justin Lolofie wrote:

  Hello,

  I have found the analysis tool in the admin page to be very useful in
  understanding my schema. I've made changes to my schema so that a
  particular case I'm looking at matches properly. I restarted solr,
  deleted the document from the index, and added it again. But still,
  when I do a query, the document does not get returned in the results.

  Does anyone have any tips for debugging this sort of issue? What is
  different between what I see in analysis tool and new documents added
  to the index?

  Thanks,
  Justin


analysis tool vs. reality

2010-08-03 Thread Justin Lolofie
Hello,

I have found the analysis tool in the admin page to be very useful in
understanding my schema. I've made changes to my schema so that a
particular case I'm looking at matches properly. I restarted solr,
deleted the document from the index, and added it again. But still,
when I do a query, the document does not get returned in the results.

Does anyone have any tips for debugging this sort of issue? What is
different between what I see in analysis tool and new documents added
to the index?

Thanks,
Justin


analysis tool vs. reality

2010-08-03 Thread Justin Lolofie
Hi Erik, thank you for replying. So, turning on debugQuery shows
information about how the query is processed- is there a way to see
how things are stored internally in the index?

My query is ABC12. There is a document who's title field is
ABC12. However, I can only get it to match if I search for ABC or
12. This was also true in the analysis tool up until recently.
However, I changed schema.xml and turned on catenate-all in
WordDelimterFilterFactory for title fieldtype. Now, in the analysis
tool ABC12 matches ABC12. However, when doing an actual query, it
does not match.

Thank you for any help,
Justin


-- Forwarded message --
From: Erik Hatcher erik.hatc...@gmail.com
To: solr-user@lucene.apache.org
Date: Tue, 3 Aug 2010 16:50:06 -0400
Subject: Re: analysis tool vs. reality
The analysis tool is merely that, but during querying there is also a
query parser involved.  Adding debugQuery=true to your request will
give you the parsed query in the response offering insight into what
might be going on.   Could be lots of things, like not querying the
fields you think you are to a misunderstanding about some text not
being analyzed (like wildcard clauses).

   Erik

On Aug 3, 2010, at 4:43 PM, Justin Lolofie wrote:

Hello,

I have found the analysis tool in the admin page to be very useful in
understanding my schema. I've made changes to my schema so that a
particular case I'm looking at matches properly. I restarted solr,
deleted the document from the index, and added it again. But still,
when I do a query, the document does not get returned in the results.

Does anyone have any tips for debugging this sort of issue? What is
different between what I see in analysis tool and new documents added
to the index?

Thanks,
Justin


Re: Dismax query response field number

2010-07-22 Thread Justin Lolofie
scrapy what version of solr are you using?

I'd like to do fq=city:Paris but it doesnt seem to work for me (solr
1.4) and the docs seem to suggest its a feature that is coming but not
there yet? Or maybe I misunderstood?


On Thu, Jul 22, 2010 at 6:00 AM,  scr...@asia.com wrote:

  Thanks,

 That was the problem!




 select?q=motoqt=dismax fq =city:Paris











 -Original Message-
 From: Chantal Ackermann chantal.ackerm...@btelligent.de
 To: solr-user@lucene.apache.org solr-user@lucene.apache.org
 Sent: Thu, Jul 22, 2010 12:47 pm
 Subject: Re: Dismax query response field number


 is this a typo in your query or in your e-mail?

 you have the q parameter twice.
 use fq for query inputs that mention a field explicitly when using
 dismax.

 So it should be:
 select?q=motoqt=dismax fq =city:Paris

 (the whitespace is only for visualization)


 chantal


 On Thu, 2010-07-22 at 11:03 +0200, scr...@asia.com wrote:
 Yes i've data... maybe my query is wrong?

 select?q=motoqt=dismaxq=city:Paris

 Field city is not showing?








 -Original Message-
 From: Grijesh.singh pintu.grij...@gmail.com
 To: solr-user@lucene.apache.org
 Sent: Thu, Jul 22, 2010 10:07 am
 Subject: Re: Dismax query response field number



 Do u have data in that field also,Solr returns field which have data only.








boosting particular field values

2010-07-21 Thread Justin Lolofie
I'm using dismax request handler, solr 1.4.

I would like to boost the weight of certain fields according to their
values... this appears to work:

bq=category:electronics^5.5

However, I think this boosting only affects sorting the results that
have already matched? So if I only get 10 rows back, I might not get
any records back that are category electronics. If I get 100 rows, I
can see that bq is working. However, I only want to get 10 rows.

How does one affect the kinds of results that are matched to begin
with? bq is the wrong thing to use, right?

Thanks for any help,
Justin


Re: boosting particular field values

2010-07-21 Thread Justin Lolofie
I might have misunderstood, but I think I cant do string literals in
function queries, right?

myfield:something^3.0

I tried it anyway using solr 1.4, doesnt seem to work.

On Wed, Jul 21, 2010 at 1:48 PM, Markus Jelsma markus.jel...@buyways.nl wrote:
 function queries match all documents


 http://wiki.apache.org/solr/FunctionQuery#Using_FunctionQuery


 -Original message-
 From: Justin Lolofie jta...@gmail.com
 Sent: Wed 21-07-2010 20:24
 To: solr-user@lucene.apache.org;
 Subject: boosting particular field values

 I'm using dismax request handler, solr 1.4.

 I would like to boost the weight of certain fields according to their
 values... this appears to work:

 bq=category:electronics^5.5

 However, I think this boosting only affects sorting the results that
 have already matched? So if I only get 10 rows back, I might not get
 any records back that are category electronics. If I get 100 rows, I
 can see that bq is working. However, I only want to get 10 rows.

 How does one affect the kinds of results that are matched to begin
 with? bq is the wrong thing to use, right?

 Thanks for any help,
 Justin



setting up clustering

2010-07-14 Thread Justin Lolofie
I'm trying to enable clustering in solr 1.4. I'm following these instructions:

http://wiki.apache.org/solr/ClusteringComponent

However, `ant get-libraries` fails for me. Before it tries to download
the 4 jar files, it tries to compile lucene? Is this necessary?

Has anyone gotten clustering working properly?

My next attempt was to just copy contrib/clustering/lib/*.jar and
contrib/clustering/lib/downloads/*.jar to WEB-INF/lib and enable
clustering in solrconfig.xml, but this doesnt work either and I cant
tell from the error log whether it just couldnt find the jar files or
if there is some other problem:

SEVERE: org.apache.solr.common.SolrException: Error loading class
'org.apache.solr.handler.clustering.ClusteringComponent'