Re: Error when indexing XML files

2009-10-13 Thread Fergus McMenemie
>Hi, > >I am trying to index XML files using SolrJ. The original XML file contains >nested elements. For example, the following is the snippet of the XML file. > > >  SOMETHING >  SOME_OTHER_THING >  > >I have added the elements "name" and "facility" in Schema.xml file to make >these e

Re: using DataImportHandler with ExtractRequestHandler ?

2009-10-13 Thread abhay kumar
Thanks Steven for the quick reply .. On Wed, Oct 14, 2009 at 1:56 AM, Steven A Rowe wrote: > See http://issues.apache.org/jira/browse/SOLR-1358 > > Steve > > > -Original Message- > > From: abhay kumar [mailto:abhay...@gmail.com] > > Sent: Tuesday, October 13, 2009 8:59 AM > > To: solr-us

Re: Dismax: Impossible to search for a _phrase_ in tokenized and untokenized fields at the same time

2009-10-13 Thread Alex Baranov
Thank you for the the explanation. Let's say product_name_un is not untokenized, but it is tokenized with: and the user enters "blue car, big wheels". I want search to be applied to both fields: product_name and product_name_un with greater boost factor for product_name_un. So that if there are

Re: Customizing solr search: SpanQueries (revisited)

2009-10-13 Thread seanoc5
I'm fairly sure I did a custom (Hit)Collector in lucene-java, but all I can find at the moment are my retro implementations (w/o collectors). I won't bore (or scare?) you with the details, but I follow some of what you're suggesting. I have been able to get straight SpanQueries to work in my cus

Re: StreamingUpdateSolrServer - indexing process stops in a couple of hours

2009-10-13 Thread Lance Norskog
Which Java release is this? There are known thread-blocking problems in Java 1.5. Also, what sockets are used during this time? Try 'netstat -s | fgrep 8983' (or your Solr URL port #) and watch the active, TIME_WAIT, CLOSE_WAIT sockets build up. This may give a hint. On Tue, Oct 13, 2009 at 8:47

Re: EmbeddedSolrServer - default index name

2009-10-13 Thread Lance Norskog
Solr wants to keep various data directories like the spellchecking index, not just the main index. The solr.data.dir option gives the location of data data/ directory, which defaults under solr/. This line in solrconfig.xml uses the property: ${solr.data.dir:./solr/data} This starts the example

RE: Lucene Merge Threads

2009-10-13 Thread Giovanni Fernandez-Kincade
I'm still getting the error after getting the latest from trunk and building it. This is what I added to the solrconfig.xml: 5 Any other ideas? Thanks, Gio. SEVERE: org.apache.solr.common.SolrException: Error loading class ' 5 ' at or

Re: Using mincount with date facet in Solr 1.4

2009-10-13 Thread Lance Norskog
Deleting a document leaves behind the terms, which end up with a 0-length list of documents. Facets include these terms. After you delete the documents, run an optimize and these "orphan" terms will disappear. That is why you find '1996:0' in your index. However, I don't know why you would get a '

Re: Is negative boost possible?

2009-10-13 Thread Andrzej Bialecki
Yonik Seeley wrote: On Mon, Oct 12, 2009 at 12:03 PM, Andrzej Bialecki wrote: Solr never discarded non-positive hits, and now Lucene 2.9 no longer does either. Hmm ... The code that I pasted in my previous email uses Searcher.search(Query, int), which in turn uses search(Query, Filter, int), a

Re: solr IOException

2009-10-13 Thread Yonik Seeley
Jetty has a maximum request size for HTTP-GET... can you use POST instead? -Yonik http://www.lucidimagination.com On Tue, Oct 13, 2009 at 4:33 PM, Elaine Li wrote: > Hi, > > In my query, i have around 80 boolean clauses. I don't know if it is > because the number of boolean clauses are too big,

Error when indexing XML files

2009-10-13 Thread Chaitali Gupta
Hi, I am trying to index XML files using SolrJ. The original XML file contains nested elements. For example, the following is the snippet of the XML file.   SOMETHING   SOME_OTHER_THING   I have added the elements "name" and "facility" in Schema.xml file to make these elements inde

solr IOException

2009-10-13 Thread Elaine Li
Hi, In my query, i have around 80 boolean clauses. I don't know if it is because the number of boolean clauses are too big, so I got into this problem. My solr config file actually says the max number to be 1024. Can any one help? _header=[1515632954,1939520811,m=3653,g=4096,p=4096,c=4096]={saui

RE: using DataImportHandler with ExtractRequestHandler ?

2009-10-13 Thread Steven A Rowe
See http://issues.apache.org/jira/browse/SOLR-1358 Steve > -Original Message- > From: abhay kumar [mailto:abhay...@gmail.com] > Sent: Tuesday, October 13, 2009 8:59 AM > To: solr-user@lucene.apache.org; solr-user- > sc.1251278899.kmoigkhhnpcnaplolgcb- > abhayait=gmail@lucene.apache.or

Re: Boosting of words

2009-10-13 Thread Nicholas Clark
Bhaskar, Read this page, specifically how to query data. http://lucene.apache.org/solr/tutorial.html#Querying+Data It sounds like you are very new to Solr, so I would also suggest reading the wiki. http://wiki.apache.org/solr/ -Nick On Mon, Oct 12, 2009 at 10:02 PM, bhaskar chandrasekar wro

Re: Customizing solr search: SpanQueries (revisited)

2009-10-13 Thread Chris Hostetter
: (e.g. defType=fooSpanQuery), along with token positions. I have this working : in straight lucene, so my challenge is to implement it half-intelligently in : solr. At the moment, I can't figure out where and how to customize the : 'inner' search process. the first step is to really make sense o

Re: how can I use debugQuery if I have extended QParserPlugin?

2009-10-13 Thread gdeconto
my apologies, you are correct; I put the stack trace in an edit of the post and not in the original post. re version info: Solr Specification Version: 1.3.0.2009.07.08.08.05.45 Solr Implementation Version: nightly exported - yonik - 2009-07-08 08:05:45 NOTE: I have some more info on this NPE

Re: Letters with accent in query

2009-10-13 Thread Chris Hostetter
: I'm querying with an accented keyword such as "caf�" but the debug info : shows that it is only searching for "caf". I'm using the ISOLatin1Accent ... : http://localhost:8983/solr/select?q=%E9&debugQuery=true : : Params return shows this: : : ...that's a pretty good tip off that you

Re: Solr over DRBD

2009-10-13 Thread Pieter Steyn
> Ypu should set a hearbeat and have the virtual IP setup for the active > instance. > So in haresources you can set like this: > node1  IPaddr::10.2.0.11 drbddisk::r0 > Filesystem::/dev/drbd0::/cluster/Solr::ext3::defaults,noatime  httpd Thanks, I already have this as the server hosts apache an

Re: Tips on speeding up indexing needed...

2009-10-13 Thread William Pierce
OopsMy bad! I didn't realize that by changing the subject line I was still "part" of the thread whose subject I changed! Sorry folks! Thanks, Hoss for pointing this out! - Bill -- From: "Chris Hostetter" Sent: Tuesday, October 13, 2009 11:

RE: Solr over DRBD

2009-10-13 Thread Francis Yakin
Ypu should set a hearbeat and have the virtual IP setup for the active instance. So in haresources you can set like this: node1 IPaddr::10.2.0.11 drbddisk::r0 Filesystem::/dev/drbd0::/cluster/Solr::ext3::defaults,noatime httpd Are you running active/active cluster or active/passive? Francis

Re: Tips on speeding up indexing needed...

2009-10-13 Thread Chris Hostetter
: References: <4acb30d2.2010...@umich.edu> : <69de18140910070109m27e50d2sc82a7c7bdd683...@mail.gmail.com> : <4acc95a3.5000...@umich.edu> : : <4acfc943.4040...@umich.edu> : In-Reply-To: <4acfc943.4040...@umich.edu> : Subject: Tips on speeding up indexing needed... http://people.ap

Re: Dynamically compute document scores...

2009-10-13 Thread Chris Hostetter
: References: <4acb30d2.2010...@umich.edu> : <69de18140910070109m27e50d2sc82a7c7bdd683...@mail.gmail.com> : <4acc95a3.5000...@umich.edu> : : <4acfc943.4040...@umich.edu> : : <87c998320910101958r589c3f3dw2feedb16b8c3f...@mail.gmail.com> : : In-Reply-To: : Subject: D

Re: Dismax: Impossible to search for a _phrase_ in tokenized and untokenized fields at the same time

2009-10-13 Thread Chris Hostetter
: It seems to me that there is no way how I can use dismax handler for : searching in both tokenized and untokenized fields while I'm searching for a : phrase. The typical way of dealing with this is to put the untokenized fields in the "pf" param and have copyFields that create varients of thos

Re: SolrJ - open existing index in EmbeddedSolrServer - query mode

2009-10-13 Thread Chris Hostetter
: This code is called passing an existing index, say /tmp/output : But this code, instead of opening an existing index and read it and make : queries against it, it just creates a new one, : in /tmp/output/index SolrCore doesn't take in the exact path ofhte index -- it takes in a "dataDir" ...

Re: multi-word synonyms and analysis.jsp vs real field analysis (query, index)

2009-10-13 Thread Chris Hostetter
: The cause of my problem should be the query parsing, but I don't know, : if there is any solution for it. I need a possibility that works like : the analysis/query parsing within /admin/analysis.jsp view. The behavior you are describing is very well documented on the wiki... http://wiki.apache.

Re: how can I use debugQuery if I have extended QParserPlugin?

2009-10-13 Thread Chris Hostetter
: My original post ( : http://www.nabble.com/how-can-I-use-debugQuery-if-I-have-extended-QParserPlugin--tt25789546.html : http://www.nabble.com/how-can-I-use-debugQuery-if-I-have-extended-QParserPlugin--tt25789546.html : ) has the stack trace. =^D Actaully, no .. your orriginal post didn't inc

Re: Using mincount with date facet in Solr 1.4

2009-10-13 Thread rperkowska
Hi, in my ATs I'm observing a similar thing happening from time to time - in a search that I perform I can see facet results that contain '0' both as keys and values. But there is no possibility that zeros are valid values. When I do a normal search, I don't see any indexed fields with zeros. Wh

Re: Passing request to another handler

2009-10-13 Thread Andrzej Bialecki
Chris Hostetter wrote: : What's the canonical way to pass an update request to another handler? I'm : implementing a handler that has to dispatch its result to different update : handlers based on its internal processing. I've always written my delegating RequestHandlers so that they take in the

RE: Lucene Merge Threads

2009-10-13 Thread Giovanni Fernandez-Kincade
Will do. Thanks! -Original Message- From: Shalin Shekhar Mangar [mailto:shalinman...@gmail.com] Sent: Tuesday, October 13, 2009 11:48 AM To: solr-user@lucene.apache.org Subject: Re: Lucene Merge Threads On Tue, Oct 13, 2009 at 8:19 PM, Giovanni Fernandez-Kincade < gfernandez-kinc...@capi

Re: Passing request to another handler

2009-10-13 Thread Chris Hostetter
: What's the canonical way to pass an update request to another handler? I'm : implementing a handler that has to dispatch its result to different update : handlers based on its internal processing. I've always written my delegating RequestHandlers so that they take in the names (or paths) of th

Re: search by some functionality

2009-10-13 Thread Chris Hostetter
Actually, I forgot that the new frange query parser can let you filter on the output of a function... http://www.lucidimagination.com/blog/tag/frange/ : : Maybe I'm missing something, but function queries aren't involved in : : determining whether a document matches or not, only its score. How

Re: Dynamically compute document scores...

2009-10-13 Thread Avlesh Singh
Options - 1. Can you pre-compute your "business logic" score at index time? If yes, then this value can be stored in some field and you can use function queries to use this data plus the score to return a value which you can sort upon. 2. Take a look at - http://lucene.apache.org/j

Re: Lucene Merge Threads

2009-10-13 Thread Shalin Shekhar Mangar
On Tue, Oct 13, 2009 at 8:19 PM, Giovanni Fernandez-Kincade < gfernandez-kinc...@capitaliq.com> wrote: > Here's the version information from the admin page: > > Solr Specification Version: 1.3.0.2009.07.28.18.51.06 > Solr Implementation Version: 1.4-dev ${svnversion} - gkincade - 2009-07-28 > 18:5

StreamingUpdateSolrServer - indexing process stops in a couple of hours

2009-10-13 Thread Dadasheva, Olga
Hi, I am indexing documents using StreamingUpdateSolrServer. My 'setup' code is almost a copy of the junit test of the Solr trunk. try { StreamingUpdateSolrServer streamingServer = new StreamingUpdateSolrServer( url, 2, 5 ) { @Overri

Dynamically compute document scores...

2009-10-13 Thread William Pierce
Folks: During query time, I want to dynamically compute a document score as follows: a) Take the SOLR score for the document -- call it S. b) Lookup the "business logic" score for this document. Call it L. c) Compute a new score T = func(S, L) d) Return the documents sorted by T. I h

RE: Lucene Merge Threads

2009-10-13 Thread Giovanni Fernandez-Kincade
Here's the version information from the admin page: Solr Specification Version: 1.3.0.2009.07.28.18.51.06 Solr Implementation Version: 1.4-dev ${svnversion} - gkincade - 2009-07-28 18:51:06 Lucene Specification Version: 2.9-dev Lucene Implementation Version: 2.9-dev 794238 - 2009-07-15 18:05:08

using DataImportHandler with ExtractRequestHandler ?

2009-10-13 Thread abhay kumar
Hi , We are using solr-1.4 for our search module. We have a long schema (35 fields) whose some field values comes from database & some field(Actually 1) value comes from different file formats. We are able to index different file formats using Solr Cell ExtractRequestHandler . Data from database

Re: doing searches from within an UpdateRequestProcessor

2009-10-13 Thread Bill Au
Thanks for the info. Just want to me sure that I am on the right track before I go too deep. Bill 2009/10/12 Noble Paul നോബിള്‍ नोब्ळ् > A custom UpdateRequestProcessor is the solution. You can access the > searcher in a UpdateRequestProcessor. > > On Tue, Oct 13, 2009 at 4:20 AM, Bill Au wro

RE: XSLT Response for multivalue fields

2009-10-13 Thread Radha C.
Hi Brain, You can try this xpath in your xls template, Best, Radha.C -Original Message- From: blholmes [mailto:blhol...@sffind.com] Sent: Tuesday, October 13, 2009 6:30 AM To: solr-user@lucene.apache.org Subject: XSLT Response

EmbeddedSolrServer - default index name

2009-10-13 Thread poeta simbolista
Hi, I'd want to know how can I get rid of the name of the index. By default it is "index", so if I specify working directory, (see code below), the index is opened in "indexdir"/index. How can I just specify the index folder? Is there a way to change convention - my index is not called "index" s

Re: Why did occur difference in using q and fq ?

2009-10-13 Thread FUJIKAKE Takayoshi
I was reading that URL -> http://wiki.apache.org/solr/FilterQueryGuidance In "Stray bits" paragraph, "Memory consumption per filter field value is not a great concern here as the filterCache typically (perhaps always) stores only bit vectors, each bit representing a boolean to signal whether or no

Re: Why did occur difference in using q and fq ?

2009-10-13 Thread FUJIKAKE Takayoshi
I tried below query later. http://solr:8983/solr/select/?q=*:*+AND+foo_num:0&start=0&rows=10 Does it mean same "?q=*:*&fq=foo_num:0" ? 2009/10/13 Chantal Ackermann : > "+" is the url encoded space " " > use "+%2B" to get the string " +" (url encoded). > > e.g. > http://solr:8983/solr/select/?q=*

Re: Why did occur difference in using q and fq ?

2009-10-13 Thread Chantal Ackermann
"+" is the url encoded space " " use "+%2B" to get the string " +" (url encoded). e.g. http://solr:8983/solr/select/?q=*:*+%2Bfoo_num:0&start=0&rows=10 FUJIKAKE Takayoshi schrieb: Hi, I try to using Solr filtering by fq below query. http://solr:8983/solr/select/?q=*:*&fq=foo_num:0&start=0&rows

Why did occur difference in using q and fq ?

2009-10-13 Thread FUJIKAKE Takayoshi
Hi, I try to using Solr filtering by fq below query. http://solr:8983/solr/select/?q=*:*&fq=foo_num:0&start=0&rows=10 This result response included 'numFound="106"' . Meanwhile, filtering by q below query. http://solr:8983/solr/select/?q=*:*+foo_num:0&start=0&rows=10 This result response included