date:20091013

Re: Error when indexing XML files

2009-10-13 Thread Fergus McMenemie

>Hi, > >I am trying to index XML files using SolrJ. The original XML file contains >nested elements. For example, the following is the snippet of the XML file. > > > SOMETHING > SOME_OTHER_THING > > >I have added the elements "name" and "facility" in Schema.xml file to make >these e

Re: using DataImportHandler with ExtractRequestHandler ?

2009-10-13 Thread abhay kumar

Thanks Steven for the quick reply .. On Wed, Oct 14, 2009 at 1:56 AM, Steven A Rowe wrote: > See http://issues.apache.org/jira/browse/SOLR-1358 > > Steve > > > -Original Message- > > From: abhay kumar [mailto:abhay...@gmail.com] > > Sent: Tuesday, October 13, 2009 8:59 AM > > To: solr-us

Re: Dismax: Impossible to search for a _phrase_ in tokenized and untokenized fields at the same time

2009-10-13 Thread Alex Baranov

Thank you for the the explanation. Let's say product_name_un is not untokenized, but it is tokenized with: and the user enters "blue car, big wheels". I want search to be applied to both fields: product_name and product_name_un with greater boost factor for product_name_un. So that if there are

Re: Customizing solr search: SpanQueries (revisited)

2009-10-13 Thread seanoc5

I'm fairly sure I did a custom (Hit)Collector in lucene-java, but all I can find at the moment are my retro implementations (w/o collectors). I won't bore (or scare?) you with the details, but I follow some of what you're suggesting. I have been able to get straight SpanQueries to work in my cus

Re: StreamingUpdateSolrServer - indexing process stops in a couple of hours

2009-10-13 Thread Lance Norskog

Which Java release is this? There are known thread-blocking problems in Java 1.5. Also, what sockets are used during this time? Try 'netstat -s | fgrep 8983' (or your Solr URL port #) and watch the active, TIME_WAIT, CLOSE_WAIT sockets build up. This may give a hint. On Tue, Oct 13, 2009 at 8:47

Re: EmbeddedSolrServer - default index name

2009-10-13 Thread Lance Norskog

Solr wants to keep various data directories like the spellchecking index, not just the main index. The solr.data.dir option gives the location of data data/ directory, which defaults under solr/. This line in solrconfig.xml uses the property: ${solr.data.dir:./solr/data} This starts the example

RE: Lucene Merge Threads

2009-10-13 Thread Giovanni Fernandez-Kincade

I'm still getting the error after getting the latest from trunk and building it. This is what I added to the solrconfig.xml: 5 Any other ideas? Thanks, Gio. SEVERE: org.apache.solr.common.SolrException: Error loading class ' 5 ' at or

Re: Using mincount with date facet in Solr 1.4

2009-10-13 Thread Lance Norskog

Deleting a document leaves behind the terms, which end up with a 0-length list of documents. Facets include these terms. After you delete the documents, run an optimize and these "orphan" terms will disappear. That is why you find '1996:0' in your index. However, I don't know why you would get a '

Re: Is negative boost possible?

2009-10-13 Thread Andrzej Bialecki

Yonik Seeley wrote: On Mon, Oct 12, 2009 at 12:03 PM, Andrzej Bialecki wrote: Solr never discarded non-positive hits, and now Lucene 2.9 no longer does either. Hmm ... The code that I pasted in my previous email uses Searcher.search(Query, int), which in turn uses search(Query, Filter, int), a

Re: solr IOException

2009-10-13 Thread Yonik Seeley

Jetty has a maximum request size for HTTP-GET... can you use POST instead? -Yonik http://www.lucidimagination.com On Tue, Oct 13, 2009 at 4:33 PM, Elaine Li wrote: > Hi, > > In my query, i have around 80 boolean clauses. I don't know if it is > because the number of boolean clauses are too big,

Error when indexing XML files

2009-10-13 Thread Chaitali Gupta

Hi, I am trying to index XML files using SolrJ. The original XML file contains nested elements. For example, the following is the snippet of the XML file. SOMETHING SOME_OTHER_THING I have added the elements "name" and "facility" in Schema.xml file to make these elements inde

solr IOException

2009-10-13 Thread Elaine Li

Hi, In my query, i have around 80 boolean clauses. I don't know if it is because the number of boolean clauses are too big, so I got into this problem. My solr config file actually says the max number to be 1024. Can any one help? _header=[1515632954,1939520811,m=3653,g=4096,p=4096,c=4096]={saui

RE: using DataImportHandler with ExtractRequestHandler ?

2009-10-13 Thread Steven A Rowe

See http://issues.apache.org/jira/browse/SOLR-1358 Steve > -Original Message- > From: abhay kumar [mailto:abhay...@gmail.com] > Sent: Tuesday, October 13, 2009 8:59 AM > To: solr-user@lucene.apache.org; solr-user- > sc.1251278899.kmoigkhhnpcnaplolgcb- > abhayait=gmail@lucene.apache.or

Re: Boosting of words

2009-10-13 Thread Nicholas Clark

Bhaskar, Read this page, specifically how to query data. http://lucene.apache.org/solr/tutorial.html#Querying+Data It sounds like you are very new to Solr, so I would also suggest reading the wiki. http://wiki.apache.org/solr/ -Nick On Mon, Oct 12, 2009 at 10:02 PM, bhaskar chandrasekar wro

Re: Customizing solr search: SpanQueries (revisited)

2009-10-13 Thread Chris Hostetter

: (e.g. defType=fooSpanQuery), along with token positions. I have this working : in straight lucene, so my challenge is to implement it half-intelligently in : solr. At the moment, I can't figure out where and how to customize the : 'inner' search process. the first step is to really make sense o

Re: how can I use debugQuery if I have extended QParserPlugin?

2009-10-13 Thread gdeconto

my apologies, you are correct; I put the stack trace in an edit of the post and not in the original post. re version info: Solr Specification Version: 1.3.0.2009.07.08.08.05.45 Solr Implementation Version: nightly exported - yonik - 2009-07-08 08:05:45 NOTE: I have some more info on this NPE

Re: Letters with accent in query

2009-10-13 Thread Chris Hostetter

: I'm querying with an accented keyword such as "caf�" but the debug info : shows that it is only searching for "caf". I'm using the ISOLatin1Accent ... : http://localhost:8983/solr/select?q=%E9&debugQuery=true : : Params return shows this: : : ...that's a pretty good tip off that you

Re: Solr over DRBD

2009-10-13 Thread Pieter Steyn

> Ypu should set a hearbeat and have the virtual IP setup for the active > instance. > So in haresources you can set like this: > node1 IPaddr::10.2.0.11 drbddisk::r0 > Filesystem::/dev/drbd0::/cluster/Solr::ext3::defaults,noatime httpd Thanks, I already have this as the server hosts apache an

Re: Tips on speeding up indexing needed...

2009-10-13 Thread William Pierce

OopsMy bad! I didn't realize that by changing the subject line I was still "part" of the thread whose subject I changed! Sorry folks! Thanks, Hoss for pointing this out! - Bill -- From: "Chris Hostetter" Sent: Tuesday, October 13, 2009 11:

RE: Solr over DRBD

2009-10-13 Thread Francis Yakin

Ypu should set a hearbeat and have the virtual IP setup for the active instance. So in haresources you can set like this: node1 IPaddr::10.2.0.11 drbddisk::r0 Filesystem::/dev/drbd0::/cluster/Solr::ext3::defaults,noatime httpd Are you running active/active cluster or active/passive? Francis

Re: Tips on speeding up indexing needed...

2009-10-13 Thread Chris Hostetter

: References: <4acb30d2.2010...@umich.edu> : <69de18140910070109m27e50d2sc82a7c7bdd683...@mail.gmail.com> : <4acc95a3.5000...@umich.edu> : : <4acfc943.4040...@umich.edu> : In-Reply-To: <4acfc943.4040...@umich.edu> : Subject: Tips on speeding up indexing needed... http://people.ap

Re: Dynamically compute document scores...

2009-10-13 Thread Chris Hostetter

: References: <4acb30d2.2010...@umich.edu> : <69de18140910070109m27e50d2sc82a7c7bdd683...@mail.gmail.com> : <4acc95a3.5000...@umich.edu> : : <4acfc943.4040...@umich.edu> : : <87c998320910101958r589c3f3dw2feedb16b8c3f...@mail.gmail.com> : : In-Reply-To: : Subject: D

Re: Dismax: Impossible to search for a _phrase_ in tokenized and untokenized fields at the same time

2009-10-13 Thread Chris Hostetter

: It seems to me that there is no way how I can use dismax handler for : searching in both tokenized and untokenized fields while I'm searching for a : phrase. The typical way of dealing with this is to put the untokenized fields in the "pf" param and have copyFields that create varients of thos

Re: SolrJ - open existing index in EmbeddedSolrServer - query mode

2009-10-13 Thread Chris Hostetter

: This code is called passing an existing index, say /tmp/output : But this code, instead of opening an existing index and read it and make : queries against it, it just creates a new one, : in /tmp/output/index SolrCore doesn't take in the exact path ofhte index -- it takes in a "dataDir" ...

Re: multi-word synonyms and analysis.jsp vs real field analysis (query, index)

2009-10-13 Thread Chris Hostetter

: The cause of my problem should be the query parsing, but I don't know, : if there is any solution for it. I need a possibility that works like : the analysis/query parsing within /admin/analysis.jsp view. The behavior you are describing is very well documented on the wiki... http://wiki.apache.

Re: how can I use debugQuery if I have extended QParserPlugin?

2009-10-13 Thread Chris Hostetter

: My original post ( : http://www.nabble.com/how-can-I-use-debugQuery-if-I-have-extended-QParserPlugin--tt25789546.html : http://www.nabble.com/how-can-I-use-debugQuery-if-I-have-extended-QParserPlugin--tt25789546.html : ) has the stack trace. =^D Actaully, no .. your orriginal post didn't inc

Re: Using mincount with date facet in Solr 1.4

2009-10-13 Thread rperkowska

Hi, in my ATs I'm observing a similar thing happening from time to time - in a search that I perform I can see facet results that contain '0' both as keys and values. But there is no possibility that zeros are valid values. When I do a normal search, I don't see any indexed fields with zeros. Wh

Re: Passing request to another handler

2009-10-13 Thread Andrzej Bialecki

Chris Hostetter wrote: : What's the canonical way to pass an update request to another handler? I'm : implementing a handler that has to dispatch its result to different update : handlers based on its internal processing. I've always written my delegating RequestHandlers so that they take in the

RE: Lucene Merge Threads

2009-10-13 Thread Giovanni Fernandez-Kincade

Will do. Thanks! -Original Message- From: Shalin Shekhar Mangar [mailto:shalinman...@gmail.com] Sent: Tuesday, October 13, 2009 11:48 AM To: solr-user@lucene.apache.org Subject: Re: Lucene Merge Threads On Tue, Oct 13, 2009 at 8:19 PM, Giovanni Fernandez-Kincade < gfernandez-kinc...@capi

Re: Passing request to another handler

2009-10-13 Thread Chris Hostetter

: What's the canonical way to pass an update request to another handler? I'm : implementing a handler that has to dispatch its result to different update : handlers based on its internal processing. I've always written my delegating RequestHandlers so that they take in the names (or paths) of th

Re: search by some functionality

2009-10-13 Thread Chris Hostetter

Actually, I forgot that the new frange query parser can let you filter on the output of a function... http://www.lucidimagination.com/blog/tag/frange/ : : Maybe I'm missing something, but function queries aren't involved in : : determining whether a document matches or not, only its score. How

Re: Dynamically compute document scores...

2009-10-13 Thread Avlesh Singh

Options - 1. Can you pre-compute your "business logic" score at index time? If yes, then this value can be stored in some field and you can use function queries to use this data plus the score to return a value which you can sort upon. 2. Take a look at - http://lucene.apache.org/j

Re: Lucene Merge Threads

2009-10-13 Thread Shalin Shekhar Mangar

On Tue, Oct 13, 2009 at 8:19 PM, Giovanni Fernandez-Kincade < gfernandez-kinc...@capitaliq.com> wrote: > Here's the version information from the admin page: > > Solr Specification Version: 1.3.0.2009.07.28.18.51.06 > Solr Implementation Version: 1.4-dev ${svnversion} - gkincade - 2009-07-28 > 18:5

StreamingUpdateSolrServer - indexing process stops in a couple of hours

2009-10-13 Thread Dadasheva, Olga

Hi, I am indexing documents using StreamingUpdateSolrServer. My 'setup' code is almost a copy of the junit test of the Solr trunk. try { StreamingUpdateSolrServer streamingServer = new StreamingUpdateSolrServer( url, 2, 5 ) { @Overri

Dynamically compute document scores...

2009-10-13 Thread William Pierce

Folks: During query time, I want to dynamically compute a document score as follows: a) Take the SOLR score for the document -- call it S. b) Lookup the "business logic" score for this document. Call it L. c) Compute a new score T = func(S, L) d) Return the documents sorted by T. I h

RE: Lucene Merge Threads

2009-10-13 Thread Giovanni Fernandez-Kincade

Here's the version information from the admin page: Solr Specification Version: 1.3.0.2009.07.28.18.51.06 Solr Implementation Version: 1.4-dev ${svnversion} - gkincade - 2009-07-28 18:51:06 Lucene Specification Version: 2.9-dev Lucene Implementation Version: 2.9-dev 794238 - 2009-07-15 18:05:08

using DataImportHandler with ExtractRequestHandler ?

2009-10-13 Thread abhay kumar

Hi , We are using solr-1.4 for our search module. We have a long schema (35 fields) whose some field values comes from database & some field(Actually 1) value comes from different file formats. We are able to index different file formats using Solr Cell ExtractRequestHandler . Data from database

Re: doing searches from within an UpdateRequestProcessor

2009-10-13 Thread Bill Au

Thanks for the info. Just want to me sure that I am on the right track before I go too deep. Bill 2009/10/12 Noble Paul നോബിള്‍ नोब्ळ् > A custom UpdateRequestProcessor is the solution. You can access the > searcher in a UpdateRequestProcessor. > > On Tue, Oct 13, 2009 at 4:20 AM, Bill Au wro

RE: XSLT Response for multivalue fields

2009-10-13 Thread Radha C.

Hi Brain, You can try this xpath in your xls template, Best, Radha.C -Original Message- From: blholmes [mailto:blhol...@sffind.com] Sent: Tuesday, October 13, 2009 6:30 AM To: solr-user@lucene.apache.org Subject: XSLT Response

EmbeddedSolrServer - default index name

2009-10-13 Thread poeta simbolista

Hi, I'd want to know how can I get rid of the name of the index. By default it is "index", so if I specify working directory, (see code below), the index is opened in "indexdir"/index. How can I just specify the index folder? Is there a way to change convention - my index is not called "index" s

Re: Why did occur difference in using q and fq ?

2009-10-13 Thread FUJIKAKE Takayoshi

I was reading that URL -> http://wiki.apache.org/solr/FilterQueryGuidance In "Stray bits" paragraph, "Memory consumption per filter field value is not a great concern here as the filterCache typically (perhaps always) stores only bit vectors, each bit representing a boolean to signal whether or no

Re: Why did occur difference in using q and fq ?

2009-10-13 Thread FUJIKAKE Takayoshi

I tried below query later. http://solr:8983/solr/select/?q=*:*+AND+foo_num:0&start=0&rows=10 Does it mean same "?q=*:*&fq=foo_num:0" ? 2009/10/13 Chantal Ackermann : > "+" is the url encoded space " " > use "+%2B" to get the string " +" (url encoded). > > e.g. > http://solr:8983/solr/select/?q=*

Re: Why did occur difference in using q and fq ?

2009-10-13 Thread Chantal Ackermann

"+" is the url encoded space " " use "+%2B" to get the string " +" (url encoded). e.g. http://solr:8983/solr/select/?q=*:*+%2Bfoo_num:0&start=0&rows=10 FUJIKAKE Takayoshi schrieb: Hi, I try to using Solr filtering by fq below query. http://solr:8983/solr/select/?q=*:*&fq=foo_num:0&start=0&rows

Why did occur difference in using q and fq ?

2009-10-13 Thread FUJIKAKE Takayoshi

Hi, I try to using Solr filtering by fq below query. http://solr:8983/solr/select/?q=*:*&fq=foo_num:0&start=0&rows=10 This result response included 'numFound="106"' . Meanwhile, filtering by q below query. http://solr:8983/solr/select/?q=*:*+foo_num:0&start=0&rows=10 This result response included

44 matches

Mail list logo