Field tokenizer question

2009-03-18 Thread Ashish P
I have created a field, The pattern is "_" (Underscore) When I do field analysis using solr admin, it shows it correctly. Have a look at attached image. e.g. cric_info http://www.nabble.com/file/p22594575/field%2Banal

Problem with Facet Date Query

2009-03-18 Thread dabboo
Hi, I want to use date field with facet query. This is my query: q=productPublicationDate_product_dt:[*%20TO%20NOW]&facet=true&facet.field=productPublicationDate_product_dt:[*%20TO%20NOW]&qt=dismaxrequest This is exception, I am facing after running this query. - org.apache.solr.common

Re: Question about incremental index update

2009-03-18 Thread Shalin Shekhar Mangar
On Thu, Mar 19, 2009 at 2:14 AM, Huang, Zijian(Victor) < zijian.hu...@etrade.com> wrote: > >I mean the document ID in Slor xml doc format. Inside the Solr wiki, > it tells me that I can update a particular doc by its ID if I assigned > one previously. I am thinking if using the url as the doc

Re: More replication questions

2009-03-18 Thread Noble Paul നോബിള്‍ नोब्ळ्
it depends on a few things. 1) no:of docs added 2) is the index optimized 3) autowarming if the no:of docs added are few and the index is not optimized , the replication will be will be done in milliseconds (the changed files will be small). If there is no autoWarming , there should be no delay in

Re: Get delta-query to work

2009-03-18 Thread Noble Paul നോബിള്‍ नोब्ळ्
I am assuming that you are using a recent version of DIH. I see some discrepency in the queries SELECT Sub0.SUBID ... is the deltaQuery and the join is done using Sub0.SUBID =${dataimporter.delta.SUBID}" in deltaImportQuery try making the first query as SELECT Sub0.SUBID as SUBID or making the

RE: More replication questions

2009-03-18 Thread Vauthrin, Laurent
Thanks for the responses. If we used a poll interval of one second (for 1.4), wouldn't we still have to wait for the replication to finish? In that case, couldn't it take minutes (depending on index size) to get that data on the slave? Or would there be a lot less data to pull down because of

multicore solrconfig issues

2009-03-18 Thread Audrey Foo
Hi I am using most recent drupal apachesolr module with solr 1.4 nightly build * solrconfig.xml ==> http://cvs.drupal.org/viewvc.py/drupal/contributions/modules/apachesolr/solrconfig.xml?revision=1.1.2.15&view=markup&pathrev=DRUPAL-6--1-0-BETA5 * schema.xml ==> http://cvs.drupal.org/viewvc.py/dru

Re: spellchecker: returning results even with misspelt words

2009-03-18 Thread Grant Ingersoll
Unfortunately, collate doesn't verify that the collated result actually results in hits. So, it is likely that each term returns results, but that doesn't mean the collation does. We probably should add to the SpellCheckComponent to have an option to check to see if the collation is going

which parameter fire optimize

2009-03-18 Thread sunnyfr
Hi I've in my log optimize=true after a commit but I didnt allow it in my solrconfig ??? /data/solr/video/bin/snapshooter /data/solr/video/bin -c true Do you have an idea where it comes from?? Thanks a lot, -- View this message in context: htt

Re: Null pointer exception on use of ImportDataHandler (useSolrAddSchema="true")

2009-03-18 Thread Sam Keen
that worked perfectly Shalin. thanks so much for your help! sam keen On Wed, Mar 18, 2009 at 1:15 PM, Shalin Shekhar Mangar wrote: > On Thu, Mar 19, 2009 at 1:29 AM, Sam Keen wrote: > >> >> What I am now attempting to do is leverage 'useSolrAddSchema="true"' . >> I have a URL the responds wit

Re: optimize after a commit don't know why?

2009-03-18 Thread sunnyfr
Maybe I miss something in solrconfig.xml ??? sunnyfr wrote: > > Hi > > I've a little problem with optimization which is very interesting but > juste one time per day otherwise replication take ages to bring back index > hard link. > > So my cron is every 30mn : > /solr/user/dataimport?command

Re: spellchecker: returning results even with misspelt words

2009-03-18 Thread Ingo Renner
Am 18.03.2009 um 21:27 schrieb Narayanan, Karthikeyan: Shyam, I tried using spellcheck.collate=true, it doesn't return results with correct word. Do I need to make any other settings?. doesn't work here either Ingo -- Ingo Renner TYPO3 Core Developer, Release Manager TYPO3 4.

Re: NPE in MultiSegmentReader$MultiTermDocs.doc

2009-03-18 Thread Comron Sattari
Because I need to filter on (possibly) more than 1024 terms and using a query to do it just wouldn't work. Comron Sattari On Wed, Mar 18, 2009 at 1:30 PM, David Smiley @MITRE.org wrote: > > Although I'm not answering your question (others have), why are you even > doing this at all with Solr wh

RE: Question about incremental index update

2009-03-18 Thread Huang, Zijian(Victor)
Hi, Otis: so does Solr already has some kind of libraries build-in, which it can automatically detect the different within two set of crawled documents and update the index to the newer one? I mean the document ID in Slor xml doc format. Inside the Solr wiki, it tells me that I can update a

Re: NPE in MultiSegmentReader$MultiTermDocs.doc

2009-03-18 Thread David Smiley @MITRE.org
Although I'm not answering your question (others have), why are you even doing this at all with Solr when you could take advantage of Solr's filter queries (fq param)? ~ David Smiley Comron Sattari-3 wrote: > > I've recently upgraded to Solr 1.3 using Lucene 2.4. One of the reasons I > upgraded

RE: spellchecker: returning results even with misspelt words

2009-03-18 Thread Narayanan, Karthikeyan
Shyam, I tried using spellcheck.collate=true, it doesn't return results with correct word. Do I need to make any other settings?. Thanks. Karthik -Original Message- From: Shyamsunder Reddy [mailto:sjh...@yahoo.

Re: Null pointer exception on use of ImportDataHandler (useSolrAddSchema="true")

2009-03-18 Thread Shalin Shekhar Mangar
On Thu, Mar 19, 2009 at 1:29 AM, Sam Keen wrote: > > What I am now attempting to do is leverage 'useSolrAddSchema="true"' . > I have a URL the responds with a well formatted solr add xml (I'm able > to add it by POSTing). But when I try to add it using > http://localhost:8983/solr/dataimport?com

Null pointer exception on use of ImportDataHandler (useSolrAddSchema="true")

2009-03-18 Thread Sam Keen
I'm attempting to use and XML/HTTP datasource [http://wiki.apache.org/solr/DataImportHandler#head-13ffe3a5e6ac22f08e063ad3315f5e7dda279bd4] I went through the RSS example in apache-solr-1.3.0/example/example-DIH and that all worked for me. What I am now attempting to do is leverage 'useSolrAddSche

Re: Solr multiple indexes

2009-03-18 Thread Otis Gospodnetic
Giovanni, It sounds like you are after a JOIN between two indices a la RDBMS JOIN? It's not possible with Solr, unless you want to do separate queries and manually join. If you are talking about merging multiple indices of the same type into a single index, that's a different story and doabl

Re: Question about incremental index update

2009-03-18 Thread Otis Gospodnetic
Victor, Daily updates (or hourly or more frequent) are not going to be a problem. I don't follow your question about document ID and using URL. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: "Huang, Zijian(Victor)" > To: solr-user@luc

Re: NPE in MultiSegmentReader$MultiTermDocs.doc

2009-03-18 Thread Comron Sattari
I can try, for now I just decided to use Lucene's TermsFilter which does the job perfectly. If I have some spare time I'll put together a unit test to show the problem. Thanks. On Wed, Mar 18, 2009 at 12:23 PM, Grant Ingersoll wrote: > Can you isolate this down to just a simple unit test? > > >

Re: NPE in MultiSegmentReader$MultiTermDocs.doc

2009-03-18 Thread Grant Ingersoll
Can you isolate this down to just a simple unit test? On Mar 17, 2009, at 6:52 PM, Comron Sattari wrote: I've recently upgraded to Solr 1.3 using Lucene 2.4. One of the reasons I upgraded was because of the nicer SearchComponent architecture that let me add a needed feature to the default re

Question about incremental index update

2009-03-18 Thread Huang, Zijian(Victor)
Hi: Is it easy to do daily incremental index update in Solr assuming the index is around 1G? In terms of giving a document an ID to facilitate index update, is it using the URL a good way to do so? Thanks Victor

Re: Solr SpellCheker configuration for multiple fields same time

2009-03-18 Thread Shalin Shekhar Mangar
Yes, approach #2 will certainly be useful. I'll open an issue. On Wed, Mar 18, 2009 at 6:20 PM, Grant Ingersoll wrote: > Hmm, I don't think there is currently a solution for this. #1 is not > viable for the reasons you mentioned and #2 is not supported by the current > code. That being said, I

Re: NPE creating EmbeddedSolrServer

2009-03-18 Thread Alexandre Rafalovitch
To reply to my own message. The following worked starting from scratch (example): SolrConfig solrConfig = new SolrConfig( "D:\\Projects\\FutureTerm\\apache-solr-1.3.0\\futu

Re: Compound word search (maybe DisMaxQueryPaser problem)

2009-03-18 Thread Chris Hostetter
: Many thanks for your explanation. That really helped me a lot in understanding : DisMax - and finally I realized that DisMax is not at all what I need. : Actually I do not want results where "blue" is in one field and "tooth" in : another (imagine you search for a notebook with blue tooth and ge

Solr multiple indexes

2009-03-18 Thread Giovanni De Stefano
Hello all, here I am with another question :-) I have to index the content of two different tables on an Oracle DB. When it comes to only one table, everything is fine: one datasource, one document, one entity in data-config, one uniqueKey in schema.xml etc. It works great. But now I have on th

Get delta-query to work

2009-03-18 Thread Rui Pereira
I have the following root entity: I get results when running the deltaQuery manually, but Solr doesn't import anything!!! What am I doing wrong?! Thanks in advance, Rui Pereira

Re: optimize an index as fast as possible

2009-03-18 Thread Marc Sturlese
Thanks Mark, going to try now... markrmiller wrote: > > Hmm - > > Have you tested search speed (without optimizing) using a merge factor > of 2? If the speed is acceptable (should be much faster than MF:10), try > a merge factor of 3. Using a merge factor of 2 or 3 and never optimizing > sho

Re: optimize an index as fast as possible

2009-03-18 Thread Mark Miller
Hmm - Have you tested search speed (without optimizing) using a merge factor of 2? If the speed is acceptable (should be much faster than MF:10), try a merge factor of 3. Using a merge factor of 2 or 3 and never optimizing should keep searches relatively fast, but also leave a lot of the index

Re: Solr SpellCheker configuration for multiple fields same time

2009-03-18 Thread Grant Ingersoll
Hmm, I don't think there is currently a solution for this. #1 is not viable for the reasons you mentioned and #2 is not supported by the current code. That being said, I think it wouldn't be too hard to for someone to work up a patch for this. Essentially, we need the ability to add in p

Re: Problem encoding ':' char in a solr query

2009-03-18 Thread Erik Hatcher
With SolrJ, you can use ClientUtils.escapeQueryChars(str) Erik On Mar 18, 2009, at 7:51 AM, Toby Cole wrote: You'll need to escape the colon with a backslash, e.g. fileAbsolutePath:file\:///Volumes/spare/ts/ford/schema/data/news/ fdw2008/jn71796.xml see the lucene query parser synt

Re: Problem encoding ':' char in a solr query

2009-03-18 Thread Toby Cole
You'll need to escape the colon with a backslash, e.g. fileAbsolutePath:file\:///Volumes/spare/ts/ford/schema/data/news/ fdw2008/jn71796.xml see the lucene query parser syntax page: http://lucene.apache.org/java/2_3_2/queryparsersyntax.html#Escaping%20Special%20Characters Toby. On 1

solrj : probleme with utf-8 content

2009-03-18 Thread Walid ABDELKABIR
when executing this code I got in my index the field "includes" with this value : "? ? ?" : --- String content ="eaiou with circumflexes: êâîôû"; SolrInputDocument doc = new SolrInputDocument(); doc.addField( "id", "123", 1.0f ); doc.addField( "inclu

Problem encoding ':' char in a solr query

2009-03-18 Thread Fergus McMenemie
Hello I have a solr field:- which an unrelated query reveals is populated with:- file:///Volumes/spare/ts/ford/schema/data/news/fdw2008/jn71796.xml however when I try and query for that exact document explicitly:- http://localhost:8080/apache-solr-1.4-dev/select?q=fileAbsolutePath:fil

Special character indexing

2009-03-18 Thread Gargate, Siddharth
Hi all, I am trying to index words containing special characters like 'Räikkönen'. Using EmbeddedSolrServer indexing is working fine, but if I use CommonHttpSolrServer then it is indexing garbage values. I am using Solr 1.4 and set URLEcoding as UTF-8 in tomcat. Is this a known issue or am I doi

Re: Solr: delta-import, help needed

2009-03-18 Thread Giovanni De Stefano
Hello Paul, thank you for your feedback. I will ask to add an expiration date to the DB and run a process that updates the index accordingly. Cheers, Giovanni On 3/18/09, Noble Paul നോബിള്‍ नोब्ळ् wrote: > > it is not possible to query details from Solr and find out deleted > items using DIH >

optimize after a commit don't know why?

2009-03-18 Thread sunnyfr
Hi I've a little problem with optimization which is very interesting but juste one time per day otherwise replication take ages to bring back index hard link. So my cron is every 30mn : /solr/user/dataimport?command=delta-import&optimize=false&commit=false otherwise i've cron for optimizing ever

Re: Compound word search (maybe DisMaxQueryPaser problem)

2009-03-18 Thread Tobias Dittrich
Many thanks for your explanation. That really helped me a lot in understanding DisMax - and finally I realized that DisMax is not at all what I need. Actually I do not want results where "blue" is in one field and "tooth" in another (imagine you search for a notebook with blue tooth and get so

Re: Index Creation Exception in solr

2009-03-18 Thread Toby Cole
If you're using a recent 1.4-snapshot you should be able to do a rollback: https://issues.apache.org/jira/browse/SOLR-670 Otherwise, if you have unique IDs in your index, you can just post new documents over the top of the old ones then commit. Toby. On 18 Mar 2009, at 10:19, dabboo wrote:

Re: Index Creation Exception in solr

2009-03-18 Thread dabboo
But if I already have some indexes in the index folder then these old indexes will also get deleted. Is there any way to roll back the operation. Shalin Shekhar Mangar wrote: > > On Wed, Mar 18, 2009 at 3:15 PM, dabboo wrote: > >> >> Hi, >> >> I am creating indexes in Solr and facing an unu

Re: Solr: delta-import, help needed

2009-03-18 Thread Noble Paul നോബിള്‍ नोब्ळ्
it is not possible to query details from Solr and find out deleted items using DIH you must maintain a deleted rows ids in the db or just flag them as deleted. --Noble On Wed, Mar 18, 2009 at 2:46 PM, Giovanni De Stefano wrote: > Hello Paul, > > thank you for your reply. > > The UPDATE in fac

Re: Index Creation Exception in solr

2009-03-18 Thread Shalin Shekhar Mangar
On Wed, Mar 18, 2009 at 3:15 PM, dabboo wrote: > > Hi, > > I am creating indexes in Solr and facing an unusual issue. > > I am creating 5 indexes and xml file of 4th index is malformed. So, while > creating indexes it properly submits index #1, 2 & 3 and throws exception > after submission of ind

Index Creation Exception in solr

2009-03-18 Thread dabboo
Hi, I am creating indexes in Solr and facing an unusual issue. I am creating 5 indexes and xml file of 4th index is malformed. So, while creating indexes it properly submits index #1, 2 & 3 and throws exception after submission of index 4. Now, if I look for index #1,2 & 3, it doesnt show up,

Re: Solr: delta-import, help needed

2009-03-18 Thread Giovanni De Stefano
Hello Paul, thank you for your reply. The UPDATE in fact works fine: I only had to update the CREATION_TIME on the DB :-) Regarding the deletedPkQuery, I understand it has to return the primary keys that should be removed from the index (because they have been removed from the DB) but I don't ha