Re: Is this DIH entity forEach expression OK? ... yes

2009-02-13 Thread Fergus McMenemie
Hello, I am having bother with forEach. I have XML source documents containing many embedded images within mediaBlock elements. Each image has a an associated caption. I want to implement a separate image search function which searches the captions and brings back the associated image. entity

Re: spellcheck.onlyMorePopular

2009-02-13 Thread Marcus Stratmann
Grant Ingersoll wrote: I believe the reason is b/c when onlyMP is false, if the word itself is already in the index, it short circuits out. When onlyMP is true, it checks to see if there are more frequently occurring variations. This would mean that onlyMorePopular=false isn't useful at all.

abbreviation problem

2009-02-13 Thread 李学健
hi, all to abbreviation, for example, 'US', how can i get results containing 'United States' in solr or lucene? in solr, synonyms filter, it seems only to handle one-word to one-word. but in abbreviation queries, words should be expanded. any body has a goods solution to that ? --steven.li

commit error which kill my dataimport.properties file

2009-02-13 Thread sunnyfr
Hi, Last night I've got an error during the importation and I don't get what does that mean and it even kill my dataimport.properties (empty file), so nothing was write in this file then the delta-import, started to import from the very start I guess. Thanks a lot for your help, I wish you

Re: commit error which kill my dataimport.properties file

2009-02-13 Thread sunnyfr
It's actually the space, sorry. But yes my snapshot looks huge around 3G every 20mn, so should I clean them up more often like every 4hours?? sunnyfr wrote: Hi, Last night I've got an error during the importation and I don't get what does that mean and it even kill my

Re: spellcheck.onlyMorePopular

2009-02-13 Thread Marcus Stratmann
Shalin Shekhar Mangar wrote: The end goal is to give spelling suggestions. Even if it gave less frequently occurring spelling suggestions, what would you do with it? To give you an example: We have an index for computer games. One title is gran turismo. The word gran is less frequent in the

Re: several snapshot ...

2009-02-13 Thread sunnyfr
Hi Hoss, Thanks a lot for your clear answer. It's very clear. Thanks hossman wrote: : I would like to get how is a snapshot really. It's obviously a hard link to : the files. : But it just contain the last update ?? the nature of lucene indexes is that files are never modified --

Re: abbreviation problem

2009-02-13 Thread Koji Sekiguchi
李学健 wrote: hi, all to abbreviation, for example, 'US', how can i get results containing 'United States' in solr or lucene? in solr, synonyms filter, it seems only to handle one-word to one-word. but in abbreviation queries, words should be expanded. SynonymFilter should support one word

Problem using DIH templatetransformer to create uniqueKey

2009-02-13 Thread Fergus McMenemie
Hello, templatetransformer behaves rather ungracefully if one of the replacement fields is missing. I am parsing a single XML document into multiple separate solr documents. It turns out that none of the source documents fields can be used to create a uniqueKey alone. I need to combine two,

Re: spellcheck.onlyMorePopular

2009-02-13 Thread Shalin Shekhar Mangar
On Fri, Feb 13, 2009 at 2:51 PM, Marcus Stratmann stratm...@gmx.de wrote: Shalin Shekhar Mangar wrote: The end goal is to give spelling suggestions. Even if it gave less frequently occurring spelling suggestions, what would you do with it? To give you an example: We have an index for

Re: Problem using DIH templatetransformer to create uniqueKey

2009-02-13 Thread Noble Paul നോബിള്‍ नोब्ळ्
the intent was to not to make a partial string if some of the variable are missing probably we can enhance TemplateTransformer by using an extra attribute on the field field column=id template=${jc.fileAbsolutePath}${x.vurl} ignoreMissingVariables=true/ then it can just resolve with whatever

facet count on partial results

2009-02-13 Thread Karl Wettin
Hi Solr, I pass a rather large amount of OR clauses to Solr, ending up with lots and lots of results. It's however only the results above a certain score threadshold that is interesting for me, thus I'd like to only get facet count of the results within the threadshold. How can I do

Re: spellcheck.onlyMorePopular

2009-02-13 Thread Marcus Stratmann
Shalin Shekhar Mangar wrote: And to come back to my last question: There seems to be no case in which onlyMorePopular=false makes sense (provided Grant's assumption is correct). Do you see one? Here's a use-case -- you provide a mis-spelled word and you want the closest suggestion by edit

Re: spellcheck.onlyMorePopular

2009-02-13 Thread Shalin Shekhar Mangar
On Fri, Feb 13, 2009 at 5:05 PM, Marcus Stratmann stratm...@gmx.de wrote: Hm, when I try searching for grand using onlyMorePopular=false I do not get any results. Same when trying gran. It seems that there will be no results at all when using onlyMorePopular=false. When onlyMorePopular is

Trouble with solr IndexbasedSpellChecker and FilebasedSpellChecker

2009-02-13 Thread Kartik Desikan
Hi folks, I'm using solr 1.3 Here is the relevant section from my solrconfig.xml searchComponent name=spellcheck class=solr.SpellCheckComponent !-- str name=queryAnalyzerFieldTypetextSpell/str -- lst name=spellchecker str name=namedefault/str

Get # of docs pending commit

2009-02-13 Thread Jacob Singh
Hi, Is there a way to retrieve the # of documents which are pending commit (when using autocommit)? Thanks, Jacob -- +1 510 277-0891 (o) +91 33 7458 (m) web: http://pajamadesign.com Skype: pajamadesign Yahoo: jacobsingh AIM: jacobsingh gTalk: jacobsi...@gmail.com

Re: Problem using DIH templatetransformer to create uniqueKey

2009-02-13 Thread Fergus McMenemie
Hello, templatetransformer behaves rather ungracefully if one of the replacement fields is missing. Looking at TemplateString.java I see that left to itself fillTokens would replace a missing variable with . It is an extra check in TemplateTransformer that is throwing the warning and stopping

Re: Get # of docs pending commit

2009-02-13 Thread Koji Sekiguchi
Jacob, Regardless of you are using autocommit or manul commit, look at Admin statistics Update Handlers status docsPending. Koji Jacob Singh wrote: Hi, Is there a way to retrieve the # of documents which are pending commit (when using autocommit)? Thanks, Jacob

delete snapshot??

2009-02-13 Thread sunnyfr
root 26834 16.2 0.0 19412 824 ?S16:05 0:08 rsync -Wa --delete rsync://##.##.##.##:18180/solr/snapshot.20090213160051/ /data/solr/books/data/snapshot.20090213160051-wip Hi obviously it can't delete them because the adress is bad it shouldnt be :

Re: spellcheck.onlyMorePopular

2009-02-13 Thread Marcus Stratmann
Shalin Shekhar Mangar wrote: If onlyMorePopular=true, then the algorithm finds tokens which have greater frequency than the searched term. Among these terms, the one which is closest (by edit distance) is returned. Okay, this is a bit weird, but I think I got it now. Let me try to explain it

Re: Get # of docs pending commit

2009-02-13 Thread Jacob Singh
Hi Koji, Thanks, but I'm trying to get it via a web service, not via the admin interface. Best, Jacob On Fri, Feb 13, 2009 at 8:20 PM, Koji Sekiguchi k...@r.email.ne.jp wrote: Jacob, Regardless of you are using autocommit or manul commit, look at Admin statistics Update Handlers status

Re: Get # of docs pending commit

2009-02-13 Thread Shalin Shekhar Mangar
Jacob, the output of stats.jsp is an XML which you can consume in your program. It is transformed to html using XSL. On Fri, Feb 13, 2009 at 9:09 PM, Jacob Singh jacobsi...@gmail.com wrote: Hi Koji, Thanks, but I'm trying to get it via a web service, not via the admin interface. Best,

Re: Get # of docs pending commit

2009-02-13 Thread Jacob Singh
*Jacob Singh feels dumb* Thanks! On Fri, Feb 13, 2009 at 9:14 PM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: Jacob, the output of stats.jsp is an XML which you can consume in your program. It is transformed to html using XSL. On Fri, Feb 13, 2009 at 9:09 PM, Jacob Singh

Re: spellcheck.onlyMorePopular

2009-02-13 Thread Walter Underwood
Fuzzy search should match grand turismo to gran turismo without using spelling suggestions. At Netflix, the first hit for the query grand turismo is the movie Gran Torino and we use fuzzy with Solr. wunder On 2/13/09 3:35 AM, Marcus Stratmann stratm...@gmx.de wrote: Shalin Shekhar Mangar

Re: Problem using DIH templatetransformer to create uniqueKey

2009-02-13 Thread Fergus McMenemie
Paul, Following up your usenet sussgetion: field column=id template=${jc.fileAbsolutePath}${x.vurl} ignoreMissingVariables=true/ and to add more to what I was thinking... if the field is undefined in the input document, but the schema.xml does allow a default value, then TemplateTransformer

Re: spellcheck.onlyMorePopular

2009-02-13 Thread Shalin Shekhar Mangar
On Fri, Feb 13, 2009 at 8:46 PM, Marcus Stratmann stratm...@gmx.de wrote: Okay, this is a bit weird, but I think I got it now. Let me try to explain it using my example. When I search for gran (frequency 10) I get the suggestion grand (frequency 17) when using onlyMorePopular=true. When I

Re: Get # of docs pending commit

2009-02-13 Thread Erik Hatcher
Jacob - note that the results from stats.jsp come back in XML format - which could be used programmatically from a client. Unfortunately the JSP pages don't follow the wt (writer type) parameter that standard request handlers use, but at least it's structured data and not HTML to be

Re: Problem using DIH templatetransformer to create uniqueKey

2009-02-13 Thread Erik Hatcher
What about having the template transformer support ${field:default} syntax? I'm assuming it doesn't support that currently right? The replace stuff in the config files does though. Erik On Feb 13, 2009, at 8:17 AM, Fergus McMenemie wrote: Paul, Following up your usenet

Wildcard query case problem

2009-02-13 Thread Alexander Ramos Jardim
Hey guys, I getting problems making wildcard query in the form nameSort:Arlin*. If I do such a query, I get 0 results, but when I do nameSort:arlin* I get 310 results from my index. Are wildcard queries case sensitive? This is the searched field config. fieldType name=string_lc

Re: Wildcard query case problem

2009-02-13 Thread Marc Sturlese
Are you using the same analyzer to queue and index? zayhen wrote: Hey guys, I getting problems making wildcard query in the form nameSort:Arlin*. If I do such a query, I get 0 results, but when I do nameSort:arlin* I get 310 results from my index. Are wildcard queries case sensitive?

Re: Wildcard query case problem

2009-02-13 Thread Erick Erickson
From a post in the archives: Wildcard searches are case-sensitive in Solr. I faced the same issue and handled converting the query string to lower case in my code itself. The filters and analyzers are not applicable for wildcard queries. The searchable mail archive is wonderful G. Best Erick

Re: Problem using DIH templatetransformer to create uniqueKey

2009-02-13 Thread Fergus McMenemie
Hmmm. Just gave that a go! No luck But how many layers of defaults do we need? Rgds Fergus What about having the template transformer support ${field:default} syntax? I'm assuming it doesn't support that currently right? The replace stuff in the config files does though. Erik

Re: delete snapshot??

2009-02-13 Thread Bill Au
The --delete option of the rsync command deletes extraneous files from the destination directory. It does not delete Solr snapshots. To do that you can use the snapcleaner on the master and/or slave. Bill On Fri, Feb 13, 2009 at 10:15 AM, sunnyfr johanna...@gmail.com wrote: root 26834

Re: Wildcard query case problem

2009-02-13 Thread Alexander Ramos Jardim
Thanks for pointing this out to me Erick. 2009/2/13 Erick Erickson erickerick...@gmail.com From a post in the archives: Wildcard searches are case-sensitive in Solr. I faced the same issue and handled converting the query string to lower case in my code itself. The filters and analyzers

Re: SolrJ API and XMLResponseParser

2009-02-13 Thread Amit Nithian
Hi Noble, According to the wiki, the following should work: server.setParser(new XMLResponseParser()); However, I don't see that method. The only place I see that method even being declared is in the SolrRequest class but then wiring that up with the SolrServer and getting results wasn't overly

Re: Problem using DIH templatetransformer to create uniqueKey

2009-02-13 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Fri, Feb 13, 2009 at 10:17 AM, Fergus McMenemie fer...@twig.me.uk wrote: Paul, Following up your usenet sussgetion: field column=id template=${jc.fileAbsolutePath}${x.vurl} ignoreMissingVariables=true/ and to add more to what I was thinking... if the field is undefined in the input

Re: Problem using DIH templatetransformer to create uniqueKey

2009-02-13 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Fri, Feb 13, 2009 at 11:04 AM, Erik Hatcher e...@ehatchersolutions.com wrote: What about having the template transformer support ${field:default} syntax? this is the only usecase for this. this can be easily achieved with a custom Transformer I'm assuming it doesn't support that currently

Re: SolrJ API and XMLResponseParser

2009-02-13 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Fri, Feb 13, 2009 at 9:18 PM, Amit Nithian anith...@gmail.com wrote: Hi Noble, According to the wiki, the following should work: server.setParser(new XMLResponseParser()); I guess it may be a typo. pls referto the javadocs for CommonsHttpSolrServer However, I don't see that method. The