Re: HighLithing exact phrases with solr

2009-10-20 Thread Antonio Calò
Hi Kaji, many thanks for your suggestion. Sorry for delay in my feedback. I've tried to set hl.usePhraseHighlighter=true, but it still not working. Here my setup: highlighting !-- Configure the standard fragmenter -- !-- This could most likely be commented out in the default case --

Re: [DIH] URLDataSource and fetching a link

2009-10-20 Thread Grant Ingersoll
Finally getting back to this... On Sep 17, 2009, at 12:28 AM, Noble Paul നോബിള്‍ नोब्ळ् wrote: 2009/9/17 Noble Paul നോബിള്‍ नोब्ळ् noble.p...@corp.aol.com: it is possible to have a sub entity which has XPathEntityProcessor which can use the link ar the url This may not be a good

Solr and bitwise comparaison

2009-10-20 Thread sahid
Hello, I have a problem :) I search to make a request in solr similaire are SELECT COUNT(*) FROM InscriptionNew WHERE choices 17 0; into mysql. it is possible, you have an idea ? Thanks -- ~sahid

Re: [DIH] URLDataSource and fetching a link

2009-10-20 Thread Noble Paul നോബിള്‍ नोब्ळ्
entity name=nytSportsFeed pk=link url=http://feeds1.nytimes.com/nyt/rss/Sports processor=XPathEntityProcessor forEach=/rss/channel | /rss/channel/item

Specify increment gap with PatternTokenizerFactory

2009-10-20 Thread Julien Nioche
Hi, Is there a way to specify an increment gap between tokens with the PatternTokenizerFactory or do I need to customise it? For instance if I split on commas in *Books, Online Shopping, Book Store* I want to be able to put a 100 position gap between say books and online shopping. There is of

Re: HighLithing exact phrases with solr

2009-10-20 Thread Koji Sekiguchi
Antonio, Put the parameter into requestHandler/ element, rather than highlighting/. If you are using standard reques thandler, set it like this: requestHandler name=standard class=solr.SearchHandler default=true lst name=defaults str name=echoParamsexplicit/str bool

Re: Boost with wildcard.

2009-10-20 Thread Jay Ess
AHMET ARSLAN wrote: I do not know how to solve your problem without writing custom code. Aaargh. So now i stand between sticking my head into the source OR writing my own engine (which is not a big deal becouse my requirements arent that big but it will anyway set me back a week or so).

Re: multi-word synonyms and analysis.jsp vs real field analysis (query, index)

2009-10-20 Thread Patrick Jungermann
Thanks Hoss, after your hints that had partially confirmed my considerations, I had made some tests with the FieldQParser. At the beginning, I had have some problems, but finally, I was able to solve the problem of multi-word synonyms at query time in a way that is suitable for us - and possibly

RE: Solr commits before documents are added

2009-10-20 Thread Feak, Todd
Any chance you are indexing to a Master, then synching to a Slave and you aren't seeing those last 20 on the Slave? There is an issue with synching between Master and Slave that we've experienced. If the last commit is very small (20 sounds possible!) it can occur in the same clock second on

Re: Boost with wildcard.

2009-10-20 Thread Yonik Seeley
On Mon, Oct 19, 2009 at 10:32 AM, Jay Ess li...@netrogenic.com wrote: The boost (index time) does not work when i am searching for a word with a wildcard appended to the end. I stumbled on to this feature and its pretty much a show stopper for me. I am implementing a live search feature where

RE: Solr commits before documents are added

2009-10-20 Thread SharmilaR
I am not using master slave setup. To give more information, I use EmbeddedSolrServer. - Sharmila Feak, Todd wrote: Any chance you are indexing to a Master, then synching to a Slave and you aren't seeing those last 20 on the Slave? There is an issue with synching between Master and

max words/tokens

2009-10-20 Thread Joe Calderon
i have a pretty basic question, is there an existing analyzer that limits the number of words/tokens indexed from a field? let say i only wanted to index the top 25 words... thx much --joe

Slow Phrase Queries

2009-10-20 Thread DHast
Hello, I have recently installed Solr as an alternative to our home made lucene search servers, and while in most respects the performance is better, i notice that phrase searches are incredibly slow compared to normal lucene, primarily when using facets example: City of New York, Matter of

Re: Slow Phrase Queries

2009-10-20 Thread Tom Burton-West
You might try a couple tests in the Solr admin interface to make sure the query is being processed the same in both Solr and raw lucene. 1) use the analysis panel to determine if the Solr filter chain is doing something unexpected compared to your lucene filter chain 2) try running a debug

Re: Slow Phrase Queries

2009-10-20 Thread Yonik Seeley
Solr just uses a stock lucene phrase query. What version of Lucene and Solr are you comparing? Do the queries match the same number of documents? -Yonik http://www.lucidimagination.com On Tue, Oct 20, 2009 at 2:18 PM, DHast hastings.recurs...@gmail.com wrote: Hello, I have recently installed

Re: max words/tokens

2009-10-20 Thread Yonik Seeley
On Tue, Oct 20, 2009 at 1:53 PM, Joe Calderon calderon@gmail.com wrote: i have a pretty basic question, is there an existing analyzer that limits the number of words/tokens indexed from a field? let say i only wanted to index the top 25 words... It would be really easy to write one, but no

Hierarchical Facet Sorting

2009-10-20 Thread Nasseam Elkarra
We have facet.sort=true however our hierarchical fields are being sorted alphabetically instead of by count. Here is the field type and field definitions: fieldType name=hierarchy class=solr.HierarchicalFacetField omitNorms=true positionIncrementGap=0 indexed=true stored=false delimiter=//

Re: Slow Phrase Queries

2009-10-20 Thread DHast
ah, it turns out it was one of my 6 facets, the author. in the data pool tehre are over 1.9 million documents, and about 800,000 authors, removing that facet worked since the field was untokenizd and slow considering how many values tehre were. Solr is definitely faster, and as fast and or

Re: Hierarchical Facet Sorting

2009-10-20 Thread Yonik Seeley
What version of Solr are you using? I just tried this with the latest 1.4-dev version, and it works fine. http://localhost:8983/solr/select?q=*:*facet=truefacet.field=catfacet.sort=true Note that facet.sort=true/false has been deprecated in Solr 1.4

Re: Wordnet dictionary integration with Solr - help

2009-10-20 Thread Nilya
Did anybody get chance to look at this. I looked at the previous posts on wordnet but didn't help much. Cheers, Nilya wrote: I have been trying to integrate wordnet dictionary with solr. I used below link to generate indexes using prolog package from wordnet.

Re: max words/tokens

2009-10-20 Thread Joe Calderon
cool np, i just didnt want to duplicate code if that already existed. On Tue, Oct 20, 2009 at 12:49 PM, Yonik Seeley yo...@lucidimagination.com wrote: On Tue, Oct 20, 2009 at 1:53 PM, Joe Calderon calderon@gmail.com wrote: i have a pretty basic question, is there an existing analyzer that

question about text field and WordDelimiterFilter in example schema.xml

2009-10-20 Thread Bill Au
I have a question regarding the use of the WordDelimiterFilter in the text field in the example schema.xml. The parameters are set differently for the indexing and querying. Namely, catenateWords and catenateNumbers are set differently. Shouldn't the same analysis be done at both index and

Re: Wordnet dictionary integration with Solr - help

2009-10-20 Thread Robert Muir
hi, one thing of interest, in lucene's contrib (in the memory package... kinda a confusing place), there is a wordnet synonym filter. if you look at its test you can get an idea of how it works: TestSynonymTokenFilter, it reads a testSynonyms.txt (a mini wordnet file for testing purposes) On

Re: question about text field and WordDelimiterFilter in example schema.xml

2009-10-20 Thread Yonik Seeley
On Tue, Oct 20, 2009 at 6:37 PM, Bill Au bill.w...@gmail.com wrote: I have a question regarding the use of the WordDelimiterFilter in the text field in the example schema.xml.  The parameters are set differently for the indexing and querying.  Namely, catenateWords and catenateNumbers are set

question about merging indexes

2009-10-20 Thread didier deshommes
Hi there, I need some help about the mergeindex command. I have 2 cores A and B that I want to merge into a new index RES. A has 100 docs and B 10 docs. All of B's docs are from A, except that one attribute is changed. The goal is to bring the updated attributes from B into A. When I issue the