Re: Ebay Kleinanzeigen and Auto Suggest

2011-04-27 Thread Eric Grobler
Thanks for the links Otis, I will have a look. Regards Ericz On Tue, Apr 26, 2011 at 10:06 PM, Otis Gospodnetic otis_gospodne...@yahoo.com wrote: Hi Eric, Before using the terms component, allow me to point out: * http://sematext.com/products/autocomplete/index.html (used on

Re: Ebay Kleinanzeigen and Auto Suggest

2011-04-27 Thread Eric Grobler
Hi Otis, The new Solr 3.1 Suggester also does not support filter queries. Is anyone using shingles with faceting on large data? Regards Ericz On Tue, Apr 26, 2011 at 10:06 PM, Otis Gospodnetic otis_gospodne...@yahoo.com wrote: Hi Eric, Before using the terms component, allow me to point

Re: WhitespaceTokenizer and scoring(field length)

2011-04-27 Thread roySolr
I thought it was something simple. Here is my configuration: fieldType name=searchType class=solr.TextField positionIncrementGap=100 analyzer charFilter class=solr.HTMLStripCharFilterFactory/ tokenizer class=solr.WhitespaceTokenizerFactory/ filter

Re: WhitespaceTokenizer and scoring(field length)

2011-04-27 Thread roySolr
I thought it was something simple. Here is my configuration: fieldType name=searchType class=solr.TextField positionIncrementGap=100 analyzer charFilter class=solr.HTMLStripCharFilterFactory/ tokenizer class=solr.WhitespaceTokenizerFactory/ filter

Re: WhitespaceTokenizer and scoring(field length)

2011-04-27 Thread roySolr
I thought it was something simple. Here is my configuration: fieldType name=searchType class=solr.TextField positionIncrementGap=100 analyzer charFilter class=solr.HTMLStripCharFilterFactory/ tokenizer class=solr.WhitespaceTokenizerFactory/ filter

Re: WhitespaceTokenizer and scoring(field length)

2011-04-27 Thread Ahmet Arslan
Lucene/solr's length normalization is not discriminative for very short documents. See Jay's excellent explanation for more details. http://search-lucene.com/m/uGKRu1pvRjw/ - Original Message - From: roySolr royrutten1...@gmail.com To: solr-user@lucene.apache.org Cc: Sent:

Re: Query regarding solr plugin.

2011-04-27 Thread rajini maski
Erick, Thank you. I could fix the problem. Started from scratch considering your advice and been successful. Thanks a lot. Rajani Maski On Tue, Apr 26, 2011 at 5:28 PM, Erick Erickson erickerick...@gmail.comwrote: Sorry, but there's too much here to debug remotely. I strongly advise you

Re: Suggester or spellcheck return stored fields

2011-04-27 Thread Erick Erickson
Hmmm, I don't quite get how this would work. There is really no association between terms from one field and terms from another. As I read your message, you have a PM from user1 with a term in some field of interest, call it suggestion. A PM from user2 also has the term suggestion. Now you want

Re: How to Update Value of One Field of a Document in Index?

2011-04-27 Thread Erick Erickson
(2) isn't viable. Updating a multiValued field is the same as any other field, a delete followed by an add of the entire document. (1) could work. The problem here is how you need to search. If you need to search your notes it would be separate from the document. In other words, you

Re: fieldCache only on stats page

2011-04-27 Thread Erick Erickson
There's nothing special you need to do to be able to view the various stats from admin/stats.jsp. If another look doesn't show them, could you post a screenshot? And please include the version of Solr you're using, I checked with 1.4.1. Best Erick On Wed, Apr 27, 2011 at 1:44 AM, Solr Beginner

Facing problem with white space in synonyms

2011-04-27 Thread rajini maski
Query related to solr synonymfilterfactory. I am using Solr 1.4.1. I have datatype field textSynonym fieldType name=textSynonym class=solr.TextField positionIncrementGap=100 analyzer filter class=solr.SynonymFilterFactory synonyms=synonyms.txt ignoreCase=true expand=true/

Re: WhitespaceTokenizer and scoring(field length)

2011-04-27 Thread roySolr
Thanks!! It's clear now, sometimes the lengthNorm is the same. See the table below: # of termslengthNorm 1 1.0 2 .625 3 .5 4 .5 5 .4375 6 .375 7 .375 8 .3125 9 .3125 10.3125 Is it

Re: fieldCache only on stats page

2011-04-27 Thread Solr Beginner
Solr version: Solr Specification Version: 3.1.0 Solr Implementation Version: 3.1.0 1085815 - grantingersoll - 2011-03-26 18:00:07 Lucene Specification Version: 3.1.0 Lucene Implementation Version: 3.1.0 1085809 - 2011-03-26 18:06:58 Current Time: Wed Apr 27 14:28:34 CEST 2011 Server Start

Re: org.apache.solr.common.SolrException: Error loading class 'org.apache.solr.handler.dataimport.DataImportHandler'

2011-04-27 Thread vrpar...@gmail.com
thanks stefan i done same and dataimporthandler error gone ,then i got slf4j error and done same and error gone i got requesthandler error, but now i have some configuration problem,i try to fix it out -- View this message in context:

RE: How to Update Value of One Field of a Document in Index?

2011-04-27 Thread Steven A Rowe
There's the limited join patch, see: https://issues.apache.org/jira/browse/SOLR-2272 that hasn't been applied yet Correction: Yonik committed this feature in r1096978.

it's possible to get an individual facet, not modified by one of the query elements?

2011-04-27 Thread enbrol
Hi everyone, I'm programming a filter system for a listing. Each filter operates on an object that generates the query to solr at the end of execution. After executing the query, it is returned to each filter to display (or not) the results of the facets. Some of these filters are multiple

Re: it's possible to get an individual facet, not modified by one of the query elements?

2011-04-27 Thread Stefan Matheis
Have a look: http://wiki.apache.org/solr/SimpleFacetParameters#Multi-Select_Faceting_and_LocalParams On Wed, Apr 27, 2011 at 4:10 PM, enbrol nicos@gmail.com wrote: Hi everyone, I'm programming a filter system for a listing. Each filter operates on an object that generates the query to solr

Re: Field Length and Highlight

2011-04-27 Thread Alejandro Delgadillo
Thanks man... That did it. On 4/26/11 6:00 PM, Koji Sekiguchi k...@r.email.ne.jp wrote: (11/04/27 7:35), Alejandro Delgadillo wrote: Hi, I¹ve been using solr with Coldfusion9, I¹ve made a couple of adjustment to it in order to fulfill my needs of my client, I¹m using solr as a document

Re: Facing problem with white space in synonyms

2011-04-27 Thread lboutros
coud you try to escape white spaces like this: Hind\ claw Ludovic. - Jouve France. -- View this message in context: http://lucene.472066.n3.nabble.com/Facing-problem-with-white-space-in-synonyms-tp2870193p2870552.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: how to abort a running optimize

2011-04-27 Thread stockii
what do you mean with segments-number ? - --- System One Server, 12 GB RAM, 2 Solr Instances, 7 Cores, 1 Core with 31 Million Documents other Cores 100.000 - Solr1 for Search-Requests - commit every Minute - 5GB Xmx -

CommonGrams indexing very slow!

2011-04-27 Thread Salman Akram
All, We have created index with CommonGrams and the final size is around 370GB. Everything is working fine but now when we add more documents into index it takes forever (almost 12 hours)...seems to change all the segments file in a commit. The same commit used to take few mins with normal

Re: How to Update Value of One Field of a Document in Index?

2011-04-27 Thread Erick Erickson
OK, thanks. On Wed, Apr 27, 2011 at 9:29 AM, Steven A Rowe sar...@syr.edu wrote: There's the limited join patch, see: https://issues.apache.org/jira/browse/SOLR-2272 that hasn't been applied  yet Correction: Yonik committed this feature in r1096978.

Re: CommonGrams indexing very slow!

2011-04-27 Thread Erick Erickson
Are you by any chance optimizing? Best Erick On Wed, Apr 27, 2011 at 11:04 AM, Salman Akram salman.ak...@northbaysolutions.net wrote: All, We have created index with CommonGrams and the final size is around 370GB. Everything is working fine but now when we add more documents into index it

Re: CommonGrams indexing very slow!

2011-04-27 Thread Salman Akram
No way. It just does this while committing. Also before this when we merged multiple small indexes without optimization - as it was done in past it again took around 12 hours and made around 20 CFS files (it never happened before) On Wed, Apr 27, 2011 at 8:21 PM, Erick Erickson

Re: WhitespaceTokenizer and scoring(field length)

2011-04-27 Thread Jonathan Rochkind
You can turn off norms for the field. It doens't make any sense to talk about changing the length norm. The length norm is based on the size of the field for the particular document, to implement the TF/IDF style relevance algorithm. But you can turn off norms for the field if you don't

ClientUtil.escapeQueryChars and Solr Search

2011-04-27 Thread Sethi, Parampreet
Hi All, I am using ClientUtil.escapeQueryChars function to escape query characters, before I make call to Solr (using Solr 1.3) server through Solrj client in Java. The issue is any query with space, is getting escaped. For example chicken tikka is escaped to chicken\ tikka. When the escaped

RE: CommonGrams indexing very slow!

2011-04-27 Thread Burton-West, Tom
Hi Salman, Sounds like somehow you are triggering merges or optimizes. What is your mergeFactor? Have you turned on the IndexWriter log? In solrconfig.xml infoStream file=${solr.indexwriter.log.dir}true/infoStream In our case we feed the directory name as a Java property in our java

Search across related/correlated multivalue fields in Solr

2011-04-27 Thread ronotica
The nature of my project is such that search is needed and specifically search across related entities. We want to perform several queries involving a correlation between two or more properties of a given entity in a collection. To put things in context, here is a snippet of the domain: Student

RE: SynonymFilterFactory case changes

2011-04-27 Thread Robert Petersen
Yes I did, but that's cool because it is useful to make the final determination explicit here on the group for the benefit of other users. :) Thanks Robi -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Tuesday, April 26, 2011 5:10 PM To:

Re: CommonGrams indexing very slow!

2011-04-27 Thread Salman Akram
Thanks for the response. We got it resolved! . We made small indexes in bulk using SOLR with Standard File Format and then merged it with a Lucene app which for some reason made it CFS. Now when we started adding real time documents using SOLR (with Compound File Format set to false) it was

Re: Search across related/correlated multivalue fields in Solr

2011-04-27 Thread Jonathan Rochkind
There is no great way. One approach would be to 'de-normalize' at index time, to actually have a field that looks like this: institution_year: 2010.OHIO_ST ; 2007.YALE Then, with some code on client side, you could more easily facet and search how you want. It still doesn't (I don't

Re: Search across related/correlated multivalue fields in Solr

2011-04-27 Thread Renaud Delbru
Hi, you might want to look at the SIREn plugin [1,2], which allows you to index and query 1:N relationships such as yours, in a tabular data format [3]. [1] http://siren.sindice.com/ [2] https://github.com/rdelbru/SIREn [3]

Re: Search across related/correlated multivalue fields in Solr

2011-04-27 Thread ronotica
Thanks Jonathan. We thought this through and have done quite a bit of digging on this and saw there was no easy way around this. It was our last attempt to post to the forum and see if there was some killer feature we were missing somehow We thought of the concatenated fields route, but quickly

Re: Search across related/correlated multivalue fields in Solr

2011-04-27 Thread Jason Rutherglen
Renaud, Can you provide a brief synopsis of how your system works? Jason On Wed, Apr 27, 2011 at 11:17 AM, Renaud Delbru renaud.del...@deri.org wrote: Hi, you might want to look at the SIREn plugin [1,2], which allows you to index and query 1:N relationships such as yours, in a tabular data

Re: Search across related/correlated multivalue fields in Solr

2011-04-27 Thread ronotica
Thanks Renaud - I'll look into that asap. -- View this message in context: http://lucene.472066.n3.nabble.com/Search-across-related-correlated-multivalue-fields-in-Solr-tp2871176p2871395.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Search across related/correlated multivalue fields in Solr

2011-04-27 Thread Renaud Delbru
Hi Jason, On 27/04/11 19:25, Jason Rutherglen wrote: Renaud, Can you provide a brief synopsis of how your system works? SIREn provides a new field type for Solr. In this particular SIREn field, the data is not a piece of text, but is organised in a table. Then, SIREn provides query objects

Re: Search across related/correlated multivalue fields in Solr

2011-04-27 Thread Renaud Delbru
On 27/04/11 19:37, Renaud Delbru wrote: Hi Jason, On 27/04/11 19:25, Jason Rutherglen wrote: Renaud, Can you provide a brief synopsis of how your system works? SIREn provides a new field type for Solr. In this particular SIREn field, the data is not a piece of text, but is organised in a

RE: CommonGrams indexing very slow!

2011-04-27 Thread Burton-West, Tom
Hi Salman, We had a similar problem with the IndexMergeTool in Lucene contrib. I seem to remember having to hack the IndexMergeTool code so that it wouldn't create the CFF automatically. Let me know if you need it and I'll dig up the modified code. Tom. -Original Message- From:

Re: Search across related/correlated multivalue fields in Solr

2011-04-27 Thread Walter Underwood
This kind of thing is really easy in an XML database. That is an XPath expression, not even a search. MarkLogic implements it with search engine technology, but you don't have to care about that. wunder Walter Underwood, MarkLogic On Apr 27, 2011, at 11:43 AM, Renaud Delbru wrote: On

Re: Performance with search terms starting and ending with wildcards

2011-04-27 Thread Ueland
Hi! Thanks for the reply. We decided to give another try with ngrams. After much tweaking/tuning for our needs. Both the size and speed was more than good enough for our needs. So it looks like ngrams was the solution for us afterall :) Best regards Tor Henning Ueland -- View this message in

Replicaiton Fails with Unreachable error when master host is responding.

2011-04-27 Thread Jed Glazner
Hello All, I'm having a very strange problem that I just can't figure out. The slave is not able to replicate from the master, even though the master is reachable from the slave machine. I can telnet to the port it's running on, I can use text based browsers to navigate the master from the

Re: Search across related/correlated multivalue fields in Solr

2011-04-27 Thread Renaud Delbru
On 27/04/11 19:50, Walter Underwood wrote: This kind of thing is really easy in an XML database. That is an XPath expression, not even a search. Indeed, in fact SIREn is based on a XML IR technique, i.e., a simplified node-based indexing scheme. -- Renaud Delbru

Re: it's possible to get an individual facet, not modified by one of the query elements?

2011-04-27 Thread enbrol
Thank you very much, that's exactly what i want to do. But now i've a new problem. In the server, we've got installed Solr 1.4.1, so we can't use a tag in the main query. Is there any way to use the fq, but not caching it?. The server will be upgraded in july/august, so this is just a temporary

SolrQuery#setStart(Integer) ???

2011-04-27 Thread Leonardo Souza
Hi Guys, We have an index with more than 3 millions documents, we use the pagination feature through SolrQuery#setStart and SolrQuery#setRows methods. Some queries can return a huge amount of documents and i'm worry about the integer parameter of the setStart method, this parameter should be a

Re: WhitespaceTokenizer and scoring(field length)

2011-04-27 Thread Ahmet Arslan
Is it possible to change the lengthNorm? Yes you can customize it and plug it into solr. DefaultSimilarity and SweetSpotSimilarity can be starting point. http://lucene.apache.org/java/3_0_3/api/core/org/apache/lucene/search/Similarity.html#lengthNorm%28java.lang.String,%20int%29 -- View

Re: Question on Batch process

2011-04-27 Thread Otis Gospodnetic
Hi Charles, Yes, the threads I was referring to are in the context of the client/indexer, so one of the params for StreamingUpdateSolrServer. post.sh/jar are just there because they are handy. Don't use them for production. It's impossible to tell how long indexing of 100M documents may

Re: WhitespaceTokenizer and scoring(field length)

2011-04-27 Thread Otis Gospodnetic
In Solr's schema.xml you can use omitNorms=true to turn norms off on field-by-field basis. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Jonathan Rochkind rochk...@jhu.edu To:

Re: WhitespaceTokenizer and scoring(field length)

2011-04-27 Thread Otis Gospodnetic
Yes, it is possible to implement your own Lucene Similarity in which you can override the length norm. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: roySolr royrutten1...@gmail.com

Re: SolrQuery#setStart(Integer) ???

2011-04-27 Thread Erick Erickson
Well, the java native int fomat is 32 bits, so unless you're returning over 2 billion documents, you should be OK. But you'll run into other issues long before you get to that range. Best Erick On Wed, Apr 27, 2011 at 5:25 PM, Leonardo Souza leonardo...@gmail.com wrote: Hi Guys, We have an

Embedded Solr Optimize under Windows

2011-04-27 Thread Greg Pendlebury
Hi All, Just quick query of no particular importance to me, but we did observe this problem: http://code.google.com/p/solr-geonames/wiki/DeveloperInstall It's worth noting that the build has also been run on Mac and Solaris now, and the Solr index is about half the size. We suspect the

Re: Question on Batch process

2011-04-27 Thread Charles Wardell
Thank you for your response. I did not make the StreamingUpdate application yet, but I did change the other settings that you mentioned. It gave me a huge boost in indexing speed. (I am still using post.sh but hope to change that soon). One thing I noticed is the indexing speed was incredibly

Indexing odt files

2011-04-27 Thread Derek Croxton
Requesting help for someone way outside of his comfort zone. :) I'm trying to use solr to index several hundred OpenDocument files. I downloaded and installed the example site and got it to work on the same files. I modified post.sh to change the mime type to vnd.oasis.opendocument.text (and

Re: Indexing odt files

2011-04-27 Thread Grijesh
Hi Derek, Simple Post Tool is only for posting xml docs. If you want to index OpenDocument files then you have to use ExtractingRequestHandler (AKA solr cell). - Thanx: Grijesh www.gettinhahead.co.in -- View this message in context: