Re: Migration from Solr 1.2 to Solr 1.4

2011-02-17 Thread Stijn Vanhoorelbeke
Hi, I recently ran across the same issues; I'm updating my solr 1.4 up to the last Nightly Build ( to have ZooKeeper functionality ). I've copied the solr_home dir - but with no success. ( The config files were not accepted on the new build - due to version mismatch ). Then I copied only the

Re: Separating Index Reader and Writer

2011-02-17 Thread Em
Push again. Regards Em wrote: Just wanted to push that topic. Regards Em wrote: Hi Peter, I must jump in this discussion: From a logical point of view what you are saying makes only sense if both instances do not run on the same machine or at least not on the same drive.

My Plan to Scale Solr

2011-02-17 Thread Bing Li
Dear all, I started to learn how to use Solr three months ago. My experiences are still limited. Now I crawl Web pages with my crawler and send the data to a single Solr server. It runs fine. Since the potential users are large, I decide to scale Solr. After configuring replication, a single

Re: Replication and newSearcher registerd poll interval

2011-02-17 Thread dan sutton
Hi, Keeping the thread alive, any thought on only doing replication if there is no warming currently going on? Cheers, Dan On Thu, Feb 10, 2011 at 11:09 AM, dan sutton danbsut...@gmail.com wrote: Hi, If the replication window is too small to allow a new searcher to warm and close the

fine tuning the solr search

2011-02-17 Thread Churchill Nanje Mambe
Hi I would love to know how to do this with solr say a user inputs Account manager files, I wish that solr puts priority on the documents it finds as follows 1) documents containing account manager files gets a greater score 2) then documents with account manager come next 3) then documents

Re: Replication and newSearcher registerd poll interval

2011-02-17 Thread Markus Jelsma
If you set maxWarmingSearchers to 1 then you cannot issue an overlapping commit. Slaves won't poll for a new index version while replication is in progress. It works well in my environment where there is a high update/commit frequency, about a thousand documents per minute. The system even

Re: My Plan to Scale Solr

2011-02-17 Thread Markus Jelsma
Hi Bing Li, On Thursday 17 February 2011 10:32:11 Bing Li wrote: Dear all, I started to learn how to use Solr three months ago. My experiences are still limited. Now I crawl Web pages with my crawler and send the data to a single Solr server. It runs fine. Since the potential users

Re: My Plan to Scale Solr

2011-02-17 Thread Stijn Vanhoorelbeke
Hi, I'm currently looking at SolrCloud. I've managed to set up a scalable cluster with ZooKeeper. ( see the examples in http://wiki.apache.org/solr/SolrCloud for a quick understanding ) This way, all different shards / replicas are stored in a centralised configuration. Moreover the ZooKeeper

Re: Validate Query Syntax of Solr Request Before Sending

2011-02-17 Thread Markus Jelsma
Uh, how about the LuceneQParser? It does some checks and can return appropriate error messages. On Thursday 17 February 2011 06:44:16 csj wrote: Hi, I wonder if it is possible to let the user build up a Solr Query and have it validated by some java API before sending it to Solr. Is there

is solr dynamic calculation??

2011-02-17 Thread satya swaroop
Hi All, I have a query whether the solr shows the results of documents by calculating the score on dynamic or is it pre calculating and supplying??.. for example: if a query is made on q=solr in my index... i get a results of 25 documents... what is it calculating?? i am very keen to

Re: fine tuning the solr search

2011-02-17 Thread Markus Jelsma
Have a read: http://lucene.apache.org/java/2_9_1/scoring.html On Thursday 17 February 2011 12:50:08 Churchill Nanje Mambe wrote: Hi I would love to know how to do this with solr say a user inputs Account manager files, I wish that solr puts priority on the documents it finds as follows

Re: is solr dynamic calculation??

2011-02-17 Thread Markus Jelsma
Both, you should also read about scoring. http://lucene.apache.org/java/2_9_1/scoring.html On Thursday 17 February 2011 13:39:05 satya swaroop wrote: Hi All, I have a query whether the solr shows the results of documents by calculating the score on dynamic or is it pre calculating and

Re: How to use XML parser in DIH for a database?

2011-02-17 Thread Estrada Groups
If your using a DIH for SQL server you can configure it however you want. Here is a snippet of my code. Note the Driver you need to grab from sourcenet. dataConfig dataSource type=JdbcDataSource name=bleh driver=net.sourceforge.jtds.jdbc.Driver

Re: How to use XML parser in DIH for a database?

2011-02-17 Thread Estrada Groups
I was also gonna say why even worry about using XPath when you can write a SQL query to get your data out. Thats what i did and it seems much simpler and cuts out a step. Adam Sent from my iPhone On Feb 16, 2011, at 10:21 PM, Bill Bell billnb...@gmail.com wrote: Does anyone have an example

Building queries for SolR

2011-02-17 Thread Vincent Cautaerts
Hi, I'm porting/upgrading a project from Lucene to Solr. In Lucene, I was using the user-provided Lucene query string, and I did complete it to implement access restriction, based on fields saved in the index: Query userQ=... // query from user Query restrictQ=.. // implement restrictions Query

Re: Building queries for SolR

2011-02-17 Thread Erik Hatcher
Vincent, Look at Solr's fq (filter query) capability. You'll likely want to put your restricting query in an fq parameter from your search client. If your restricting query is a simple TermQuery, have a look at the various built-in query parsers in Solr. On trunk you can do this: fq={!term

Re: last item in results page is always the same

2011-02-17 Thread Paul
Thanks, going to update now. This is a system that is currently deployed. Should I just update to 1.4.1, or should I go straight to 3.0? Does 1.4 = 3.0 require reindexing? On Wed, Feb 16, 2011 at 5:37 PM, Yonik Seeley yo...@lucidimagination.com wrote: On Wed, Feb 16, 2011 at 5:08 PM, Paul

Re: last item in results page is always the same

2011-02-17 Thread Markus Jelsma
Its fixed in 1.4.1. https://issues.apache.org/jira/browse/SOLR-1777 On Thursday 17 February 2011 16:04:18 Paul wrote: Thanks, going to update now. This is a system that is currently deployed. Should I just update to 1.4.1, or should I go straight to 3.0? Does 1.4 = 3.0 require reindexing?

Re: last item in results page is always the same

2011-02-17 Thread Erik Hatcher
Paul - go with 1.4.1 in this case. Keep tabs on the upcoming 3.1 release (of both Lucene and Solr) and consider that in a month or so. Erik On Feb 17, 2011, at 10:04 , Paul wrote: Thanks, going to update now. This is a system that is currently deployed. Should I just update to

Re: last item in results page is always the same

2011-02-17 Thread Yonik Seeley
On Thu, Feb 17, 2011 at 10:04 AM, Paul p...@nines.org wrote: Thanks, going to update now. This is a system that is currently deployed. Should I just update to 1.4.1, or should I go straight to 3.0? Does 1.4 = 3.0 require reindexing? There is no 3.0 - that release happened before the

RE: Solr multi cores or not

2011-02-17 Thread Thumuluri, Sai
We have 3 applications and they need to have different relevancy models, synonyms, stop words etc. App A - content size - 20 GB - MySQL and Drupal based app App B - # of documents ~ 400K; index size ~ 25 GB - primarily a portal with links to different applications, data sources include crawl

Re: My Plan to Scale Solr

2011-02-17 Thread Dennis Gearon
What's an 'LSA' Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is usually a better idea to learn from others’ mistakes, so you do not have to make them yourself. from

Re: My Plan to Scale Solr

2011-02-17 Thread Walter Underwood
http://lmgtfy.com/?q=SLA wunder On Feb 17, 2011, at 11:04 AM, Dennis Gearon wrote: What's an 'LSA' Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is usually a better idea to learn from others’ mistakes, so you do not

GET or POST for large queries?

2011-02-17 Thread mrw
We are running into some issues with large queries. Initially, they were ostensibly header buffer overruns, because increasing Jetty's headerBufferSize value to 65536 resolved them. This seems like a kludge, but it does solve the problem for 95% of our users. However, we do have queries that

Re: GET or POST for large queries?

2011-02-17 Thread Erik Hatcher
Yes, you may use POST to make search requests to Solr. Erik On Feb 17, 2011, at 14:27 , mrw wrote: We are running into some issues with large queries. Initially, they were ostensibly header buffer overruns, because increasing Jetty's headerBufferSize value to 65536 resolved them.

Re: GET or POST for large queries?

2011-02-17 Thread mrw
Yeah, I tried switching to POST. It seems to be handling the size, but apparently Solr has a limit on the number of boolean comparisons -- I'm now getting too many boolean clauses errors emanating from org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:108). :)

Re: GET or POST for large queries?

2011-02-17 Thread Jonathan Rochkind
Yes, I think it's 1024 by default. I think you can raise it in your config. But your performance may suffer. Best would be to try and find a better way to do what you want without using thousands of clauses. This might require some custom Java plugins to Solr though. On 2/17/2011 3:52 PM,

Re: GET or POST for large queries?

2011-02-17 Thread Dennis Gearon
Probably you could do it, and solving a problem in business supersedes 'rightness' concerns, much to the dismay of geeks and 'those who like rightness and say the word Neemph! '. the not rightness about this is that: POST, PUT, DELETE are assumed to make changes to the URL's backend. GET is

solr.KeepWordsFilterFactory confusion

2011-02-17 Thread Robert Haschart
I have a solr index where certain facet fields should only contain one or more items from a limited list of values. To enforce this restriction at index time I have been looking at using a KeepWordFilterFactory. It seems it ought to work as I have it implamented, and actually seems to work

Re: solr.KeepWordsFilterFactory confusion

2011-02-17 Thread Ahmet Arslan
I've added a new field type in schema.xml:   fieldType name=formatFacet class=solr.StrField sortMissingLast=true omitNorms=true      analyzer type=index           tokenizer class=solr.KeywordTokenizerFactory/             filter class=solr.KeepWordFilterFactory words=format_facet.txt

Re: SolrCloud - Example C not working

2011-02-17 Thread Yonik Seeley
FYI, this should be fixed in the (very) latest trunk. -Yonik http://lucidimagination.com

Date Math

2011-02-17 Thread Andreas Kemkes
The SolrQuerySyntax Wiki page refers to DateMathParser for examples. When I tried -1DAY, I got: org.apache.lucene.queryParser.ParseException: Cannot parse 'last_modified:-DAY': Encountered - - at line 1, column 14. Was expecting one of: ( ... * ... QUOTED ... TERM ...

Index Design Question

2011-02-17 Thread Andreas Kemkes
We are indexing documents with several associated fields for search and display, some of which may change with a much higher frequency than the document content. As per my understanding, we have to resubmit the entire gamut of fields with every update. If the reindexing of the documents

Re: TermVector query using Solr Tutorial

2011-02-17 Thread Chris Hostetter
: I am searching the keyword 25, in the field : : field name=features30 TFT active matrix LCD, 2560 x 1600, .25mm : dot pitch, 700:1 contrast/field : : I want to know the character position of matched keyword in the : corresponding field. : : usb or cabl is not what I want. your search is

Re: Index Design Question

2011-02-17 Thread kenf_nc
Some options to reduce performance implications are: replication... index your documents in one solr instance, and query in a different one. that way the users of the query side will not be as adversely impacted by frequent changes. You have better control over when change occurs. separate

Re: is solr dynamic calculation??

2011-02-17 Thread satya swaroop
Hi Markus, As far i gone through the scoring of solr. The scoring is done during searching on the use of boost values which were given during the indexing. I have a query now if i search for a keyword java then 1)if for a term named java in index contain 50,000 documents then do

Re: My Plan to Scale Solr

2011-02-17 Thread Lance Norskog
Or even better, search with 'LSA'. On Thu, Feb 17, 2011 at 9:22 AM, Walter Underwood wun...@wunderwood.org wrote: http://lmgtfy.com/?q=SLA wunder On Feb 17, 2011, at 11:04 AM, Dennis Gearon wrote: What's an 'LSA' Dennis Gearon Signature Warning It is always a good

Re: My Plan to Scale Solr

2011-02-17 Thread Grijesh
its just a joke? - Thanx: Grijesh http://lucidimagination.com -- View this message in context: http://lucene.472066.n3.nabble.com/My-Plan-to-Scale-Solr-tp2516904p2524700.html Sent from the Solr - User mailing list archive at Nabble.com.