Re: Can Master push data to slave

2011-08-15 Thread Pawan Darira
Regarding point b, i mean that when Slave server does a replication from Master, it creates a lock-file in it's index directory. How to avoid that? On Tue, Aug 9, 2011 at 2:56 AM, Markus Jelsma markus.jel...@openindex.iowrote: Hi, Hi I am using Solr 1.4. and doing a replication process

filtering non english text from my results

2011-08-15 Thread Omri Cohen
Hi All, I am looking for a solution to filter out text which contains non english words. Where my goal is to present my english speaking users with results in their language. any ideas? thanks Omri

Re: sorting issue with solr 3.3

2011-08-15 Thread Bernd Fehling
I have created an issue with test attached. https://issues.apache.org/jira/browse/SOLR-2713 Will try to figure out whats going wrong. Regards Bernd http://www.base-search.net/ Am 13.08.2011 16:20, schrieb Bernd Fehling: The issue was located in a 31 million docs index and i have already

Re: Unbuffered entity enclosing request can not be repeated Invalid chunk header

2011-08-15 Thread Markus Jelsma
Hi, Hi Markus, thanks for your answer. I'm using Solr. 4.0 and jetty now and observe the behavior and my error logs next week. tomcat can be a reason, we will see, i'll report. I'm indexing WITHOUT batches, one doc after another. But i would try out the batch indexing as well as retry

Re: Nutch related issue: URL Ignore

2011-08-15 Thread Markus Jelsma
The Solr list is not the appropriate list to ask. Please try the Nutch user mailing list. hi i am using nutch 1.2. in my crawl-urlfilter.txt, i am specifying URLs to be skipped. i am giving some patterns that need to be skipped but it is not working e.g.

Migration from Autonomy IDOL to SOLR

2011-08-15 Thread Arcadius Ahouansou
Hello. We have a couple of application running on half a dozen Autonomy IDOL servers. Currently, all feature we need are supported by Solr. We have done some internal testing and realized that SOLR would do a better job. So, we are investigation all possibilities for a smooth migration from

Re: strip html from data

2011-08-15 Thread Merlin Morgenstern
2011/8/11 Ahmet Arslan iori...@yahoo.com Is there a way to strip the html tags completly and not index them? If not, how to I retrieve the results without html tags? How do you push documents to solr? You need to strip html tags before the analysis chain. For example, if you are using

Invalid Date String for highlighting any date field match

2011-08-15 Thread baronDodd
I must be missing something.. It appears to me with solr 3.2 and 3.3 if you highlight on a date field (e.g by searching on *:*) the application blows up with: ERROR org.apache.solr.core.SolrCore - org.apache.solr.common.SolrException: Invalid Date String:'1306406051000' at

A strange Exception in Solr 1.4

2011-08-15 Thread weiwei fu
java.lang.NullPointerException HI. I meet a NullPointerException in Solr 1.4 . The params is params={q=s_id:112511+AND+b_id:332133defType=lucene} status=500 QTime=1} 2011-08-15 10:31:24,968 ERROR [org.apache.solr.core.SolrCore] - java.lang.NullPointerException at

RE: filtering non english text from my results

2011-08-15 Thread Jaeger, Jay - DOT
1. Find a dictionary with the English words you find acceptable 2. Use the KeepWordFilterFactory (doc in the AnalyzerTTokenizersTokenFilters Wiki page). -Original Message- From: Omri Cohen [mailto:omri...@gmail.com] Sent: Monday, August 15, 2011 1:23 AM To:

Re: parsing many documents takes too long

2011-08-15 Thread Erik Hatcher
Sounds like you aren't using SolrJ, which will return a Java object back to you natively. Give that a try and let us know how it fairs against the jaxb method. Erik On Aug 12, 2011, at 02:58 , Tri Nguyen wrote: Hi, My results from solr returns about 982 documents and I use jaxb

RE: ideas for indexing large amount of pdf docs

2011-08-15 Thread Jaeger, Jay - DOT
Note on i: Solr replication provides pretty good clustering support out-of-the-box, including replication of multiple cores. Read the Wiki on replication (Google +solr +replication if you don't know where it is). In my experience, the problem with indexing PDFs is it takes a lot of CPU on

Re: Exception DirectSolrSpellChecker when using spellcheck.q

2011-08-15 Thread Robert Muir
what subversion revision are you using? I think you just need to svn up, as from the line number I can tell its before I fixed this bug in trunk :) On Fri, Aug 12, 2011 at 11:36 AM, O. Klein kl...@octoweb.nl wrote: Spellchecker works fine, but when using spellcheck.q it gives following

Solr + Arabic Search

2011-08-15 Thread Rohit
I am trying to search Arabic keyword in solr, but am just unable to do so. I have successfully indexed Arabic but the search doesn’t seem to be working, Search URL: http://localhost:8080/solr/tw/select/?q=%D8%AA%D8%A3%D8%AC%D9%8A%D8%B1%20%D8%A7%D9%84%D8%A7%D9%87%D9%84%D9%8A The

SolrJ and ContentStreams

2011-08-15 Thread Marcus Paradies
Hi I'm considering to use SolrJ to run queries in a MLT fashion against my Solr server. I saw that there is already an open bug filed in Jira (https://issues.apache.org/jira/browse/SOLR-1085). My questions is: Is it possible to use content streams to pass a data stream to the MLT handler in

Re: Solr + Arabic Search

2011-08-15 Thread Ahmet Arslan
I am trying to search Arabic keyword in solr, but am just unable to do so. I have successfully indexed Arabic but the search doesn’t seem to be working, Could it be URI encoding of your servlet container? http://wiki.apache.org/solr/SolrTomcat#URI_Charset_Config Does 'match all docs query'

Minimum score filter

2011-08-15 Thread Donald J. Organ IV
Is there a way to set a minimum score requirement so that matches below a given score are not return/included in facet counts.

RE: Solr + Arabic Search

2011-08-15 Thread Rohit
Thanks Ahmet, this was the problem I guess. Regards, Rohit Mobile: +91-9901768202 About Me: http://about.me/rohitg -Original Message- From: Ahmet Arslan [mailto:iori...@yahoo.com] Sent: 15 August 2011 22:20 To: solr-user@lucene.apache.org Subject: Re: Solr + Arabic Search I am trying

Re: Migration from Autonomy IDOL to SOLR

2011-08-15 Thread Alexei Martchenko
This might be a longshot but... Adobe is deprecating Verity in Coldfusion engine. Version 9 has both databases but I believe CF10 will only have Solr bundled. Idol is the-new-verity since autonomy acquired verity. Although Adobe wraps solr to work like old verity, there might be some info on

Re: Minimum score filter

2011-08-15 Thread simon
The absolute value of a relevance score doesn't have a lot of meaning and the range of scores can vary a lot depending on any boost you may apply. Even if you normalize them (say on a 1-100 scale where 100 is the max relevance) you can't really draw any valid conclusions from those values. It

Re: Minimum score filter

2011-08-15 Thread Donald J. Organ IV
OK I am doing a search using the following fields name^2.0 code^1.8 cat_search^1.5 description^0.8 I am searching for: free range dog nips I am getting back 2 documents the first is the document I am looking for, and contains those works in the name field, as the name field is Free

Re: Tomcat7 with Solr closes at fixed hours, every time another hour

2011-08-15 Thread Chris Hostetter
: : I'm having a Solr running within Tomcat7 and Tomcat is closing at : fixed hours, everytime is a different hour. catalina.log doesn't show : anything other than a clean tomcat shutdown (no exception or : anything). I would really apreciate some advice on how to debug this. : Tomcat doesn't

Re: Why is boost not always listed in explain when debug is on?

2011-08-15 Thread Chris Hostetter
: using Solr Specification Version: 4.0.0.2011.08.09.11.02.13 : : While trying understand scoring I noticed that boost is intermittently : displayed in the explain. For example, using edismax and the query string is Hmmm... that output is strange. it's not just the boost that's missing, all

Re: Migration from Autonomy IDOL to SOLR

2011-08-15 Thread Arcadius Ahouansou
Hi Alexei. I had a quick look and it seems that Adobe provides their CF tag as a wrapper around the verity/solr API, therefore, the application code is not poluated with client specific API. This makes app migration easier. Thanks for the input. Arcadius. On Mon, Aug 15, 2011 at 6:46 PM,

Indexing from a database via SolrJ

2011-08-15 Thread Shawn Heisey
Is there a simple way to get all the fields from a jdbc resultset into a bunch of SolrJ documents, which I will then send to be indexed in Solr? I would like to avoid the looping required to copy the data one field at a time. Copying it one document at a time would be acceptable, but it

Re: defType argument weirdness

2011-08-15 Thread Chris Hostetter
: Huh, I'm still not completely following. I'm sure it makes sense if you : understand the underlying implemetnation, but I don't understand how 'type' : and 'defType' don't mean exactly the same thing, just need to be expressed : differently in different location. ... : prefixing def to

Product data schema question

2011-08-15 Thread Steve Cerny
I'm working on an online eCommerce project and am having difficulties building the core / index schema. Here is the way we organize our product information in a normalized database. A product model has many SKUs (called colorways) A SKU has many sizes (called variants) A SKU size has associated

hl.useFastVectorHighlighter, fragmentsBuilder and HighlightingParameters

2011-08-15 Thread Alexei Martchenko
I'm having some trouble trying to upgrade my old hightligher from highlightingfragmenterformatter format (1.4 version, default config in the solr website) to the new Fast Vector highlighter. I'm using SOLR 3.3.0 with luceneMatchVersionLUCENE_33/luceneMatchVersion in config In my solrconfig.xml i

Re: Indexing from a database via SolrJ

2011-08-15 Thread Arcadius Ahouansou
Hi Shawn. Unles you are doing complex pre-processing before indexing, you may want to have a look at: http://wiki.apache.org/solr/DataImportHandler#Usage_with_RDBMS That should take care of it without any coding. You may need to periodically do a HTTP GET to trigger the import. Arcadius. On

Score

2011-08-15 Thread Bill Bell
How do I change the score to scale it between 0 and 100 irregardless of the score? q.alt=*:*bq=lang:SpanishdefType=dismax Bill Bell Sent from mobile

Re: Score

2011-08-15 Thread Shashi Kant
https://wiki.apache.org/lucene-java/ScoresAsPercentages On Mon, Aug 15, 2011 at 8:13 PM, Bill Bell billnb...@gmail.com wrote: How do I change the score to scale it between 0 and 100 irregardless of the score? q.alt=*:*bq=lang:SpanishdefType=dismax Bill Bell Sent from mobile

Re: Indexing from a database via SolrJ

2011-08-15 Thread Shawn Heisey
On 8/15/2011 5:55 PM, Arcadius Ahouansou wrote: Hi Shawn. Unles you are doing complex pre-processing before indexing, you may want to have a look at: http://wiki.apache.org/solr/DataImportHandler#Usage_with_RDBMS That should take care of it without any coding. You may need to periodically do