Re: when to use Fieldnorm ??

2013-10-15 Thread Karan jindal
thanks shawn for quick insight about it. I will look more into this.. and will share my experience Thanks, Karan Jindal On Tue, Oct 15, 2013 at 12:10 AM, Shawn Heisey s...@elyograg.org wrote: On 10/14/2013 3:05 AM, Karan jindal wrote: Is there standard way of checking to know whether

Personalization

2013-10-15 Thread Pawel
Hi, I'd like to prepare some personalized search. Let's say there is a user vector which describes a long term profile of the user. There are some values which ranks parameters used in queries and there is a function which uses parameters to compute dynamic ranking (paramets are just index fields

Re: ClusteringComponent under Tomcat 7

2013-10-15 Thread ravi koshal
Lieberman, Ariel Ariel.Lieberman at verint.com writes: Hi, I'm trying to run Solr 4.3 (and 4.4) with -Dsolr.clustering.enabled=true I've copied all relevant jars to ./lib directory under the instance. With jetty it runs OK! But, under Tomcat I receives the error (exception)

Re: Concurent indexing

2013-10-15 Thread maephisto
Thanks for the tip! I must mention that I am using Solr 4.4.0 and this problem only appears when i'm doing the indexing in the SolrCloud configuration deployed on standalone Jetty 9.0.6. When I do the same operations on a modified example in Solr 4.4.0 with embedded Jetty, indexing to a simple

Span Not Queries

2013-10-15 Thread Ankit Kumar
I need to add Span Not queries in solr . Ther's a parser Surround Query Parser i went through this ( http://lucene.472066.n3.nabble.com/Surround-query-parser-not-working-td4075066.html) to discover that surround query parser does not analyze text Does DisMaxQueryParser supports SpanNot Queries

Re: Please any idea? Highlighting exact phrases with solr

2013-10-15 Thread Silvia Suárez
Dear All and Bryan, Thanks a lot for your help, with reindex data, I got it working! :) Thanks again for your time Sil, * * *Tecnologías y SaaS para el análisis de marcas comerciales.* Nota: Usted ha recibido este mensaje al estar en la libreta de direcciones del remitente, en los

Debugging update request

2013-10-15 Thread maephisto
As a followup to another thread, where I described how my SolrCloud sometimes just stops accepting updates http://lucene.472066.n3.nabble.com/Concurent-indexing-td4095409.html I have a question, is there a way to debug or analyze the update request? Verbose output or anything else? It happens

Check the collection status ?

2013-10-15 Thread xinwu
Hi ,all I want to know how to check the collection status by using the solrj in solrCloud. Ps:I created one independent collection every day ,the name may looks like collection-20131015. By the way , where can I find the solrj api document? Thank You. -- View this message

Re: Cores with lot of folders with prefix index.XXXXXXX

2013-10-15 Thread primoz . skale
I have a question for developers of Solr regarding the issue of left-over index folders when replication fails. Could be this issue resolved quickly if when replication starts Solr creates a flag file in index. folder and when replication ends (and commits) this file is deleted? In this

SPLITSHARD not working in SOLR-4.4.0

2013-10-15 Thread RadhaJayalakshmi
Hi All, For POC purpose, I just brought up a Tomcat-Solr Cluster, with Zookeeper of 3 zodes. In one of my collection, i haave only one shard, with two replicas. I just want to split this shard, so that, it will be splitted by two and each splitted shard will have two replicas(including the master

Re: SPLITSHARD not working in SOLR-4.4.0

2013-10-15 Thread Yago Riveiro
Which is the size of the index? If the index is too big, the SPLISHARD operation will give you a timeout and don't will finish -- Yago Riveiro Sent with Sparrow (http://www.sparrowmailapp.com/?sig) On Tuesday, October 15, 2013 at 10:36 AM, RadhaJayalakshmi wrote: Hi All, For POC

Re: Concurent indexing

2013-10-15 Thread Chris Geeringh
I can confirm I am seeing the same issue with Tomcat - cluster split over 4 nodes. Is this fix in a 4.6 snapshot? On 15 October 2013 08:28, maephisto my_sky...@yahoo.com wrote: Thanks for the tip! I must mention that I am using Solr 4.4.0 and this problem only appears when i'm doing the

Re: Concurent indexing

2013-10-15 Thread maephisto
Hi Chris! Could you describe your problem, how similar is it to mine? Also, on which version of Solr are you encountering it? -- View this message in context: http://lucene.472066.n3.nabble.com/Concurent-indexing-tp4095409p4095630.html Sent from the Solr - User mailing list archive at

Re: Profiling Solr Lucene for query

2013-10-15 Thread Manuel Le Normand
I tried my last proposition, editing the clusterstate.json to add a dummy frontend shard seems to work. I made sure the ranges were not overlapping. Doesn't it resolve the solr cloud issue as specified above?

[Indexing XML files in Solr with DataImportHandler]

2013-10-15 Thread kujta1
hello i have problems wih indexing xml file format. my solrconfigdaa-config and solr files are here http://stackoverflow.com/questions/19337979/indexing-xml-files-in-solr-with-dataimporthandlerCan sombody help me why thi is not working!!thank you -- View this message in context:

Fuzzy Logic

2013-10-15 Thread kumar
Hi I have a situation that if a user has to spelt wrongly then also it has to dispaly the results. i am using my query in the following way http://localhost:2012/solr/sulekha/NewRentals/suggest?s=royal+enifld+bikes1=royal~0.7+enifld~0.7+bike~0.7 first it has to check with sif exact

Solr 4.4 with log4j and multiple indexes on tomcat 6

2013-10-15 Thread Russell Taylor
Hi, My problem is that all my indexes log to one log file but I want each index to log to their own log file. I'm using solr 4.4 and I've copied jcl-over-slf4j-1.6.6.jar, jul-to-slf4j-1.6.6.jar, log4j-1.2.16.jar, slf4j-api-1.6.6.jar and slf4j-log4j12-1.6.6.jar into my tomcats lib/ directory.

Re: Solr 4.4 with log4j and multiple indexes on tomcat 6

2013-10-15 Thread Otis Gospodnetic
Hi Russ, It's not really indexes that lit, but Solr running in Tomcat, so I don't think there's a way... Otis Solr ElasticSearch Support http://sematext.com/ On Oct 15, 2013 7:14 AM, Russell Taylor russell.tay...@interactivedata.com wrote: Hi, My problem is that all my indexes log to one

Re: Concurent indexing

2013-10-15 Thread michael.boom
Here's some of the Solr's last words (log content before it stoped accepting updates), maybe someone can help me interpret that. http://pastebin.com/mv7fH62H -- View this message in context: http://lucene.472066.n3.nabble.com/Concurent-indexing-tp4095409p4095642.html Sent from the Solr - User

Solr 4.4 : using SolrCloud, on reconnection to zookeeper, core sometimes goes down, never coming back alive

2013-10-15 Thread Andre Bois-Crettez
Hello all, We had this problem twice in 4 days, only in one of our 14 servers (2 shards 7 replicas) in Solr 4.4 : after successful re-connection to Zookeeper (triggered by Connection expired - starting a new one), sometimes the core stays down without coming back, and we have to restart the

Re: when to use Fieldnorm ??

2013-10-15 Thread Upayavira
I would say, if index size is not an issue, there's merit in indexing a field twice, once with these turned off, once with them turned on. That gives yuo the chance to choose at query time without major re-engineering efforts for your indexer code. Upayavira On Tue, Oct 15, 2013, at 07:08 AM,

Fuzzy Logic Implementation

2013-10-15 Thread kumar
Hi I have a situation that if a user has to spelt wrongly then also it has to dispaly the results. i am using my query in the following way http://localhost:2012/solr/suggest?s=royal+enifld+bikes1=royal~0.7+enifld~0.7+bike~0.7 first it has to check with sif exact results are not found

field title_ngram was indexed without position data; cannot run PhraseQuery

2013-10-15 Thread MC
Hello, Could someone explain (or perhaps provide a documentation link) what does the following error mean: field title_ngram was indexed without position data; cannot run PhraseQuery I'll do some more searching online, I was just wondering if anyone has encountered this error before, and

Re: Replace NULL with 0 while Indexing

2013-10-15 Thread keshari.prerna
Thank you everyone, I think COALESCE(duration, 0) and ISNULL will solve my problem. -- View this message in context: http://lucene.472066.n3.nabble.com/Replace-NULL-with-0-while-Indexing-tp4095059p4095678.html Sent from the Solr - User mailing list archive at Nabble.com.

Regarding Solr Cloud issue...

2013-10-15 Thread Chris
Hi, I am using solr 4.4 as cloud. while creating shards i see that the last shard has range of null. i am not sure if this is a bug. I am stuck with having null value for the range in clusterstate.json (attached below) shard5:{ range:null, state:active, replicas:{core_node1:{ state:active,

Solr postFilter

2013-10-15 Thread Muhzin
Hi, I am developing a post filter implementation in solr 4.2.1 . the following is a gist to my implementation. https://gist.github.com/rmuhzin/6991544 I deployed my plugin in the library folder and updated the solrconfig.xml queryParser name=lastLoginParser class=com.m4marry.PostQParserPlugin

Regarding Solr Cloud issue...

2013-10-15 Thread Chris
Hi, I am using solr 4.4 as cloud. while creating shards i see that the last shard has range of null. i am not sure if this is a bug. I am stuck with having null value for the range in clusterstate.json (attached below) shard5:{ range:null, state:active, replicas:{core_node1:{ state:active,

Re: Regarding Solr Cloud issue...

2013-10-15 Thread Shalin Shekhar Mangar
How did you create these shards? Can you tell us how to reproduce the issue? Any shard in a collection with compositeId router should never have null ranges. On Tue, Oct 15, 2013 at 7:07 PM, Chris christu...@gmail.com wrote: Hi, I am using solr 4.4 as cloud. while creating shards i see that

Re: Check the collection status ?

2013-10-15 Thread Shawn Heisey
On 10/15/2013 2:55 AM, xinwu wrote: I want to know how to check the collection status by using the solrj in solrCloud. Ps:I created one independent collection every day ,the name may looks like collection-20131015. By the way , where can I find the solrj api document

Re: Regarding Solr Cloud issue...

2013-10-15 Thread Chris
Hi Shalin,. Thank you for your quick reply. I appreciate all the help. I started the solr cloud servers first...with 5 nodes. then i issued a command like below to create the shards -

Re: field title_ngram was indexed without position data; cannot run PhraseQuery

2013-10-15 Thread Jason Hellman
If you consider what n-grams do this should make sense to you. Consider the following piece of data: White iPod If the field is fed through a bigram filter (n-gram with size of 2) the resulting token stream would appear as such: wh hi it te ip po od The usual use of n-grams is to match

Re: Solr 4.4 with log4j and multiple indexes on tomcat 6

2013-10-15 Thread Shawn Heisey
On 10/15/2013 5:13 AM, Russell Taylor wrote: My problem is that all my indexes log to one log file but I want each index to log to their own log file. I'm using solr 4.4 and I've copied jcl-over-slf4j-1.6.6.jar, jul-to-slf4j-1.6.6.jar, log4j-1.2.16.jar, slf4j-api-1.6.6.jar and

loading SolrInfoMBeanHandler is slow?

2013-10-15 Thread Shinichiro Abe
Hi, In my Mac OSX when starting Solr as OOTB, I always have to wait 30 sec for completely loading. It seems that loading SolrInfoMBeanHandler is slow in Solr 4.x (also 4.5). Does anyone have the same problem? log: DEBUG - 2013-10-16 00:51:14.144;

Re: field title_ngram was indexed without position data; cannot run PhraseQuery

2013-10-15 Thread Jack Krupansky
Show us the field and field type from your schema. Likely you are omitting position info for the field, and the field type has autoGeneratePhraseQueries=true - the ngram analyzer generates a sequence of terms for a single source term and then the query parser generates a PhraseQuery for that

Re: loading SolrInfoMBeanHandler is slow?

2013-10-15 Thread Stefan Matheis
Shinichrio Perhaps i don't see it, but nowhere in your log is something related to this handler? For me it looks like this: Oct 15, 2013 4:36:47 PM org.apache.solr.core.SolrCore execute INFO: [collection1] webapp=/solr path=/admin/mbeans params={stats=truewt=json} status=0 QTime=3 Stefan

Re: Span Not Queries

2013-10-15 Thread Jack Krupansky
Nope. But the LucidWorks Search product query parser does support SpanNot if you use their BEFORE, AFTER, and NEAR span operators. See: http://docs.lucidworks.com/display/lweug/Proximity+Operations For example: George BEFORE:2 Bush NOT H to match George anything Bush, but not George H. W.

Re: loading SolrInfoMBeanHandler is slow?

2013-10-15 Thread Shinichiro Abe
Thanks Stefan, It occurs when boosting Solr before opening Admin GUI, so the handler such as /admin/mbeans is not called, although I checked log details... Shinichiro On 2013/10/16, at 1:38, Stefan Matheis wrote: Shinichrio Perhaps i don't see it, but nowhere in your log is something

Re: loading SolrInfoMBeanHandler is slow?

2013-10-15 Thread Chris Hostetter
: In my Mac OSX when starting Solr as OOTB, : I always have to wait 30 sec for completely loading. : It seems that loading SolrInfoMBeanHandler is slow in Solr 4.x (also 4.5). Are you certain the problem is coming from SolrInfoMBeanHandler and not one of the other admin handler rleated classes?

Re: Span Not Queries

2013-10-15 Thread Ankit Kumar
*I have a business use case in which i need to use Span Not and other ordered proximity queries . And they can be nested upto any level A Boolean inside a ordered query or ordered query inside a Boolean . Currently i am thinking of changing the QuerParser.jj file to identify the SpanNot query and

Re: SolrDocumentList - bitwise operation

2013-10-15 Thread Erick Erickson
Why do you think a bitset would help? Bitsets have a bit set on for every document that matches based on the _internal_ Lucene document ID, it has nothing to do with the uniqueKey you have defined. Nor does it have anything to do with the foreign key relationship. So either I don't understand the

Re: fq caching question

2013-10-15 Thread Erick Erickson
Tim: Do watch out for using NOW in filter queries, see: http://searchhub.org/2012/02/23/date-math-now-and-filter-queries/ Best, Erick On Tue, Oct 15, 2013 at 5:36 AM, Tim Vaillancourt t...@elementspace.comwrote: Thanks Koji! Cheers, Tim On 14/10/13 03:56 PM, Koji Sekiguchi wrote: Hi

Re: loading SolrInfoMBeanHandler is slow?

2013-10-15 Thread Shinichiro Abe
Thanks Hoss, My system seemed to configure bad DNS setting. I used Google Public DNS instead then Solr started up quickly. I'll try to confirm DNS configuration properly. SolrInfoMBeanHandler was not slow, but SystemInfoHandler.init() before SolrInfoMBeanHandler was slow, I think. So

Re: Debugging update request

2013-10-15 Thread Erick Erickson
What version of Solr? There's a known issue with distributed deadlock (since fixed) where under heavy load from multiple clients to multiple Solr nodes, you can get into a situation where it will stop accepting updates, but _will_ continue to serve queries. If this describes your situation, dump

Switching indexes

2013-10-15 Thread Christopher Gross
In Solr 3.x, whenever I'd reindex content, I'd fill up one instance, copy the whole data directory over to the second (or third) instance and then restart that Tomcat to get the indexes lined up. With Solr 4.1, I'm guessing that I can't go and do that without taking down all of my nodes and

Re: Switching indexes

2013-10-15 Thread Shawn Heisey
On 10/15/2013 12:36 PM, Christopher Gross wrote: In Solr 3.x, whenever I'd reindex content, I'd fill up one instance, copy the whole data directory over to the second (or third) instance and then restart that Tomcat to get the indexes lined up. With Solr 4.1, I'm guessing that I can't go and do

Re: Adding field to solr dynamically

2013-10-15 Thread Mysurf Mail
Thanks. On Sun, Oct 13, 2013 at 4:18 PM, Jack Krupansky j...@basetechnology.comwrote: Either simply use a dynamic field, or use the Schema API to add a static field: https://cwiki.apache.org/**confluence/display/solr/**Schema+APIhttps://cwiki.apache.org/confluence/display/solr/Schema+API

Re: Switching indexes

2013-10-15 Thread Christopher Gross
Thanks Shawn. I have 3 Solr nodes (and 5 ZK nodes). For #1, would I have to do that on all of them? For #2, I'm not getting the auto-replication between node 1 and nodes 2 3 for my new index. I have 2 indexes -- just call them index and indexbk (bk being the backup containing the full data

Re: Fuzzy Logic Implementation

2013-10-15 Thread Developer
Use the spell check component with collation.. Example: http://localhost:8983/solr/spell?q=delll ultrasharspellcheck=truespellcheck.extendedResults=truespellcheck.collate=true http://wiki.apache.org/solr/SpellCheckComponent -- View this message in context:

Re: Switching indexes

2013-10-15 Thread Shawn Heisey
On 10/15/2013 2:17 PM, Christopher Gross wrote: I have 3 Solr nodes (and 5 ZK nodes). For #1, would I have to do that on all of them? For #2, I'm not getting the auto-replication between node 1 and nodes 2 3 for my new index. I have 2 indexes -- just call them index and indexbk (bk being the

Re: DIH - stream file with solrEntityProcessor

2013-10-15 Thread Josh Lincoln
ultimately I just temporarily increased the memory to handle this data set, but that won't always be practical. I did try the csv export/import and it worked well in this case. I hadn't considered it at first. I am wary that the escaping and splitting may be problematic with some data sets, so

Re: Check the collection status ?

2013-10-15 Thread xinwu
Really,really appreciate for your reply. Thank you. -- View this message in context: http://lucene.472066.n3.nabble.com/Check-the-collection-status-tp4095620p4095756.html Sent from the Solr - User mailing list archive at Nabble.com.

RegexReplaceProcessorFactory replacement string support for match groups

2013-10-15 Thread Ken Krugler
Hi Hoss, In RegexReplaceProcessorFactory, this line means that you can't use match groups in the replacement string: replacement = Matcher.quoteReplacement(replacementParam.toString()); What's the reasoning behind this? Or am I missing something here, and groups can be used? It's making

DIH

2013-10-15 Thread Bill Bell
We have a custom Field processor in DIH and we are not CPU bound on one core... How do we thread it ?? We need to use more cores The box has 32 cores and 1 is 100% CPU bound. Ideas ? Bill Bell Sent from mobile

Re: DIH

2013-10-15 Thread Bill Bell
We are NOW CPU bound Thoughts ??? Bill Bell Sent from mobile On Oct 15, 2013, at 8:49 PM, Bill Bell billnb...@gmail.com wrote: We have a custom Field processor in DIH and we are not CPU bound on one core... How do we thread it ?? We need to use more cores The box has 32 cores

Re: RegexReplaceProcessorFactory replacement string support for match groups

2013-10-15 Thread Jack Krupansky
I filed a Jira on this as well back in May: https://issues.apache.org/jira/browse/SOLR-4864 -- Jack Krupansky -Original Message- From: Ken Krugler Sent: Tuesday, October 15, 2013 10:34 PM To: solr-user@lucene.apache.org Cc: Chris Hostetter Subject: RegexReplaceProcessorFactory

Re: Fuzzy Logic Implementation

2013-10-15 Thread kumar
Hi I have 4-different scenarios to get the results as per my requirement. i am using custom request handlers in the following way. requestHandler name=/suggest class=com.abc.solr.handler.MySearchHandler arr name=components strquery/str strdebug/str /arr lst name=defaults str

Re: SPLITSHARD not working in SOLR-4.4.0

2013-10-15 Thread Shalin Shekhar Mangar
That is strange. Do you have a uniqueKey defined in the schema? Is it marked as required? On Tue, Oct 15, 2013 at 3:06 PM, RadhaJayalakshmi rlakshminaraya...@inautix.co.in wrote: Hi All, For POC purpose, I just brought up a Tomcat-Solr Cluster, with Zookeeper of 3 zodes. In one of my

Re: prepareCommit vs Commit

2013-10-15 Thread Shalin Shekhar Mangar
The IndexWriter javadocs describe it pretty well: http://lucene.apache.org/core/4_5_0/core/org/apache/lucene/index/IndexWriter.html#prepareCommit() My limited understanding is that new segments are opened on indexing requests if prepareCommit has already been called and/or the segments are ready

Re: Cores with lot of folders with prefix index.XXXXXXX

2013-10-15 Thread Shalin Shekhar Mangar
I think that's an acceptable strategy. Can you put up a patch? On Tue, Oct 15, 2013 at 2:32 PM, primoz.sk...@policija.si wrote: I have a question for developers of Solr regarding the issue of left-over index folders when replication fails. Could be this issue resolved quickly if when

Re: [Indexing XML files in Solr with DataImportHandler]

2013-10-15 Thread Shalin Shekhar Mangar
What is not working? Are you seeing any exceptions in the logs? On Tue, Oct 15, 2013 at 3:53 PM, kujta1 kujtim.rahm...@gmail.com wrote: hello i have problems wih indexing xml file format. my solrconfigdaa-config and solr files are here

Re: DIH

2013-10-15 Thread Otis Gospodnetic
Hi Push docs from the outside with multithreaded indexer instead? Otis Solr ElasticSearch Support http://sematext.com/ On Oct 15, 2013 10:49 PM, Bill Bell billnb...@gmail.com wrote: We have a custom Field processor in DIH and we are not CPU bound on one core... How do we thread it ?? We need

Re: SPLITSHARD not working in SOLR-4.4.0

2013-10-15 Thread RadhaJayalakshmi
Thanks for the response!! Yes i have defined unique key in the schema... Still it is throwing the same error.. Is this SPLITSHARD a new feature that is under development in solr 4.4? Has anyone able to split the shards using SPLITSHARD successfully? -- View this message in context:

Re: Regarding Solr Cloud issue...

2013-10-15 Thread Shalin Shekhar Mangar
I'm sorry I am not able to reproduce this issue. I started 5 solr-4.4 instances. I copied example directory into example1, example2, example3 and example4 cd example; java -Dbootstrap_confdir=./solr/collection1/conf -Dcollection.configName=myconf -DzkRun -DnumShards=1 -jar start.jar cd example1;

Re: SPLITSHARD not working in SOLR-4.4.0

2013-10-15 Thread Shalin Shekhar Mangar
Just to be clear, you had a required uniqueKey defined in the schema before you indexed any document, is that correct? It is possible to have a NPE in that line if there is an empty segment or if there are documents but no fields! I'm curious to understand how you ended up with an index like