Re: External File Field eating memory

2014-07-03 Thread Shalin Shekhar Mangar
This might be related: https://issues.apache.org/jira/browse/SOLR-3514 On Sat, Jun 28, 2014 at 5:34 PM, Kamal Kishore Aggarwal kkroyal@gmail.com wrote: Hi Team, I have recently implemented EFF in solr. There are about 1.5 lacs(unsorted) values in the external file. After this

Error in replication eff files

2014-07-03 Thread Kamal Kishore Aggarwal
Dear Team, I am working on external file field. But, I does not know the configuration of how to replicate the EFF files. This is master replication configuration: core/conf/solrconfig.xml requestHandler name=/replication class=solr.ReplicationHandler lst name=master str

Re: External File Field eating memory

2014-07-03 Thread Kamal Kishore Aggarwal
Thanks for your guidance Alexandre Rafalovitch. I am looking into this seriously. Another question is that I facing error in replication of eff file This is master replication configuration: core/conf/solrconfig.xml requestHandler name=/replication class=solr.ReplicationHandler lst

Je suis sur LinkedIn et je vous recommande ce site également.

2014-07-03 Thread benjelloun
Bonjour Alexandre, J'aimerais que vous rejoigniez mon réseau pour partager des nouvelles et rester en contact. anass Accepter : https://www.linkedin.com/e/v2?e=z6nfen-hx5qhjgq-5ra=preRegInvitetracking=eml-guest-invite-ctaek=invite_guestinvitationID=5890360658662281216sharedKey=eBz8oxMp Vous

J'aimerais vous inviter à rejoindre mon réseau professionnel sur LinkedIn.

2014-07-03 Thread benjelloun
Bonjour Erick, J'aimerais que vous rejoigniez mon réseau pour partager des nouvelles et rester en contact. anass Accepter : https://www.linkedin.com/e/v2?e=-u5fqpj-hx5qhjg5-70a=preRegInvitetracking=eml-guest-invite-ctaek=invite_guestinvitationID=5890360658087665665sharedKey=CJxCbD_4 Vous

I use LinkedIn and you should use it too.

2014-07-03 Thread benjelloun
Bonjour Steve, J'aimerais que vous rejoigniez mon réseau pour partager des nouvelles et rester en contact. anass Accepter : https://www.linkedin.com/e/v2?e=-k0w1w5-hx5qhjfn-3ja=preRegInvitetracking=eml-guest-invite-ctaek=invite_guestinvitationID=5890360657726947328sharedKey=v4df5Q_O Vous

Rejoignez mon réseau sur LinkedIn

2014-07-03 Thread benjelloun
Bonjour Alexandre, J'aimerais que vous rejoigniez mon réseau pour partager des nouvelles et rester en contact. anass Accepter : https://www.linkedin.com/e/v2?e=1p1dkj-hx5qhjh3-5ka=preRegInvitetracking=eml-guest-invite-ctaek=invite_guestinvitationID=5890360658481917952sharedKey=iwQz292V Vous

J'aimerais vous inviter à rejoindre mon réseau professionnel sur LinkedIn.

2014-07-03 Thread benjelloun
Bonjour Walter, J'aimerais que vous rejoigniez mon réseau pour partager des nouvelles et rester en contact. anass Accepter : https://www.linkedin.com/e/v2?e=f7uf5h-hx5qhjgx-4ca=preRegInvitetracking=eml-guest-invite-ctaek=invite_guestinvitationID=5890360658842640384sharedKey=ln36HRyT Vous

J'aimerais vous inviter à rejoindre mon réseau professionnel sur LinkedIn.

2014-07-03 Thread benjelloun
Bonjour Alexandre, J'aimerais que vous rejoigniez mon réseau pour partager des nouvelles et rester en contact. anass Accepter : https://www.linkedin.com/e/v2?e=ylp2ee-hx5qhjfy-2pa=preRegInvitetracking=eml-guest-invite-ctaek=invite_guestinvitationID=5890360657898921984sharedKey=aNAb4H5v Vous

Consultez mon profil LinkedIn

2014-07-03 Thread benjelloun
LinkedIn Daniel, J'aimerais vous inviter à rejoindre mon réseau professionnel en ligne, sur le site LinkedIn. anass anass benjelloun Stagiaire chez Document numérique Région de Dijon , France Veuillez confirmer que vous connaissez anass benjelloun :

Re: External File Field eating memory

2014-07-03 Thread Kamal Kishore Aggarwal
With the above replication configuration, the eff file is getting replicated at core/conf/data/external_eff_views (new dir data is being created in conf dir) location, but it is not getting replicated at core/data/external_eff_views on slave. Please help. On Thu, Jul 3, 2014 at 12:21 PM, Kamal

Re: OCR - Saving multi-term position

2014-07-03 Thread Charlie Hull
On 02/07/2014 15:19, Manuel Le Normand wrote: Hello, Many of our indexed documents are scanned and OCR'ed documents. Unfortunately we were not able to improve much the OCR quality (less than 80% word accuracy) for various reasons, a fact which badly hurts the retrieval quality. As we use an

Re: Customise score

2014-07-03 Thread rachun
Hi Jack, I just found out that the sort it doesn't work right for the next page. The response result show that numFound:23850. So I tried this .../select?q=MacBooksort=sum(base_score,query($q,0))+descwt=jsonindent=truestart=100 The summary of base_score and document sort at the next page should

Dynamic field doesnt work

2014-07-03 Thread Anton Gavazuk
Hi dear SOLR community, I'm having a problem with a dynamic field in my schema - basically when I'm adding any document with key matching dynamic field definition I'm getting back an error which says something like unknown field 'test_exattrs' The schema contains such definition dynamic_field

Re: Dynamic field doesnt work

2014-07-03 Thread Alexandre Rafalovitch
I would say something is misspelt somewhere. Put a dynamic field called '*' and see what you end up getting in the schema. Obviously, on a separate test instance. Regards, Alex. Personal website: http://www.outerthoughts.com/ Current project: http://www.solr-start.com/ - Accelerating your Solr

Query with many search items

2014-07-03 Thread Matt Stunfield
Hi, I'm new to Lucene/SOLR and I'm researching if SOLR would fit to our case requirements. I would be very happy if You could help me :). Environment: We have a database storing some (mostly) text information. There are elements containing multiple sections of information. Each section is stored

Re: Query with many search items

2014-07-03 Thread Alexandre Rafalovitch
Luwak may be relevant here (https://github.com/flaxsearch/luwak)? Or it may help to describe difference from Luwak's solution to further fine-tune your requirement. Regards, Alex. Personal website: http://www.outerthoughts.com/ Current project: http://www.solr-start.com/ - Accelerating your

Re: Dynamic field doesnt work

2014-07-03 Thread Jack Krupansky
s/dynamic_field/dynamicField/ -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Thursday, July 3, 2014 5:45 AM To: solr-user@lucene.apache.org Subject: Re: Dynamic field doesnt work I would say something is misspelt somewhere. Put a dynamic field called '*' and

Re: Query with many search items

2014-07-03 Thread Toke Eskildsen
On Thu, 2014-07-03 at 13:10 +0200, Matt Stunfield wrote: [Sections of text 200 words, dictionary of terms phrases] Task: find if terms occur in single section. Found terms must be distinguished. In query result there is (highlighted?) found terms positions. I agree with Alexandre that this

Question about solrcloud recovery process

2014-07-03 Thread Peter Keegan
I bring up a new Solr node with no index and watch the index being replicated from the leader. The index size is 12G and the replication takes about 6 minutes, according to the replica log (from 'Starting recovery process' to 'Finished recovery process). However, shortly after the replication

Re: Dynamic field doesnt work

2014-07-03 Thread Alexandre Rafalovitch
So, I was right about misspelling, but totally missed the one staring me in the face :-) Hopefully, that was all. Personal website: http://www.outerthoughts.com/ Current project: http://www.solr-start.com/ - Accelerating your Solr proficiency On Thu, Jul 3, 2014 at 7:23 PM, Jack Krupansky

Re: Dynamic field doesnt work

2014-07-03 Thread Anton Gavazuk
Jack thank you very much! So obvious - damn, Ive mixed my codebase with solr shema xml :) 2014-07-03 14:23 GMT+02:00 Jack Krupansky j...@basetechnology.com: s/dynamic_field/dynamicField/

Re: Question about solrcloud recovery process

2014-07-03 Thread Mark Miller
I don’t know offhand about the num docs issue - are you doing NRT? As far as being able to query the replica, I’m not sure anyone ever got to making that fail if you directly query a node that is not active. It certainly came up, but I have no memory of anyone tackling it. Of course in many

Listening on SolrCloud events

2014-07-03 Thread Ugo Matrangolo
Hi, I would like to be informed as soon as a cluster event happens like a node dropping and/or starting a recovery process. What is the best way (if any) to listening on SolrCloud events ? Ugo

Re: Question about solrcloud recovery process

2014-07-03 Thread Peter Keegan
No, we're not doing NRT. The search clients aren't using CloudSolrServer and they are behind an AWS load balancer, which calls the Solr ping handler (implemented with ClusterStateAwarePingRequestHandler) to determine when the node is active. This ping handler also responds during the index copy,

Re: Streaming large updates with SolrJ

2014-07-03 Thread Joe Gresock
Thanks Hoss, that's a good explanation. I don't have much experience with the non-sugar parts of the API, so this was a good summary. I suppose I can at least help out the client heap size this way. On Wed, Jul 2, 2014 at 10:14 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : Now that

Re: Question about solrcloud recovery process

2014-07-03 Thread wrdrvr
Try querying the recovering core with distrib=false, you should get the count of docs in it. Most likely, since the replica is recovering it is forwarding all queries to the active replica, this can be verified in the core logs. -- View this message in context:

Field for 'species' data?

2014-07-03 Thread Dan Bolser
Hi, Does anyone on the list have experience with hierarchical facets, specifically for species data? I have a variety of 'messy' species names that I'd like to tidy up at analysis time and use as the basis for taxonomically guided hierarchical facets at query time. I was wondering if there's

Re: Solr Map Reduce Indexer Tool GoLive to SolrCloud with index on local file system

2014-07-03 Thread Tom Chen
Hi, In the GoLive stage, the MRIT sends the MERGEINDEXES requests to Solr instances. The request has a indexDir parameter with a hdfs path to the index generated on HDFS, as shown in the MRIT log: 2014-07-02 15:03:55,123 DEBUG org.apache.http.impl.conn.DefaultClientConnection: Sending request:

Re: Listening on SolrCloud events

2014-07-03 Thread Shawn Heisey
On 7/3/2014 7:49 AM, Ugo Matrangolo wrote: I would like to be informed as soon as a cluster event happens like a node dropping and/or starting a recovery process. What is the best way (if any) to listening on SolrCloud events ? I don't know how it's done, but if you are using SolrJ and

Re: MLT weird behaviour in Solrcloud

2014-07-03 Thread shamik
Anyone ? -- View this message in context: http://lucene.472066.n3.nabble.com/MLT-weird-behaviour-in-Solrcloud-tp4145066p4145502.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Question about solrcloud recovery process

2014-07-03 Thread Peter Keegan
Aha, you are right wrdrvf! The query is forwarded to any of the active shards (I saw the query alternate between both of mine). Nice feature. Also, looking at 'ClusterStateAwarePingRequestHandler' (which I downloaded from www.manning.com/SolrinAction), it is checking zookeeper to see if the

Re: Confusion about location of + and - ?

2014-07-03 Thread Shawn Heisey
On 7/1/2014 2:06 PM, Brett Hoerner wrote: Also, does anyone have the Solr or Lucene bug # for this? It's not actually a bug. It's an inherent truth -- you can't subtract from nothing. You must tell Lucene what to start with before you can subtract the 123456789 value. The fact that a bare

Re: Restriction on type of uniqueKey field?

2014-07-03 Thread Shawn Heisey
On 7/1/2014 8:43 AM, Koji Sekiguchi wrote: In addition, KeywordTokenizer can be seemingly used but it should be avoided for unique key field. One of my customers that used it and they had got OOM during a long term indexing. As it was difficult to find the problem, I'd like to share my

Re: SolrCloud copy the index to another cluster.

2014-07-03 Thread heaven
Hi, sorry for the delay. Yes, we thought to simply copy the index over but this sounds risky and time consuming. Our index is too big to copy it over the internet quickly. We decided to re-index our data and then switch and re-index again. It's a pity there's no way to do this like with mysql :)

java.net.SocketException: Connection reset

2014-07-03 Thread heaven
Hi, trying DigitalOcean for Solr, everything seems well, except sometimes I see these errors: java.net.SocketException: Connection reset at java.net.SocketInputStream.read(SocketInputStream.java:196) at java.net.SocketInputStream.read(SocketInputStream.java:122) at

RE: Disable Regular Expression Support

2014-07-03 Thread Markus Jelsma
Hi, you can escape the surrounding slashes in your front-end. Markus -Original message- From:Markus Schuch markus_sch...@web.de Sent: Thursday 3rd July 2014 20:53 To: solr-user@lucene.apache.org Subject: Disable Regular Expression Support Hi Solr Community, we migrate from

Disable Regular Expression Support

2014-07-03 Thread Markus Schuch
Hi Solr Community, we migrate from solr 1.4 to 4.3 and found out, that solr 4.x invented regular expression support for the query parser. Is it possible to disable this feature to get back to the 1.4 behavior of the query parser? Many thanks in advance, Markus Schuch

Re: Disable Regular Expression Support

2014-07-03 Thread Jack Krupansky
Unfortunately, not - the syntax is hard-wired into the grammar. Feel free to file a Jira though. I would be in favor of having a query parser config option to disable features like regex and leading wildcard as well. -- Jack Krupansky -Original Message- From: Markus Schuch Sent:

Re: Strategy for removing an active shard from zookeeper

2014-07-03 Thread Jeff Wartes
To expand on that, the Collections API DELETEREPLICA command is availible in Solr = 4.6, but will not have the ability wipe the disk until Solr 4.10. Note that whether or not it deletes anything from disk, DELETEREPLICA will remove that replica from your cluster state in ZK, so even in 4.10,

Re: Solr Map Reduce Indexer Tool GoLive to SolrCloud with index on local file system

2014-07-03 Thread Erick Erickson
Hmmm, interesting, I actually hadn't thought of doing it that way. I don't know the internals well enough to comment on it but I do know someone who does. I'll check with them Erick On Thu, Jul 3, 2014 at 9:18 AM, Tom Chen tomchen1...@gmail.com wrote: Hi, In the GoLive stage, the MRIT

Re: java.net.SocketException: Connection reset

2014-07-03 Thread Michael Della Bitta
What's the %system load on your nodes? What servlet container are you using? Are you writing a single document per update, or in batches? How many clients are attached to your cloud? Michael Della Bitta Applications Developer o: +1 646 532 3062 appinions inc. “The Science of Influence

Re: Confusion about location of + and - ?

2014-07-03 Thread Jack Krupansky
I gave the relevant Solr Jiras on a previous fork of this thread. Here is the most relevant current Lucene Jira: https://issues.apache.org/jira/browse/LUCENE-3451 Remove special handling of pure negative Filters in BooleanFilter, disallow pure negative queries in BooleanQuery That patch is

Re: Listening on SolrCloud events

2014-07-03 Thread Jeff Wartes
If you¹re using SolrJ, CloudSolrServer exposes the information you need directly, although you¹d have to poll it for changes. Specifically, this code path will get you a snapshot of the clusterstate: http://lucene.apache.org/solr/4_5_0/solr-solrj/org/apache/solr/client/solrj

Re: java.net.SocketException: Connection reset

2014-07-03 Thread heaven
Hello, usually the loading is not high at all: http://lucene.472066.n3.nabble.com/file/n4145550/Screenshot_795.png We're using bundled jetty and writing in batches by 50-100 documents and only using soft and auto commits. About clients, we have 4 processes and each could run up to 5 threads. At

Passing Solr data directory to DataImportHandler?

2014-07-03 Thread Brian Panulla
I'm trying to use the new propertyWriter configuration directive for DataImportHandler ( https://wiki.apache.org/solr/DataImportHandler#Configuring_The_Property_Writer). I'd like to tell Solr to write the dataimport.properties to the data directory for the corresponding Core on our multicore

How to get related facets using Solr query ?

2014-07-03 Thread Shamik Bandopadhyay
Hi, I've trying construct a facet query to organize related facets in the response. Let me illustrate a sample. Let's say I've the following documents indexed in Solr. 1. Doc A -- Facet:AB Facet:MNO 2. Doc B -- Facet:CD

Re: Field for 'species' data?

2014-07-03 Thread Gora Mohanty
On 3 July 2014 21:40, Dan Bolser dbol...@ebi.ac.uk wrote: Hi, Does anyone on the list have experience with hierarchical facets, specifically for species data? [...] Maybe not specifically for species data, but hierarchical faceting works pretty well with Solr. Please see