Re: Issue in 5.5.3 with lucene localParams with type

2016-11-28 Thread William Bell
Bump... Thoughts? It seems that {!lucene type=} would just override the lucene qp - but in 5.5.3 something changed. On Mon, Nov 28, 2016 at 1:15 PM, William Bell wrote: > In Solr 5.4.1 this used to work: > > fl={!lucene%20type=payloadQueryParser

Re: Starting SolrCloud

2016-11-28 Thread Erick Erickson
You need to find the solr.log file and examine it. What this usually means is that something's wrong with, say, your Solr configs. You should see a more informative message in the Solr log, usually it's a stack trace. You say that your start "seems to complete successfully". That implies that you

Starting SolrCloud

2016-11-28 Thread James Muerle
Hello, I am very new to Solr, and I'm excited to get it up and running on amazon ec2 for some prototypical testing. So, I've installed solr (and java) on one ec2 instance, and I've installed zookeeper on another. After starting the zookeeper server on the default port of 2181, I run this on the

Re: Break up a supplier's documents (products) from dominating search result.

2016-11-28 Thread Alexandre Rafalovitch
You can use expand and it will provide several documents per group (but in a different data structure in the response). Then it is up to you how to sequence or interleave the results in your UI. You do need to deal with edge-cases like what happens if you say 3 products per group, but then one

Re: Break up a supplier's documents (products) from dominating search result.

2016-11-28 Thread Derek Poh
Is there a way where we do not have to change the page UI? This is the search page for your reference.

Zookeeper connection lost in 5.5.3

2016-11-28 Thread Yago Riveiro
Hi, I upgraded my cluster to 5.5.3 and now I'm having a lot of this warnings. Unable to read /collections/collectionX/leader_initiated_recovery/shard9/core_node12 due to: org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for

Re: How to enable JMX to monitor Jetty

2016-11-28 Thread Yago Riveiro
Hi, Rallavagu, the jetty-jmx.xml file is the basic file of the github repository or something custom? I modified the file modules/http.mod and I can't see jetty stuff ... - Best regards -- View this message in context:

Re: Break up a supplier's documents (products) from dominating search result.

2016-11-28 Thread Walter Underwood
We used something like field collapsing, but it wasn’t with Solr or Lucene. They had not been invented at the time. This was a feature of the Ultraseek engine from Infoseek, probably in 1997 or 1998. With field collapsing, you provide a link to show more results from that source. wunder Walter

Re: Break up a supplier's documents (products) from dominating search result.

2016-11-28 Thread Derek Poh
Hi Walter You used field collapsing for your case as well? For my case the search result page is listing of products. There is a option to select the number of products to display per page. Let's say 40 products per page is selected. A search result has 100 matching products but these

Re: Solr 6.3.0 SQL question

2016-11-28 Thread Damien Kamerman
Aggregated selects only work with lower-case collection names (and no dashes). (Bug in StatsStream I think) I assume 'SOLR-9077 Streaming expressions should support collection alias' which is fixed in 6.4 is a work around. On 29 November 2016 at 08:29, Kevin Risden

Re: upconfig in zookeeper doesn't relfect changes

2016-11-28 Thread Sadheera Vithanage
Please ignore this, It worked. On Tue, Nov 29, 2016 at 11:41 AM, Sadheera Vithanage wrote: > Hi All, > > I am trying to edit the solrconfig.xml for my solrcloud setup, which is in > the zookeeper as a configuration. > > Below are the steps I am following. > >

upconfig in zookeeper doesn't relfect changes

2016-11-28 Thread Sadheera Vithanage
Hi All, I am trying to edit the solrconfig.xml for my solrcloud setup, which is in the zookeeper as a configuration. Below are the steps I am following. /opt/solr/server/scripts/cloud-scripts/zkcli.sh -cmd downconfig -confdir /var/solr/data/dir_name -confname MyConfig -z 100.100.100.102 Update

Re: Break up a supplier's documents (products) from dominating search result.

2016-11-28 Thread Walter Underwood
We had a similar feature in the Ultraseek search engine. One of our customers was a magazine publisher, and they wanted the best hit from each magazine on the first page. I expect that field collapsing would work for this. wunder Walter Underwood wun...@wunderwood.org

Re: Break up a supplier's documents (products) from dominating search result.

2016-11-28 Thread Derek Poh
Alex Hope I understand what you meant by positive business requirements. With a few supplier's products dominating the first page of a search result, the sales will not be able to convince prospectiveor existing clients to sign up. They would like the results tofeature other supplier's

Re: Index time sorting and per index mergePolicyFactory

2016-11-28 Thread Erick Erickson
Wait, on the page you referenced there's this which appears to be exactly what you want: timestamp desc inner org.apache.solr.index.TieredMergePolicyFactory 10 10 And since this is in solrconfig.xml which is defined per core you can specify whatever you want for each core. Also see

Re: Index time sorting and per index mergePolicyFactory

2016-11-28 Thread Dorian Hoxha
bump after 11 days On Thu, Nov 17, 2016 at 10:25 AM, Dorian Hoxha wrote: > Hi, > > I know this is done in lucene, but I don't see it in solr (by searching + > docs on collections). > > I see https://cwiki.apache.org/confluence/display/solr/ > IndexConfig+in+SolrConfig

Re: stream, features and train

2016-11-28 Thread Joe Obernberger
Thank you Joel - that was it; or rather a miss-understanding of how this works on my end! -Joe On 11/26/2016 10:17 PM, Joel Bernstein wrote: Hi, It looks like the outcome field my not be correct or it may have missing values. You'll need to populate this field for all records in the

Re: Solr 6.3.0 SQL question

2016-11-28 Thread Kevin Risden
Is there a longer error/stack trace in your Solr server logs? I wonder if the real error is being masked. Kevin Risden On Mon, Nov 28, 2016 at 3:24 PM, Joe Obernberger < joseph.obernber...@gmail.com> wrote: > I'm running this query: > > curl --data-urlencode 'stmt=SELECT avg(TextSize) from

Solr 6.3.0 SQL question

2016-11-28 Thread Joe Obernberger
I'm running this query: curl --data-urlencode 'stmt=SELECT avg(TextSize) from UNCLASS' http://cordelia:9100/solr/UNCLASS/sql?aggregationMode=map_reduce The error that I get back is: {"result-set":{"docs":[ {"EXCEPTION":"org.apache.solr.common.SolrException: Collection not found:

Re: ClassicIndexSchemaFactory with Solr 6.3

2016-11-28 Thread Cassandra Targett
I'm not seeing how the documentation is wrong here. It says: "When a is not explicitly declared in a solrconfig.xml file, Solr implicitly uses a ManagedIndexSchemaFactory" IOW, managed schema is the default, and you may not find a schemaFactory definition in your file. When a schemaFactory

Issue in 5.5.3 with lucene localParams with type

2016-11-28 Thread William Bell
In Solr 5.4.1 this used to work: fl={!lucene%20type=payloadQueryParser v='hosp_quality_spec_boost:PS628'} 24.227154, The only way I can get payloads to work is: fl={!payloadQueryParser v='hosp_quality_spec_boost:PS628'} 0.125, But the right values only come back in #2. It should be .125.

Re: Scheduling Data Import Handler (DIH) Delta Imports

2016-11-28 Thread Walter Underwood
First, try to do it with something like Apache Camel. That moves the whole database import process outside of Solr where it can be more easily controlled. http://camel.apache.org/ wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog)

Re: Scheduling Data Import Handler (DIH) Delta Imports

2016-11-28 Thread Jamie Jackson
One last bump before I get crackin'... On Mon, Nov 21, 2016 at 11:54 AM, Jamie Jackson wrote: > Hi Folks, > > I have DIH cores that are being indexed by my Lucee application. That > works, but I'd like to make some improvements: > > >- Make a standalone scheduler

Re: Search opening hours

2016-11-28 Thread David Smiley
Lets say you wanted to do ranges over some integer. Simply convert those integers to dates, such as java.time.Instant.ofEpochSecond(myInteger).toString(). It's more efficient to convert to seconds (as in this example) as a base instead milliseconds because the internal date oriented tree has

Re: Failure when trying to full sync, out of space ? Doesn't delete old segments before full sync?

2016-11-28 Thread Walter Underwood
> On Nov 28, 2016, at 9:38 AM, Shawn Heisey wrote: > > […] Typically > a merge or optimize will only require double the space, but there are > certain worst-case scenarios where it can require triple. I do not know > what causes the worst-case situation. Worst case: 1.

Re: Failure when trying to full sync, out of space ? Doesn't delete old segments before full sync?

2016-11-28 Thread Michael Joyner
On 11/28/2016 12:26 PM, Erick Erickson wrote: Well, such checks could be put in, but they don't get past the basic problem. And all this masks your real problem; you didn't have enough disk space to optimize in the first place. Even during regular indexing w/o optimizing, Lucene segment

Re: Failure when trying to full sync, out of space ? Doesn't delete old segments before full sync?

2016-11-28 Thread Shawn Heisey
On 11/28/2016 9:39 AM, Michael Joyner wrote: > I'm running out of spacing when trying to restart nodes to get a > cluster back up fully operational where a node ran out of space during > an optimize. > > It appears to be trying to do a full sync from another node, but > doesn't take care to check

Re: Failure when trying to full sync, out of space ? Doesn't delete old segments before full sync?

2016-11-28 Thread Michael Joyner
We've being trying to run at 40% estimated usage when optimized, but are doing a large amount of index updates ... 40% usage in this scenario seems to be too high... On 11/28/2016 12:26 PM, Erick Erickson wrote: Well, such checks could be put in, but they don't get past the basic problem.

Re: Failure when trying to full sync, out of space ? Doesn't delete old segments before full sync?

2016-11-28 Thread Erick Erickson
Well, such checks could be put in, but they don't get past the basic problem. bq: If the segments are out of date and we are pulling from another node before coming "online" why aren't the old segments deleted? because you run the risk of losing _all_ your data and having nothing at all. The

Re: Solr 6 Performance Suggestions

2016-11-28 Thread Walter Underwood
We had some serious slowness at startup before I set Xms to be the same as Xmx. We run with an 8G heap. We have multiple collections but don’t use faceting. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Nov 28, 2016, at 8:40 AM, Max Bridgewater

Re: initiate solr could collection

2016-11-28 Thread Novin Novin
Apologies for that didn't described properly. Thanks for the help. I will look into this. On Mon, 28 Nov 2016 at 16:33 Erick Erickson wrote: > Please state the full problem rather than make us pull things out in > dribs and drabs. > > Have you looked at the bin/solr

Re: Solr 6 Performance Suggestions

2016-11-28 Thread Max Bridgewater
Thanks again Folks. I tried each suggestion and none made any difference. I am setting up a lab for performance monitoring using App Dynamics. Hopefully I am able to figure out something. On Mon, Nov 28, 2016 at 11:20 AM, Erick Erickson wrote: > bq: If you know the

Failure when trying to full sync, out of space ? Doesn't delete old segments before full sync?

2016-11-28 Thread Michael Joyner
Hello all, I'm running out of spacing when trying to restart nodes to get a cluster back up fully operational where a node ran out of space during an optimize. It appears to be trying to do a full sync from another node, but doesn't take care to check available space before starting

Re: initiate solr could collection

2016-11-28 Thread Erick Erickson
Please state the full problem rather than make us pull things out in dribs and drabs. Have you looked at the bin/solr script options? Particularly the create_collection option? On Mon, Nov 28, 2016 at 8:24 AM, Novin Novin wrote: > Thanks for this Erick, -e brings me to

Re: initiate solr could collection

2016-11-28 Thread Novin Novin
Thanks for this Erick, -e brings me to prompt. I can't use it because I am using script to setup solr cloud. I required something where I can define shard and replica also. Best, Novin On Mon, 28 Nov 2016 at 16:14 Erick Erickson wrote: > try > > bin/solr start -e cloud

Re: Solr 6 Performance Suggestions

2016-11-28 Thread Erick Erickson
bq: If you know the maximum size you ever will need, setting Xmx is good. Not quite sure what you're getting at here. I pretty much guarantee that a production system will eat up the default heap size, so not setting Xmx will cause OOM errors pretty soon. Or did you mean Xms? As far as setting

Re: initiate solr could collection

2016-11-28 Thread Erick Erickson
try bin/solr start -e cloud -z ZK_NODE That'll guide you through creating a collection, assuming you can get by with one of the stock configuration sets. Best, Erick On Mon, Nov 28, 2016 at 8:11 AM, Novin Novin wrote: > Hi Guys, > > Does solr has any way to create

initiate solr could collection

2016-11-28 Thread Novin Novin
Hi Guys, Does solr has any way to create collection when solr cloud is getting started first time? Best, Novin

Using atomic update in Solr get an error

2016-11-28 Thread giladv
I'm getting the following error in 5.2.1: RunUpdateProcessor has received an AddUpdateCommand containing a document that appears to still contain Atomic document update operations, most likely because DistributedUpdateProcessorFactory was explicitly disabled from this updateRequestProcessorChain

3rd party integrations (was: The state of Solr 5. Is it in maintenance mode only?)

2016-11-28 Thread Alexandre Rafalovitch
On 29 November 2016 at 00:24, Shawn Heisey wrote: > Third-party integrations (Solr support in other software) tend to be > VERY slow to upgrade. Some of them are still shipping configs designed > for Solr 3.x, which won't work in 5.x and later. Some are still > shipping

Re: Break up a supplier's documents (products) from dominating search result.

2016-11-28 Thread Alexandre Rafalovitch
Is it technically possible to expose it in Solr? Because there was also http://stackoverflow.com/questions/40831474/randomize-result-set-between-the-brands-in-solr/40835382#40835382 . Seems a popular request (or I misread different things in the same way). Regards, Alex.

Re: Break up a supplier's documents (products) from dominating search result.

2016-11-28 Thread Shalin Shekhar Mangar
There is a related work done in Lucene land which hasn't been exposed in Solr yet. It is called DiversifiedTopDocsCollector. See https://issues.apache.org/jira/browse/LUCENE-6066 On Mon, Nov 28, 2016 at 2:39 PM, Derek Poh wrote: > Hi > > We have a business requirement to

Re: The state of Solr 5. Is it in maintenance mode only?

2016-11-28 Thread Shawn Heisey
On 11/28/2016 6:11 AM, Jaroslaw Rozanski wrote: > As for adoption levels, it was my subjective feel reading this list. Do > we have community survey on that subject? That would be really > interesting to see. That's really hard for me to say. Users tend to not what version they are running

Re: The state of Solr 5. Is it in maintenance mode only?

2016-11-28 Thread Jaroslaw Rozanski
Hi, Thanks for elaborate response. Missed the link to duplicate JIRA. Makes sense. On the 5.x front I wasn't expecting 5.6 release now that we have 6.x but was simply surprised to see fix for 4.x and not for 5.x. As for adoption levels, it was my subjective feel reading this list. Do we have

Re: SOl6.3 Alchemy Annotator Not Working

2016-11-28 Thread Shawn Heisey
On 11/28/2016 12:50 AM, soumitra80 wrote: > This issue has been resolved. Please close this Unless you opened an issue, there wasn't ever one open. I did not see an issue number, so if there's something to close, I'm not aware of it. The class you mentioned in your original post,

Re: The state of Solr 5. Is it in maintenance mode only?

2016-11-28 Thread Shawn Heisey
On 11/28/2016 4:29 AM, Jaroslaw Rozanski wrote: > Recently I have noticed that couple of Solr issues have been > resolved/added only for Solr 4.x and Solr 6.x branch. For example > https://issues.apache.org/jira/browse/SOLR-2242. Has Solr 5.x branch > been moved to maintenance mode only? The 5

The state of Solr 5. Is it in maintenance mode only?

2016-11-28 Thread Jaroslaw Rozanski
Hi, Recently I have noticed that couple of Solr issues have been resolved/added only for Solr 4.x and Solr 6.x branch. For example https://issues.apache.org/jira/browse/SOLR-2242. Has Solr 5.x branch been moved to maintenance mode only? The 5 wasn't around for long before 6 came about so I

Re: Solr 6 Performance Suggestions

2016-11-28 Thread Florian Gleixner
Am 28.11.2016 um 00:00 schrieb Shawn Heisey: On 11/27/2016 12:51 PM, Florian Gleixner wrote: On 22.11.2016 14:54, Max Bridgewater wrote: test cases were exactly the same, the machines where exactly the same and heap settings exactly the same (Xms24g, Xmx24g). Requests were sent with Setting

Re: Break up a supplier's documents (products) from dominating search result.

2016-11-28 Thread Alexandre Rafalovitch
You have described your _negative_ business requirements, but not the _positive_ ones. So, it is hard to see what they want to happen. It is easy enough to promote or demote a particular filter matches. But you want to partially limit them. On a first page? What about on the second? I suspect you

Break up a supplier's documents (products) from dominating search result.

2016-11-28 Thread Derek Poh
Hi We have a business requirement to breakupa supplier's products from dominating search resultso as to allow othersuppliers' products in the search result to have exposure. Business users are open to implementing this for the first page of the search resultif it is not possible to apply

Re: AW: AW: Resync after restart

2016-11-28 Thread Arkadi Colson
We do. Indexing is always running. Fix version is 6.3 so can I assume that the issue is fixed in 6.3? We are running 6.3 right now so or the fix is not in 6.3 or another issue is causing the full resync. BR Arkadi On 25-11-16 18:23, Pushkar Raste wrote: Did you index any documents while

Custom EntityProcessor for DataImportHandler related Issue.

2016-11-28 Thread anupambumba
Hi All, I am facing some issue with the Custom EntityProcessor for DataImportHandler related Issue. *My Requirement:* My Requirement is to process a main file along with it's associated chunk files( Child Files) placed in a folder. The file related information are part of JSON file placed in