Re: mm being ignored by edismax

2016-10-06 Thread Alexandre Rafalovitch
I think it is the change in the OR and AND treatment that had been confusing a number of people. There were discussions before on the mailing list about it, for example http://search-lucene.com/m/eHNlzBMAHdfxcv1 Regards, Alex. Solr Example reading group is starting November 2016, join us

RE: Migrating to Solr 6.1.0 from 5.5.0

2016-10-06 Thread M, Arjun (Nokia - IN/Bangalore)
Thanks David.. I found the solution. Below the information related to that. “Solr supports polygons via JTS Topology Suite, which does not come with Solr.It's a JAR file that you need to put on Solr's classpath (but not via the standard solrconfig.xml mechanisms). If you intend to use those

mm being ignored by edismax

2016-10-06 Thread Nick Hall
Hello, I'm working on upgrading a Solr installation from 4.0 to 6.2.1 and have everything mostly working but have hit a snag. I kept the schema basically the same, just made some minor changes to allow it to work with the new version, but one of my queries is working differently with the new

Re: solr 5 leaving tomcat, will I be the only one fearing about this?

2016-10-06 Thread Alexandre Rafalovitch
Treat Solr as a blackbox standalone database. Your MySQL is running standalone, right? And try to go to Solr 6, if you can. 5 is not latest anymore and there had been lots of scaling improvements in 6. Regards, Alex On 7 Oct 2016 5:02 AM, "Renee Sun" wrote: > need

Re: [Solr-5-4-1] Why SolrCloud leader is putting all replicas in recovery at the same time ?

2016-10-06 Thread Pushkar Raste
A couple of questions/suggestions - This normally happens after leader election, when new leader gets elected, it will force all the nodes to sync with itself. Check logs to see when this happens, if leader was changed. If that is true then you will have to investigate why leader change takes

Re: newSearcher autowarming queries in solrconfig.xml run but does not appear to warm cache

2016-10-06 Thread Dalton Gooding
Erick, Thanks for the response. After I run the initial query and get a long response time, if I change the query to remove or add additional query statements, I find the speed is good. If I run the modified query after a new searcher has registered, the response is slow but after the modified

Count on Multivalued field using facet

2016-10-06 Thread Aswath Srinivasan (TMS)
Hello, I'm having a result set something like this, and query like below. The facet count for Line field is 1(1). That is, value Line's value 1 has numBucket = 1. However, I need to count the number of occurrence of each of the values in the LINE field. Is there a way to do this? Expecting

Re: Streaming api and multiValued fields

2016-10-06 Thread Joel Bernstein
Currently the joins in the Streaming API don't support joining on multi-value fields. It will be difficult to support merge joins on multi-value fields but hash joins would be possible in the future. Also the gatherNodes graph expression will support multi-value fields in the future. Joel

Re: newSearcher autowarming queries in solrconfig.xml run but does not appear to warm cache

2016-10-06 Thread Erick Erickson
Submitting the exact same query twice will return results from the queryResultCache. I'm not entirely sure that the firstSearcher events get put into the cache. So if you change the query even slighty my guess is that you'll see response times very close to your original ones of over a second.

solr 5 leaving tomcat, will I be the only one fearing about this?

2016-10-06 Thread Renee Sun
need some general advises please... our infra is built with multiple webapps with tomcat ... the scale layer is archived on top of those webapps which work hand-in-hand with solr admin APIs / shard queries / commit or optimize / core management etc etc. While I have not get a chance to actually

newSearcher autowarming queries in solrconfig.xml run but does not appear to warm cache

2016-10-06 Thread Dalton Gooding
After setting a number of newSearcher and firstSearcher queries, I can see in the console logs that the queries are run, but when I run the same query against the new searcher (using CuRL), I get a slow response time for the first run.  Config:                   DataType_s:Product           

Re: Queries to help warm up (mmap)

2016-10-06 Thread Pushkar Raste
One of the tricks I had read somewhere was to cat all files in the index directory and OS will have file in the disk cache. On Thu, Oct 6, 2016 at 11:55 AM, Rallavagu wrote: > Looking for clues/recommendations to help warm up during startup. Not > necessarily Solr caches

Streaming api and multiValued fields

2016-10-06 Thread chriseldredge
Is there any documentation on the support (or lack thereof) for using join, hashJoin and other operations to combine streams on multiValued fields? I have a core with posts that can be written about multiple companies, and another core with info about those companies: { 'id': 'post-1234',

Re: Average of Averages in Solr

2016-10-06 Thread Susheel Kumar
Please look into streaming expressions. I think that is what you are looking for. https://cwiki.apache.org/confluence/display/solr/Streaming+Expressions Thanks, Susheel On Thu, Oct 6, 2016 at 11:56 AM, John Bickerstaff wrote: > This may help? Note the "Bloomberg

Re: Problem with Password Decryption in Data Import Handler

2016-10-06 Thread Jamie Jackson
It happens to be ten characters. On Thu, Oct 6, 2016 at 12:44 PM, Alexandre Rafalovitch wrote: > How long is the encryption key (file content)? Because the code I am > looking at seems to expect it to be at most 100 characters. > > Regards, >Alex. > > Newsletter and

Re: Problem with Password Decryption in Data Import Handler

2016-10-06 Thread Jamie Jackson
I tried it both ways yesterday--with a newline and without. On Thu, Oct 6, 2016 at 12:26 PM, Kevin Risden wrote: > I haven't tried this but is it possible there is a new line at the end in > the file? > > If you did something like echo "" > file.txt then there would be

Re: Writing Solr Custom Components

2016-10-06 Thread John Bickerstaff
Thank you Otis! On Thu, Oct 6, 2016 at 10:28 AM, Otis Gospodnetić < otis.gospodne...@gmail.com> wrote: > John, if it helps, here are a few examples of custom Solr SearchComponents: > > https://github.com/sematext/query-segmenter > https://github.com/sematext/solr-researcher >

Re: Writing Solr Custom Components

2016-10-06 Thread Otis Gospodnetić
John, if it helps, here are a few examples of custom Solr SearchComponents: https://github.com/sematext/query-segmenter https://github.com/sematext/solr-researcher https://github.com/sematext/solr-autocomplete I hope this helps. Otis -- Monitoring - Log Management - Alerting - Anomaly Detection

Re: seperate core from engine

2016-10-06 Thread Shawn Heisey
On 10/6/2016 11:07 AM, KRIS MUSSHORN wrote: > Currently Solr ( 5.4.1 ) and its core data are all in one location. > How would i set up Solr so that the core data could be stored > somewhere else? Pointers to helpful instructions are appreciated If you use the service installation script on a

Re: seperate core from engine

2016-10-06 Thread Alexandre Rafalovitch
You have solr home property (solr.solr.home) to point to where all your collections/cores are and then you can set various directory locations per core in the core.properties file. Regards, Alex. Newsletter and resources for Solr beginners and intermediates: http://www.solr-start.com/

seperate core from engine

2016-10-06 Thread KRIS MUSSHORN
Currently Solr ( 5.4.1 ) and its core data are all in one location. How would i set up Solr so that the core data could be stored somewhere else? Pointers to helpful instructions are appreciated TIA Kris

Re: Problem with Password Decryption in Data Import Handler

2016-10-06 Thread Alexandre Rafalovitch
How long is the encryption key (file content)? Because the code I am looking at seems to expect it to be at most 100 characters. Regards, Alex. Newsletter and resources for Solr beginners and intermediates: http://www.solr-start.com/ On 6 October 2016 at 23:26, Kevin Risden

Re: Problem with Password Decryption in Data Import Handler

2016-10-06 Thread Kevin Risden
I haven't tried this but is it possible there is a new line at the end in the file? If you did something like echo "" > file.txt then there would be a new line. Use echo -n "" > file.txt Also you should be able to check how many characters are in the file. Kevin Risden On Wed, Oct 5, 2016 at

Re: SOLR Sizing

2016-10-06 Thread Walter Underwood
The square-root rule comes from a short paper draft (unpublished) that I can’t find right now. But this paper gets the same result: http://nflrc.hawaii.edu/rfl/April2005/chujo/chujo.html Perfect OCR would follow this rule, but even great

Re: Upgrading to SolrCloud

2016-10-06 Thread Shawn Heisey
On 10/6/2016 9:02 AM, Steven White wrote: > We currently have a component that uses SolrJ and Solr REST API to admin > Solr (adding new fields, changing handlers, etc. to customize Solr's > schema) based on customer's DB schema before we start indexing. > > If we switch over to SolrCloud: > > 1)

Re: Upgrading to SolrCloud

2016-10-06 Thread Jan Høydahl
> 6. okt. 2016 kl. 17.02 skrev Steven White : > If we switch over to SolrCloud: > > 1) Will our existing usage of SolrJ and REST API still work as-is? Yes, probably > 2) Not all of our customers need that high availability of Solr. For > those, single server and single

Re: Queries to help warm up (mmap)

2016-10-06 Thread Walter Underwood
I use the schema browser to find the 20 most common words. I use those, assuming that they’ll be the most common in queries. Those are static warming queries in solrconfig.xml. This works fairly well for book or movie titles. Not so well for free text. You could do the same thing with query

Upgrading to SolrCloud (take 2)

2016-10-06 Thread Steven White
(sorry if this a second post, the first one 1 posted 1 hour ago has yet to make it to the mailing list!!) Hi everyone, Currently, we are on Solr 5.2 and use 1 core and none of the cloud features. We are planning to upgrade to Solr 6.2 and utilize SolrCloud not because our data need to scale

Re: Average of Averages in Solr

2016-10-06 Thread John Bickerstaff
This may help? Note the "Bloomberg Analytics" at the bottom of the post... https://dzone.com/articles/solr-not-just-for-text-anymore Quote from article: - *Bloomberg Analytics Component for Solr*: Bloomberg Financial Services uses Solr extensively, and found the existing statistical

Queries to help warm up (mmap)

2016-10-06 Thread Rallavagu
Looking for clues/recommendations to help warm up during startup. Not necessarily Solr caches but mmap as well. I have used some like "q=:[* TO *]" for various fields and it seems to help with mmap population around 40-50%. Is there anything else that could help achieve 90% or more? Thanks.

Re: Average of Averages in Solr

2016-10-06 Thread Shawn Heisey
On 10/6/2016 12:04 AM, Lewin Joy (TMS) wrote: > There is a requirement to take an average on "Amount" field against > each "code" field. And then calculate the averages on this averages. > Since my "code" field has a very huge cardinality, which could be > around 200,000 or even in millions ; It

Re: JSON Facet "allBuckets" behavior

2016-10-06 Thread prosens
Yonik, Here is the requirement: Get sum of size field for all the documents which has a duplicate in the index. Duplicate is decided based on a string field. So, we are looking for something like this. { "Statistics": { "type": "terms", "field": "filename",

Re: SOLR Sizing

2016-10-06 Thread Erick Erickson
OCR _without errors_ wouldn't break it. That comment assumed that the OCR was dirty I thought. Honest, I once was trying to index an OCR'd image of a "family tree" that was a stylized tree where the most remote ancestor was labeled in vertical text on the trunk, and descendants at various angles

Re: SOLR Sizing

2016-10-06 Thread Rick Leir
I am curious to know where the square-root assumption is from, and why OCR (without errors) would break it. TIA cheers - - Rick On 2016-10-04 10:51 AM, Walter Underwood wrote: No, we don’t have OCR’ed text. But if you do, it breaks the assumption that vocabulary size is the square root of

Upgrading to SolrCloud

2016-10-06 Thread Steven White
Hi everyone, Currently, we are on Solr 5.2 and use 1 core and none of the cloud features. We are planning to upgrade to Solr 6.2 and utilize SolrCloud not because our data need to scale (single core with no cloud is doing just fine on our index of 2 million records and about 15 gb index size)

Re: QuerySenderListener

2016-10-06 Thread Erick Erickson
Hmm, that JIRA looks like exactly what's going on. I suspect the reason it's not generating that much interest is that restarting Solr should be a rare enough event that opening two searchers isn't causing enough difficulty for someone to break loose the time to create a patch. The patch attached

Rollback solrcloud

2016-10-06 Thread Pablo Anzorena
Hey, I was trying to make a rollback under solrcloud and foundd that it's not supported https://issues.apache.org/jira/browse/SOLR-4895 (I have solr6.1.0) So my question is, how can I simulate a rollback? Actually what I'm doing is: 1. prepareCommit 2. add documents 3. try to commit

Re: [Solr-5-4-1] Why SolrCloud leader is putting all replicas in recovery at the same time ?

2016-10-06 Thread Erick Erickson
There is no information here at all that would us to say anything meaningful. You might review: http://wiki.apache.org/solr/UsingMailingLists What do the logs say? Are there any exceptions? What happens on your system that's unusual if anything? In short, what have you tried to do to diagnose the

Best practice for Fuzzy Search combined with Phrase Queries

2016-10-06 Thread Markus Lang
Hi, I am interested in best practices on how to handle phrase queries where only a part of the phrase may match and / or the user made some typos. Are there any papers on when to use only a part of the query phrase or how many words of the phrase should rather be corrected before skipping them?

[Solr-5-4-1] Why SolrCloud leader is putting all replicas in recovery at the same time ?

2016-10-06 Thread Gerald Reinhart
Hello everyone, Our Solr Cloud works very well for several months without any significant changes: the traffic to serve is stable, no major release deployed... But randomly, the Solr Cloud leader puts all the replicas in recovery at the same time for no obvious reason. Hence, we

Re: running solr 6.x in Eclipse for debugging

2016-10-06 Thread John Bickerstaff
Thank you very much Eric - I'll try that tomorrow. On Wed, Oct 5, 2016 at 7:57 PM, Erick Erickson wrote: > John: > > The simple answer is "cheat" > > It takes a little fiddling, but here's what I do in IntelliJ: > > 1> use IntelliJ to create an "artifact" that's

Average of Averages in Solr

2016-10-06 Thread Lewin Joy (TMS)
•• PROTECTED 関係者外秘 Hi, I have a big collection with around 100 million records. There is a requirement to take an average on "Amount" field against each "code" field. And then calculate the averages on this averages. Since my "code" field has a very huge cardinality, which could be around