Re: The most efficient way to get un-inverted view of the index?

2016-08-17 Thread Roman Chyla
in case this helps someone, here is a solution (probably very efficient already, but i didn't profile it); it can deal with DocValues and with FieldCache (the old 'stored' values) private void unInvertedTheDamnThing( SolrIndexSearcher searcher, List fields, KVSetter setter)

Re: Using Solr invariants to set facet method?

2016-08-17 Thread ruby
Thanks for your reply. I was not seeing the param being added in return results. but after adding echoParams=true, I see that facet method is being added. -- View this message in context: http://lucene.472066.n3.nabble.com/Using-Solr-invariants-to-set-facet-method-tp4292142p4292149.html Sent

Re: Error During Indexing - org.apache.solr.common.SolrException; org.apache.solr.common.SolrException: early EOF

2016-08-17 Thread Erick Erickson
>From my testing program, there's nothing standard here. As the blog points out, since I was indexing fairly simple documents you should _not_ be expecting to see those indexing rates. The point of the article was just to show the _relative_ changes when I sent batches. Best, Erick On Wed, Aug

Re: Using Solr invariants to set facet method?

2016-08-17 Thread Erick Erickson
Setting the facet method to enum will have consequences for the filterCache, especially if you allow faceting on high-cardinality fields so for that specific example I'd be cautious. Best, Erick On Wed, Aug 17, 2016 at 3:01 PM, Alexandre Rafalovitch wrote: > That's what it

Re: Using Solr invariants to set facet method?

2016-08-17 Thread Alexandre Rafalovitch
That's what it is there for. Are you seeing any issues? You can confirm whether it works or not by adding echoParams=all to the query (or in the defaults/invariants). Regards, Alex Newsletter and resources for Solr beginners and intermediates: http://www.solr-start.com/ On 18 August

Using Solr invariants to set facet method?

2016-08-17 Thread ruby
Is it possible to use the invariants in Solr config to set facet.method to override what user is sending? enum -- View this message in context: http://lucene.472066.n3.nabble.com/Using-Solr-invariants-to-set-facet-method-tp4292142.html Sent from the Solr - User mailing list

Re: Error During Indexing - org.apache.solr.common.SolrException; org.apache.solr.common.SolrException: early EOF

2016-08-17 Thread Jaspal Sawhney
Erick Going through the article which you shared. Where are you getting the Docs/second value? Thanks On 8/17/16, 4:37 PM, "Jaspal Sawhney" wrote: >Erick >Thanks - My batch size was 30 and thread size also 30. >Thanks > >On 8/17/16, 3:48 PM, "Erick Erickson"

Re: Error During Indexing - org.apache.solr.common.SolrException; org.apache.solr.common.SolrException: early EOF

2016-08-17 Thread Jaspal Sawhney
Erick Thanks - My batch size was 30 and thread size also 30. Thanks On 8/17/16, 3:48 PM, "Erick Erickson" wrote: >What this probably indicates is that the size of the packets you send >to Solr is large enough that it exceeds the transport protocol's >limit. This is

Unit testing HttpPost With an Embedded Solr Server

2016-08-17 Thread Jennifer Coston
Hello, I have written a data service to send an HttpPost command to post JSON to Solr. The code is working, but now I want to switch to using an embedded Solr server for just the unit tests. The problem is that the embedded Solr server doesn't seem to be starting an embedded server with a

Re: Error During Indexing - org.apache.solr.common.SolrException; org.apache.solr.common.SolrException: early EOF

2016-08-17 Thread Erick Erickson
What this probably indicates is that the size of the packets you send to Solr is large enough that it exceeds the transport protocol's limit. This is reinforced by your statement that reducing the batch size fixes the problem even though it increases indexing time. So the place I'd be looking is

Re: Error During Indexing - org.apache.solr.common.SolrException; org.apache.solr.common.SolrException: early EOF

2016-08-17 Thread Jaspal Sawhney
Bump ! On 8/16/16, 10:53 PM, "Jaspal Sawhney" wrote: >Hello >We are running solr 4.6 in master-slave configuration where in our master >is used entirely for indexing. No search traffic comes to master ever. >Off late we have started to get the early EOF error on the solr

Re: The most efficient way to get un-inverted view of the index?

2016-08-17 Thread Roman Chyla
Joel, thanks, but which of them? I've counted at least 4, if not more, different ways of how to get DocValues. Are there many functionally equal approaches just because devs can't agree on using one api? Or is there a deeper reason? Btw, the FieldCache is still there - both in lucene (to be

Re: [Ext] Influence ranking based on document committed date

2016-08-17 Thread Stefan Matheis
Erick already gave you the solution, additional to that there’s a wiki page that might contain a few more things about relevancy: https://wiki.apache.org/solr/SolrRelevancyFAQ#How_can_I_change_the_score_of_a_document_based_on_the_.2Avalue.2A_of_a_field_.28say.2C_.22popularity.22.29 -Stefan On

Re: Increasing filterCache size and Java Heap size

2016-08-17 Thread Zheng Lin Edwin Yeo
Hi Toke, Thanks for the explanation. I will prefer the memory-based limit too. At first I got confused with that too, thinking that the setting of 2000 means 2GB. Regards, Edwin On 17 August 2016 at 17:40, Toke Eskildsen wrote: > On Wed, 2016-08-17 at 11:02 +0800,

Re: [Ext] Influence ranking based on document committed date

2016-08-17 Thread Erick Erickson
Try: recip(rord(creationDate),1,1000,1000) See: https://wiki.apache.org/solr/FunctionQuery You can play with the magic numbers to influence how this scales your docs. Best, Erick On Wed, Aug 17, 2016 at 7:11 AM, Jay Parashar wrote: > This is correct: " I index it and feed

Re: Modified stat of index

2016-08-17 Thread Scott Derrick
thanks that works perfectly! Scott Original Message Subject: Re: Modified stat of index From: Alexandre Rafalovitch To: solr-user Date: 08/16/2016 04:17 PM I believe you can get that via Luke REST API:

Re: Use function in condition

2016-08-17 Thread Emir Arnautovic
Hi Nabil, You can use frange queries, e.g. you can use fq={!frange l=100}sum(field1,field2) to filter doc with sum greater than 100. Regards, Emir On 17.08.2016 16:26, nabil Kouici wrote: Hi, Is it possible to use functions (function query

Re: index size increses dramatically

2016-08-17 Thread Jan Høydahl
Hi It is quite normal that index size can be close to double during background merge of segments. If you have a lot of deletions and/or reindexed docs then the same document may also exist in multiple segments, taking up space temporarily until a merge or optimize. If this slows down your

Re: Tagging and excluding Filters with BlockJoin Queries and BlockJoin Faceting

2016-08-17 Thread Stefan Moises
Hi Mikhail, thanks for the info ... what is the advantage of using the JSON FACET API compared to the standard BlockJoinQuery features? Is there already anybody working on the tagging/exclusion feature or is there any timeframe for it? There wasn't any discussion yet in SOLR-8998 about

Re: What does refCount denotes in solr admin

2016-08-17 Thread kshitij tyagi
any update?? On Wed, Aug 17, 2016 at 12:47 PM, kshitij tyagi wrote: > Hi, > > I need to understand what is refcount in stats section of solr admin. > > I am seeing refcount: 2 on my solr cores and on one of the core i am > seeing refcount:171. > > The core with

Use function in condition

2016-08-17 Thread nabil Kouici
Hi, Is it possible to use functions (function query https://cwiki.apache.org/confluence/display/solr/Function+Queries) in q or fq parameters to build a complex search expression.  For exemple, take only documents that sum(field1,field2)> 100. Another exemple:  if(test,value1,value2):vallue3

RE: [Ext] Influence ranking based on document committed date

2016-08-17 Thread Jay Parashar
This is correct: " I index it and feed it the timestamp at index time". You can sort desc on that field (can be a TrieDateField) -Original Message- From: Steven White [mailto:swhite4...@gmail.com] Sent: Wednesday, August 17, 2016 9:01 AM To: solr-user@lucene.apache.org Subject: [Ext]

Influence ranking based on document committed date

2016-08-17 Thread Steven White
Hi everyone Let's say I search for the word "Olympic" and I get a hit on 10 documents that have similar content (let us assume the content is at least 80% identical) how can I have Solr rank them so that the ones with most recently updated doc gets ranked higher? Is this something I have to do

index size increses dramatically

2016-08-17 Thread kshitij tyagi
Hi, Suddenly my index size just doubles and indexing just slows down poorly. After sometime it reduces back to normal and indexing starts working. Can someone help me out in finding why index size doubles abnormally??

Re: Tagging and excluding Filters with BlockJoin Queries and BlockJoin Faceting

2016-08-17 Thread Mikhail Khludnev
Stefan, child.facet.field never intend to support exclusions. My preference is to implement it under json.facet that's discussed under https://issues.apache.org/jira/browse/SOLR-8998. On Wed, Aug 17, 2016 at 3:52 PM, Stefan Moises wrote: > Hey girls and guys, > > for a long

Tagging and excluding Filters with BlockJoin Queries and BlockJoin Faceting

2016-08-17 Thread Stefan Moises
Hey girls and guys, for a long time we have been using our own BlockJoin Implementation, because for our Shop Systems a lot of requirements that we had were not implemented in solr. As we now had a deeper look into how far the standard has come, we saw that BlockJoin and faceting on children

Re: Creating a SolrJ Data Service to send JSON to Solr

2016-08-17 Thread Jennifer Coston
Thank you Alex and Anshum! I will look into both of these. Jennifer From: Anshum Gupta To: solr-user@lucene.apache.org Date: 08/16/2016 08:17 PM Subject:Re: Creating a SolrJ Data Service to send JSON to Solr I would also suggest sending the JSON

Re: Increasing filterCache size and Java Heap size

2016-08-17 Thread Toke Eskildsen
On Wed, 2016-08-17 at 11:02 +0800, Zheng Lin Edwin Yeo wrote: > Would like to check, do I need to increase my Java Heap size for > Solr, if I plan to increase my filterCache size in solrconfig.xml? > > I'm using Solr 6.1.0 It _seems_ that you can specify a limit in megabytes when using LRUCache

What does refCount denotes in solr admin

2016-08-17 Thread kshitij tyagi
Hi, I need to understand what is refcount in stats section of solr admin. I am seeing refcount: 2 on my solr cores and on one of the core i am seeing refcount:171. The core with refcount with higher number is having very slow indexing speed?

RE: solr-6.1.0 - Using different client and server certificates for authentication doesn't work

2016-08-17 Thread Kostas
This is what helped me: https://gist.github.com/jankronquist/6412839 -Original Message- From: Kostas [mailto:k...@dataverse.gr] Sent: Tuesday, July 26, 2016 3:22 PM To: solr-user@lucene.apache.org Subject: solr-6.1.0 - Using different client and server certificates for authentication

Re: Increasing filterCache size and Java Heap size

2016-08-17 Thread Zheng Lin Edwin Yeo
Hi Erick, Thanks for your reply. But do we have to set the Java Heap size based on all the collections available (if I were to increase the filterCache size for all my collections)? I come across this from StackOverFlow,

Re: Indexing (posting document) taking a lot of time

2016-08-17 Thread kshitij tyagi
I am posting json using curl. On Wed, Aug 17, 2016 at 4:41 AM, Alexandre Rafalovitch wrote: > What format are those documents? Solr XML? Custom JSON? > > Or are you sending PDF/binary documents to Solr's extract handler and > asking it to do the extraction of the useful

Re: Multiple rollups/facets in one streaming aggregation?

2016-08-17 Thread Radu Gheorghe
Thanks a lot, Joel, for your very fast and informative reply! We'll chew on this and add a Jira if we're going on this route. -- Performance Monitoring * Log Analytics * Search Analytics Solr & Elasticsearch Support * http://sematext.com/ On Tue, Aug 16, 2016 at 8:29 PM, Joel Bernstein

Re: Increasing filterCache size and Java Heap size

2016-08-17 Thread Erick Erickson
Yes. Each entry is roughly 1K + maxdoc/8 bytes. The maxdoc/8 is the bitmap that holds the result set and the 1K is just overhead for the text of the query itself and cache overhead. Usually it's safe to ignore since the maxdoc/8 usually dominates by a wide margin. Best, Erick On Tue, Aug 16,