Re: Is anybody using Config API/configoverlay.json, useParams/params.json, and/or initParams?

2016-02-26 Thread Alexandre Rafalovitch
Thanks Erik, I know the examples use all of that. It was quite a surprise to discover the films and files examples. I even felt a need to write the blog post explaining where ALL of the examples and Solr homes hide in the distribution:

Re: Newest docs added

2016-02-26 Thread Shawn Heisey
On 2/26/2016 3:53 PM, Toke Eskildsen wrote: > MarkG wrote: >> Is there a way anyone can recommend to identify newly added docs to a Solr >> index. Ie: I have some new docs. I update the index with the new doc and >> this happens on a regular basis, say every 4 weeks. I want

Re: Newest docs added

2016-02-26 Thread Toke Eskildsen
MarkG wrote: > Is there a way anyone can recommend to identify newly added docs to a Solr > index. Ie: I have some new docs. I update the index with the new doc and > this happens on a regular basis, say every 4 weeks. I want to be able to > distinguish the docs that are new

Newest docs added

2016-02-26 Thread MarkG
Is there a way anyone can recommend to identify newly added docs to a Solr index. Ie: I have some new docs. I update the index with the new doc and this happens on a regular basis, say every 4 weeks. I want to be able to distinguish the docs that are new given in a certain number of days, say 10

Re: Disable phrase search in edismax?

2016-02-26 Thread Ahmet Arslan
Hi, If you don't set (phrase fields) pf* parameters, phrase creation is automatically disabled, no? Ahmet On Friday, February 26, 2016 11:51 PM, Walter Underwood wrote: I’m creating a query from MLT terms, then sending it to edismax. The neighboring words in the

Re: Thread Usage

2016-02-26 Thread Toke Eskildsen
Azazel K wrote: [Toke: 1 shard instead of multiple?] > The nodes were unstable when we had single shard setup. > It used to run OOM frequently. Fair enough. [Toke: Use a queue instead of 1000+ concurrent requests?] > There are 16CPU on each node. Requests are

Disable phrase search in edismax?

2016-02-26 Thread Walter Underwood
I’m creating a query from MLT terms, then sending it to edismax. The neighboring words in the query are not meaningful phrases. Is there a way to turn off phrase creation and search for one query? Or should I separate them all with “OR”? wunder Walter Underwood wun...@wunderwood.org

Re: Is anybody using Config API/configoverlay.json, useParams/params.json, and/or initParams?

2016-02-26 Thread Erik Hatcher
data_driven /browse does. And example/files builds upon that a lot more. I did it that way to personally explore the configset feature. Erik > On Feb 26, 2016, at 16:12, Alexandre Rafalovitch wrote: > > Hi, > > I am creating an explanation of solrconfig.xml for the

Re: Thread Usage

2016-02-26 Thread Azazel K
> There is a non-trivial overhead for sharding: Using a single shard increases > throughput. Have you tried with 1 >shard to see if the latency is acceptable > for that? The nodes were unstable when we had single shard setup. It used to run OOM frequently. Ops team setup a cronjob to clear

Is anybody using Config API/configoverlay.json, useParams/params.json, and/or initParams?

2016-02-26 Thread Alexandre Rafalovitch
Hi, I am creating an explanation of solrconfig.xml for the beginners and want to know whether anybody is actually using overrides and initParams in the wild. Sometimes, features exist for edge cases, but may not be worth spending much attention on in the beginner docs. Any feedback (on the list

Re: Shard State vs Replica State

2016-02-26 Thread Jeff Wartes
I believe the shard state is a reflection of whether that shard is still in use by the collection, and has nothing to do with the state of the replicas. I think doing a split-shard operation would create two new shards, and mark the old one as inactive, for example. On 2/26/16, 8:50 AM,

Escaping characters in a nested query

2016-02-26 Thread Jamie Johnson
When using nested queries of the form q=_query_:"my_awesome:query", what needs to be escaped in the query portion? Just using the admin UI the following works _query_:"+field\\:with\\:special" _query_:"+field\\:with\\~special" _query_:"+field\\:with\\" but the same doesn't work for quotes, i.e.

Re: Thread Usage

2016-02-26 Thread Toke Eskildsen
Azazel K wrote: > We have solr cluster with 2 shards running 2 nodes on each shard. > They are beefy physical boxes with index size of 162 GB , RAM of > about 96 GB and around 153M documents. There is a non-trivial overhead for sharding: Using a single shard increases

[ANNOUNCE] YCSB 0.7.0 Release

2016-02-26 Thread Kevin Risden
On behalf of the development community, I am pleased to announce the release of YCSB 0.7.0. Highlights: * GemFire binding replaced with Apache Geode (incubating) binding * Apache Solr binding was added * OrientDB binding improvements * HBase Kerberos support and use single connection * Accumulo

Re: Query time de-boost

2016-02-26 Thread shamik
Thanks Walter, I've tried this earlier and it works. But the problem in my case is that I've boosting on few Source parameters as well. My ideal "bq" should like this: *bq=Source:simplecontent^10 Source:Help^20 (*:* -ContentGroup-local:("Developer"))^99* But this is not going to work. I'm

Thread Usage

2016-02-26 Thread Azazel K
Hi, We have solr cluster with 2 shards running 2 nodes on each shard. They are beefy physical boxes with index size of 162 GB , RAM of about 96 GB and around 153M documents. Two times this week we have seen the thread usage spike from the usual 1000 to 4000 on all nodes at the same time

Re: Solr does not receive any documents by nutch

2016-02-26 Thread Shawn Heisey
On 2/26/2016 9:22 AM, Merlin Morgenstern wrote: > during the nutch run there are no activites inside the logfile as it seems. Are you looking at the actual *logfile*, or the "Logging" tab in the admin UI? The Logging tab will only show you entries that are at least WARN severity. Most of what

Shard State vs Replica State

2016-02-26 Thread Dennis Gove
In clusterstate.json (or just state.json in new versions) I'm seeing the following "shard1":{ "range":"8000-d554", "state":"active", "replicas":{ "core_node7":{ "core":"people_shard1_replica3", "base_url":"http://192.168.2.32:8983/solr;,

Re: Solr does not receive any documents by nutch

2016-02-26 Thread Shawn Heisey
On 2/26/2016 9:22 AM, Merlin Morgenstern wrote: > during the nutch run there are no activites inside the logfile as it seems. > However the logfile from the admin interface shows the following: > > 2/26/2016, 5:20:04 PM WARN null SolrConfig Couldn't add files from >

Re: Solr does not receive any documents by nutch

2016-02-26 Thread Merlin Morgenstern
during the nutch run there are no activites inside the logfile as it seems. However the logfile from the admin interface shows the following: 2/26/2016, 5:20:04 PM WARN null SolrConfig Couldn't add files from /usr/local/Cellar/solr/5.4.1/contrib/extraction/lib filtered by .*\.jar to classpath:

Re: Solr does not receive any documents by nutch

2016-02-26 Thread Shawn Heisey
On 2/26/2016 8:34 AM, Merlin Morgenstern wrote: > Unfortunatelly no documents get added to solr and no error log entries show > up. It seems as it would be working, but the documents are not there. Is there anything happening in the Solr logfile at all during the nutch run? I'm talking about

Solr does not receive any documents by nutch

2016-02-26 Thread Merlin Morgenstern
I have nutch 1.11 installed together with solr 4.10.4 AND solr 5.4.1 on OS X 10.11. Nutch and Solr seem to work as nutch starts to index and solr shows the admin interface together with the configured core. Unfortunatelly no documents get added to solr and no error log entries show up. It seems

Re: Deleting by query

2016-02-26 Thread Marc Burt
Thanks Jan, That worked. Kind Regards, Marc On 02/26/2016 01:43 PM, Jan Høydahl wrote: Hi Try this instead /solr/de/update?stream.body=last_seen:[* TO 2016-02-24T00:00:00Z]=true …that is if you have streaming enabled in solrconfig. Else do a POST instead Note that I put a commit=true at

Re: Solr | index | Lock Type

2016-02-26 Thread Shawn Heisey
On 2/26/2016 7:48 AM, Prateek Jain J wrote: > WARN - 2016-02-26 05:49:29.191; org.apache.solr.core.SolrCore; [cm_history] > WARNING: Solr index directory '/foo/solr/cm_history/data/index/' is locked. > Unlocking... > WARN - 2016-02-26 05:49:29.680; org.apache.solr.rest.ManagedResource; No >

Re: Query time de-boost

2016-02-26 Thread Jack Krupansky
Could you share your actual numbers and test case? IOW, the document score without ^0.01 and with ^0.01. Again, to repeat, the specific boost factor may be positive, but the effect of a fractional boost is to reduce, not add, to the score, so that a score of 0.5 boosted by 0.1 would become 0.05.

Solr | index | Lock Type

2016-02-26 Thread Prateek Jain J
Hi All, We are seeing an issue with solr where solr is failing to initialize the cores with the following errors. We have gone through the solr documentation on these errors and its mentioned that this could happen when solr is running in clustered mode. But in our case solr is deployed in

Update command not working

2016-02-26 Thread Mike Thomsen
I posted this to http://localhost:8983/solr/default-collection/update and it treated it like I was adding a whole document, not a partial update: { "id": "0be0daa1-a6ee-46d0-ba05-717a9c6ae283", "tags": { "add": [ "news article" ] } } In the logs, I found this: 2016-02-26

Re: Deleting by query

2016-02-26 Thread Jan Høydahl
Hi Try this instead /solr/de/update?stream.body=last_seen:[* TO 2016-02-24T00:00:00Z]=true …that is if you have streaming enabled in solrconfig. Else do a POST instead Note that I put a commit=true at the end, so you will see the changes immediately. -- Jan Høydahl, search solution architect

Re: Is it different? q=(field1:value1 OR field2:value2) and q=field1:value1 OR field2:value2

2016-02-26 Thread John Blythe
not that i'm aware of. i think you could also simply have q=field1:value field2:value in which the OR is implied -- *John Blythe* Product Manager & Lead Developer 251.605.3071 | j...@curvolabs.com www.curvolabs.com 58 Adams Ave Evansville, IN 47713 On Fri, Feb 26, 2016 at 8:31 AM, Aurélien

Re: Is it different? q=(field1:value1 OR field2:value2) and q=field1:value1 OR field2:value2

2016-02-26 Thread Aurélien MAZOYER
Hi, I think both the two queries are rewrited to the same query. You can use the debugQuery=on parameter to see how the query is rewrited and then compare if you get the same result for each query. Regards, Aurélien Le 26/02/2016 14:27, vitaly bulgakov a écrit : Is there a difference when

Is it different? q=(field1:value1 OR field2:value2) and q=field1:value1 OR field2:value2

2016-02-26 Thread vitaly bulgakov
Is there a difference when we put query in brackets? -- View this message in context: http://lucene.472066.n3.nabble.com/Is-it-different-q-field1-value1-OR-field2-value2-and-q-field1-value1-OR-field2-value2-tp4259976.html Sent from the Solr - User mailing list archive at Nabble.com.

Deleting by query

2016-02-26 Thread Marc Burt
Hi, I'm trying to delete by query using the following: /solr/de/update?last_seen:[* TO 2016-02-24T00:00:00.00Z] /solr/de/select?q=last_seen:[* TO 2016-02-24T00:00:00.00Z] returns the correct documents to be deleted. Last time I attempted this using the above I somehow managed to delete

Re: Query time de-boost

2016-02-26 Thread Emir Arnautovic
Hi Jack, I just checked on 5.5 and 0.1 is positive boost. Regards, Emir On 26.02.2016 01:11, Jack Krupansky wrote: 0.1 is a fractional boost - all intra-query boosts are multiplicative, not additive, so term^0.1 reduces the term by 90%. -- Jack Krupansky On Wed, Feb 24, 2016 at 11:29 AM,