Re: minimal solrconfig example

2017-03-01 Thread Aravind Durvasula
Hi David, What is the solr version you are using? To get started, it's better to use the config file that comes out of the box. Thanks, Aravind -- View this message in context: http://lucene.472066.n3.nabble.com/minimal-solrconfig-example-tp4322977p4322978.html Sent from the Solr - User

minimal solrconfig example

2017-03-01 Thread David Michael Gang
Hi all, I want to create my first solr collection I found an example of solrconfig here. https://github.com/apache/lucene-solr/blob/master/solr/example/files/conf/solrconfig.xml This is a file of more than thousand lines. As i understand this file shows all possible configurations possible. What

Re: SOLR JOIN

2017-03-01 Thread Zheng Lin Edwin Yeo
If you're using Solr 6, you can use streaming expressions: https://cwiki.apache.org/confluence/display/solr/Streaming+Expressions Regards, Edwin On 1 March 2017 at 18:27, vrindavda wrote: > Hi Nitin, > > You can use Streaming Expressions >

Re: using HttpSolrServer with PoolingHttpClientConnectionManager

2017-03-01 Thread Renee Sun
Thank you Shawn! this is very helpful. Renee -- View this message in context: http://lucene.472066.n3.nabble.com/using-HttpSolrServer-with-PoolingHttpClientConnectionManager-tp4322905p4322972.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: bin/solr -a doesn't work?

2017-03-01 Thread Zheng Lin Edwin Yeo
Hi Markus, Maybe you can post the script or error message here, so we can have a better understanding of the situation. Regards, Edwin On 1 March 2017 at 19:53, Markus Jelsma wrote: > Hello, > > Because we upload large files to Zookeeper, i tried: > > bin/solr

RE: Excessive Wire logging while indexing. Blank output from tika parser

2017-03-01 Thread Phil Scadden
Belay that. I found out why parser was just returning empty data - I didn’t have the right artefact in maven. In case anyone else trips on this: org.apache.tika tika-core 1.12 org.apache.tika tika-parsers

OR condition between !frange and normal query

2017-03-01 Thread Zheng Lin Edwin Yeo
Hi, Would like to check, how can we do an OR condition between !frange and normal query? For example, I want to have the following condition in my query: ({!frange l=1}ms(startDate_dt,endDate_dt) OR (startDate:[2000-01-01T00:00:00Z TO *] AND endDate:[2016-12-31T23:59:59Z])) How can we put it

Re: JOIN query

2017-03-01 Thread Zheng Lin Edwin Yeo
Hi Nitin, Probably you can look at the Streaming Expressions here: https://cwiki.apache.org/confluence/display/solr/Streaming+Expressions This is available from Solr 6. Regards, Edwin On 28 February 2017 at 16:45, Nitin Kumar wrote: > Hi, > > Can we use join query

Re: OOM

2017-03-01 Thread Rick Leir
Thanks Shawn, of course it must be the -Xmx. It is interesting that we do not see the OOM until restarting. On March 1, 2017 8:18:11 PM EST, Shawn Heisey wrote: >On 2/27/2017 4:57 PM, Rick Leir wrote: >> We get an OOM after stopping then starting Solr (with a tiny index).

RE: Excessive Wire logging while indexing.

2017-03-01 Thread Phil Scadden
>Another side issue: Using the extracting handler for handling rich documents >is discouraged. Tika (which is what is used by the extracting >handler) is pretty amazing software, but it has a habit of crashing or >consuming all the heap memory when it encounters a document that it doesn't

Re: Updating 100 documents in one request

2017-03-01 Thread Rick Leir
JSONL is a very simple storage format, and that makes it attractive. You just append lines to a file, with each line being the JSON representation of a document. Much simpler than NoSQL databases. See jsonlines.org http://jsonlines.org/ Cheers -- Rick On March 1, 2017 2:11:49 PM EST, kshitij

Re: Implicit routing, delete on specific shard

2017-03-01 Thread Damien Kamerman
I assume with the implicit router you would do something like curl " http://127.0.0.1:8983/solr/collection1_20170220_replica1/update?commit= false" On 28 February 2017 at 22:39, philippa griggs wrote: > Hello, > > > Solr 5.4.1 using Solr Cloud, multiple cores

Re: OOM

2017-03-01 Thread Shawn Heisey
On 2/27/2017 4:57 PM, Rick Leir wrote: > We get an OOM after stopping then starting Solr (with a tiny index). Is > there something I could check quickly before I break out the Eclipse > debugger? Maybe Marple could tell me about problems in the index? There are exactly two ways of dealing with

Re: About editing managed-schema by hand

2017-03-01 Thread Shawn Heisey
2/27/2017 4:46 AM, Issei Nishigata wrote: > Thank you for your reply. If I was to say which one, I'd maybe be > talking about the concept for Solr. I understand we should use > "ClassicSchemaFactory" when we want to hand-edit, but why are there > two files, schema.xml and managed-schema, in spite

RE: Excessive Wire logging while indexing.

2017-03-01 Thread Phil Scadden
The logging is coming from application which is running in Tomcat. Solr itself is running in the embedded Jetty. And yes, another look at the log4j and I see that rootlogger is set to DEBUG. I've changed that/ >On the Solr server side, the 6.4.x versions have a bug that causes extremely >high

Re: maxwarmingSearchers and memory leak

2017-03-01 Thread Shawn Heisey
On 2/26/2017 6:40 AM, SOLR4189 wrote: > Shawn, you are right. > * OS vendor and version > CentosOS 6.5 > > * Java vendor and version > OpenJDK version 1.8.0_20 > OpenJDK 64-bit Server VM (build 25.20-b23) > > * Servlet container used to start Solr. > Catalina(tomcat7) > > * Total amount of

Re: Upgrading solr 5.2 to solr 5.5

2017-03-01 Thread Shawn Heisey
On 2/28/2017 8:45 AM, skasab2s wrote: > we are trying to upgrade solr 5.2 to solr 5.5. > > In solr 5.2 we had an own, customised version of solr.war and we put it > under > > solr-5.2.1/server/webapps/ (we overwrote the default solr.war), > > so on the solr startup, the war was extracted to >

Re: using HttpSolrServer with PoolingHttpClientConnectionManager

2017-03-01 Thread Shawn Heisey
On 3/1/2017 3:13 PM, Renee Sun wrote: > first of all I apologize for the length of this message ... there are few > questions I would appreciate your help please: > > 1. originally I wanted to use solrj in my application layer (webapp deployed > with tomcat), to query the solr server(s) with

Re: Excessive Wire logging while indexing.

2017-03-01 Thread Shawn Heisey
On 3/1/2017 4:41 PM, Phil Scadden wrote: > Using Solr 6.4.1 on windows. Installed and trial POST on my directories > worked okay. However, now trying to create an index from code running on > tomcat on the same machine as SOLR server with my own schema. Indexing of PDF > is very slow.

Excessive Wire logging while indexing.

2017-03-01 Thread Phil Scadden
Using Solr 6.4.1 on windows. Installed and trial POST on my directories worked okay. However, now trying to create an index from code running on tomcat on the same machine as SOLR server with my own schema. Indexing of PDF is very slow. Investigating that find my tomcat output full of wire

using HttpSolrServer with PoolingHttpClientConnectionManager

2017-03-01 Thread Renee Sun
first of all I apologize for the length of this message ... there are few questions I would appreciate your help please: 1. originally I wanted to use solrj in my application layer (webapp deployed with tomcat), to query the solr server(s) with multi-cores, non-cloud setup. Since I need send

Re: Distributed Search: Wrong count?

2017-03-01 Thread Kelly, Frank
Quick extra clarification – the documents in question we are searching for are child documents we are searching direct (no parent/child in the query) -Frank From: Frank J Kelly > Reply-To:

Re: Updating 100 documents in one request

2017-03-01 Thread kshitij tyagi
thanks everyone for your inputs, we are using solr 5.1 as of now. @rick/walter Can you please explain or provide link for entire set of loaded documents is saved as JSONL in S3 would be helpfull. Regards, Kshitij On Wed, Mar 1, 2017 at 10:06 PM, Walter Underwood wrote:

Distributed Search: Wrong count?

2017-03-01 Thread Kelly, Frank
Environment: SolrCloud 5.3 Collection has 12.3m docs split across 3 shards and 3 replicas In the query below I get one document ID returned but a numFound of 365 { "responseHeader":{ "status":0, "QTime":47, "params":{ "q":"haUserId: AND haAccountType:google AND

Re: Using parameter values in a sort

2017-03-01 Thread Erik Hatcher
FYI - I recalled, and located, a solr-user thread from 2015 with subject “Parameter Substitution” with this same issue. Tricky issue - overloaded `${` usage and interpretation time. Erik > On Mar 1, 2017, at 1:41 PM, Erik Hatcher wrote: > > I just found a

Re: Using parameter values in a sort

2017-03-01 Thread Erik Hatcher
I just found a workaround, check out this trick: ${sort_field:${sort_field}} desc when the core is loaded, it looks for a system property “sort_field”, doesn’t find it, and defaults the value to ${sort_field} and voila: /browse?q=*:*=xml_field=id Erik > On Mar 1, 2017,

Re: Using parameter values in a sort

2017-03-01 Thread Erik Hatcher
Walter - Apologies for not trying this sooner first-hand. I’m used to passing in all the params (even the dynamic ${…} ones) in the request these days, not so much putting them into request handler definitions. I finally tried it with a default (master/trunk) with modifying the /browse

Re: Stored value for highlighting from different field?

2017-03-01 Thread Caruana, Matthew
Many of our field values are large, so we want to use the unified highlighter for its performance benefits. Development also seems to be focussed on that highlighter. > On 1 Mar 2017, at 19:07, Rick Leir wrote: > > Matthew, Is TVH term vector highlighter an option? Just a

Re: Stored value for highlighting from different field?

2017-03-01 Thread Rick Leir
Matthew, Is TVH term vector highlighter an option? Just a suggestion. Cheers -- Rick -- Sent from my Android device with K-9 Mail. Please excuse my brevity.

Stored value for highlighting from different field?

2017-03-01 Thread Caruana, Matthew
We’re currently using copyField directives in our schema to copy the same text to different fields that use different analysers. For example, assuming the original field contained in the document payload sent to the update handler is called “tika_output", it is copied to “text”,

Re: Updating 100 documents in one request

2017-03-01 Thread Walter Underwood
That is exactly what we do. The entire set of loaded documents is saved as JSONL in S3. Very handy for loading up a prod index in test for diagnosis or benchmarking. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Mar 1, 2017, at 8:14 AM, Rick

Re: Updating 100 documents in one request

2017-03-01 Thread Rick Leir
And perhaps put the crawl results in JSONL, so when you get a 404 you can use yesterdays document in a pinch. Cheers -- Rick On March 1, 2017 10:20:21 AM EST, Walter Underwood wrote: >Since I always need to know which document was bad, I back off to >batches of one

solr to solrcloud

2017-03-01 Thread Walker, Darren
Our out of the box solr 5.4.1 installation cannot handle the 50gb analytics index anymore. We are using sitecore 8.1 and planning to go to 8.2 but when we tried went to 8.2 we rebuild the indexes but the sight was very unresponsive and was missing items and was too slow. We ended up giving that

Delta Import JDBC connection frame size larger than max length

2017-03-01 Thread Liu, Daphne
Hello Solr experts, Is there a place in Solr (Delta Import Datasource?) where I can adjust the JDBC connection frame size to 256 mb ? I have adjusted the settings in Cassandra but I'm still getting this error. NonTransientConnectionException:

Re: Updating 100 documents in one request

2017-03-01 Thread Walter Underwood
Since I always need to know which document was bad, I back off to batches of one document when there is a failure. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Mar 1, 2017, at 6:25 AM, Erick Erickson wrote: > > What

Re: Updating 100 documents in one request

2017-03-01 Thread Erick Erickson
What version of Solr? This was a pretty long-standing issue that was fixed in Solr 6.1, see: https://issues.apache.org/jira/browse/SOLR-445 Otherwise you really have to write your code to re-transmit sub-packets, perhaps even one at a time when a packet fails. Best, Erick On Wed, Mar 1, 2017 at

Re: Susbcribe

2017-03-01 Thread Steve Rowe
Hi Pankaj, To subscribe, send an email to . More info here: . -- Steve www.lucidworks.com

looking for a way to get structured nested response from solr

2017-03-01 Thread ankur bansal
Hi, I am new SOLR user, I am trying to use SOLR in a application where we have 1 core with 1 config file having multiple queries. We have root entities and many sub entities as well. Currently I am getting a response of something like this "response":{"numFound":1,"start":0,"docs":[ {

bin/solr -a doesn't work?

2017-03-01 Thread Markus Jelsma
Hello, Because we upload large files to Zookeeper, i tried: bin/solr restart -c -m 1500m -a "-Djute.maxbuffer=0xF2" But the script keeps hanging, and no Solr is started. The -a parameter doesn't seem to work. I am missing something very obvious? Thanks, Markus

Updating 100 documents in one request

2017-03-01 Thread kshitij tyagi
Hi Team, I am facing an issue when I am updating more than 1 document on solr. 1. If any 1 document gives 400 error them my other documents are also not updated. How can I approach to solve this? I need my other documents to be indexed which are not giving 400 error. Help appreciated!

Re: SOLR JOIN

2017-03-01 Thread vrindavda
Hi Nitin, You can use Streaming Expressions for Joins in SolrCloud only (For Collections Not Core). Again this can affect you performance, I would suggest to copy fields from one collection to another any seamlessly use

Re: Having clause in Faceting

2017-03-01 Thread Zheng Lin Edwin Yeo
Hi, Is it possible to do this Having Clause in Solr Facet or JSON Facet query? Regards, Edwin On 28 February 2017 at 19:36, Zheng Lin Edwin Yeo wrote: > Hi, > > Would like to check, how can we put the having clause (having sum(fieldC) > > 1000) in the following example