Multiple Queries using spark and solr

2017-03-25 Thread Matt Magnusson
Hello: I'm interested in querying solr as a spark rdd. Has anyone used the lucidworks code api https://github.com/lucidworks/spark-solr to issue multiple queries? I'd like to have it execute multiple queries and have the top n results be combined into one spark rdd for further analysis. The

Re: ComplexPhrase WildCard Case Sensitivy problem

2017-03-25 Thread Eyyub Çil
Hi My fieldType definition is below. When I search without ComplexPhraseQueryParser, numbers of results are same for "60 YAŞIND*" and "60 yaşınd*". { "responseHeader":{ "zkConnected":true, "status":0, "QTime":5, "params":{ " q":"60 YAŞIND*", "indent":"on", "rows":"0", "wt":"json",

Re: Solr Delete By Id Out of memory issue

2017-03-25 Thread Rohit Kanchan
I think we figure out the issue, When we were conventing delete by query in a Solr Handler we were not making a deep copy of BytesRef. We were making reference of same object, which was causing old deletes(LinkedHasmap) adding more than 1K entries. But I think it is still not clearing those 1K

Re: ComplexPhrase WildCard Case Sensitivy problem

2017-03-25 Thread Erick Erickson
Mikhail: I just checked and I don't think CPQP is passing the terms through the lowercase filter when it's a phrase. I used the text_tr fieldType in the techproducts schema and get these relevant bits: q=eoe:YASIN* parsedquery: "eoe:yasın*" q={!complexphrase}eoe:YASIN* parsedquery: "eoe:yasın*"

Re: ComplexPhrase WildCard Case Sensitivy problem

2017-03-25 Thread Mikhail Khludnev
Hello It sounds similar to https://lucidworks.com/2011/11/29/whats-with-lowercasing-wildcard-multiterm-queries-in-solr/ (yep, I see TurkishLowerCaseFilterFactory there) What's your analyzer for this field? Does it work for plain wildcards queries? I'm asking because CPQP adds an oddity sometimes.

Exception while integrating UIMA with Solr

2017-03-25 Thread aruninfo100
Hi, I am trying to integrate UIMA with Solr. I have created the pear file(packaged and installed)- opennlp.uima.OpenNlpTextAnalyzer_pear.xml I am using openNLP -uima annotators for document analysis. The pear(xml) file is located at path:D:\temp\opennlp.uima.OpenNlpTextAnalyzer *folder

ComplexPhrase WildCard Case Sensitivy problem

2017-03-25 Thread Eyyub Çil
Hello, I have a problem with ComplexPhrase and TurkishLowerCaseFilterFactory. When I search like {!complexphrase}SContent_tinx:"6* YAŞINDA", {!complexphrase}SContent_tinx:"6* yaşında", results is correct. { "responseHeader":{ "zkConnected":true, "status":0, "QTime":104, "params":{

Re: Exception in export handler when using fq collapse that only returns one result

2017-03-25 Thread Joel Bernstein
I would consider this a bug. Collapse has never really been tested with export. But this would certainly speed up the unique Streaming Expression. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Mar 24, 2017 at 7:46 AM, David Svånå wrote: > Hello, > > Summary: when

Re: Difference between hashJoin and innerJoin in Streaming Expression

2017-03-25 Thread Zheng Lin Edwin Yeo
Hi Joel, Thanks for the information. Regards, Edwin On 25 March 2017 at 10:15, Joel Bernstein wrote: > The innerJoin is a merge join and the hashJoin is a hash join. > > The merge join can support joins of unlimited size and never runs out of > memory. But it requires