Re: Getting error while excuting full import

2017-04-18 Thread ankur.168
Thanks for enlightening, Shawn :) I thought DIH does parallel db request for all the entities defined in a document. I do believe that DIH is easier to use that's why I am trying to find a way to use this in my current system. But as I explained above since I have so many sub entities,each

Re: extract multi-features for one solr feature extractor in solr learning to rank

2017-04-18 Thread Jianxiong Dong
Hi, Michael, Thank for very valuable feedbacks. > You can pass in different params in the > features.json config for each feature, even though they use the same > feature class. I used this idea to extract some features in this paper

Re: Security.json file caused Solr to stop working

2017-04-18 Thread Zheng Lin Edwin Yeo
This was due to a missing comma in the JSON file. { "authentication":{ "blockUnknown": false, "class":"solr.BasicAuthPlugin", "credentials":{"solr":"IV0EHq1OnNrj6gvRCwvFwTrZ1+z1oBbnQdiVC3otuq0= Ndd7LKvVBAaZIF0QAVi1ekCfAJXr1GGfLtRUXhgrF8c="} }, "authorization":{

Re: prefix facet performance

2017-04-18 Thread Maria Muslea
Hmmm, not sure. Probably in the range of 100K-500K. Before writing the email I was just looking at: http://yonik.com/facet-performance/ Wow, using facet.method=enum makes a big difference. I will read on it to understand what it does. Thank you so much. Maria On Tue, Apr 18, 2017 at 5:21 PM,

Re: prefix facet performance

2017-04-18 Thread Yonik Seeley
How many unique values in the index? You could try facet.method=enum -Yonik On Tue, Apr 18, 2017 at 8:16 PM, Maria Muslea wrote: > Hi, > > I have ~40K documents in SOLR (not many) and a multivalued facet field that > contains at least 2K values per document. > > The

prefix facet performance

2017-04-18 Thread Maria Muslea
Hi, I have ~40K documents in SOLR (not many) and a multivalued facet field that contains at least 2K values per document. The values of the facet field look like: A/B, A/C, A/D, C/E, M/F, etc, and I use facet.prefix. q=*:*=0=true=concept=A/ with "concept" defined as: This generates the

Re: AnalyzingInfixSuggester performance

2017-04-18 Thread Michael McCandless
It also indexes edge ngrams for short sequences (e.g. a*, b*, etc.) and switches to ordinary PrefixQuery for longer sequences, and does some work to at search time to do the "infixing". But yeah otherwise that's it. If your ranking at lookup isn't exactly matching the weight, but "roughly" has

Re: Getting error while excuting full import

2017-04-18 Thread Mikhail Khludnev
Ok. I've checked AbstractSqlEntityProcessorTestCase. Please make the next attempt with where="PROPERTY_ID=propertiesList.PROPERTY_ID" On Tue, Apr 18, 2017 at 4:35 PM, ankur.168 wrote: > Yes, both column names are same. But if we just use property_id=property_id > in

Re: AnalyzingInfixSuggester performance

2017-04-18 Thread OTH
I see. I had actually overlooked the fact that Suggester provides a 'weightField', and I could possibly use that in my case instead of the regular Solr index with bq. So if I understand then - the main advantage of using the AnalyzingInfixSuggester instead of a regular Solr index (since both are

Re: How to change stateFomat to 2

2017-04-18 Thread Erick Erickson
There should be no need to set CLUSTERPROP more than once, it's a characteristic of your entire, well, cluster. See clusterprops.json in your admin UI>>tree view. Best, Erick On Tue, Apr 18, 2017 at 10:21 AM, Manohar Sripada wrote: > Thanks Erick! > state.json exists for

Re: SolrJ and Streaming

2017-04-18 Thread Joe Obernberger
Thank you Joel; exactly what I needed! Just had to change it to use CloudSolrStream instead. Much appreciated! -Joe On 4/18/2017 3:21 PM, Joel Bernstein wrote: Are you trying to send a streaming expression using SolrJ? If you are you can send the expression with the SolrStream. for

Re: SolrJ and Streaming

2017-04-18 Thread Joel Bernstein
paramsLoc in my last email should be params Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Apr 18, 2017 at 3:21 PM, Joel Bernstein wrote: > Are you trying to send a streaming expression using SolrJ? > > If you are you can send the expression with the SolrStream. for

Re: SolrJ and Streaming

2017-04-18 Thread Joel Bernstein
Are you trying to send a streaming expression using SolrJ? If you are you can send the expression with the SolrStream. for example: params = new ModifiableSolrParams(); params.set("expr", expr); params.set("qt", "/stream"); SolrStream stream = new SolrStream(url, paramsLoc); try {

SolrJ and Streaming

2017-04-18 Thread Joe Obernberger
Hi All - any examples of using solrJ and streaming expressions available? Like calling UpdateStream from solrJ? Thank you! -Joe

Re: How to change stateFomat to 2

2017-04-18 Thread Manohar Sripada
Thanks Erick! state.json exists for each collection in the "tree" view of admin UI. So, that format is set to 2. I will call the CLUSTERPROP collections API too and set legacyCloud=false whenever I create a collection. Thanks On Tue, Apr 18, 2017 at 8:50 PM, Erick Erickson

Re: extract multi-features for one solr feature extractor in solr learning to rank

2017-04-18 Thread Michael Nilsson
Hi Jianxiong, What you say is true. If you want 100 different feature values extracted, you need to specify 100 different features in the features.json config so that there is a direct mapping of features in and features out. However, you more than likely need to only implement 1 feature class

Re: How to change stateFomat to 2

2017-04-18 Thread Erick Erickson
clusterstate.json will exist, it just should be empty if you're using state format 2. Note: if you have "state.json" files under each collections in ZK (see the "tree" view in the admin UI), then you _are_ in the format 2 world. However, for Solr 5.x, there'a an obscure property "legacyCloud"

Re: Innerjoin streaming expressions - Invalid JoinStream error

2017-04-18 Thread Joel Bernstein
Interesting, that inverting the on clause worked. Something is not working as designed. Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Apr 18, 2017 at 11:09 AM, Dominique Bejean < dominique.bej...@eolya.fr> wrote: > Done > https://issues.apache.org/jira/browse/SOLR-10512 > > Regards. > >

Re: Innerjoin streaming expressions - Invalid JoinStream error

2017-04-18 Thread Dominique Bejean
Done https://issues.apache.org/jira/browse/SOLR-10512 Regards. Dominique Le mar. 18 avr. 2017 à 14:51, Joel Bernstein a écrit : > I looked through the test cases I don't think we have this covered exactly > as it's written. Can you log a jira for this? > > Joel

How to change stateFomat to 2

2017-04-18 Thread Manohar Sripada
After deleting a collection through Collection API, the data is not getting deleted from clusterstate.json. Based on this discussion , it seems clusterstate.json shouldn't be there for Solr 5.x (I am

Re: Running Solr6 on Tomcat7

2017-04-18 Thread Shawn Heisey
On 4/18/2017 7:40 AM, rgummadi wrote: > Is anyone successfull in running Solr6 on Tomcat7. If so can you give > me some pointers on how you did this? Running in this way is officially unsupported. You can most likely do it, but we strongly recommend that you don't. You are on your own when it

Re: Innerjoin streaming expressions - Invalid JoinStream error

2017-04-18 Thread Dominique Bejean
Furthermore, it looks like due to the fact "all incoming stream comparators (sort) must be a superset of this stream's equalitor" condition, it isn't possible to sort the stream on a other field such as for instance in my example pubyear_s (books collection) or review_dt (reviews collection).

Re: Filter Facet Query

2017-04-18 Thread Furkan KAMACI
Hi Alex, I found the reason, thanks for the help. Facet shows all possible values including 0. Could you help on my last question: I have facet results like: "", 9 "research",6 "development",3 I want to filter empty string from my facet "" (I don't want to add it to fq, just filter from

Running Solr6 on Tomcat7

2017-04-18 Thread rgummadi
Is anyone successfull in running Solr6 on Tomcat7. If so can you give me some pointers on how you did this? -- View this message in context: http://lucene.472066.n3.nabble.com/Running-Solr6-on-Tomcat7-tp4330500.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Moving solr home

2017-04-18 Thread Shawn Heisey
On 4/18/2017 7:07 AM, tedsolr wrote: > Looks like the issues are self inflicted. I have custom start/stop > scripts that actually specify the solr home directory as a param to > the start command (start -c -s ...). This was overriding my include > variable. As for the magical solr.xml file, that's

Re: Getting error while excuting full import

2017-04-18 Thread Shawn Heisey
On 4/18/2017 12:58 AM, ankur.168 wrote: > Hi Erick, > > Thanks for replying, As you suggest I can use solrJ to map RDBMS fetched > data and index/search it later on. but DIH gives multi db connection for > full import and other benefits. > Does solrJ supports this or we need to put efforts to make

Re: Getting error while excuting full import

2017-04-18 Thread ankur.168
Yes, both column names are same. But if we just use property_id=property_id in child entity, then how zipper gets to know which child document to merge with which parent? Any how I just tried with ur suggested where condition which result in arrayindexoutofbound exception, here are the logs

Re: Innerjoin streaming expressions - Invalid JoinStream error

2017-04-18 Thread Dominique Bejean
Hi, I reply to myself I just had to invert the "on" clause to make it work curl --data-urlencode 'expr=innerJoin( search(books, q="*:*", fl="id", sort="id

Re: Upgrading cluster from 4 to 5. Slow replication detected.

2017-04-18 Thread Shawn Heisey
On 4/14/2017 2:10 AM, Himanshu Sachdeva wrote: > We're starting to upgrade our solr cluster to version 5.5. So we > removed one slave node from the cluster and installed solr 5.5.4 on it > and started solr. So it started copying the index from the master. > However, we noticed a drop in the

Re: Solr Index size keeps fluctuating, becomes ~4x normal size.

2017-04-18 Thread Shawn Heisey
On 4/10/2017 1:57 AM, Himanshu Sachdeva wrote: > Thanks for your time and quick response. As you said, I changed our > logging level from SEVERE to INFO and indeed found the performance > warning *Overlapping onDeckSearchers=2* in the logs. I am considering > limiting the *maxWarmingSearchers*

Re: Getting error while excuting full import

2017-04-18 Thread Mikhail Khludnev
Hello, Shouldn't it just be where="PROPERTY_ID=PROPERTY_ID'" since fields are named the same in both tables. On Tue, Apr 18, 2017 at 4:02 PM, ankur.168 wrote: > Hi Mikhail, > > I tried with a simplest zipper entity. Here are the config details- > > >

Re: Moving solr home

2017-04-18 Thread tedsolr
Looks like the issues are self inflicted. I have custom start/stop scripts that actually specify the solr home directory as a param to the start command (start -c -s ...). This was overriding my include variable. As for the magical solr.xml file, that's also my doing because as part of shutdown

Re: Innerjoin streaming expressions - Invalid JoinStream error

2017-04-18 Thread Joel Bernstein
I looked through the test cases I don't think we have this covered exactly as it's written. Can you log a jira for this? Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Apr 18, 2017 at 6:33 AM, Dominique Bejean wrote: > Hi, > > I don not understand what I am

Re: Get handler not working

2017-04-18 Thread PeterCiuffetti
We've bumped into this issue too, but it was through the MoreLikeThis query parser. Internally it uses the get handler to obtain the seed document. One of our SOLR collections uses a shard router that is not the document id. The get handler will fail if the value of the document id is not the

Innerjoin streaming expressions - Invalid JoinStream error

2017-04-18 Thread Dominique Bejean
Hi, I don not understand what I am doing wrong il this simple query. curl --data-urlencode 'expr=innerJoin( search(books, q="*:*", fl="id", sort="id asc"),

Re: Solr Child="true" flag in version 6.4

2017-04-18 Thread Alexandre Rafalovitch
I am not sure I can explain it better than the link I gave. Basically you select parent records and then use fl=*,[child] to add children records into that. I mostly wanted to make sure you knew about the flatten-by-default search. Another way is to look for common _root_ field value. It

Index and query time suggester behavior in a SolrCloud environment

2017-04-18 Thread Andrea Gazzarini
Hi, I have a project, with SolrCloud, where I'm going to use the Suggester component (BlendedInfixLookupFactory with DocumentDictionaryFactory). Some info: * I will have a suggest-only collection, with no NRT requirements (indexes will be updated with a daily frequency) * I'm not yet sure

Re: Solr Child="true" flag in version 6.4

2017-04-18 Thread donjose
Hello Alex, Thanks for your reply. This is the first time i am doing with nested entities. Yes you are right am getting flat list combining parent & child. Could you please explain bit more detail, how to apply child transformer for the below mentioned response. Response == {

Re: Getting error while excuting full import

2017-04-18 Thread ankur.168
Hi Mikhail, Thanks for replying, I am currently trying to use zipper join but getting null pointer exception as given below stacktrace 2017-04-18 09:11:51.154 INFO (qtp1348949648-13) [ x:sample_content] o.a.s.u.p.LogUpdateProcessorFactory [sample_content] webapp=/solr path=/dataimport

Re: AnalyzingInfixSuggester performance

2017-04-18 Thread Michael McCandless
AnalyzingInfixSuggester uses index-time sort, to sort all postings by the suggest weight, so that lookup, as long as your sort by the suggest weight is extremely fast. But if you need to rank at lookup time by something not "congruent" with the index-time sort then you lose that benefit. Mike

Re: Solr Child="true" flag in version 6.4

2017-04-18 Thread Alexandre Rafalovitch
You say you are trying to use child=true, but the definition you gave does not actually have one. Is it possible you tested with it once, but then did not keep it for later tests accidentally? Also, if that`s your first time working with nested entities, the query returns parents and children all

Re: Solr Child="true" flag in version 6.4

2017-04-18 Thread donjose
Verbose debug output { "responseHeader": { "status": 0, "QTime": 109 }, "initArgs": [ "defaults", [ "config", "data-config.xml" ] ], "command": "full-import", "mode": "debug", "documents": [ { "category": [ "Shirt" ],

Re: Filter Facet Query

2017-04-18 Thread Alexandre Rafalovitch
Are you saying that all the values in the facet are zero with that query? The query you gave seems to be the super-basic faceting code, so maybe something super-basic is missing. E.g. *) Did you check that the documents you get back actually have any values in that field to facet on? *) Did you

Re: Filter Facet Query

2017-04-18 Thread Furkan KAMACI
OK, it returns 0 results every time. So, I want to filter out research values with empty string ("") from facet result. How can I do that? On Tue, Apr 18, 2017 at 8:53 AM, Furkan KAMACI wrote: > First problem is they do not match with main query. > > 18 Nis 2017 Sal,

Re: Solr Child="true" flag in version 6.4

2017-04-18 Thread Mikhail Khludnev
Hello, This is puzzling. Are you sure you have recent DIH jar at that core? Sometimes the old one can remain at lib directory. One odd thing in the config is that category values are not limited with something like WHERE categoryvalues.categoryid=${category.id} Can you share verbose debug output?

Re: Getting error while excuting full import

2017-04-18 Thread ankur.168
Hi Erick, Thanks for replying, As you suggest I can use solrJ to map RDBMS fetched data and index/search it later on. but DIH gives multi db connection for full import and other benefits. Does solrJ supports this or we need to put efforts to make a multithreaded connection pool similar to DIH?