change default id in results clustering

2016-02-18 Thread Dmitry Kan
Hi, Is it possible to change the id field, that defaults to 'id' in carrot based result clustering? I have another field, 'externalId', that is stamped on each document and would like to return it in clusters instead. -- Dmitry Kan Luke Toolbox: http://github.com/DmitryKey/luke Blog:

Re: Facet Filter

2016-02-18 Thread Anil
Thanks Shawn. This really helps. we are using 4.10.3 now.. will look into 5.4.1. Thanks. Regards, Anil On 18 February 2016 at 20:04, Shawn Heisey wrote: > On 2/18/2016 7:12 AM, Anil wrote: > > Thank you, i just checked in 5.1. > > > > as facet fields has to be Strings and

Re: Do all SolrCloud nodes communicate with the database when indexing a collection?

2016-02-18 Thread Anshum Gupta
I'd suggest using CloudSolrClient. It uses ConcurrentUpdateSolrClient under the hood and is zk aware so it would route the documents from the Client to your Solr nodes correctly, saving you an extra hop. Another thing to remember here is to reuse the Solr client as it is thread-safe. Reading up

Re: Do all SolrCloud nodes communicate with the database when indexing a collection?

2016-02-18 Thread Shawn Heisey
On 2/18/2016 3:08 PM, Colin Freas wrote: > Thanks for the info, Anshum. > > Writing up a SolrJ program to do this is entirely within my wheelhouse. > > Read through some of the SolrJ docs and found some examples to start. > > A handful of questions if anyone has some pointers. > > 1. From a

Re: SOLR ranking

2016-02-18 Thread Binoy Dalal
Hi Alessandro, Don't get me wrong. Using mm, ps and pf can and absolutely will solve his problem. Like I said above, my solution is meant to be a quick and dirty fix. It's really not that complex and shouldn't take more than an hour to setup at the app level. Moreover I suggested it because he

RE: Hitting complex multilevel pivot queries in solr

2016-02-18 Thread Lewin Joy (TMS)
Hi, The fields are single valued. But, the requirement will be at query time rather than index time. This is because, we will be having many such scenarios with different fields. I hoped we could concatenate at query time. I just need top 100 counts from the leaf level of the pivot. I'm also

Re: SOLR ranking

2016-02-18 Thread Alessandro Benedetti
Hey Binoi , can't understand why such complexity to be honest :/ Can you explain me why playing with : edismax mm ( percentage of query terms you want to be in the results) pf ( the fields you want to be boosted if phrase matches ) ps ( slop to allow) Should not solve the problem instead of the

SolrCloud shards marked as down and Does not recovery connection to zk

2016-02-18 Thread KNitin
Hi, I am indexing about 5M docs in a 4 shard and 1 replica setup. During indexing one of the shards is marked as down in zookeeper but when i tail the logs all the updates are received in the shard and a hard commit at the end of the job also succeeds. (The auto commit is set to trigger every

Re: Hitting complex multilevel pivot queries in solr

2016-02-18 Thread Alvaro Cabrerizo
Hi, The idea of copying fields into a new one (or various) during indexing and then facet the new field (or fields) looks promising. More information about data will be helpful (for example if the fields:country, state.. are single or multivalued). For example if all of the fields are single

Re: How to use DocValues with TextField

2016-02-18 Thread Harry Yoo
Thanks for the pointer. Please advise me how I can contribute. H > On Jan 27, 2016, at 2:16 AM, Toke Eskildsen wrote: > > Erick Erickson wrote: >> DocValues was designed to support unanalyzed types >> originally. I don't know that code, but

Re: How to use DocValues with TextField

2016-02-18 Thread Harry Yoo
RE: separating a column into two for different behavior. Yes, that is exactly I was advised multiple time. However, it will make a problem when I apply it to my application. I have a one core that contains more than 50 columns (out of 100) want to be searched by case-insensitive and partial

Re: Do all SolrCloud nodes communicate with the database when indexing a collection?

2016-02-18 Thread Colin Freas
Thanks for the info, Anshum. Writing up a SolrJ program to do this is entirely within my wheelhouse. Read through some of the SolrJ docs and found some examples to start. A handful of questions if anyone has some pointers. 1. From a performance perspective, is it worth it to use

Re: Display entire string containing query string

2016-02-18 Thread Alvaro Cabrerizo
Hi, To understand Binoy's answer, please check the: The fl (Field List) Parameter . If you want to include: "*fragments of documents that match the user's query to be included

RE: Hitting complex multilevel pivot queries in solr

2016-02-18 Thread Lewin Joy (TMS)
Still splitting my head over this one. Let me know if anyone has any idea I could try. Or, is there a way to concatenate these 4 fields onto a dynamic field and do a facet.field on top of this one? Thanks. Any idea is helpful to try. -Lewin -Original Message- From: Lewin Joy (TMS)

Re: Filter query and Faceting problem

2016-02-18 Thread Filippo La Torre
Thank you Alessandro for the suggestion (i have to investigate about solrJ), and thank you Mikhail for the explanation. My problem isn’t related to that simple query and it don't depends on framework java. Maybe i’m not smart enough, but if i have this query: WHERE (macro_category = DRINKS AND

Re: Display entire string containing query string

2016-02-18 Thread Tom Running
Hello Thank you for your reply. I am wondering if you can clarify a bit more for me. Is field_where_string_may_be_present something that I have to specify? I am searching HTML page. For example if I search for the word "name" I am trying to display the entire sentence containing "name = T" or

Re: SOLR ranking

2016-02-18 Thread Binoy Dalal
Here's an alternative solution that may be of some help. Here I'm assuming that you are not directly outputting the search results to the user and have some sort of layer between the results from solr and presentation to the user where some additional processing can be performed. 1) You already

Re: Reverse Eningeer Query For a Given Result Set?

2016-02-18 Thread Jack Krupansky
Out of the box? No. Could you develop one? Probably, or at least a rough approximation, at least some of the time... but probably at a cost significantly greater than converting queries by hand. If it is taking you 2-4 hours per query then that suggests that the query complexity is not amenable

Re: Filter query and Faceting problem

2016-02-18 Thread Alessandro Benedetti
As Mikhail suggests, are you sure spring data solr is the right tool for you ? Probably is a great tool for a newbie ( to be honest I just noticed it right now) but maybe you need a more customisable approach to build tag/exclusion filter queries on top of facets. You could use spring data but

Re: Facet Filter

2016-02-18 Thread Shawn Heisey
On 2/18/2016 7:12 AM, Anil wrote: > Thank you, i just checked in 5.1. > > as facet fields has to be Strings and cannot be tockenized. is there any > way to search on case insensitive search on this field (not in a facet > filter scenario). If you configure docValues on the field in schema.xml and

Re: Unable to query the spellchecker in a distributed way

2016-02-18 Thread Damien Picard
I got it ; in Solr 4.4, the component org.apache.solr.handler.component.SpellCheckComponent didn't implement the method distributedProcess(ResponseBuilder rb) which is necessary to org.apache.solr.handler.component.SearchHandler to handle distributed searches the right way. And it seems that with

Re: Error creating document SolrInputDocument

2016-02-18 Thread Shawn Heisey
On 2/18/2016 7:07 AM, Bernd Fehling wrote: > the DIH is doing the splitting: > > ... > > xpath="/documents/document/element[@name='dccreator']/value" /> > > ... This DIH config says it's the "dccreator" field, but the schema.xml excerpts you included earlier were the "creator" field. Can you

Re: Facet Filter

2016-02-18 Thread Anil
Thank you, i just checked in 5.1. as facet fields has to be Strings and cannot be tockenized. is there any way to search on case insensitive search on this field (not in a facet filter scenario). Regards, Anil On 18 February 2016 at 17:34, Upayavira wrote: > facet.contains=

Re: Error creating document SolrInputDocument

2016-02-18 Thread Bernd Fehling
Hi Shawn, the DIH is doing the splitting: ... ... Bernd Am 18.02.2016 um 14:42 schrieb Shawn Heisey: > On 2/18/2016 3:45 AM, Bernd Fehling wrote: >> Now this is strange with solr 4.10.4, >> I have a multivalue string field for creator. >> > multiValued="true" /> >> >> And a multivalue

Re: Error creating document SolrInputDocument

2016-02-18 Thread Shawn Heisey
On 2/18/2016 3:45 AM, Bernd Fehling wrote: > Now this is strange with solr 4.10.4, > I have a multivalue string field for creator. > multiValued="true" /> > > And a multivalue string field for f_person, prepared for facetting with > docValues. > multiValued="true" docValues="true" /> > > To

Re: Filter query and Faceting problem

2016-02-18 Thread Mikhail Khludnev
Tagging works only in the way I describe. Otherwise you might need to need issue separate queries. However, are you sure that this case shouldn't be done in more regular way: fq={!tag=foo}department:foods={!tag=foo}macro_category.key:(drinks food) On Thu, Feb 18, 2016 at 1:48 PM, Filippo La Torre

Re: Facet Filter

2016-02-18 Thread Upayavira
facet.contains= Beware that it is relatively new, so will only be in the latest few Solr releases. I think this was it [1], which suggests it is in 5.1+ Upayavira [1] https://issues.apache.org/jira/browse/SOLR-1387 On Thu, Feb 18, 2016, at 10:38 AM, Anil wrote: > HI , > > Following are the

Re: SOLR ranking

2016-02-18 Thread Emir Arnautovic
Hi Nitin, Can you send us how your parsed query looks like (from debug output). Thanks, Emir On 17.02.2016 08:38, Nitin.K wrote: Hi Binoy, We are searching for both phrases and individual words but we want that only those documents which are having phrases will come first in the order and

Re: Filter query and Faceting problem

2016-02-18 Thread Filippo La Torre
Hi, thank for your response. The problem is that i make this query by Spring Data Solr and i have to make complex AND/OR. Example : ( department:foods AND macro_category.key:drinks) OR ( department:foods AND macro_category.key:food) What is the best practice the make complex AND/OR query and

Error creating document SolrInputDocument

2016-02-18 Thread Bernd Fehling
Now this is strange with solr 4.10.4, I have a multivalue string field for creator. And a multivalue string field for f_person, prepared for facetting with docValues. To fill f_person I use copyField. The input to creator is 43470 bytes long with names, split at ";" for each subfield.

Facet Filter

2016-02-18 Thread Anil
HI , Following are the facets in my use case CLOSED IN PROCESS RE PROCESS OPEN i know facet.prefix returns returns the facets starting with it. i just want to check if any facet parameter is exist in current solr to return facets on matching any word in the facet text ? Ex : PROCESS must

Re: Filter query and Faceting problem

2016-02-18 Thread Mikhail Khludnev
just do fq={!tag=DEPARTMENT}department:foods={!tag=MACROCATEGORY}macro_category.key:drinks tagging in the middle of the query may somehow work, but it's not a recommended way. On Thu, Feb 18, 2016 at 11:48 AM, Filippo La Torre < filippo.lato...@stentle.com> wrote: > Hello everyone, > > this is

Re: Reverse Eningeer Query For a Given Result Set?

2016-02-18 Thread Charlie Hull
On 18/02/2016 09:07, Christian Effertz wrote: Hi, Can I somehow feed Solr with a result set or a list of primary keys and get the shortest query that leads to this result? In other terms, can I reverse engineer a query for a given result set? Some background why I ask this question: We are

Re: join and NOT together

2016-02-18 Thread Sergio García Maroto
HI Mikhail. Sorry for all the confusion This is the original query which doesn't work q=PersonName:peter AND {!type=join from=DocPersonID to=PersonID fromIndex=document v='(*:* -DocType:pdf)' } I figure out that negating outside the cross join query makes the trick for me. I take the negation

Re: Field exclusion from fl and hl.fl

2016-02-18 Thread Anil
I am looking for the same. please do let me know just in case you find workaround. On 18 February 2016 at 14:18, Zheng Lin Edwin Yeo wrote: > Hi, > > Would like to find out, is there already a way to exclude field from the > Solr response. I did came across SOLR-3191 which

Reverse Eningeer Query For a Given Result Set?

2016-02-18 Thread Christian Effertz
Hi, Can I somehow feed Solr with a result set or a list of primary keys and get the shortest query that leads to this result? In other terms, can I reverse engineer a query for a given result set? Some background why I ask this question: We are currently migrating a search application from

Filter query and Faceting problem

2016-02-18 Thread Filippo La Torre
Hello everyone, this is my first mail to solr user mailing list. I’m new to Solr too, my Solr version is 5.4.1. I have a problem with filter query and faceting, i have to make a filter query with AND/OR using also faceting (i will make this query using Spring Data Solr). It seems that the same

Field exclusion from fl and hl.fl

2016-02-18 Thread Zheng Lin Edwin Yeo
Hi, Would like to find out, is there already a way to exclude field from the Solr response. I did came across SOLR-3191 which was created about 4 years ago, but could not find any workable solutions from there. As my collections can have more than 100 fields, and I would need to return the

Does hl.fl field shows fields with type int, tdate

2016-02-18 Thread Zheng Lin Edwin Yeo
Hi, In my configuration, I have fields that are of type "string", "int" and "tdate". However, when I tried to do highlighting by setting fl=* and hl.fl=*, all the fields are returned in the "fl" parameters, but only those of type=string are returned in the "hl.fl" parameters. Is this correct

Re: Querying data based on field type

2016-02-18 Thread Binoy Dalal
My apologies. I thought you wanted to remove all the arr values. On Thu, 18 Feb 2016, 13:55 Salman Ansari wrote: > Not sure if I am getting this but I am not interested in updating > documents. I am interested in getting documents that has the field type of > a specific

Re: Querying data based on field type

2016-02-18 Thread Salman Ansari
Not sure if I am getting this but I am not interested in updating documents. I am interested in getting documents that has the field type of a specific field as array . Regards, Salman On Thu, Feb 18, 2016 at 11:13 AM, Binoy Dalal wrote: > Take a look at atomic updates

Re: Querying data based on field type

2016-02-18 Thread Binoy Dalal
Take a look at atomic updates and remove regex. https://cwiki.apache.org/confluence/display/solr/Updating+Parts+of+Documents On Thu, 18 Feb 2016, 13:07 Salman Ansari wrote: > Hi, > > Due to some mis-configuration issues, I have a field that has values as > single string