RE: how can we use multi term search along with stop words

2016-05-27 Thread Preeti Bhat
Hi, I was firing both leading and trailing queries. And the tollc is an example where we have llc which is stopword for me but needs to be retrieved during search. Thanks and Regards, Preeti Bhat Shore Group Associates LLC (C) +91-996-644-8187 www.ShoreGroupAssociates.com -Original

RE: how can we use multi term search along with stop words

2016-05-27 Thread Preeti Bhat
Hi, I was firing both leading and trailing queries. And the tollc is an example where we have llc which is stopword for me but needs to be retrieved during search. Thanks and Regards, Preeti Bhat -Original Message- From: Ahmet Arslan [mailto:iori...@yahoo.com.INVALID] Sent: Thursday,

Recall: how can we use multi term search along with stop words

2016-05-27 Thread Preeti Bhat
Preeti Bhat would like to recall the message, "how can we use multi term search along with stop words". NOTICE TO RECIPIENTS: This communication may contain confidential and/or privileged information. If you are not the intended recipient (or have received this communication in error) please

RE: Metadata and HTML ending up in searchable text

2016-05-27 Thread Allison, Timothy B.
I'm only minimally familiar with Solr Cell, but... 1) It looks like you aren't setting extractFormat=text. According to [0]...the default is xhtml which will include a bunch of the metadata. 2) is there an attr_* dynamic field in your index with type="ignored"? This would strip out the attr_

OutOfMemoryError does not fire the script

2016-05-27 Thread Pablo Anzorena
Hello, I am using solr 5.2.1 in cloud mode. My jvm arguments for the OutOfMemoryError is -XX:OnOutOfMemoryError='/etc/init.d/solrcloud;restart' In the Solr UI, the event is beign fired, but nothing happens. What am I missing? Regards.

Re: Metadata and HTML ending up in searchable text

2016-05-27 Thread Simon Blandford
Hi Timothy, Thanks for responding. java -jar tika-app-1.13.jar -t "/home/user/Documents/library/UsingMailingLists.txt" ...gives a clean result with no CSS or other nasties in the output. So it looks like the latest version of tika itself is OK. I was basing the test case on this doc page as

Re: Can a DocTransformer access the whole results tree?

2016-05-27 Thread Upayavira
In a JSON response, we get this: { "responseHeader": {...}, "response": { "docs": [...] }, "highlighting": {...} ... } I'm assuming that the getProcessedDocuments call would give me the docs: {} element, whereas I'm after the whole response so I can retrieve the "highlighting" element.

Re: debugging solr query

2016-05-27 Thread Ahmet Arslan
Hi Jay, Please separate the clauses. Feed one of them to the main q parameter with content score operator =^ since you are sorting on a structured field(e.g. date) q:fieldB:(123 OR 456)^=1.0 =dt1:[date1 TO *] =dt2:[* TO NOW/DAY+1] =fieldA:abc =dt1 asc,field2 asc, fieldC desc Play with the

Re: debugging solr query

2016-05-27 Thread Jay Potharaju
I updated almost 1/3 of the data and ran my queries with new columns as mentioned earlier. The query returns data in almost half the time as compared to before. I am thinking that if I update all the columns there would not be much difference in query response time. Are there any

Re: Can a DocTransformer access the whole results tree?

2016-05-27 Thread Upayavira
Yes, I've seen that. I can see the getDocList() method will presumably give me the results themselves, but I need the full response so I can get the highlighting details, but I can't see them anywhere. On Thu, 26 May 2016, at 09:39 PM, Mikhail Khludnev wrote: > public abstract class ResultContext

Fw: SolrCloud and Zookeeper integration issue in .net application

2016-05-27 Thread shivendra.tiwari
Hi, This is my first time asking the question. I am facing some problems in Solr. Could you please help me out. Below is my question: Currently I am using Solr lower version it is working fine but now, we are trying to configure SolrCloud for load balance so, I have configured- 2 Solr nodes

Re: How can Most Popular Search be implemented in Solr?

2016-05-27 Thread Ahmet Arslan
Hi, Solr does not explicitly save incoming/maintain queries. * Some people save queries at the UI side. * Some folks enable Solr logging and then extract useful query, numFound, QTime, etc information from logs: http://soleami.com * Others identify searches that return zero documents (missing

RE: Metadata and HTML ending up in searchable text

2016-05-27 Thread Allison, Timothy B.
Of course, for greater control over indexing (and for more robust handling of exceedingly rare (but real) infinite loops/OOM caused by Tika), consider SolrJ: http://searchhub.org/2012/02/14/indexing-with-solrj/ -Original Message- From: Simon Blandford

Re: Fw: SolrCloud and Zookeeper integration issue in .net application

2016-05-27 Thread Shawn Heisey
On 5/27/2016 5:57 AM, shivendra.tiwari wrote: > Currently I am using Solr lower version it is working fine but now, we are > trying to configure SolrCloud for load balance so, > I have configured- 2 Solr nodes and 1 ZooKeeper node, created collections > and > shards also getting data from SQL

Re: Solr Cloud and Multi-word Synonyms :: synonym_edismax parser

2016-05-27 Thread Steve Rowe
I’m working on addressing problems using multi-term synonyms at query time in Lucene and Solr. I recommend these two blogs for understanding the issues (the second one was mentioned earlier in this thread):

Re: Auto Suggestion in solr

2016-05-27 Thread Mugeesh Husain
Hi Erick, I have followed this article http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/. could yo explain this line of bf product(log(sum(popularity,1)),100)^20. what kind of score, this line should be populated ? I have a similar requirement which he build over this

Re: how can we use multi term search along with stop words

2016-05-27 Thread Walter Underwood
If “llc” is a stopword, it is not in the index and can never be searched. Period. I never use stopwords. They cause odd problems, like not being able to search for “vitamin a”. When I was at Netflix, I discovered that the movie title “Being There” had zero tokens after stemming and

Re: how can we use multi term search along with stop words

2016-05-27 Thread Erick Erickson
And if you want an absolutely beautiful example, try searching for this blog post: https://lucidworks.com/blog/2011/12/28/why-not-and-or-and-not/ Best, Erick On Fri, May 27, 2016 at 8:23 AM, Walter Underwood wrote: > If “llc” is a stopword, it is not in the index and can

Re: Can a DocTransformer access the whole results tree?

2016-05-27 Thread Erick Erickson
Maybe you'd be better off using a custom search component. instead of a doc transformer. The intent of a doc transformer is, as you've discovered, working on single docs at a time. You want to manipulate the whole response which seems to fit more naturally into a search component. Make sure to put

Re: Facet data type

2016-05-27 Thread Erick Erickson
bq: my index size grew by 20%. Is this expected Yes. But don't worry about it ;). Basically, you've serialized to disk the "uninverted" form of the field. But, that is accessed through Lucene by MMapDirectory, see: http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html If you

Re: OutOfMemoryError does not fire the script

2016-05-27 Thread Pablo Anzorena
Perfect, thank you very much. 2016-05-27 12:44 GMT-03:00 Shawn Heisey : > On 5/27/2016 7:05 AM, Pablo Anzorena wrote: > > I am using solr 5.2.1 in cloud mode. My jvm arguments for the > > OutOfMemoryError is > > -XX:OnOutOfMemoryError='/etc/init.d/solrcloud;restart' > > > >

Solr 4.9.1 MoreLikeThis Handler CursorMark

2016-05-27 Thread Esan London
Hi The MoreLikeThis handler doesn't appear to be returning a nextcursor value in solr 4.9.1. Is this by design or something that was changed in the later versions? thanks for the help

Solr vs JDBC driver

2016-05-27 Thread Vachon , Jean-Sébastien
Hi All, I am trying to use Solr’s JDBC driver in Java and I’m stuck with the following error message: 14:52:37,802 ERROR [consoleLogger] java.sql.SQLException: No suitable driver found for jdbc:solr://10.28.213.133:2181/solr?collection=Current My pom.xml contains: org.apache.solr

Re: Facet data type

2016-05-27 Thread Steven White
Thanks Erick. What about Solr defect SOLR-7495 that Nick mentioned? It sounds like because of this defect, I should NOT set docValues="true" on a filed when: a) type="int" and b) multiValued="true". Can you confirm that I got this right? I'm on Solr 5.2.1 Steve On Fri, May 27, 2016 at 1:30

Re: Metadata and HTML ending up in searchable text

2016-05-27 Thread Alexandre Rafalovitch
I think Solr's layer above Tika was merging in metadata and text all together without a way (that I could see) to separate them. That's all I remember of my examination of this issue when I run into something similar. Not very helpful, I know. Regards, Alex. Newsletter and resources for

Re: Not (!) operator

2016-05-27 Thread Alexandre Rafalovitch
If you are worried about performance, bake the present/absent as a signal in a separate field during the document processing as a special UpdateRequestProcessor sequence. Regards, Alex. Newsletter and resources for Solr beginners and intermediates: http://www.solr-start.com/ On 27 May

Re: Solr vs JDBC driver

2016-05-27 Thread Vachon , Jean-Sébastien
Never mindŠ I had to load the class just like any database driver: Class.forName("org.apache.solr.client.solrj.io.sql.DriverImpl").newInstance (); On 2016-05-27, 2:59 PM, "Vachon, Jean-Sébastien" wrote: >Hi All, > > > >I am trying to use Solr¹s JDBC driver in Java

Re: Solr Cloud and Multi-word Synonyms :: synonym_edismax parser

2016-05-27 Thread John Bickerstaff
Thank you Steve -- very helpful. I can see that whatever implementation I decide to try, some testing will be in order. If anyone is aware of significant gotchas with this synonym thing that are not mentioned in the already-listed URLs, please feel free to comment. On Fri, May 27, 2016 at 10:28

Re: debugging solr query

2016-05-27 Thread Jay Potharaju
Thanks for the suggestion. At this time I wont be able to change any code in the API ...my options are limited to changing things at the solr level. Any suggestions regarding solr settings in config or schema changes are something in my control. On Fri, May 27, 2016 at 7:03 AM, Ahmet Arslan

Re: How can Most Popular Search be implemented in Solr?

2016-05-27 Thread Alexandre Rafalovitch
A think to watch out for is also that some of the parameters are defined Solr-side in the request handler. So, a server-level log will not capture all the parameters. Only something that extracts it from Solr after request handler defaults/overrides are applied. This may not matter for just

Slow date filter query

2016-05-27 Thread Jay Potharaju
Hi, I am running filter query(range query) on date fields(high cardinality) and the performance is really bad ...it takes about 2-5 seconds for it to come back with response. I am rebuilding the index to have docvalues & tdates instead of "date" field. But not sure if that will alleviate the

Re: OutOfMemoryError does not fire the script

2016-05-27 Thread Shawn Heisey
On 5/27/2016 7:05 AM, Pablo Anzorena wrote: > I am using solr 5.2.1 in cloud mode. My jvm arguments for the > OutOfMemoryError is > -XX:OnOutOfMemoryError='/etc/init.d/solrcloud;restart' > > In the Solr UI, the event is beign fired, but nothing happens. In all versions before 5.5.1, that -XX

How can Most Popular Search be implemented in Solr?

2016-05-27 Thread Syedabbasmehdi Rizvi
Hi, Below is my question: I want to implement Most Popular search in Solr. Is there any OOTB functionality in Solr that can achieve this? I have had a good look in StatsComponent as well as

How can Most Popular Search be implemented in Solr?

2016-05-27 Thread Syedabbasmehdi Rizvi
Hi, This is my first time asking the question. I am facing some problems in Solr. Could you please help me out. Below is my question: I want to implement Most Popular search in Solr. Is there any OOTB functionality in Solr that can achieve this? I have had a good look in

Re: Not (!) operator

2016-05-27 Thread Shawn Heisey
On 5/26/2016 11:13 PM, Anil wrote: > We have status text field in our solr document and it is optional. > search query status: !Closed returning documents with no status as > well. how to get only documents having status and it is !Closed ? one > way is status:* AND status:!Closed . any other way

Re: How to save index data to other place? [scottchu]

2016-05-27 Thread Shawn Heisey
On 5/26/2016 8:51 PM, scott.chu wrote: > I want to migrate my Solrcloud from Windows to CentOS. Because I am > new to CentOS, not familiar with how to install Solr on it and I did a > lot of config in my Solrcloud on Windows, so I use ftp to upload > solr-5.4.1 and zookeeper-3.4.6 folders to 3

Re: Not (!) operator

2016-05-27 Thread Anil
Hi Shawn, Thanks for reply. i am also worried wither performance. I will check if there is another way to design the documents in case of parent and child relationship. Regards, Anil On 27 May 2016 at 12:39, Shawn Heisey wrote: > On 5/26/2016 11:13 PM, Anil wrote: > > We

Re: How can Most Popular Search be implemented in Solr?

2016-05-27 Thread Shawn Heisey
On 5/26/2016 11:49 PM, Syedabbasmehdi Rizvi wrote: > I want to implement Most Popular search in Solr. Is there any OOTB > functionality in Solr that can achieve this? > I have had a good look in > StatsComponent as well as >

SolrCloud and Zookeeper integration issue in .net application

2016-05-27 Thread shivendra.tiwari
Hi, Currently I am using Solr lower version it is working fine but now, we are trying to configure SolrCloud for load balance so, I have configured- 2 Solr nodes and 1 ZooKeeper node, created collections and shards also getting data from SQL server on Solr but i need to call SolrCloud in .net

Re: Facet data type

2016-05-27 Thread Nick D
Steven, The case that I was pointing to was specifically talking about the need for a int to be set to multivalued=true for the field to be used as a facet.field. I personally ran into it when upgrading to 5.x from 4.10.2. I believe setting docValues=true will not have an affect (untested by me