Activate Fuzzy Queries for each term by default

2016-05-30 Thread Sebastian Landwehr
Hi there, I got a question regarding fuzzy queries: I know that I can create a fuzzy query by appending a „~" with the maximal edit distance to a word. Is it also possible to automatically create a fuzzy query for each search term? I know that I could theoretically append the „~"

Re: Recommended api/lib to search Solr using PHP

2016-05-30 Thread Georg Sorst
We've had good experiences with Solarium, so it's probably worth spending some time in getting it to run. scott.chu schrieb am Mo., 30. Mai 2016 um 09:30 Uhr: > > We have two legacy in-house applications written in PHP 5.2.6 and 5.5.3. > Our engineers currently just use

Re: Activate Fuzzy Queries for each term by default

2016-05-30 Thread Georg Sorst
AFAIK this is not possible, but it probably doesn't make so much sense either. In my experience fuzzy search should be explicit to the user (Google does a pretty good job at this, eg. "Did you mean" etc.). What are you trying to achieve and what results do you want to return? Sebastian Landwehr

Recommended api/lib to search Solr using PHP

2016-05-30 Thread scott.chu
We have two legacy in-house applications written in PHP 5.2.6 and 5.5.3. Our engineers currently just use fopen with url to search Solr but it's kinda unenough when we want to do more advanced, complex queries. We've tried to use something called 'Solarium' but its installtion steps has

Re: Solr 6 CDCR does not work

2016-05-30 Thread Renaud Delbru
Hi Adam, could you check the response of the monitoring commands [1], QUEUES, ERRORS, OPS. This might help undeerstanding if documents are flowing or if there are issues. Also, do you have an autocommit configured on the target ? CDCR does not replicate commit, and therefore you have to

Re: Activate Fuzzy Queries for each term by default

2016-05-30 Thread Sebastian Landwehr
The „Did you mean“ thing is more the spell checker, which I already included. Fuzzy Queries are for terms where multiple spellings in fact exist in the index. At least that’s what I’m aiming at. > Am 30.05.2016 um 10:33 schrieb Georg Sorst : > > AFAIK this is not

searching in two indices

2016-05-30 Thread Bernd Fehling
Has anyone experiences with searching in two indices? E.g. having one index with nearly static data (like personal data) and a second index with articles which changes pretty much. A search would then start for articles and from the list of results (e.g. first page, 10 articles) start a sub

Re: After Solr 5.5, mm parameter doesn't work properly

2016-05-30 Thread Jan Høydahl
Hi, This may be related to SOLR-8812, but still different. Please file a JIRA issue for this. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 29. mai 2016 kl. 18.20 skrev Issei Nishigata : > > Hi, > > “mm" parameter does not work properly,

Re: searching in two indices

2016-05-30 Thread John Blythe
We had previously done something of the sort. With some sources of truth type of cores we would do initial searches on customer transaction data before fetching the related information from those "truth" tables. We would use the various pertinent fields from results #1 to find related data in

Re: searching in two indices

2016-05-30 Thread Bernd Fehling
Thanks for sharing your solution and experience. I'm just thinking about to load all article data (100 mio.) and all personal data (4 mio.) into one core with a selector field "db" containing either "article" or "pdata". But still not really satisfied with this solution. Anyway, MySQL is a good

Re: Boost(bf) function in solr

2016-05-30 Thread Doug Turnbull
Let's say you're building search for your blog. If popularity is say number of page views, than a handful might have a million (they made it to hacker news and slashdot). A few dozen may have hundreds of thousand (they only made it to slashdot). The vast majority might have less than 100 page

Boost(bf) function in solr

2016-05-30 Thread Mugeesh Husain
Hi, could any one explain me why people use log function for boosting like below product(log(sum(popularity,wiegh),100)^20 what is log function please elaborate it ? -- View this message in context: http://lucene.472066.n3.nabble.com/Boost-bf-function-in-solr-tp4279792.html Sent from the Solr

SQL Interface vs geofilter (radius search)

2016-05-30 Thread Vachon , Jean-Sébastien
Hi All, Does the SQL interface allow searching around a specific lat/long coordinates for all documents within a radius of 50 kilometers? If so, what is the syntax to perform such a query? Thanks CEB Canada Inc. Registration No: 1781071. Registered office: 199 Bay Street Commerce Court

Re: Recommended api/lib to search Solr using PHP

2016-05-30 Thread Shawn Heisey
On 5/30/2016 1:29 AM, scott.chu wrote: > We have two legacy in-house applications written in PHP 5.2.6 and 5.5.3. Our > engineers currently just use fopen with url to search Solr but it's kinda > unenough when we want to do more advanced, complex queries. We've tried to > use something called

Re: Recommended api/lib to search Solr using PHP

2016-05-30 Thread John Blythe
we also use Solarium. the documentation is pretty spotty in some cases (tho they've recently updated it, or at least the formatting, which seems to be a move in the right direction), but overall pretty simple to use. some good plugins at hand to help extend the base power, too. i'd say give it a

Re: Solr Cloud and Multi-word Synonyms :: synonym_edismax parser

2016-05-30 Thread MaryJo Sminkey
This is a very timely discussion for me as well as we're trying to tackle the multi term synonym issue as well and have not been able to hon-lucene plugin to work, the jar shows up as installed but when we set up the sample request handler it throws this error:

Re: Solr Cloud and Multi-word Synonyms :: synonym_edismax parser

2016-05-30 Thread MaryJo Sminkey
It's been awhile since I installed it so I really can't say. I'm more of a code monkey than a server gal (particularly Linux... I'm amazed I got Solr installed in the first place, LOL!) So I had asked our network guy to look it over recently and see if it looked like I did it okay. He said since

Clarity on Sharding Concepts.

2016-05-30 Thread Siddhartha Singh Sandhu
Hi Community, I need some help understanding some concepts. I have the config on 2 severs: 2 shards each with 1 replica. Hence, on each server I have 1. shard1_replica1 2 . shard2_replica1 Suppose I have 50 documents then, shard1_replica1 + shard2_replica1 = 50 ? or shard2_replica1 = 50 &&

Re: SolrCloud and Zookeeper integration issue in .net application

2016-05-30 Thread Erick Erickson
You'd probably get a more knowledgeable response on the SolrNet user's list. I have no idea the state of that project, the Java client is the one maintained by the Apache Solr project. On a quick look at the Apache SolrNet project, I don't see any activity recently, but I have no clue what the

Re: Cloud Solr 5.3.1 + 6.0.1 cannot delete documents

2016-05-30 Thread Erick Erickson
bq: I checked in the Solr Admin and noticed that the same document resided in both shards on the same node If this means two _different_ shards (as opposed to two replicas in the _same_ shard) showed the document, then that's the proverbial "smoking gun", somehow your setup isn't what you

Re(2): Recommended api/lib to search Solr using PHP

2016-05-30 Thread scott.chu
Thanks, guys! My engineers just find another thing called 'SolrPhpClient'. But I am trying solarium again. It just looks like a well-structured API. (Note: Actually, I've noticed it from very beginning when it's developed but never give it a try.) scott.chu,scott@udngroup.com 2016/5/31

Re: Recommended api/lib to search Solr using PHP

2016-05-30 Thread scott.chu
For those who might have same need to use Solarium, this is the best tutorial I can find by googling, it's actually a chapter in the book "Apache Solr PHP Integration" https://www.packtpub.com/packtlib/book/Big-Data-and-Business-Intelligence/9781782164920/1/ch01lvl1sec13/Installing%20Solarium

Re: Can "Using cp replica and modifying core.properties" rather than ADDREPLICA API work?

2016-05-30 Thread Erick Erickson
Well, that'll work, but you better know _exactly_ what you're doing. For instance, you better not be indexing and have committed before you start your copy. You better make sure your third node is up before you index anything. Etc. Etc. Why do you think this "saves time"? Have you measured? Is

Re(2): Can "Using cp replica and modifying core.properties" rather thanADDREPLICA API work?

2016-05-30 Thread scott.chu
Thanks for your advice, Erick. I think you point out what I didn't think of and a possible side effect in the future. I'll go back to the "normal" way next time I do the same job. scott.chu,scott@udngroup.com 2016/5/31 (週二) - Original Message - From: Erick Erickson To: solr-user

Re: float or string type for a field with whole number and decimal number values?

2016-05-30 Thread Erick Erickson
bq: Should I change the field type to "float" or "string"? I'd go with float. Let's assume you want to sort by this field. 10.00 sorts before 9.0 if you just use Strings. Plus floats are generally much more compact. bq: do I need to delete all documents in the index and do a full

Re: Slow date filter query

2016-05-30 Thread Erick Erickson
That does seem long, but you haven't provided many details about the fields. Are there 100 docs in your index? 100M docs? 500M docs? Are you using NOW in appropriately? See: On Fri, May 27, 2016 at 1:32 PM, Jay Potharaju wrote: > Hi, > I am running filter query(range

Re: Slow date filter query

2016-05-30 Thread Erick Erickson
Oops, fat fingers. see: searchhub.org/2012/02/23/date-math-now-and-filter-queries/ If you're not re-using the _same_ filter query, you'll be better off using fq={!cache=false}range_query Best, Erick On Mon, May 30, 2016 at 8:16 PM, Erick Erickson wrote: > That does

Re: Clarity on Sharding Concepts.

2016-05-30 Thread Mugeesh Husain
Hi, To read out this document https://cwiki.apache.org/confluence/display/solr/Shards+and+Indexing+Data+in+SolrCloud for proper understanding. FYI, you are using implicit router, a document will be divided randomly based on hashing technique. If you indexed 50 documents, it will be divided

Re: Solr Cloud and Multi-word Synonyms :: synonym_edismax parser

2016-05-30 Thread John Bickerstaff
Thanks for the comment Mary Jo... The error loading the class rings a bell - did you find and follow instructions for adding that to the WAR file? I vaguely remember seeing something about that. I'm going to try my own tests on the auto phrasing one.. If I'm successful, I'll post back. On

Re: Boost(bf) function in solr

2016-05-30 Thread Mugeesh Husain
Thanks Doug, that clear my understanding, when i get free, will study your book. -- View this message in context: http://lucene.472066.n3.nabble.com/Boost-bf-function-in-solr-tp4279792p4279860.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Slow date filter query

2016-05-30 Thread Jay Potharaju
There are about 30 Million Docs and the index size is 75 GB. Using a full timestamp value when querying and not using NOW. The fq queries covers almost all the docs(20+ million) in the index. Thanks On Mon, May 30, 2016 at 8:17 PM, Erick Erickson wrote: > Oops, fat

Re: Solr Cloud and Multi-word Synonyms :: synonym_edismax parser

2016-05-30 Thread John Bickerstaff
So I'm looking at the solution mentioned here: https://lucidworks.com/blog/2014/07/12/solution-for-multi-term-synonyms-in-lucenesolr-using-the-auto-phrasing-tokenfilter/ The thing that's troubling me slightly is that the way it's documented it seems to be missing a small but important link...

Re: Recommended api/lib to search Solr using PHP

2016-05-30 Thread Shawn Heisey
On 5/30/2016 12:32 PM, GW wrote: > I would say look at the urls for searches you build in the query tool > > In my case > > http://172.16.0.1:8983/solr/#/products/query > > When you build queries with the Query tool, for example an edismax query, > the URL is there for you to copy. > Use the url

Re: Recommended api/lib to search Solr using PHP

2016-05-30 Thread GW
I would say look at the urls for searches you build in the query tool In my case http://172.16.0.1:8983/solr/#/products/query When you build queries with the Query tool, for example an edismax query, the URL is there for you to copy. Use the url structure with curl in your

Cloud Solr 5.3.1 + 6.0.1 cannot delete documents

2016-05-30 Thread Moritz Becker
Hi,   I have the following issue: I initially started with a Solr 5.3.1 + Zookeeper 3.4.6 cloud setup with 2 solr nodes and with one collection consisting of 2 shards and 2 replicas. I am accessing the cluster using the CloudSolrClient. When I tried to delete a document, no error occurred but