Re: export with collapse filter runs into NPE

2016-06-10 Thread Joel Bernstein
Yeah, it sounds like we've got two good bugs here. Feel free to create jira tickets for them, I don't believe they've been created yet. It would be good to get these fixed for the next release. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Jun 10, 2016 at 7:25 PM, Susmit Shukla

Re: Issues with coordinates in Solr during updating of fields

2016-06-10 Thread Zheng Lin Edwin Yeo
Would like to check, what is the use of the gps_0_coordinate and gps_1_coordinate field then? Is it just to store the data points, or does it have any other use? When I do the query, I found that we are only querying the gps_field, which is something like this:

Re: export with collapse filter runs into NPE

2016-06-10 Thread Susmit Shukla
Hi Joel, I would need to join results from 2 solr clouds before collapsing so it would not be an issue right now. I ran into another issue - if data in any of the shards is empty, export throws an error- Once i have atleast one document in each shard, it works fine.

RE: using spell check on phrases

2016-06-10 Thread Dyer, James
Kaveh, If your query has "mm" set to zero or a low value, then you may want to override this when the spellchecker checks possible collations. For example: spellcheck.collateParam.mm=100% You may also want to consider adding "spellcheck.maxResultsForSuggest" to your query, so that it will

RE: Questions regarding re-index when using Solr as a data source

2016-06-10 Thread Hui Liu
Thank you Walter. -Original Message- From: Walter Underwood [mailto:wun...@wunderwood.org] Sent: Friday, June 10, 2016 3:53 PM To: solr-user@lucene.apache.org Subject: Re: Questions regarding re-index when using Solr as a data source Those are brand new features that I have not used, so

Re: OT: is Heliosearch discontinued?

2016-06-10 Thread tedsolr
That's fantastic! Thanks Joel -- View this message in context: http://lucene.472066.n3.nabble.com/OT-is-Heliosearch-discontinued-tp4242345p4281792.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: export with collapse filter runs into NPE

2016-06-10 Thread Joel Bernstein
This sounds like a bug. I'm pretty sure there are no tests that use collapse with the export handler. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Jun 10, 2016 at 3:59 PM, Susmit Shukla wrote: > Hi, > > I'm running this export query, it is working fine. f1 is

Re: OT: is Heliosearch discontinued?

2016-06-10 Thread Joel Bernstein
You can actually find those old articles on https://archive.org/web/. I haven't gone back and collected the writings to repost. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Jun 10, 2016 at 3:31 PM, tedsolr wrote: > There were many great white papers hosted on that

export with collapse filter runs into NPE

2016-06-10 Thread Susmit Shukla
Hi, I'm running this export query, it is working fine. f1 is the uniqueKey and running solr 5.3.1 /export?q=f1:term1=f1+desc=f1,f2 if I add collapsing filter, it is giving NullPointerException /export?q=f1:term1=f1+desc=f1,f2={!collapse field=f2} does collapsing filter work with /export

Re: Questions regarding re-index when using Solr as a data source

2016-06-10 Thread Walter Underwood
Those are brand new features that I have not used, so I can’t comment on them. But I know they do not make Solr into a database. If you need a transactional database that can support search, you probably want MarkLogic. I worked at MarkLogic for a couple of years. In some ways, MarkLogic is

Re: Simulate doc linking via post filter cache check

2016-06-10 Thread tedsolr
The terms component will not work for me because it holds on to terms from deleted documents. My indexes are too volatile. I could perform a search for every match - but that would not perform. Maybe I need something that can compare two searches. Anyone know of an existing filter component does

RE: Questions regarding re-index when using Solr as a data source

2016-06-10 Thread Hui Liu
What if we plan to use Solr version 6.x? this url says it support 2 different update modes: atomic update and optimistic concurrency: https://cwiki.apache.org/confluence/display/solr/Updating+Parts+of+Documents I tested 'optimistic concurrency' and it appears to be working, i.e if a document I

Re: Re-create shard with compositeId router and known hash range

2016-06-10 Thread Henrik Brautaset Aronsen
On Fri, Jun 10, 2016 at 6:18 PM, Erick Erickson wrote: > Well, how brave do you want to be ;)? Hi Erick, thanks for your reply! > There's no great magic to the > Zookeeper nodes here. If you do everything just right you could create > one manually. By that I mean you

Re: OT: is Heliosearch discontinued?

2016-06-10 Thread tedsolr
There were many great white papers hosted on that old site. Does anyone know if they were moved? I've got lots of broken links - I wish I could get to that reference material. -- View this message in context:

Re: Solr Schema for same field names within different input entities

2016-06-10 Thread Aniruddh Sharma
Thanks a lot Eric. Thanks and Regards Aniruddh On Fri, Jun 10, 2016 at 12:25 PM, Erick Erickson wrote: > Usually people put an application layer between the Business User and > the actual query to form complex Solr queries that "do the right > thing". Unfortunately

Re: Solr Schema for same field names within different input entities

2016-06-10 Thread Erick Erickson
Usually people put an application layer between the Business User and the actual query to form complex Solr queries that "do the right thing". Unfortunately there's no good automated ways to do this that I know of as each app has its own set of peculiarities. Best, Erick On Wed, Jun 8, 2016 at

Re: Query exact match with ASCIIFoldingFilterFactory

2016-06-10 Thread Erick Erickson
What query are you using? From what you've shown, the exact match should work. Perhaps use a phrase query? And while the analyzer is very cool, it has its limitations, particularly it doesn't show the interactions with the _parser_. So add to the URL and look at the parsed_query bits of the

Bug in ExtractingRequestHandler

2016-06-10 Thread Gilbert Boyreau
Hello, I think there's a bug in the |ExtractingRequestHandler|Handler (Tika parser). Some tika's exception are not catch, and the handler return a 0 status, indicating no problem's with that content. I give a look at the code (Solr 5.1, ExtractingDocumentLoader:221), only TikaException are

Re: Re-create shard with compositeId router and known hash range

2016-06-10 Thread Erick Erickson
Well, how brave do you want to be ;)? There's no great magic to the Zookeeper nodes here. If you do everything just right you could create one manually. By that I mean you could "hand edit" the znode with the Zookeeper commands, you'd have to dig for the exact commands You _may_ be able to use

Re: Questions regarding re-index when using Solr as a data source

2016-06-10 Thread Walter Underwood
Solr does not have transactions at all. The “commit” is really “submit batch”. Solr does not have update. You can add, delete, or replace an entire document. There is no optimistic concurrency control because there is no concurrency control. Clients can concurrently add documents to a batch,

RE: Questions regarding re-index when using Solr as a data source

2016-06-10 Thread Hui Liu
Walter, Thank you for your advice. We are new to Solr and have been using Oracle for past 10+ years, so we are used to the idea of having a tool that can be used as both data store and also searchable by having indexes on top of it. I guess the reason we are considering Solr as data

Re: Scoring changes between 4.10 and 5.5

2016-06-10 Thread Upayavira
Tracked it down to this ticket: https://issues.apache.org/jira/browse/LUCENE-6590 which changed the implementation of normalize() in org.apache.lucene.search.similarities.TFIDFSimilarity. I've asked for comment on that ticket. Upayavira On Fri, 10 Jun 2016, at 01:39 AM, Ahmet Arslan wrote: >

Re: Bypassing ExtractingRequestHandler

2016-06-10 Thread Charlie Hull
On 10/06/2016 02:20, Justin Lee wrote: Has anybody had any experience bypassing ExtractingRequestHandler and simply managing Tika manually? I want to make a small modification to Tika to get and save additional data from my PDFs, but I have been procrastinating in no small part due to the

Re: Question about multiple fq parameters

2016-06-10 Thread Mikhail Khludnev
Ahmet, Honestly I don't know, but googling gives: More DateRangeField Details https://cwiki.apache.org/confluence/display/solr/Working+with+Dates On Fri, Jun 10, 2016 at 3:44 AM, Ahmet Arslan wrote: > Hi Mikhail, > > Can you please explain what this mysterious op

Re: Solutions for Multi-word Synonyms

2016-06-10 Thread Bernd Fehling
As Doug said, you should really try to build your own solution for Multi-word Synonyms because every need is different and you can customize it for your special use case, like adding a Thesaurus. http://www.ub.uni-bielefeld.de/~befehl/base/solr/InsideBase_eurovocThesaurus.html Regards Bernd Am