Is there a way to specify word position in solr search query on text fields?

2017-04-25 Thread Sundeep T
Hello, We have a text field in our schema that is indexed using the StandardTokenizerFactory. We have set omitPositions= false, so that positional information of individual tokens is also included in the index data. Question is if there is a way to construct a query in which we can specify the

Re: Poll: Master-Slave or SolrCloud?

2017-04-25 Thread Walter Underwood
1. I never saw the poll. 2. It looks better than the previous poll, which was poorly worded. I couldn’t answer “yes” or “no”, really. Here is what we have in production. Solr 3: Using every threat I can think of to get the remaining clients off of it. It has been shut down in test for months.

Re: SolrIndexSearcher#getDocList() method returns zero results, if query includes tdate range query

2017-04-25 Thread Victor Solakhian
Hi Chris, I think you are leading me to the right direction. I'm not sure how that would have worked in Solr 4.5, ... unless > perhaps your definition of a "date" field was different in the schema's > you used in 4.5, and did not involve a Trie based date field (the very > old legacy date

Re: Poll: Master-Slave or SolrCloud?

2017-04-25 Thread Otis Gospodnetić
Hi Erick, Could one run *only* embedded ZK on some SolrCloud nodes, sans any data? It would be equivalent of dedicated Elasticsearch nodes, which is the current ES best practice/recommendation. I've never heard of anyone being scared of running 3 dedicated master ES nodes, so if SolrCloud

Pointless query parsing before distributed processing

2017-04-25 Thread Mikhail Khludnev
Hello, Before distributed requests are submitted, QueryComponent.prepare() is invoked and parses the query, but then that parsed query is just thrown away (probably it appears in debug). It's neglectable for the most of the cases until a heavy wildcarded {!complexphrase} query is submitted. It can

Re: Version conflict during data import from another Solr instance into clean Solr

2017-04-25 Thread deansg
Hi, I ran into the same problem. Chris' first solution worked for us, however the second solution on its own doesn't work, as the conflict error arises before the update processors' code is even reached. However, creating an alias for the _version_ field in the dataconfig file, together with an

Re: Atomic Updates

2017-04-25 Thread Chris Ulicny
All fields are being explicitly populated on the initial document load without copyFields, and the atomic updates come after. This situation actually came up while we were planning removing copyField properties from one of the fields in a schema for a new collection. On Tue, Apr 25, 2017 at 3:54

Re: Poll: Master-Slave or SolrCloud?

2017-04-25 Thread Erick Erickson
bq: I read somewhere that you should run your own ZK externally, and turn off SolrCloud this is a bit confused. "turn off SolrCloud" has nothing to do with running ZK internally or externally. SolrCloud requires ZK, whether internal or external is irrelevant to the term SolrCloud. On to running

Re: Troubleshooting solr errors

2017-04-25 Thread Erick Erickson
Solr likes memory. A lot. Even 2G is quite small by recent installations I have seen. There is an "oom killer script" that can be specified to kill Solr if it gets an OOM, at least then you have something to warn you. After an OOM, Java is in an indeterminate state so all bets are off. Best,

Re: Atomic Updates

2017-04-25 Thread Erick Erickson
How is "otherText" getting values in the first place? If it's the destination of a copyField directive, it'll be repopulated if the source of the copyField is stored=true. Best, Erick On Tue, Apr 25, 2017 at 12:40 PM, Chris Ulicny wrote: > Hello all, > > Suppose I have the

SolrServerException: Invalid use of BasicClientConnManager: connection still allocated.

2017-04-25 Thread Putul S
Hello, I am using single instance CloudSolrClient using my HttpClinet. Problem with using this httpClient is that, whenever I add more than one document, LBHttpSolrClient complains about connection not released. Everything works fine is I do not use my own HttpClient. HttpClient httpClient =

Atomic Updates

2017-04-25 Thread Chris Ulicny
Hello all, Suppose I have the following fields in a document and populate all 4 fields for every document. id: uniqueKey, indexed and stored integer_field: indexed and stored text_field: indexed and stored othertext_field: indexed but not stored No default values, multivalues, docvalues,

Re: Caused by: org.noggit.JSONParser$ParseException: Expected ',' or '}': char=",position=312 BEFORE='ssions"

2017-04-25 Thread Fuad Efendi
Yes, absolutely correct, comma is missing at the end of line 10 All key-value pairs inside the same block should be comma separated, except last one From: Shawn Heisey Reply: solr-user@lucene.apache.org

Re: SolrIndexSearcher#getDocList() method returns zero results, if query includes tdate range query

2017-04-25 Thread Chris Hostetter
: The correct way for a plugin to do the sort of thing you are trying to do : would be to use an instance of SolrQueryParser -- see for example the code : in LuceneQParser and how it uses SolrQueryParser ... you'll most likeley : just want to use LuceneQParser directly in your plugin to

Re: SolrIndexSearcher#getDocList() method returns zero results, if query includes tdate range query

2017-04-25 Thread Chris Hostetter
Diff FieldType's encode diff values into terms in diff ways. at query time the FieldTypes need to be consulted to know how to build the resulting query object. Solr's query parsers are "schema aware" and delegate to the appropriate FieldType to handle any index term encoding needed -- but

Re: Caused by: org.noggit.JSONParser$ParseException: Expected ',' or '}': char=",position=312 BEFORE='ssions"

2017-04-25 Thread bay chae
doh Thanks for the tip It worked perfectly!! > On 25 Apr 2017, at 19:28, Shawn Heisey wrote: > >

Re: Caused by: org.noggit.JSONParser$ParseException: Expected ',' or '}': char=",position=312 BEFORE='ssions"

2017-04-25 Thread Shawn Heisey
On 4/25/2017 12:10 PM, bay chae wrote: > https://stackoverflow.com/questions/43618000/solr-standalone-basicauth-org-noggit-jsonparserparseexception > > > > Hi I am following guides on

Caused by: org.noggit.JSONParser$ParseException: Expected ',' or '}': char=",position=312 BEFORE='ssions"

2017-04-25 Thread bay chae
https://stackoverflow.com/questions/43618000/solr-standalone-basicauth-org-noggit-jsonparserparseexception Hi I am following guides on security.json in

Re: Troubleshooting solr errors

2017-04-25 Thread Daniel Miller
The problem isn't a particular email message - I get a cascade of those errors (every time a new message is received) once the server "breaks". The fix is to restart the server. I did find a Java heap error in the log - so I've increased the memory allocation (now to -Xms512m -Xmx2048m). I

Re: SolrIndexSearcher#getDocList() method returns zero results, if query includes tdate range query

2017-04-25 Thread Rick Leir
Victor, In SorAdmin, do a query then look at the top bar on the screen. Sorry, I cannot do a screenshot here. The actual query that SolrAdmin generated is in that top bar. It is difficult to cut and paste the query but possible. Or you can click on it and jump to a results page. SolrAdmin has

Re: Poll: Master-Slave or SolrCloud?

2017-04-25 Thread David Hastings
I can definitely attest to this. The really nice thing about the standard Solr/Jetty configuration is that its all there, Lucene+Solr+Jetty and you just turn it on and run, and after only minor tweaks to JVM and memory settings, its effectively production ready with a reliable master- slave

Re: Poll: Master-Slave or SolrCloud?

2017-04-25 Thread Rick Leir
All, I read somewhere that you should run your own ZK externally, and turn off SolrCloud. Comments please! Rick On April 25, 2017 1:33:31 PM EDT, "Otis Gospodnetić" wrote: >This is interesting - that ZK is seen as adding so much complexity that >it >turns people

Re: Poll: Master-Slave or SolrCloud?

2017-04-25 Thread Otis Gospodnetić
This is interesting - that ZK is seen as adding so much complexity that it turns people off! If you think about it, Elasticsearch users have no choice -- except their "ZK" is built-in, hidden, so one doesn't have to think about it, at least not initially. I think I saw mentions (maybe on user or

Re: SolrIndexSearcher#getDocList() method returns zero results, if query includes tdate range query

2017-04-25 Thread Victor Solakhian
Rick, Solr Admin does not generate a query. I use it just to confirm that the query generated by our code returns results. getDocList() method also does not generate a query, It returns a list of document IDs for the query created by the QueryParser.parse(query,...). method. Thanks, Victor On

Re: Poll: Master-Slave or SolrCloud?

2017-04-25 Thread Sales
> On Apr 25, 2017, at 11:23 AM, Erick Erickson wrote: > > Maybe the other thing in play here is that use-cases that "just work" > in the master/slave environment are less likely to employ consultants > so we get something of a skewed sense of who uses what ;) > So,

Re: DIH Issues

2017-04-25 Thread Sales
> On Apr 25, 2017, at 10:28 AM, AJ Lemke wrote: > > Thanks for the thought Alex! > The fields that have this happen most often are numeric and boolean fields. > These fields have real data (id numbers, true/false, etc.) > > AJ > We had an identical problem a few

Re: SolrIndexSearcher#getDocList() method returns zero results, if query includes tdate range query

2017-04-25 Thread Rick Leir
Victor, When you do a query in SolrAdmin, the generated query is shown in at the top of the page. Can you compare that with the query that getDocList generates? Or did I misunderstand your question. Cheers -- Rick On April 25, 2017 11:34:17 AM EDT, Victor Solakhian

Re: DIH Issues

2017-04-25 Thread Alexandre Rafalovitch
I wonder if it is possible to write a component/URP/something that will intercept exceptions like these and dump out full record. Regards, Alex. http://www.solr-start.com/ - Resources for Solr users, new and experienced On 25 April 2017 at 12:19, Erick Erickson

Re: Poll: Master-Slave or SolrCloud?

2017-04-25 Thread Erick Erickson
Maybe the other thing in play here is that use-cases that "just work" in the master/slave environment are less likely to employ consultants so we get something of a skewed sense of who uses what ;) On Tue, Apr 25, 2017 at 1:50 AM, Charlie Hull wrote: > On 24/04/2017 15:58,

Re: DIH Issues

2017-04-25 Thread Erick Erickson
You say your SQL database always has the values, but does the output from the SQL query you actually use have them? I've been fooled before by the query I form "somehow" doesn't have a value for all fields I expect. You could also crank the logging level up enough to see the docs that are

Re: Graph traversel

2017-04-25 Thread mganeshs
Dear Solr experts, Can you any one over here explain about why graph traversal is not working as expected in Solr 6.5 ? It's not traversing all the child nodes. It traverse only few nodes and not getting all the mid level and leaf nodes. As I explained below, For this query

SolrIndexSearcher#getDocList() method returns zero results, if query includes tdate range query

2017-04-25 Thread Victor Solakhian
We have code that uses *SolrIndexSearcher#getDocList()* method to get document IDs for the query. First a Solr query string is generated from UI, then the following code creates a Lucene Query org.apache.lucene.search.Query query = parser.parse(solrQueryString); where parser is

RE: DIH Issues

2017-04-25 Thread AJ Lemke
Thanks for the thought Alex! The fields that have this happen most often are numeric and boolean fields. These fields have real data (id numbers, true/false, etc.) AJ -Original Message- From: Alexandre Rafalovitch [mailto:arafa...@gmail.com] Sent: Tuesday, April 25, 2017 8:27 AM To:

Re: DIH Issues

2017-04-25 Thread Alexandre Rafalovitch
Maybe the content gets simplified away between the database and the Solr schema. For example if your field contains just spaces and you have UpdateRequestProcessors to do trim and removal of empty fields? Schemaless mode will remove empty fields, but will not trim for example. Regards, Alex.

DIH Issues

2017-04-25 Thread AJ Lemke
Hey all, We are using 6.3.0 and we have issues with DIH throwing errors. We are seeing an intermittent issue where on a full index a single error will be thrown. The error is always "missing required field: fieldname". Our SQL database always has data in the field that comes up with the

JSON Response for Spellcheck Collate

2017-04-25 Thread Zoran
Hi Guys, I’m using solr 6.5.0, which is fantastic, and I’ve come across an issue with the collations in the spellcheck response. The way the json is structured collations is an object with each collation named ‘collation’ where it should be an array with multiple ‘collation’ object

Re: termfreq usage/syntax

2017-04-25 Thread Shawn Heisey
On 4/25/2017 4:46 AM, Saman Rasheed wrote: > what i'm looking for is syntax similair to: 'termfreq(content,*house*)' > which doesnt work. I doubt this function knows how to deal with wildcards. It sounds like it can only do exact terms. One option you have is to use the /terms handler with the

Re: Huge cfs files

2017-04-25 Thread Shawn Heisey
On 4/25/2017 3:33 AM, Avi Steiner wrote: > We have a customer with Solr 5.3.1. > The index contains less than 3.5 million docs, and index folder size is about > 240GB. If 3.5 million documents creates a 240GB index, then this is a very atypical index. The documents must be HUGE, or else you are

Re: Issues with ingesting to Solr using Flume

2017-04-25 Thread Shawn Heisey
On 4/20/2017 9:02 AM, Anantharaman, Srinatha (Contractor) wrote: > Hi all, > > I am trying to ingest data to Solr 6.3 using flume 1.5 on Hortonworks 2.5 > platform Facing below issue while sinking the data > > 19 Apr 2017 19:54:26,943 ERROR [lifecycleSupervisor-1-3] >

termfreq usage/syntax

2017-04-25 Thread Saman Rasheed
hi Solr team, i'm starting to have fun with solr, and i'm in a big project that requires me to index some books and then do certain term lookups on them. I'm using windows 10 and i've successfully managed to index a book containing more than 118,000 words! which is normal i guess. in the

Huge cfs files

2017-04-25 Thread Avi Steiner
Hi We have a customer with Solr 5.3.1. The index contains less than 3.5 million docs, and index folder size is about 240GB. I found that the most huge files are .cfs files (compound files) that were created lately although only few documents were added. The useCompoundFile parameter is

Re: Poll: Master-Slave or SolrCloud?

2017-04-25 Thread Charlie Hull
On 24/04/2017 15:58, Otis Gospodnetić wrote: Hi, I'm really really surprised here. Back in 2013 we did a poll to see how people were running Master-Slave (4.x back then) and SolrCloud was a bit more popular than Master-Slave: https://sematext.com/blog/2013/02/25/poll-solr-cloud-or-not/ Here

Re: Poll: Master-Slave or SolrCloud?

2017-04-25 Thread Bernd Fehling
Hi, bq: What amazes me that in 2017 we don't see a lot more SolrCloud users! Really? SolrCloud is much more complex. All of a sudden you have to deal with zookeeper which brings a new level of complexity into play where you only want do have some data stored and searchable. The easyness of