RE: FW: NRTCachingDirectory threads stuck

2015-02-23 Thread Moshe Recanati
Thank you. Regards, Moshe Recanati SVP Engineering Office + 972-73-2617564 Mobile  + 972-52-6194481 Skype    :  recanati More at:  www.kmslh.com | LinkedIn | FB -Original Message- From: Mikhail Khludnev [mailto:mkhlud...@griddynamics.com] Sent: Sunday, February 22, 2015 6:16 PM To:

highlighting the boolean query

2015-02-23 Thread Dmitry Kan
Hello! In solr 4.3.1 there seem to be some inconsistency with the highlighting of the boolean query: a OR (b c) OR d This returns a proper hit, which shows that only d was included into the document score calculation. But the highlighter returns both d and c in em tags. Is this a known issue

Re: Question on CloudSolrServer API

2015-02-23 Thread Shalin Shekhar Mangar
By default the max connections is set to 128 and max connections per host is 32. You can configure an HttpClient as per your needs and pass it as a parameter to CloudSolrServer's constructor. On Mon, Feb 23, 2015 at 3:49 PM, Manohar Sripada manohar...@gmail.com wrote: Thanks for the response.

Re: Question on CloudSolrServer API

2015-02-23 Thread Manohar Sripada
Thanks for the response. How to control the number of connections pooled here in SolrJ Client? Also, what will be the default values for maximum Connections and all. - Thanks On Thu, Feb 19, 2015 at 6:09 PM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: No, you should reuse the same

CollationKeyFilterFactory stops suggestions and collations

2015-02-23 Thread Nitin Solanki
Hello all, I am working on collations. Somewhere in Solr, I found that UnicodeCollation will do searching fast. But after applying CollationKeyFilterFactory in schema.xml, it stops the suggestions and collations both. Please check the configurations and help me. *Schema.xml:*

Atomic Update while having fields with attribute stored=true in schema

2015-02-23 Thread Rahul Bhooteshwar
Hi, I have around 50 fields in my schema and having 20 fields are stored=”true” and rest of them stored=”false” In case partial update (atomic update), it is mentioned at many places that the fields in schema should have stored=”true”. I have also tried atomic update on documents having fields

Re: Atomic Update while having fields with attribute stored=true in schema

2015-02-23 Thread Yago Riveiro
Field with store=true has the downside of disk space. Your index will grow in space requirements. Maybe update the whole document can be an option ... — /Yago Riveiro On Mon, Feb 23, 2015 at 1:02 PM, Rahul Bhooteshwar rahul.bhootesh...@hotwaxsystems.com wrote: Hi Yago Riveiro,

Re: Atomic Update while having fields with attribute stored=true in schema

2015-02-23 Thread Yago Riveiro
Which means I have to change all my fields to stored=”true” if I want to use atomic update.Right?” Yes, and re-index all your data. Will it affect the performance of the Solr?” What type of queries are you doing now? — /Yago Riveiro On Mon, Feb 23, 2015 at 12:05 PM, Rahul

Re: Solr 4.x to Solr 5 = org.noggit.JSONParser$ParseException

2015-02-23 Thread Alan Woodward
I think this means you've got an older version of noggit around. You need version 0.6. Alan Woodward www.flax.co.uk On 23 Feb 2015, at 13:00, Clemens Wyss DEV wrote: Just about to upgrade to Solr5. My UnitTests fail: 13:50:41.178 [main] ERROR org.apache.solr.core.CoreContainer - Error

Re: CollationKeyFilterFactory stops suggestions and collations

2015-02-23 Thread Nitin Solanki
Hi all, I have found to use UnicodeCollation. I need *lucene-collation-2.9.1.jar. *I am using solr 4.10.2. I have download lucene-collation-2.9.1.jar where I have to store this or Is it already in-built in solr? If it already in solr then why suggestions and collations are not coming? Any

Solr 4.x to Solr 5 = org.noggit.JSONParser$ParseException

2015-02-23 Thread Clemens Wyss DEV
Just about to upgrade to Solr5. My UnitTests fail: 13:50:41.178 [main] ERROR org.apache.solr.core.CoreContainer - Error creating core [1-de_CH]: null java.lang.ExceptionInInitializerError: null at org.apache.solr.core.SolrConfig.getConfigOverlay(SolrConfig.java:359)

Re: Atomic Update while having fields with attribute stored=true in schema

2015-02-23 Thread Rahul Bhooteshwar
Hi Yago Riveiro, Thanks for your quick reply. I am using Solr for faceted search using *Solr**j. *I am using facet queries and filter queries. I am new to Solr so I would like to know what is the best practice to handle such scenarios. Thanks and Regards, Rahul Bhooteshwar Enterprise Software

Re: Solr 4.x to Solr 5 = org.noggit.JSONParser$ParseException

2015-02-23 Thread Noble Paul
This code is executed every time Solr is initialized and it is unlikely that it is a bug. Are you using an older version of noggit.jar by any chance? On Mon, Feb 23, 2015 at 6:30 PM, Clemens Wyss DEV clemens...@mysign.ch wrote: Just about to upgrade to Solr5. My UnitTests fail: 13:50:41.178

Stop solr query

2015-02-23 Thread Moshe Recanati
Hi, Recently there were some scenarios in which queries that user sent to solr got stuck and increased our solr heap. Is there any option to kill or timeout query that wasn't returned from solr by external command? Thank you, Regards, Moshe Recanati SVP Engineering Office + 972-73-2617564

incorrect Java version reported in solr dashboard

2015-02-23 Thread SolrUser1543
I have upgraded Java version from 1.7 to 1.8 on Linux server. After the upgrade, if I run Java -version I can see that it really changed to the new one. But when I run Solr, it is still reporting the old version in dashboard JVM section. What could be the reason? -- View this message

Re: incorrect Java version reported in solr dashboard

2015-02-23 Thread Michael Della Bitta
You're probably launching Solr using the older version of Java somehow. You should make sure your PATH and JAVA_HOME variables point at your Java 8 install from the point of view of the script or configuration that launches Solr. Hope that helps. Michael Della Bitta Senior Software Engineer o:

Re: Used CollationKeyFilterFactory, Seems not to be working

2015-02-23 Thread Ahmet Arslan
Hi Nitin, How can you pass empty value to the language attribute? Is this intentional? What is your intention to use that filter with suggestion functionality? Ahmet On Monday, February 23, 2015 5:03 PM, Nitin Solanki nitinml...@gmail.com wrote: Hi, I have integrate

Re: Stop solr query

2015-02-23 Thread Shawn Heisey
On 2/23/2015 7:23 AM, Moshe Recanati wrote: Recently there were some scenarios in which queries that user sent to solr got stuck and increased our solr heap. Is there any option to kill or timeout query that wasn't returned from solr by external command? The best thing you can do is examine

[ANNOUNCE] Luke 4.10.3 released

2015-02-23 Thread Dmitry Kan
Hello, Luke 4.10.3 has been released. Download it here: https://github.com/DmitryKey/luke/releases/tag/luke-4.10.3 The release has been tested against the solr-4.10.3 based index. Issues fixed in this release: #13 https://github.com/DmitryKey/luke/pull/13 Apache License 2.0 abbreviation

Used CollationKeyFilterFactory, Seems not to be working

2015-02-23 Thread Nitin Solanki
Hi, I have integrate CollationKeyFilterFactory in schema.xml and re-index the data again. *filter class=solr.CollationKeyFilterFactory language= strength=primary/* I need to use this becuase I want to build collations fast. Referred link: http://wiki.apache.org/solr/UnicodeCollation But

AW: Solr 4.x to Solr 5 = org.noggit.JSONParser$ParseException

2015-02-23 Thread Clemens Wyss DEV
Bingo! thx for the hint -Ursprüngliche Nachricht- Von: Alan Woodward [mailto:a...@flax.co.uk] Gesendet: Montag, 23. Februar 2015 15:00 An: solr-user@lucene.apache.org Betreff: Re: Solr 4.x to Solr 5 = org.noggit.JSONParser$ParseException I think this means you've got an older version

Re: Strange search behaviour when upgrading to 4.10.3

2015-02-23 Thread Rishi Easwaran
Thanks Shawn. Just ran the analysis between 4.6 and 4.10, there seems to be only difference between the outputs positionLength value is set in 4.10. Does that mean anything. Version 4.10 SF text raw_bytes start end positionLength type position message [6d 65 73 73 61 67

Is Solr best for did you mean functionality just like Google?

2015-02-23 Thread Nitin Solanki
Hello, I came in the worst condition. I want to do spell/query correction functionality. I have 49 GB indexed data where I have applied spellchecker. I want to do same as Google - *did you mean*. *Example* - If any user types any question/query which might be misspell or wrong typed. I

Re: Collations are not working fine.

2015-02-23 Thread Nitin Solanki
Hi Charles, How you patch the suggester to get frequency information in the spellcheck response? It's very good. I also want to do that? On Mon, Feb 16, 2015 at 7:59 PM, Reitzel, Charles charles.reit...@tiaa-cref.org wrote: I have been working with collations the last couple days

Re: syntax for increasing java memory

2015-02-23 Thread Walter Underwood
That depends on the JVM you are using. For the Oracle JVMs, use this to get a list of extended options: java -X wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) On Feb 23, 2015, at 8:21 AM, Kevin Laurie superinterstel...@gmail.com wrote: Hi Guys, I

Re: syntax for increasing java memory

2015-02-23 Thread Kevin Laurie
Hi Walter Got it. java -Xmx1024m -jar start.jar Thanks Kevin On Tue, Feb 24, 2015 at 1:00 AM, Kevin Laurie superinterstel...@gmail.com wrote: Hi Walter, I am running :- Oracle Corporation OpenJDK 64-Bit Server VM (1.7.0_65 24.65-b04) I tried running with this command:- java -jar

Re: Used CollationKeyFilterFactory, Seems not to be working

2015-02-23 Thread Ahmet Arslan
Hi Nitin, I think that token filter factory has nothing to do with collations in spellchecker domain. Single term from different domains causing confusion. solr.CollationKeyFilterFactory targets mainly for locale sensitive sorting. For example, I used below type to fix sorting problem of

Re: Collations are not working fine.

2015-02-23 Thread Rajesh Hazari
Hi, we have used spellcheck component the below configs to get a best collation (exact collation) when a query has either single term or multiple terms. As charles, mentioned above we do have a check for getOriginalFrequency() for each term in our service before we send spellcheck response to

syntax for increasing java memory

2015-02-23 Thread Kevin Laurie
Hi Guys, I am a newbie on Solr and I am just using it for dovecot sake. Could you help advise the correct syntax to increase java heap size using the -xmx option(or advise some easy-to-read literature for configuring) ? Much appreciate if you could help. I just need this to sort out the problem

Re: highlighting the boolean query

2015-02-23 Thread Dmitry Kan
Erick, nope, we are using std lucene qparser with some customizations, that do not affect the boolean query parsing logic. Should we try some other highlighter? On Mon, Feb 23, 2015 at 6:57 PM, Erick Erickson erickerick...@gmail.com wrote: Are you using edismax? On Mon, Feb 23, 2015 at 3:28

Re: highlighting the boolean query

2015-02-23 Thread Erick Erickson
Are you using edismax? On Mon, Feb 23, 2015 at 3:28 AM, Dmitry Kan solrexp...@gmail.com wrote: Hello! In solr 4.3.1 there seem to be some inconsistency with the highlighting of the boolean query: a OR (b c) OR d This returns a proper hit, which shows that only d was included into the

Re: Used CollationKeyFilterFactory, Seems not to be working

2015-02-23 Thread Nitin Solanki
Hi Ahmet, language= means that it is used for any language - simply define the language as the empty string for most languages *Intention:* I am working on spell/question correction. Just like google, I want to do same as did you mean. Using spellchecker, I got suggestions and

Optimize maxSegments=2 not working right with Solr 4.10.2

2015-02-23 Thread Tom Burton-West
Hello, We normally run an optimize with maxSegments=2 after our daily indexing. This has worked without problem on Solr 3.6. We recently moved to Solr 4.10.2 and on several shards the optimize completed with no errors in the logs, but left more than 2 segments. We send this xml to Solr

Re: syntax for increasing java memory

2015-02-23 Thread Kevin Laurie
Hi Walter, I am running :- Oracle Corporation OpenJDK 64-Bit Server VM (1.7.0_65 24.65-b04) I tried running with this command:- java -jar start.jar -Xmx1024m WARNING: System properties and/or JVM args set. Consider using --dry-run or --exec 0[main] INFO org.eclipse.jetty.server.Server ?

RE: Collations are not working fine.

2015-02-23 Thread Reitzel, Charles
I filed issue SOLR-7144 with the patch attached. It's probably best to get some feedback from developers. It may not be the right approach, etc. Also, spellcheck.maxCollationTries 0 is the parameter needed to get collation results that respect the current filter queries, etc. Set

Re: Suggestion on distinct/ group by for a field ?

2015-02-23 Thread Erick Erickson
Maybe pivot facets will do what you need? See: https://cwiki.apache.org/confluence/display/solr/Faceting#Faceting-Pivot(DecisionTree)Faceting Best, Erick On Mon, Feb 23, 2015 at 11:31 AM, Vishal Swaroop vishal@gmail.com wrote: Please suggest on how to get the distinct count for a field

Basic Multilingual search capability

2015-02-23 Thread Rishi Easwaran
Hi All, For our use case we don't really need to do a lot of manipulation of incoming text during index time. At most removal of common stop words, tokenize emails/ filenames etc if possible. We get text documents from our end users, which can be in any language (sometimes combination) and we

Re: highlighting the boolean query

2015-02-23 Thread Erick Erickson
Highlighting is such a pain... what does the parsed query look like? If the default operator is OR, then this seems correct as both 'd' and 'c' appear in the doc. So I'm a bit puzzled by your statement that c didn't contribute to the score. If the parsed query is, indeed a +b +c d then it does

Suggestion on distinct/ group by for a field ?

2015-02-23 Thread Vishal Swaroop
Please suggest on how to get the distinct count for a field (name). Summary : I have data indexed in the following format category name value Cat1 A 1 Cat1 A 2 Cat1 B 3 Cat1 B 4 I tried getting the distinct name count... but it returns 4 records instaed of 2 (i.e. A, B)

SolrCloud 4.10.3 Security

2015-02-23 Thread mihaela olteanu
Hello, Does anyone know why the Basic authentication was not yet released for SolrCloud as described on the wiki page:  https://wiki.apache.org/solr/SolrSecurity? Is there any plan in the near future for closing this issue: https://issues.apache.org/jira/browse/SOLR-4470 ? Isn't already a very

more like this and term vectors

2015-02-23 Thread Scott C. Cote
Is there a way to configure the more like this query handler and also receive the corresponding term vectors? (tf-idf) ? I tried by creating a “search component” for the term vectors and adding it to the mlt handler, but that did not work. Here is what I tried: searchComponent

Re: more like this and term vectors

2015-02-23 Thread Jack Krupansky
It's never helpful when you merely say that it did not work - detail the symptom, please. Post both the query and the response. As well as the field and type definitions for the fields for which you expected term vectors - no term vectors are enabled by default. -- Jack Krupansky On Mon, Feb

Re: Basic Multilingual search capability

2015-02-23 Thread Alexandre Rafalovitch
Which languages are you expecting to deal with? Multilingual support is a complex issue. Even if you think you don't need much, it is usually a lot more complex than expected, especially around relevancy. Regards, Alex. Sign up for my Solr resources newsletter at

Error instantiating class: 'org.apache.lucene.collation.CollationKeyFilterFactory'

2015-02-23 Thread Nitin Solanki
Hi, I am using Collation Key Filter. After adding it into schema.xml. *Schema.xml* field name=gram type=textSpell indexed=true stored=true required=true multiValued=false/ /fieldTypefieldType name=textSpell class=solr.TextField positionIncrementGap=100 analyzer type=index

Geo Aggregations and Search Alerts in Solr

2015-02-23 Thread Richard Gibbs
Hi There, I am in the process of choosing a search technology for one of my projects and I was looking into Solr and Elasticsearch. Two features that I am more interested are geo aggregations (for map clustering) and search alerts. Elasticsearch seem to have these two features built-in.

Query: no result returned if use AND OR operators

2015-02-23 Thread arthur.hk.c...@gmail.com
Hi, My Solr is 4.10.2 When I use the web UI to run a simple query: 1+AND+2 1) from the log, I can see the hits=8 7629109 [qtp1702388274-16] INFO org.apache.solr.core.SolrCore – [infocast] webapp=/solr path=/clustering params={q=1+AND+2wt=velocityv.template=cluster_results}

Re: Basic Multilingual search capability

2015-02-23 Thread Walter Underwood
It isn’t just complicated, it can be impossible. Do you have content in Chinese or Japanese? Those languages (and some others) do not separate words with spaces. You cannot even do word search without a language-specific, dictionary-based parser. German is space separated, except many noun

Re: Special character and wildcard matching

2015-02-23 Thread Jack Krupansky
Is it really a string field - as opposed to a text field? Show us the field and field type. Besides, if it really were a raw name, wouldn't that be a capital B? -- Jack Krupansky On Mon, Feb 23, 2015 at 6:52 PM, Arun Rangarajan arunrangara...@gmail.com wrote: I have a string field raw_name

Re: Special character and wildcard matching

2015-02-23 Thread Jack Krupansky
But how is that lowercasing occurring? I mean, solr.StrField doesn't do that. Some containers default to automatically mapping accented characters, so that the accented e would then get indexed as a normal e, and then your wildcard would match it, and an accented e in a query would get mapped as

Re: Basic Multilingual search capability

2015-02-23 Thread Rishi Easwaran
Hi Alex, There is no specific language list. For example: the documents that needs to be indexed are emails or any messages for a global customer base. The messages back and forth could be in any language or mix of languages. I understand relevancy, stemming etc becomes extremely

Special character and wildcard matching

2015-02-23 Thread Arun Rangarajan
I have a string field raw_name like this in my document: {raw_name: beyoncé} (Notice that the last character is a special character.) When I issue this wildcard query: q=raw_name:beyonce* i.e. with the last character simply being the ASCII 'e', Solr returns me the above document. How do I

Re: Special character and wildcard matching

2015-02-23 Thread Arun Rangarajan
Yes, it is a string field and not a text field. fieldType name=string class=solr.StrField sortMissingLast=true omitNorms=true/ field name=raw_name type=string indexed=true stored=true / Lower-casing done to do case-insensitive matching. On Mon, Feb 23, 2015 at 4:01 PM, Jack Krupansky

apache solr - dovecot - some search fields works some dont

2015-02-23 Thread Kevin Laurie
Hi, I finally understand how Solr works(somewhat) its a bit complicated as I am new to the whole concept but I understand it as a search engine. I am using Solr with dovecot. and I found out that some seach fields from the inbox work and other dont. For example if I were to search To and From

snapinstaller does not start newSearcher

2015-02-23 Thread alxsss
Hello, I am using latest solr (solr trunk) . I run snapinstaller, and see that it copies snapshot to index folder but changes are not picked up and logs in slave after running snapinstaller are 44302 [qtp1312571113-14] INFO org.apache.solr.update.UpdateHandler – start

Re: Basic Multilingual search capability

2015-02-23 Thread Rishi Easwaran
Hi Wunder, Yes we do expect incoming documents to contain Chinese/Japanese/Arabic languages. From what you have mentioned, it looks like we need to auto detect the incoming content language and tokenize/filter after that. But I thought the ICU tokenizer had capability to do that

Setting Up an External ZooKeeper Ensemble

2015-02-23 Thread CKReddy Bhimavarapu
Hi, I did follow all the steps in [ https://cwiki.apache.org/confluence/display/solr/Setting+Up+an+External+ZooKeeper+Ensemble] but still I am getting this error bWaiting to see Solr listening on port 8983 [-] Still not seeing Solr listening on 8983 after 30 seconds!/b WARN - 2015-02-24

Re: Basic Multilingual search capability

2015-02-23 Thread Trey Grainger
Hi Rishi, I don't generally recommend a language-insensitive approach except for really simple multilingual use cases (for most of the reasons Walter mentioned), but the ICUTokenizer is probably the best bet you're going to have if you really want to go that route and only need exact-match on the