RE: can't make sense of spellchecker results when using techproducts example

2015-01-14 Thread Chris Hostetter
James: everything you said made perfect sense, and in hindsight was actually covered on the page -- it was just hte example that was bogus in light of the current config defaults I went ahead and fixed it based on your feedback, and beefed up the explanation of spellcheck.collateParam.* (now

Re: Distributed mode for stats component?

2015-01-14 Thread Jack Krupansky
Thanks, Chris. I just needed to stare at the code I already knew about more intently to see what was really going on. It's super convoluted and super confusing. The keys were the handleResponses method in the main component class and the AbstractStatsValues class that is hidden in the

Re: Engage custom hit collector for special search processing

2015-01-14 Thread tedsolr
Thank you so much Alex and Joel for your ideas. I am pouring through the documentation and code now to try an understand it all. A post filter sounds promising. As 99% of my doc fields are character based I should try to compliment the collapsing Q parser with an option that compares string fields

WordDelimiter Works differently in solr3X vs SolrCloud..?

2015-01-14 Thread gouthsmsimhadri
Problem: While migrating the solr version from 3.X(schema version is 1.4) to cloud 4.10.0 (schema version 1.5), I see a difference in the way the worddelimiterfilter works for the below configuration filter class=solr.WordDelimiterFilterFactory generateWordParts=1generateNumberParts=1

Conditions in function query

2015-01-14 Thread Shamik Bandopadhyay
Hi, Just wanted to know if it's possible to provide conditions with a function query. Right now,I'm using the following functions to boost on Likes data. bf=recip(ms(NOW/DAY,PublishDate),3.16e-11,1,1)^2.0 sum(Likes,2) What I would like to do is to apply the boost on Likes based on source.

Re: Load existing Lucene sharded indexes onto single Solr collection

2015-01-14 Thread Jaikit Savla
Yes, I wanted to get rid of merge step. But looks like merge is not that cumbersome either. Thanks Mikhail and Erick for pointers, that helped. Jaikit On Wednesday, January 14, 2015 8:24 AM, Erick Erickson erickerick...@gmail.com wrote: You certainly can't do this into a single

Re: Occasionally getting error in solr suggester component.

2015-01-14 Thread Michael Sokolov
did you build the spellcheck index using spellcheck.build as described here: https://cwiki.apache.org/confluence/display/solr/Spell+Checking ? -Mike On 01/14/2015 07:19 AM, Dhanesh Radhakrishnan wrote: Hi, Thanks for the reply. As you mentioned in the previous mail I changed

Re: WordDelimiter Works differently in solr3X vs SolrCloud..?

2015-01-14 Thread Ahmet Arslan
Hi, You could try passing luceneMatchVersion argument to WordDelimiterFilterFactory and see if it works for you. Factory returns Lucene47WordDelimiterFilter before LUCENE_4_8_0. Ahmet On Wednesday, January 14, 2015 11:10 PM, gouthsmsimhadri gouthamsimha...@gmail.com wrote: Problem: While

Re: WordDelimiter Works differently in solr3X vs SolrCloud..?

2015-01-14 Thread gouthsmsimhadri
Thanks Ahmet, that works. - -goutham -- View this message in context: http://lucene.472066.n3.nabble.com/WordDelimiter-Works-differently-in-solr3X-vs-SolrCloud-tp4179647p4179662.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to configure Solr PostingsFormat block size

2015-01-14 Thread Chris Hostetter
: As a foolish dev (not malicious I hope!), I did mess around with something : like this once; I was writing my own Codec. I found I had to create a file : called META-INF/services/org.apache.lucene.codecs.Codec in my solr plugin jar : that contained the fully-qualified class name of my codec: I

Re: Conditions in function query

2015-01-14 Thread Erick Erickson
Nest them perhaps? Best Erick On Wed, Jan 14, 2015 at 7:07 PM, shamik sham...@gmail.com wrote: Thanks Eric, I did take a look at the if condition earlier, but not sure how that can be used for multiple conditions. It works for a single condition : if(termfreq(Source2,'A'),sum(Likes,3),0)

Re: Conditions in function query

2015-01-14 Thread Erick Erickson
Why won't the if clause work? See: https://cwiki.apache.org/confluence/display/solr/Function+Queries On Wed, Jan 14, 2015 at 5:29 PM, Shamik Bandopadhyay sham...@gmail.com wrote: Hi, Just wanted to know if it's possible to provide conditions with a function query. Right now,I'm using the

Re: Conditions in function query

2015-01-14 Thread shamik
Thanks Eric, I did take a look at the if condition earlier, but not sure how that can be used for multiple conditions. It works for a single condition : if(termfreq(Source2,'A'),sum(Likes,3),0) But for multiple, I'm struggling to find the right syntax. I tried using OR in conjunction but hasn't

Distributed Search returns Empty document list

2015-01-14 Thread Jaikit Savla
Hello, I am running Solr (4.10) in cloud mode by configuring multiple collections (1 for each day). Structure is as shown below. I can fetch documents for given query, if I query individual collection. However when I send distributed request to multiple shards, I only see numFound and no

Re: Tokenizer or Filter ?

2015-01-14 Thread Jack Krupansky
It's what Java has, whatever that is: http://docs.oracle.com/javase/7/docs/api/java/util/regex/Pattern.html So, maybe the correct answer is neither, but similar to both. -- Jack Krupansky On Wed, Jan 14, 2015 at 9:06 AM, tomas.kalas kala...@email.cz wrote: Oh yeah, that is it. Thank you very

Re: How to do fuzzy search on phrases

2015-01-14 Thread Tomoko Uchida
Iam using solr 4.7 and solr php client. Back to original question, I've missed it. ComplexPhraseQueryParser is not available in Solr 4.7, so sorry for misleading information. Regards, Tomoko 2015-01-14 23:44 GMT+09:00 Tomoko Uchida tomoko.uchida.1...@gmail.com: Hi Adrien, No, you cannot

Re: How to do fuzzy search on phrases

2015-01-14 Thread Tomoko Uchida
Hi Adrien, No, you cannot use ComplexPhraseQueryParser in Solr 3.3.0 since this was introduced at Solr 4.8 (it's a pretty new feature...) https://issues.apache.org/jira/browse/SOLR-1604 お邪魔しました。 You do not need this phrase here, we rarely use this in mails. :) Thanks, Tomoko 2015-01-14 23:19

Solr groups not matching with terms in a field

2015-01-14 Thread Naresh Yadav
Hi all, I had done following configuration to test Solr grouping concept. solr version : 4.6.1 (tried in latest version 4.10.3 also) Schema : http://www.imagesup.net/?di=10142124357616 Solrj code to insert docs :http://www.imagesup.net/?di=10142124381116 Response Group's :

Re: Solr grouping problem - need help

2015-01-14 Thread Naresh Yadav
as my problem is changed completely from first post so i had created new thread for that. On Wed, Jan 14, 2015 at 7:31 PM, Naresh Yadav nyadav@gmail.com wrote: just wanted to share schema and results for same : solr version : 4.6.1 Schema : http://www.imagesup.net/?di=10142124357616

Re: Engage custom hit collector for special search processing

2015-01-14 Thread William Bell
We all need example data, and a sample query to help you. You can use group to group by a field and remove dupes. If you want to remove dupes you can do something like: q=field1:DOG AND NOT field2:DOG AND NOT field3:DOG That will remove DOG from field2 or field3. If you don't care if it is in

Re: Distributed search across Solr cores in a collection - NPE

2015-01-14 Thread Jaikit Savla
It was because I did not have unique id's in my index. I added that and it worked. Also it is mentioned as one of the requirement for Distributed Search. Thanks,Jaikit On Wednesday, January 14, 2015 1:53 AM, Jaikit Savla jaikit.sa...@yahoo.com wrote: Folks, I have set up 3 cores

Core deletion

2015-01-14 Thread phiroc
Hello, I am running SOLR 4.10.0 on Tomcat 8. The solr.xml file in .../apache-tomcat-8.0.15_solr_8983/conf/Catalina/localhost looks like this: ?xml version=1.0 encoding=utf-8? Context docBase=/archives/solr/example/solr/solr.war debug=0 crossContext=true Environment name=solr/home

Re: Load existing Lucene sharded indexes onto single Solr collection

2015-01-14 Thread Erick Erickson
You certainly can't do this into a single directory, there would be zillions of name conflicts. I believe I saw Uwe make a comment on the Lucene list about using MultiReaders and keeping the sub-indexes in different directories, but that's lower-level than Solr has access to Plus, you'd have to

Distributed mode for stats component?

2015-01-14 Thread Jack Krupansky
Does anybody know for sure whether the stats component fully supports distributed mode? It is listed in the doc as supporting distributed mode (at least for old, non-SolrCloud distrib mode), but... I don't see any code that actually does that. Nor any tests, unless they are hidden somewhere I

OutOfMemoryError for PDF document upload into Solr

2015-01-14 Thread Ganesh.Yadav
Hello, Can someone pass on the hints to get around following error? Is there any Heap Size parameter I can set in Tomcat or in Solr webApp that gets deployed in Solr? I am running Solr webapp inside Tomcat on my local machine which has RAM of 12 GB. I have PDF document which is 4 GB max in

Re: OutOfMemoryError for PDF document upload into Solr

2015-01-14 Thread Michael Della Bitta
Yep, you'll have to increase the heap size for your Tomcat container. http://stackoverflow.com/questions/6897476/tomcat-7-how-to-set-initial-heap-size-correctly Michael Della Bitta Senior Software Engineer o: +1 646 532 3062 appinions inc. “The Science of Influence Marketing” 18 East 41st

Re: How to configure Solr PostingsFormat block size

2015-01-14 Thread Michael Sokolov
As a foolish dev (not malicious I hope!), I did mess around with something like this once; I was writing my own Codec. I found I had to create a file called META-INF/services/org.apache.lucene.codecs.Codec in my solr plugin jar that contained the fully-qualified class name of my codec: I

Re: Distributed mode for stats component?

2015-01-14 Thread Chris Hostetter
: Does anybody know for sure whether the stats component fully supports : distributed mode? It is listed in the doc as supporting distributed mode it's been supported for as long as i can remember -- since Day 1 of the StatsComponent i believe. : (at least for old, non-SolrCloud distrib mode),

Re: Solr fails to start with log file not found error

2015-01-14 Thread Graeme Pietersz
I use the same user every time and the /logs directory and everything in it is owned by that user. I get the same problem occasionally developing on my Ubuntu 14.10 laptop as well, and all the files in the solr directory are owned by me on that machine (and I run Solr as me there as well).

Load existing Lucene sharded indexes onto single Solr collection

2015-01-14 Thread Jaikit Savla
Folks, I have generated multiple (count of 100) sharded Lucene indexes on Hadoop and they are of format. The total indexed data (sum of all the index-*) is of size 500GB and hence the number of shards.drwxr-x--- 2 index-66drwxr-x--- 2 index-68drwxr-x--- 2 index-9 and each index

Re: Load existing Lucene sharded indexes onto single Solr collection

2015-01-14 Thread Mikhail Khludnev
On Wed, Jan 14, 2015 at 11:42 AM, Jaikit Savla jaikit.sa...@yahoo.com.invalid wrote: Now to load this index, I am currently using Lucene IndexMergeTool to merge all the shards into one giant index. My question is, is there a way to load shared index without merging into one giant index on to

Re: Load existing Lucene sharded indexes onto single Solr collection

2015-01-14 Thread Jaikit Savla
This solution will merge the index as well. I want to find out if merge is required before loading indexes onto Solr ?  If that is possible than I can just point solrconfig.xml to directory where I have all the shards. Jaikit On Wednesday, January 14, 2015 1:11 AM, Mikhail Khludnev

How to do fuzzy search on phrases

2015-01-14 Thread madhav bahuguna
HI Iam using solr 4.7 and solr php client. So heres the issue ,i have data indexed in solr eg mist spa I have applied fuzzy to my search and If i search myst or mysty i get the correct result i get mist spa in result. But if i write mysty spa or must spa i do not get ant results.Iam not able

Re: Tokenizer or Filter ?

2015-01-14 Thread Jack Krupansky
It should replace all occurrences of the pattern. Post your specific filter XML. Patterns can be very tricky. Use the Solr Admin UI analysis page to see how the filtering is occurring. -- Jack Krupansky On Wed, Jan 14, 2015 at 7:16 AM, tomas.kalas kala...@email.cz wrote: Jack, thanks for

Re: Solr grouping problem - need help

2015-01-14 Thread Naresh Yadav
just wanted to share schema and results for same : solr version : 4.6.1 Schema : http://www.imagesup.net/?di=10142124357616 Code :http://www.imagesup.net/?di=10142124381116 Response Group : http://www.imagesup.net/?di=1114212438351 Response Terms : http://www.imagesup.net/?di=614212438580

Distributed search across Solr cores in a collection - NPE

2015-01-14 Thread Jaikit Savla
Folks, I have set up 3 cores in a single collection and they all have same schema but different index. I have set unique Id required field to false.field name=id type=string indexed=true stored=true required=false/ When I run query against single core, it works fine. But when I add the shard

Re: Solr grouping problem - need help

2015-01-14 Thread Naresh Yadav
Thanks much, now i got better idea on stored indexed works internally in solr. From this i tried and modified few things to understand grouping logic. *Schema :* field name=tenant_pool type=text stored=true *indexed=true*/ field name=type type=text stored=true indexed=true/ *Code :* SolrQuery q

Re: Solr grouping problem - need help

2015-01-14 Thread Norgorn
Can u get raw SOLR response? For me grouping works exactly the way u expect it to work. Try direct query in browser to be sure the problem is not in your code. http://192.168.0.1:8983/solr/collection1/select?q=*:*group=truegroup.field=tenant_pool -- View this message in context:

Re: Tokenizer or Filter ?

2015-01-14 Thread tomas.kalas
I just used Solr UI Analyzer for my test, or must i indexed it firstly? I used this XML code in my schema: fieldType name=direction1 class=solr.TextField positionIncrementGap=100 analyzer charFilter class=solr.PatternReplaceCharFilterFactory

Re: Tokenizer or Filter ?

2015-01-14 Thread Jack Krupansky
I was suspecting it might do that - the pattern is greedy and takes the longest matching pattern. Add a question mark after the asterisk to use stingy mode that matches the shortest pattern. -- Jack Krupansky On Wed, Jan 14, 2015 at 8:37 AM, tomas.kalas kala...@email.cz wrote: I just used Solr

Re: Distributed search across Solr cores in a collection - NPE

2015-01-14 Thread Mikhail Khludnev
Jaikit, uniq key is mandatory for distributed search. if most of your docs have ids assigned, you can drop remaining ones by adding something like ..fq=id:[* TO *] On Wed, Jan 14, 2015 at 12:53 PM, Jaikit Savla jaikit.sa...@yahoo.com.invalid wrote: Folks, I have set up 3 cores in a single

Re: Solr grouping problem - need help

2015-01-14 Thread Naresh Yadav
I tried what you said also appended group.ngroups=true and got same result not expected onengroups coming is 1. i am on solr-4.6.1 single machine default setup. On Wed, Jan 14, 2015 at 4:43 PM, Norgorn lsunnyd...@mail.ru wrote: Can u get raw SOLR response? For me grouping works exactly

Re: How to do fuzzy search on phrases

2015-01-14 Thread Tomoko Uchida
Hi, I suspect you are likely to misunderstand fuzzy search. You should append ~N to end of each query term, not whole query string / phrase. (You can debug your query and get useful information by specifying debugQuery=true parameter, try it if you have not.) At first glance, I guess Complex

Re: Tokenizer or Filter ?

2015-01-14 Thread tomas.kalas
Jack, thanks for help, but if i used PatternReplaceCharFilterFactory for example for this : d1text d1/d1d2text d2/d2d1text d1/d1d2text 2 ok/d2 then at output i only get segment d2text 2 ok/d2 when is d2 text d2/d2 between marks d1 ./d1.d2.../d2 d1.../d1so the filter probably takes only first

Re: Occasionally getting error in solr suggester component.

2015-01-14 Thread Dhanesh Radhakrishnan
Hi, Thanks for the reply. As you mentioned in the previous mail I changed buildOnCommit=false in solrConfig. After that change, suggestions are not working. In Solr 4.7 introduced a new approach based on a dedicated SuggestComponent I'm using that component to build suggestions and lookup

Re: Tokenizer or Filter ?

2015-01-14 Thread tomas.kalas
Oh yeah, that is it. Thank you very much for your patience. And a last question at the end what type regEx Solr actually using ? POSIX or PCRE ? Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/Tokenizer-or-Filter-tp4178346p4179505.html Sent from the Solr - User

RE: How to do fuzzy search on phrases

2015-01-14 Thread Adrien RUFFIE
Tomokoさん、おはようございます。 Can I use ComplexPhraseQueryParser with Core Solr 3.3.0 ? どうもありがとうございます。 お邪魔しました。 宜しくお願いします。 Bien cordialement, ルフフィエ アドリエン Adrien Ruffié LD : +33 1 73 03 26 40 Tél : +33 1 73 03 29 80 E-DEAL Innover la Relation Client -Message d'origine- De : Tomoko Uchida