Re: performance crossover between single index and sharding

2011-08-03 Thread Dmitry Kan
scaling. Regards, Dmitry Kan twitter.com/DmitryKan On Wed, Aug 3, 2011 at 12:24 PM, Bernd Fehling bernd.fehl...@uni-bielefeld.de wrote: On 02.08.2011 21:00, Shawn Heisey wrote: ... I did try some early tests with a single large index. Performance was pretty decent once it got warmed up

Re: Need help indexing/querying a particular type of hierarchy

2011-08-11 Thread Dmitry Kan
PathHierarchyTokenizerFactory is quite the answer either. What kind of Solr magic, if any, am I looking for here? Thanks in advance for any help or advice. Michael --- Michael B. Klein Digitization Workflow Engineer Stanford University Libraries -- Regards, Dmitry Kan

solr 1.4 facet.limit behaviour in merging from several shards

2011-09-02 Thread Dmitry Kan
the merging SOLR combine the results from shard, when they exceed the facet.limit? Please ask questions, if something isn't clear or you need more details. Thanks, Dmitry Kan

solr 1.4 highlighting issue

2011-09-14 Thread Dmitry Kan
(semisubmersibles and drillships) are 21 deepwater lt;emdrillinglt;/em /str /arr /lst Why did solr highlight drilling even though there is no ships in the text? * *-- Regards, Dmitry Kan

Re: solr 1.4 facet.limit behaviour in merging from several shards

2011-09-14 Thread Dmitry Kan
), the results you are getting, and the results you are expecting? actually providing the response xml is very helpful. (change the fl to hide any fields you consider sensitive) -Hoss -- Regards, Dmitry Kan

Re: Out of memory

2011-09-14 Thread Dmitry Kan
Hi Rohit, Do you use caching? How big is your index in size on the disk? What is the stack trace contents? The OOM problems that we have seen so far were related to the index physical size and usage of caching. I don't think we have ever found the exact cause of these problems, but sharding has

Re: Out of memory

2011-09-14 Thread Dmitry Kan
: Dmitry Kan [mailto:dmitry@gmail.com] Sent: 14 September 2011 08:15 To: solr-user@lucene.apache.org Subject: Re: Out of memory Hi Rohit, Do you use caching? How big is your index in size on the disk? What is the stack trace contents? The OOM problems that we have seen so far were related

Re: solr 1.4 highlighting issue

2011-09-15 Thread Dmitry Kan
, 2011 at 2:20 PM, Koji Sekiguchi k...@r.email.ne.jp wrote: (11/09/14 15:54), Dmitry Kan wrote: Hello list, Not sure how many of you are still using solr 1.4 in production, but here is an issue with highlighting, that we've noticed: The query is: (drill AND ships) OR rigs Excerpt from

Re: solr 1.4 highlighting issue

2011-09-15 Thread Dmitry Kan
document should satisfy the query (ie it probably has ships/s somewhere else in it), but each snippet won't generally have all the terms. -Mike On 9/14/2011 2:54 AM, Dmitry Kan wrote: Hello list, Not sure how many of you are still using solr 1.4 in production, but here is an issue

Re: Out of memory

2011-09-15 Thread Dmitry Kan
again Rohit -Original Message- From: Dmitry Kan [mailto:dmitry@gmail.com] Sent: 14 September 2011 10:23 To: solr-user@lucene.apache.org Subject: Re: Out of memory Hi, OK 64GB fits into one shard quite nicely in our setup. But I have never used multicore setup. In total you

Re: Out of memory

2011-09-15 Thread Dmitry Kan
...@in-rev.com wrote: It's happening more in search and search has become very slow particularly on the core with 69GB index data. Regards, Rohit -Original Message- From: Dmitry Kan [mailto:dmitry@gmail.com] Sent: 15 September 2011 07:51 To: solr-user@lucene.apache.org Subject

Re: solr 1.4 facet.limit behaviour in merging from several shards

2011-09-23 Thread Dmitry Kan
about the X so that we can understand the : full issue. Perhaps the best solution doesn't involve Y at all? : See Also: http://www.perlmonks.org/index.pl?node_id=542341 -Hoss -- Regards, Dmitry Kan

wild card search and lower-casing

2011-11-18 Thread Dmitry Kan
, Dmitry Kan

Re: wild card search and lower-casing

2011-11-18 Thread Dmitry Kan
://wiki.apache.org/lucene-java/LuceneFAQ#Are_Wildcard.2C_Prefix.2C_and_Fuzzy_queries_case_sensitive.3F -- Regards, Dmitry Kan

Re: wild card search and lower-casing

2011-11-18 Thread Dmitry Kan
. -- Regards, Dmitry Kan

Re: wild card search and lower-casing

2011-11-18 Thread Dmitry Kan
. For clarification see source code of SolrQueryParser. -- Regards, Dmitry Kan

Re: wild card search and lower-casing

2011-11-20 Thread Dmitry Kan
Thanks Erick. Do you think the patch you are working on will be applicable as well to 3.4? Best, Dmitry On Mon, Nov 21, 2011 at 5:06 AM, Erick Erickson erickerick...@gmail.comwrote: As it happens I'm working on SOLR-2438 which should address this. This patch will provide two things: The

Re: wild card search and lower-casing

2011-11-22 Thread Dmitry Kan
. You'll have to change it after applying the patch for this to work for you. Should be trivial, I'll leave a note in the code about this, look for SOLR-2438 in the 3x code line for the place to change. On Mon, Nov 21, 2011 at 2:14 AM, Dmitry Kan dmitry@gmail.com wrote: Thanks Erick. Do

Re: wild card search and lower-casing

2011-11-22 Thread Dmitry Kan
if that worked for you. But you can't do any of this until the JIRA (SOLR-2438) is marked Resolution: Fixed. Don't be fooled by Fix Version. Fix Version simply says that those are the earliest versions it *could* go in. Best Erick Best Erick On Tue, Nov 22, 2011 at 6:32 AM, Dmitry Kan dmitry

Re: Search on multiple fields is not working

2011-11-23 Thread Dmitry Kan
am doing wrong? Regards, Siva -- View this message in context: http://lucene.472066.n3.nabble.com/Search-on-multiple-fields-is-not-working-tp3530145p3530145.html Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Dmitry Kan

Re: wild card search and lower-casing

2011-11-23 Thread Dmitry Kan
Erick On Tue, Nov 22, 2011 at 10:46 AM, Dmitry Kan dmitry@gmail.com wrote: Thanks, Erick. I was in fact reading the patch (the one attached as a file to the aforementioned jira) you updated sometime yesterday. I'll watch the issue, but as said the change of a hard-coded boolean to its

Re: Huge Performance: Solr distributed search

2011-11-23 Thread Dmitry Kan
Hello, Is this log from the frontend SOLR (aggregator) or from a shard? Can you merge, e.g. 3 shards together or is it much effort for your team? In our setup we currently have 16 shards with ~30GB each, but we rarely search in all of them at once. Best, Dmitry On Wed, Nov 23, 2011 at 3:12 PM,

Re: Huge Performance: Solr distributed search

2011-11-23 Thread Dmitry Kan
aggregator Can you merge, e.g. 3 shards together or is it much effort for your team? Yes, we can merge. We'll try to do this and review how it will works Thanks, Dmitry Any another ideas? On Wed, Nov 23, 2011 at 4:01 PM, Dmitry Kan dmitry@gmail.com wrote: Hello, Is this log from

Re: Huge Performance: Solr distributed search

2011-11-25 Thread Dmitry Kan
this and review how it will works Thanks, Dmitry Any another ideas? On Wed, Nov 23, 2011 at 4:01 PM, Dmitry Kan dmitry@gmail.com wrote: Hello, Is this log from the frontend SOLR (aggregator) or from a shard? Can you merge, e.g. 3 shards together or is it much effort for your team

cache monitoring tools?

2011-12-06 Thread Dmitry Kan
/ /analyzer /fieldType -- Regards, Dmitry Kan

Re: Solr request handler queries in fiddler

2011-12-06 Thread Dmitry Kan
out with this. Thank u in advance -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-request-handler-queries-in-fiddler-tp3564260p3564260.html Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Dmitry Kan

Re: cache monitoring tools?

2011-12-07 Thread Dmitry Kan
, increase the maxsize value to your acceptable limit. Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/cache-monitoring-tools-tp3566645p3566811.html Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Dmitry Kan

Re: Solr request handler queries in fiddler

2011-12-07 Thread Dmitry Kan
On Wed, Dec 7, 2011 at 12:54 PM, Dmitry Kan [via Lucene] ml-node+s472066n351...@n3.nabble.com wrote: If you mean debugging the queries, you can use eclipse+jetty plugin setup ( http://code.google.com/p/run-jetty-run/) with solr web app ( http://hokiesuns.blogspot.com/2010/01/setting-up

Re: cache monitoring tools?

2011-12-07 Thread Dmitry Kan
tool. See http://wiki.apache.org/solr/SolrJmx On Wed, Dec 7, 2011 at 6:19 AM, Dmitry Kan dmitry@gmail.com wrote: Yes, we do require that much. Ok, thanks, I will try increasing the maxsize. On Wed, Dec 7, 2011 at 10:56 AM, pravesh suyalprav...@yahoo.com wrote: facet.limit

Re: cache monitoring tools?

2011-12-07 Thread Dmitry Kan
The culprit seems to be the merger (frontend) SOLR. Talking to one shard directly takes substantially less time (1-2 sec). On Wed, Dec 7, 2011 at 4:10 PM, Dmitry Kan dmitry@gmail.com wrote: Tomás: thanks. The page you gave didn't mention cache specifically, is there more documentation

Re: cache monitoring tools?

2011-12-07 Thread Dmitry Kan
-and-lucidworks-with-zabbix/ On Wed, Dec 7, 2011 at 11:49 AM, Dmitry Kan dmitry@gmail.com wrote: The culprit seems to be the merger (frontend) SOLR. Talking to one shard directly takes substantially less time (1-2 sec). On Wed, Dec 7, 2011 at 4:10 PM, Dmitry Kan dmitry@gmail.com wrote

Re: cache monitoring tools?

2011-12-12 Thread Dmitry Kan
, multipled by some constant (i think it's 2 but it might be higher) in order to over request facet constriant counts from each shard to aggregate them. the dominant factor in the slow speed you are seeing is most likeley Network IO between the shards. -Hoss -- Regards, Dmitry Kan

Re: cache monitoring tools?

2011-12-12 Thread Dmitry Kan
to show graphs of loads, e.g. cache counts or CPU load, in parallel to a console log or to an http request log?? I am working on such a tool currently but I have a bad feeling of reinventing the wheel. thanks in advance Paul Le 8 déc. 2011 à 08:53, Dmitry Kan a écrit : Otis, Tomás: thanks

Re: cache monitoring tools?

2011-12-12 Thread Dmitry Kan
to a console log or to an http request log?? I am working on such a tool currently but I have a bad feeling of reinventing the wheel. thanks in advance Paul Le 8 déc. 2011 à 08:53, Dmitry Kan a écrit : Otis, Tomás: thanks for the great links! 2011/12/7 Tomás Fernández Löbbe

Re: Virtual Memory very high

2011-12-13 Thread Dmitry Kan
If you allow me to chime in, is there a way to check for which DirectoryFactory is in use, if ${solr.directoryFactory:solr.StandardDirectoryFactory} has been configured? Dmitry 2011/12/12 Yury Kats yuryk...@yahoo.com On 12/11/2011 4:57 AM, Rohit wrote: What are the difference in the

Re: NumericRangeQuery: what am I doing wrong?

2011-12-14 Thread Dmitry Kan
Maybe you should index your values differently? Here is what Lucene's 2.9 javadoc says: To use this, you must first index the numeric values using NumericFieldhttp://lucene.apache.org/java/2_9_0/api/all/org/apache/lucene/document/NumericField.html(expert:

Re: cache monitoring tools?

2011-12-14 Thread Dmitry Kan
indexes that handle the queries, while our master handles updates/commits. Justin Dmitry Kan dmitry@gmail.com writes: Justin, in terms of the overhead, have you noticed if Munin puts much of it when used in production? In terms of the solr farm: how big is a shard's index (given

Re: disable stemming on query parser.

2011-12-16 Thread Dmitry Kan
You can disable stemming in a copy field. So you need to define one field with your input data on which stemming will be done and the other field (copy field), on which stemming will not be done. Then on the client you can decide which field to search against. Dmitry On Fri, Dec 16, 2011 at 2:00

Re: disable stemming on query parser.

2011-12-19 Thread Dmitry Kan
the Solr - User mailing list archive at Nabble.com. -- Regards, Dmitry Kan

Re: cache monitoring tools?

2011-12-19 Thread Dmitry Kan
-monitoring.org/plugins/search?keyword=solr Dmitry Kan dmitry@gmail.com writes: Thanks, Justin. With zabbix I can gather jmx exposed stats from SOLR, how about munin, what protocol / way it uses to accumulate stats? It wasn't obvious from their online documentation... On Mon, Dec 12

a question on jmx solr exposure

2011-12-21 Thread Dmitry Kan
going to see under solr/ ? From the numbers (e.g. numDocs of searcher), jconsole see the stats of A. Where do stats of B go? Or is firstly activated core will capture the jmx pipe and won't let B's stats to go through? -- Regards, Dmitry Kan

Re: a question on jmx solr exposure

2011-12-21 Thread Dmitry Kan
Solved by exposing jmx only on one of the cores, as it is of a more interest than the other one. Dmitry On Wed, Dec 21, 2011 at 11:56 AM, Dmitry Kan dmitry@gmail.com wrote: Hello list, This might be not the right place to ask the jmx specific questions, but I decided to try, as we

faceting over ngrams

2011-03-16 Thread Dmitry Kan
on a cloud with Hadoop make it faster (to seconds) as faceting seems to be a natural map-reduce task? Are there any other options to look into before stepping into the cloud? Please let me know, if you need specific details on the schema / solrconfig setup or the like. -- Regards, Dmitry Kan

Re: faceting over ngrams

2011-03-16 Thread Dmitry Kan
. Jonathan On 3/16/2011 11:51 AM, Toke Eskildsen wrote: On Wed, 2011-03-16 at 13:05 +0100, Dmitry Kan wrote: Hello guys. We are using shard'ed solr 1.4 for heavy faceted search over the trigrams field with about 1 million of entries in the result set and more than 100 million of entries

Re: faceting over ngrams

2011-03-16 Thread Dmitry Kan
much worry for you? I'll check out the direct query and let you know. On Wed, Mar 16, 2011 at 5:51 PM, Toke Eskildsen t...@statsbiblioteket.dkwrote: On Wed, 2011-03-16 at 13:05 +0100, Dmitry Kan wrote: Hello guys. We are using shard'ed solr 1.4 for heavy faceted search over the trigrams

Re: faceting over ngrams

2011-03-16 Thread Dmitry Kan
AM, Dmitry Kan dmitry@gmail.com wrote: Hello guys. We are using shard'ed solr 1.4 for heavy faceted search over the trigrams field with about 1 million of entries in the result set and more than 100 million of entries to facet on in the index. Currently the faceted search is very slow

solr on the cloud

2011-03-22 Thread Dmitry Kan
/solr/SolrCloud ? Basically, of high interest is checking out the Map-Reduce for distributed faceting, is it even possible with the trunk? -- Regards, Dmitry Kan

Re: solr on the cloud

2011-03-25 Thread Dmitry Kan
with the trunk? Hm, MR for distributed faceting? Maybe I missed this... can you point to a place that mentions this? Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ -- Regards, Dmitry Kan

Re: solr on the cloud

2011-03-25 Thread Dmitry Kan
...@lucidimagination.comwrote: On Tue, Mar 22, 2011 at 7:51 AM, Dmitry Kan dmitry@gmail.com wrote: Basically, of high interest is checking out the Map-Reduce for distributed faceting, is it even possible with the trunk? Solr already has distributed faceting, and it's much more performant than a map

Re: solr on the cloud

2011-03-25 Thread Dmitry Kan
you mean when you refer to distribution of facet requests across hosts?) On Fri, Mar 25, 2011 at 1:57 PM, Upayavira u...@odoko.co.uk wrote: On Fri, 25 Mar 2011 13:44 +0200, Dmitry Kan dmitry@gmail.com wrote: Hi Yonik, Oh, this is great. Is distributed faceting available in the trunk

Re: Create 2 index with solr

2011-03-25 Thread Dmitry Kan
and generate 2 index. So I don't know how I could generate 2 different index? Thank you for your help. Amel. -- Regards, Dmitry Kan

Re: solr on the cloud

2011-03-25 Thread Dmitry Kan
- Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Dmitry Kan dmitry@gmail.com To: solr-user@lucene.apache.org Cc: Upayavira u...@odoko.co.uk Sent: Fri, March 25, 2011 8:26:33 AM Subject: Re: solr on the cloud Hi, Upayavira Probably

Re: solr on the cloud

2011-03-26 Thread Dmitry Kan
for reading the index [randomly] out of HDFS. Benchmarking'll be implemented next. It's not production ready, suggestions are welcome. Jason On Fri, Mar 25, 2011 at 2:03 PM, Dmitry Kan dmitry@gmail.com wrote: Hi Otis, Thanks for elaborating on this and the link (funny!). I have

Re: Form too large ...

2011-03-30 Thread Dmitry Kan
-tp2752676p2752676.html Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Dmitry Kan

stemming for English

2011-05-03 Thread Dmitry Kan
such situations, is going all the way to modifying the Porter stemming source code the best choice? -- Regards, Dmitry Kan

Re: stemming for English

2011-05-03 Thread Dmitry Kan
to use stemming on both side (index and query) if you really want to use stemming. Ludovic 2011/5/3 Dmitry Kan [via Lucene] ml-node+2893599-894006307-383...@n3.nabble.com Dear list, In SOLR schema on the index side we use no stemming to support favor wildcard search. On the query

Re: stemming for English

2011-05-03 Thread Dmitry Kan
and consistent wildcard support in the same field. To me, you have to create 2 different fields. Ludovic. 2011/5/3 Dmitry Kan [via Lucene] ml-node+2893628-993677979-383...@n3.nabble.com Hi Ludovic, That's an option we had before we decided to go for a full-blown support of wildcards

Re: stemming for English

2011-05-04 Thread Dmitry Kan
as the first does *not* double your index size at all. -Original Message- From: Dmitry Kan [mailto:dmitry@gmail.com] Sent: Tuesday, May 03, 2011 4:06 AM To: solr-user@lucene.apache.org Subject: Re: stemming for English Yes, Ludovic. Thus effectively we get index doubled. Given

Re: Want to Delete Existing Index create fresh index

2011-05-14 Thread Dmitry Kan
Hi Pawan, Which SOLR version do you have installed? It should be absolutely normal for the data/ sub directory to create when starting up SOLR. So just go ahead and post your data into SOLR, if you have changed the schema already. -- Regards, Dmitry Kan On Sat, May 14, 2011 at 4:01 PM

boolean versus non-boolean search

2011-05-16 Thread Dmitry Kan
as the original query and same debug output. Why is that? -- Regards, Dmitry Kan

Re: boolean versus non-boolean search

2011-05-16 Thread Dmitry Kan
to the one you specifically did add the AND in. I guess that it basically groups left-to-right. So: term1 AND term2 OR X OR Y is the same as: term1 AND (term2 OR (X OR Y)) But I guess you already figured this all out, yeah? On 5/16/2011 9:24 AM, Dmitry Kan wrote: Dear list, Might have

sorting on date field in facet query

2011-05-18 Thread Dmitry Kan
Hello list, Is it possible to sort on date field in a facet query in SOLR 3.1? -- Regards, Dmitry Kan

Re: sorting on date field in facet query

2011-05-19 Thread Dmitry Kan
, Erick Erickson erickerick...@gmail.comwrote: Can you provide an example of what you are trying to do? Are you referring to ordering the result set or the facet information? Best Erick On Wed, May 18, 2011 at 7:21 AM, Dmitry Kan dmitry@gmail.com wrote: Hello list, Is it possible

Re: sorting on date field in facet query

2011-05-19 Thread Dmitry Kan
, that would/could not really work. Perhaps we have to ask another Question .. what are you trying to achieve? Boost by Date? Regards Stefan On Thu, May 19, 2011 at 2:24 PM, Dmitry Kan dmitry@gmail.com wrote: Hi Erick, It is about ordering the facet information. The result set is empty via

Re: sorting on date field in facet query

2011-05-19 Thread Dmitry Kan
the top of my head here Erick On Thu, May 19, 2011 at 9:46 AM, Dmitry Kan dmitry@gmail.com wrote: Hi, Thanks for the questions, guys, and sorry for the confusion. I should start with a broader picture of what we are trying to achieve. The only problem is that I cannot speak

Re: sorting on date field in facet query

2011-05-19 Thread Dmitry Kan
in context: http://lucene.472066.n3.nabble.com/sorting-on-date-field-in-facet-query-tp2956540p2961612.html Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Dmitry Kan

query parser and other query filters

2011-05-23 Thread Dmitry Kan
way to tell SOLR to execute query parser the last after all other filters have processes the query? -- Regards, Dmitry Kan

Re: Termscomponent sort question

2011-05-25 Thread Dmitry Kan
-sort-question-tp2980683p2980683.html Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Dmitry Kan

Re: What is omitNorms

2011-05-26 Thread Dmitry Kan
archive at Nabble.com. -- Regards, Dmitry Kan

Re: Termscomponent sort question

2011-05-26 Thread Dmitry Kan
PM, antonio antonio...@email.it wrote: Hi Dmitry Kan, thanks for your anwser. This is an idea, but i think that will be not so performing. Because if the terms are 1000, i must reorder 1000 terms by own length, and i think the time will be high for make autocomplete. Don't you think

query routing with shards

2011-06-02 Thread Dmitry Kan
routing of queries to the shards they map to. Can you please give some pointers as to what would be an optimal way to achieve such a routing inside the front end solr? Is there a way to configure mapping inside the solrconfig? Thanks. -- Regards, Dmitry Kan

Re: query routing with shards

2011-06-03 Thread Dmitry Kan
- Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Dmitry Kan dmitry@gmail.com To: solr-user@lucene.apache.org; yo...@lucidimagination.com Sent: Thu, June 2, 2011 7:00:53 AM Subject: query routing with shards Hello all, We

Re: query routing with shards

2011-06-03 Thread Dmitry Kan
the request, it has the correct shard in there already. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Dmitry Kan dmitry@gmail.com To: solr-user@lucene.apache.org Sent: Fri, June 3

Re: query routing with shards

2011-06-03 Thread Dmitry Kan
-lucene.com/ - Original Message From: Dmitry Kan dmitry@gmail.com To: solr-user@lucene.apache.org Sent: Fri, June 3, 2011 12:56:15 PM Subject: Re: query routing with shards Hi Otis, Thanks! This sounds promising. This custom implementation, will it hurt in any

Re: Hitting the URI limit, how to get around this?

2011-06-03 Thread Dmitry Kan
. -- Ken Krugler +1 530-210-6378 http://bixolabs.com custom data mining solutions -- Regards, Dmitry Kan

huge shards (300GB each) and load balancing

2011-06-08 Thread Dmitry Kan
. -- Regards, Dmitry Kan

Re: huge shards (300GB each) and load balancing

2011-06-08 Thread Dmitry Kan
experience / pointers to load balancer software that was tried with SOLR? Dmitry On Wed, Jun 8, 2011 at 12:32 PM, Upayavira u...@odoko.co.uk wrote: On Wed, 08 Jun 2011 10:42 +0300, Dmitry Kan dmitry@gmail.com wrote: Hello list, Thanks for attending to my previous questions so far

Re: huge shards (300GB each) and load balancing

2011-06-08 Thread Dmitry Kan
authentication. Nginx and others can also do load balancing. Bill Bell Sent from mobile On Jun 8, 2011, at 3:32 AM, Upayavira u...@odoko.co.uk wrote: On Wed, 08 Jun 2011 10:42 +0300, Dmitry Kan dmitry@gmail.com wrote: Hello list, Thanks for attending to my previous questions so far

Re: huge shards (300GB each) and load balancing

2011-06-14 Thread Dmitry Kan
indexing so instead changed the index time parameter termIndexInterval from 128 to 1024. (More details here: http://www.hathitrust.org/blogs/large-scale-search) Tom Burton-West -- Regards, Dmitry Kan

Re: query parsing - removes a term

2011-06-14 Thread Dmitry Kan
is only sending in the +text:(wasp) on parsing, and doesn't use the was value. Why is it removing one of the terms? Thanks! Andrea -- Regards, Dmitry Kan

Re: query routing with shards

2011-06-16 Thread Dmitry Kan
param. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Dmitry Kan dmitry@gmail.com To: solr-user@lucene.apache.org Sent: Fri, June 3, 2011 12:56:15 PM Subject: Re: query

Re: Showing facet of first N docs

2011-06-16 Thread Dmitry Kan
-- Regards, Dmitry Kan

Re: query routing with shards

2011-06-16 Thread Dmitry Kan
Hi Otis, I have fixed it by assigning the value to rb same as assigned to sreq: rb.shards = shards.toString().split(,); not tested that fully yet, but distributed faceting works at least on my pc _3 shards 1 router_ setup. Dmitry On Thu, Jun 16, 2011 at 4:53 PM, Dmitry Kan dmitry

Re: Solr and Tag Cloud

2011-06-18 Thread Dmitry Kan
sequence (ngram) length. On Sat, Jun 18, 2011 at 7:12 AM, Jamie Johnson jej2...@gmail.com wrote: Does anyone have details of how to generate a tag cloud of popular terms across an entire data set and then also across a query? -- Regards, Dmitry Kan

Re: Showing facet of first N docs

2011-06-18 Thread Dmitry Kan
a 'give me only facets related to the most relevant docs in the results' functionality. Any idea on how to do that? Tommaso 2011/6/16 Dmitry Kan dmitry@gmail.com http://wiki.apache.org/solr/SimpleFacetParameters facet.offset This param indicates an offset into the list of constraints

Re: Garbage Collection: I have given bad advice in the past!

2011-06-24 Thread Dmitry Kan
will make it easier for others to find the right solution for themselves. Thanks, Shawn -- Regards, Dmitry Kan

Re: testing subscription.

2011-06-24 Thread Dmitry Kan
passed On Thu, Jun 23, 2011 at 10:38 PM, Esteban Donato esteban.don...@gmail.comwrote: -- Regards, Dmitry Kan

Re: Custom Query Processing

2011-06-28 Thread Dmitry Kan
to it on the Solr end. Our previous implementation used a QueryWrapperFilter along with some custom code to build a new Filter from the query provided. How can we plug this filter into Solr? -- Regards, Dmitry Kan

Re: Unique document count from index?

2011-06-28 Thread Dmitry Kan
this message and its attachments, along with any copies thereof. This message does not create any contractual obligation on behalf of the sender or Law Bulletin Publishing Company. Thank you. -- Regards, Dmitry Kan

Re: how to improve query result time.

2011-07-04 Thread Dmitry Kan
=English protected=protwords.txt / /analyzer /fieldType Thanks in advance -- View this message in context: http://lucene.472066.n3.nabble.com/how-to-improve-query-result-time-tp3136554p3136554.html Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Dmitry Kan

solr.StandardTokenizerFactory: more info needed

2011-07-06 Thread Dmitry Kan
Hi all! solr.StandardTokenizerFactory -- is it possible to see the full description of its behaviour for solr.1.4 somewhere? Wiki http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.StandardTokenizerFactory is very short.. -- Regards, Dmitry Kan

Re: solr.StandardTokenizerFactory: more info needed

2011-07-06 Thread Dmitry Kan
, The underlying Lucene implementation is here: http://svn.apache.org/repos/asf/lucene/java/tags/lucene_2_9_1/src/java/org/apache/lucene/analysis/standard/ StandardTokenizerImpl.jflex is probably where you should start. Steve -Original Message- From: Dmitry Kan [mailto:dmitry

Re: solr.StandardTokenizerFactory: more info needed

2011-07-06 Thread Dmitry Kan
-Original Message- From: Dmitry Kan [mailto:dmitry@gmail.com] Sent: Wednesday, July 06, 2011 9:21 AM To: solr-user@lucene.apache.org Subject: Re: solr.StandardTokenizerFactory: more info needed Hi Steven, This looks very good. Thanks. Do I understand correctly, that I were

Re: solr.StandardTokenizerFactory: more info needed

2011-07-06 Thread Dmitry Kan
ones of your own to exercise your changes to insure that they do what you want Best Erick On Wed, Jul 6, 2011 at 9:31 AM, Dmitry Kan dmitry@gmail.com wrote: OK, thanks. Do you know if there are tokenizer specific tests to run after compilation? On Wed, Jul 6, 2011 at 4:25 PM

Re: sorting on date field in facet query

2011-07-08 Thread Dmitry Kan
.n3.nabble.com/sorting-on-date-field-in-facet-query-tp2956540p2961612.html Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Dmitry Kan

Re: Query Rewrite

2011-07-09 Thread Dmitry Kan
a completely blank slate so any guidance would be appreciated. -- Regards, Dmitry Kan

Re: Query Rewrite

2011-07-10 Thread Dmitry Kan
was typying this on-the-go from my phone, I meant LuceneQParserPlugin of course. On Sat, Jul 9, 2011 at 6:39 PM, Dmitry Kan dmitry@gmail.com wrote: you can try extending LuceneQParser. In its createParser method (lucene 2.9.3 and solr 1.4) you can analyze the input query in the param q

Re: any detailed tutorials on plugin development?

2011-07-19 Thread Dmitry Kan
... - Zeki ama calismiyor... Calissa yapar... -- View this message in context: http://lucene.472066.n3.nabble.com/any-detailed-tutorials-on-plugin-development-tp3177821p3177821.html Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Dmitry Kan

Re: any detailed tutorials on plugin development?

2011-07-21 Thread Dmitry Kan
.n3.nabble.com/any-detailed-tutorials-on-plugin-development-tp3177821p3184160.html Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Dmitry Kan

Re: Wildcard-Search Solr 3.5.0

2012-05-23 Thread Dmitry Kan
://wiki.apache.org/solr/MultitermQueryAnalysis Well, it works in 3.6. With one exception: If I use german umlauts it does not work anymore. Text: Bär Bä* - no hits Bär - hits What can I do in this case? Thank you -- Regards, Dmitry Kan

<    1   2   3   4   5   6   >