Re: DateMathParser question

2014-10-10 Thread Jack Krupansky
Sounds reasonable. File a Jira! -- Jack Krupansky -Original Message- From: Jamie Johnson Sent: Friday, October 10, 2014 11:45 AM To: solr-user@lucene.apache.org Subject: DateMathParser question I have found that DateMathParser is extremely useful in providing nice labels back

Re: does one need to reindex when changing similarity class

2014-10-09 Thread Jack Krupansky
The similarity class is only invoked at query time, so it doesn't participate in indexing. -- Jack Krupansky -Original Message- From: Markus Jelsma Sent: Thursday, October 9, 2014 6:59 AM To: solr-user@lucene.apache.org Subject: RE: does one need to reindex when changing similarity

Re: WhitespaceTokenizer to consider incorrectly encoded c2a0?

2014-10-08 Thread Jack Krupansky
white space as white space here. And update the Lucene Javadoc contract to be more explicit. -- Jack Krupansky -Original Message- From: Markus Jelsma Sent: Wednesday, October 8, 2014 10:16 AM To: solr-user@lucene.apache.org ; solr-user Subject: RE: WhitespaceTokenizer to consider

Re: eDisMax parser and special characters

2014-10-08 Thread Jack Krupansky
quoted, and then analyzed to nothing for text fields but is still a string for string fields. -- Jack Krupansky -Original Message- From: Lanke,Aniruddha Sent: Wednesday, October 8, 2014 4:38 PM To: solr-user@lucene.apache.org Subject: Re: eDisMax parser and special characters Sorry

Re: Edismax parser and boosts

2014-10-08 Thread Jack Krupansky
Definitely sounds like a bug! File a Jira. Thanks for reporting this. What release of Solr? -- Jack Krupansky -Original Message- From: Pawel Rog Sent: Wednesday, October 8, 2014 3:57 PM To: solr-user@lucene.apache.org Subject: Edismax parser and boosts Hi, I use edismax query

Re: Best way to index wordpress blogs in solr

2014-10-08 Thread Jack Krupansky
The LucidWorks product has builtin crawler support so you could crawl one or more web sites. http://lucidworks.com/product/fusion/ -- Jack Krupansky -Original Message- From: Vishal Sharma Sent: Tuesday, October 7, 2014 2:08 PM To: solr-user@lucene.apache.org Subject: Best way

Re: Advise on an architecture with lot of cores

2014-10-07 Thread Jack Krupansky
separate clusters for larger groups of customers, maybe with a smaller cluster with a collection that maps the customer ID to a Solr cluster, and then the application layer can direct requests to the Solr cluster that owns that customer. -- Jack Krupansky -Original Message- From

Re: dismax query does not match with additional field in qf

2014-10-07 Thread Jack Krupansky
the string term to match, which won't happen since only the full string is indexed. Generally, you need to escape all special characters in a query. Then hopefully your string field will match. -- Jack Krupansky -Original Message- From: Andreas Hubold Sent: Tuesday, September 30, 2014 11

Re: dismax query does not match with additional field in qf

2014-10-07 Thread Jack Krupansky
on the string field, but a tokenized phrase match on the text field, and support partial matches on the text field as a phrase of contiguous terms. -- Jack Krupansky -Original Message- From: Andreas Hubold Sent: Tuesday, October 7, 2014 12:08 PM To: solr-user@lucene.apache.org Subject: Re

Re: Flexible search field analyser/tokenizer configuration

2014-10-04 Thread Jack Krupansky
What exactly do you think that filter query is doing? Explain it in plain English. My guess is that it eliminates all your document matches. -- Jack Krupansky -Original Message- From: PeterKerk Sent: Saturday, October 4, 2014 12:34 AM To: solr-user@lucene.apache.org Subject: Re

Re: Flexible search field analyser/tokenizer configuration

2014-10-04 Thread Jack Krupansky
to apply a boost to all un-fielded terms for a field, you otherwise need to apply any boost on a term, not a field. -- Jack Krupansky -Original Message- From: PeterKerk Sent: Saturday, October 4, 2014 10:43 AM To: solr-user@lucene.apache.org Subject: Re: Flexible search field analyser

Re: Regarding Default Scoring For Solr

2014-10-03 Thread Jack Krupansky
That's a reasonable description for Solr/Lucene scoring, but use the latest release: http://lucene.apache.org/core/4_10_0/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html -- Jack Krupansky -Original Message- From: mdemarco123 Sent: Thursday, October 2, 2014 6:06 PM

Re: Solr + Federated Search Question

2014-10-03 Thread Jack Krupansky
Yes, either term can be used to confuse people equally well! -- Jack Krupansky -Original Message- From: Alejandro Calbazana Sent: Thursday, October 2, 2014 3:28 PM To: solr-user@lucene.apache.org ; Ahmet Arslan Subject: Re: Solr + Federated Search Question Thanks Ahmet. Yay! New

Re: Wildcard search makes no sense!!

2014-10-01 Thread Jack Krupansky
token gets analyzed into - that's what your wildcard prefix must match. Sometimes (usually!) you will be surprised. -- Jack Krupansky -Original Message- From: Wayne W Sent: Wednesday, October 1, 2014 7:16 AM To: solr-user@lucene.apache.org Subject: Wildcard search makes no sense!! Hi

Re: Adding filter in custom query parser

2014-10-01 Thread Jack Krupansky
Unless you consider yourself to be a Solr expert, it would be best to implement such query translation in an application layer. -- Jack Krupansky -Original Message- From: sagarprasad Sent: Wednesday, October 1, 2014 3:27 AM To: solr-user@lucene.apache.org Subject: Adding filter

Re: Solr + Federated Search Question

2014-10-01 Thread Jack Krupansky
. -- Jack Krupansky -Original Message- From: Ahmet Arslan Sent: Wednesday, October 1, 2014 9:35 AM To: solr-user@lucene.apache.org Subject: Re: Solr + Federated Search Question Hi, Federation is possible. Solr has distributed search support with shards parameter. Ahmet On Wednesday

Re: Search multiple values with wildcards

2014-09-30 Thread Jack Krupansky
The special characters (colon) are treated as term delimiters for text field. How do you really intend to query this string. You could make it simply a string field. -- Jack Krupansky -Original Message- From: J'roo Sent: Tuesday, September 30, 2014 11:08 AM To: solr-user

Re: Boost Query (bq) syntax/usage

2014-09-30 Thread Jack Krupansky
of the terms rather than any of the three terms. -- Jack Krupansky -Original Message- From: shamik Sent: Tuesday, September 30, 2014 5:38 PM To: solr-user@lucene.apache.org Subject: Boost Query (bq) syntax/usage Hi, I'm little confused with the right syntax of defining boost

Re: Boost Query (bq) syntax/usage

2014-09-30 Thread Jack Krupansky
The + signs in the parsed boost query indicated the terms were ANDed together, but maybe you can use the q.op and mm parameters to change the default operator (I forget!). -- Jack Krupansky -Original Message- From: shamik Sent: Tuesday, September 30, 2014 7:19 PM To: solr-user

Re: Boost Query (bq) syntax/usage

2014-09-30 Thread Jack Krupansky
with dismax and then specify edismax for bq using the localParam notation. -- Jack Krupansky -Original Message- From: Jack Krupansky Sent: Tuesday, September 30, 2014 8:19 PM To: solr-user@lucene.apache.org Subject: Re: Boost Query (bq) syntax/usage The + signs in the parsed boost query

Re: multiple terms order in query - eDismax

2014-09-29 Thread Jack Krupansky
That's called phrase query - selecting documents based on the order of the terms. Just enclose the terms in quotes. -- Jack Krupansky -Original Message- From: Tomer Levi Sent: Monday, September 29, 2014 2:41 AM To: solr-user@lucene.apache.org Subject: RE: multiple terms order

Re: How to query certain fields filtered by a condition

2014-09-29 Thread Jack Krupansky
You can perform boolean operations using parentheses. So you can OR a sequence of sub-queries, and each sub-query can be an AND of the desired search term and the constraining values for other fields. -- Jack Krupansky -Original Message- From: Shamik Bandopadhyay Sent: Monday

Re: demo app explaining solr features

2014-09-28 Thread Jack Krupansky
And you can also check out the tutorials in any of the Solr books, including my Solr Deep Dive e-book: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/ebook/product-21203548.html -- Jack Krupansky -Original Message- From: Mikhail Khludnev Sent

Re: multiple terms order in query - eDismax

2014-09-28 Thread Jack Krupansky
pf and ps merely control boosting of documents, not selection of documents. mm controls selection of documents. So, hopefully at least doc3 is returned before doc2. -- Jack Krupansky From: Tomer Levi Sent: Sunday, September 28, 2014 5:39 AM To: solr-user@lucene.apache.org Subject: multiple

Re: java.lang.NumberFormatException: For input string: string;#-6.872515521, 53.28853084

2014-09-27 Thread Jack Krupansky
And how is the schema field declared. Seems like it's a TrieDoubleField, which should be a simple floating point value. You should be using the spatial field types. -- Jack Krupansky -Original Message- From: Erick Erickson Sent: Friday, September 26, 2014 12:20 PM To: solr-user

Re: Scoring with wild cars

2014-09-25 Thread Jack Krupansky
The wildcard query is “constant score” to make it faster, so unfortunately that means there is no score differentiation between the wildcard matches. You can simple add the wildcard prefix as a separate query term and boost it: q=text:carre* text:carre^1.5 -- Jack Krupansky From: Pigeyre

Re: Changed behavior in solr 4 ??

2014-09-25 Thread Jack Krupansky
I am not aware of any such feature! That doesn't mean it doesn't exist, but I don't recall seeing it in the Solr source code. -- Jack Krupansky -Original Message- From: Jorge Luis Betancourt Gonzalez Sent: Wednesday, September 24, 2014 1:31 AM To: solr-user@lucene.apache.org Subject

Re: query for space character in text field ...

2014-09-23 Thread Jack Krupansky
Or simply enclosed the full term in quotes: q=path:my path Which is more properly encoded as: q=path:%22my+path%22 -- Jack Krupansky -Original Message- From: Erick Erickson Sent: Tuesday, September 23, 2014 11:02 PM To: solr-user@lucene.apache.org Subject: Re: query for space

Re: Changed behavior in solr 4 ??

2014-09-23 Thread Jack Krupansky
name=echoParamsexplicit/str int name=rows10/int str name=dftext/str /lst ... -- Jack Krupansky -Original Message- From: Jorge Luis Betancourt Gonzalez Sent: Tuesday, September 23, 2014 11:02 AM To: solr-user@lucene.apache.org Subject: Changed behavior in solr 4 ?? Hi: I’m

Re: How to summarize a String Field ?

2014-09-18 Thread Jack Krupansky
Do a copyField to a numeric field. -- Jack Krupansky -Original Message- From: Erick Erickson Sent: Thursday, September 18, 2014 11:35 AM To: solr-user@lucene.apache.org Subject: Re: How to summarize a String Field ? You cannot do this as far as I know, it must be a numeric field

Re: Solr Exceptions -- immense terms

2014-09-15 Thread Jack Krupansky
You can use an update request processor to filter the input for large values. You could write a script with the stateless script processor which ignores or trims large input values. -- Jack Krupansky -Original Message- From: Christopher Gross Sent: Monday, September 15, 2014 7:58 AM

Re: Solr Exceptions -- immense terms

2014-09-15 Thread Jack Krupansky
page as a string field. -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Monday, September 15, 2014 8:39 AM To: solr-user Subject: Re: Solr Exceptions -- immense terms May not need a script for that: http://www.solr-start.com/javadoc/solr-lucene/org/apache/solr

Re: Mongo DB Users

2014-09-15 Thread Jack Krupansky
Waiting for a positive response! -1 -- Jack Krupansky -Original Message- From: Rakesh Varna Sent: Monday, September 15, 2014 10:18 AM To: solr-user@lucene.apache.org Subject: Re: Mongo DB Users Remove Regards, Rakesh Varna On Mon, Sep 15, 2014 at 9:29 AM, Ed Smiley ed.smi

Re: Tricky exact match, unwanted search results

2014-09-14 Thread Jack Krupansky
for string field queries, and that don't use wildcard, fuzzy, or range queries. And maybe also keyword tokenizer text fields that don't have any filters, which might as well be string fields. -- Jack Krupansky -Original Message- From: FiMka Sent: Sunday, September 14, 2014 9:34 AM To: solr

Re: Solr multiple sources configuration

2014-09-09 Thread Jack Krupansky
It is mostly a matter of how you expect to query that data - do you need different queries for different sources, or do you have a common conceptual model that covers all sources with a common set of queries? -- Jack Krupansky -Original Message- From: vineet yadav Sent: Tuesday

Re: Is there any sentence tokenizers in sold 4.9.0?

2014-09-08 Thread Jack Krupansky
Out of curiosity, what would be an example query for your application that would depend on sentence tokenization, as opposed to simple term tokenization? I mean, there are no sentence-based query operators in the Solr query parsers. -- Jack Krupansky -Original Message- From: Sandeep

Re: How to implement multilingual word components fields schema?

2014-09-08 Thread Jack Krupansky
to short queries. Keep in mind that auto-detection for indexing full documents is a different problem that auto-detection for very short queries. -- Jack Krupansky -Original Message- From: Ilia Sretenskii Sent: Sunday, September 7, 2014 10:33 PM To: solr-user@lucene.apache.org Subject: Re

Re: How to solve?

2014-09-06 Thread Jack Krupansky
Payload really don't have first class support in Solr. It's a solid feature of Lucene, but never expressed well in Solr. Any thoughts or proposals are welcome! (Hmmm... I wonder what the good folks at Heliosearch have up their sleeves in this area?!) -- Jack Krupansky -Original Message

Re: FAST-like document vector data structures in Solr?

2014-09-05 Thread Jack Krupansky
relevance. The similarity vector is created during item processing and indicates the most important terms or concepts in the item and the corresponding weight.” See: http://msdn.microsoft.com/en-us/library/office/ff521597(v=office.14).aspx -- Jack Krupansky From: Jürgen Wagner (DVT) Sent: Friday

Re: How to implement multilingual word components fields schema?

2014-09-05 Thread Jack Krupansky
the same source text in multiple fields, one for each language. You can then do a dismax query on that set of fields. -- Jack Krupansky -Original Message- From: Ilia Sretenskii Sent: Friday, September 5, 2014 10:06 AM To: solr-user@lucene.apache.org Subject: How to implement

Re: FAST-like document vector data structures in Solr?

2014-09-05 Thread Jack Krupansky
Sounds like a great future to add to Solr, especially if it would facilitate more automatic relevancy enhancement. LucidWorks Search has a feature called unsupervised feedback that does that but something like a docvector might make it a more realistic default. -- Jack Krupansky

Re: looking for a solr/search expert in Paris

2014-09-03 Thread Jack Krupansky
listing up to date, including regional availability and any specialties. -- Jack Krupansky -Original Message- From: elisabeth benoit Sent: Wednesday, September 3, 2014 4:02 AM To: solr-user@lucene.apache.org Subject: looking for a solr/search expert in Paris Hello, We are looking

Re: Specify Analyzer per field

2014-09-01 Thread Jack Krupansky
how to define and use custom analyzers as well. No, Solr does not have that feature per se - you have to specify a custom field TYPE to specify the analyzer. -- Jack Krupansky -Original Message- From: Ankit Jain Sent: Monday, September 1, 2014 2:14 AM To: solr-user@lucene.apache.org

Re: AW: Scaling to large Number of Collections

2014-09-01 Thread Jack Krupansky
: How many of your collections need to be simultaneously active? Say, in a one-hour period, how many of them will be updating and serving queries, and what query load per-collection and total query load do you need to design for? -- Jack Krupansky -Original Message- From: Christoph

Re: external indexer for Solr Cloud

2014-09-01 Thread Jack Krupansky
be great to have a standalone DIH that runs as a separate server and then sends standard Solr update requests to a Solr cluster. -- Jack Krupansky -Original Message- From: Lee Chunki Sent: Sunday, August 31, 2014 8:55 PM To: solr-user@lucene.apache.org Subject: Re: external indexer

Re: external indexer for Solr Cloud

2014-09-01 Thread Jack Krupansky
. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Monday, September 1, 2014 11:42 AM To: solr-user@lucene.apache.org Subject: Re: external indexer for Solr Cloud On 9/1/2014 7:19 AM, Jack Krupansky wrote: It would be great to have a standalone DIH that runs as a separate

Re: Indexing search list of Key/Value pairs

2014-09-01 Thread Jack Krupansky
simply query: php_skill:[5 TO *] AND ruby_skill:[2 TO *] -- Jack Krupansky -Original Message- From: amid Sent: Monday, September 1, 2014 12:24 PM To: solr-user@lucene.apache.org Subject: Indexing search list of Key/Value pairs Hi, I'm using solr and trying to index a list of key/value

Re: Indexing search list of Key/Value pairs

2014-09-01 Thread Jack Krupansky
AND 10 or more years of Software Development. -- Jack Krupansky -Original Message- From: amid Sent: Monday, September 1, 2014 12:50 PM To: solr-user@lucene.apache.org Subject: Re: Indexing search list of Key/Value pairs Hi Jack, Thanks for the fast response. I assume that using

Re: Scaling to large Number of Collections

2014-08-31 Thread Jack Krupansky
scenario at this time. Certainly suggestions for future enhancement can be made though. -- Jack Krupansky -Original Message- From: Christoph Schmidt Sent: Sunday, August 31, 2014 4:04 AM To: solr-user@lucene.apache.org Subject: Scaling to large Number of Collections we see at least

Re: AW: Scaling to large Number of Collections

2014-08-31 Thread Jack Krupansky
You close with two great questions for the community! We have a similar issue over in Apache Cassandra database land (thousands of tables). There is no immediate, easy, great answer. Other than the kinds of workarounds being suggested. -- Jack Krupansky -Original Message- From

Re: Scaling to large Number of Collections

2014-08-31 Thread Jack Krupansky
for a few shards or even just a single shard, and to instead focus the attention on large number of collections rather than heavily-sharded collections. -- Jack Krupansky -Original Message- From: Erick Erickson Sent: Sunday, August 31, 2014 12:04 PM To: solr-user@lucene.apache.org

Re: solr result handler??

2014-08-30 Thread Jack Krupansky
an application layer that forces that filter to be added. -- Jack Krupansky -Original Message- From: cmd.ares Sent: Saturday, August 30, 2014 2:10 AM To: solr-user@lucene.apache.org Subject: solr result handler?? I have a blackliststring save some keywords,and the query results need

Re: external indexer for Solr Cloud

2014-08-29 Thread Jack Krupansky
What exactly are you referring to by the term external indexer? -- Jack Krupansky -Original Message- From: Lee Chunki Sent: Friday, August 29, 2014 7:21 AM To: solr-user@lucene.apache.org Subject: external indexer for Solr Cloud Hi, Is there any way to run external indexer for solar

Re: Specify Analyzer per field

2014-08-29 Thread Jack Krupansky
That said, maybe you could provide a couple of examples of exactly what you want to do. -- Jack Krupansky -Original Message- From: Ankit Jain Sent: Friday, August 29, 2014 8:16 AM To: solr-user@lucene.apache.org Subject: Specify Analyzer per field Hi All, I would like to use schema less

Re: Specify Analyzer per field

2014-08-29 Thread Jack Krupansky
Different field TYPES, not different fields. -- Jack Krupansky -Original Message- From: Ahmet Arslan Sent: Friday, August 29, 2014 8:49 AM To: solr-user@lucene.apache.org Subject: Re: Specify Analyzer per field Hi, I think he wants to change query analyzer dynamically, where index

Re: Specify Analyzer per field

2014-08-29 Thread Jack Krupansky
But that doesn't let him change or override the analyzer for the field type. -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Friday, August 29, 2014 11:55 AM To: solr-user Subject: Re: Specify Analyzer per field Can't you just use old fashion dynamic fields

Re: external indexer for Solr Cloud

2014-08-29 Thread Jack Krupansky
My other thought was that maybe he wants to do index updates outside of the cluster that is handling queries, and then copy in the completed index. Or... maybe take replicas out of the query rotation while they are updated. Or... maybe this is yet another X-Y problem! -- Jack Krupansky

Re: Query regarding URL Analysers

2014-08-28 Thread Jack Krupansky
-core/org/apache/solr/update/processor/URLClassifyProcessor.html The official doc is... pitiful, but I have doc and examples in my e-book: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/ebook/product-21203548.html -- Jack Krupansky -Original Message

Re: Solr range query issue

2014-08-27 Thread Jack Krupansky
, you could use a regex query term, but better to avoid that if at all possible. -- Jack Krupansky -Original Message- From: nutchsolruser Sent: Wednesday, August 27, 2014 12:21 AM To: solr-user@lucene.apache.org Subject: Solr range query issue Hi , I Am using solr 4.6.1 . I have name

Re: Solr content limits?

2014-08-27 Thread Jack Krupansky
chosen hardware, both a single machine and a small cluster, and measure how much load it can handle and how it performs. And then scale your cluster based on that application-specific performance data. -- Jack Krupansky -Original Message- From: lalitjangra Sent: Tuesday, August 26, 2014

Re: Solr CPU Usage

2014-08-27 Thread Jack Krupansky
Is the high usage just suddenly happening after a long period of up-time without it, or is this on a server restart? The latter can happen if you have a large commit log to replay because you haven't done hard commits. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent

Re: Help with StopFilterFactory

2014-08-26 Thread Jack Krupansky
by adding this attribute: luceneMatchVersion=4.3 But... the old behavior is now deprecated, so it mostly likely will not be in Solr 5.0. I'll think about this some more as to whether there might be some workaround or alternative. -- Jack Krupansky -Original Message- From: heaven

Re: Help with StopFilterFactory

2014-08-26 Thread Jack Krupansky
I agree that it's a bad situation, and wasn't handled well by the Lucene guys. They may have had good reasons, but they didn't execute a decent plan for how to migrate existing behavior. -- Jack Krupansky -Original Message- From: heaven Sent: Tuesday, August 26, 2014 6:51 AM

Re: embedded documents

2014-08-25 Thread Jack Krupansky
That's a completely different concept, I think - the ability to return a single field value as a structured JSON object in the writer, rather than simply loading from a nested JSON object and distributing the key values to normal Solr fields. -- Jack Krupansky -Original Message

Re: Exact search with special characters

2014-08-25 Thread Jack Krupansky
. -- Jack Krupansky -Original Message- From: Shay Sofer Sent: Monday, August 25, 2014 8:02 AM To: solr-user@lucene.apache.org Subject: RE: Exact search with special characters Hi, Thanks for your reply. I thought that google search work the same (quotes stand for exact match). Example

Re: Help with StopFilterFactory

2014-08-25 Thread Jack Krupansky
, try adding enablePositionIncrements=false, reindex, and see what happens. -- Jack Krupansky -Original Message- From: heaven Sent: Monday, August 25, 2014 3:37 AM To: solr-user@lucene.apache.org Subject: Re: Help with StopFilterFactory A valid search: http://pastie.org/pastes/9500661

Re: embedded documents

2014-08-25 Thread Jack Krupansky
to make Solr more automatic and more approachable, not an even more complicated toolkit. -- Jack Krupansky -Original Message- From: Erik Hatcher Sent: Monday, August 25, 2014 9:32 AM To: solr-user@lucene.apache.org Subject: Re: embedded documents Jack et al - there’s now this, which

Re: embedded documents

2014-08-25 Thread Jack Krupansky
And a comparison to Elasticsearch would be helpful, since ES gets a lot of mileage from their super-easy JSON support. IOW, how much of the ES advantage is eliminated. -- Jack Krupansky -Original Message- From: Noble Paul Sent: Monday, August 25, 2014 1:59 PM To: solr-user

Re: Exact search with special characters

2014-08-24 Thread Jack Krupansky
, but the concept of exact match is not supported for tokenized and filtered text fields. So, please describe, in plain English, plus examples, exactly what you expect your analyzer to do, both in terms of how it treats text to be indexed and how you expect to be able to query that text. -- Jack

Re: embedded documents

2014-08-24 Thread Jack Krupansky
be problematic. -- Jack Krupansky -Original Message- From: Michael Pitsounis Sent: Wednesday, August 20, 2014 7:14 PM To: solr-user@lucene.apache.org Subject: embedded documents Hello everybody, I had a requirement to store complicated json documents in solr. i have modified

Re: Help with StopFilterFactory

2014-08-24 Thread Jack Krupansky
confused the discussion here by failing to do so on at least one occasion, and possibly in this latest response although I can't tell for sure. 5. We'll confirm either any mistakes you've made, recommendations, and whether there are any bugs. Fair enough? -- Jack Krupansky -Original Message

Re: Help with StopFilterFactory

2014-08-24 Thread Jack Krupansky
so that we can see what was really generated for the query. -- Jack Krupansky -Original Message- From: heaven Sent: Sunday, August 24, 2014 12:04 PM To: solr-user@lucene.apache.org Subject: Re: Help with StopFilterFactory I don't see any confusions, the problem is clearly explained

Re: Help with StopFilterFactory

2014-08-24 Thread Jack Krupansky
Just to confirm, the generated phrase query is generated using the analyzed terms, so if the stop filter is removing the terms, they won't appear in the generated query. It will be interesting to see what does get generated. -- Jack Krupansky -Original Message- From: heaven Sent

Re: Strange Behavior

2014-08-23 Thread Jack Krupansky
use case clearly - there may be some better way to try to achieve it. Use the analysis page of the Solr Admin UI to see the detailed query and index analysis of your terms. You'll be surprised. -- Jack Krupansky -Original Message- From: EXTERNAL Taminidi Ravi (ETI, Automotive-Service

Re: Minimum Match with filters that add tokens

2014-08-23 Thread Jack Krupansky
in the original query, the implementation (BooleanQuery) uses the terms generated by the analysis process, which can break up source terms into multiple terms and generate extra terms as well. Any MM number or percentage will count the terms output by analysis, not the source terms. -- Jack Krupansky

Re: Integrating DictionaryAnnotator and Solr

2014-08-23 Thread Jack Krupansky
Uhhh... UIMA... and parameter checking... NOT. You're probably missing something, but there is so much stuff. I have some examples in my e-book that show various errors you can get for missing/incorrect parameters for UIMA: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early

Re: Help with StopFilterFactory

2014-08-21 Thread Jack Krupansky
For the sake of completeness, please post the parsed query that you get when you add the debug=true parameter. IOW, how Solr/Lucene actually interprets the query itself. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Thursday, August 21, 2014 10:03 AM To: solr-user

Re: Substring and Case In sensitive Search

2014-08-21 Thread Jack Krupansky
performance, as long as the prefix isn't too short (e.g., cat*). See PrefixQuery: http://lucene.apache.org/core/4_9_0/core/org/apache/lucene/search/PrefixQuery.html ngram filters can also be used, but... that can make the index rather large. -- Jack Krupansky -Original Message- From: Umesh

Re: Substring and Case In sensitive Search

2014-08-19 Thread Jack Krupansky
to confirm whether you really need to use string as opposed to text field. -- Jack Krupansky -Original Message- From: Nishanth S Sent: Tuesday, August 19, 2014 12:03 PM To: solr-user@lucene.apache.org Subject: Substring and Case In sensitive Search Hi, I am very new to solr.How can I

Re: Performance of Boolean query with hundreds of OR clauses.

2014-08-19 Thread Jack Krupansky
CPU-bound or I/O-bound? -- Jack Krupansky -Original Message- From: SolrUser1543 Sent: Tuesday, August 19, 2014 2:57 PM To: solr-user@lucene.apache.org Subject: Performance of Boolean query with hundreds of OR clauses. I am using Solr to perform search for finding similar pictures

Re: Help with StopFilterFactory

2014-08-19 Thread Jack Krupansky
What release of Solr? Do you have autoGeneratePhraseQueries=true on the field? And when you said But any of these does, did you mean But NONE of these does? -- Jack Krupansky -Original Message- From: heaven Sent: Tuesday, August 19, 2014 2:34 PM To: solr-user@lucene.apache.org

Re: explaination of query processing in SOLR

2014-08-17 Thread Jack Krupansky
In any case, besides the raw code and the similarity Javadoc, Lucene does have Javadoc for file formats: http://lucene.apache.org/core/4_9_0/core/org/apache/lucene/codecs/lucene49/package-summary.html -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Sunday, August 17

Re: Question

2014-08-14 Thread Jack Krupansky
query, and pivot query, with QTime, and debug=true timing to show which search components are consuming the time. -- Jack Krupansky -Original Message- From: Oded Sofer Sent: Thursday, August 14, 2014 6:29 AM To: solr-user@lucene.apache.org Subject: Question Hello We are implementing

Re: Solr cloud performance degradation with billions of documents

2014-08-14 Thread Jack Krupansky
patterns, which you will have to test for yourself, you will probably need to use an application layer to shard your 100s of billions to specific SolrCloud clusters. -- Jack Krupansky -Original Message- From: Wilburn, Scott Sent: Thursday, August 14, 2014 11:05 AM To: solr-user

Re: explaination of query processing in SOLR

2014-08-13 Thread Jack Krupansky
Why? The semantics are defined by the code and similarity matching algorithm, not... files. -- Jack Krupansky -Original Message- From: abhi Abhishek Sent: Wednesday, August 13, 2014 2:40 AM To: solr-user@lucene.apache.org Subject: Re: explaination of query processing in SOLR Thanks

Re: Solr cloud performance degradation with billions of documents

2014-08-13 Thread Jack Krupansky
Could you clarify what you mean with the term cloud, as in per cloud and individual clouds? That's not a proper Solr or SolrCloud concept per se. SolrCloud works with a single cluster of nodes. And there is no interaction between separate SolrCloud clusters. -- Jack Krupansky -Original

Re: Solr cloud performance degradation with billions of documents

2014-08-13 Thread Jack Krupansky
with a rule of thumb of 100 million documents per node (and that is million, not billion.) That could be a lot higher - or a lot lower - based on your actual schema and data value distribution. -- Jack Krupansky -Original Message- From: Wilburn, Scott Sent: Wednesday, August 13, 2014

Re: Modifying date format when using TrieDateField.

2014-08-12 Thread Jack Krupansky
Use the parse date update request processor: http://lucene.apache.org/solr/4_9_0/solr-core/org/apache/solr/update/processor/ParseDateFieldUpdateProcessorFactory.html Additional examples are in my e-book: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7

Re: Solr search \ special cases

2014-08-11 Thread Jack Krupansky
The use of a wildcard suppresses analysis of the query term, so the special characters remain, but... they were removed when the terms were indexed, so no match. You must manually emulate the index term analysis in order to use wildcards. -- Jack Krupansky -Original Message- From

Re: How can I request a big list of values ?

2014-08-10 Thread Jack Krupansky
Generally, large requests are an anti-pattern in modern distributed systems. Better to have a number of smaller requests executing in parallel and then merge the results in the application layer. -- Jack Krupansky -Original Message- From: Bruno Mannina Sent: Saturday, August 9, 2014

Re: How can I request a big list of values ?

2014-08-10 Thread Jack Krupansky
that it is not a massive, blocking request. -- Jack Krupansky -Original Message- From: Bruno Mannina Sent: Sunday, August 10, 2014 6:04 PM To: solr-user@lucene.apache.org Subject: Re: How can I request a big list of values ? Hi Anshum, I can do it with 3.6 release no ? my main problem, it's that I have

Re: How can I request a big list of values ?

2014-08-10 Thread Jack Krupansky
potential of a system, which in this case is parallel execution of distributed components. -- Jack Krupansky -Original Message- From: Bruno Mannina Sent: Sunday, August 10, 2014 6:01 PM To: solr-user@lucene.apache.org Subject: Re: How can I request a big list of values ? Hi Jack, ok

Re: explaination of query processing in SOLR

2014-08-08 Thread Jack Krupansky
(Search Components), but none of it is down at that Lucene file level. -- Jack Krupansky -Original Message- From: abhi Abhishek Sent: Friday, August 8, 2014 7:59 AM To: solr-user@lucene.apache.org Subject: explaination of query processing in SOLR Hello, I am fairly new to SOLR, can

Re: Help Required

2014-08-08 Thread Jack Krupansky
And the Solr Support list is where people register their available consulting services: http://wiki.apache.org/solr/Support -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Friday, August 8, 2014 9:12 AM To: solr-user Subject: Re: Help Required We don't mediate

Re: Is it OK to have very big number of fields in solr/lucene ?

2014-08-08 Thread Jack Krupansky
and use more powerful hardware. Architect your application and model your data around the strengths of Solr (and Lucene.) And also look at your queries first, to make sure they will make sense. -- Jack Krupansky -Original Message- From: Lisheng Zhang Sent: Friday, August 8, 2014 5:25

Re: WordDelimiter

2014-08-08 Thread Jack Krupansky
The word delimiter filter is actually combining 100-001 into 11. You have BOTH catenateNumbers AND catenateAll, so 100-R8989 should generate THREE tokens: the concatenated numbers 100, the concatenated words R8989, and both numbers and words concatenated, 100R8989 . -- Jack Krupansky

Re: indexing comments with Apache Solr

2014-08-06 Thread Jack Krupansky
almost anything you want, but its up to you to decide what you want to index. IOW, it is your obligation to come up with a data model. And the data model should be driven in large part by the query and access requirements mentioned above. -- Jack Krupansky -Original Message- From

Re: Suggestion for term searches

2014-08-06 Thread Jack Krupansky
OR tractor. -- Jack Krupansky -Original Message- From: Corey Gerhardt Sent: Wednesday, August 6, 2014 1:14 PM To: Solr User List Subject: Suggestion for term searches I have an interesting situation of searching Business Names where results should be partially sorted by position. Searching

Re: indexing comments with Apache Solr

2014-08-06 Thread Jack Krupansky
a stream of flat documents. -- Jack Krupansky -Original Message- From: Ali Nazemian Sent: Wednesday, August 6, 2014 9:35 AM To: solr-user@lucene.apache.org Subject: Re: indexing comments with Apache Solr Dear Alexandre, Hi, Thank you very much. I think nested document is what I need. Do you

Re: how to change field value during index time?

2014-08-06 Thread Jack Krupansky
An update request processor could do the trick. You can use the stateless script update processor to code a JavaScript snippet to do whatever logic you want. Plenty of examples in my e-book: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/ebook/product

<    1   2   3   4   5   6   7   8   9   10   >