Re: Does updating a child document destroy the parent - child relationship

2014-06-24 Thread Jack Krupansky
, this limitation does not appear to be documented in the Solr ref guide. -- Jack Krupansky -Original Message- From: Vinay B, Sent: Tuesday, June 24, 2014 10:40 PM To: solr-user Subject: Does updating a child document destroy the parent - child relationship When I edit a child document

Re: Question about sending solrconfig and schema files with java

2014-06-20 Thread Jack Krupansky
it is probably not Solr-related. Sorry for the inconvenience! -- Jack Krupansky -Original Message- From: Frederic Esnault Sent: Friday, June 20, 2014 11:50 AM To: solr-user@lucene.apache.org Subject: Re: Question about sending solrconfig and schema files with java Hi Shawn, Actually i should

Re: Question about sending solrconfig and schema files with java

2014-06-20 Thread Jack Krupansky
Oops! Sorry I missed it. Please post of the rest of the info on SO as well. We'll get to it! -- Jack Krupansky -Original Message- From: Frederic Esnault Sent: Friday, June 20, 2014 7:03 PM To: solr-user@lucene.apache.org Subject: Re: Question about sending solrconfig and schema files

Re: Multivalue wild card search

2014-06-19 Thread Jack Krupansky
1. Wildcards do not work within quoted terms. 2. Spaces in terms need to be escaped. 3. The quotes embedded in a term do not need to be escaped. So, try: q=*[Steve,\ Wonder,\ ]] or q=*[Steve,\ Wonder,\ ]* -- Jack Krupansky -Original Message- From: Ethan Sent: Thursday, June 19

Re: docFreq coming to be more than 1 for unique id field

2014-06-17 Thread Jack Krupansky
, that model is clearly obsolete. -- Jack Krupansky -Original Message- From: Apoorva Gaurav Sent: Tuesday, June 17, 2014 11:15 AM To: solr-user ; Ahmet Arslan Subject: Re: docFreq coming to be more than 1 for unique id field Yes we have updates on these. Didn't try optimizing will do

Re: How to update one field without losing the others?

2014-06-14 Thread Jack Krupansky
Is this a schema in when the text field is actually populated via copyField from other fields? Or maybe not, but the text field may not have been a stored field. It needs to be one of the two. -- Jack Krupansky -Original Message- From: librarymark Sent: Saturday, June 14, 2014 10

Re: split field on json update

2014-06-12 Thread Jack Krupansky
You can easily write a JavaScript snippet using the stateless script update processor and do whatever string manipulation you want on an input value, and then write extracted strings to whatever field(s) you want. My e-book has plenty of script examples. -- Jack Krupansky -Original

Re: How to query for content with ACLs?

2014-06-12 Thread Jack Krupansky
Take a look at this: http://www.slideshare.net/lucenerevolution/wright-nokia-manifoldcfeurocon-2011 Karl has an old Jira patch somewhere for doing the ACLs processing in Solr. -- Jack Krupansky -Original Message- From: lalitjangra Sent: Thursday, June 12, 2014 9:28 AM To: solr-user

Re: Large disjunction query practices

2014-06-09 Thread Jack Krupansky
. Small is better. -- Jack Krupansky -Original Message- From: Joe Gresock Sent: Monday, June 9, 2014 8:50 AM To: solr-user@lucene.apache.org Subject: Large disjunction query practices I'm wondering what the best practice for large disjunct queries in Solr is. A user wants to submit

Re: accessing individual elements of a multivalued field

2014-06-09 Thread Jack Krupansky
Not currently. You could have separate explicit fields for the categories such as cat_1, cat_2, etc. The data would need to be replicated (possibly using a copyField), but redundancy to facilitate access is a reasonable approach. -- Jack Krupansky -Original Message- From

Re: Deepy nested structure

2014-06-07 Thread Jack Krupansky
you believe are the most complex queries your users are likely to need. Start by expressing them clearly in simple, plain English, unless the structured query is quite obvious. -- Jack Krupansky -Original Message- From: harikrishna Sent: Friday, June 6, 2014 9:35 AM To: solr-user

Re: Date number to text

2014-06-06 Thread Jack Krupansky
to keep it as text. You can also use your own update processor to convert to some other text format. See the stateless script update processor to write a simple JavaScript script. -- Jack Krupansky -Original Message- From: O. Klein Sent: Friday, June 6, 2014 4:32 AM To: solr-user

Re: Multivalue wild card search

2014-06-06 Thread Jack Krupansky
Wildcard, fuzzy, and regex query do work on multivalued fields, but on only a single value at a time. Your match cannot occur across values. -- Jack Krupansky -Original Message- From: Ethan Sent: Thursday, June 5, 2014 12:52 PM To: solr-user Subject: Re: Multivalue wild card search

Re: Mapping a field name before queryParser

2014-06-06 Thread Jack Krupansky
Edismax has field aliasing: http://wiki.apache.org/solr/ExtendedDisMax#Field_aliasing_.2F_renaming f.my_alias.qf=actual_field f.brand.qf=brand_name -- Jack Krupansky -Original Message- From: Antoine LE FLOC'H Sent: Friday, June 6, 2014 5:56 PM To: solr-user@lucene.apache.org

Re: Mapping a field name before queryParser

2014-06-06 Thread Jack Krupansky
Oops... for your example it would be: f.brand_name.qf=brand -- Jack Krupansky -Original Message- From: Antoine LE FLOC'H Sent: Friday, June 6, 2014 6:54 PM To: solr-user@lucene.apache.org Subject: Re: Mapping a field name before queryParser OK thank you. I wanted to see what's

Re: change in EnumField configuration - what do you think?

2014-06-05 Thread Jack Krupansky
Yeah, leaving holes as reservations for future fill-in seems like a reasonable use case. Why not just have explicit=n as an XML attribute for value. If no explicit attribute is given, then simply use the previous value plus one. No need for a separate pair then. -- Jack Krupansky

Re: Easises way to insatll solr cloud with tomcat

2014-06-05 Thread Jack Krupansky
As has been suggested on other threads here, the HDS distribution of Solr from Heliosearch comes preconfigured with Tomcat, so it really is the easiest way to install Solr with Tomcat. -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Thursday, June 5, 2014 8:32 AM

Re: Does lucene uses tries?

2014-06-05 Thread Jack Krupansky
Lucene uses trie for numeric and date fields: http://lucene.apache.org/core/4_8_0/core/org/apache/lucene/search/NumericRangeQuery.html -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Thursday, June 5, 2014 5:32 AM To: solr-user@lucene.apache.org Subject: Does lucene uses

Re: Use a field with space in qf

2014-06-04 Thread Jack Krupansky
Unfortunately, field aliasing works above the level of the qf parameter values. Maybe the Lucene guys could suggest a way to forcibly rename a field on disk. -- Jack Krupansky -Original Message- From: devraj.jaiman Sent: Wednesday, June 4, 2014 6:27 AM To: solr-user

Re: Solr maximum Optimal Index Size per Shard

2014-06-04 Thread Jack Krupansky
need to know. -- Jack Krupansky -Original Message- From: Vineet Mishra Sent: Wednesday, June 4, 2014 2:45 AM To: solr-user@lucene.apache.org Subject: Re: Solr maximum Optimal Index Size per Shard Thanks all for your response. I presume this conversation concludes that indexing around

Re: Multivalue wild card search

2014-06-04 Thread Jack Krupansky
Wildcard, fuzzy, and regex query operate on a single term of a single tokenized field value or a single string field value. -- Jack Krupansky -Original Message- From: Ethan Sent: Wednesday, June 4, 2014 6:59 PM To: solr-user Subject: Multivalue wild card search I can't seem to find

Re: suspect SOLR query from D029 (SOLR master)

2014-06-03 Thread Jack Krupansky
11 * 11 or 121 query terms, which shouldn't be so bad. But... maybe the Lucene FST for your synonym list is huge. Someone with deeper Lucene knowledge would have to address that. -- Jack Krupansky -Original Message- From: Branham, Jeremy [HR] Sent: Tuesday, June 3, 2014 3:57 AM

Re: Nested boosting in map function in solr?

2014-06-03 Thread Jack Krupansky
I think you need to use parameter substitution for those nested queries since the boost parameter takes a white-space delimited sequence of function queries. -- Jack Krupansky -Original Message- From: Kamal Kishore Aggarwal Sent: Tuesday, June 3, 2014 2:22 AM To: solr-user

Re: Solr maximum Optimal Index Size per Shard

2014-06-03 Thread Jack Krupansky
. -- Jack Krupansky -Original Message- From: Vineet Mishra Sent: Tuesday, June 3, 2014 8:43 AM To: solr-user@lucene.apache.org Subject: Solr maximum Optimal Index Size per Shard Hi All, Has anyone came across the maximum threshold document or size wise for each core of solr to hold. As I

Re: Solr maximum Optimal Index Size per Shard

2014-06-03 Thread Jack Krupansky
Anybody care to forecast when hardware will catch up with Solr and we can routinely look forward to newbies complaining that they indexed some data and after only 10 minutes they hit this weird 2G document count limit? -- Jack Krupansky -Original Message- From: Shawn Heisey Sent

Re: Solr 4.8 synonyms expansion for each primary term

2014-06-02 Thread Jack Krupansky
dug into the new REST API to know the answer. -- Jack Krupansky -Original Message- From: Archana R Sent: Monday, June 2, 2014 10:44 AM To: solr-user@lucene.apache.org Subject: Solr 4.8 synonyms expansion for each primary term we recently upgraded to Solr 4.8 and we are using REST API

Re: change in EnumField configuration - what do you think?

2014-06-02 Thread Jack Krupansky
or docvalues, right? -- Jack Krupansky -Original Message- From: Erick Erickson Sent: Monday, June 2, 2014 10:46 AM To: solr-user@lucene.apache.org Subject: Re: change in EnumField configuration - what do you think? Would both then be supported? I see where it would be easily detectable. And I

Re: solr multi-tenant: anyone use per-tenant synonyms file?

2014-06-02 Thread Jack Krupansky
Try to stay with a separate collection/core for each tenant - otherwise relevancy for document scores gets polluted by other tenants, even if you do use filter queries to isolate what documents get returned for a tenant in a multi-tenant core. -- Jack Krupansky -Original Message

Re: Uneven shard heap usage

2014-05-31 Thread Jack Krupansky
When you restart, how long does it take it hit the problem? And how much query or update activity is happening in that time? Is there any other activity showing up in the log? If you bring up only a single node in that problematic shard, do you still see the problem? -- Jack Krupansky

Re: Block document for certain search words

2014-05-30 Thread Jack Krupansky
Explain your use case a little more, but you can define terms as stop words with a stop filter, which means they won't appear in the index. -- Jack Krupansky -Original Message- From: Christian Loock Sent: Friday, May 30, 2014 5:38 AM To: solr-user@lucene.apache.org Subject: Block

Re: How to check and handle empty Solr query

2014-05-30 Thread Jack Krupansky
Consider using the edismax query parser, which allows you to specify q.alt which is an alternative query such as *:. Or use an application layer to handle the situation before it gets to Solr. -- Jack Krupansky -Original Message- From: vit Sent: Friday, May 30, 2014 9:01 AM To: solr

Re: search component needs access to results of previous component

2014-05-30 Thread Jack Krupansky
Solr-based product. It re-executes enhanced queries in a query component. I don't think the source is available though. And I don't recall if it was cloud-enabled. See: http://docs.lucidworks.com/display/lweug/Queries+and+Relevance -- Jack Krupansky -Original Message- From: Jitka Sent

Re: How to build Solr4.0 Final?

2014-05-30 Thread Jack Krupansky
Make sure you checkout from the tagged branch, not trunk or the actual release branch which has updates since the release. See: http://svn.apache.org/viewvc/lucene/dev/tags/ -- Jack Krupansky -Original Message- From: eShard Sent: Friday, May 30, 2014 11:55 AM To: solr-user

Re: wildcard matches in EnumField - what do I need to change in code to enable wildcard matches?

2014-05-29 Thread Jack Krupansky
://cwiki.apache.org/confluence/display/solr/Working+with+Enum+Fields -- Jack Krupansky -Original Message- From: Elran Dvir Sent: Thursday, May 29, 2014 2:50 AM To: solr-user@lucene.apache.org Subject: wildcard matches in EnumField - what do I need to change in code to enable wildcard matches? Hi all

Re: wildcard matches in EnumField - what do I need to change in code to enable wildcard matches?

2014-05-29 Thread Jack Krupansky
for this particular thread. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Thursday, May 29, 2014 9:46 AM To: solr-user@lucene.apache.org Subject: Re: wildcard matches in EnumField - what do I need to change in code to enable wildcard matches? On 5/29/2014 12:50 AM, Elran Dvir wrote

Re: Regex with local params is not working

2014-05-28 Thread Jack Krupansky
Post the parsed query itself. Yes, edismax should always generate a disjunctionmaxquery - in addition to the regexquery. -- Jack Krupansky -Original Message- From: Lokn Sent: Wednesday, May 28, 2014 1:53 AM To: solr-user@lucene.apache.org Subject: Re: Regex with local params

Re: Solr Cell Tika - date.formats

2014-05-28 Thread Jack Krupansky
-MM-dd hh:mm:ss -MM-dd HH:mm:ss EEE MMM d hh:mm:ss z EEE, dd MMM HH:mm:ss zzz , dd-MMM-yy HH:mm:ss zzz EEE MMM d HH:mm:ss See: https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika -- Jack Krupansky -Original Message

Re: Any Solrj API to obtain field list?

2014-05-27 Thread Jack Krupansky
You might consider an update request processor as an alternative. It runs on the server and might be simpler. You can even use the stateless script update processor to avoid having to write any custom Java code. -- Jack Krupansky -Original Message- From: T. Kuro Kurosaka Sent

Re: about analyzer and tokenizer

2014-05-26 Thread Jack Krupansky
: macbook,mac book Only use the synonyms filter at index time. The standard query parsers don't support phrases for synonyms. -- Jack Krupansky -Original Message- From: rachun Sent: Monday, May 26, 2014 5:26 AM To: solr-user@lucene.apache.org Subject: about analyzer and tokenizer Dear

Re: Solr - Cores not initialised

2014-05-26 Thread Jack Krupansky
mangled that message as well! Feel free to post the complete message here as well. -- Jack Krupansky -Original Message- From: Manikandan Saravanan Sent: Monday, May 26, 2014 1:52 AM To: solr-user@lucene.apache.org Cc: Varuna Venkatesh Subject: Solr - Cores not initialised Hi, I’m running

Re: Using SolrCloud with RDBMS or without

2014-05-26 Thread Jack Krupansky
You could also consider DataStax Enterprise, which integrates Apache Cassandra as the primary database and Solr for indexing and query. See: http://www.datastax.com/what-we-offer/products-services/datastax-enterprise -- Jack Krupansky -Original Message- From: Ali Nazemian Sent

Re: Query translation of User Fields

2014-05-25 Thread Jack Krupansky
. -- Jack Krupansky -Original Message- From: Yonik Seeley Sent: Sunday, May 25, 2014 10:58 AM To: solr-user@lucene.apache.org Subject: Re: Query translation of User Fields On Thu, May 22, 2014 at 10:56 AM, Jack Krupansky j...@basetechnology.com wrote: Hmmm... that doesn't sound like

Re: How to Disable Commit Option and Just Manage it via SolrConfig?

2014-05-23 Thread Jack Krupansky
, or maybe you would have to copy and edit it. Also, note that the delete command also has a commitWithin setting. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Thursday, May 22, 2014 6:36 AM To: solr-user@lucene.apache.org Subject: How to Disable Commit Option and Just Manage

Re: index a repository of documents(.doc) without using post.jar

2014-05-23 Thread Jack Krupansky
Is there a particular reason you are adverse to using post.jar? I mean, if there is some bug or inconvenience, let us know so we can fix it! The Solr server itself does not provide any ability to crawl file systems (LucidWorks Search does.) post.jar does provide that convenience. -- Jack

Re: index a repository of documents(.doc) without using post.jar

2014-05-23 Thread Jack Krupansky
Feel free to look at the source code for post.jar. I mean, all it is really doing is scanning the directory (optionally recursively) and then streaming each file to Solr. -- Jack Krupansky -Original Message- From: benjelloun Sent: Friday, May 23, 2014 8:15 AM To: solr-user

Re: Solr 4.7.2 ValueSourceParser classCast exception

2014-05-23 Thread Jack Krupansky
Are you sure that you compiled your code with the proper Solr jars so that the class signature (extends, implements, and constructors) matches the Solr 4.7.2 jars? I mean, Java is simply complaining that your class is not a valid value source class of the specified type. -- Jack Krupansky

Re: pdfs

2014-05-22 Thread Jack Krupansky
Yeah, I recall running into infinite loop issues with PDFBox in Solr years ago. They keep fixing these issues, but they keep popping up again. Sigh. -- Jack Krupansky -Original Message- From: Siegfried Goeschl Sent: Thursday, May 22, 2014 4:35 AM To: solr-user@lucene.apache.org

Re: multiple queries in single request

2014-05-22 Thread Jack Krupansky
No, I was rejecting BOTH methods 1 and 2. I was suggesting a different method. I'll leave it to somebody else to describe the method so that it is easier to understand. -- Jack Krupansky -Original Message- From: Pavel Belenkovich Sent: Thursday, May 22, 2014 4:00 AM To: solr-user

Re: Query translation of User Fields

2014-05-22 Thread Jack Krupansky
Hmmm... that doesn't sound like what I would have expected - I would have thought that Solr would throw an exception on the user field, rather than simply treat it as a text keyword. File a Jira. Either it's a bug or the doc is not complete. -- Jack Krupansky -Original Message- From

Re: Atomic update by query instead of ID

2014-05-22 Thread Jack Krupansky
No, but it sure would be nice to have the Elasticsearch feature of supplying a script for update. -- Jack Krupansky -Original Message- From: Saumitra Srivastav Sent: Thursday, May 22, 2014 11:13 AM To: solr-user@lucene.apache.org Subject: Atomic update by query instead of ID

Re: solr-user Digest of: get.100322

2014-05-21 Thread Jack Krupansky
it. Or, maybe we should enhance Solr to check available memory and log a stern warning if the index size exceeds system memory when Solr is started. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Tuesday, May 20, 2014 1:49 PM To: solr-user@lucene.apache.org Subject: Re

Re: Using fq as OR

2014-05-21 Thread Jack Krupansky
that is different? For the latter, the fq is purely a filter that removes documents, but the terms in fq do not participate in relevance or scoring calculation of documents, so documents will not get boosted based on the terms present in fq. -- Jack Krupansky -Original Message- From

Re: Distributed Search in Solr with different queries per shard

2014-05-21 Thread Jack Krupansky
Unfortunately the same query will be sent to all cores if you use the shards parameter to query multiple cores. Is there some characteristic of the first core that is distinct from the second core so that you could OR the differences between the two? -- Jack Krupansky -Original Message

Re: Using fq as OR

2014-05-21 Thread Jack Krupansky
the filtering terms to participate in the document scoring. In other words, what exactly were you trying to achieve by using fq? -- Jack Krupansky -Original Message- From: johnmu...@aol.com Sent: Wednesday, May 21, 2014 12:19 PM To: solr-user@lucene.apache.org Subject: Re: Using fq

Re: Using fq as OR

2014-05-21 Thread Jack Krupansky
, or add a boosting query using the bq parameter. The latter approach works for the dismax and edismax query parsers only. -- Jack Krupansky -Original Message- From: johnmu...@aol.com Sent: Wednesday, May 21, 2014 5:51 PM To: solr-user@lucene.apache.org Subject: Re: Using fq as OR Hi

Re: multiple queries in single request

2014-05-21 Thread Jack Krupansky
Nothing special for this use case. This seems to be a use case that I would call bulk data retrieval - based on ID. I would suggest batching your requests - limit each request query to, say, 50 or 100 IDs. -- Jack Krupansky -Original Message- From: Pavel Belenkovich Sent

Re: boosting multivalued fields

2014-05-21 Thread Jack Krupansky
Yes. -- Jack Krupansky -Original Message- From: vit Sent: Wednesday, May 21, 2014 11:20 AM To: solr-user@lucene.apache.org Subject: boosting multivalued fields is it posiible to boost values of the same field. For example in a query like that: category_id:(2271578^0.5 22718986^0.4

Re: Using fq as OR

2014-05-21 Thread Jack Krupansky
The results will be scored, but only based on terms in q, not terms in fq. -- Jack Krupansky -Original Message- From: johnmu...@aol.com Sent: Wednesday, May 21, 2014 6:41 PM To: solr-user@lucene.apache.org Subject: Re: Using fq as OR Interesting!! I did not know that using fq means

Re: Applying boosting for keyword search

2014-05-21 Thread Jack Krupansky
Just add the boost to the keyword: q=toyota^100. Or, use the dismax or edismax query parsers and then the boost can be specified for the field: qf=keyword^100. -- Jack Krupansky -Original Message- From: manju16832003 Sent: Thursday, May 22, 2014 12:04 AM To: solr-user

Re: pdfs

2014-05-21 Thread Jack Krupansky
symptom? -- Jack Krupansky -Original Message- From: Brian McDowell Sent: Thursday, May 22, 2014 12:24 AM To: solr-user@lucene.apache.org Subject: pdfs Has anyone had issues with indexing pdf files? Some pdfs are bringing down Solr completely so that it actually needs to be manually

Re: Applying boosting for keyword search

2014-05-21 Thread Jack Krupansky
Your original message had q=toyota featured:true^100 and also using bq - both are valid. If either is not working for you, please be specific about what exactly is not behaving as you expected - what the symptom is. Sometimes you have to experiment with the boost factor. -- Jack Krupansky

Re: Howto Search word which contains the character

2014-05-20 Thread Jack Krupansky
, but there are some examples in my e-book that shows how to map @ and _ to ALPHA. -- Jack Krupansky -Original Message- From: Ahmet Arslan Sent: Tuesday, May 20, 2014 4:55 AM To: solr-user@lucene.apache.org Subject: Re: Howto Search word which contains the character Hi, It is special query parser

Re: trigger delete on nested documents

2014-05-20 Thread Jack Krupansky
document would make it discontiguous with the rest of the block of documents. Just update the block by resending the entire block of documents. For e previous discussion of this limitation: http://lucene.472066.n3.nabble.com/block-join-and-atomic-updates-td4117178.html -- Jack Krupansky

Re: How to delete docs only in a spec shard within a collection

2014-05-19 Thread Jack Krupansky
Normally a delete-by-query needs to be distributed to all shards since it is not known which document keys it will affect. It sounds like SolrCloud does not support the feature you would really like: support for distrb=false. You can file a Jira request for a feature improvement. -- Jack

Re: Subscribe for Solr Mails

2014-05-19 Thread Jack Krupansky
Read this web page for information about subscribing: https://lucene.apache.org/solr/discussion.html -- Jack Krupansky -Original Message- From: Kamal Kishore Aggarwal Sent: Monday, May 19, 2014 6:12 AM To: solr-user@lucene.apache.org Subject: Subscribe for Solr Mails Dear Team

Re: Index / Query IP Address as number.

2014-05-19 Thread Jack Krupansky
addresses. And then you could output to multiple fields - one for the raw string for wildcard matches, say, and one as an integer for proximity or range checks. -- Jack Krupansky -Original Message- From: SolrUser1543 Sent: Monday, May 19, 2014 3:04 PM To: solr-user@lucene.apache.org Subject

Re: Index / Query IP Address as number.

2014-05-18 Thread Jack Krupansky
What are you using for your default query operator, and do you have autoGeneratePhraseQueries set to true for your field type? I mean, a query for 192.168.1.4 shouldn't match 192.168.1.3 - unless you have autoGeneratePhraseQueries set to false (the default.) -- Jack Krupansky -Original

Re: What is the usage of solr.NumericPayloadTokenFilterFactory

2014-05-17 Thread Jack Krupansky
, it sure would be nice to see more substantial and easier to use support for Payload in Solr. -- Jack Krupansky -Original Message- From: ienjreny Sent: Saturday, May 17, 2014 1:24 AM To: solr-user@lucene.apache.org Subject: Re: What is the usage of solr.NumericPayloadTokenFilterFactory

Re: Solr 4.8: Does eDisMax parser calls analyzer chain to tokenize?

2014-05-17 Thread Jack Krupansky
on WDF in my e-book: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/ebook/product-21203548.html -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Saturday, May 17, 2014 1:13 PM To: solr-user@lucene.apache.org Subject: Re: Solr

Status of mail?

2014-05-16 Thread Jack Krupansky
Is the mail list working again yet?? -- Jack Krupansky

Re: Difference between search strings

2014-05-16 Thread Jack Krupansky
For these specific examples, the results should be the same, but mostly that's because the term is a simple sequence of letters. I have an extended discussion of characters in terms in my e-book: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/ebook

Re: date range queries efficiency

2014-05-16 Thread Jack Krupansky
query matches such as year without expanding the date to a range for the full interval. -- Jack Krupansky -Original Message- From: Dmitry Kan Sent: Friday, May 9, 2014 6:41 AM To: solr-user@lucene.apache.org Subject: date range queries efficiency Hi, There was a mention either

Re: slow performance on simple filter

2014-05-16 Thread Jack Krupansky
Add the debugQuery=true parameter and look at the timing section to see which search component is consuming the time. Are you using faceting or highlighting? 7 million documents is actually a fairly small index. -- Jack Krupansky -Original Message- From: mizayah Sent: Wednesday

Re: Difference between search strings

2014-05-15 Thread Jack Krupansky
Inside of quotes you only have to escape quote and backslash. Add the debugQuery=true parameter to see exactly how Solr processes characters and generates queries. But... in a URL you have to URL-encode URL query parameters: http://en.wikipedia.org/wiki/Query_string -- Jack Krupansky

Re: Help to Understand a Solr Query

2014-05-15 Thread Jack Krupansky
Please don't re-use an existing message thread for a new, completely independent question! Also, try to make the subject line indicate something about the actual issue. -- Jack Krupansky -Original Message- From: nativecoder Sent: Tuesday, May 13, 2014 10:56 AM To: solr-user

Re: distrib=false is not honoring

2014-05-14 Thread Jack Krupansky
rationale? What is this odd mcat.intent query response writer type that you are specifying with the qt parameter? -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Wednesday, May 14, 2014 1:35 AM To: solr-user@lucene.apache.org Subject: distrib=false is not honoring I am trying

Re: KeywordTokenizerFactory splits the string for the exclamation mark

2014-05-14 Thread Jack Krupansky
://lucene.apache.org/core/4_8_0/queryparser/org/apache/lucene/queryparser/classic/package-summary.html#Escaping_Special_Characters -- Jack Krupansky -Original Message- From: Romani Rupasinghe Sent: Tuesday, May 13, 2014 11:14 AM To: solr-user@lucene.apache.org Subject: KeywordTokenizerFactory

Re: Indexing DateField timezone problem

2014-05-14 Thread Jack Krupansky
that to dates. -- Jack Krupansky -Original Message- From: hakanbillur Sent: Friday, May 9, 2014 4:38 AM To: solr-user@lucene.apache.org Subject: Indexing DateField timezone problem http://lucene.472066.n3.nabble.com/file/n4135079/Capture2.png http://lucene.472066.n3.nabble.com/file

Re: Easises way to insatll solr cloud with tomcat

2014-05-13 Thread Jack Krupansky
Check out HDS from Heliosearch - it comes packaged with Tomcat, ready to go: http://heliosearch.com/download.html -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Monday, May 12, 2014 8:23 AM To: solr-user@lucene.apache.org Subject: Re: Easises way to insatll solr cloud

Re: Too many documents Exception

2014-05-13 Thread Jack Krupansky
One of the hard-core Lucene guys is going to have to help you out. Or you may have to write some custom code to fix the index for any such shard. If you have deleted any documents, it may be sufficient to simply optimize the index. -- Jack Krupansky -Original Message- From: yamazaki

Re: What is the usage of solr.NumericPayloadTokenFilterFactory

2014-05-13 Thread Jack Krupansky
I do have basic coverage for that filter (and all other filters) and the parameter values in my e-book: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/ebook/product-21203548.html That said, are you sure you want to be using the payload feature of Lucene

Re: LetterTokenizerFactory doesn't work as expected

2014-05-11 Thread Jack Krupansky
Please post your full field type analyzer. The letter tokenizer should in fact return I, can, and t - if it is used properly. -- Jack Krupansky -Original Message- From: ienjreny Sent: Saturday, May 10, 2014 8:28 AM To: solr-user@lucene.apache.org Subject: LetterTokenizerFactory

Re: is it possible for solr to calculate and give back the price of a product based on its sub-products

2014-05-11 Thread Jack Krupansky
clarify your use case. -- Jack Krupansky -Original Message- From: Gharbi Mohamed Sent: Friday, May 9, 2014 3:29 PM To: solr-user@lucene.apache.org Subject: is it possible for solr to calculate and give back the price of a product based on its sub-products Hi, I am using Solr

Re: Too many documents Exception

2014-05-07 Thread Jack Krupansky
such an overstuffed index. -- Jack Krupansky -Original Message- From: [Tech Fun]山崎 Sent: Tuesday, May 6, 2014 8:54 PM To: solr-user@lucene.apache.org Subject: Too many documents Exception Hello everybody, Solr 4.3.1(and 4.7.1), Num Docs + Deleted Docs 2147483647(Integer.MAX_VALUE) over Caused

Re: Use XSD or DTD to make Solr schema?

2014-05-07 Thread Jack Krupansky
Sorry, but you have to create the schema manually, but... you could possibly get by with Solr schemaless mode to dynamically create the schema based on the actual data values. See: https://cwiki.apache.org/confluence/display/solr/Schemaless+Mode -- Jack Krupansky -Original Message

Re: Indexing scanned PDFs

2014-05-06 Thread Jack Krupansky
a word processor source file will retain smooth curves on characters while a PDF generated from scanned page images will show heavy pixelation. -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Tuesday, May 6, 2014 1:30 AM To: solr-user@lucene.apache.org Subject: Re

Re: Help to Understand a Solr Query

2014-05-06 Thread Jack Krupansky
The + symbol means a clause of a boolean query that must be present, as opposed to should (optionally) be present. This is equivalent to the AND operator. The | means OR for a disjunction maximum query to indicates the alternatives - at least one of the alternatives must match. -- Jack

Re: Help to Understand a Solr Query

2014-05-06 Thread Jack Krupansky
Think of debugQuery as your Solr BFF! -- Jack Krupansky -Original Message- From: nativecoder Sent: Tuesday, May 6, 2014 7:36 AM To: solr-user@lucene.apache.org Subject: Re: Help to Understand a Solr Query Thank you this is what I was looking for all this time I wanted to understand

Re: Wildcard malfunctioning

2014-05-05 Thread Jack Krupansky
or MappingCharFilterFactory. -- Jack Krupansky -Original Message- From: Román González Sent: Monday, May 5, 2014 7:00 AM To: solr-user@lucene.apache.org Subject: Wildcard malfunctioning Hi all! Sorry in advance if this question was posted but I were unable to find it with search engines

Re: Help to Understand a Solr Query

2014-05-05 Thread Jack Krupansky
Read up on the edismax query parser first: http://wiki.apache.org/solr/ExtendedDisMax The ^ operator is known as boosting or field boosting and is used to influence document scores for relevancy. It has no analog in SQL. -- Jack Krupansky -Original Message- From: nativecoder Sent

Re: Help to Understand a Solr Query

2014-05-05 Thread Jack Krupansky
dismax means Disjunction Maximum, which means Lucene takes the highest scoring clause (field), for each search term. This is effectively an OR of the clauses. -- Jack Krupansky -Original Message- From: nativecoder Sent: Monday, May 5, 2014 11:21 AM To: solr-user@lucene.apache.org

Re: Turning on KeywordRepeat and RemoveDups on an existing fieldType.

2014-05-05 Thread Jack Krupansky
the reindex incrementally (as you replace existing documents) as well if you don't mind if the difference in relevancy takes an extended time to become apparent. -- Jack Krupansky -Original Message- From: Michael Tracey Sent: Monday, May 5, 2014 4:52 PM To: solr-user@lucene.apache.org

Re: Relevancy help

2014-05-05 Thread Jack Krupansky
the document scores look like. -- Jack Krupansky -Original Message- From: Ravi Solr Sent: Monday, May 5, 2014 5:41 PM To: solr-user@lucene.apache.org Subject: Relevancy help Hello, I have a weird relevancy requirement. We search news content hence chronology is very important and also

Re: Strict Search in Apache Solr

2014-05-05 Thread Jack Krupansky
The term strict search is not in the Lucene/Solr nomenclature - it could mean any number of things. It sounds as if maybe you want to do a phrase search, looking for an exact phrase - yes, you can do that by enclosing the phrase in quotes. -- Jack Krupansky -Original Message- From

Re: Explain Solr Query Execution

2014-05-04 Thread Jack Krupansky
should start by telling us what you are trying to achieve, in plain English. -- Jack Krupansky -Original Message- From: nativecoder Sent: Sunday, May 4, 2014 2:20 PM To: solr-user@lucene.apache.org Subject: Explain Solr Query Execution How will a query like below will get executed

Re: Wildcard search not working with search term having special characters and digits

2014-04-28 Thread Jack Krupansky
be analyzed at index time for wildcard to work. Ditto is your filed type uses the word delimiter filter with the split digits option enabled - the alpha and numeric portions will generate separate terms - and cause a wildcard to fail. -- Jack Krupansky -Original Message- From: Geepalem

Re: Stemming not working with wildcard search

2014-04-28 Thread Jack Krupansky
Wildcards and stemming are incompatible at query time - you need to manually stem the term before applying your wildcard. Wildcards are not supported in quoted phrases. They will be treated as punctuation, and ignored by the standard tokenizer or the word delimiter filter. -- Jack Krupansky

Re: How can I convert xml message for updating a Solr index to a javabin file

2014-04-27 Thread Jack Krupansky
Look at the SolrJ source code and doc. JavaBin is more of a protocol than a file format. -- Jack Krupansky -Original Message- From: Elran Dvir Sent: Sunday, April 27, 2014 2:16 AM To: solr-user@lucene.apache.org Subject: RE: How can I convert xml message for updating a Solr index

Re: get term frequency, just only keywords search

2014-04-26 Thread Jack Krupansky
documents containing a phrase - just use the pf, pf2, and pf3 parameters of edsimax or explicitly boost the phrase, such as research development^20. -- Jack Krupansky -Original Message- From: ksmith Sent: Saturday, April 26, 2014 5:38 AM To: solr-user@lucene.apache.org Subject: Re

Re: dynamic field assignments

2014-04-25 Thread Jack Krupansky
, but it is not a substitute for professional review. -- Jack Krupansky -Original Message- From: John Thorhauer Sent: Friday, April 25, 2014 7:10 AM To: solr-user@lucene.apache.org Subject: dynamic field assignments I have a scenario where I would like dynamically assign incoming document fields to two

<    2   3   4   5   6   7   8   9   10   11   >