Re: Solr irregularly having QTime 50000ms, stracing solr cures the problem

2014-07-21 Thread Harald Kirsch
Hi IJ, yes indeed, there are multiple node. But I have a 50 seconds delay, not 5 seconds. Anyway I will keep this in mind and will experiment with the hosts file if it starts to get annoying again. Cheers, Harald. On 16.07.2014 19:44, IJ wrote: I know u mentioned you have a single

Re: Plugin init failure for custom analysis filter

2014-07-21 Thread ssivakumaran
Hi, I am not able to find anything in the log or rather not that specific. This error is being thrown when I add a string argument to my filter in schema. If I remove the same, I donot get any error. I tried changing the datatype but still same error. A little more detail regarding the filter

stats.facet with multi-valued field in Solr 4.9

2014-07-21 Thread Nico Kaiser
Hi! I am storing aggregated article click statistics for a website in a Lucene database. Website articles (i.e., pages in this case) can have multiple associated financial instruments, which – for statistics reasons – I also copy to Lucene. So basically this data is stored (and regularly

Re: stats.facet with multi-valued field in Solr 4.9

2014-07-21 Thread Yonik Seeley
On Mon, Jul 21, 2014 at 7:09 AM, Nico Kaiser n...@kaiser.me wrote: After the upgrade to Solr 4.9 (from 3.6) this seems not to be possible anymore: Stats can only facet on single-valued fields, not: instrumentIds https://issues.apache.org/jira/browse/SOLR-3642 It looks like perhaps it never

Re: stats.facet with multi-valued field in Solr 4.9

2014-07-21 Thread Nico Kaiser
Yonik, thanks for your reply! I also found https://issues.apache.org/jira/browse/SOLR-1782 which also sees to deal with this, but I did not find out wether there is a workaround. For our use case the previous behaviour was ok and seemed (!) to be consistent. However I understand that this

Re: stats.facet with multi-valued field in Solr 4.9

2014-07-21 Thread Yonik Seeley
On Mon, Jul 21, 2014 at 7:32 AM, Nico Kaiser n...@kaiser.me wrote: Yonik, thanks for your reply! I also found https://issues.apache.org/jira/browse/SOLR-1782 which also sees to deal with this, but I did not find out wether there is a workaround. For our use case the previous behaviour was

AUTO: Nicholas M. Wertzberger is out of the office (returning 07/23/2014)

2014-07-21 Thread Nicholas M. Wertzberger
I am out of the office until 07/23/2014. I'm out of town for the next few days. I am reachable by Blackberry, if needed. Please contact Jason Brown for anything JAS Team related. Note: This is an automated response to your message Re: questions on Solr WordBreakSolrSpellChecker and

faceting within facets

2014-07-21 Thread David Flower
Hi Is it possible to create a facet within another facet in a single query, currently I'm having to filter the query with facet.query=type:foo and running the query multiple times to return the number and type of object created on a given date. Is it even possible to return this in a single

Re: faceting within facets

2014-07-21 Thread Yonik Seeley
On Mon, Jul 21, 2014 at 8:08 AM, David Flower dflo...@amplience.com wrote: Is it possible to create a facet within another facet in a single query For simple field facets, there's pivot faceting. For more complex nested facets, there are sub-facets in heliosearch (a solr fork):

Solr Cassandra MySQL Best Practice Indexing

2014-07-21 Thread Yavar Husain
So my full text data lies on Cassandra along with an ID. Now I have a lot of structured data linked to the ID which lies on an RDBMS (read MySQL). I need this structured data as it would help me with my faceting and other needs. What is the best practice in going about indexing in this scenario.

Re: Solr Cassandra MySQL Best Practice Indexing

2014-07-21 Thread Jack Krupansky
Solandra is not a supported product. DataStax Enterprise (DSE) supersedes it. With DSE, just load your data into a Solr-enabled Cassandra data center and it will be indexed automatically in the embedded Solr within DSE, as per a Solr schema that you provide. Then use any of the nodes in that

RE: SolrCloud performance issues regarding hardware configuration

2014-07-21 Thread Toke Eskildsen
search engn dev [sachinyadav0...@gmail.com] wrote: Yes, You are right my facet queries are for text analytic purpose. Does this mean that facet calls are rare (at most one at a time)? Users will send boolean and spatial queries. current performance for spatial queries is 100qps with 150

Query about Solr

2014-07-21 Thread Ameya Aware
Hi, How can i stop content of file from being getting indexed?? Will removing content field from schema.xml do that job? Thanks, Ameya

Edit Example Post.jar to read ALL file types

2014-07-21 Thread jrusnak
I am working with Solr 4.8.1 to set up an enterprise search system. The file system I am working with has numerous files with unique extension types (ex .20039 .20040 .20041 etc.) I am using the post.jar file included in the binary download (src: SimplePostTool.java

Re: Query about Solr

2014-07-21 Thread Alexandre Rafalovitch
Nothing gets indexed automatically. So you must be doing something (e.g. Nutch). Tell us what that something is first so we know your baseline setup. Regards, Alex On 21/07/2014 9:43 pm, Ameya Aware ameya.aw...@gmail.com wrote: Hi, How can i stop content of file from being getting

Re: Query about Solr

2014-07-21 Thread Ameya Aware
Hi, The data coming into Solr is different metadata such as author, created time, last modified time etc along with content of the file. So indexing content is giving me different errors, so i just simply want to skip indexing content part. Thanks, Ameya On Mon, Jul 21, 2014 at 11:07 AM,

Re: Query about Solr

2014-07-21 Thread Jack Krupansky
Set the field type for such a field to ignored. Or set it to string and then you can still examine or query the data even if it is not properly formatted. -- Jack Krupansky -Original Message- From: Ameya Aware Sent: Monday, July 21, 2014 11:12 AM To: solr-user@lucene.apache.org

Solr schema.xml query analyser

2014-07-21 Thread prashantc88
0 down vote favorite I am a complete beginner to Solr and need some help. My task is to provide a match when the search term contains the indexed field. For example: If query= foo bar and textExactMatch= foo, I should not get a MATCH If query= foo bar and textExactMatch= foo

Re: Solr schema.xml query analyser

2014-07-21 Thread Jack Krupansky
If you don't specify a query analyzer, Solr will use the index analyzer at query time. But... at query time there is something called a query parser which typically breaks the query into separate terms, delimited by white space, and then calls the analyzer for each term, separately. You can

Re: Solr schema.xml query analyser

2014-07-21 Thread prashantc88
Thanks Jack for the reply. I did not mention the query time analyzer in my post because I wasn't sure what should be put there. With regards to your reply, If I put the query term in quotes, would I get a match for the following: Indexed field value: foo bar Query term: foo bar xyz/xyz foo bar

Re: Solr schema.xml query analyser

2014-07-21 Thread Jack Krupansky
Based on your stated requirements, there is no obvious need to use the keyword tokenizer. So fix that and then quoted phrases or escaped spaces should work. -- Jack Krupansky -Original Message- From: prashantc88 Sent: Monday, July 21, 2014 11:51 AM To: solr-user@lucene.apache.org

Re: Solr schema.xml query analyser

2014-07-21 Thread newBie88
My apologies Jack. But there was a mistake in my question. I actually switched query and textExactMatch in my question. I would be really helpful if you could have a look at the scenario once again: My task is to provide a match when the search term contains the indexed field. For example:

RE: text search problem

2014-07-21 Thread EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions)
Thanks for the reply Erick, I will try as you suggested. There I have another question related to this lines. When I have - in my description , name then the search results are different. For e.g. ABC-123 , it look sofr ABC or 123, I want to treat this search as exact match, i.e if my

Re: Solr schema.xml query analyser

2014-07-21 Thread Jack Krupansky
That sounds more like a reverse query - trying to match documents against the query rather than matching the query against the documents. Solr doesn't have that feature currently. Although I'm not absolutely sure what your textExactMatch is. I'm guessing that it is a document field in your

RE: Multiterm analysis in complexphrase query

2014-07-21 Thread Gopal Agarwal
That would be really useful. Can you upload the jar and its requirements? It also makes it pluggable with diff versions of solr. On Jul 1, 2014 9:01 PM, Allison, Timothy B. talli...@mitre.org wrote: If there's enough interest, I might get back into the code and throw a standalone src (and

How do I disable distributed search feature when I have only one shard

2014-07-21 Thread pramodEbay
Hi there, We have a solr cloud set up with only one shard. There is one leader and 15 followers. So the data is replicated on 15 nodes. When we run a solr query, only one node should handle the request and we do not need any distributed search feature as all the nodes are exact copies of each

SolrCloud replica dies under high throughput

2014-07-21 Thread Darren Lee
Hi, I'm doing some benchmarking with Solr Cloud 4.9.0. I am trying to work out exactly how much throughput my cluster can handle. Consistently in my test I see a replica go into recovering state forever caused by what looks like a timeout during replication. I can understand the timeout and

Re: SolrCloud replica dies under high throughput

2014-07-21 Thread Mark Miller
Looks like you probably have to raise the http client connection pool limits to handle that kind of load currently. They are specified as top level config in solr.xml: maxUpdateConnections maxUpdateConnectionsPerHost -- Mark Miller about.me/markrmiller On July 21, 2014 at 7:14:59 PM, Darren

SolrCloud extended warmup support

2014-07-21 Thread Jeff Wartes
I’d like to ensure an extended warmup is done on each SolrCloud node prior to that node serving traffic. I can do certain things prior to starting Solr, such as pump the index dir through /dev/null to pre-warm the filesystem cache, and post-start I can use the ping handler with a health check

Re: SolrCloud extended warmup support

2014-07-21 Thread Shawn Heisey
On 7/21/2014 5:37 PM, Jeff Wartes wrote: I’d like to ensure an extended warmup is done on each SolrCloud node prior to that node serving traffic. I can do certain things prior to starting Solr, such as pump the index dir through /dev/null to pre-warm the filesystem cache, and post-start I

Re: SolrCloud extended warmup support

2014-07-21 Thread Jeff Wartes
On 7/21/14, 4:50 PM, Shawn Heisey s...@elyograg.org wrote: On 7/21/2014 5:37 PM, Jeff Wartes wrote: I¹d like to ensure an extended warmup is done on each SolrCloud node prior to that node serving traffic. I can do certain things prior to starting Solr, such as pump the index dir through

Re: Edit Example Post.jar to read ALL file types

2014-07-21 Thread Erick Erickson
So how do you expect these to be indexed? I mean what happens if you run across a Word document? How about an mp3? Just blasting all files up seems chancy. And doesn't just 'java -jar post.jar * ' do what you ask? This seems like an XY problem, _why_ do you want to do this? Because unless the

Re: text search problem

2014-07-21 Thread Erick Erickson
Try escaping the hyphen as \-. Or enclosing it all in quotes. But you _really_ have to spend some time with the debug option an admin/analysis page or you will find endless surprises. Best, Erick On Mon, Jul 21, 2014 at 11:12 AM, EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions)

Re: How do I disable distributed search feature when I have only one shard

2014-07-21 Thread Erick Erickson
Are you using CloudSolrServer in your SolrJ program? No matter what, the distrib=false should be keeping the query from going to more than one shard So I'd check the logs and see if the suspect query appears in more than one node. FWIW, Erick On Mon, Jul 21, 2014 at 4:13 PM, pramodEbay

Re: SolrCloud extended warmup support

2014-07-21 Thread Erick Erickson
I've never seen it necessary to run thousands of queries to warm Solr. Usually less than a dozen will work fine. My challenge would be for you to measure performance differences on queries after running, say, 12 well-chosen queries as opposed to hundreds/thousands. I bet that if 1 you search

DocValues without re-index?

2014-07-21 Thread Michael Ryan
Is it possible to use DocValues on an existing index without first re-indexing? -Michael