Re: initializing parameters of an alayzer

2015-09-03 Thread Roxana Danger
Hi Ahmet, You are absolutely right, I will divide my analyzer as suggested. Thank you very much, Roxana On 3 September 2015 at 16:23, Ahmet Arslan wrote: > Hi Roxana, > > Although it is possible to plug Lucene analyser in schema.xml, it is not > the recommended

Re: Not able to get logging in solr

2015-09-03 Thread Erick Erickson
If you insist on using Tomcat, the log is often in catalina.out On Thu, Sep 3, 2015 at 7:04 AM, Upayavira wrote: > > > On Thu, Sep 3, 2015, at 02:31 PM, shahper wrote: >> >> On Thursday 03 September 2015 05:48 PM, Upayavira wrote: >> > >> > On Thu, Sep 3, 2015, at 11:32 AM,

Re: Stemming words Using Solr

2015-09-03 Thread Jack Krupansky
The # in the URL says to send the request to the admin UI, which of course returns an HTML web page. Instead, send the analysis URL fragment directly to the analysis API (not UI) for the Solr core, without the #. -- Jack Krupansky On Thu, Sep 3, 2015 at 8:45 AM, Ritesh Sinha <

Re: Custom merge logic in SolrCloud.

2015-09-03 Thread Upayavira
Mohan, I asked you to give more details of what the actual problem is you are trying to solve. What is going wrong? Why do you need to "blend" - what's wrong with the results you are getting now? Why do you need custom logic? If we can understand the problem more, we may be better able to help

Re: Stemming words Using Solr

2015-09-03 Thread Upayavira
yes, the URL should be something like: http://localhost:8983/solr/images/analysis/field?wt=json=true== Upayavira On Thu, Sep 3, 2015, at 03:23 PM, Jack Krupansky wrote: > The # in the URL says to send the request to the admin UI, which of > course > returns an HTML web page. Instead, send the

Re: Not able to get logging in solr

2015-09-03 Thread Upayavira
On Thu, Sep 3, 2015, at 02:31 PM, shahper wrote: > > On Thursday 03 September 2015 05:48 PM, Upayavira wrote: > > > > On Thu, Sep 3, 2015, at 11:32 AM, shahper wrote: > >> Hi, > >> > >> I have setup solr when I am clicking on logging there nothing coming. > >> Its just showing loading. > > Have

Re: Custom merge logic in SolrCloud.

2015-09-03 Thread Mohan gupta
Folks, Really looking forward to any help on this. On Tue, Sep 1, 2015 at 8:39 PM, Mohan gupta wrote: > *Bump* > > On Tue, Sep 1, 2015 at 1:17 AM, Mohan gupta > wrote: > >> Hi Folks, >> >> I need to merge docs received from multiple shards via a

Local Params for Stats field

2015-09-03 Thread Iana Bondarska
Hi, I'm trying to use localparams for stats component on Solr 4.4, exact query: q=*:*=hotel_reviews=hotel_reviews=checkout_date:[* TO *]={!tag=period1}checkout_date:[2011-12-25T00:00:00.000Z TO 2012-01-02T00:00:00.000Z}={!tag=period2}checkout_date:[2011-12-25T00:00:00.000Z TO

RE: Rules for pre-processing queries

2015-09-03 Thread Siamak Rowshan
Thanks Arcadius! Great and helpful article. Siamak Rowshan | Software Engineer Softmart | 450 Acorn Lane Downingtown, PA 19335 P | 888-763-8627 siamak.rows...@softmart.com EEO Employer/Protected Veteran/Disabled The information in this

Re: initializing parameters of an alayzer

2015-09-03 Thread Ahmet Arslan
Hi Roxana, Although it is possible to plug Lucene analyser in schema.xml, it is not the recommended approach. Is there a special reason for doing that? I suggest you to divide your analyser into pieces (tokeniser, token filter, char filter) and switch to the usual usage. Then you can read any

Re: String bytes can be at most 32766 characters in length?

2015-09-03 Thread Zheng Lin Edwin Yeo
Thanks for your advice Alexandre. On 3 September 2015 at 20:29, Alexandre Rafalovitch wrote: > Probably because your signatureField and your fields are the same! You > need to point signatureField at a new (not-ID) field. > > You will still get duplicates, as you requested

initializing parameters of an alayzer

2015-09-03 Thread Roxana Danger
Hello, I constructed a custom analyser that needs to get some data from a file. Is there any attribute in the analyzer element of the schema.xml that I can use to initialize the parametersf my analyzer? Thank you very much in advance, Roxana --

Re: Position of Document in Listing (Search Result)

2015-09-03 Thread Erick Erickson
OK, but how are they ordered on the public pages? I'm guessing that what I'm missing here is something like "If I did an identical search on the public pages, my car would be number 208" _Something_ has to glue the two separate pages together. If it's something like that I don't see a good way to

Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread scott chu
solr-user,妳好 I keep forgeting to mention one thing along the discussion session. Our data is Chinese news articles and we use CJK tokenizer (i.e. 2-gram) currently. The time spent to indexing is quite slow, compared to indexing english articles. That's why I am so worrying about indexing

Re: Error in creating a new collection

2015-09-03 Thread Kevin Lee
Configuration upload to zookeeper and collection creation are two separate things, although they can be accompished at the same time using /bin/solr. You can upload configurations before you create collections and you can have mutiple configurations uploaded to zookeeper at the same time. I

RE: Merging documents from a distributed search

2015-09-03 Thread Markus Jelsma
Hello - another current topic is also covering this issue, you may want to check it out: http://lucene.472066.n3.nabble.com/Merging-documents-from-a-distributed-search-td4226802.html -Original message- > From:Markus Jelsma > Sent: Thursday 3rd September

Re: Position of Document in Listing (Search Result)

2015-09-03 Thread Upayavira
Here's the way to do it, based on Alexandre's idea: Write a PostFilter which takes in a document id. It lets through all documents until it sees that document id. Once it sees it, it stops letting them through. Thus, the total count of documents would be the position of your queried car. Then,

RE: Merging documents from a distributed search

2015-09-03 Thread tedsolr
Markus, did you mistakingly post a link to this same thread? -- View this message in context: http://lucene.472066.n3.nabble.com/Merging-documents-from-a-distributed-search-tp4226802p4227035.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Merging documents from a distributed search

2015-09-03 Thread tedsolr
Thanks Joel, that link looks promising. The CloudSolrStream bypasses my issue of multiple shards. Perhaps the ReducerStream would provide what I need. At first glance I worry that the the buffer would grow too large - if its really holding the values for all the fields in each document

Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread scott chu
solr-user,妳好 If you switch to SolrCloud, will you still keep numShards parameter to 1? If you are migrating to SolrCloud and going to split that single shard into multple shards, Wouldn't you have to reindex the data? Is it possible just put that single shard into SolrCloud and call

Re: Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread scott chu
solr-user,妳好 No, both. But first I have to face the indexing performance problem. Where can I see information about concurrent/parallel indexing on Solr? Thanks in advance. - Original Message - From: Toke Eskildsen To: solr_user lucene_apache Date: 2015-09-04, 00:57:51 Subject: Re:

Re: Re: Re: Re: Re: Re: concept and choice: custom sharding or autosharding?

2015-09-03 Thread scott chu
solr-user,妳好 Thanks for the info. I also find this: Parallel indexing for Apache Solr Search Integration | Nick Veenhof http://nickveenhof.be/blog/parallel-indexing-apache-solr-search-integration - Original Message - From: Toke Eskildsen To: solr_user lucene_apache Date: 2015-09-04,

Re: Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Erick Erickson
Ah, that may make my suggestions unworkable re: just reindexing. Still, how much time are we talking about here? I've very often found that indexing performance isn't gated by the Solr processing, but by whatever is feeding Solr. A quick test is to fire up your indexing and see if the CPU

Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Toke Eskildsen
scott chu wrote:   > I keep forgeting to mention one thing along the discussion session. > Our data is Chinese news articles and we use CJK tokenizer > (i.e. 2-gram) currently. The time spent to indexing is quite slow, > compared to indexing english articles. That's why I

Re: Error in creating a new collection

2015-09-03 Thread Erick Erickson
bq: The configuration should be removed from Zookeeper if the collection was not created due to an error... Well, how about an enhancement? It is a bit confusing that if the configset already exists in ZK, then the second attempt to create a collection with the same configset fails because

Re: Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Toke Eskildsen
scott chu wrote: > No, both. But first I have to face the indexing performance problem. > Where can I see information about concurrent/parallel indexing on Solr? Depends on how you index. If you use a Java program,

Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Erick Erickson
bq: If you switch to SolrCloud, will you still keep numShards parameter to 1 yes. Although if you want to add more replicas you might want to specify that. For 10M documents, I wouldn't be very fancy. Indexing them shouldn't take very long, and I think your time would be better spent on other

Re: Local Params for Stats field

2015-09-03 Thread Chris Hostetter
: I'm trying to use localparams for stats component on Solr 4.4, exact query: : q=*:*=hotel_reviews=hotel_reviews=checkout_date:[* TO : *]={!tag=period1}checkout_date:[2011-12-25T00:00:00.000Z TO : 2012-01-02T00:00:00.000Z}={!tag=period2}checkout_date:[2011-12-25T00:00:00.000Z : TO :

Re: Position of Document in Listing (Search Result)

2015-09-03 Thread Alexandre Rafalovitch
So, basically for each car, you want to generate a query with the same parameter (e.g. make) and then say where in the results for that query, your particular car would be. Right? I think the only way is to run the query and to see where the car is in the result. So, a custom code of some sort.

Re: Why is Process Total Time greater than Elapsed Time?

2015-09-03 Thread Chris Hostetter
depends on where you are reading "Process Total Time" from. that terminology isn't something i've ever sen used in the context of solr (fairly certain nothing in solr refers to anything that way) QTime is the amount of time spent processing a request before it starts being written out over

Re: Custom merge logic in SolrCloud.

2015-09-03 Thread tedsolr
I am facing a similar issue. See this thread -- View this message in context: http://lucene.472066.n3.nabble.com/Custom-merge-logic-in-SolrCloud-tp4226325p4227073.html Sent from the Solr - User

Re: Why is Process Total Time greater than Elapsed Time?

2015-09-03 Thread Scott Stults
Thanks Hoss, sorry I wasn't clear. By Process Total Time I mean this structure in the debug response: debug timing process time Elapsed time is what I get from SolrJ's API: SolrClient.quey().getElapsedTime(). So I really expect elapsed time to be the greatest duration of all

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Renee Sun
hum... at beginning I also assumed segment index files will only be deleted or added, but not modified. But I did a test with heavy indexing on going, and observed the index file in [core]/index with a latest updated timestamp keep growing for about 7 minutes... not sure if the new write caused

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Renee Sun
[core]/index is a folder holding index files. But index files in that folder is not just being deleted or added, they are also being updated. on Linux file system, the folder's timestamp will only be updated if the files in it is being added or deleted, NOT updated. So if I check the index

Re: Cached fq decreases performance

2015-09-03 Thread Jeff Wartes
I’m measuring performance in the aggregate, over several minutes and tens of thousands of distinct queries that all use this specific fq. The cache hit count reported is roughly identical to the number of queries I’ve sent, so no, this isn’t a first-query cache-miss situation. The fq result will

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Upayavira
in a lucene index, files are never updated, only ever added or deleted. You may well be able to use the ReplicationHandler to answer that question for you, as it can tell stuff about an index for the purpose of replicating it - I'm not sure what precisely it tells. Why do you need to know this?

Re: Position of Document in Listing (Search Result)

2015-09-03 Thread Alexandre Rafalovitch
That's a good point. What is the query sorting on? Shayan, can you give an example of a query with sorting/etc shown. Regards, Alex. Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: http://www.solr-start.com/ On 3 September 2015 at 16:24, Chris Hostetter

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Toke Eskildsen
Renee Sun wrote: > [core]/index is a folder holding index files. Agree so far. > But index files in that folder is not just being deleted or added, they are > also being updated. So you say. That contradicts my understanding, as well as the first 10 hits in Google for

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Upayavira
what matters here is the time between hard commits. If you do infrequent hard commits, then it is possible that files will be written to over that time. Those files are part-complete segment files, and are not yet referred to by the segments file, and thus are not really yet a part of the index. A

Re: Merging documents from a distributed search

2015-09-03 Thread Upayavira
On Wed, Sep 2, 2015, at 10:12 PM, tedsolr wrote: > I've read from http://heliosearch.org/solrs-mergestrategy/ > that the AnalyticsQuery > component only works for a single instance of Solr. I'm planning to > "migrate" to the SolrCloud soon and I

Re: Position of Document in Listing (Search Result)

2015-09-03 Thread Chris Hostetter
: Write a PostFilter which takes in a document id. It lets through all : documents until it sees that document id. Once it sees it, it stops : letting them through. : : Thus, the total count of documents would be the position of your queried : car. Sorry guys, that won't work. PostFilter's can

Why is Process Total Time greater than Elapsed Time?

2015-09-03 Thread Scott Stults
>From what I can tell, each component processes the request sequentially. So how can I see an Elapsed Time of 750ms (SolrJ client) and a Process Total Time of 1300ms? Does the Process Total Time add up the amount of time each leaf reader takes, or some other concurrent things? Thank you, Scott

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Toke Eskildsen
Renee Sun wrote: > But I did a test with heavy indexing on going, and observed the index file > in [core]/index with a latest updated timestamp keep growing for about 7 > minutes... That is not a file, but the folder that holds the immutable segment files. What you observe

Cached fq decreases performance

2015-09-03 Thread Jeff Wartes
I have a query like: q==enabled:true For purposes of this conversation, "fq=enabled:true" is set for every query, I never open a new searcher, and this is the only fq I ever use, so the filter cache size is 1, and the hit ratio is 1. The fq=enabled:true clause matches about 15% of my

Socket \ Connection Timeout Values

2015-09-03 Thread Arnon Yogev
Hi, I wanted to ask about the implications of different timeout values one can use. For example: >From what I see in the code, the default socket timeout value for Solr is 0. Does that mean Solr nodes will wait to update \ receive update from each other without any timeout? In other words,

Re: Position of Document in Listing (Search Result)

2015-09-03 Thread Shayan Haque
Let me give an example: for suppose we have "public pages" which list cars for makes (they use q=make:Chery or q=make:Toyota etc...) and list cars for all members. Now suppose, in our member account area, there is already a page that lists cars that the current logged in member had posted. In

Re: Error in creating a new collection

2015-09-03 Thread Shalin Shekhar Mangar
Did you upload the modified schema.xml to ZooKeeper? If you had already (unsuccessfully) attempted to create a collection called "test2" then you must first delete that collection (using the collection API) and create it fresh again. On Thu, Sep 3, 2015 at 12:28 PM, shacky

Elevation working for some queries, not others

2015-09-03 Thread Sébastien Muller
Hi! I have a Solr 4.10.4 installation where elevation is behaving strangely i.e. some queries correctly elevate the appropriate doc(s) while others don't. I've debugged the queries and other than the query term itself they are identical. Also confirmed that the doc that should be elevated actually

RE: Merging documents from a distributed search

2015-09-03 Thread Markus Jelsma
Hello - We're doing something similar ended up overriding QueryComponent (https://issues.apache.org/jira/browse/SOLR-7968) which needs protected members instead of private members first. We could do a RankQuery and use its cool MergeStrategy, but we would also ened RankQuery to provide an entry

Re: Socket \ Connection Timeout Values

2015-09-03 Thread Shawn Heisey
On 9/3/2015 12:06 AM, Arnon Yogev wrote: > I wanted to ask about the implications of different timeout values one can > use. > > For example: > From what I see in the code, the default socket timeout value for Solr is > 0. > Does that mean Solr nodes will wait to update \ receive update from

Error in creating a new collection

2015-09-03 Thread shacky
Hi, I have a three nodes Solr 5.3.0 cluster with Zookeeper 3.4.5. New collections are creating without any problem, so the cluster seems to work well. I created a new collection "test2" using an old configuration, and I got this error: enablePositionIncrements is not a valid option as of Lucene

Re: which solrconfig.xml

2015-09-03 Thread Mikhail Khludnev
Hello, fwiw, a handy tool to answer such questions is $lsof -p On Wed, Sep 2, 2015 at 11:03 PM, Mark Fenbers wrote: > Hi, I've been fiddling with Solr for two whole days since > downloading/unzipping it. I've learned a lot by reading 4 documents and > the web site.

Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Charlie Hull
On 02/09/2015 15:47, scott chu wrote: > I post a question on Stackoverflow > http://stackoverflow.com/questions/32343813/custom-sharding-or-auto-sharding-on-solrcloud: > However, since this is a mail-list, I repost the question below to request > for suggestion and more subtle concept of

Re: Error in creating a new collection

2015-09-03 Thread shacky
Hi Shalin, thank you very much for your answer. I found out and managed in recreating the problem. I created a new collection, with the wrong configset. I got the error and the collection was not created, good. But after that I continue to see the "SolrCore Initialization Failures" in the Solr

Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Toke Eskildsen
On Wed, 2015-09-02 at 08:30 -0700, Erick Erickson wrote: > Because I routinely see 50M docs on a single node and I've seen over 300M docs > on a single node with sub-second responses. For what it's worth, we also do article-based search of newspaper based material (old OCR'ed papers). We use a

Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Scott Chu
Do you use master-slave or SolrCloud for that single shard? Erick suggest that I can still can use SolrCloud for HA/DR purpose cause Zookeeper can do the work for me. Should I just give up master-slave choice even there's only one single shard? Scott Chu,scott@udngroup.com 2015/9/3 -

Re: Stemming words Using Solr

2015-09-03 Thread Upayavira
On Thu, Sep 3, 2015, at 11:19 AM, Ritesh Sinha wrote: > I am learning solr and want to use solr for stemming words.I'll be > passing > the word to the solr and it should send the stemmed word back.I know how > to > configure solr core for different stemming patterns and also i am able to > view

Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Toke Eskildsen
On Thu, 2015-09-03 at 18:24 +0800, Scott Chu wrote: > Do you use master-slave or SolrCloud for that single shard? Due to legacy reasons we are just using 2 fully independent Solrs, each indexing independently, with an Apache load balancer in front for the searches. It does give us the occasional

Not able to get logging in solr

2015-09-03 Thread shahper
Hi, I have setup solr when I am clicking on logging there nothing coming. Its just showing loading. Shahper

Stemming words Using Solr

2015-09-03 Thread Ritesh Sinha
I am learning solr and want to use solr for stemming words.I'll be passing the word to the solr and it should send the stemmed word back.I know how to configure solr core for different stemming patterns and also i am able to view their stemmed words in the analyzer (solr admin ui) but i am not

SOLR last modified different than filesystem last modified

2015-09-03 Thread sat
I am indexing a bunch of PDF's, Docs, PPT's etc with Solr 4.10.1. When I retrieve the records each has a last-modified value, but the value is different than what I see on the file system (a difference of exact 2 hours). Aren't those supposed to match, and if not, how can I retrieve the same

Re: Not able to get logging in solr

2015-09-03 Thread Upayavira
On Thu, Sep 3, 2015, at 11:32 AM, shahper wrote: > Hi, > > I have setup solr when I am clicking on logging there nothing coming. > Its just showing loading. Have a look in your browser's developer tools and see what happens to the request for logging information. If you give us more

Re: SOLR last modified different than filesystem last modified

2015-09-03 Thread Toke Eskildsen
On Thu, 2015-09-03 at 04:10 -0700, sat wrote: > I am indexing a bunch of PDF's, Docs, PPT's etc with Solr 4.10.1. When I > retrieve the records each has a last-modified value, but the value is > different than what I see on the file system (a difference of exact 2 > hours). I guess your local

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Walter Underwood
Instead of writing new code, you could configure an autocommit interval in Solr. That already does what you want, no more than one commit in the interval and no commits if there were no adds or deletes. Then the clients would never need to commit. wunder Walter Underwood wun...@wunderwood.org

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Erik Hatcher
/admin/luke can give you a lastModified time stamp. The Solr admin UI makes a request to display this on the core overview screen, making a request like this: http://localhost:8983/solr//admin/luke?wt=json=index=0 and the

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Renee Sun
thank you! I will look into that. Also I came across autosoftcommit, it seems to be useful... we are still using solr 3.5, I hope autosoftcommit is included in solr 3.5... -- View this message in context:

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Renee Sun
Walter, thanks! I will do some tests using auto commit, I guess if there is requirement for console UI to make documents searchable in 10 minutes, we will need to use the autocommit with maxTime instead of maxDoc. I wonder if in case we need to do a 'force commit', the autocommit will not get

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Alexandre Rafalovitch
Put the IgnoreCommit on the default handler to stop clients from forcing the commit: http://www.solr-start.com/javadoc/solr-lucene/org/apache/solr/update/processor/IgnoreCommitOptimizeUpdateProcessorFactory.html Then have a separate normal handler and send your real commits through that if you

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Erick Erickson
I'm pretty sure, soft commits didn't come along until 4.0 Best, Erick On Thu, Sep 3, 2015 at 4:24 PM, Erik Hatcher wrote: > /admin/luke can give you a lastModified time stamp. The Solr admin UI makes > a request to display this on the core overview screen, making a

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Renee Sun
this make sense now. Thanks! why I got on this idea is: In our system we have large customer base and lots of cores, each customer may have multiple cores. there are also a lot of processes running in our system processing the data for these customers, and once a while, they would ask a center

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Shawn Heisey
On 9/3/2015 4:20 PM, Renee Sun wrote: > Now I just discovered some of the processes send in large amount of commit > requests on many cores which never had any changes in the last interval. If you are using a Solr version released in the last two years (at least version 4.4), then a commit on an

Order of hosts in zkHost

2015-09-03 Thread Arcadius Ahouansou
Hello. Let's say we have 10 SolrJ clients all configured with zkhost=zk1:port,zk2:port,zk3:port For each of the 10 SolrJ clients, would it make a difference in term of load on zk1 (the server on the list) if we shuffle around the order of the ZK servers in zkHost or is it all the same? I would

Re: Difference between Legacy Facets and JSON Facets

2015-09-03 Thread Zheng Lin Edwin Yeo
Hi Yonik, What do you mean by the "counts-only" case? Regards, Edwin On 3 September 2015 at 21:17, Yonik Seeley wrote: > On Wed, Sep 2, 2015 at 2:44 PM, Toke Eskildsen > wrote: > > When incrementing counters for String faceting, segment ordinal

Re: Custom merge logic in SolrCloud.

2015-09-03 Thread Mohan gupta
Hi Folks, Please help me with this. On Thu, Sep 3, 2015 at 7:34 PM, Mohan gupta wrote: > Folks, > > Really looking forward to any help on this. > > On Tue, Sep 1, 2015 at 8:39 PM, Mohan gupta > wrote: > >> *Bump* >> >> On Tue, Sep 1, 2015 at

Zookeeper Quorum leader election

2015-09-03 Thread Arcadius Ahouansou
We have a quorum of 3 ZK nodes zk1, zk2 and zk3. All nodes are identicals. After multiple restart of the ZK nodes, always keeping the majority of 2, we have noticed that the node zk1 has never become the leader. Only zk2 and zk3 become leader. 1) Is there any know reason or possible

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Renee Sun
unfortunately we are still using solr 3.5 with lucene 2.9.3 :-( If we upgrade to solr 4.x it will require upgrade of lucene away from 2.x.x which will need re-index of all our data. With current measures, it might take about 8-9 for the data we have to be re-indexed, a big concern. so to

Re: any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Shawn Heisey
On 9/3/2015 10:00 PM, Renee Sun wrote: > unfortunately we are still using solr 3.5 with lucene 2.9.3 :-( If we > upgrade to solr 4.x it will require upgrade of lucene away from 2.x.x > which will need re-index of all our data. With current measures, it > might take about 8-9 for the data we have

Re: Stemming words Using Solr

2015-09-03 Thread Ritesh Sinha
Yeah, I got. Thanks. It returns a json which have the stemmed words.I just need to parse it and get the value. But, isn't there any JAVA API available for it ? On Thu, Sep 3, 2015 at 7:58 PM, Upayavira wrote: > yes, the URL should be something like: > > >

Re: http client mismatch

2015-09-03 Thread Shawn Heisey
On 9/3/2015 8:11 PM, Firas Khasawneh wrote: > I am trying to use SolrHttpClient in solrj 5.3.0 but I am getting the > following execption: > > Caused by: java.lang.VerifyError: Bad return type > Exception Details: > Location: > >

any easy way to find out when a core's index physical file has been last updated?

2015-09-03 Thread Renee Sun
I will need to figure out when was last index activity on a core. I can't use [corename]/index timestamp, because it only reflex the file deletion or addition, not file update. I am curious if any solr core admin RESTful api sort of thing thing I can use to get last modified timestamp on

RE: Merging documents from a distributed search

2015-09-03 Thread Markus Jelsma
It seems so indeed. Please look up the thread titled "Custom merge logic in SolrCloud." -Original message- > From:tedsolr > Sent: Thursday 3rd September 2015 21:28 > To: solr-user@lucene.apache.org > Subject: RE: Merging documents from a distributed

Re: Cached fq decreases performance

2015-09-03 Thread Alexandre Rafalovitch
FQ has to calculate the result bit set for every document to be able to cache it. Q will only calculate it for the documents it matches on and there is some intersection hopping going on. Are you seeing this performance hit on first query only or or every one? I would expect on first query only

Re: http client mismatch

2015-09-03 Thread Shawn Heisey
On 9/3/2015 11:04 PM, Shawn Heisey wrote: > It sounds like your code is trying to assign the result of the > createClient method to an object of type SystemDefaultHttpClient. This > is a derivative type of CloseableHttpClient. This would work if the > derivation were the other direction. > >

Re: Difference between Legacy Facets and JSON Facets

2015-09-03 Thread Yonik Seeley
On Wed, Sep 2, 2015 at 2:44 PM, Toke Eskildsen wrote: > When incrementing counters for String faceting, segment ordinal -> index > ordinal mapping takes place. Legacy facets has a mechanism where temporary > segment-specific counters are used. These are updated

Re: Not able to get logging in solr

2015-09-03 Thread shahper
On Thursday 03 September 2015 05:48 PM, Upayavira wrote: On Thu, Sep 3, 2015, at 11:32 AM, shahper wrote: Hi, I have setup solr when I am clicking on logging there nothing coming. Its just showing loading. Have a look in your browser's developer tools and see what happens to the request for

Re: Stemming words Using Solr

2015-09-03 Thread Ritesh Sinha
Hi, I observed the inspect element and wrote a code to give back the content. I have included the url which was getting generated. public class URLConnectionReader { public static void main(String[] args) throws Exception { URL solr = new URL( "

Re: String bytes can be at most 32766 characters in length?

2015-09-03 Thread Alexandre Rafalovitch
Probably because your signatureField and your fields are the same! You need to point signatureField at a new (not-ID) field. You will still get duplicates, as you requested that in your other emails, but now you would be able to group on that new signature field. If you have any further

Re: Order of hosts in zkHost

2015-09-03 Thread Shawn Heisey
On 9/3/2015 9:47 PM, Arcadius Ahouansou wrote: > Let's say we have 10 SolrJ clients all configured with > zkhost=zk1:port,zk2:port,zk3:port > > For each of the 10 SolrJ clients, would it make a difference in term of > load on zk1 (the server on the list) if we shuffle around the order of the > ZK

Re: Issue Using Solr 5.3 Authentication and Authorization Plugins

2015-09-03 Thread Noble Paul
Both these are committed. If you could test with the latest 5.3 branch it would be helpful On Wed, Sep 2, 2015 at 5:11 PM, Noble Paul wrote: > I opened a ticket for the same > https://issues.apache.org/jira/browse/SOLR-8004 > > On Wed, Sep 2, 2015 at 1:36 PM, Kevin Lee

http client mismatch

2015-09-03 Thread Firas Khasawneh
Hi, I am trying to use SolrHttpClient in solrj 5.3.0 but I am getting the following execption: Caused by: java.lang.VerifyError: Bad return type Exception Details: Location: