Re: Adding a new shard

2016-04-15 Thread Jay Potharaju
I found ticket https://issues.apache.org/jira/browse/SOLR-5025 which talks about sharding in solrcloud. Are there any plans to address this issue in near future? Can any of the users on the forum comment how they are handling this scenario in production? Thanks On Fri, Apr 15, 2016 at 4:28 PM,

dataimport db-data-config.xml

2016-04-15 Thread kishor
I am try to run two pgsql query on same data-source. is this possible in db-data-config.xml. This code is not working please suggest any more example -- View this message in context:

dataimport db-data-config.xml

2016-04-15 Thread kishor
I am try to run two pgsql query on same data-source. is this possible in db-data-config.xml. This code is not working please suggest any more example -- View this message in context:

Re: Getting duplicate output while doing auto suggestion based on multiple filed using copy filed in solr 5.5

2016-04-15 Thread Chris Hostetter
I can't explain the results you are seeing, but you also didn't provide us with your schema.xml (ie; how are "text" and "text_auto" defined?) or enough details ot try and reproduce on a small scale (ie: what does the source data look like in the documents where these suggestion values are

Re: MiniSolrCloudCluster usage in solr 7.0.0

2016-04-15 Thread Chris Hostetter
: At first, I saw the same exception you got ... but after a little while : I figured out that this is because I was running the program more than : once without deleting everything in the baseDir -- so the zookeeper : server was starting with an existing database already containing the :

Adding a new shard

2016-04-15 Thread Jay Potharaju
Hi, I have an existing collection which has 2 shards, one on each node in the cloud. Now I want to split the existing collection into 3 shards because of increase in volume of data. And create this new shard on a new node in the solrCloud. I read about splitting a shard & creating a shard, but

SOLR-3666

2016-04-15 Thread Jay Potharaju
Hi, I am using solrCloud with DIH for indexing my data. Is it possible to get status of all my DIH across all nodes in the cloud? I saw this jira ticket from couple of years ago. https://issues.apache.org/jira/browse/SOLR-3666 Can any of contributors comment on whether this would be resolved? The

Re: Question on Solr JDBC driver with SQL client like DB Visualizer

2016-04-15 Thread Joel Bernstein
Ok, I think I know the problem you're running into. You'll need to load the solr-solrj jar after loading the jars in the solrj-lib. Otherwise DbVis seems to get confused and lose the driver class. We'll work on putting out a single jar for the JDBC driver. Joel Bernstein

Re: MiniSolrCloudCluster usage in solr 7.0.0

2016-04-15 Thread Shawn Heisey
On 4/14/2016 8:32 AM, Rohana Rajapakse wrote: > I have added few dependency jars into my project. There are no compilation > errors or ClassNotFound exceptions, but Zookeeper exception " > KeeperException$NodeExistsException: KeeperErrorCode = NodeExists for > /solr/solr.xml ". My temporary

Re: Question on Solr JDBC driver with SQL client like DB Visualizer

2016-04-15 Thread Joel Bernstein
What version of DbVisualizer are you using? When I tested I was using the latest version. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Apr 15, 2016 at 12:47 PM, Reth RM wrote: > output of command : > > org/apache/solr/client/solrj/io/sql/ >

Re: Getting duplicate output while doing auto suggestion based on multiple filed using copy filed in solr 5.5

2016-04-15 Thread Tejas Bhanushali
For more info PFA config file . URL http://localhost:8983/solr/products/suggest?suggest=true=true=json=Fruit=none On Fri, Apr 15, 2016 at 11:18 PM, Tejas Bhanushali < contact.tejasbhanush...@gmail.com> wrote: > Hi Team, > > Im getting the duplicate result when i do auto suggestion based on >

Getting duplicate output while doing auto suggestion based on multiple filed using copy filed in solr 5.5

2016-04-15 Thread Tejas Bhanushali
Hi Team, Im getting the duplicate result when i do auto suggestion based on multiple filed by using copy filed . i have below table configuration . Segment -- have multiple category -- have multiple sub category -- have multiple products. suggestion are given based on segment name, category

Re: Adding replica on solr - 5.50

2016-04-15 Thread Jay Potharaju
I have multiple solr instances running in my dev sandbox. When adding a replica i was passing the host IP instead of 127.0.1.1 which is recorded in the live nodes section. Thanks Eric for pointing that out. Working URL:

Re: Adding replica on solr - 5.50

2016-04-15 Thread John Bickerstaff
Oh, and what, if any directories need to exist for the ADDREPLICA command to work? Hopefully nothing past the already existing /var/solr/data created by the Solr install script? On Fri, Apr 15, 2016 at 11:18 AM, John Bickerstaff wrote: > Oh, and what, if any

Re: Adding replica on solr - 5.50

2016-04-15 Thread John Bickerstaff
Oh, and what, if any directories need to exist for the ADDREPLICA On Fri, Apr 15, 2016 at 11:09 AM, John Bickerstaff wrote: > Thanks again Eric - I'm going to be trying the ADDREPLICA again today or > Monday. I much prefer that to hand-edit hackery... > > Thanks also

Re: Adding replica on solr - 5.50

2016-04-15 Thread John Bickerstaff
Thanks again Eric - I'm going to be trying the ADDREPLICA again today or Monday. I much prefer that to hand-edit hackery... Thanks also for pointing out that cURL makes it "scriptable"... On Fri, Apr 15, 2016 at 10:50 AM, Erick Erickson wrote: > bq: Shouldn't this:

Re: Re: solr 5.2.1, data import issue, shown processed rows doesn't match acturally indexed doc quantity.

2016-04-15 Thread Erick Erickson
The simplest test to see if there are duplicates is to check the maxDoc and numDocs in the admin UI. If they're different then you have duplicates. NOTE: this is not definitive, and you MUST NOT run optimize before you look. But it's quick. I'd delete all docs before trying this first though.

Re: Adding replica on solr - 5.50

2016-04-15 Thread Erick Erickson
bq: Shouldn't this: =x.x.x.x:9001_solr Actually be this? =x.x.x.x:9001/solr (Note the / instead of _ )

Re: Question on Solr JDBC driver with SQL client like DB Visualizer

2016-04-15 Thread Reth RM
output of command : org/apache/solr/client/solrj/io/sql/ META-INF/services/java.sql.Driver org/apache/solr/client/solrj/io/sql/ConnectionImpl.class org/apache/solr/client/solrj/io/sql/DatabaseMetaDataImpl.class org/apache/solr/client/solrj/io/sql/DriverImpl.class

Re: Singular Plural Results Inconsistent - SOLR v3.6 and EnglishMinimalStemFilterFactor

2016-04-15 Thread Walter Underwood
I looked at the PHP clients a couple of years ago and they didn’t seem to add much. I wrote PHP code to make GET requests to Solr and parse the JSON response. It wasn’t much more code than doing it with a client library. The client libraries don’t really do much for you. They can’t even keep

Re: Singular Plural Results Inconsistent - SOLR v3.6 and EnglishMinimalStemFilterFactor

2016-04-15 Thread Sara Woodmansee
Hi Shawn, No clue what PHP client they are using. Thanks for the info! Sara > On Apr 15, 2016, at 10:35 AM, Shawn Heisey wrote: > > On 4/15/2016 8:15 AM, Sara Woodmansee wrote: >> When I suggested the developer consider upgrading to v5.5 or 6.0 (from >> v3.6), this was

Re: Question on Solr JDBC driver with SQL client like DB Visualizer

2016-04-15 Thread Kevin Risden
> > Page 11, the screenshot specifies to select a > "solr-solrj-6.0.0-SNAPSHOT.jar" which is equivalent into > "solr-solrj-6.0.0.jar" shipped with released version, correct? > Correct the PDF was generated before 6.0.0 was released. The documentation from SOLR-8521 is being migrated to here:

RE: Shard ranges seem incorrect

2016-04-15 Thread Markus Jelsma
Thanks both. I completely missed Shawn's response. -Original message- > From:Chris Hostetter > Sent: Thursday 14th April 2016 22:48 > To: solr-user@lucene.apache.org > Subject: RE: Shard ranges seem incorrect > > > : Hi - bumping this issue. Any thoughts

Re: Solr best practices for many to many relations...

2016-04-15 Thread Jack Krupansky
And it may also be that there are whole classes of user for whom denormalization is just too heavy a cross to bear and for who a little extra money spent on more hardware is a great tradeoff. And... Lucene's indexing may be superior to your average SQL database, so that a Solr JOIN could be so

Re: Anticipated Solr 5.5.1 release date

2016-04-15 Thread Tom Evans
Awesome, thanks :) On Fri, Apr 15, 2016 at 4:19 PM, Anshum Gupta wrote: > Hi Tom, > > I plan on getting a release candidate out for vote by Monday. If all goes > well, it'd be about a week from then for the official release. > > On Fri, Apr 15, 2016 at 6:52 AM, Tom Evans

Re: Question on Solr JDBC driver with SQL client like DB Visualizer

2016-04-15 Thread Joel Bernstein
Can you post the output from the command below. Notice the diver classes in the trunk snapshot on my desktop. jar -tvf solr-solrj-7.0.0-SNAPSHOT.jar | grep sql 0 Sun Apr 03 20:20:28 EDT 2016 org/apache/solr/client/solrj/io/sql/ 842 Sun Apr 03 20:20:28 EDT 2016

Re: Anticipated Solr 5.5.1 release date

2016-04-15 Thread Anshum Gupta
Hi Tom, I plan on getting a release candidate out for vote by Monday. If all goes well, it'd be about a week from then for the official release. On Fri, Apr 15, 2016 at 6:52 AM, Tom Evans wrote: > Hi all > > We're currently using Solr 5.5.0 and converting our regular

Re: Can a field be an array of fields?

2016-04-15 Thread Jack Krupansky
It all depends on what your queries look like - what input data does your application have and what data does it need to retrieve. My recommendation is that you store first name and last name as separate, multivalued fields if you indeed need to query by precisely a first or last name, but also

Re: Solr best practices for many to many relations...

2016-04-15 Thread Joel Bernstein
I think people are going to be surprised though by the speed of the joins. The joins also get faster as the number of shards, replicas and worker nodes grow in the cluster. So we may see people building out large clusters and and using the joins in OLTP scenarios. Joel Bernstein

RE: Can a field be an array of fields?

2016-04-15 Thread Davis, Daniel (NIH/NLM) [C]
Short answer - JOINs, external query outside Solr, Elastic Search ;) Alternatives: * You get back an id for each document when you query on "Nino". You look up the last names in some other system that has the full list. * You index the authors in another collection and use JOINs * You

Re: Adding docValues in schema - Solr Cloud 4.8.1

2016-04-15 Thread Vincenzo D'Amore
Thanks Shawn, just to confirm your claim. Following your suggestion I have double checked my queries with grouping and faceting. Faceting and grouping became empty immediately after I have added docValues . Thanks again for your support, Vincenzo On Fri, Apr 15, 2016 at 4:31 PM, Shawn Heisey

Re: Solr best practices for many to many relations...

2016-04-15 Thread Jack Krupansky
And of course it depends on the specific queries, both in terms of what fields will be searched and which fields need to be returned. Yes, OLAP is the clear sweet spot, where taking 500 ms to 2 or even 20 seconds for a complex query may be just fine vs. OLTP/search where under 150 ms is the

Re: Solr best practices for many to many relations...

2016-04-15 Thread Joel Bernstein
In general the Streaming Expression joins are designed for interactive OLAP type work loads. So BI and data warehousing scenarios are the sweet spot. There may be scenarios where high QPS search applications will work with the distributed joins, particularly if the joins themselves are not huge.

Re: Singular Plural Results Inconsistent - SOLR v3.6 and EnglishMinimalStemFilterFactor

2016-04-15 Thread Shawn Heisey
On 4/15/2016 8:15 AM, Sara Woodmansee wrote: > When I suggested the developer consider upgrading to v5.5 or 6.0 (from v3.6), > this was their response. It’s clear that upgrading is not going to happen > any time soon. > > Developer response: "But to use SOLR 5, there is a need to find a stable

Re: Adding docValues in schema - Solr Cloud 4.8.1

2016-04-15 Thread Shawn Heisey
On 4/15/2016 7:42 AM, Vincenzo D'Amore wrote: > I would like to add docValues to few fields definition in production. > > I first tried in a test environment during partial reindexing and it seems > have no effect, (i.e. no real benefits with small number of documents to > reindex, 30% of total).

Re: Solr best practices for many to many relations...

2016-04-15 Thread Jack Krupansky
It will be interesting to see which use cases work best with the new streaming JOIN vs. which will remain best with full denormalization, or whether you simply have to try both and benchmark them. My impression had been that streaming JOIN would be ideal for bulk operations rather than

Solr json api,metrics calculation

2016-04-15 Thread Iana Bondarska
Hi All, could you please help me with solr metrics on json api: 1) I don't see count metric in list of supported metrics -- is it really not supported now? E.g. I have records like this: city name NY johnson LA smith NYnull LA johnson. And I want to count of names grouped by city.

Question on Solr JDBC driver with SQL client like DB Visualizer

2016-04-15 Thread Reth RM
Note: I followed the steps mentioned in the pdf attached on this Jira https://issues.apache.org/jira/browse/SOLR-8521 Page 11, the screenshot specifies to select a "solr-solrj-6.0.0-SNAPSHOT.jar" which is equivalent into "solr-solrj-6.0.0.jar" shipped with released version, correct? When I try

Re: Singular Plural Results Inconsistent - SOLR v3.6 and EnglishMinimalStemFilterFactor

2016-04-15 Thread Sara Woodmansee
Hi all, When I suggested the developer consider upgrading to v5.5 or 6.0 (from v3.6), this was their response. It’s clear that upgrading is not going to happen any time soon. Developer response: "But to use SOLR 5, there is a need to find a stable and reliable php client. And until very

Re: Solr best practices for many to many relations...

2016-04-15 Thread Joel Bernstein
You may also want to keep an eye on SOLR-8925 which supports distributed, cross collection graph traversals. This may be useful in traversing the relationships. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Apr 15, 2016 at 9:56 AM, Joel Bernstein wrote: > Solr now

Re: Solr best practices for many to many relations...

2016-04-15 Thread Joel Bernstein
Solr now has full distributed join capabilities as part of the Streaming Expression library. Keep in mind that these are distributed joins so they shuffle records to worker nodes to perform the joins. These are comparable to joins done by SQL over MapReduce systems, but they are very responsive

Anticipated Solr 5.5.1 release date

2016-04-15 Thread Tom Evans
Hi all We're currently using Solr 5.5.0 and converting our regular old style facets into JSON facets, and are running in to SOLR-8155 and SOLR-8835. I can see these have already been back-ported to 5.5.x branch, does anyone know when 5.5.1 may be released? We don't particularly want to move to

Adding docValues in schema - Solr Cloud 4.8.1

2016-04-15 Thread Vincenzo D'Amore
Dear Solr Gurus :), I would like to add docValues to few fields definition in production. I first tried in a test environment during partial reindexing and it seems have no effect, (i.e. no real benefits with small number of documents to reindex, 30% of total). So I have to wait a full

Re: Solr best practices for many to many relations...

2016-04-15 Thread Dennis Gove
The Streaming API with Streaming Expressions (or Parallel SQL if you want to use SQL) can give you the functionality you're looking for. See https://cwiki.apache.org/confluence/display/solr/Streaming+Expressions and https://cwiki.apache.org/confluence/display/solr/Parallel+SQL+Interface. SQL

Can a field be an array of fields?

2016-04-15 Thread Bastien Latard - MDPI AG
The same email, but with formatting... (email below) Forwarded Message Subject:Can a field be an array of fields? Date: Fri, 15 Apr 2016 13:51:48 +0200 From: Bastien Latard - MDPI AG To: solr-user@lucene.apache.org Hi everybody! /I

Can a field be an array of fields?

2016-04-15 Thread Bastien Latard - MDPI AG
Hi everybody! /I described a bit what I found in another thread, but I prefer to create a new thread for this specific question.../ *It's **possible to create an array of string by doing (incomplete example): - in the data-conf.xml:* *- in schema.xml:

Error starting Solr-6.0.0 in HDFS mode (in Windows 7)

2016-04-15 Thread Rohana Rajapakse
java.nio.file.InvalidPathException:java.nio.file.InvalidPathException: Illegal char <:> at index 4: hdfs:\\myserver:9000\solr It doesn't like the colon. I have tried starting solr on windows command line with: bin/solr start -Dsolr.directoryFactory=HdfsDirectoryFactory

Re: Adding replica on solr - 5.50

2016-04-15 Thread Jaroslaw Rozanski
Hi, Does the `=...` actually work for you? When attempting similar with Solr 5.3.1, despite what documentation said, I had to use `node_name=...`. Thanks, Jarek On Fri, 15 Apr 2016, at 05:48, John Bickerstaff wrote: > Another thought - again probably not it, but just in case... > > Shouldn't

Re: Solr Sharding Strategy

2016-04-15 Thread Bhaumik Joshi
Hi , Toke - I tried with pausing the indexing fully but got the slight improvement so the impact of indexing is not that much. Shawn - Answer to your question - I am sending one document in one update request. I have test solr cloud configured like 2 shards on one machine and each of has one

Re: Solr best practices for many to many relations...

2016-04-15 Thread Bastien Latard - MDPI AG
'/would I then be able to query a specific field of articles or other "table" (with the same OR BETTER performances)?/' -> And especially, would I be able to get only 1 article in the result... On 15/04/2016 09:06, Bastien Latard - MDPI AG wrote: Thanks Jack. I know that Solr is a search

Re: Solr best practices for many to many relations...

2016-04-15 Thread Bastien Latard - MDPI AG
Thanks Jack. I know that Solr is a search engine, but this replace a search in my mysql DB with this model: *My goal is to improve my environment (and my performances at the same time).* / //Yes, I have a Solr data model... but atm I created 4 different indexes for "similar service

Re:Re: solr 5.2.1, data import issue, shown processed rows doesn't match acturally indexed doc quantity.

2016-04-15 Thread cqlangyi
hi guys, thank you very much for the help. sorry been so lated to reply. 1. "commit" didn't help. after commit, the 'numFound' of "*:*" query is still the same. 2. "id" field in every doc is generated by solr using UUID, i have idea how to check if there is a duplicated one. but i

Re: Growing memory?

2016-04-15 Thread Shawn Heisey
On 4/14/2016 1:25 PM, Betsey Benagh wrote: > bin/solr status shows the memory usage increasing, as does the admin ui. > > I¹m running this on a shared machine that is supporting several other > applications, so I can¹t be particularly greedy with memory usage. Is > there anything out there that