Re: Memory Usage increases by a lot during and after optimization .

2016-01-06 Thread Zheng Lin Edwin Yeo
Hi Shawn,

Thank you for your explanation.

Yes, both of the top two processes are Solr. I have two Solr processes on
one machine now, as the second one is a replica of the first one. In the
future, the plan is to have them on separate machine.


>Subtracting SHR from RES (or in your case, Shareable from Working)
>reveals the actual memory being used, and I believe you can see this
>actual number in the Private column, which is approximately the
>difference between Working and Shareable.  If I'm right, this means that
>the actual memory usage is almost 14GB lower than Windows is reporting.

Does this means that sometimes when I see the high memory usage (can be up
to 100%), it is just a memory reporting error by Windows, but Solr is
working exactly as it should?

Regards,
Edwin


On 7 January 2016 at 04:42, Shawn Heisey  wrote:

> On 1/5/2016 11:50 PM, Zheng Lin Edwin Yeo wrote:
> > Here is the new screenshot of the Memory tab of the Resource Monitor.
> > https://www.dropbox.com/s/w4bnrb66r16lpx1/Resource%20Monitor.png?dl=0
> >
> > Yes, I found that the value under the "Working Set" column is much higher
> > than the others. Also, the value which I was previously looking at under
> > the Task Manager is under the Private Column here.
> > It says that I have about 14GB of available memory, but the "Free" number
> > is much lower, at 79MB.
>
> You'll probably think I'm nuts, but I believe everything is working
> exactly as it should.
>
> The first two processes, which I assume are Solr processes, show a
> Shareable size near 7GB each.  I have seen something similar happen on
> Linux where SHR memory is huge for the Solr process, and when this
> happens, the combination of memory numbers would turn out to be
> impossible, so I think it's a memory reporting bug related to Java, one
> that affects both Linux and Windows.
>
> Subtracting SHR from RES (or in your case, Shareable from Working)
> reveals the actual memory being used, and I believe you can see this
> actual number in the Private column, which is approximately the
> difference between Working and Shareable.  If I'm right, this means that
> the actual memory usage is almost 14GB lower than Windows is reporting.
>
> If both of the top processes are Solr, I'm not sure why you have two
> Solr processes on one machine.  One Solr instance can handle multiple
> indexes with no problem.
>
> As evidence that I'm not insane, consider the following screenshot, from
> another of my servers:
>
> https://www.dropbox.com/s/64en3sar4cr1ytj/linux-solr-mem-high-shr.png?dl=0
>
> On the screenshot, the solr process shows RES size of 22GB ... which is
> highly unusual, because this Solr install has a max heap of 8GB ... but
> notice that SHR is 13GB.  The difference between 22GB and 13GB is 9GB,
> which is much more reasonable, and if we assume that the 22GB is rounded
> up and/or the 13GB is rounded down, then the difference is much closer
> to 8GB.  Looking at some other numbers, the "cached" value is 48GB.  If
> you add the 48GB cache allocation to the *reported* resident size of
> 22GB for Solr, you get a total of 70GB ... which is more memory than the
> machine even has (64GB).  This is why I am sure that when SHR is really
> high on a Java process, it is a memory reporting error.
>
> Thanks,
> Shawn
>
>


Parent Child Schema Design

2016-01-06 Thread Pranaya Behera

Hi,
 I have read yonik.com/solr-nested-objects/ which states that there 
is no need for additional schema changes other than having a _root_ 
which is already present in the schema.xml. But it never specified on 
for the child elements what would the schema look like. And the post 
actually uses curl with json to index data to solr. I am using python 
client to index data to solr.


I have products as the core. This is one document but it has more 
interlinked child documents. As of now it is a single flat structure 
schema. But if I would like to use the parent-child relationship how 
would I go about it. Sample current schema:


required="true" multiValued="false" />



multiValued="true"/>
required="false" multiValued="false" />
required="false" />


Now I would like to add child document to it. Lets say I would like to 
add another field named steps which will contain id, product_id, name, 
description. This steps would be a multivalued as per product we have 
multiple steps.


Can someone help me figure out how to go about this ?

--
Thanks & Regards
Pranaya Behera



Re: Query behavior difference.

2016-01-06 Thread Modassar Ather
Thanks for your responses.

Best,
Modassar

On Wed, Jan 6, 2016 at 9:27 PM, Jack Krupansky 
wrote:

> The motivation for the constant-score rewrite is simply performance. As per
> the Javadoc:
>
> "*This method is faster than the BooleanQuery rewrite methods when the
> number of matched terms or matched documents is non-trivial. Also, it will
> never hit an errant BooleanQuery.TooManyClauses exception.*"
>
> So that's a second reason - to avoid the max clause count limitation of
> Boolean Query.
>
> See:
>
> https://lucene.apache.org/core/5_4_0/core/org/apache/lucene/search/MultiTermQuery.html#CONSTANT_SCORE_REWRITE
>
> https://lucene.apache.org/core/5_4_0/core/org/apache/lucene/search/WildcardQuery.html
>
>
> -- Jack Krupansky
>
> On Wed, Jan 6, 2016 at 6:07 AM, Modassar Ather 
> wrote:
>
> > Please help me understand why queries like wildcard, prefix and few
> others
> > are re-written into constant score query?
> > Why the scoring factors are not taken into consideration in such queries?
> >
> > Please correct me if I am wrong that this behavior is per the query type
> > irrespective of the parser used.
> >
> > Thanks,
> > Modassar
> >
> > On Wed, Jan 6, 2016 at 12:56 PM, Modassar Ather 
> > wrote:
> >
> > > Thanks for your response Ahmet.
> > >
> > > Best,
> > > Modassar
> > >
> > > On Mon, Jan 4, 2016 at 5:07 PM, Ahmet Arslan  >
> > > wrote:
> > >
> > >> Hi,
> > >>
> > >> I think wildcard queries fl:networ* are re-written into Constant Score
> > >> Query.
> > >> fl=*,score should returns same score for all documents that are
> > retrieved.
> > >>
> > >> Ahmet
> > >>
> > >>
> > >>
> > >> On Monday, January 4, 2016 12:22 PM, Modassar Ather <
> > >> modather1...@gmail.com> wrote:
> > >> Hi,
> > >>
> > >> Kindly help me understand how will relevance ranking differ int
> > following
> > >> searches.
> > >>
> > >> query : fl:network
> > >> query : fl:networ*
> > >>
> > >> What I am observing that the results returned are different in both of
> > >> them
> > >> in a way that the top documents returned for q=fl:network is not
> present
> > >> in
> > >> the top results of q=fl:networ*.
> > >> For example for q=fl:network I am getting top documents having around
> 20
> > >> occurrence of network whereas the top result of q=fl:networ* has only
> > >> couple of occurrence of network.
> > >> I am aware of the underlying normalization process participation in
> > >> relevance ranking of documents but not able to understand such a
> > >> difference
> > >> in the ranking of result for the queries.
> > >>
> > >> Thanks,
> > >> Modassar
> > >>
> > >
> > >
> >
>


Re: Issue with Join

2016-01-06 Thread William Bell
Can a committer take a look at this please?


On Wed, Dec 30, 2015 at 4:35 PM, William Bell  wrote:

> Yeah I ran it. It appears to only have an issue if the field for the core
> on the outside is MultiValue, and the inside field is single value, and the
> order of the field has the code not as the first entry. Like:
>
>  practicing_specialties_codes:
> >  [
> > - "PS1010",
> > - "PS282",
> > - "*PS1226*"
> > ]
> >  }
>
> Here is the output of the debug:
>
>
>- join:
>{
>   - {!join from=prac_spec_code to=practicing_specialties_codes
>   fromIndex=autosuggest}*:*:
>   {
>  - time: 24,
>  - fromSetSize: 170179,
>  - toSetSize: 3194542,
>  - fromTermCount: 1091,
>  - fromTermTotalDf: 1091,
>  - fromTermDirectCount: 1091,
>  - fromTermHits: 1091,
>  - fromTermHitsTotalDf: 1091,
>  - toTermHits: 1069,
>  - toTermHitsTotalDf: 4002538,
>  - toTermDirectCount: 697,
>  - smallSetsDeferred: 1,
>  - toSetDocsAdded: 4002538
>  }
>   },
>
>
> On Wed, Dec 30, 2015 at 2:02 AM, Mikhail Khludnev <
> mkhlud...@griddynamics.com> wrote:
>
>> Hello,
>>
>> Have you considered to pass NOT {!join from=prac_spec_code
>> to=practicing_specialties_codes
>>  fromIndex=autosuggest}*:* as a q and look at debug output.
>> Also there was a trick for dealing with pure negative queries q=*:*
>> -foo:bar though it might be obsolete.
>>
>> On Tue, Dec 29, 2015 at 7:02 AM, William Bell 
>> wrote:
>>
>> > I have having issues with {!join}. If the core have multiValued field
>> and
>> > the inner join does not have a multiValued field it does not find the
>> > ones...
>> >
>> > Solr 5.3.1... 5.3.1
>> >
>> > Example.
>> >
>> > PS1226 is in practicing_specialties_codes in providersearch core. This
>> > field is multiValued.
>> >
>> > in the autosuggest core there is NOT a field for PS1226 in there. This
>> > field is called prac_spec_code and is single values.
>> >
>> >
>> >
>> >
>> http://localhost:8983/solr/providersearch/select?q=*%3A*&wt=json&indent=true&fq=practicing_specialties_codes:PS1226&fl=practicing_specialties_codes
>> >
>> > I get:
>> >
>> >
>> >- docs:
>> >[
>> >   -
>> >   {
>> >  - practicing_specialties_codes:
>> >  [
>> > - "PS1010",
>> > - "PS282",
>> > - "PS1226"
>> > ]
>> >  }
>> >   ]
>> >
>> >
>> >
>> > In autosuggest there is nothing:
>> >
>> >
>> >
>> http://localhost:8983/solr/autosuggest/select?q=*%3A*&wt=json&indent=true&fq=prac_spec_code:PS1226&fl=prac_spec_code
>> >
>> > Nothing.
>> >
>> > Then a join should find what is in providersearch but missing in
>> > autosuggest.
>> >
>> >
>> >
>> http://localhost:8983/solr/providersearch/select?debugQuery=true&wt=json&q=*:*&rows=10&fq=practicing_specialties_codes:PS1226&fl=practicing_specialties_codes&fq=NOT%20{!join%20from=prac_spec_code%20to=practicing_specialties_codes%20fromIndex=autosuggest}auto_type:PRACSPEC
>> 
>> >
>> > or
>> >
>> >
>> >
>> http://hgsolr2sl1:8983/solr/providersearch/select?debugQuery=true&wt=json&q=*:*&rows=10&fl=practicing_specialties_codes&fq=NOT%20{!join%20from=prac_spec_code%20to=practicing_specialties_codes%20fromIndex=autosuggest}auto_type:PRACSPEC
>> 
>> >
>> > or
>> >
>> >
>> >
>> http://hgsolr2sl1:8983/solr/providersearch/select?debugQuery=true&wt=json&q=*:*&rows=10&fl=practicing_specialties_codes&fq=NOT%20{!join%20from=prac_spec_code%20to=practicing_specialties_codes%20fromIndex=autosuggest}*:*
>> 
>> >
>> > I also tried *:* AND NOT {!join}
>> >
>> > I get 0 results. This seems to be a bug.
>> >
>> > {
>> >
>> >- responseHeader:
>> >{
>> >   - status: 0,
>> >   - QTime: 178,
>> >   - params:
>> >   {
>> >  - q: "*:*",
>> >  - fl: "practicing_specialties_codes",
>> >  - fq: "NOT {!join from=prac_spec_code
>> >  to=practicing_specialties_codes fromIndex=autosuggest}*:*",
>> >  - rows: "10",
>> >  - wt: "json",
>> >  - debugQuery: "true"
>> >  }
>> >   },
>> >- response:
>> >{
>> >   - numFound: 0,
>> >   - start:

Re: I cannot create replica in Solr

2016-01-06 Thread Erick Erickson
It looks like you haven't uploaded the configset to Zookeeper so it
can be found by the
create command. See:
https://cwiki.apache.org/confluence/display/solr/Using+ZooKeeper+to+Manage+Configuration+Files.

Best,
Erick

On Wed, Jan 6, 2016 at 1:33 PM, persoy  wrote:
> Hi
> I'm using Solr clouds. I created one solr and zookeeper instances. Then I
> want to create replica for Solr.
> I'm using Windows 7
>  But the following errors are throwing :
>
> This is warning message:
> OverseerCollectionProcessor.processMessage : create , {
>   "name":"contracts",
>   "fromApi":"true",
>   "replicationFactor":"1",
>   "collection.configName":"contracts",
>   "numShards":"1",
>   "stateFormat":"2",
>   "operation":"create"}
> After this warning these error messages are throwing
> Can someone help me?
> Thanks
> This is Error messages:
> org.apache.solr.common.SolrException: Could not load conf for core
> contracts_shard1_replica1: Error loading solr config from solrconfig.xml
> at
> org.apache.solr.core.ConfigSetService.getConfig(ConfigSetService.java:78)
> at org.apache.solr.core.CoreContainer.create(CoreContainer.java:635)
> at org.apache.solr.core.CoreContainer.create(CoreContainer.java:611)
> at
> org.apache.solr.handler.admin.CoreAdminHandler.handleCreateAction(CoreAdminHandler.java:628)
> at
> org.apache.solr.handler.admin.CoreAdminHandler.handleRequestInternal(CoreAdminHandler.java:213)
> at
> org.apache.solr.handler.admin.CoreAdminHandler.handleRequestBody(CoreAdminHandler.java:193)
> at
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:143)
> at
> org.apache.solr.servlet.HttpSolrCall.handleAdminRequest(HttpSolrCall.java:660)
> at org.apache.solr.servlet.HttpSolrCall.call(HttpSolrCall.java:431)
> at
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:227)
> at
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:196)
> at
> org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1652)
> at
> org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:585)
> at
> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:143)
> at
> org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:577)
> at
> org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:223)
> at
> org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1127)
> at
> org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:515)
> at
> org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:185)
> at
> org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1061)
> at
> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)
> at
> org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:215)
> at
> org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:110)
> at
> org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:97)
> at org.eclipse.jetty.server.Server.handle(Server.java:497)
> at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:310)
> at
> org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:257)
> at
> org.eclipse.jetty.io.AbstractConnection$2.run(AbstractConnection.java:540)
> at
> org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:635)
> at
> org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:555)
> at java.lang.Thread.run(Thread.java:785)
> Caused by: org.apache.solr.common.SolrException: Error loading solr config
> from solrconfig.xml
> at
> org.apache.solr.core.SolrConfig.readFromResourceLoader(SolrConfig.java:187)
> at
> org.apache.solr.core.ConfigSetService.createSolrConfig(ConfigSetService.java:92)
> at
> org.apache.solr.core.ConfigSetService.getConfig(ConfigSetService.java:73)
> ... 30 more
> Caused by: java.io.IOException: Can't find resource 'solrconfig.xml' in
> classpath or '/configs/contracts',
> cwd=C:\CM_10.1.1\INDEXSERVER\searchserver-distribution\target\searchserver\solr\server
> at
> org.apache.solr.cloud.ZkSolrResourceLoader.openResource(ZkSolrResourceLoader.java:96)
> at
> org.apache.solr.core.SolrResourceLoader.openConfig(SolrResourceLoader.java:309)
> at org.apache.solr.core.Config.(Config.java:122)
> at org.apache.solr.core.Config.(Config.java:92)
> at org.apache.solr.core.SolrConfig.(SolrConfig.java:202)
> at
> org.apache.solr.core.SolrConfig.readFromResourceLoader(SolrConfig.java:179)
> ... 32 more
>
> org.apache.solr.common.SolrException: Error CREATEing SolrCore

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina

Hi,

is it possible that was the problem wrote by Shawn and you have 
SolrCloud mode (with zookeeper) ?


The solution gives by Esther works fine so it's ok for me :)

**

Are you running in SolrCloud mode (with zookeeper)?  If you're not, then
the collection parameter doesn't do anything, and old-style distributed
search (with the shards parameter) will be your only option.

Thanks,
Shawn

***

Le 06/01/2016 19:17, Susheel Kumar a écrit :

Hi Bruno,  I just tested on 5.4 for your sake and it works fine.  You are
somewhere goofing up.  Please create a new simple schema different from
your use case with 2-3 fields with 2-3 documents and test this out
independently on your current problem.  That's what i can make suggestion
and did same to confirm this.

On Wed, Jan 6, 2016 at 11:48 AM, Bruno Mannina  wrote:


Same result on my dev' server, it seems that collection param haven't
effect on the query...

Q: I don't see on the solr 5.4 doc, the "collection" param for select
handler, is it always present in 5.4 version ?


Le 06/01/2016 17:38, Bruno Mannina a écrit :


I have a dev' server, I will do some test on it...

Le 06/01/2016 17:31, Susheel Kumar a écrit :


I'll suggest if you can setup some some test data locally and try this
out.  This will confirm your understanding.

Thanks,
Susheel

On Wed, Jan 6, 2016 at 10:39 AM, Bruno Mannina  wrote:

Hi Susheel, Emir,

yes I check, and I have one result in c1 and in c2 with the same query
fid:34520196

http://xxx.xxx.xxx.xxx:
/solr/c1/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2


{ "responseHeader":{ "status":0, "QTime":1, "params":{
"fl":"fid,cc*,st",
"indent":"true", "q":"fid:34520196", "collection":"c1,c2",
"wt":"json"}},
"response":{"numFound":1,"start":0,"docs":[ {

  "id":"EP1680447",
  "st":"LAPSED",
  "fid":"34520196"}]
}
}


http://xxx.xxx.xxx.xxx:
/solr/c2/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2


{
"responseHeader":{
  "status":0,
  "QTime":0,
  "params":{
"fl":"id,fid,cc*,st",
"indent":"true",
"q":"fid:34520196",
"collection":"c1,c2",
"wt":"json"}},
"response":{"numFound":1,"start":0,"docs":[
{
  "id":"WO2005040212",
  "st":"PENDING",
  "cc_CA":"LAPSED",
  "cc_EP":"LAPSED",
  "cc_JP":"PENDING",
  "cc_US":"LAPSED",
  "fid":"34520196"}]
}}


I have the same xxx.xxx.xxx.xxx: (server:port).
unique key field C1, C2 : id

id data in C1 is different of id data in C2

Must I config/set something in solr ?

thanks,
Bruno


Le 06/01/2016 14:56, Emir Arnautovic a écrit :

Hi Bruno,

Can you check counts? Is it possible that first page is only with
results
from collection that you sent request to so you assumed it returns only
results from single collection?

Thanks,
Emir

On 06.01.2016 14:33, Susheel Kumar wrote:

Hi Bruno,

I just tested this scenario in my local solr 5.3.1 and it returned
results
from two identical collections. I doubt if it is broken in 5.4 just
double
check if you are not missing anything else.

Thanks,
Susheel



http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2

responseHeader": {"status": 0,"QTime": 98,"params": {"q":
"id_type:hello","
indent": "true","collection": "c1,c2","wt": "json"}},
response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id":
"1","
id_type": "hello","_version_": 1522623395043213300},{"id":
"3","id_type":"
hello","_version_": 1522623422397415400}]}

On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina 
wrote:

yes id value is unique in C1 and unique in C2.


id in C1 is never present in C2
id in C2 is never present in C1


Le 06/01/2016 11:12, Binoy Dalal a écrit :

Are Id values for docs in both the collections exactly same?


To get proper results, the ids should be unique across both the
cores.

On Wed, 6 Jan 2016, 15:11 Bruno Mannina  wrote:

Hi All,

Solr 5.4, Ubuntu

I thought it was simple to request across two collections with the
same
schema but not.
I have one solr instance launch. 300 000 records in each
collection.

I try to use this request without having both results:

http://my_adress:my_port
/solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json

this request returns only C1 results and if I do:

http://my_adress:my_port
/solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json

it returns only C2 results.

I have 5 identical fields on both collection
id, fid, st, cc, timestamp
where id is the unique key field.

Can someone could explain me why it doesn't work ?

Thanks a lot !
Bruno

---
L'absence de virus dans ce courrier électronique a été vérifiée
par le
logiciel antivirus Avast.
http://www.avast.com

--

Regards,


Binoy Dalal


---


L'absence de virus dans ce courrier électronique a été vérifiée par
le
logiciel antivirus Avast.
http://www.avast.com



---

L'absence de virus dans ce courrier électronique a été vé

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina

Hi Shawn,

thanks for this info, I use solr alone on my own server.

Le 06/01/2016 20:13, Shawn Heisey a écrit :

On 1/6/2016 2:41 AM, Bruno Mannina wrote:

I try to use this request without having both results:

http://my_adress:my_port/solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json


this request returns only C1 results and if I do:

http://my_adress:my_port/solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json


it returns only C2 results.

Are you running in SolrCloud mode (with zookeeper)?  If you're not, then
the collection parameter doesn't do anything, and old-style distributed
search (with the shards parameter) will be your only option.

Thanks,
Shawn






---
L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel 
antivirus Avast.
http://www.avast.com



Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina

Yeah ! it works with your method !

thanks a lot Esther !


Le 06/01/2016 19:15, Esther-Melaine Quansah a écrit :

Ok, so join won’t work. Distributed search is your answer. This worked for me:

http://localhost:8983/solr/temp/select?shards=localhost:8983/solr/job,localhost:8983/solr/temp&q=*:*
 


so for you it’d look something like

http://localhost:8983/solr/c1/select?shards=localhost:8983/solr/c1,localhost:8983/solr/c2&q=fid:34520196
 

and obviously, you’ll just choose the ports that correspond to your 
configuration.

Esther

On Jan 6, 2016, at 9:36 AM, Bruno Mannina  wrote:

:( not work for me

http://my_adress:my_port/solr/c1/select?q={!join from=fid to=fid 
fromIndex=c2}fid:34520196&wt=json

the result is always the same, it answer only for c1
34520196 has result in both collections



Le 06/01/2016 18:16, Binoy Dalal a écrit :

Bruno,
Use join like so:
{!join from=f1 to=f2 fromIndex=c2}
On c1

On Wed, 6 Jan 2016, 22:30 Bruno Mannina  wrote:


Hi Ester,

yes, i saw it, but if I use:

q={!join from=fid to=fid}fid:34520196 (with or not &collection=c1,c2)

I have only the result from the collection used in the select/c1

Le 06/01/2016 17:52, esther.quan...@lucidworks.com a écrit :

Hi Bruno,

You might consider using the JoinQueryParser. Details here :

https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-JoinQueryParser

Best,
Esther


Le 6 janv. 2016 à 08:48, Bruno Mannina  a écrit :

Same result on my dev' server, it seems that collection param haven't

effect on the query...

Q: I don't see on the solr 5.4 doc, the "collection" param for select

handler, is it always present in 5.4 version ?

Le 06/01/2016 17:38, Bruno Mannina a écrit :

I have a dev' server, I will do some test on it...

Le 06/01/2016 17:31, Susheel Kumar a écrit :

I'll suggest if you can setup some some test data locally and try this
out.  This will confirm your understanding.

Thanks,
Susheel


On Wed, Jan 6, 2016 at 10:39 AM, Bruno Mannina 

wrote:

Hi Susheel, Emir,

yes I check, and I have one result in c1 and in c2 with the same

query

fid:34520196

http://xxx.xxx.xxx.xxx:


/solr/c1/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2

{ "responseHeader":{ "status":0, "QTime":1, "params":{

"fl":"fid,cc*,st",

"indent":"true", "q":"fid:34520196", "collection":"c1,c2",

"wt":"json"}},

"response":{"numFound":1,"start":0,"docs":[ {

  "id":"EP1680447",
  "st":"LAPSED",
  "fid":"34520196"}]
}
}


http://xxx.xxx.xxx.xxx:


/solr/c2/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2

{
"responseHeader":{
  "status":0,
  "QTime":0,
  "params":{
"fl":"id,fid,cc*,st",
"indent":"true",
"q":"fid:34520196",
"collection":"c1,c2",
"wt":"json"}},
"response":{"numFound":1,"start":0,"docs":[
{
  "id":"WO2005040212",
  "st":"PENDING",
  "cc_CA":"LAPSED",
  "cc_EP":"LAPSED",
  "cc_JP":"PENDING",
  "cc_US":"LAPSED",
  "fid":"34520196"}]
}}


I have the same xxx.xxx.xxx.xxx: (server:port).
unique key field C1, C2 : id

id data in C1 is different of id data in C2

Must I config/set something in solr ?

thanks,
Bruno


Le 06/01/2016 14:56, Emir Arnautovic a écrit :


Hi Bruno,
Can you check counts? Is it possible that first page is only with

results

from collection that you sent request to so you assumed it returns

only

results from single collection?

Thanks,
Emir


On 06.01.2016 14:33, Susheel Kumar wrote:

Hi Bruno,

I just tested this scenario in my local solr 5.3.1 and it returned
results
from two identical collections. I doubt if it is broken in 5.4 just
double
check if you are not missing anything else.

Thanks,
Susheel




http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2

responseHeader": {"status": 0,"QTime": 98,"params": {"q":
"id_type:hello","
indent": "true","collection": "c1,c2","wt": "json"}},
response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id":

"1","

id_type": "hello","_version_": 1522623395043213300},{"id":
"3","id_type":"
hello","_version_": 1522623422397415400}]}

On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina 

wrote:

yes id value is unique in C1 and unique in C2.

id in C1 is never present in C2
id in C2 is never present in C1


Le 06/01/2016 11:12, Binoy Dalal a écrit :

Are Id values for docs in both the collections exactly same?

To get proper results, the ids should be unique across both the

cores.

On Wed, 6 Jan 2016, 15:11 Bruno Mannina 

wrote:

Hi All,


Solr 5.4, Ubuntu

I thought it was simple to request across two collections with

the

same
schema but not.
I have one solr instance launch. 300 000 records in 

I cannot create replica in Solr

2016-01-06 Thread persoy
Hi
I'm using Solr clouds. I created one solr and zookeeper instances. Then I
want to create replica for Solr.
I'm using Windows 7 
 But the following errors are throwing :

This is warning message:
OverseerCollectionProcessor.processMessage : create , {
  "name":"contracts",
  "fromApi":"true",
  "replicationFactor":"1",
  "collection.configName":"contracts",
  "numShards":"1",
  "stateFormat":"2",
  "operation":"create"}
After this warning these error messages are throwing
Can someone help me?
Thanks
This is Error messages:
org.apache.solr.common.SolrException: Could not load conf for core
contracts_shard1_replica1: Error loading solr config from solrconfig.xml
at
org.apache.solr.core.ConfigSetService.getConfig(ConfigSetService.java:78)
at org.apache.solr.core.CoreContainer.create(CoreContainer.java:635)
at org.apache.solr.core.CoreContainer.create(CoreContainer.java:611)
at
org.apache.solr.handler.admin.CoreAdminHandler.handleCreateAction(CoreAdminHandler.java:628)
at
org.apache.solr.handler.admin.CoreAdminHandler.handleRequestInternal(CoreAdminHandler.java:213)
at
org.apache.solr.handler.admin.CoreAdminHandler.handleRequestBody(CoreAdminHandler.java:193)
at
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:143)
at
org.apache.solr.servlet.HttpSolrCall.handleAdminRequest(HttpSolrCall.java:660)
at org.apache.solr.servlet.HttpSolrCall.call(HttpSolrCall.java:431)
at
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:227)
at
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:196)
at
org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1652)
at
org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:585)
at
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:143)
at
org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:577)
at
org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:223)
at
org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1127)
at
org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:515)
at
org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:185)
at
org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1061)
at
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)
at
org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:215)
at
org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:110)
at
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:97)
at org.eclipse.jetty.server.Server.handle(Server.java:497)
at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:310)
at
org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:257)
at
org.eclipse.jetty.io.AbstractConnection$2.run(AbstractConnection.java:540)
at
org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:635)
at
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:555)
at java.lang.Thread.run(Thread.java:785)
Caused by: org.apache.solr.common.SolrException: Error loading solr config
from solrconfig.xml
at
org.apache.solr.core.SolrConfig.readFromResourceLoader(SolrConfig.java:187)
at
org.apache.solr.core.ConfigSetService.createSolrConfig(ConfigSetService.java:92)
at
org.apache.solr.core.ConfigSetService.getConfig(ConfigSetService.java:73)
... 30 more
Caused by: java.io.IOException: Can't find resource 'solrconfig.xml' in
classpath or '/configs/contracts',
cwd=C:\CM_10.1.1\INDEXSERVER\searchserver-distribution\target\searchserver\solr\server
at
org.apache.solr.cloud.ZkSolrResourceLoader.openResource(ZkSolrResourceLoader.java:96)
at
org.apache.solr.core.SolrResourceLoader.openConfig(SolrResourceLoader.java:309)
at org.apache.solr.core.Config.(Config.java:122)
at org.apache.solr.core.Config.(Config.java:92)
at org.apache.solr.core.SolrConfig.(SolrConfig.java:202)
at
org.apache.solr.core.SolrConfig.readFromResourceLoader(SolrConfig.java:179)
... 32 more

org.apache.solr.common.SolrException: Error CREATEing SolrCore
'contracts_shard1_replica1': Unable to create core
[contracts_shard1_replica1] Caused by: Can't find resource 'solrconfig.xml'
in classpath or '/configs/contracts',
cwd=C:\CM_10.1.1\INDEXSERVER\searchserver-distribution\target\searchserver\solr\server
at
org.apache.solr.handler.admin.CoreAdminHandler.handleCreateAction(CoreAdminHandler.java:661)
at
org.apache.solr.handler.admin.CoreAdminHandler.handleRequestInternal(CoreAdminHandler.java:213)
 

Re: Memory Usage increases by a lot during and after optimization .

2016-01-06 Thread Shawn Heisey
On 1/5/2016 11:50 PM, Zheng Lin Edwin Yeo wrote:
> Here is the new screenshot of the Memory tab of the Resource Monitor.
> https://www.dropbox.com/s/w4bnrb66r16lpx1/Resource%20Monitor.png?dl=0
>
> Yes, I found that the value under the "Working Set" column is much higher
> than the others. Also, the value which I was previously looking at under
> the Task Manager is under the Private Column here.
> It says that I have about 14GB of available memory, but the "Free" number
> is much lower, at 79MB.

You'll probably think I'm nuts, but I believe everything is working
exactly as it should.

The first two processes, which I assume are Solr processes, show a
Shareable size near 7GB each.  I have seen something similar happen on
Linux where SHR memory is huge for the Solr process, and when this
happens, the combination of memory numbers would turn out to be
impossible, so I think it's a memory reporting bug related to Java, one
that affects both Linux and Windows.

Subtracting SHR from RES (or in your case, Shareable from Working)
reveals the actual memory being used, and I believe you can see this
actual number in the Private column, which is approximately the
difference between Working and Shareable.  If I'm right, this means that
the actual memory usage is almost 14GB lower than Windows is reporting.

If both of the top processes are Solr, I'm not sure why you have two
Solr processes on one machine.  One Solr instance can handle multiple
indexes with no problem.

As evidence that I'm not insane, consider the following screenshot, from
another of my servers:

https://www.dropbox.com/s/64en3sar4cr1ytj/linux-solr-mem-high-shr.png?dl=0

On the screenshot, the solr process shows RES size of 22GB ... which is
highly unusual, because this Solr install has a max heap of 8GB ... but
notice that SHR is 13GB.  The difference between 22GB and 13GB is 9GB,
which is much more reasonable, and if we assume that the 22GB is rounded
up and/or the 13GB is rounded down, then the difference is much closer
to 8GB.  Looking at some other numbers, the "cached" value is 48GB.  If
you add the 48GB cache allocation to the *reported* resident size of
22GB for Solr, you get a total of 70GB ... which is more memory than the
machine even has (64GB).  This is why I am sure that when SHR is really
high on a Java process, it is a memory reporting error.

Thanks,
Shawn



RE: Cleanup solr cloud after failure in collection creation

2016-01-06 Thread Gian Maria Ricci - aka Alkampfer
Thanks Erick,

I did not remember that image attached are not permitted. Here is the links to 
the two images

http://screencast.com/t/HQnWDV6GDoe
http://screencast.com/t/qUpt0HxF 

--
Gian Maria Ricci
Cell: +39 320 0136949


-Original Message-
From: Erick Erickson [mailto:erickerick...@gmail.com] 
Sent: mercoledì 6 gennaio 2016 18:39
To: solr-user 
Subject: Re: Cleanup solr cloud after failure in collection creation

The mail server is quite aggressive about removing attachments, none of yours 
came through.
Perhaps put them somewhere else and provide a link?

Best,
Erick

On Wed, Jan 6, 2016 at 3:22 AM, Gian Maria Ricci - aka Alkampfer < 
alkamp...@nablasoft.com> wrote:

> I’ve issued a command to create some collections, but there were an 
> error in solrconfig.xml (I’ve specified wrong path to 
> dataimporthandler.jar files). The creation of the collection failed 
> but now I don’t know how to cleanup everything.
>
>
>
> This is a test solrcloud where I’m experimenting in windows environment.
>
>
>
> Here is what I see in the UI. I’ve also checked disk and all 
> directories for various shards are there. I cannot use the DELETE 
> collection command because the collection was not really created, but 
> I need to know what is the correct way to cleanup the situation.
>
>
>
>
>
>
>
> Here is the disk, where all the directories are present
>
>
>
> --
> Gian Maria Ricci
> Cell: +39 320 0136949
>
> [image:
> https://ci5.googleusercontent.com/proxy/5oNMOYAeFXZ_LDKanNfoLRHC37mAZk
> VVhkPN7QxMdA0K5JW2m0bm8azJe7oWZMNt8fKHNX1bzrUTd-kIyE40CmwT2Mlf8OI=s0-d
> -e1-ft#http://www.codewrecks.com/files/signature/mvp.png]
>  [image:
> https://ci3.googleusercontent.com/proxy/f-unQbmk6NtkHFspO5Y6x4jlIf_xrm
> GLUT3fU9y_7VUHSFUjLs7aUIMdZQYTh3eWIA0sBnvNX3WGXCU59chKXLuAHi2ArWdAcBcl
> KA=s0-d-e1-ft#http://www.codewrecks.com/files/signature/linkedin.jpg]
>  [image:
> https://ci3.googleusercontent.com/proxy/gjapMzu3KEakBQUstx_-cN7gHJ_Gpc
> IZNEPjCzOYMrPl-r1DViPE378qNAQyEWbXMTj6mcduIAGaApe9qHG1KN_hyFxQAIkdNSVT
> =s0-d-e1-ft#http://www.codewrecks.com/files/signature/twitter.jpg]
>  [image:
> https://ci5.googleusercontent.com/proxy/iuDOD2sdaxRDvTwS8MO7-CcXchpNJX
> 96uaWuvagoVLcjpAPsJi88XeOonE4vHT6udVimo7yL9ZtdrYueEfH7jXnudmi_Vvw=s0-d
> -e1-ft#http://www.codewrecks.com/files/signature/rss.jpg]
>  [image:
> https://ci6.googleusercontent.com/proxy/EBJjfkBzcsSlAzlyR88y86YXcwaKfn
> 3x7ydAObL1vtjJYclQr_l5TvrFx4PQ5qLNYW3yp7Ig66DJ-0tPJCDbDmYAFcamPQehwg=s
> 0-d-e1-ft#http://www.codewrecks.com/files/signature/skype.jpg]
>
>
>


Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Shawn Heisey
On 1/6/2016 2:41 AM, Bruno Mannina wrote:
> I try to use this request without having both results:
>
> http://my_adress:my_port/solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json
>
>
> this request returns only C1 results and if I do:
>
> http://my_adress:my_port/solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json
>
>
> it returns only C2 results.

Are you running in SolrCloud mode (with zookeeper)?  If you're not, then
the collection parameter doesn't do anything, and old-style distributed
search (with the shards parameter) will be your only option.

Thanks,
Shawn



Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Susheel Kumar
Hi Bruno,  I just tested on 5.4 for your sake and it works fine.  You are
somewhere goofing up.  Please create a new simple schema different from
your use case with 2-3 fields with 2-3 documents and test this out
independently on your current problem.  That's what i can make suggestion
and did same to confirm this.

On Wed, Jan 6, 2016 at 11:48 AM, Bruno Mannina  wrote:

> Same result on my dev' server, it seems that collection param haven't
> effect on the query...
>
> Q: I don't see on the solr 5.4 doc, the "collection" param for select
> handler, is it always present in 5.4 version ?
>
>
> Le 06/01/2016 17:38, Bruno Mannina a écrit :
>
>> I have a dev' server, I will do some test on it...
>>
>> Le 06/01/2016 17:31, Susheel Kumar a écrit :
>>
>>> I'll suggest if you can setup some some test data locally and try this
>>> out.  This will confirm your understanding.
>>>
>>> Thanks,
>>> Susheel
>>>
>>> On Wed, Jan 6, 2016 at 10:39 AM, Bruno Mannina  wrote:
>>>
>>> Hi Susheel, Emir,

 yes I check, and I have one result in c1 and in c2 with the same query
 fid:34520196

 http://xxx.xxx.xxx.xxx:
 /solr/c1/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2


 { "responseHeader":{ "status":0, "QTime":1, "params":{
 "fl":"fid,cc*,st",
 "indent":"true", "q":"fid:34520196", "collection":"c1,c2",
 "wt":"json"}},
 "response":{"numFound":1,"start":0,"docs":[ {

  "id":"EP1680447",
  "st":"LAPSED",
  "fid":"34520196"}]
}
 }


 http://xxx.xxx.xxx.xxx:
 /solr/c2/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2


 {
"responseHeader":{
  "status":0,
  "QTime":0,
  "params":{
"fl":"id,fid,cc*,st",
"indent":"true",
"q":"fid:34520196",
"collection":"c1,c2",
"wt":"json"}},
"response":{"numFound":1,"start":0,"docs":[
{
  "id":"WO2005040212",
  "st":"PENDING",
  "cc_CA":"LAPSED",
  "cc_EP":"LAPSED",
  "cc_JP":"PENDING",
  "cc_US":"LAPSED",
  "fid":"34520196"}]
}}


 I have the same xxx.xxx.xxx.xxx: (server:port).
 unique key field C1, C2 : id

 id data in C1 is different of id data in C2

 Must I config/set something in solr ?

 thanks,
 Bruno


 Le 06/01/2016 14:56, Emir Arnautovic a écrit :

 Hi Bruno,
> Can you check counts? Is it possible that first page is only with
> results
> from collection that you sent request to so you assumed it returns only
> results from single collection?
>
> Thanks,
> Emir
>
> On 06.01.2016 14:33, Susheel Kumar wrote:
>
> Hi Bruno,
>>
>> I just tested this scenario in my local solr 5.3.1 and it returned
>> results
>> from two identical collections. I doubt if it is broken in 5.4 just
>> double
>> check if you are not missing anything else.
>>
>> Thanks,
>> Susheel
>>
>>
>>
>> http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2
>>
>> responseHeader": {"status": 0,"QTime": 98,"params": {"q":
>> "id_type:hello","
>> indent": "true","collection": "c1,c2","wt": "json"}},
>> response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id":
>> "1","
>> id_type": "hello","_version_": 1522623395043213300},{"id":
>> "3","id_type":"
>> hello","_version_": 1522623422397415400}]}
>>
>> On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina 
>> wrote:
>>
>> yes id value is unique in C1 and unique in C2.
>>
>>> id in C1 is never present in C2
>>> id in C2 is never present in C1
>>>
>>>
>>> Le 06/01/2016 11:12, Binoy Dalal a écrit :
>>>
>>> Are Id values for docs in both the collections exactly same?
>>>
 To get proper results, the ids should be unique across both the
 cores.

 On Wed, 6 Jan 2016, 15:11 Bruno Mannina  wrote:

 Hi All,

 Solr 5.4, Ubuntu
>
> I thought it was simple to request across two collections with the
> same
> schema but not.
> I have one solr instance launch. 300 000 records in each
> collection.
>
> I try to use this request without having both results:
>
> http://my_adress:my_port
> /solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json
>
> this request returns only C1 results and if I do:
>
> http://my_adress:my_port
> /solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json
>
> it returns only C2 results.
>
> I have 5 identical fields on both collection
> id, fid, st, cc, timestamp
>>

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Esther-Melaine Quansah
Ok, so join won’t work. Distributed search is your answer. This worked for me:

http://localhost:8983/solr/temp/select?shards=localhost:8983/solr/job,localhost:8983/solr/temp&q=*:*
 


so for you it’d look something like 

http://localhost:8983/solr/c1/select?shards=localhost:8983/solr/c1,localhost:8983/solr/c2&q=fid:34520196
 

and obviously, you’ll just choose the ports that correspond to your 
configuration.

Esther 
> On Jan 6, 2016, at 9:36 AM, Bruno Mannina  wrote:
> 
> :( not work for me
> 
> http://my_adress:my_port/solr/c1/select?q={!join from=fid to=fid 
> fromIndex=c2}fid:34520196&wt=json
> 
> the result is always the same, it answer only for c1
> 34520196 has result in both collections
> 
> 
> 
> Le 06/01/2016 18:16, Binoy Dalal a écrit :
>> Bruno,
>> Use join like so:
>> {!join from=f1 to=f2 fromIndex=c2}
>> On c1
>> 
>> On Wed, 6 Jan 2016, 22:30 Bruno Mannina  wrote:
>> 
>>> Hi Ester,
>>> 
>>> yes, i saw it, but if I use:
>>> 
>>> q={!join from=fid to=fid}fid:34520196 (with or not &collection=c1,c2)
>>> 
>>> I have only the result from the collection used in the select/c1
>>> 
>>> Le 06/01/2016 17:52, esther.quan...@lucidworks.com a écrit :
 Hi Bruno,
 
 You might consider using the JoinQueryParser. Details here :
>>> https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-JoinQueryParser
 Best,
 Esther
 
> Le 6 janv. 2016 à 08:48, Bruno Mannina  a écrit :
> 
> Same result on my dev' server, it seems that collection param haven't
>>> effect on the query...
> Q: I don't see on the solr 5.4 doc, the "collection" param for select
>>> handler, is it always present in 5.4 version ?
> Le 06/01/2016 17:38, Bruno Mannina a écrit :
>> I have a dev' server, I will do some test on it...
>> 
>> Le 06/01/2016 17:31, Susheel Kumar a écrit :
>>> I'll suggest if you can setup some some test data locally and try this
>>> out.  This will confirm your understanding.
>>> 
>>> Thanks,
>>> Susheel
>>> 
 On Wed, Jan 6, 2016 at 10:39 AM, Bruno Mannina 
>>> wrote:
 Hi Susheel, Emir,
 
 yes I check, and I have one result in c1 and in c2 with the same
>>> query
 fid:34520196
 
 http://xxx.xxx.xxx.xxx:
 
>>> /solr/c1/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2
 { "responseHeader":{ "status":0, "QTime":1, "params":{
>>> "fl":"fid,cc*,st",
 "indent":"true", "q":"fid:34520196", "collection":"c1,c2",
>>> "wt":"json"}},
 "response":{"numFound":1,"start":0,"docs":[ {
 
  "id":"EP1680447",
  "st":"LAPSED",
  "fid":"34520196"}]
}
 }
 
 
 http://xxx.xxx.xxx.xxx:
 
>>> /solr/c2/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2
 {
"responseHeader":{
  "status":0,
  "QTime":0,
  "params":{
"fl":"id,fid,cc*,st",
"indent":"true",
"q":"fid:34520196",
"collection":"c1,c2",
"wt":"json"}},
"response":{"numFound":1,"start":0,"docs":[
{
  "id":"WO2005040212",
  "st":"PENDING",
  "cc_CA":"LAPSED",
  "cc_EP":"LAPSED",
  "cc_JP":"PENDING",
  "cc_US":"LAPSED",
  "fid":"34520196"}]
}}
 
 
 I have the same xxx.xxx.xxx.xxx: (server:port).
 unique key field C1, C2 : id
 
 id data in C1 is different of id data in C2
 
 Must I config/set something in solr ?
 
 thanks,
 Bruno
 
 
 Le 06/01/2016 14:56, Emir Arnautovic a écrit :
 
> Hi Bruno,
> Can you check counts? Is it possible that first page is only with
>>> results
> from collection that you sent request to so you assumed it returns
>>> only
> results from single collection?
> 
> Thanks,
> Emir
> 
>> On 06.01.2016 14:33, Susheel Kumar wrote:
>> 
>> Hi Bruno,
>> 
>> I just tested this scenario in my local solr 5.3.1 and it returned
>> results
>> from two identical collections. I doubt if it is broken in 5.4 just
>> double
>> check if you are not missing anything else.
>> 
>> Thanks,
>> Susheel
>> 
>> 
>> 
>>> http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2
>> responseHeader": {"status": 0,"QTime": 98,"p

Re: solr 5.2.0 need to build high query response

2016-01-06 Thread Erick Erickson
Right, the point of the newSearcher and firstSearcher events
is to smooth out spikes in response time, and it looks like
that's happening now.

>From here, you're in to query tuning. Without knowing your queries,
your schema, how many docs on a server, memory allocation
 and all that it's impossible to say much.

Here are some places to start looking:
https://wiki.apache.org/solr/SolrPerformanceProblems

Start with adding &debug=timing to see where Solr thinks it's
spending time, you'll see breakdowns for how much time
is spent searching, highlighting, faceting, whatever. Likely the
time is in search, but you never know.

Best,
Erick

On Wed, Jan 6, 2016 at 2:15 AM, Novin Novin  wrote:
> Thanks Erick, this listener doing quite a good job. But not what  I needed.
> Do the solr has any other things that I can look into to make it faster.
> FYI  speed goes to 1 sec to 1.2 sec. I actually needed around 500 ms.
>
> On Tue, 5 Jan 2016 at 18:24 Erick Erickson  wrote:
>
>> Yep. Do note what's happening here. You're executing a query
>> that potentially takes 10 seconds to execute (based on your
>> earlier post). But you may be opening a new searcher every
>> 2 seconds. You may start to see "too many on deck searchers"
>> in your log. If you do do _not_ try to "fix" this by upping the
>> maxWarmingSearchers in solrconfig.xml, that's really an
>> anti-pattern.
>>
>> Really, I'd consider relaxing this 2 second limit. I've often found
>> it easier to tell users "it may take up to 30 seconds for newly-added
>> docs to appear in search results" than try to satisfy overly-tight
>> requirements.
>>
>> As a former co-worker often said, "Users are much more comfortable
>> with predictable delays than unpredictable ones". It's surprising how
>> often it's the case.
>>
>> Best,
>> Erick
>>
>> P.S. What's the difference between newSearcher and firstSearcher?
>> newSearcher is fired every time a commit (soft or hard with
>> openSearcher=true)
>> where firstSearcher is fired up only when Solr starts. This is to
>> accommodate
>> the fact that the autowarm counts on things like filterCacher aren't
>> available when Solr starts. In practice, though, many (most?) people
>> put the same query in both.
>>
>> On Tue, Jan 5, 2016 at 9:17 AM, Novin Novin  wrote:
>> > If I'm correct, you are talking about this
>> >
>> > 
>> > 
>> > 
>> > 
>> > 
>> > 
>> > 
>> > 
>> >
>> > *or may be here too.*
>> >
>> > static firstSearcher warming in
>> > solrconfig.xml
>> > 
>> > 
>> > 
>> >
>> > Thanks,
>> > Novin
>> >
>> > On Tue, 5 Jan 2016 at 16:22 Erick Erickson 
>> wrote:
>> >
>> >> It sounds like you're not doing proper autowarming,
>> >> which you'd need to do either with hard or
>> >> soft commits that open new searchers.
>> >>
>> >> see:
>> >> https://wiki.apache.org/solr/SolrCaching#Cache_Warming_and_Autowarming
>> >>
>> >> In particular, you should have a newSearcher event
>> >> that facets on the fields you expect to need.
>> >>
>> >> Best,
>> >> Erick
>> >>
>> >> On Tue, Jan 5, 2016 at 8:17 AM, Novin Novin 
>> wrote:
>> >> > Thanks David. It is quite good to use for NRT.
>> >> >
>> >> > Apologies, I didn't mention that facet search is really slow.
>> >> >
>> >> > I found the below reason which could be the reason because I am using
>> >> facet
>> >> > spatial search which is getting slow.
>> >> >
>> >> > To know more about solr hard and soft commits, have a look at this
>> blog :
>> >> >
>> >>
>> https://lucidworks.com/blog/2013/08/23/understanding-transaction-logs-softcommit-and-commit-in-sorlcloud/
>> >> >
>> >> > In this article, "soft commits are that they will make documents
>> visible,
>> >> > but at some cost. In particular the “top level” caches, which include
>> >> what
>> >> > you configure in solrconfig.xml (filterCache, queryResultCache, etc)
>> will
>> >> > be invalidated! Autowarming will be performed on your top level caches
>> >> > (e.g. filterCache, queryResultCache), and any newSearcher queries
>> will be
>> >> > executed. Also, the FieldValueCache is invalidated, so facet queries
>> will
>> >> > have to wait until the cache is refreshed."
>> >> >
>> >> > Do you have any idea what could possible be do about this?
>> >> >
>> >> >
>> >> >
>> >> > On Tue, 5 Jan 2016 at 12:31 davidphilip cherian <
>> >> > davidphilipcher...@gmail.com> wrote:
>> >> >
>> >> >> You should use solr softcommit for this use case. So, by setting
>> >> softcommit
>> >> >> to 5 seconds and autoCommit to minute with openSearcher=false should
>> do
>> >> the
>> >> >> work.
>> >> >>
>> >> >>  
>> >> >>  6
>> >> >> false
>> >> >>  
>> >> >>
>> >> >> 
>> >> >> 2000
>> >> >> 
>> >> >>
>> >> >> Reference link-
>> >> >>
>> >>
>> https://cwiki.apache.org/confluence/display/solr/Near+Real+Time+Searching
>> >> >>
>> >> >> To know more about solr hard and soft commits, have a look at this
>> blog
>> >> :
>> >> >>
>> >> >

Re: Cleanup solr cloud after failure in collection creation

2016-01-06 Thread Erick Erickson
The mail server is quite aggressive about removing attachments, none of
yours came through.
Perhaps put them somewhere else and provide a link?

Best,
Erick

On Wed, Jan 6, 2016 at 3:22 AM, Gian Maria Ricci - aka Alkampfer <
alkamp...@nablasoft.com> wrote:

> I’ve issued a command to create some collections, but there were an error
> in solrconfig.xml (I’ve specified wrong path to dataimporthandler.jar
> files). The creation of the collection failed but now I don’t know how to
> cleanup everything.
>
>
>
> This is a test solrcloud where I’m experimenting in windows environment.
>
>
>
> Here is what I see in the UI. I’ve also checked disk and all directories
> for various shards are there. I cannot use the DELETE collection command
> because the collection was not really created, but I need to know what is
> the correct way to cleanup the situation.
>
>
>
>
>
>
>
> Here is the disk, where all the directories are present
>
>
>
> --
> Gian Maria Ricci
> Cell: +39 320 0136949
>
> [image:
> https://ci5.googleusercontent.com/proxy/5oNMOYAeFXZ_LDKanNfoLRHC37mAZkVVhkPN7QxMdA0K5JW2m0bm8azJe7oWZMNt8fKHNX1bzrUTd-kIyE40CmwT2Mlf8OI=s0-d-e1-ft#http://www.codewrecks.com/files/signature/mvp.png]
>  [image:
> https://ci3.googleusercontent.com/proxy/f-unQbmk6NtkHFspO5Y6x4jlIf_xrmGLUT3fU9y_7VUHSFUjLs7aUIMdZQYTh3eWIA0sBnvNX3WGXCU59chKXLuAHi2ArWdAcBclKA=s0-d-e1-ft#http://www.codewrecks.com/files/signature/linkedin.jpg]
>  [image:
> https://ci3.googleusercontent.com/proxy/gjapMzu3KEakBQUstx_-cN7gHJ_GpcIZNEPjCzOYMrPl-r1DViPE378qNAQyEWbXMTj6mcduIAGaApe9qHG1KN_hyFxQAIkdNSVT=s0-d-e1-ft#http://www.codewrecks.com/files/signature/twitter.jpg]
>  [image:
> https://ci5.googleusercontent.com/proxy/iuDOD2sdaxRDvTwS8MO7-CcXchpNJX96uaWuvagoVLcjpAPsJi88XeOonE4vHT6udVimo7yL9ZtdrYueEfH7jXnudmi_Vvw=s0-d-e1-ft#http://www.codewrecks.com/files/signature/rss.jpg]
>  [image:
> https://ci6.googleusercontent.com/proxy/EBJjfkBzcsSlAzlyR88y86YXcwaKfn3x7ydAObL1vtjJYclQr_l5TvrFx4PQ5qLNYW3yp7Ig66DJ-0tPJCDbDmYAFcamPQehwg=s0-d-e1-ft#http://www.codewrecks.com/files/signature/skype.jpg]
>
>
>


Re: core,Collection,Shard,Replication

2016-01-06 Thread Erick Erickson
bq: But when indexing a document in one shard,it gets reflected in every shard
of that collection

This is a misunderstanding (and I'm being a bit pedantic here). Each shard
contains a portion of the entire corpus. Say you have 1M docs and 2 shards.
Each shard will have very close to 500K documents.

If a shard has multiple _replicas_, each replica has a copy of the doc.

Please take the time to work through the Solr tutorials, much will become
clearer. You don't need any kind of extensive setup, you can see how things
run on any machine you have.

Best,
Erick

On Wed, Jan 6, 2016 at 5:19 AM, Binoy Dalal  wrote:
> The machines part may have been a bit misleading. I am sorry for that. What
> I actually meant was shards. Now, you can have multiple shards hosted on a
> single machine or multiple machines as in the example I gave.
>
> "I have to make sure that all those machines have solr server or gateway
> should be deplyed ?"
>
> Yes you do need a solr process running on all machines on which you plan to
> distribute your index.
>
> "And what multiple JVM processes run behind a solr server running?"
>
> If you mean how many jvms are running for a solr server, the answer's 1.
> "then what is a solr instance?"
> One solr process on your machine.
>
> On Wed, 6 Jan 2016, 18:33 vidya  wrote:
>
>> Hi
>> You described that sharding is to distribute data over multiple machines.Do
>> I have to make sure that all those machines have solr server or gateway
>> should be deplyed ?
>> And what multiple JVM processes run behind a solr server running?
>> I wanted to know what is a node. -> I understood like a mchine with solr
>> server deployed.
>> then what is a solr instance?
>>
>> Am I correct.If not,please help me
>>
>> Thanks in advance
>>
>>
>>
>> --
>> View this message in context:
>> http://lucene.472066.n3.nabble.com/core-Collection-Shard-Replication-tp4248850p4248865.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
> --
> Regards,
> Binoy Dalal


Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina

:( not work for me

http://my_adress:my_port/solr/c1/select?q={!join from=fid to=fid 
fromIndex=c2}fid:34520196&wt=json

the result is always the same, it answer only for c1
34520196 has result in both collections



Le 06/01/2016 18:16, Binoy Dalal a écrit :

Bruno,
Use join like so:
{!join from=f1 to=f2 fromIndex=c2}
On c1

On Wed, 6 Jan 2016, 22:30 Bruno Mannina  wrote:


Hi Ester,

yes, i saw it, but if I use:

q={!join from=fid to=fid}fid:34520196 (with or not &collection=c1,c2)

I have only the result from the collection used in the select/c1

Le 06/01/2016 17:52, esther.quan...@lucidworks.com a écrit :

Hi Bruno,

You might consider using the JoinQueryParser. Details here :

https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-JoinQueryParser

Best,
Esther


Le 6 janv. 2016 à 08:48, Bruno Mannina  a écrit :

Same result on my dev' server, it seems that collection param haven't

effect on the query...

Q: I don't see on the solr 5.4 doc, the "collection" param for select

handler, is it always present in 5.4 version ?

Le 06/01/2016 17:38, Bruno Mannina a écrit :

I have a dev' server, I will do some test on it...

Le 06/01/2016 17:31, Susheel Kumar a écrit :

I'll suggest if you can setup some some test data locally and try this
out.  This will confirm your understanding.

Thanks,
Susheel


On Wed, Jan 6, 2016 at 10:39 AM, Bruno Mannina 

wrote:

Hi Susheel, Emir,

yes I check, and I have one result in c1 and in c2 with the same

query

fid:34520196

http://xxx.xxx.xxx.xxx:


/solr/c1/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2

{ "responseHeader":{ "status":0, "QTime":1, "params":{

"fl":"fid,cc*,st",

"indent":"true", "q":"fid:34520196", "collection":"c1,c2",

"wt":"json"}},

"response":{"numFound":1,"start":0,"docs":[ {

  "id":"EP1680447",
  "st":"LAPSED",
  "fid":"34520196"}]
}
}


http://xxx.xxx.xxx.xxx:


/solr/c2/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2

{
"responseHeader":{
  "status":0,
  "QTime":0,
  "params":{
"fl":"id,fid,cc*,st",
"indent":"true",
"q":"fid:34520196",
"collection":"c1,c2",
"wt":"json"}},
"response":{"numFound":1,"start":0,"docs":[
{
  "id":"WO2005040212",
  "st":"PENDING",
  "cc_CA":"LAPSED",
  "cc_EP":"LAPSED",
  "cc_JP":"PENDING",
  "cc_US":"LAPSED",
  "fid":"34520196"}]
}}


I have the same xxx.xxx.xxx.xxx: (server:port).
unique key field C1, C2 : id

id data in C1 is different of id data in C2

Must I config/set something in solr ?

thanks,
Bruno


Le 06/01/2016 14:56, Emir Arnautovic a écrit :


Hi Bruno,
Can you check counts? Is it possible that first page is only with

results

from collection that you sent request to so you assumed it returns

only

results from single collection?

Thanks,
Emir


On 06.01.2016 14:33, Susheel Kumar wrote:

Hi Bruno,

I just tested this scenario in my local solr 5.3.1 and it returned
results
from two identical collections. I doubt if it is broken in 5.4 just
double
check if you are not missing anything else.

Thanks,
Susheel




http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2

responseHeader": {"status": 0,"QTime": 98,"params": {"q":
"id_type:hello","
indent": "true","collection": "c1,c2","wt": "json"}},
response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id":

"1","

id_type": "hello","_version_": 1522623395043213300},{"id":
"3","id_type":"
hello","_version_": 1522623422397415400}]}

On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina 

wrote:

yes id value is unique in C1 and unique in C2.

id in C1 is never present in C2
id in C2 is never present in C1


Le 06/01/2016 11:12, Binoy Dalal a écrit :

Are Id values for docs in both the collections exactly same?

To get proper results, the ids should be unique across both the

cores.

On Wed, 6 Jan 2016, 15:11 Bruno Mannina 

wrote:

Hi All,


Solr 5.4, Ubuntu

I thought it was simple to request across two collections with

the

same
schema but not.
I have one solr instance launch. 300 000 records in each

collection.

I try to use this request without having both results:

http://my_adress:my_port
/solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json

this request returns only C1 results and if I do:

http://my_adress:my_port
/solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json

it returns only C2 results.

I have 5 identical fields on both collection
id, fid, st, cc, timestamp
where id is the unique key field.

Can someone could explain me why it doesn't work ?

Thanks a lot !
Bruno

---
L'absence de virus dans ce courrier électronique a été vérifiée

par le

logiciel antivirus Avast.
http://www.avast.com

--

Regards,

Binoy Dalal


---

L'absence de virus dans ce courrier électronique a été vérifiée

par le

logiciel antivirus Avast.
http://www.avast.com

---
L'absence de v

Re: Solr server not starting

2016-01-06 Thread Erick Erickson
I doubt we'll be much help, it's probably best to talk to the
echoprint people, assuming any are still available. I took a quick
look at the project and the Solr implementation is from 4+ years
ago...

Best,
Erick

On Wed, Jan 6, 2016 at 8:11 AM, agonn Qurdina  wrote:
> Hi,
>
>
>
> I am using Solr server with Echoprint service
> (https://github.com/echonest/echoprint-server). The first time I started
>  it everything worked perfectly. This is the way I started it:
>
>
>
> java -Dsolr.solr.home=/home/echoprint-server/solr/solr/solr/ 
> -Djava.awt.headless=true -Xmx2048m -Xms2048m -jar start.jar
>
>
>
> Then I stopped it and I cannot start it anymore as it gets stuck at the 3rd 
> row of execution:
>
>
>
> 2016-01-06 11:04:19.030::INFO:  Logging to STDERR via 
> org.mortbay.log.StdErrLog
>
> 2016-01-06 11:04:19.165::INFO:  jetty-6.1.3
>
> 2016-01-06 11:04:19.231::INFO:  Extract
> jar:file:/home/echoprint-server/solr/solr/webapps/solr.war!/ to
> /tmp/Jetty_0_0_0_0_8502_solr.war__solr__-rnc92a/webapp
>
>
> It does not continue to execute anymore. I check if it is running in the
>  processes list and it turns out it is NOT. Please help me to solve this
>  problem!
>
>
>
> Best regards,
>
> Agon
>


Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Binoy Dalal
Bruno,
Use join like so:
{!join from=f1 to=f2 fromIndex=c2}
On c1

On Wed, 6 Jan 2016, 22:30 Bruno Mannina  wrote:

> Hi Ester,
>
> yes, i saw it, but if I use:
>
> q={!join from=fid to=fid}fid:34520196 (with or not &collection=c1,c2)
>
> I have only the result from the collection used in the select/c1
>
> Le 06/01/2016 17:52, esther.quan...@lucidworks.com a écrit :
> > Hi Bruno,
> >
> > You might consider using the JoinQueryParser. Details here :
> https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-JoinQueryParser
> >
> > Best,
> > Esther
> >
> >> Le 6 janv. 2016 à 08:48, Bruno Mannina  a écrit :
> >>
> >> Same result on my dev' server, it seems that collection param haven't
> effect on the query...
> >>
> >> Q: I don't see on the solr 5.4 doc, the "collection" param for select
> handler, is it always present in 5.4 version ?
> >>
> >> Le 06/01/2016 17:38, Bruno Mannina a écrit :
> >>> I have a dev' server, I will do some test on it...
> >>>
> >>> Le 06/01/2016 17:31, Susheel Kumar a écrit :
>  I'll suggest if you can setup some some test data locally and try this
>  out.  This will confirm your understanding.
> 
>  Thanks,
>  Susheel
> 
> > On Wed, Jan 6, 2016 at 10:39 AM, Bruno Mannina 
> wrote:
> >
> > Hi Susheel, Emir,
> >
> > yes I check, and I have one result in c1 and in c2 with the same
> query
> > fid:34520196
> >
> > http://xxx.xxx.xxx.xxx:
> >
> /solr/c1/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2
> >
> > { "responseHeader":{ "status":0, "QTime":1, "params":{
> "fl":"fid,cc*,st",
> > "indent":"true", "q":"fid:34520196", "collection":"c1,c2",
> "wt":"json"}},
> > "response":{"numFound":1,"start":0,"docs":[ {
> >
> >  "id":"EP1680447",
> >  "st":"LAPSED",
> >  "fid":"34520196"}]
> >}
> > }
> >
> >
> > http://xxx.xxx.xxx.xxx:
> >
> /solr/c2/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2
> >
> > {
> >"responseHeader":{
> >  "status":0,
> >  "QTime":0,
> >  "params":{
> >"fl":"id,fid,cc*,st",
> >"indent":"true",
> >"q":"fid:34520196",
> >"collection":"c1,c2",
> >"wt":"json"}},
> >"response":{"numFound":1,"start":0,"docs":[
> >{
> >  "id":"WO2005040212",
> >  "st":"PENDING",
> >  "cc_CA":"LAPSED",
> >  "cc_EP":"LAPSED",
> >  "cc_JP":"PENDING",
> >  "cc_US":"LAPSED",
> >  "fid":"34520196"}]
> >}}
> >
> >
> > I have the same xxx.xxx.xxx.xxx: (server:port).
> > unique key field C1, C2 : id
> >
> > id data in C1 is different of id data in C2
> >
> > Must I config/set something in solr ?
> >
> > thanks,
> > Bruno
> >
> >
> > Le 06/01/2016 14:56, Emir Arnautovic a écrit :
> >
> >> Hi Bruno,
> >> Can you check counts? Is it possible that first page is only with
> results
> >> from collection that you sent request to so you assumed it returns
> only
> >> results from single collection?
> >>
> >> Thanks,
> >> Emir
> >>
> >>> On 06.01.2016 14:33, Susheel Kumar wrote:
> >>>
> >>> Hi Bruno,
> >>>
> >>> I just tested this scenario in my local solr 5.3.1 and it returned
> >>> results
> >>> from two identical collections. I doubt if it is broken in 5.4 just
> >>> double
> >>> check if you are not missing anything else.
> >>>
> >>> Thanks,
> >>> Susheel
> >>>
> >>>
> >>>
> http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2
> >>>
> >>> responseHeader": {"status": 0,"QTime": 98,"params": {"q":
> >>> "id_type:hello","
> >>> indent": "true","collection": "c1,c2","wt": "json"}},
> >>> response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id":
> "1","
> >>> id_type": "hello","_version_": 1522623395043213300},{"id":
> >>> "3","id_type":"
> >>> hello","_version_": 1522623422397415400}]}
> >>>
> >>> On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina 
> wrote:
> >>>
> >>> yes id value is unique in C1 and unique in C2.
>  id in C1 is never present in C2
>  id in C2 is never present in C1
> 
> 
>  Le 06/01/2016 11:12, Binoy Dalal a écrit :
> 
>  Are Id values for docs in both the collections exactly same?
> > To get proper results, the ids should be unique across both the
> cores.
> >
> > On Wed, 6 Jan 2016, 15:11 Bruno Mannina 
> wrote:
> >
> > Hi All,
> >
> >> Solr 5.4, Ubuntu
> >>
> >> I thought it was simple to request across two collections with
> the
> >> same
> >> schema but not.
> >> I 

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina

Hi Ester,

yes, i saw it, but if I use:

q={!join from=fid to=fid}fid:34520196 (with or not &collection=c1,c2)

I have only the result from the collection used in the select/c1

Le 06/01/2016 17:52, esther.quan...@lucidworks.com a écrit :

Hi Bruno,

You might consider using the JoinQueryParser. Details here : 
https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-JoinQueryParser

Best,
Esther


Le 6 janv. 2016 à 08:48, Bruno Mannina  a écrit :

Same result on my dev' server, it seems that collection param haven't effect on 
the query...

Q: I don't see on the solr 5.4 doc, the "collection" param for select handler, 
is it always present in 5.4 version ?

Le 06/01/2016 17:38, Bruno Mannina a écrit :

I have a dev' server, I will do some test on it...

Le 06/01/2016 17:31, Susheel Kumar a écrit :

I'll suggest if you can setup some some test data locally and try this
out.  This will confirm your understanding.

Thanks,
Susheel


On Wed, Jan 6, 2016 at 10:39 AM, Bruno Mannina  wrote:

Hi Susheel, Emir,

yes I check, and I have one result in c1 and in c2 with the same query
fid:34520196

http://xxx.xxx.xxx.xxx:
/solr/c1/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2

{ "responseHeader":{ "status":0, "QTime":1, "params":{ "fl":"fid,cc*,st",
"indent":"true", "q":"fid:34520196", "collection":"c1,c2", "wt":"json"}},
"response":{"numFound":1,"start":0,"docs":[ {

 "id":"EP1680447",
 "st":"LAPSED",
 "fid":"34520196"}]
   }
}


http://xxx.xxx.xxx.xxx:
/solr/c2/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2

{
   "responseHeader":{
 "status":0,
 "QTime":0,
 "params":{
   "fl":"id,fid,cc*,st",
   "indent":"true",
   "q":"fid:34520196",
   "collection":"c1,c2",
   "wt":"json"}},
   "response":{"numFound":1,"start":0,"docs":[
   {
 "id":"WO2005040212",
 "st":"PENDING",
 "cc_CA":"LAPSED",
 "cc_EP":"LAPSED",
 "cc_JP":"PENDING",
 "cc_US":"LAPSED",
 "fid":"34520196"}]
   }}


I have the same xxx.xxx.xxx.xxx: (server:port).
unique key field C1, C2 : id

id data in C1 is different of id data in C2

Must I config/set something in solr ?

thanks,
Bruno


Le 06/01/2016 14:56, Emir Arnautovic a écrit :


Hi Bruno,
Can you check counts? Is it possible that first page is only with results
from collection that you sent request to so you assumed it returns only
results from single collection?

Thanks,
Emir


On 06.01.2016 14:33, Susheel Kumar wrote:

Hi Bruno,

I just tested this scenario in my local solr 5.3.1 and it returned
results
from two identical collections. I doubt if it is broken in 5.4 just
double
check if you are not missing anything else.

Thanks,
Susheel


http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2

responseHeader": {"status": 0,"QTime": 98,"params": {"q":
"id_type:hello","
indent": "true","collection": "c1,c2","wt": "json"}},
response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id": "1","
id_type": "hello","_version_": 1522623395043213300},{"id":
"3","id_type":"
hello","_version_": 1522623422397415400}]}

On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina  wrote:

yes id value is unique in C1 and unique in C2.

id in C1 is never present in C2
id in C2 is never present in C1


Le 06/01/2016 11:12, Binoy Dalal a écrit :

Are Id values for docs in both the collections exactly same?

To get proper results, the ids should be unique across both the cores.

On Wed, 6 Jan 2016, 15:11 Bruno Mannina  wrote:

Hi All,


Solr 5.4, Ubuntu

I thought it was simple to request across two collections with the
same
schema but not.
I have one solr instance launch. 300 000 records in each collection.

I try to use this request without having both results:

http://my_adress:my_port
/solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json

this request returns only C1 results and if I do:

http://my_adress:my_port
/solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json

it returns only C2 results.

I have 5 identical fields on both collection
id, fid, st, cc, timestamp
where id is the unique key field.

Can someone could explain me why it doesn't work ?

Thanks a lot !
Bruno

---
L'absence de virus dans ce courrier électronique a été vérifiée par le
logiciel antivirus Avast.
http://www.avast.com

--

Regards,

Binoy Dalal


---

L'absence de virus dans ce courrier électronique a été vérifiée par le
logiciel antivirus Avast.
http://www.avast.com

---
L'absence de virus dans ce courrier électronique a été vérifiée par le
logiciel antivirus Avast.
http://www.avast.com


---
L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel 
antivirus Avast.
http://www.avast.com


---
L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel 
antivirus Avast.
http://www.avast.com




---
L'absence de virus dans ce courrier électronique a été vérifiée par 

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread esther . quansah
Hi Bruno,

You might consider using the JoinQueryParser. Details here : 
https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-JoinQueryParser

Best,
Esther

> Le 6 janv. 2016 à 08:48, Bruno Mannina  a écrit :
> 
> Same result on my dev' server, it seems that collection param haven't effect 
> on the query...
> 
> Q: I don't see on the solr 5.4 doc, the "collection" param for select 
> handler, is it always present in 5.4 version ?
> 
> Le 06/01/2016 17:38, Bruno Mannina a écrit :
>> I have a dev' server, I will do some test on it...
>> 
>> Le 06/01/2016 17:31, Susheel Kumar a écrit :
>>> I'll suggest if you can setup some some test data locally and try this
>>> out.  This will confirm your understanding.
>>> 
>>> Thanks,
>>> Susheel
>>> 
 On Wed, Jan 6, 2016 at 10:39 AM, Bruno Mannina  wrote:
 
 Hi Susheel, Emir,
 
 yes I check, and I have one result in c1 and in c2 with the same query
 fid:34520196
 
 http://xxx.xxx.xxx.xxx:
 /solr/c1/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2
  
 
 { "responseHeader":{ "status":0, "QTime":1, "params":{ "fl":"fid,cc*,st",
 "indent":"true", "q":"fid:34520196", "collection":"c1,c2", "wt":"json"}},
 "response":{"numFound":1,"start":0,"docs":[ {
 
 "id":"EP1680447",
 "st":"LAPSED",
 "fid":"34520196"}]
   }
 }
 
 
 http://xxx.xxx.xxx.xxx:
 /solr/c2/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2
  
 
 {
   "responseHeader":{
 "status":0,
 "QTime":0,
 "params":{
   "fl":"id,fid,cc*,st",
   "indent":"true",
   "q":"fid:34520196",
   "collection":"c1,c2",
   "wt":"json"}},
   "response":{"numFound":1,"start":0,"docs":[
   {
 "id":"WO2005040212",
 "st":"PENDING",
 "cc_CA":"LAPSED",
 "cc_EP":"LAPSED",
 "cc_JP":"PENDING",
 "cc_US":"LAPSED",
 "fid":"34520196"}]
   }}
 
 
 I have the same xxx.xxx.xxx.xxx: (server:port).
 unique key field C1, C2 : id
 
 id data in C1 is different of id data in C2
 
 Must I config/set something in solr ?
 
 thanks,
 Bruno
 
 
 Le 06/01/2016 14:56, Emir Arnautovic a écrit :
 
> Hi Bruno,
> Can you check counts? Is it possible that first page is only with results
> from collection that you sent request to so you assumed it returns only
> results from single collection?
> 
> Thanks,
> Emir
> 
>> On 06.01.2016 14:33, Susheel Kumar wrote:
>> 
>> Hi Bruno,
>> 
>> I just tested this scenario in my local solr 5.3.1 and it returned
>> results
>> from two identical collections. I doubt if it is broken in 5.4 just
>> double
>> check if you are not missing anything else.
>> 
>> Thanks,
>> Susheel
>> 
>> 
>> http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2
>>  
>> 
>> responseHeader": {"status": 0,"QTime": 98,"params": {"q":
>> "id_type:hello","
>> indent": "true","collection": "c1,c2","wt": "json"}},
>> response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id": "1","
>> id_type": "hello","_version_": 1522623395043213300},{"id":
>> "3","id_type":"
>> hello","_version_": 1522623422397415400}]}
>> 
>> On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina  wrote:
>> 
>> yes id value is unique in C1 and unique in C2.
>>> id in C1 is never present in C2
>>> id in C2 is never present in C1
>>> 
>>> 
>>> Le 06/01/2016 11:12, Binoy Dalal a écrit :
>>> 
>>> Are Id values for docs in both the collections exactly same?
 To get proper results, the ids should be unique across both the cores.
 
 On Wed, 6 Jan 2016, 15:11 Bruno Mannina  wrote:
 
 Hi All,
 
> Solr 5.4, Ubuntu
> 
> I thought it was simple to request across two collections with the
> same
> schema but not.
> I have one solr instance launch. 300 000 records in each collection.
> 
> I try to use this request without having both results:
> 
> http://my_adress:my_port
> /solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json
> 
> this request returns only C1 results and if I do:
> 
> http://my_adress:my_port
> /solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json
> 
> it returns only C2 results.
> 
> I have 5 identical fields on both collection
> id, fid, st, cc, timestamp
> where id is the unique key field.
> 
> Can someone could explain me why it doesn't work ?
> 
> Thanks a lot !
> Bruno
> 

Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina
Same result on my dev' server, it seems that collection param haven't 
effect on the query...


Q: I don't see on the solr 5.4 doc, the "collection" param for select 
handler, is it always present in 5.4 version ?


Le 06/01/2016 17:38, Bruno Mannina a écrit :

I have a dev' server, I will do some test on it...

Le 06/01/2016 17:31, Susheel Kumar a écrit :

I'll suggest if you can setup some some test data locally and try this
out.  This will confirm your understanding.

Thanks,
Susheel

On Wed, Jan 6, 2016 at 10:39 AM, Bruno Mannina  wrote:


Hi Susheel, Emir,

yes I check, and I have one result in c1 and in c2 with the same query
fid:34520196

http://xxx.xxx.xxx.xxx:
/solr/c1/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2 



{ "responseHeader":{ "status":0, "QTime":1, "params":{ 
"fl":"fid,cc*,st",
"indent":"true", "q":"fid:34520196", "collection":"c1,c2", 
"wt":"json"}},

"response":{"numFound":1,"start":0,"docs":[ {

 "id":"EP1680447",
 "st":"LAPSED",
 "fid":"34520196"}]
   }
}


http://xxx.xxx.xxx.xxx:
/solr/c2/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2 



{
   "responseHeader":{
 "status":0,
 "QTime":0,
 "params":{
   "fl":"id,fid,cc*,st",
   "indent":"true",
   "q":"fid:34520196",
   "collection":"c1,c2",
   "wt":"json"}},
   "response":{"numFound":1,"start":0,"docs":[
   {
 "id":"WO2005040212",
 "st":"PENDING",
 "cc_CA":"LAPSED",
 "cc_EP":"LAPSED",
 "cc_JP":"PENDING",
 "cc_US":"LAPSED",
 "fid":"34520196"}]
   }}


I have the same xxx.xxx.xxx.xxx: (server:port).
unique key field C1, C2 : id

id data in C1 is different of id data in C2

Must I config/set something in solr ?

thanks,
Bruno


Le 06/01/2016 14:56, Emir Arnautovic a écrit :


Hi Bruno,
Can you check counts? Is it possible that first page is only with 
results
from collection that you sent request to so you assumed it returns 
only

results from single collection?

Thanks,
Emir

On 06.01.2016 14:33, Susheel Kumar wrote:


Hi Bruno,

I just tested this scenario in my local solr 5.3.1 and it returned
results
from two identical collections. I doubt if it is broken in 5.4 just
double
check if you are not missing anything else.

Thanks,
Susheel


http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2 



responseHeader": {"status": 0,"QTime": 98,"params": {"q":
"id_type:hello","
indent": "true","collection": "c1,c2","wt": "json"}},
response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id": 
"1","

id_type": "hello","_version_": 1522623395043213300},{"id":
"3","id_type":"
hello","_version_": 1522623422397415400}]}

On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina  
wrote:


yes id value is unique in C1 and unique in C2.

id in C1 is never present in C2
id in C2 is never present in C1


Le 06/01/2016 11:12, Binoy Dalal a écrit :

Are Id values for docs in both the collections exactly same?
To get proper results, the ids should be unique across both the 
cores.


On Wed, 6 Jan 2016, 15:11 Bruno Mannina  wrote:

Hi All,


Solr 5.4, Ubuntu

I thought it was simple to request across two collections with the
same
schema but not.
I have one solr instance launch. 300 000 records in each 
collection.


I try to use this request without having both results:

http://my_adress:my_port
/solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json

this request returns only C1 results and if I do:

http://my_adress:my_port
/solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json

it returns only C2 results.

I have 5 identical fields on both collection
id, fid, st, cc, timestamp
where id is the unique key field.

Can someone could explain me why it doesn't work ?

Thanks a lot !
Bruno

---
L'absence de virus dans ce courrier électronique a été vérifiée 
par le

logiciel antivirus Avast.
http://www.avast.com

--

Regards,

Binoy Dalal


---
L'absence de virus dans ce courrier électronique a été vérifiée 
par le

logiciel antivirus Avast.
http://www.avast.com




---
L'absence de virus dans ce courrier électronique a été vérifiée par le
logiciel antivirus Avast.
http://www.avast.com





---
L'absence de virus dans ce courrier électronique a été vérifiée par le 
logiciel antivirus Avast.

http://www.avast.com






---
L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel 
antivirus Avast.
http://www.avast.com



Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina

I have a dev' server, I will do some test on it...

Le 06/01/2016 17:31, Susheel Kumar a écrit :

I'll suggest if you can setup some some test data locally and try this
out.  This will confirm your understanding.

Thanks,
Susheel

On Wed, Jan 6, 2016 at 10:39 AM, Bruno Mannina  wrote:


Hi Susheel, Emir,

yes I check, and I have one result in c1 and in c2 with the same query
fid:34520196

http://xxx.xxx.xxx.xxx:
/solr/c1/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2

{ "responseHeader":{ "status":0, "QTime":1, "params":{ "fl":"fid,cc*,st",
"indent":"true", "q":"fid:34520196", "collection":"c1,c2", "wt":"json"}},
"response":{"numFound":1,"start":0,"docs":[ {

 "id":"EP1680447",
 "st":"LAPSED",
 "fid":"34520196"}]
   }
}


http://xxx.xxx.xxx.xxx:
/solr/c2/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2

{
   "responseHeader":{
 "status":0,
 "QTime":0,
 "params":{
   "fl":"id,fid,cc*,st",
   "indent":"true",
   "q":"fid:34520196",
   "collection":"c1,c2",
   "wt":"json"}},
   "response":{"numFound":1,"start":0,"docs":[
   {
 "id":"WO2005040212",
 "st":"PENDING",
 "cc_CA":"LAPSED",
 "cc_EP":"LAPSED",
 "cc_JP":"PENDING",
 "cc_US":"LAPSED",
 "fid":"34520196"}]
   }}


I have the same xxx.xxx.xxx.xxx: (server:port).
unique key field C1, C2 : id

id data in C1 is different of id data in C2

Must I config/set something in solr ?

thanks,
Bruno


Le 06/01/2016 14:56, Emir Arnautovic a écrit :


Hi Bruno,
Can you check counts? Is it possible that first page is only with results
from collection that you sent request to so you assumed it returns only
results from single collection?

Thanks,
Emir

On 06.01.2016 14:33, Susheel Kumar wrote:


Hi Bruno,

I just tested this scenario in my local solr 5.3.1 and it returned
results
from two identical collections. I doubt if it is broken in 5.4 just
double
check if you are not missing anything else.

Thanks,
Susheel


http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2

responseHeader": {"status": 0,"QTime": 98,"params": {"q":
"id_type:hello","
indent": "true","collection": "c1,c2","wt": "json"}},
response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id": "1","
id_type": "hello","_version_": 1522623395043213300},{"id":
"3","id_type":"
hello","_version_": 1522623422397415400}]}

On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina  wrote:

yes id value is unique in C1 and unique in C2.

id in C1 is never present in C2
id in C2 is never present in C1


Le 06/01/2016 11:12, Binoy Dalal a écrit :

Are Id values for docs in both the collections exactly same?

To get proper results, the ids should be unique across both the cores.

On Wed, 6 Jan 2016, 15:11 Bruno Mannina  wrote:

Hi All,


Solr 5.4, Ubuntu

I thought it was simple to request across two collections with the
same
schema but not.
I have one solr instance launch. 300 000 records in each collection.

I try to use this request without having both results:

http://my_adress:my_port
/solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json

this request returns only C1 results and if I do:

http://my_adress:my_port
/solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json

it returns only C2 results.

I have 5 identical fields on both collection
id, fid, st, cc, timestamp
where id is the unique key field.

Can someone could explain me why it doesn't work ?

Thanks a lot !
Bruno

---
L'absence de virus dans ce courrier électronique a été vérifiée par le
logiciel antivirus Avast.
http://www.avast.com

--

Regards,

Binoy Dalal


---

L'absence de virus dans ce courrier électronique a été vérifiée par le
logiciel antivirus Avast.
http://www.avast.com




---
L'absence de virus dans ce courrier électronique a été vérifiée par le
logiciel antivirus Avast.
http://www.avast.com





---
L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel 
antivirus Avast.
http://www.avast.com



Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Susheel Kumar
I'll suggest if you can setup some some test data locally and try this
out.  This will confirm your understanding.

Thanks,
Susheel

On Wed, Jan 6, 2016 at 10:39 AM, Bruno Mannina  wrote:

> Hi Susheel, Emir,
>
> yes I check, and I have one result in c1 and in c2 with the same query
> fid:34520196
>
> http://xxx.xxx.xxx.xxx:
> /solr/c1/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2
>
> { "responseHeader":{ "status":0, "QTime":1, "params":{ "fl":"fid,cc*,st",
> "indent":"true", "q":"fid:34520196", "collection":"c1,c2", "wt":"json"}},
> "response":{"numFound":1,"start":0,"docs":[ {
>
> "id":"EP1680447",
> "st":"LAPSED",
> "fid":"34520196"}]
>   }
> }
>
>
> http://xxx.xxx.xxx.xxx:
> /solr/c2/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2
>
> {
>   "responseHeader":{
> "status":0,
> "QTime":0,
> "params":{
>   "fl":"id,fid,cc*,st",
>   "indent":"true",
>   "q":"fid:34520196",
>   "collection":"c1,c2",
>   "wt":"json"}},
>   "response":{"numFound":1,"start":0,"docs":[
>   {
> "id":"WO2005040212",
> "st":"PENDING",
> "cc_CA":"LAPSED",
> "cc_EP":"LAPSED",
> "cc_JP":"PENDING",
> "cc_US":"LAPSED",
> "fid":"34520196"}]
>   }}
>
>
> I have the same xxx.xxx.xxx.xxx: (server:port).
> unique key field C1, C2 : id
>
> id data in C1 is different of id data in C2
>
> Must I config/set something in solr ?
>
> thanks,
> Bruno
>
>
> Le 06/01/2016 14:56, Emir Arnautovic a écrit :
>
>> Hi Bruno,
>> Can you check counts? Is it possible that first page is only with results
>> from collection that you sent request to so you assumed it returns only
>> results from single collection?
>>
>> Thanks,
>> Emir
>>
>> On 06.01.2016 14:33, Susheel Kumar wrote:
>>
>>> Hi Bruno,
>>>
>>> I just tested this scenario in my local solr 5.3.1 and it returned
>>> results
>>> from two identical collections. I doubt if it is broken in 5.4 just
>>> double
>>> check if you are not missing anything else.
>>>
>>> Thanks,
>>> Susheel
>>>
>>>
>>> http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2
>>>
>>> responseHeader": {"status": 0,"QTime": 98,"params": {"q":
>>> "id_type:hello","
>>> indent": "true","collection": "c1,c2","wt": "json"}},
>>> response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id": "1","
>>> id_type": "hello","_version_": 1522623395043213300},{"id":
>>> "3","id_type": "
>>> hello","_version_": 1522623422397415400}]}
>>>
>>> On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina  wrote:
>>>
>>> yes id value is unique in C1 and unique in C2.
 id in C1 is never present in C2
 id in C2 is never present in C1


 Le 06/01/2016 11:12, Binoy Dalal a écrit :

 Are Id values for docs in both the collections exactly same?
> To get proper results, the ids should be unique across both the cores.
>
> On Wed, 6 Jan 2016, 15:11 Bruno Mannina  wrote:
>
> Hi All,
>
>> Solr 5.4, Ubuntu
>>
>> I thought it was simple to request across two collections with the
>> same
>> schema but not.
>> I have one solr instance launch. 300 000 records in each collection.
>>
>> I try to use this request without having both results:
>>
>> http://my_adress:my_port
>> /solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json
>>
>> this request returns only C1 results and if I do:
>>
>> http://my_adress:my_port
>> /solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json
>>
>> it returns only C2 results.
>>
>> I have 5 identical fields on both collection
>> id, fid, st, cc, timestamp
>> where id is the unique key field.
>>
>> Can someone could explain me why it doesn't work ?
>>
>> Thanks a lot !
>> Bruno
>>
>> ---
>> L'absence de virus dans ce courrier électronique a été vérifiée par le
>> logiciel antivirus Avast.
>> http://www.avast.com
>>
>> --
>>
>> Regards,
> Binoy Dalal
>
>
> ---
 L'absence de virus dans ce courrier électronique a été vérifiée par le
 logiciel antivirus Avast.
 http://www.avast.com



>>
>
> ---
> L'absence de virus dans ce courrier électronique a été vérifiée par le
> logiciel antivirus Avast.
> http://www.avast.com
>
>


Solr server not starting

2016-01-06 Thread agonn Qurdina
Hi,



I am using Solr server with Echoprint service 
(https://github.com/echonest/echoprint-server). The first time I started
 it everything worked perfectly. This is the way I started it:



java -Dsolr.solr.home=/home/echoprint-server/solr/solr/solr/ 
-Djava.awt.headless=true -Xmx2048m -Xms2048m -jar start.jar



Then I stopped it and I cannot start it anymore as it gets stuck at the 3rd row 
of execution:



2016-01-06 11:04:19.030::INFO:  Logging to STDERR via org.mortbay.log.StdErrLog

2016-01-06 11:04:19.165::INFO:  jetty-6.1.3

2016-01-06 11:04:19.231::INFO:  Extract 
jar:file:/home/echoprint-server/solr/solr/webapps/solr.war!/ to 
/tmp/Jetty_0_0_0_0_8502_solr.war__solr__-rnc92a/webapp


It does not continue to execute anymore. I check if it is running in the
 processes list and it turns out it is NOT. Please help me to solve this
 problem!



Best regards,

Agon
  

Re: Query behavior difference.

2016-01-06 Thread Jack Krupansky
The motivation for the constant-score rewrite is simply performance. As per
the Javadoc:

"*This method is faster than the BooleanQuery rewrite methods when the
number of matched terms or matched documents is non-trivial. Also, it will
never hit an errant BooleanQuery.TooManyClauses exception.*"

So that's a second reason - to avoid the max clause count limitation of
Boolean Query.

See:
https://lucene.apache.org/core/5_4_0/core/org/apache/lucene/search/MultiTermQuery.html#CONSTANT_SCORE_REWRITE
https://lucene.apache.org/core/5_4_0/core/org/apache/lucene/search/WildcardQuery.html


-- Jack Krupansky

On Wed, Jan 6, 2016 at 6:07 AM, Modassar Ather 
wrote:

> Please help me understand why queries like wildcard, prefix and few others
> are re-written into constant score query?
> Why the scoring factors are not taken into consideration in such queries?
>
> Please correct me if I am wrong that this behavior is per the query type
> irrespective of the parser used.
>
> Thanks,
> Modassar
>
> On Wed, Jan 6, 2016 at 12:56 PM, Modassar Ather 
> wrote:
>
> > Thanks for your response Ahmet.
> >
> > Best,
> > Modassar
> >
> > On Mon, Jan 4, 2016 at 5:07 PM, Ahmet Arslan 
> > wrote:
> >
> >> Hi,
> >>
> >> I think wildcard queries fl:networ* are re-written into Constant Score
> >> Query.
> >> fl=*,score should returns same score for all documents that are
> retrieved.
> >>
> >> Ahmet
> >>
> >>
> >>
> >> On Monday, January 4, 2016 12:22 PM, Modassar Ather <
> >> modather1...@gmail.com> wrote:
> >> Hi,
> >>
> >> Kindly help me understand how will relevance ranking differ int
> following
> >> searches.
> >>
> >> query : fl:network
> >> query : fl:networ*
> >>
> >> What I am observing that the results returned are different in both of
> >> them
> >> in a way that the top documents returned for q=fl:network is not present
> >> in
> >> the top results of q=fl:networ*.
> >> For example for q=fl:network I am getting top documents having around 20
> >> occurrence of network whereas the top result of q=fl:networ* has only
> >> couple of occurrence of network.
> >> I am aware of the underlying normalization process participation in
> >> relevance ranking of documents but not able to understand such a
> >> difference
> >> in the ranking of result for the queries.
> >>
> >> Thanks,
> >> Modassar
> >>
> >
> >
>


Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina

Hi Susheel, Emir,

yes I check, and I have one result in c1 and in c2 with the same query 
fid:34520196


http://xxx.xxx.xxx.xxx:/solr/c1/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2

{ "responseHeader":{ "status":0, "QTime":1, "params":{ 
"fl":"fid,cc*,st", "indent":"true", "q":"fid:34520196", 
"collection":"c1,c2", "wt":"json"}}, 
"response":{"numFound":1,"start":0,"docs":[ {


"id":"EP1680447",
"st":"LAPSED",
"fid":"34520196"}]
  }
}


http://xxx.xxx.xxx.xxx:/solr/c2/select?q=fid:34520196&wt=json&indent=true&fl=id,fid,cc*,st&collection=c1,c2

{
  "responseHeader":{
"status":0,
"QTime":0,
"params":{
  "fl":"id,fid,cc*,st",
  "indent":"true",
  "q":"fid:34520196",
  "collection":"c1,c2",
  "wt":"json"}},
  "response":{"numFound":1,"start":0,"docs":[
  {
"id":"WO2005040212",
"st":"PENDING",
"cc_CA":"LAPSED",
"cc_EP":"LAPSED",
"cc_JP":"PENDING",
"cc_US":"LAPSED",
"fid":"34520196"}]
  }}


I have the same xxx.xxx.xxx.xxx: (server:port).
unique key field C1, C2 : id

id data in C1 is different of id data in C2

Must I config/set something in solr ?

thanks,
Bruno

Le 06/01/2016 14:56, Emir Arnautovic a écrit :

Hi Bruno,
Can you check counts? Is it possible that first page is only with 
results from collection that you sent request to so you assumed it 
returns only results from single collection?


Thanks,
Emir

On 06.01.2016 14:33, Susheel Kumar wrote:

Hi Bruno,

I just tested this scenario in my local solr 5.3.1 and it returned 
results
from two identical collections. I doubt if it is broken in 5.4 just 
double

check if you are not missing anything else.

Thanks,
Susheel

http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2 



responseHeader": {"status": 0,"QTime": 98,"params": {"q": 
"id_type:hello","

indent": "true","collection": "c1,c2","wt": "json"}},
response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id": "1","
id_type": "hello","_version_": 1522623395043213300},{"id": 
"3","id_type": "

hello","_version_": 1522623422397415400}]}

On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina  wrote:


yes id value is unique in C1 and unique in C2.
id in C1 is never present in C2
id in C2 is never present in C1


Le 06/01/2016 11:12, Binoy Dalal a écrit :


Are Id values for docs in both the collections exactly same?
To get proper results, the ids should be unique across both the cores.

On Wed, 6 Jan 2016, 15:11 Bruno Mannina  wrote:

Hi All,

Solr 5.4, Ubuntu

I thought it was simple to request across two collections with the 
same

schema but not.
I have one solr instance launch. 300 000 records in each collection.

I try to use this request without having both results:

http://my_adress:my_port
/solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json

this request returns only C1 results and if I do:

http://my_adress:my_port
/solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json

it returns only C2 results.

I have 5 identical fields on both collection
id, fid, st, cc, timestamp
where id is the unique key field.

Can someone could explain me why it doesn't work ?

Thanks a lot !
Bruno

---
L'absence de virus dans ce courrier électronique a été vérifiée 
par le

logiciel antivirus Avast.
http://www.avast.com

--


Regards,
Binoy Dalal



---
L'absence de virus dans ce courrier électronique a été vérifiée par le
logiciel antivirus Avast.
http://www.avast.com







---
L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel 
antivirus Avast.
http://www.avast.com



Re: Count multivalued field issue

2016-01-06 Thread Jack Krupansky
Out of curiosity, where did you get your example code from - so we can
assure that it gets corrected?

Here's a valid example, from de-dupe:


  
dedupe
  
...


Note it is the request handler for "/update", not the "update handler."

See:
https://cwiki.apache.org/confluence/display/solr/De-Duplication

It is unfortunate that such an example is not given in the actual update
request processor doc, which only shows an example for the Solr Cell
request handler:
https://cwiki.apache.org/confluence/display/solr/Update+Request+Processors

If that still doesn't work, be sure to provide detail of what the symptom
is rather than simply saying that it doesn't work.


-- Jack Krupansky

On Wed, Jan 6, 2016 at 8:43 AM, marotosg  wrote:

> Hi,
>
> I am trying to add a new field to my schema to add the number of items of a
> multivalued field.
> I am using solr 4.11
>
> These are my fields on *schema.xml*
>  multiValued="true" stored="true" />
> 
>
> Here is the update done to my *solrconfig.xml*. I created an
> updateRequestProcessorChain
> and add it to the update handler
>
> 
> 
> countfields
> 
> 
>
> 
>
>  EmailListS
>  EmailListCountD
>
>
>  EmailListCountD
>
>
>  EmailListCountD
>  0
>
>
>
>  
>
> Am I doing somwthing wrong here?
>
> Thanks for your help.
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Count-multivalued-field-issue-tp4248878.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Binoy Dalal
Another diagnostic you can perform is to query for a doc you know is on the
other collection and see if it is returned. If it is then you might simply
not be seeing the results as emir says.

On Wed, 6 Jan 2016, 19:27 Emir Arnautovic 
wrote:

> Hi Bruno,
> Can you check counts? Is it possible that first page is only with
> results from collection that you sent request to so you assumed it
> returns only results from single collection?
>
> Thanks,
> Emir
>
> On 06.01.2016 14:33, Susheel Kumar wrote:
> > Hi Bruno,
> >
> > I just tested this scenario in my local solr 5.3.1 and it returned
> results
> > from two identical collections. I doubt if it is broken in 5.4 just
> double
> > check if you are not missing anything else.
> >
> > Thanks,
> > Susheel
> >
> >
> http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2
> >
> > responseHeader": {"status": 0,"QTime": 98,"params": {"q":
> "id_type:hello","
> > indent": "true","collection": "c1,c2","wt": "json"}},
> > response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id": "1","
> > id_type": "hello","_version_": 1522623395043213300},{"id":
> "3","id_type": "
> > hello","_version_": 1522623422397415400}]}
> >
> > On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina  wrote:
> >
> >> yes id value is unique in C1 and unique in C2.
> >> id in C1 is never present in C2
> >> id in C2 is never present in C1
> >>
> >>
> >> Le 06/01/2016 11:12, Binoy Dalal a écrit :
> >>
> >>> Are Id values for docs in both the collections exactly same?
> >>> To get proper results, the ids should be unique across both the cores.
> >>>
> >>> On Wed, 6 Jan 2016, 15:11 Bruno Mannina  wrote:
> >>>
> >>> Hi All,
>  Solr 5.4, Ubuntu
> 
>  I thought it was simple to request across two collections with the
> same
>  schema but not.
>  I have one solr instance launch. 300 000 records in each collection.
> 
>  I try to use this request without having both results:
> 
>  http://my_adress:my_port
>  /solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json
> 
>  this request returns only C1 results and if I do:
> 
>  http://my_adress:my_port
>  /solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json
> 
>  it returns only C2 results.
> 
>  I have 5 identical fields on both collection
>  id, fid, st, cc, timestamp
>  where id is the unique key field.
> 
>  Can someone could explain me why it doesn't work ?
> 
>  Thanks a lot !
>  Bruno
> 
>  ---
>  L'absence de virus dans ce courrier électronique a été vérifiée par le
>  logiciel antivirus Avast.
>  http://www.avast.com
> 
>  --
> 
> >>> Regards,
> >>> Binoy Dalal
> >>>
> >>>
> >> ---
> >> L'absence de virus dans ce courrier électronique a été vérifiée par le
> >> logiciel antivirus Avast.
> >> http://www.avast.com
> >>
> >>
>
> --
> Monitoring * Alerting * Anomaly Detection * Centralized Log Management
> Solr & Elasticsearch Support * http://sematext.com/
>
> --
Regards,
Binoy Dalal


Re: Count multivalued field issue

2016-01-06 Thread Binoy Dalal
You haven't given any name to your update handler. How are you planning to
call it?

On Wed, 6 Jan 2016, 19:41 GW  wrote:

> When dealing with Solr data you need to decide whether or not to go
> programming.
>
> when I want to count a multi-value I go programming.
>
>
> $count = array_count($array);
>
>
>
> On 6 January 2016 at 08:43, marotosg  wrote:
>
> > Hi,
> >
> > I am trying to add a new field to my schema to add the number of items
> of a
> > multivalued field.
> > I am using solr 4.11
> >
> > These are my fields on *schema.xml*
> >  > multiValued="true" stored="true" />
> > 
> >
> > Here is the update done to my *solrconfig.xml*. I created an
> > updateRequestProcessorChain
> > and add it to the update handler
> >
> > 
> > 
> > countfields
> > 
> > 
> >
> > 
> >
> >  EmailListS
> >  EmailListCountD
> >
> >
> >  EmailListCountD
> >
> >
> >  EmailListCountD
> >  0
> >
> >
> >
> >  
> >
> > Am I doing somwthing wrong here?
> >
> > Thanks for your help.
> >
> >
> >
> >
> > --
> > View this message in context:
> >
> http://lucene.472066.n3.nabble.com/Count-multivalued-field-issue-tp4248878.html
> > Sent from the Solr - User mailing list archive at Nabble.com.
> >
>
-- 
Regards,
Binoy Dalal


Re: Count multivalued field issue

2016-01-06 Thread GW
When dealing with Solr data you need to decide whether or not to go
programming.

when I want to count a multi-value I go programming.


$count = array_count($array);



On 6 January 2016 at 08:43, marotosg  wrote:

> Hi,
>
> I am trying to add a new field to my schema to add the number of items of a
> multivalued field.
> I am using solr 4.11
>
> These are my fields on *schema.xml*
>  multiValued="true" stored="true" />
> 
>
> Here is the update done to my *solrconfig.xml*. I created an
> updateRequestProcessorChain
> and add it to the update handler
>
> 
> 
> countfields
> 
> 
>
> 
>
>  EmailListS
>  EmailListCountD
>
>
>  EmailListCountD
>
>
>  EmailListCountD
>  0
>
>
>
>  
>
> Am I doing somwthing wrong here?
>
> Thanks for your help.
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Count-multivalued-field-issue-tp4248878.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Emir Arnautovic

Hi Bruno,
Can you check counts? Is it possible that first page is only with 
results from collection that you sent request to so you assumed it 
returns only results from single collection?


Thanks,
Emir

On 06.01.2016 14:33, Susheel Kumar wrote:

Hi Bruno,

I just tested this scenario in my local solr 5.3.1 and it returned results
from two identical collections. I doubt if it is broken in 5.4 just double
check if you are not missing anything else.

Thanks,
Susheel

http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2

responseHeader": {"status": 0,"QTime": 98,"params": {"q": "id_type:hello","
indent": "true","collection": "c1,c2","wt": "json"}},
response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id": "1","
id_type": "hello","_version_": 1522623395043213300},{"id": "3","id_type": "
hello","_version_": 1522623422397415400}]}

On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina  wrote:


yes id value is unique in C1 and unique in C2.
id in C1 is never present in C2
id in C2 is never present in C1


Le 06/01/2016 11:12, Binoy Dalal a écrit :


Are Id values for docs in both the collections exactly same?
To get proper results, the ids should be unique across both the cores.

On Wed, 6 Jan 2016, 15:11 Bruno Mannina  wrote:

Hi All,

Solr 5.4, Ubuntu

I thought it was simple to request across two collections with the same
schema but not.
I have one solr instance launch. 300 000 records in each collection.

I try to use this request without having both results:

http://my_adress:my_port
/solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json

this request returns only C1 results and if I do:

http://my_adress:my_port
/solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json

it returns only C2 results.

I have 5 identical fields on both collection
id, fid, st, cc, timestamp
where id is the unique key field.

Can someone could explain me why it doesn't work ?

Thanks a lot !
Bruno

---
L'absence de virus dans ce courrier électronique a été vérifiée par le
logiciel antivirus Avast.
http://www.avast.com

--


Regards,
Binoy Dalal



---
L'absence de virus dans ce courrier électronique a été vérifiée par le
logiciel antivirus Avast.
http://www.avast.com




--
Monitoring * Alerting * Anomaly Detection * Centralized Log Management
Solr & Elasticsearch Support * http://sematext.com/



Count multivalued field issue

2016-01-06 Thread marotosg
Hi,

I am trying to add a new field to my schema to add the number of items of a
multivalued field.
I am using solr 4.11

These are my fields on *schema.xml*
  
  

Here is the update done to my *solrconfig.xml*. I created an
updateRequestProcessorChain 
and add it to the update handler



countfields




   
 EmailListS
 EmailListCountD
   
   
 EmailListCountD
   
   
 EmailListCountD
 0
   
   
   
 

Am I doing somwthing wrong here?

Thanks for your help.




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Count-multivalued-field-issue-tp4248878.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Susheel Kumar
Hi Bruno,

I just tested this scenario in my local solr 5.3.1 and it returned results
from two identical collections. I doubt if it is broken in 5.4 just double
check if you are not missing anything else.

Thanks,
Susheel

http://localhost:8983/solr/c1/select?q=id_type%3Ahello&wt=json&indent=true&collection=c1,c2

responseHeader": {"status": 0,"QTime": 98,"params": {"q": "id_type:hello","
indent": "true","collection": "c1,c2","wt": "json"}},
response": {"numFound": 2,"start": 0,"maxScore": 1,"docs": [{"id": "1","
id_type": "hello","_version_": 1522623395043213300},{"id": "3","id_type": "
hello","_version_": 1522623422397415400}]}

On Wed, Jan 6, 2016 at 6:13 AM, Bruno Mannina  wrote:

> yes id value is unique in C1 and unique in C2.
> id in C1 is never present in C2
> id in C2 is never present in C1
>
>
> Le 06/01/2016 11:12, Binoy Dalal a écrit :
>
>> Are Id values for docs in both the collections exactly same?
>> To get proper results, the ids should be unique across both the cores.
>>
>> On Wed, 6 Jan 2016, 15:11 Bruno Mannina  wrote:
>>
>> Hi All,
>>>
>>> Solr 5.4, Ubuntu
>>>
>>> I thought it was simple to request across two collections with the same
>>> schema but not.
>>> I have one solr instance launch. 300 000 records in each collection.
>>>
>>> I try to use this request without having both results:
>>>
>>> http://my_adress:my_port
>>> /solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json
>>>
>>> this request returns only C1 results and if I do:
>>>
>>> http://my_adress:my_port
>>> /solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json
>>>
>>> it returns only C2 results.
>>>
>>> I have 5 identical fields on both collection
>>> id, fid, st, cc, timestamp
>>> where id is the unique key field.
>>>
>>> Can someone could explain me why it doesn't work ?
>>>
>>> Thanks a lot !
>>> Bruno
>>>
>>> ---
>>> L'absence de virus dans ce courrier électronique a été vérifiée par le
>>> logiciel antivirus Avast.
>>> http://www.avast.com
>>>
>>> --
>>>
>> Regards,
>> Binoy Dalal
>>
>>
>
> ---
> L'absence de virus dans ce courrier électronique a été vérifiée par le
> logiciel antivirus Avast.
> http://www.avast.com
>
>


Re: core,Collection,Shard,Replication

2016-01-06 Thread Binoy Dalal
The machines part may have been a bit misleading. I am sorry for that. What
I actually meant was shards. Now, you can have multiple shards hosted on a
single machine or multiple machines as in the example I gave.

"I have to make sure that all those machines have solr server or gateway
should be deplyed ?"

Yes you do need a solr process running on all machines on which you plan to
distribute your index.

"And what multiple JVM processes run behind a solr server running?"

If you mean how many jvms are running for a solr server, the answer's 1.
"then what is a solr instance?"
One solr process on your machine.

On Wed, 6 Jan 2016, 18:33 vidya  wrote:

> Hi
> You described that sharding is to distribute data over multiple machines.Do
> I have to make sure that all those machines have solr server or gateway
> should be deplyed ?
> And what multiple JVM processes run behind a solr server running?
> I wanted to know what is a node. -> I understood like a mchine with solr
> server deployed.
> then what is a solr instance?
>
> Am I correct.If not,please help me
>
> Thanks in advance
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/core-Collection-Shard-Replication-tp4248850p4248865.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
-- 
Regards,
Binoy Dalal


Re: core,Collection,Shard,Replication

2016-01-06 Thread vidya
Hi 
You described that sharding is to distribute data over multiple machines.Do
I have to make sure that all those machines have solr server or gateway
should be deplyed ?
And what multiple JVM processes run behind a solr server running?
I wanted to know what is a node. -> I understood like a mchine with solr
server deployed.
then what is a solr instance?

Am I correct.If not,please help me

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/core-Collection-Shard-Replication-tp4248850p4248865.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Query behavior difference.

2016-01-06 Thread Emir Arnautovic

Hi Modassar,
It usually helps if you analyze extreme case: e.g. fl:a*
What terms should be better match? Those who are shorter or all should 
be equally good?
What should be top document? Assuming standard TF/IDF scoring is used, 
that would be one with the most terms that start with 'a' especially 
those that are not frequent in corpus. Calculating that could be 
expensive and irrelevant in most cases so constant score makes sense.


Thanks,
Emir

On 06.01.2016 12:07, Modassar Ather wrote:

Please help me understand why queries like wildcard, prefix and few others
are re-written into constant score query?
Why the scoring factors are not taken into consideration in such queries?

Please correct me if I am wrong that this behavior is per the query type
irrespective of the parser used.

Thanks,
Modassar

On Wed, Jan 6, 2016 at 12:56 PM, Modassar Ather 
wrote:


Thanks for your response Ahmet.

Best,
Modassar

On Mon, Jan 4, 2016 at 5:07 PM, Ahmet Arslan 
wrote:


Hi,

I think wildcard queries fl:networ* are re-written into Constant Score
Query.
fl=*,score should returns same score for all documents that are retrieved.

Ahmet



On Monday, January 4, 2016 12:22 PM, Modassar Ather <
modather1...@gmail.com> wrote:
Hi,

Kindly help me understand how will relevance ranking differ int following
searches.

query : fl:network
query : fl:networ*

What I am observing that the results returned are different in both of
them
in a way that the top documents returned for q=fl:network is not present
in
the top results of q=fl:networ*.
For example for q=fl:network I am getting top documents having around 20
occurrence of network whereas the top result of q=fl:networ* has only
couple of occurrence of network.
I am aware of the underlying normalization process participation in
relevance ranking of documents but not able to understand such a
difference
in the ranking of result for the queries.

Thanks,
Modassar





--
Monitoring * Alerting * Anomaly Detection * Centralized Log Management
Solr & Elasticsearch Support * http://sematext.com/



Re: core,Collection,Shard,Replication

2016-01-06 Thread Binoy Dalal
1) A collection is simply a logical group and can consist of multiple
cores. The core is a representation of a single physical index or part of
an index. Both cores and collections can be created in local as well as
cloud modes.
2) Sharding is performed to distribute your index over multiple machines
when the index becomes too big for one machine.
So if you have a 10TB index and 10 machines each with 1TB of disk space
then you'll divide your index into 10 shards and put those 10 shards on
your 10 machines.
"But when indexing a document in one shard,it gets reflected in every shard
of that collection"
This is what happens only logically, meaning you can query any one shard
for a doc that might be on another and you'll still get the proper results.
Physically, one doc is only present on one shard, which is determined by
the hash value of the doc id at index time.
3) The main purpose of replication is to provide redundancy. When you're
running solr on cloud mode with multiple shards and one of your shards goes
down, your entire cluster will stop responding. In such a case a replica
for such a shard will serve as a backup and take over the responsibilities
of that shard.
This will keep your app running.

What I've written above is a very coarse grained view of all these concepts.
You should take a look at the wiki pages to gain a better fuller
understanding of these concepts.

On Wed, 6 Jan 2016, 17:00 vidya  wrote:

> Hi
>
> I am new to solr. i have a doubt in understanding difference between core
> and collection.
> >As far as i understand, cores can be created when solr is run in local
> mode
> and collections in solrCloud.
> Can you please help me if i am wrong.
> >And why do we shard a collection? i read like -
> When your data is too large for one node, you can break it up and store it
> in sections by creating one or more shards. Each is a portion of the
> logical
> index, or core, and it's the set of all nodes containing that section of
> the
> index.
> But when indexing a document in one shard,it gets reflected in every shard
> of that collection. But main intention of creating shard is to break up the
> data.
> >Why do we replicate a collection?
>
> Thanks in advance
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/core-Collection-Shard-Replication-tp4248850.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
-- 
Regards,
Binoy Dalal


Re: Solr 6 Distributed Join

2016-01-06 Thread Akiel Ahmed
Hi Dennis/Joel,

Thank you for your help to date - I must say this user group is very 
responsive :-)

Cheers

Akiel



From:   Dennis Gove 
To: solr-user@lucene.apache.org
Date:   05/01/2016 13:22
Subject:Re: Solr 6 Distributed Join



Akiel,

https://issues.apache.org/jira/browse/SOLR-7554 added checks on the sort
with streams, where required. If a particular stream requires that 
incoming
streams be ordered in a compatible way then that check will be performed
during creation of the stream and an error will be thrown if that check
fails. This is only a check on the sorts of the incoming streams and
doesn't do any checks related to if expected fields are included in the
incoming streams. In your case, it'd have found the error and told you 
that
the streams aren't sorted in a compatible way.

- Dennis

On Tue, Jan 5, 2016 at 8:11 AM, Akiel Ahmed  wrote:

> Hi Joel,
>
> Sorry there was an error between my chair and keyboard; there isn't a 
bug
> - the right hand stream was not ordered by the joined-on field. So, the
> following query does what I expected:
>
>
> 
http://localhost:8983/solr/gettingstarted/stream?stream=innerJoin(search(gettingstarted

> ,fl="id",q=text:John,sort="id 
asc",zkHost="localhost:9983",qt="/export"),
> search(gettingstarted,fl="id,e1",q=type:DEF,sort="e1
> asc",zkHost="localhost:9983",qt="/export"), on="id=e1")
>
> Do you know if on the release of Solr 6, the stream handler will contain
> validation code which does a syntax check as well as checking if
> appropriate fields have been used in the fl and sort properties? For
> example, for the above query, I am joining the id field on the e1 field,
> so the id field needs to be in the fl and sort property of the left-hand
> stream, and e1 needs to be in the fl and sort property in the right-hand
> stream foe the join to work.
>
> Cheers
>
> Akiel
>
>
>
> From:   Joel Bernstein 
> To: solr-user@lucene.apache.org
> Date:   24/12/2015 15:51
> Subject:Re: Solr 6 Distributed Join
>
>
>
> I haven't had a chance to review. If you have a reproducible failure on 
a
> one-to-many join go ahead and create a jira ticket.
>
> Joel Bernstein
> http://joelsolr.blogspot.com/
>
> On Thu, Dec 24, 2015 at 3:25 AM, Akiel Ahmed  
wrote:
>
> > Hi
> >
> > Did you get a chance to check whether one-to-many joins were covered 
in
> > your tests? If yes, can you make any suggestions for what I could be
> doing
> > wrong?
> >
> > Cheers
> >
> > Akiel
> >
> >
> >
> > From:   Joel Bernstein 
> > To: solr-user@lucene.apache.org
> > Date:   22/12/2015 13:03
> > Subject:Re: Solr 6 Distributed Join
> >
> >
> >
> > Just did a quick review of the InnerJoinStream and it appears that it
> > should handle one-to-one, one-to-many, many-to-one and many-to-many
> joins.
> > It will take a closer review of the tests to see if all these cases 
are
> > covered. So the innerJoin is designed to handle the case you describe.
> If
> > it doesn't work properly it makes sense to file a bug report.
> >
> > Joel Bernstein
> > http://joelsolr.blogspot.com/
> >
> > On Tue, Dec 22, 2015 at 5:55 AM, Akiel Ahmed 
> wrote:
> >
> > > Hi,
> > >
> > > I tried a straight forward join against something that is connected 
to
> > > many things but didn't get the results I expected - I wanted to 
check
> > > whether my expectations are off, and whether I can do anything in 
Solr
> > to
> > > do what I want. So given the data:
> > >
> > > id,type,e1,e2,text
> > > 1,ABC,,,John Smith
> > > 2,ABC,,,Jane Doe
> > > 3,DEF,1,2,1
> > > 4,DEF,1,2,2
> > > 5,DEF,1,2,4
> > > 6,DEF,1,2,8
> > >
> > > and the query
> > >
> > >
> > >
> >
> >
>
> 
http://localhost:8983/solr/gettingstarted/stream?stream=innerJoin(search(gettingstarted

>
> >
> > > , fl="id", q=text:John, sort="id
> > > asc",zkHost="localhost:9983",qt="/export"), search(gettingstarted,
> > > fl="id,e1", q=type:DEF, sort="id
> > > asc",zkHost="localhost:9983",qt="/export"), on="id=e1")
> > >
> > > I expected
> > >
> > > {"result-set":{"docs":[
> > > {"e1":"1","id":"3"},
> > > {"e1":"1","id":"4"},
> > > {"e1":"1","id":"5"},
> > > {"e1":"1","id":"6"},
> > > {"EOF":true,"RESPONSE_TIME":56}]}}
> > >
> > > but instead I got
> > >
> > > {"result-set":{"docs":[
> > > {"e1":"1","id":"3"},
> > > {"EOF":true,"RESPONSE_TIME":58}]}}
> > >
> > > Deleting the document with id 3, and rerunning the query (see above)
> > > returned
> > >
> > > {"result-set":{"docs":[
> > > {"e1":"1","id":"4"},
> > > {"EOF":true,"RESPONSE_TIME":56}]}}
> > >
> > > So it looks like the join finds the first thing to join on. Is this
> > > expected behaviour? If so, is there anyway I can do to convince Solr
> to
> > > return all the things it is connected to?
> > >
> > > Cheers
> > >
> > > Akiel
> > > - Forwarded by Akiel Ahmed/UK/IBM on 22/12/2015 10:47 -
> > >
> > > From:   Akiel Ahmed/UK/IBM
> > > To: solr-user@lucene.apache.org
> > > Date:   21/12/2015 11:16
> > > Subject:Re: Solr 6 Distributed Join
> > >
> > >
> > > Thank you for t

core,Collection,Shard,Replication

2016-01-06 Thread vidya
Hi

I am new to solr. i have a doubt in understanding difference between core
and collection.
>As far as i understand, cores can be created when solr is run in local mode
and collections in solrCloud.
Can you please help me if i am wrong.
>And why do we shard a collection? i read like -
When your data is too large for one node, you can break it up and store it
in sections by creating one or more shards. Each is a portion of the logical
index, or core, and it's the set of all nodes containing that section of the
index.
But when indexing a document in one shard,it gets reflected in every shard
of that collection. But main intention of creating shard is to break up the
data.
>Why do we replicate a collection?

Thanks in advance



--
View this message in context: 
http://lucene.472066.n3.nabble.com/core-Collection-Shard-Replication-tp4248850.html
Sent from the Solr - User mailing list archive at Nabble.com.


Cleanup solr cloud after failure in collection creation

2016-01-06 Thread Gian Maria Ricci - aka Alkampfer
I've issued a command to create some collections, but there were an error in
solrconfig.xml (I've specified wrong path to dataimporthandler.jar files).
The creation of the collection failed but now I don't know how to cleanup
everything.

 

This is a test solrcloud where I'm experimenting in windows environment. 

 

Here is what I see in the UI. I've also checked disk and all directories for
various shards are there. I cannot use the DELETE collection command because
the collection was not really created, but I need to know what is the
correct way to cleanup the situation.

 



 

 

Here is the disk, where all the directories are present

 



--
Gian Maria Ricci
Cell: +39 320 0136949

 

   


 



Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina

yes id value is unique in C1 and unique in C2.
id in C1 is never present in C2
id in C2 is never present in C1

Le 06/01/2016 11:12, Binoy Dalal a écrit :

Are Id values for docs in both the collections exactly same?
To get proper results, the ids should be unique across both the cores.

On Wed, 6 Jan 2016, 15:11 Bruno Mannina  wrote:


Hi All,

Solr 5.4, Ubuntu

I thought it was simple to request across two collections with the same
schema but not.
I have one solr instance launch. 300 000 records in each collection.

I try to use this request without having both results:

http://my_adress:my_port
/solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json

this request returns only C1 results and if I do:

http://my_adress:my_port
/solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json

it returns only C2 results.

I have 5 identical fields on both collection
id, fid, st, cc, timestamp
where id is the unique key field.

Can someone could explain me why it doesn't work ?

Thanks a lot !
Bruno

---
L'absence de virus dans ce courrier électronique a été vérifiée par le
logiciel antivirus Avast.
http://www.avast.com

--

Regards,
Binoy Dalal




---
L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel 
antivirus Avast.
http://www.avast.com



Re: Query behavior difference.

2016-01-06 Thread Modassar Ather
Please help me understand why queries like wildcard, prefix and few others
are re-written into constant score query?
Why the scoring factors are not taken into consideration in such queries?

Please correct me if I am wrong that this behavior is per the query type
irrespective of the parser used.

Thanks,
Modassar

On Wed, Jan 6, 2016 at 12:56 PM, Modassar Ather 
wrote:

> Thanks for your response Ahmet.
>
> Best,
> Modassar
>
> On Mon, Jan 4, 2016 at 5:07 PM, Ahmet Arslan 
> wrote:
>
>> Hi,
>>
>> I think wildcard queries fl:networ* are re-written into Constant Score
>> Query.
>> fl=*,score should returns same score for all documents that are retrieved.
>>
>> Ahmet
>>
>>
>>
>> On Monday, January 4, 2016 12:22 PM, Modassar Ather <
>> modather1...@gmail.com> wrote:
>> Hi,
>>
>> Kindly help me understand how will relevance ranking differ int following
>> searches.
>>
>> query : fl:network
>> query : fl:networ*
>>
>> What I am observing that the results returned are different in both of
>> them
>> in a way that the top documents returned for q=fl:network is not present
>> in
>> the top results of q=fl:networ*.
>> For example for q=fl:network I am getting top documents having around 20
>> occurrence of network whereas the top result of q=fl:networ* has only
>> couple of occurrence of network.
>> I am aware of the underlying normalization process participation in
>> relevance ranking of documents but not able to understand such a
>> difference
>> in the ranking of result for the queries.
>>
>> Thanks,
>> Modassar
>>
>
>


Re: Many patterns against many sentences, storing all results

2016-01-06 Thread Charlie Hull

On 05/01/2016 16:05, Allison, Timothy B. wrote:

Might want to look into:

https://github.com/flaxsearch/luwak


Yes, this sounds like a very good fit for Luwak. We built it originally 
for media monitoring applications where one also needs just a hit/no-hit 
result. It's running in production at much larger scale than this.


Best

Charlie



or
  https://github.com/OpenSextant/SolrTextTagger

-Original Message-
From: Will Moy [mailto:w...@fullfact.org]
Sent: Tuesday, January 05, 2016 11:02 AM
To: solr-user@lucene.apache.org
Subject: Many patterns against many sentences, storing all results

Hello

Please may I have your advice as to whether Solr is a good tool for this job?

We have (per year) –
Up to 50,000,000 sentences
And about 5,000 search patterns (i.e. queries)

Our task is to identify all matches between any sentence and any search pattern.

That list of detections must be kept up to date as patterns are added or 
updated (a handful an hour), and as new sentences are added.

Some of the sentences will be added in real time, at probably max 100 / second 
and usually much less. The detections on these should be provided within 3 
seconds.

It's an unusual application in that we want all results in an external DB, and 
also in that every sentence is either a hit or not. we don't care about scoring 
results, only about matches for the exact search pattern entered.

The application is automatically detecting instances of factchecked statements.

The smaller-scale prototype was done with postgres full text searching, but 
that can't do exact phrase matching or other more sophisticated searches, so 
it's out.

Thanks very much

Will




--
Charlie Hull
Flax - Open Source Enterprise Search

tel/fax: +44 (0)8700 118334
mobile:  +44 (0)7767 825828
web: www.flax.co.uk


Re: solr 5.2.0 need to build high query response

2016-01-06 Thread Novin Novin
Thanks Erick, this listener doing quite a good job. But not what  I needed.
Do the solr has any other things that I can look into to make it faster.
FYI  speed goes to 1 sec to 1.2 sec. I actually needed around 500 ms.

On Tue, 5 Jan 2016 at 18:24 Erick Erickson  wrote:

> Yep. Do note what's happening here. You're executing a query
> that potentially takes 10 seconds to execute (based on your
> earlier post). But you may be opening a new searcher every
> 2 seconds. You may start to see "too many on deck searchers"
> in your log. If you do do _not_ try to "fix" this by upping the
> maxWarmingSearchers in solrconfig.xml, that's really an
> anti-pattern.
>
> Really, I'd consider relaxing this 2 second limit. I've often found
> it easier to tell users "it may take up to 30 seconds for newly-added
> docs to appear in search results" than try to satisfy overly-tight
> requirements.
>
> As a former co-worker often said, "Users are much more comfortable
> with predictable delays than unpredictable ones". It's surprising how
> often it's the case.
>
> Best,
> Erick
>
> P.S. What's the difference between newSearcher and firstSearcher?
> newSearcher is fired every time a commit (soft or hard with
> openSearcher=true)
> where firstSearcher is fired up only when Solr starts. This is to
> accommodate
> the fact that the autowarm counts on things like filterCacher aren't
> available when Solr starts. In practice, though, many (most?) people
> put the same query in both.
>
> On Tue, Jan 5, 2016 at 9:17 AM, Novin Novin  wrote:
> > If I'm correct, you are talking about this
> >
> > 
> > 
> > 
> > 
> > 
> > 
> > 
> > 
> >
> > *or may be here too.*
> >
> > static firstSearcher warming in
> > solrconfig.xml
> > 
> > 
> > 
> >
> > Thanks,
> > Novin
> >
> > On Tue, 5 Jan 2016 at 16:22 Erick Erickson 
> wrote:
> >
> >> It sounds like you're not doing proper autowarming,
> >> which you'd need to do either with hard or
> >> soft commits that open new searchers.
> >>
> >> see:
> >> https://wiki.apache.org/solr/SolrCaching#Cache_Warming_and_Autowarming
> >>
> >> In particular, you should have a newSearcher event
> >> that facets on the fields you expect to need.
> >>
> >> Best,
> >> Erick
> >>
> >> On Tue, Jan 5, 2016 at 8:17 AM, Novin Novin 
> wrote:
> >> > Thanks David. It is quite good to use for NRT.
> >> >
> >> > Apologies, I didn't mention that facet search is really slow.
> >> >
> >> > I found the below reason which could be the reason because I am using
> >> facet
> >> > spatial search which is getting slow.
> >> >
> >> > To know more about solr hard and soft commits, have a look at this
> blog :
> >> >
> >>
> https://lucidworks.com/blog/2013/08/23/understanding-transaction-logs-softcommit-and-commit-in-sorlcloud/
> >> >
> >> > In this article, "soft commits are that they will make documents
> visible,
> >> > but at some cost. In particular the “top level” caches, which include
> >> what
> >> > you configure in solrconfig.xml (filterCache, queryResultCache, etc)
> will
> >> > be invalidated! Autowarming will be performed on your top level caches
> >> > (e.g. filterCache, queryResultCache), and any newSearcher queries
> will be
> >> > executed. Also, the FieldValueCache is invalidated, so facet queries
> will
> >> > have to wait until the cache is refreshed."
> >> >
> >> > Do you have any idea what could possible be do about this?
> >> >
> >> >
> >> >
> >> > On Tue, 5 Jan 2016 at 12:31 davidphilip cherian <
> >> > davidphilipcher...@gmail.com> wrote:
> >> >
> >> >> You should use solr softcommit for this use case. So, by setting
> >> softcommit
> >> >> to 5 seconds and autoCommit to minute with openSearcher=false should
> do
> >> the
> >> >> work.
> >> >>
> >> >>  
> >> >>  6
> >> >> false
> >> >>  
> >> >>
> >> >> 
> >> >> 2000
> >> >> 
> >> >>
> >> >> Reference link-
> >> >>
> >>
> https://cwiki.apache.org/confluence/display/solr/Near+Real+Time+Searching
> >> >>
> >> >> To know more about solr hard and soft commits, have a look at this
> blog
> >> :
> >> >>
> >> >>
> >>
> https://lucidworks.com/blog/2013/08/23/understanding-transaction-logs-softcommit-and-commit-in-sorlcloud/
> >> >>
> >> >> On Tue, Jan 5, 2016 at 5:44 PM, Novin Novin 
> >> wrote:
> >> >>
> >> >> > Hi guys,
> >> >> >
> >> >> > I'm having trouble to figure what would be idle solr config for
> where:
> >> >> >
> >> >> > I'm doing hard commit in every minute   for very few number of
> users
> >> >> > because I have to show those docs in search results quickly when
> user
> >> >> save
> >> >> > the changes.
> >> >> >
> >> >> > It is causing the response in around  2 secs to show even I am
> getting
> >> >> only
> >> >> > 10 records.
> >> >> >
> >> >> > Could you able to give some idea where to look at.
> >> >> >
> >> >> >
> >> >> > Thanks in advance,
> >> >> > Novin
> >> >> >
> >> >>
> >>
>


Re: Newbie: Searching across 2 collections ?

2016-01-06 Thread Binoy Dalal
Are Id values for docs in both the collections exactly same?
To get proper results, the ids should be unique across both the cores.

On Wed, 6 Jan 2016, 15:11 Bruno Mannina  wrote:

> Hi All,
>
> Solr 5.4, Ubuntu
>
> I thought it was simple to request across two collections with the same
> schema but not.
> I have one solr instance launch. 300 000 records in each collection.
>
> I try to use this request without having both results:
>
> http://my_adress:my_port
> /solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json
>
> this request returns only C1 results and if I do:
>
> http://my_adress:my_port
> /solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json
>
> it returns only C2 results.
>
> I have 5 identical fields on both collection
> id, fid, st, cc, timestamp
> where id is the unique key field.
>
> Can someone could explain me why it doesn't work ?
>
> Thanks a lot !
> Bruno
>
> ---
> L'absence de virus dans ce courrier électronique a été vérifiée par le
> logiciel antivirus Avast.
> http://www.avast.com
>
> --
Regards,
Binoy Dalal


Newbie: Searching across 2 collections ?

2016-01-06 Thread Bruno Mannina

Hi All,

Solr 5.4, Ubuntu

I thought it was simple to request across two collections with the same
schema but not.
I have one solr instance launch. 300 000 records in each collection.

I try to use this request without having both results:

http://my_adress:my_port/solr/C1/select?collection=C1,C2&q=fid:34520196&wt=json

this request returns only C1 results and if I do:

http://my_adress:my_port/solr/C2/select?collection=C1,C2&q=fid:34520196&wt=json

it returns only C2 results.

I have 5 identical fields on both collection
id, fid, st, cc, timestamp
where id is the unique key field.

Can someone could explain me why it doesn't work ?

Thanks a lot !
Bruno

---
L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel 
antivirus Avast.
http://www.avast.com