Commit required after delete ?

2017-01-05 Thread Dorian Hoxha
Hello friends, Based on what I've read, I think "commit" isn't needed to make deletes active (like we do with index/update), right ? Since it just marks an in-memory deleted-id bitmap, right ? Thank You

error during running my code java.lang.VerifyError: Bad type on operand stack

2017-01-05 Thread gayathri...@tcs.com
Hi Im using solr 5.4.0 while running my code i get below eroor please suggest what has to be done public static void main(String[] args) throws SolrServerException, IOException { String urlString = "http://localhost:8983/solr/;;

Need help for this scenario

2017-01-05 Thread capgemini_india . shashi
Hello Team, I am looking for your valuable suggestions/solutions for the below scenario: > Scenario : When any user gives a request by giving the name of the filename.zip, then he wants the "filename.zip" zip file. > Description: *The data is a collection.zip where it consists of

Re: Subqueries

2017-01-05 Thread Mikhail Khludnev
Peter, Subquery should also log its' request. Can't you find it in log? On Fri, Jan 6, 2017 at 1:19 AM, Peter Matthew Eichman wrote: > Hello Mikhail, > > I put pcdm_members into the fl, and it is definitely stored. I tried adding > the logParamsList, but all I see in the log

Re: How to train the model using user clicks when use ltr(learning to rank) module?

2017-01-05 Thread Will Martin
In the Assemble training data part: the third column indicates the relative importance or relevance of that doc Could you please give more info about how to give a score based on what user clicks? Hi Jeffery, Give your questions more detail and there may be more feedback; just a suggestion.

Re: SolrCloud and LVM

2017-01-05 Thread Shawn Heisey
On 1/5/2017 3:12 PM, Chris Ulicny wrote: > Is there any known significant performance impact of running solrcloud with > lvm on linux? > > While migrating to solrcloud we don't have the storage capacity for our > expected final size, so we are planning on setting up the solrcloud > instances on a

Re: Subqueries

2017-01-05 Thread Peter Matthew Eichman
Hello Mikhail, I put pcdm_members into the fl, and it is definitely stored. I tried adding the logParamsList, but all I see in the log is 183866104 [qtp1778535015-14] INFO org.apache.solr.core.SolrCore – [fedora4] webapp=/solr path=/select params={q=id:"https://

SolrCloud and LVM

2017-01-05 Thread Chris Ulicny
Is there any known significant performance impact of running solrcloud with lvm on linux? While migrating to solrcloud we don't have the storage capacity for our expected final size, so we are planning on setting up the solrcloud instances on a logical volume that we can grow when hardware

Re: Subqueries

2017-01-05 Thread Mikhail Khludnev
Hello, Can you add pcdm_members into fl to make sure it's stored? Also please add the following param members.logParamsList=q,fl,rows,row.pcdm_members, and check logs then. On Thu, Jan 5, 2017 at 9:46 PM, Peter Matthew Eichman wrote: > Hello all, > > I am attempting to use a

Re: reuse a org.apache.lucene.search.Query in Solrj?

2017-01-05 Thread Mikhail Khludnev
If I've got you right, it's not possible. It's an obvious problem to pass Lucene Query through SolrJ API. On Thu, Jan 5, 2017 at 8:32 PM, xavier jmlucjav wrote: > Hi, > > I have a lucene Query (Boolean query with a bunch of possibly complex > spatial queries, even polygon

Re: How to train the model using user clicks when use ltr(learning to rank) module?

2017-01-05 Thread Jeffery Yuan
Thanks , Will Martin. I checked the pdf it's great. but seems not very useful for my question: How to train the model using user clicks when use ltr(learning to rank) module. I know the concept after reading these papers. But still not sure how to code them. -- View this message in context:

Re: Is there Solr limitation on size for document retrieval?

2017-01-05 Thread Erick Erickson
The problem is probably somewhere in the max allowed packet size you have configured between your client and server. Solr has no a-priori limit here (well, I think > 2B won't return). What is you symptom? Often the browser will sit there blank because it's taking forever to render. Try submitting

Is there Solr limitation on size for document retrieval?

2017-01-05 Thread Kaushik
Hello, Is there a limit on the size of a document that can be indexed and rendered by Solr? We use Solr 5.3.1 and while we are able to index a document of 40 mb size withouot any issue, we are unable to retrieve the indexed SolrDocument. Is there any configuration that we can use to spit out the

Facet date - autogap

2017-01-05 Thread sn00py
Is it possible to make an "autogap" for a daterange? I would like to send a query and depending on the daterange, the gap should be 1 Year 1 Month 1 Day depending on the date range of the results My only possibility i see at the moment ist to make a query to get first and last date and send

Re: Regarding /sql -- WHERE <> IS NULL and IS NOT NULL

2017-01-05 Thread Joel Bernstein
IS NULL and IS NOT NULL predicate are not currently supported. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Jan 5, 2017 at 2:05 PM, radha krishnan wrote: > Hi, > > solr version : 6.3 > > will WHERE <> IS NULL / IS NOT NULL work with the /sql handler > ? > >

Regarding /sql -- WHERE <> IS NULL and IS NOT NULL

2017-01-05 Thread radha krishnan
Hi, solr version : 6.3 will WHERE <> IS NULL / IS NOT NULL work with the /sql handler ? " select name from gettingstarted where name is not null " the above query is not returning any documents in the response even if there are documents with "name"defined Thanks, Radhakrishnan D

Re: Search for ISBN-like identifiers

2017-01-05 Thread Josh Lincoln
Sebastian, You may want to try adding autoGeneratePhraseQueries="true" to the fieldtype. With that setting, a query for 978-3-8052-5094-8 will behave just like "978 3 8052 5094 8" (with the quotes) A few notes about autoGeneratePhraseQueries a) it used to be set to true by default, but that was

Subqueries

2017-01-05 Thread Peter Matthew Eichman
Hello all, I am attempting to use a subquery to enrich a query with the titles of related objects. Each document in my index may have 1 or more pcdm_members and pcdm_related_objects fields, whose values are ids of other documents in the index. Those documents in turn have reciprocal

AW: Search for ISBN-like identifiers

2017-01-05 Thread Sebastian Riemer
Thank you very much for taking the time to help me! I'll definitely have a look at the link you've posted. @ShawnHeisey Thanks too for shedding light on the wildcard behaviour! Allow me one further question: - Assuming that I define a separate field for storing the ISBNs, using the awesome

Re: Search for ISBN-like identifiers

2017-01-05 Thread Erik Hatcher
Sebastian - There’s some precedent out there for ISBN’s. Bill Dueber and the UMICH/code4lib folks have done amazing work, check it out here - https://github.com/mlibrary/umich_solr_library_filters - Erik > On Jan 5, 2017,

Re: Search for ISBN-like identifiers

2017-01-05 Thread Shawn Heisey
On 1/5/2017 3:08 AM, Sebastian Riemer wrote: > I now face the problem, that searching for a book with > text:978-3-8052-5094-8* does not return the single result I expect. > However searching for text:9783805250948* instead returns a result. > Note, that I am adding a wildcard at the end

Re: Search for ISBN-like identifiers

2017-01-05 Thread Erick Erickson
bq: How does the left side correlate with the right side?... You've got it right, the left is the indexed and the right is the query bq: the contents I see In the column text represents the _stored_ value of the field text, right... Correct bq: ...are only the tokenized values stored for

reuse a org.apache.lucene.search.Query in Solrj?

2017-01-05 Thread xavier jmlucjav
Hi, I have a lucene Query (Boolean query with a bunch of possibly complex spatial queries, even polygon etc) that I am building for some MemoryIndex stuff. Now I need to add that same query to a Solr query (adding it to a bunch of other fq I am using). Is there a some way to piggyback the lucene

Re: Howto reload "all" cores?

2017-01-05 Thread Shawn Heisey
On 1/5/2017 6:16 AM, Clemens Wyss DEV wrote: > does http://localhost:8983/solr/admin/cores?action=RELOAD reload all > cores? No. It would complain that you didn't give it a core name. If you want to reload all cores, restart Solr ... or ask Solr for a list of cores, and reload each of them.

AW: Re: update/extract override ExtractTyp

2017-01-05 Thread sn00py
I am useing the Extract URL And Renamed the File to test.txtBut it is still Parsed with the XML ParserCan I force the txt Parser for all .txt Files?  Von meinem Samsung Gerät gesendet. Ursprüngliche Nachricht Von: Shawn Heisey Datum: 04.01.17

Re: SolrCloud different score for same document on different replicas.

2017-01-05 Thread Charlie Hull
On 05/01/2017 13:30, Morten Bøgeskov wrote: Hi. We've got a SolrCloud which is sharded and has a replication factor of 2. The 2 replicas of a shard may look like this: Num Docs:5401023 Max Doc:6388614 Deleted Docs:987591 Num Docs:5401023 Max Doc:5948122 Deleted Docs:

LineEntityProcessor | Separator --- /update/csv | OnError

2017-01-05 Thread Moenieb Davids
Hi, Just wanted to know if anybody can assist with the following scenario: I have a pipe delimited mainframe file\s that sometimes misses certain fields in a row, which obviously causes issues when I try the /update/csv handler. Scenario 1: The csv handler is quite fast, however, when it picks

StringIndexOutOfBoundsException "in" SpellCheckCollator.getCollation

2017-01-05 Thread Clemens Wyss DEV
I am seeing many exceptions like this in my Solr [5.4.1] log: null:java.lang.StringIndexOutOfBoundsException: String index out of range: -2 at java.lang.AbstractStringBuilder.replace(AbstractStringBuilder.java:824) at java.lang.StringBuilder.replace(StringBuilder.java:262)

RE: SolrCloud different score for same document on different replicas.

2017-01-05 Thread Markus Jelsma
Hello - you need a custom similarity and use docCount as divisor instead of maxDoc when calculating IDF. I believe this was fixed in some version but i'm not sure. Markus -Original message- > From:Morten Bøgeskov > Sent: Thursday 5th January 2017 14:33 > To:

SolrCloud different score for same document on different replicas.

2017-01-05 Thread Morten Bøgeskov
Hi. We've got a SolrCloud which is sharded and has a replication factor of 2. The 2 replicas of a shard may look like this: Num Docs:5401023 Max Doc:6388614 Deleted Docs:987591 Num Docs:5401023 Max Doc:5948122 Deleted Docs:547099 We've seen >10% difference in Max

Howto reload "all" cores?

2017-01-05 Thread Clemens Wyss DEV
does http://localhost:8983/solr/admin/cores?action=RELOAD reload all cores? Thx Clemens

Re: ClusterStateMutator

2017-01-05 Thread Hendrik Haddorp
The UI warning was quite easy to resolve. I'm currently testing Solr with HDFS but for some reason the core ended up on the local storage of the node. After a delete and restart the problem was gone. On 05.01.2017 12:42, Hendrik Haddorp wrote: Right, I had to do that multiple times already

Re: ClusterStateMutator

2017-01-05 Thread Hendrik Haddorp
Right, I had to do that multiple times already when I restarted nodes during collection creation. In such cases I was left with data in the clusterstate.json, which at least on 6.2.1, blocked further collection creations. Once manually deleted or set to {} collection creation worked again.

Solr json facet api

2017-01-05 Thread kshitij tyagi
Hi, We were earlier using solr 4.0 and now moved to solr 5.2: I am debugging queries and seeing that most of the time in queries are taken by solr facet queries. I have read about solr json facet api in solr 5 on wards, can anyone help me out to understand the difference between these both?

Search for ISBN-like identifiers

2017-01-05 Thread Sebastian Riemer
Hi folks, TL;DR: Is there an easy way, to copy ISBNs with hyphens to the general text field, respectively configure the analyser on that field, so that a search for the hyphenated ISBN returns exactly the matching document? Long version: I've defined a field "text" of type "text_general",

Re: SolrJ doesn't work with Json facet api

2017-01-05 Thread Sandeep Khanzode
For me, these variants have worked ... solrQuery.add("json.facet", "..."); solrQuery.setParam("json.facet", "...");   You get ... QueryResponse.getResponse().get("facets"); SRK On Thursday, January 5, 2017 1:19 PM, Jeffery Yuan wrote: Thanks for your response.

Re: Solr query *:* timeout

2017-01-05 Thread sn00py
Hmmm i have to check something it seems, that it's no error There are some zip files which are indexed, and on the admin page there are fetched all fields, including the contents ... and the zip document has a realy big content :O Zitat von sn0...@ulysses-erp.com: Hello - an hour ago,

Solr query *:* timeout

2017-01-05 Thread sn00py
Hello - an hour ago, solr worked fine i had about 2 documents in the index. I had made an upadte/extract process from the batch, and saw that on document has blocked the batch I waited fo about 2 minutes than i killed the update batch process. After a restart of the server, i started

Re: How to train the model using user clicks when use ltr(learning to rank) module?

2017-01-05 Thread Will Martin
http://www.dcc.fc.up.pt/~pribeiro/aulas/na1516/slides/na1516-slides-ir.pdf see the relevant sections for good info On 1/5/2017 3:02 AM, Jeffery Yuan wrote: > Thanks very much for integrating machine learning to Solr. >

How to train the model using user clicks when use ltr(learning to rank) module?

2017-01-05 Thread Jeffery Yuan
Thanks very much for integrating machine learning to Solr. https://github.com/apache/lucene-solr/blob/f62874e47a0c790b9e396f58ef6f14ea04e2280b/solr/contrib/ltr/README.md In the Assemble training data part: the third column indicates the relative importance or relevance of that doc Could you