Solr Hit Highlighting

2016-10-24 Thread Al Hudson
Hello All,

I’m new to the world of Solr and hoping someone on this list can help me hit 
highlighting in solr.

I am trying to set up a hit highlighting in Solr and have been seeing some 
strange issues.

My core.xml file has a single tag   which houses all the 
text in a document.

Using the Solr web interface I submit the following query : What is milk? – I 
get back many answers and in addition, just by selecting the hl box and 
entering ‘content’ in the hl.fl box I get hit highlighted portions of text.

However things stop working when I change the query to : What is lactose 
intolerance? I still get valid results but the highlighting section is full of 
empty arrays.

I’ve tried different combinations of commenting out the copyField, making 
content multivalued, but to be honest I’m trying things and hoping some 
configuration will work.

   











Can someone help?

Thank you,
Al


Sent from Mail for Windows 10



Re: OOM Error

2016-10-24 Thread Susheel Kumar
Thanks, Pushkar. The Solr was already killed by OOM script so i believe we
can't get heap dump.

Hi Shawn, I used Solr service scripts to launch Solr and it looks like
bin/solr doesn't include by default the below JVM parameter.

"-XX:+HeapDumpOnOutOfMemoryError -XX:HeapDumpPath=/path/to/the/dump"

Is that something we should add to the Solr launch scripts to have it
included or may be at least in disabled (comment) mode?

Thanks,
Susheel

On Mon, Oct 24, 2016 at 8:20 PM, Shawn Heisey  wrote:

> On 10/24/2016 4:27 PM, Susheel Kumar wrote:
> > I am seeing OOM script killed solr (solr 6.0.0) on couple of our VM's
> > today. So far our solr cluster has been running fine but suddenly today
> > many of the VM's Solr instance got killed. I had 8G of heap allocated on
> 64
> > GB machines with 20+ GB of index size on each shards.
> >
> > What could be looked to find the exact root cause. I am suspecting of any
> > query (wildcard prefix query etc.) might have caused this issue.  The
> > ingestion and query load looks normal as other days.  I have the solr GC
> > logs as well.
>
> It is unlikely that you will be able to figure out exactly what is using
> too much memory from Solr logs.  The place where the OOM happens may be
> completely unrelated to the parts of the system that are using large
> amounts of memory.  That point is just the place where Java ran out of
> memory to allocate, which could happen when allocating a tiny amount of
> memory just as easily as it could happen when allocating a large amount
> of memory.
>
> What I can tell you has been placed on this wiki page:
>
> https://wiki.apache.org/solr/SolrPerformanceProblems#Java_Heap
>
> Thanks,
> Shawn
>
>


Re: Related Search

2016-10-24 Thread Erick Erickson
Rick:

The priority isn't particularly helpful for two reasons:

1> it's the default so often gets set without intent.
2> what the originator thinks of as major may or may not translate
into someone actually doing work on it.

In this case there's a lot of work that'd need to be done. "some
model" just begs for clarification. In this case it might be a major
feature, but nobody's felt the need or had the time to put into making
it a reality. This is really just a topic for conversation at this
point

Best,
Erick

On Mon, Oct 24, 2016 at 5:32 PM, Rick Leir  wrote:
> Hi all,
>
> There is an issue 'Create a Related Search Component' which has been open
> for some years now.
>
> It has a priority: major.
>
> https://issues.apache.org/jira/browse/SOLR-2080
>
>
> I discovered it linked from Lucidwork's very useful blog on ecommerce:
>
> https://lucidworks.com/blog/2011/01/25/implementing-the-ecommerce-checklist-with-apache-solr-and-lucidworks/
>
>
> Did people find a better way to accomplish Related Search? Perhaps MLT
> http://wiki.apache.org/solr/MoreLikeThis ?
>
> cheers -- Rick
>
>


Related Search

2016-10-24 Thread Rick Leir

Hi all,

There is an issue 'Create a Related Search Component' which has been 
open for some years now.


It has a priority: major.

https://issues.apache.org/jira/browse/SOLR-2080


I discovered it linked from Lucidwork's very useful blog on ecommerce:

https://lucidworks.com/blog/2011/01/25/implementing-the-ecommerce-checklist-with-apache-solr-and-lucidworks/


Did people find a better way to accomplish Related Search? Perhaps MLT 
http://wiki.apache.org/solr/MoreLikeThis ?


cheers -- Rick




Re: OOM Error

2016-10-24 Thread Shawn Heisey
On 10/24/2016 4:27 PM, Susheel Kumar wrote:
> I am seeing OOM script killed solr (solr 6.0.0) on couple of our VM's
> today. So far our solr cluster has been running fine but suddenly today
> many of the VM's Solr instance got killed. I had 8G of heap allocated on 64
> GB machines with 20+ GB of index size on each shards.
>
> What could be looked to find the exact root cause. I am suspecting of any
> query (wildcard prefix query etc.) might have caused this issue.  The
> ingestion and query load looks normal as other days.  I have the solr GC
> logs as well.

It is unlikely that you will be able to figure out exactly what is using
too much memory from Solr logs.  The place where the OOM happens may be
completely unrelated to the parts of the system that are using large
amounts of memory.  That point is just the place where Java ran out of
memory to allocate, which could happen when allocating a tiny amount of
memory just as easily as it could happen when allocating a large amount
of memory.

What I can tell you has been placed on this wiki page:

https://wiki.apache.org/solr/SolrPerformanceProblems#Java_Heap

Thanks,
Shawn



Re: OOM Error

2016-10-24 Thread Pushkar Raste
Did you look into the heap dump ?

On Mon, Oct 24, 2016 at 6:27 PM, Susheel Kumar 
wrote:

> Hello,
>
> I am seeing OOM script killed solr (solr 6.0.0) on couple of our VM's
> today. So far our solr cluster has been running fine but suddenly today
> many of the VM's Solr instance got killed. I had 8G of heap allocated on 64
> GB machines with 20+ GB of index size on each shards.
>
> What could be looked to find the exact root cause. I am suspecting of any
> query (wildcard prefix query etc.) might have caused this issue.  The
> ingestion and query load looks normal as other days.  I have the solr GC
> logs as well.
>
> Thanks,
> Susheel
>


OOM Error

2016-10-24 Thread Susheel Kumar
Hello,

I am seeing OOM script killed solr (solr 6.0.0) on couple of our VM's
today. So far our solr cluster has been running fine but suddenly today
many of the VM's Solr instance got killed. I had 8G of heap allocated on 64
GB machines with 20+ GB of index size on each shards.

What could be looked to find the exact root cause. I am suspecting of any
query (wildcard prefix query etc.) might have caused this issue.  The
ingestion and query load looks normal as other days.  I have the solr GC
logs as well.

Thanks,
Susheel


RE: CachedSqlEntityProcessor with delta-import

2016-10-24 Thread Mohan, Sowmya
Thanks James. That's what I was using before. But I also wanted to perform 
deletes using deletedPkQuery and hence switched to delta imports. The problem 
with using deletedPkQuery with the full import is that 
dataimporter.last_index_time is no longer accurate. 

Below is an example of my deletedPkQuery. If run the full-import for a 
differential index, that would update the last index time. Running the delta 
import to remove the deleted records then wouldn't do anything since nothing 
changed since the last index time. 


 deletedPkQuery="SELECT id
FROM content
WHERE active = 1 AND lastUpdate > 
'${dataimporter.last_index_time}'"






-Original Message-
From: Dyer, James [mailto:james.d...@ingramcontent.com] 
Sent: Friday, October 21, 2016 4:23 PM
To: solr-user@lucene.apache.org
Subject: RE: CachedSqlEntityProcessor with delta-import

Sowmya,

My memory is that the cache feature does not work with Delta Imports.  In fact, 
I believe that nearly all DIH features except straight JDBC imports do not work 
with Delta Imports.  My advice is to not use the Delta Import feature at all as 
the same result can (often more-efficiently) be accomplished following the 
approach outlined here: 
https://wiki.apache.org/solr/DataImportHandlerDeltaQueryViaFullImport

James Dyer
Ingram Content Group

-Original Message-
From: Mohan, Sowmya [mailto:sowmya.mo...@icf.com] 
Sent: Tuesday, October 18, 2016 10:07 AM
To: solr-user@lucene.apache.org
Subject: CachedSqlEntityProcessor with delta-import

Good morning,

Can CachedSqlEntityProcessor be used with delta-import? In my setup when 
running a delta-import with CachedSqlEntityProcessor, the child entity values 
are not correctly updated for the parent record. I am on Solr 4.3. Has anyone 
experienced this and if so how to resolve it?

Thanks,
Sowmya.



Re: Solr 6.0 Highlighting Not Working

2016-10-24 Thread jimtronic
Perhaps you need to wrap your inner "" and "" tags in the CDATA
structure?





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-6-0-Highlighting-Not-Working-tp4302787p4302835.html
Sent from the Solr - User mailing list archive at Nabble.com.


autoAddReplicas:true not working

2016-10-24 Thread Chetas Joshi
Hello,

I have the following configuration for the Solr cloud and a Solr collection
This is Solr on HDFS and Solr version I am using is 5.5.0

No. of hosts: 52 (Solr Cloud)

shard count:   50
replicationFactor:   1
MaxShardsPerNode: 1
autoAddReplicas:   true

Now, one of my shards is down. Although there are two hosts which are
available in my cloud on which a new replica could be created, it just does
not create a replica. All 52 hosts are healthy. What could be the reason
for this?

Thanks,

Chetas.


Solr Cloud A/B Deployment Issue

2016-10-24 Thread jimtronic
We are running into a timing issue when trying to do a scripted deployment of
our Solr Cloud cluster.

Scenario to reproduce (sometimes):

1. launch 3 clean solr nodes connected to zookeeper.
2. create a 1 shard collection with replicas on each node.
3. load data (more will make the problem worse)
4. launch 3 more nodes
5. add replicas to each new node
6. once entire cluster is healthy, start killing first three nodes.

Depending on the timing, the second three nodes end up all in RECOVERING
state without a leader.  

This appears to be happening because when the first leader dies, all the new
nodes go into full replication recovery and if all the old boxes happen to
die during that state, the boxes are stuck. The boxes cannot serve requests
and they eventually (1-8 hours) go into RECOVERY_FAILED state. 

This state is easy to fix with a FORCELEADER call to the collections API,
but that's only remediation, not prevention.

My question is this: Why do the new nodes have to go into full replication
recovery when they are already up to date? I just added the replica, so it
shouldn't have to a new full replication again.

Jim




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-Cloud-A-B-Deployment-Issue-tp4302810.html
Sent from the Solr - User mailing list archive at Nabble.com.


Solr Job opportunity - Noida, India

2016-10-24 Thread Aman Tandon
Hi Everyone,

If anyone is interested to apply for noida, India location for Solr
Developer position, then please forward me your resume with the contact
number and email.

*Company Name: Genpact Headstrong Capital Markey*
*Experience required:- 3 - 7 years*

With Regards
Aman Tandon


Solr 6.0 Highlighting Not Working

2016-10-24 Thread Teague James
Can someone please help me troubleshoot my Solr 6.0 highlighting issue? I
have a production Solr 4.9.0 unit configured to highlight responses and it
has worked for a long time now without issues. I have recently been testing
Solr 6.0 and have been unable to get highlighting to work. I used my 4.9
configuration as a guide when configuring my 6.0 machine. Here are the
primary configs:

solrconfig.xml
In my  query requestHandler I have the following:
on
text
html
b
/b

It is worth noting here that the documentation in the wiki says
hl.simple.pre and hl.simple.post both accept the following:


Using this config in 6.0 causes the core to malfunction at startup throwing
an error that essentially says that an XML statement was not closed. I had
to add the escaped characters just to get the solrconfig to load! Why? That
isn't documented anywhere I looked. It makes me wonder if this is the source
of the problems with highlighting since it works in my 4.9 implementation
without escaping. Is there something wrong with 6's ability to parse XML?

I upload documents using cURL:
curl http://localhost:8983/solr/[CORENAME]/update?commit=true -H
"Content-Type:text/xml" --data-binary '7518TEST02. This is the second
test.'

When I search using a browser:
http://50.16.13.37:8983/solr/pp/query?indent=true=TEST04=xml

The response I get is:
 
7518  TEST02. This is the
second test.



TEST02. This is the second test.


1548827202660859904
2.2499826






Note that nothing appears in the highlight section. Why?

Any help would be appreciated - thanks!

-Teague



Re: Using nested subdocuments in SOLR

2016-10-24 Thread Erick Erickson
1> I'd add a field "type" (or something) that had some way to
distinguish the two records. Consider "product" and "company" for the
two field values. As Alexandre said, internally these are _separate_
records.

2> Each record needs a unique ID ( in your schema). If you
don't, subsequent records will replace previous ones.

You asked about nested documents and that's what Alexandre gave you,
but have you consider denormalizing the data? That is, store the
company info with each separate product? If possible that's usually
preferred.

Here are some references:
http://yonik.com/solr-nested-objects/
https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Index+Handlers#UploadingDatawithIndexHandlers-NestedChildDocuments

Best,
Erick

On Mon, Oct 24, 2016 at 6:05 AM, Preeti Bhat  wrote:
> HI Alexandre,
>
> I have below questions.
> 1) need to tag your parent/child documents with document type-> What do we 
> mean by this? Should this be done in the schema.xml /managed-schema.
> 2) When we say individual id's what do we mean? I am asking this because, I 
> might have the child ids and parent ids as same. For example, the Company 
> Link Id could be 1 and so could be id=1
>
>
> Thanks and Regards,
> Preeti Bhat
>
> -Original Message-
> From: Alexandre Rafalovitch [mailto:arafa...@gmail.com]
> Sent: Monday, October 24, 2016 6:06 PM
> To: solr-user
> Subject: Re: Using nested subdocuments in SOLR
>
> (tl;dr) Yes
>
> Solr supports nested documents, with the company link instances becoming 
> children records/documents. However, they are stored separately, so the 
> parent and child documents would be searched and return separately unless you 
> use appropriate queries, document transformers, etc.
>
> You'd also need to tag your parent/child documents with document type 
> (obviously, CompanyLink for child documents here) and assign individual IDs.
>
> SolrJ supports nested documents too.
>
> Regards,
>Alex.
> 
> Solr Example reading group is starting November 2016, join us at 
> http://j.mp/SolrERG Newsletter and resources for Solr beginners and 
> intermediates:
> http://www.solr-start.com/
>
>
> On 24 October 2016 at 05:46, Preeti Bhat  wrote:
>> HI All,
>>
>> I have a requirement which needs the documents to be stored in the below 
>> format. Could someone please advise on whether this is possible in SOLR?
>>
>> {
>> Id:1
>> Name:"ABC"
>> CompanyLink:[ {CompanyId:2, email:abc@dba.com} , {CompanyId:4,
>> email:abc@dbcc.com}
>> ]
>> }
>>
>> Is this achievable through SOLRJ? Also, is there any expected impact on 
>> performance of the search or index due to storage in this format?
>>
>>
>> Thanks and Regards,
>> Preeti Bhat
>>
>>
>>
>> NOTICE TO RECIPIENTS: This communication may contain confidential and/or 
>> privileged information. If you are not the intended recipient (or have 
>> received this communication in error) please notify the sender and 
>> it-supp...@shoregrp.com immediately, and destroy this communication. Any 
>> unauthorized copying, disclosure or distribution of the material in this 
>> communication is strictly forbidden. Any views or opinions presented in this 
>> email are solely those of the author and do not necessarily represent those 
>> of the company. Finally, the recipient should check this email and any 
>> attachments for the presence of viruses. The company accepts no liability 
>> for any damage caused by any virus transmitted by this email.
>>
>>
>
> NOTICE TO RECIPIENTS: This communication may contain confidential and/or 
> privileged information. If you are not the intended recipient (or have 
> received this communication in error) please notify the sender and 
> it-supp...@shoregrp.com immediately, and destroy this communication. Any 
> unauthorized copying, disclosure or distribution of the material in this 
> communication is strictly forbidden. Any views or opinions presented in this 
> email are solely those of the author and do not necessarily represent those 
> of the company. Finally, the recipient should check this email and any 
> attachments for the presence of viruses. The company accepts no liability for 
> any damage caused by any virus transmitted by this email.
>
>


RE: Using nested subdocuments in SOLR

2016-10-24 Thread Preeti Bhat
HI Alexandre,

I have below questions.
1) need to tag your parent/child documents with document type-> What do we mean 
by this? Should this be done in the schema.xml /managed-schema.
2) When we say individual id's what do we mean? I am asking this because, I 
might have the child ids and parent ids as same. For example, the Company Link 
Id could be 1 and so could be id=1


Thanks and Regards,
Preeti Bhat

-Original Message-
From: Alexandre Rafalovitch [mailto:arafa...@gmail.com]
Sent: Monday, October 24, 2016 6:06 PM
To: solr-user
Subject: Re: Using nested subdocuments in SOLR

(tl;dr) Yes

Solr supports nested documents, with the company link instances becoming 
children records/documents. However, they are stored separately, so the parent 
and child documents would be searched and return separately unless you use 
appropriate queries, document transformers, etc.

You'd also need to tag your parent/child documents with document type 
(obviously, CompanyLink for child documents here) and assign individual IDs.

SolrJ supports nested documents too.

Regards,
   Alex.

Solr Example reading group is starting November 2016, join us at 
http://j.mp/SolrERG Newsletter and resources for Solr beginners and 
intermediates:
http://www.solr-start.com/


On 24 October 2016 at 05:46, Preeti Bhat  wrote:
> HI All,
>
> I have a requirement which needs the documents to be stored in the below 
> format. Could someone please advise on whether this is possible in SOLR?
>
> {
> Id:1
> Name:"ABC"
> CompanyLink:[ {CompanyId:2, email:abc@dba.com} , {CompanyId:4,
> email:abc@dbcc.com}
> ]
> }
>
> Is this achievable through SOLRJ? Also, is there any expected impact on 
> performance of the search or index due to storage in this format?
>
>
> Thanks and Regards,
> Preeti Bhat
>
>
>
> NOTICE TO RECIPIENTS: This communication may contain confidential and/or 
> privileged information. If you are not the intended recipient (or have 
> received this communication in error) please notify the sender and 
> it-supp...@shoregrp.com immediately, and destroy this communication. Any 
> unauthorized copying, disclosure or distribution of the material in this 
> communication is strictly forbidden. Any views or opinions presented in this 
> email are solely those of the author and do not necessarily represent those 
> of the company. Finally, the recipient should check this email and any 
> attachments for the presence of viruses. The company accepts no liability for 
> any damage caused by any virus transmitted by this email.
>
>

NOTICE TO RECIPIENTS: This communication may contain confidential and/or 
privileged information. If you are not the intended recipient (or have received 
this communication in error) please notify the sender and 
it-supp...@shoregrp.com immediately, and destroy this communication. Any 
unauthorized copying, disclosure or distribution of the material in this 
communication is strictly forbidden. Any views or opinions presented in this 
email are solely those of the author and do not necessarily represent those of 
the company. Finally, the recipient should check this email and any attachments 
for the presence of viruses. The company accepts no liability for any damage 
caused by any virus transmitted by this email.




Re: Using nested subdocuments in SOLR

2016-10-24 Thread Alexandre Rafalovitch
(tl;dr) Yes

Solr supports nested documents, with the company link instances
becoming children records/documents. However, they are stored
separately, so the parent and child documents would be searched and
return separately unless you use appropriate queries, document
transformers, etc.

You'd also need to tag your parent/child documents with document type
(obviously, CompanyLink for child documents here) and assign
individual IDs.

SolrJ supports nested documents too.

Regards,
   Alex.

Solr Example reading group is starting November 2016, join us at
http://j.mp/SolrERG
Newsletter and resources for Solr beginners and intermediates:
http://www.solr-start.com/


On 24 October 2016 at 05:46, Preeti Bhat  wrote:
> HI All,
>
> I have a requirement which needs the documents to be stored in the below 
> format. Could someone please advise on whether this is possible in SOLR?
>
> {
> Id:1
> Name:"ABC"
> CompanyLink:[ {CompanyId:2, email:abc@dba.com}
> , {CompanyId:4, email:abc@dbcc.com}
> ]
> }
>
> Is this achievable through SOLRJ? Also, is there any expected impact on 
> performance of the search or index due to storage in this format?
>
>
> Thanks and Regards,
> Preeti Bhat
>
>
>
> NOTICE TO RECIPIENTS: This communication may contain confidential and/or 
> privileged information. If you are not the intended recipient (or have 
> received this communication in error) please notify the sender and 
> it-supp...@shoregrp.com immediately, and destroy this communication. Any 
> unauthorized copying, disclosure or distribution of the material in this 
> communication is strictly forbidden. Any views or opinions presented in this 
> email are solely those of the author and do not necessarily represent those 
> of the company. Finally, the recipient should check this email and any 
> attachments for the presence of viruses. The company accepts no liability for 
> any damage caused by any virus transmitted by this email.
>
>


RE: Can we query across collections in SOLR?

2016-10-24 Thread Preeti Bhat
Thanks every one for the insights, I am able to get some ideas from the links.


Thanks and Regards,
Preeti Bhat
Shore Group Associates LLC
(C) +91-996-644-8187
www.ShoreGroupAssociates.com

-Original Message-
From: Dennis Gove [mailto:dpg...@gmail.com]
Sent: Sunday, October 23, 2016 10:06 AM
To: solr-user@lucene.apache.org
Subject: Re: Can we query across collections in SOLR?

Take a look at Streaming Expressions in Solr 6.x - 
https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=61330338.
This supports cross-collection joining.

- Denns

On Fri, Oct 21, 2016 at 7:42 AM, Preeti Bhat 
wrote:

> Hi All,
>
>
>
> I have an requirement where in SQL we have two different sets of data
> like Company and Contact in SQL.
>
> We are planning to get this to SOLR, I wanted to know whether we can
> have two separate collections in SOLR and say have a link between them
> with say id of one collection or if there any better way to get this
> done. Can we query this data directly?
>
>
>
> I am searching by say below fields.
>
>
>
> ? Company ID  -> Get all the contacts linked to this?
>
>
>
> ? Contact ID-> Get all details of the company linked to this contact
> along with contact details.
>
>
>
> Could someone suggest if there is any expected impact on performance
> while indexing  or searching?
>
>
>
> Thanks and Regards,
>
> Preeti Bhat
>
>
>
>
>
>
> NOTICE TO RECIPIENTS: This communication may contain confidential
> and/or privileged information. If you are not the intended recipient
> (or have received this communication in error) please notify the
> sender and it-supp...@shoregrp.com immediately, and destroy this
> communication. Any unauthorized copying, disclosure or distribution of
> the material in this communication is strictly forbidden. Any views or
> opinions presented in this email are solely those of the author and do
> not necessarily represent those of the company. Finally, the recipient
> should check this email and any attachments for the presence of
> viruses. The company accepts no liability for any damage caused by any virus 
> transmitted by this email.
>
>
>

NOTICE TO RECIPIENTS: This communication may contain confidential and/or 
privileged information. If you are not the intended recipient (or have received 
this communication in error) please notify the sender and 
it-supp...@shoregrp.com immediately, and destroy this communication. Any 
unauthorized copying, disclosure or distribution of the material in this 
communication is strictly forbidden. Any views or opinions presented in this 
email are solely those of the author and do not necessarily represent those of 
the company. Finally, the recipient should check this email and any attachments 
for the presence of viruses. The company accepts no liability for any damage 
caused by any virus transmitted by this email.




Using nested subdocuments in SOLR

2016-10-24 Thread Preeti Bhat
HI All,

I have a requirement which needs the documents to be stored in the below 
format. Could someone please advise on whether this is possible in SOLR?

{
Id:1
Name:"ABC"
CompanyLink:[ {CompanyId:2, email:abc@dba.com}
, {CompanyId:4, email:abc@dbcc.com}
]
}

Is this achievable through SOLRJ? Also, is there any expected impact on 
performance of the search or index due to storage in this format?


Thanks and Regards,
Preeti Bhat



NOTICE TO RECIPIENTS: This communication may contain confidential and/or 
privileged information. If you are not the intended recipient (or have received 
this communication in error) please notify the sender and 
it-supp...@shoregrp.com immediately, and destroy this communication. Any 
unauthorized copying, disclosure or distribution of the material in this 
communication is strictly forbidden. Any views or opinions presented in this 
email are solely those of the author and do not necessarily represent those of 
the company. Finally, the recipient should check this email and any attachments 
for the presence of viruses. The company accepts no liability for any damage 
caused by any virus transmitted by this email.




Re: Transactions behaviour on Batch insert / update

2016-10-24 Thread Mikhail Khludnev
Hello Ganesh,
By default the half of batch fails.
https://lucene.apache.org/solr/6_2_1/solr-core/org/apache/solr/update/processor/TolerantUpdateProcessorFactory.html
can fail only certain problematic rows. FWIW, you can send delete for
uncommitted documents, it wipes them from ram buffer, that's how you can
rollback a failed batch.

On Mon, Oct 24, 2016 at 10:41 AM, Ganesh M  wrote:

> Hi all,
>
> We are planning to make use of batch update / insert of solr documents,
> with batch size of around 100 documents per batch.
>
> Bit curious on how transactions are maintained per batch. I do knew SOLR
> is not meant for transaction based, but want to know whether SOLR is
> designed to throw error even if one document in batch fails due to issue in
> data or invalid fields.
>
> In case of such errors, whether complete batch fails or only one specific
> document fails and we get error for that specific document.
>
> Pls let me know how SOLR behaves.
>
> Regards,
> Ganesh
>



-- 
Sincerely yours
Mikhail Khludnev


Transactions behaviour on Batch insert / update

2016-10-24 Thread Ganesh M
Hi all,

We are planning to make use of batch update / insert of solr documents, with 
batch size of around 100 documents per batch.

Bit curious on how transactions are maintained per batch. I do knew SOLR is not 
meant for transaction based, but want to know whether SOLR is designed to throw 
error even if one document in batch fails due to issue in data or invalid 
fields.

In case of such errors, whether complete batch fails or only one specific 
document fails and we get error for that specific document.

Pls let me know how SOLR behaves.

Regards,
Ganesh