[jira] [Commented] (RYA-307) MongoDB Bulk Load methods should use Secondary Indexer Bulk Loading

2017-08-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/RYA-307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16113809#comment-16113809
 ] 

ASF GitHub Bot commented on RYA-307:


Github user asfgit closed the pull request at:

https://github.com/apache/incubator-rya/pull/181


> MongoDB Bulk Load methods should use Secondary Indexer Bulk Loading
> ---
>
> Key: RYA-307
> URL: https://issues.apache.org/jira/browse/RYA-307
> Project: Rya
>  Issue Type: Improvement
>  Components: dao
>Reporter: Aaron Mihalik
>Assignee: Eric White
>
> The MongoDB secondary indexers *really* slow down inserts via the bulk load 
> methods for the DAO.  The DAO should use the bulk load methods on the 
> secondary indexers.  Here is where the call is made in the DAO [1]
> [1] 
> https://github.com/apache/incubator-rya/blob/master/dao/mongodb.rya/src/main/java/org/apache/rya/mongodb/MongoDBRyaDAO.java#L158
> Some version of a BatchWriter should be created for Mongo that is used in the 
> DAO and any secondary indexers. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (RYA-307) MongoDB Bulk Load methods should use Secondary Indexer Bulk Loading

2017-07-21 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/RYA-307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16096897#comment-16096897
 ] 

ASF GitHub Bot commented on RYA-307:


GitHub user ejwhite922 opened a pull request:

https://github.com/apache/incubator-rya/pull/181

Rya-307 MongoDB Rya DAO Batch Writer

## Description
Improved Rya MongoDB ingest of statements through the Sail Layer and Rya 
DAO by queueing up multiple inserts at a time so can be written as a single 
batch. If no statements in the batch have been written after a set time limit 
then they are flushed out into the datastore. The size of the batch and the 
time limit are configurable.

### Tests
Unit Tests/Integration Tests

### Links
[Jira](https://issues.apache.org/jira/browse/RYA-307)

### Checklist
- [ ] Code Review
- [ ] Squash Commits

 People To Review
@amihalik 
@meiercaleb 
@DLotts
@jessehatfield 
@isper3at 


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/ejwhite922/incubator-rya RYA-307_MongoIngest

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/incubator-rya/pull/181.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #181


commit 08596dd004c5f35867c7a6780a800d2387e05676
Author: eric.white 
Date:   2017-07-19T13:08:32Z

RYA-307 Improved Rya MongoDB ingest of statements through the Sail Layer 
and Rya DAO by queueing up multiple inserts at a time so can be written as a 
single batch.  If no statements in the batch have been written after a set time 
limit then they are flushed out into the datastore.  The size of the batch and 
the time limit are configurable.

commit ec0ccc4ce1cc329edd02cfc5d09d543447ca59af
Author: eric.white 
Date:   2017-07-20T20:22:47Z

Rya-307 Commit #2. Added config options for flushing MongoDB batch writer.

commit 154e589a082b91c4556079be4fabd55d37480360
Author: eric.white 
Date:   2017-07-21T21:15:19Z

RYA_307 Commit #3. Fixed integration tests. Made BatchWriter compatible 
with MongoCollection.




> MongoDB Bulk Load methods should use Secondary Indexer Bulk Loading
> ---
>
> Key: RYA-307
> URL: https://issues.apache.org/jira/browse/RYA-307
> Project: Rya
>  Issue Type: Improvement
>  Components: dao
>Reporter: Aaron Mihalik
>Assignee: Eric White
>
> The MongoDB secondary indexers *really* slow down inserts via the bulk load 
> methods for the DAO.  The DAO should use the bulk load methods on the 
> secondary indexers.  Here is where the call is made in the DAO [1]
> [1] 
> https://github.com/apache/incubator-rya/blob/master/dao/mongodb.rya/src/main/java/org/apache/rya/mongodb/MongoDBRyaDAO.java#L158
> Some version of a BatchWriter should be created for Mongo that is used in the 
> DAO and any secondary indexers. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)