[ 
https://issues.apache.org/jira/browse/ACCUMULO-931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13545949#comment-13545949
 ] 

Jeremy Kepner commented on ACCUMULO-931:
----------------------------------------

So the phenomena appears to be that  mutations can return control to the 
ingestor before the minor compactions they cause have completed.  Thus a single 
ingestor can cause multiple simultaneous compactions on the same tablet.  If 
the table only has one tablet this cause the ingest process back up (see file 
1ingestor_1table_1tablet.pdf).  The solution is to presplit the tablet.  The 
files 1ingestor_1table_2tablet.pdf and 1ingestor_1table_4tablet.pdf shows that 
these splits solves the problem.  The file 2,3,4ingestor_1table_4tablet.pdf 
show the performance of three separate runs using 2, 3, and 4 ingestors into a 
table with 4 tablets..  Likewise, the file 
10,r8,r10,r12ingestor_1table_12tablet.pdf  shows four separate runs using 10 
local, 8 remote, 10 remote, and 12 remote ingestors.  In all cases, the 
splitting resolves the performance issue.

If there are no objections, this issue can be closed.
                
> Oscillations in Accumulo Ingest Performance
> -------------------------------------------
>
>                 Key: ACCUMULO-931
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-931
>             Project: Accumulo
>          Issue Type: Bug
>    Affects Versions: 1.4.2
>         Environment: 
>            Reporter: Jeremy Kepner
>         Attachments: 1ingestor_1table_1tablet.pdf, 1ingestor.pdf, 
> 2ingestor.pdf, 3ingestor.pdf, 4ingestor.pdf, ingest_performance_explained.pdf
>
>
> Linux 2.6.32, single node, 32 cores, 96 GB RAM, 3x3TB SATA drives, RAID5
> Ingest performance into accumulo varies by 2.5x.  Depending upon the number 
> of ingestors.
> Performance tests were carried out using Graph500 benchmark (see 
> d4m_api/examples/3Scaling/2ParallelDatabase/pDB10_EdgeInsertTEST.m from 
> http://www.mit.edu/~kepner/D4M/).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to