[
https://issues.apache.org/jira/browse/MAPREDUCE-4522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15179395#comment-15179395
]
Tsuyoshi Ozawa commented on MAPREDUCE-4522:
-------------------------------------------
In addition to the above comment, MR_DBOUTPUTFORMAT_BATCH_SIZE should be
renamed as MR_DB_OUTPUT_FORMAT_BATCH_SIZE.
> DBOutputFormat Times out on large batch inserts
> -----------------------------------------------
>
> Key: MAPREDUCE-4522
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4522
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: task-controller
> Affects Versions: 0.20.205.0
> Reporter: Nathan Jarus
> Assignee: Shyam Gavulla
> Labels: newbie
>
> In DBRecordWriter#close(), progress is never updated. In large batch inserts,
> this can cause the reduce task to time out due to the amount of time it takes
> the SQL engine to process that insert.
> Potential solutions I can see:
> Don't batch inserts; do the insert when DBRecordWriter#write() is called
> (awful)
> Spin up a thread in DBRecordWriter#close() and update progress in that.
> (gross)
> I can provide code for either if you're interested.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)