[
https://issues.apache.org/jira/browse/TAJO-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14354410#comment-14354410
]
ASF GitHub Bot commented on TAJO-1384:
--------------------------------------
GitHub user jihoonson opened a pull request:
https://github.com/apache/tajo/pull/406
TAJO-1384: Duplicated output file path problem
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/jihoonson/tajo-2 TAJO-1384
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/tajo/pull/406.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #406
----
commit afc25a285e6ce7ca999971d9e53e365b698a2d91
Author: Jihoon Son <[email protected]>
Date: 2015-03-10T06:36:04Z
TAJO-1384
----
> Duplicated output file path problem
> -----------------------------------
>
> Key: TAJO-1384
> URL: https://issues.apache.org/jira/browse/TAJO-1384
> Project: Tajo
> Issue Type: Bug
> Components: storage
> Reporter: Jihoon Son
> Assignee: Jihoon Son
>
> *Introduction*
> When consecutively inserting data into an existing directory, each files must
> have different names. To avoid the duplicated name problem, a unique sequence
> number is appended to the end of the file name. In addition, the max sequence
> number of pre-existing files is first found before starting writing to the
> final output directory.
> *Problem*
> When finding the max sequence number of pre-existing files, those files are
> not sorted by their sequence numbers. This finally causes the duplicated file
> name problem.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)