[
https://issues.apache.org/jira/browse/HADOOP-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13435997#comment-13435997
]
Hudson commented on HADOOP-8654:
--------------------------------
Integrated in Hadoop-Common-trunk-Commit #2587 (See
[https://builds.apache.org/job/Hadoop-Common-trunk-Commit/2587/])
HADOOP-8654. TextInputFormat delimiter bug (Gelesh and Jason Lowe via
bobby) (Revision 1373859)
Result = SUCCESS
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1373859
Files :
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
*
/hadoop/common/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/util/LineReader.java
*
/hadoop/common/trunk/hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/util/TestLineReader.java
> TextInputFormat delimiter bug:- Input Text portion ends with & Delimiter
> starts with same char/char sequence
> -------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8654
> URL: https://issues.apache.org/jira/browse/HADOOP-8654
> Project: Hadoop Common
> Issue Type: Bug
> Components: util
> Affects Versions: 0.20.204.0, 1.0.3, 0.21.0, 2.0.0-alpha
> Environment: Linux
> Reporter: Gelesh
> Labels: patch
> Fix For: 3.0.0, 2.2.0-alpha
>
> Attachments: HADOOP-8654.patch, MAPREDUCE-4512.txt
>
> Original Estimate: 1m
> Remaining Estimate: 1m
>
> TextInputFormat delimiter bug scenario , a character sequence of the input
> text, in which the first character matches with the first character of
> delimiter, and the remaining input text character sequence matches with the
> entire delimiter character sequence from the starting position of the
> delimiter.
> eg delimiter ="record";
> and Text =" record 1:- name = Gelesh e mail = [email protected]
> Location Bangalore record 2: name = sdf .. location =Bangalorrecord 3: name
> .... "
> Here string "=Bangalorrecord 3: " satisfy two conditions
> 1) contains the delimiter "record"
> 2) The character / character sequence immediately before the delimiter (ie '
> r ') matches with first character (or character sequence ) of delimiter. (ie
> "=Bangalor" ends with and Delimiter starts with same character/char sequence
> 'r' ),
> Here the delimiter is not encountered by the program resulting in improper
> value text in map that contains the delimiter
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira