[jira] [Commented] (HDFS-12828) OIV ReverseXML Processor fails with escaped characters
[ https://issues.apache.org/jira/browse/HDFS-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16448366#comment-16448366 ] Erik Krogen commented on HDFS-12828: Thanks [~ajisakaa]! > OIV ReverseXML Processor fails with escaped characters > -- > > Key: HDFS-12828 > URL: https://issues.apache.org/jira/browse/HDFS-12828 > Project: Hadoop HDFS > Issue Type: Bug > Components: hdfs >Affects Versions: 2.8.0 >Reporter: Erik Krogen >Assignee: Erik Krogen >Priority: Critical > Fix For: 2.10.0, 3.2.0, 3.1.1, 2.9.2, 3.0.3, 2.8.5 > > Attachments: HDFS-12828.000.patch, fsimage_008.xml > > > The HDFS OIV ReverseXML processor fails if the XML file contains escaped > characters: > {code} > ekrogen at ekrogen-ld1 in > ~/dev/hadoop/trunk/hadoop-dist/target/hadoop-3.0.0-beta1-SNAPSHOT on trunk! > ± $HADOOP_HOME/bin/hdfs dfs -fs hdfs://localhost:9000/ -ls / > Found 4 items > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:48 /foo > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:49 /foo" > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:50 /foo` > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:49 /foo& > {code} > Then after doing {{saveNamespace}} on that NameNode... > {code} > ekrogen at ekrogen-ld1 in > ~/dev/hadoop/trunk/hadoop-dist/target/hadoop-3.0.0-beta1-SNAPSHOT on trunk! > ± $HADOOP_HOME/bin/hdfs oiv -i > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008 -o > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008.xml -p XML > ekrogen at ekrogen-ld1 in > ~/dev/hadoop/trunk/hadoop-dist/target/hadoop-3.0.0-beta1-SNAPSHOT on trunk! > ± $HADOOP_HOME/bin/hdfs oiv -i > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008.xml -o > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008.xml.rev -p > ReverseXML > OfflineImageReconstructor failed: unterminated entity ref starting with & > org.apache.hadoop.hdfs.util.XMLUtils$UnmanglingError: unterminated entity ref > starting with & > at > org.apache.hadoop.hdfs.util.XMLUtils.unmangleXmlString(XMLUtils.java:232) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.loadNodeChildrenHelper(OfflineImageReconstructor.java:383) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.loadNodeChildrenHelper(OfflineImageReconstructor.java:379) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.loadNodeChildren(OfflineImageReconstructor.java:418) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.access$1000(OfflineImageReconstructor.java:95) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor$INodeSectionProcessor.process(OfflineImageReconstructor.java:524) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.processXml(OfflineImageReconstructor.java:1710) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.run(OfflineImageReconstructor.java:1765) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageViewerPB.run(OfflineImageViewerPB.java:191) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageViewerPB.main(OfflineImageViewerPB.java:134) > {code} > See attachments for relevant fsimage XML file. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Commented] (HDFS-12828) OIV ReverseXML Processor Fails With Escaped Characters
[ https://issues.apache.org/jira/browse/HDFS-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441925#comment-16441925 ] Akira Ajisaka commented on HDFS-12828: -- +1, nice catch! > OIV ReverseXML Processor Fails With Escaped Characters > -- > > Key: HDFS-12828 > URL: https://issues.apache.org/jira/browse/HDFS-12828 > Project: Hadoop HDFS > Issue Type: Bug > Components: hdfs >Affects Versions: 2.8.0 >Reporter: Erik Krogen >Assignee: Erik Krogen >Priority: Major > Attachments: HDFS-12828.000.patch, fsimage_008.xml > > > The HDFS OIV ReverseXML processor fails if the XML file contains escaped > characters: > {code} > ekrogen at ekrogen-ld1 in > ~/dev/hadoop/trunk/hadoop-dist/target/hadoop-3.0.0-beta1-SNAPSHOT on trunk! > ± $HADOOP_HOME/bin/hdfs dfs -fs hdfs://localhost:9000/ -ls / > Found 4 items > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:48 /foo > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:49 /foo" > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:50 /foo` > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:49 /foo& > {code} > Then after doing {{saveNamespace}} on that NameNode... > {code} > ekrogen at ekrogen-ld1 in > ~/dev/hadoop/trunk/hadoop-dist/target/hadoop-3.0.0-beta1-SNAPSHOT on trunk! > ± $HADOOP_HOME/bin/hdfs oiv -i > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008 -o > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008.xml -p XML > ekrogen at ekrogen-ld1 in > ~/dev/hadoop/trunk/hadoop-dist/target/hadoop-3.0.0-beta1-SNAPSHOT on trunk! > ± $HADOOP_HOME/bin/hdfs oiv -i > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008.xml -o > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008.xml.rev -p > ReverseXML > OfflineImageReconstructor failed: unterminated entity ref starting with & > org.apache.hadoop.hdfs.util.XMLUtils$UnmanglingError: unterminated entity ref > starting with & > at > org.apache.hadoop.hdfs.util.XMLUtils.unmangleXmlString(XMLUtils.java:232) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.loadNodeChildrenHelper(OfflineImageReconstructor.java:383) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.loadNodeChildrenHelper(OfflineImageReconstructor.java:379) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.loadNodeChildren(OfflineImageReconstructor.java:418) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.access$1000(OfflineImageReconstructor.java:95) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor$INodeSectionProcessor.process(OfflineImageReconstructor.java:524) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.processXml(OfflineImageReconstructor.java:1710) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.run(OfflineImageReconstructor.java:1765) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageViewerPB.run(OfflineImageViewerPB.java:191) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageViewerPB.main(OfflineImageViewerPB.java:134) > {code} > See attachments for relevant fsimage XML file. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Commented] (HDFS-12828) OIV ReverseXML Processor Fails With Escaped Characters
[ https://issues.apache.org/jira/browse/HDFS-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437851#comment-16437851 ] Erik Krogen commented on HDFS-12828: Unit test failures look unrelated... > OIV ReverseXML Processor Fails With Escaped Characters > -- > > Key: HDFS-12828 > URL: https://issues.apache.org/jira/browse/HDFS-12828 > Project: Hadoop HDFS > Issue Type: Bug > Components: hdfs >Affects Versions: 2.8.0 >Reporter: Erik Krogen >Assignee: Erik Krogen >Priority: Major > Attachments: HDFS-12828.000.patch, fsimage_008.xml > > > The HDFS OIV ReverseXML processor fails if the XML file contains escaped > characters: > {code} > ekrogen at ekrogen-ld1 in > ~/dev/hadoop/trunk/hadoop-dist/target/hadoop-3.0.0-beta1-SNAPSHOT on trunk! > ± $HADOOP_HOME/bin/hdfs dfs -fs hdfs://localhost:9000/ -ls / > Found 4 items > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:48 /foo > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:49 /foo" > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:50 /foo` > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:49 /foo& > {code} > Then after doing {{saveNamespace}} on that NameNode... > {code} > ekrogen at ekrogen-ld1 in > ~/dev/hadoop/trunk/hadoop-dist/target/hadoop-3.0.0-beta1-SNAPSHOT on trunk! > ± $HADOOP_HOME/bin/hdfs oiv -i > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008 -o > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008.xml -p XML > ekrogen at ekrogen-ld1 in > ~/dev/hadoop/trunk/hadoop-dist/target/hadoop-3.0.0-beta1-SNAPSHOT on trunk! > ± $HADOOP_HOME/bin/hdfs oiv -i > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008.xml -o > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008.xml.rev -p > ReverseXML > OfflineImageReconstructor failed: unterminated entity ref starting with & > org.apache.hadoop.hdfs.util.XMLUtils$UnmanglingError: unterminated entity ref > starting with & > at > org.apache.hadoop.hdfs.util.XMLUtils.unmangleXmlString(XMLUtils.java:232) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.loadNodeChildrenHelper(OfflineImageReconstructor.java:383) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.loadNodeChildrenHelper(OfflineImageReconstructor.java:379) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.loadNodeChildren(OfflineImageReconstructor.java:418) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.access$1000(OfflineImageReconstructor.java:95) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor$INodeSectionProcessor.process(OfflineImageReconstructor.java:524) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.processXml(OfflineImageReconstructor.java:1710) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.run(OfflineImageReconstructor.java:1765) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageViewerPB.run(OfflineImageViewerPB.java:191) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageViewerPB.main(OfflineImageViewerPB.java:134) > {code} > See attachments for relevant fsimage XML file. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Commented] (HDFS-12828) OIV ReverseXML Processor Fails With Escaped Characters
[ https://issues.apache.org/jira/browse/HDFS-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437845#comment-16437845 ] genericqa commented on HDFS-12828: -- | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 21s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 0s{color} | {color:green} The patch appears to include 1 new or modified test files. {color} | || || || || {color:brown} trunk Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 26m 25s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 56s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 49s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 1m 0s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 11m 53s{color} | {color:green} branch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 51s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 44s{color} | {color:green} trunk passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 0m 59s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 50s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 0m 50s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 45s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 56s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 11m 10s{color} | {color:green} patch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 53s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 43s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:red}-1{color} | {color:red} unit {color} | {color:red} 82m 6s{color} | {color:red} hadoop-hdfs in the patch failed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 25s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black}143m 35s{color} | {color:black} {color} | \\ \\ || Reason || Tests || | Failed junit tests | hadoop.hdfs.server.namenode.TestDecommissioningStatus | | | hadoop.hdfs.server.namenode.TestReencryptionWithKMS | | | hadoop.hdfs.server.datanode.TestDataNodeVolumeFailureReporting | | | hadoop.hdfs.qjournal.server.TestJournalNodeSync | \\ \\ || Subsystem || Report/Notes || | Docker | Client=17.05.0-ce Server=17.05.0-ce Image:yetus/hadoop:8620d2b | | JIRA Issue | HDFS-12828 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12918981/HDFS-12828.000.patch | | Optional Tests | asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle | | uname | Linux 775eea79b119 3.13.0-139-generic #188-Ubuntu SMP Tue Jan 9 14:43:09 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /testptch/patchprocess/precommit/personality/provided.sh | | git revision | trunk / e66e287 | | maven | version: Apache Maven 3.3.9 | | Default Java | 1.8.0_162 | | findbugs | v3.1.0-RC1 | | unit | https://builds.apache.org/job/PreCommit-HDFS-Build/23929/artifact/out/patch-unit-hadoop-hdfs-project_hadoop-hdfs.txt | | Test Results | https://builds.apache.org/job/PreCommit-HDFS-Build/23929/testReport/ | | Max. process+thread count | 3492 (vs. ulimit of 1) | | modules | C: hadoop-hdfs-project/hadoop-hdfs U:
[jira] [Commented] (HDFS-12828) OIV ReverseXML Processor Fails With Escaped Characters
[ https://issues.apache.org/jira/browse/HDFS-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437612#comment-16437612 ] Erik Krogen commented on HDFS-12828: The issue comes from how the {{XMLEventReader}} processing entity references. The code assumed that a XML block like {{foobar}} would be parsed as a {{START_ELEMENT}}, a {{CHARACTERS}} with "foobar", and an {{END_ELEMENT}}. However what actually happens between the start/end element is three {{CHARACTERS}} blocks, "foo", "&", and "bar" (note that the entity reference has already been handled). So, remove the flag to process entity references, and support multiple contiguous {{CHARACTERS}} blocks. Attached a patch with the fix and supplementing existing unit tests. > OIV ReverseXML Processor Fails With Escaped Characters > -- > > Key: HDFS-12828 > URL: https://issues.apache.org/jira/browse/HDFS-12828 > Project: Hadoop HDFS > Issue Type: Bug > Components: hdfs >Affects Versions: 2.8.0 >Reporter: Erik Krogen >Assignee: Erik Krogen >Priority: Major > Attachments: HDFS-12828.000.patch, fsimage_008.xml > > > The HDFS OIV ReverseXML processor fails if the XML file contains escaped > characters: > {code} > ekrogen at ekrogen-ld1 in > ~/dev/hadoop/trunk/hadoop-dist/target/hadoop-3.0.0-beta1-SNAPSHOT on trunk! > ± $HADOOP_HOME/bin/hdfs dfs -fs hdfs://localhost:9000/ -ls / > Found 4 items > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:48 /foo > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:49 /foo" > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:50 /foo` > drwxr-xr-x - ekrogen supergroup 0 2017-11-16 14:49 /foo& > {code} > Then after doing {{saveNamespace}} on that NameNode... > {code} > ekrogen at ekrogen-ld1 in > ~/dev/hadoop/trunk/hadoop-dist/target/hadoop-3.0.0-beta1-SNAPSHOT on trunk! > ± $HADOOP_HOME/bin/hdfs oiv -i > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008 -o > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008.xml -p XML > ekrogen at ekrogen-ld1 in > ~/dev/hadoop/trunk/hadoop-dist/target/hadoop-3.0.0-beta1-SNAPSHOT on trunk! > ± $HADOOP_HOME/bin/hdfs oiv -i > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008.xml -o > /tmp/hadoop-ekrogen/dfs/name/current/fsimage_008.xml.rev -p > ReverseXML > OfflineImageReconstructor failed: unterminated entity ref starting with & > org.apache.hadoop.hdfs.util.XMLUtils$UnmanglingError: unterminated entity ref > starting with & > at > org.apache.hadoop.hdfs.util.XMLUtils.unmangleXmlString(XMLUtils.java:232) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.loadNodeChildrenHelper(OfflineImageReconstructor.java:383) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.loadNodeChildrenHelper(OfflineImageReconstructor.java:379) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.loadNodeChildren(OfflineImageReconstructor.java:418) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.access$1000(OfflineImageReconstructor.java:95) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor$INodeSectionProcessor.process(OfflineImageReconstructor.java:524) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.processXml(OfflineImageReconstructor.java:1710) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageReconstructor.run(OfflineImageReconstructor.java:1765) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageViewerPB.run(OfflineImageViewerPB.java:191) > at > org.apache.hadoop.hdfs.tools.offlineImageViewer.OfflineImageViewerPB.main(OfflineImageViewerPB.java:134) > {code} > See attachments for relevant fsimage XML file. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org