[ https://issues.apache.org/jira/browse/HADOOP-6059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720616#action_12720616 ]
Steve Loughran commented on HADOOP-6059: ---------------------------------------- +1 for restrictions. I'd go for names that allow lots of * all valid HDFS names are valid within XML files. That is, at a minimum, the only values <ASCII 32 are tab, cr, and lf. And I can think of some good reasons to stop that too. No < or > either. * all valid HDFS names are valid within string database tables. * All valid names can be represnted with strings in JSON documents, possibly with some escaping * the normal POSIX forbidden paths are still forbidden I have no pressing need for XML, JSON or in-database representation, but I can imagine it being useful in the future. Valid XML can also be used inside HTML reports..you don't want to do XSS tricks by creating filenames with <script> in their name to try and catch out anyone browsing the directory tree > Should HDFS restrict the names used for files? > ---------------------------------------------- > > Key: HADOOP-6059 > URL: https://issues.apache.org/jira/browse/HADOOP-6059 > Project: Hadoop Core > Issue Type: New Feature > Components: dfs > Affects Versions: 0.20.0 > Reporter: Robert Chansler > > When reviewing the consequences of Hadoop:6017 (the name system could not > start because a file name interpreted as a regex caused a fault), the > discussion turned to improving the test set for file system functions by > broadening the set of names used for testing. Presently, HDFS allows any name > without a slash. _Should the space of names be restricted?_ If most funny > names are unintended, maybe the user would benefit from an early error > indication. A contrary view is that restricting names is so 20th-century. > Should be or shouldn't we? -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.