[ 
https://issues.apache.org/jira/browse/HADOOP-6059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720616#action_12720616
 ] 

Steve Loughran commented on HADOOP-6059:
----------------------------------------

+1 for restrictions. 

I'd go for names that allow lots of 

* all valid HDFS names are valid within XML files. That is, at a minimum, the 
only values <ASCII 32 are tab, cr, and lf. And I can think of some good reasons 
to stop that too. No < or > either.
* all valid HDFS names are valid within string database tables. 
* All valid names can be represnted with strings in JSON documents, possibly 
with some escaping
* the normal POSIX forbidden paths are still forbidden

I have no pressing need for XML, JSON or in-database representation, but I can 
imagine it being useful in the future. Valid XML can also be used inside HTML 
reports..you don't want to do XSS tricks by creating filenames with <script> in 
their name to try and catch out anyone browsing the directory tree


> Should HDFS restrict the names used for files?
> ----------------------------------------------
>
>                 Key: HADOOP-6059
>                 URL: https://issues.apache.org/jira/browse/HADOOP-6059
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: dfs
>    Affects Versions: 0.20.0
>            Reporter: Robert Chansler
>
> When reviewing the consequences of Hadoop:6017 (the name system could not 
> start because a file name interpreted as a regex caused a fault), the 
> discussion turned to improving the test set for file system functions by 
> broadening the set of names used for testing. Presently, HDFS allows any name 
> without a slash. _Should the space of names be restricted?_ If most funny 
> names are unintended, maybe the user would benefit from an early error 
> indication. A contrary view is that restricting names is so 20th-century.
> Should be or shouldn't we?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to