Review and document '_' prefix convention in input directories
--------------------------------------------------------------

                 Key: HADOOP-2715
                 URL: https://issues.apache.org/jira/browse/HADOOP-2715
             Project: Hadoop Core
          Issue Type: Bug
            Reporter: eric baldeschwieler


We use files and directories prefixed with '_' to store logs, metadata and 
other info that might be useful to the owner of a job within the output 
directory.  The standard input methods then ignore such files by default.

HADOOP-2391 lead to some discussion of the '_' convention in output 
directories.  No all developers input formats are supporting this.  We should 
review the convention and document it well so that future input methods support 
it.  Or we should come up with an alternate approach.  

My hope is that after some discuss we will close this bug by creating a 
documentation patch explaining the convention.

It sounds like the convention is implemented via some input filter classes.  We 
should discuss if this generic solution is helping or obscuring the intent of 
the convention.  Perhaps we should just have a non-configurable filter, so '_' 
prefixed files are treated like '.' prefixed files by most unix tools.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to