[
https://issues.apache.org/jira/browse/HDFS-7602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
James Kinley updated HDFS-7602:
-------------------------------
Description:
Provide a utility to determine HDFS file formats and compression types, akin to
Linux's file utility.
There is no easy way to do this today, short of downloading a file and running
Linux's file utility on it for at least some intelligence. Although, Linux's
magic file does not contain any information to identify the leading bytes of
Hadoop's common file formats, for example: 'S', 'E', 'Q' for SequenceFiles, or
'P', 'A', 'R', '1' for Parquet.
was:Provide a utility to determine HDFS file formats and compression types,
akin to Linux's file utility.
> HDFS file utility
> -----------------
>
> Key: HDFS-7602
> URL: https://issues.apache.org/jira/browse/HDFS-7602
> Project: Hadoop HDFS
> Issue Type: Improvement
> Components: hdfs-client, tools
> Affects Versions: 2.5.0
> Reporter: James Kinley
> Priority: Minor
>
> Provide a utility to determine HDFS file formats and compression types, akin
> to Linux's file utility.
> There is no easy way to do this today, short of downloading a file and
> running Linux's file utility on it for at least some intelligence. Although,
> Linux's magic file does not contain any information to identify the leading
> bytes of Hadoop's common file formats, for example: 'S', 'E', 'Q' for
> SequenceFiles, or 'P', 'A', 'R', '1' for Parquet.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)