Re: [jira] Commented: (HADOOP-51) per-file replication counts

Bryan A. Pendleton Mon, 10 Apr 2006 11:36:44 -0700

I realize that the value will come from hadoop-default.xml when it's not
defined elsewhere. I guess I was just suggesting that it makes the code
less-clear to have an explicit default in cases where the value *would* be
provided from a -default file, or the equivalent.


Having spent enough time reading others' code over the years, figuring out
*why* a value gets set, especially with a fancy configuration environment
(yes, hadoop's configuration is definitely in the "fancy" realm) by walking
the code is a tricky operation. My comment was merely that we might want to
have a better way of being explicit when things are really "system
defaults". The idiom used makes plenty of sense for user-created or
rarely-set values which have no specific defaults. Just my $0.02, though,
perhaps no one else agrees.

On 4/9/06, Doug Cutting (JIRA) <[EMAIL PROTECTED]> wrote:
>
>     [
> http://issues.apache.org/jira/browse/HADOOP-51?page=comments#action_12373803]
>
> Doug Cutting commented on HADOOP-51:
> ------------------------------------
>
> > the idiom of conf.getType("config.value",defaultValue) is good for
> user-defined values, but shouldn't the default be skipped for things that
> are defined in hadoop-default.xml, in general?
>
> The value from hadoop-default.xml is used in preference to the
> defaultValue paramter.  The paramter is only used as a last resort when no
> value is found in hadoop-default.xml or any other config file.
>
> > per-file replication counts
> > ---------------------------
> >
> >          Key: HADOOP-51
> >          URL: http://issues.apache.org/jira/browse/HADOOP-51
> >      Project: Hadoop
> >         Type: New Feature
>
> >   Components: dfs
> >     Versions: 0.2
> >     Reporter: Doug Cutting
> >     Assignee: Konstantin Shvachko
> >      Fix For: 0.2
> >  Attachments: Replication.patch
> >
> > It should be possible to specify different replication counts for
> different files.  Perhaps an option when creating a new file should be the
> desired replication count.  MapReduce should take advantage of this feature
> so that job.xml and job.jar files, which are frequently accessed by lots
> of machines, are more highly replicated than large data files.
>
> --
> This message is automatically generated by JIRA.
> -
> If you think it was sent incorrectly contact one of the administrators:
>    http://issues.apache.org/jira/secure/Administrators.jspa
> -
> For more information on JIRA, see:
>    http://www.atlassian.com/software/jira
>
>


--
Bryan A. Pendleton
Ph: (877) geek-1-bp

Re: [jira] Commented: (HADOOP-51) per-file replication counts

Reply via email to