core-site.xml
<configuration>
<property>
<name>fs.default.name</name>
<value>hdfs://s-pd-hadoop01:9000</value>
<description>URI of NameNode.</description>
</property>
</configuration>
hdfs-site.xml
<configuration>
<property>
<name>dfs.name.dir</name>
<value>/hdfs1/nameNode/</value>
<description>If this is a comma-delimited list of directories then the
name table is replicated in all of the directories, for
redundancy.</description>
</property>
<property>
<name>dfs.data.dir</name>
<value>/hdfs1/dataNode/,/hdfs2/dataNode/</value>
<description>If this is a comma-delimited list of directories, then data
will be stored in all named directories, typically on different devices.
</description>
</property>
<property>
<name>dfs.block.size</name>
<value>134217728</value>
<description>HDFS blocksize of 128MB for large file-systems.
</description>
</property>
</configuration>
mapred-site.xml
<property>
<name>mapred.job.tracker</name>
<value>192.168.1.202:9001</value>
<description>Host or IP and port of JobTracker. host:port
pair.</description>
</property>
<property>
<name>mapred.system.dir</name>
<value>/hadoop/mapred/system/</value>
<description>This is in the default filesystem (HDFS) and must be
accessible from both the server and client machines. </description>
</property>
<property>
<name>mapred.local.dir</name>
<value>/hdfs1/mapredtmp/</value>
<description>Multiple paths help spread disk i/o.</description>
</property>
<property>
<name>mapred.tasktracker.{map|reduce}.tasks.maximum</name>
<value>4</value>
<description>Defaults to 2 (2 maps and 2 reduces), but vary it depending
on your hardware.</description>
</property>
</configuration>
Nothing changed in hive-default.conf.
Thanks,
Gary
在 2010年4月13日 上午9:28,Namit Jain <[email protected]>写道:
> I meant hive/hadoop conf.
>
>
>
>
>
> *From:* 朱鹤群 [mailto:[email protected]]
> *Sent:* Monday, April 12, 2010 5:59 PM
>
> *To:* [email protected]
> *Subject:* Re: Problem about 'like' condition
>
>
>
> Do you mean hadoop conf of hive DDL?
>
> 在 2010年4月13日 上午8:39,Namit Jain <[email protected]>写道:
>
> Can you send your conf ?
>
>
>
> *From:* 朱鹤群 [mailto:[email protected]]
> *Sent:* Monday, April 12, 2010 5:35 PM
>
>
> *To:* [email protected]
>
> *Subject:* Re: Problem about 'like' condition
>
>
>
> Hi namit,
>
>
>
> No, I'm using TEXTFILE input format.
>
> Yes. My files are gzip files.
>
>
>
> Thanks,
>
> Gary
>
> 在 2010年4月13日 上午2:20,Namit Jain <[email protected]>写道:
>
> Are you using CombineHiveInputFormat ?
>
> And are you using compressed text files ?
>
>
>
>
>
> Thanks,
>
> -namit
>
>
>
>
>
> *From:* 朱鹤群 [mailto:[email protected]]
> *Sent:* Monday, April 12, 2010 12:53 AM
> *To:* [email protected]
> *Subject:* Problem about 'like' condition
>
>
>
> Hi,
>
>
>
> Has anybody got problem when query in hive with condition (field like
> '%string%')?
>
> I got problem that the return records are duplicated.
>
>
>
> My hadoop cluster has 1 namenode, 1 jobtracker and 3 slaves. On each slave,
> there is 2 mappers and 2 reducers.
>
> All data are kept in gz file.
>
> In this case, the record duplicated for 15 times.
>
> Will anybody give some clue? Thanks.
>
>
> --
> Best Regards,
> Gary Zhu
>
>
>
>
> --
> Best Regards,
> Gary Zhu 朱鹤群
>
>
>
>
> --
> Best Regards,
> Gary Zhu 朱鹤群
>