Hi,

I remember that the sync module will rename the file after merge. More details 
about sync could be given by @Tianan.

Best,
--
Jialin Qiao
School of Software, Tsinghua University

乔嘉林
清华大学 软件学院

> -----原始邮件-----
> 发件人: "Xiangdong Huang" <[email protected]>
> 发送时间: 2019-09-02 13:28:50 (星期一)
> 收件人: [email protected]
> 抄送: 
> 主题: Re: [IoTDB-189] question about the feature of compatibility of HDFS
> 
> Hi,
> 
> Look forward to the performance comparison.
> 
> > Currently, renaming only occurs in two case
> > (1) updating the version files for each storage group periodically in
> system/storage_group_x/version-xxx
> > (2) updating the compression-ratio file when dynamically adjusting
> parameters
> 
> How about the sync module? Seems there is a file for recording the file
> list (well, the sync module execution is not frequent)
> 
> Best,
> 
> -----------------------------------
> Xiangdong Huang
> School of Software, Tsinghua University
> 
>  黄向东
> 清华大学 软件学院
> 
> 
> Jialin Qiao <[email protected]> 于2019年9月1日周日 下午8:00写道:
> 
> > Hi,
> >
> > > Ah... I remember that there is a file whose name is "version-xxx", in
> > which
> > the "xxx" is a counter, and will be updated if the version difference
> > between the disk and memory >= 50.
> >
> > Yes, each storage group has a file named version-xxx under
> > system/storage_group_xxx folder.
> >
> > > I am not sure whether it is correct that "renaming is not very frequent"
> > because each ChunkGroup has a version (am I right?)..
> >
> > Right, each ChunkGroup will increase this version, and each new data file
> > will increase this version.
> >
> > Currently, renaming only occurs in two case
> > (1) updating the version files for each storage group periodically in
> > system/storage_group_x/version-xxx
> > (2) updating the compression-ratio file when dynamically adjusting
> > parameters
> >
> > It's hard to say whether it is frequent or not, we can compare the
> > performance between local file system and HDFS after supporting HDFS.
> >
> > Best,
> > --
> > Jialin Qiao
> > School of Software, Tsinghua University
> >
> > 乔嘉林
> > 清华大学 软件学院
> >
> > > -----原始邮件-----
> > > 发件人: "Xiangdong Huang" <[email protected]>
> > > 发送时间: 2019-09-01 23:10:08 (星期日)
> > > 收件人: [email protected]
> > > 抄送:
> > > 主题: Re: [IoTDB-189] question about the feature of compatibility of HDFS
> > >
> > > Hi,
> > >
> > > > since renaming is not very frequent, maybe we do not need to worry
> > about
> > > performance.
> > >
> > > Ah... I remember that there is a file whose name is "version-xxx", in
> > which
> > > the "xxx" is a counter, and will be updated if the version difference
> > > between the disk and memory >= 50.
> > >
> > > I am not sure whether it is correct that "renaming is not very frequent"
> > > because each ChunkGroup has a version (am I right?)..
> > >
> > > But, I am not saying that writing on HDFS is bad, I just want to remind
> > you
> > > that reducing renaming operations, or proving that renaming on HDFS is
> > not
> > > a time consuming operation.
> > >
> > > Best,
> > > -----------------------------------
> > > Xiangdong Huang
> > > School of Software, Tsinghua University
> > >
> > >  黄向东
> > > 清华大学 软件学院
> > >
> > >
> > > Zesong Sun <[email protected]> 于2019年8月31日周六 下午8:28写道:
> > >
> > > > Hi,
> > > >
> > > >
> > > > This issue is a sub-task of IoTDB-187 [1], which enables users to
> > choose
> > > > whether files are storied in local file system or HDFS.
> > > >
> > > >
> > > > In this sub-task, I intend to use an encapsulated "File" to replace
> > > > existed Java File in our project.
> > > >
> > > >
> > > > Currently in our design, all files could be written on HDFS.
> > > >
> > > >
> > > > HDFS supports truncating files, and since renaming is not very
> > frequent,
> > > > maybe we do not need to worry about performance.
> > > >
> > > >
> > > > If anyone has any other ideas, please discuss with me.
> > > >
> > > >
> > > > [1] https://issues.apache.org/jira/browse/IOTDB-187
> > > >
> > > >
> > > > ------------------
> > > > Zesong Sun
> > > > School of Software, Tsinghua University
> > > >
> > > > 孙泽嵩
> > > > 清华大学 软件学院
> > > >
> > > >
> > > >
> > > >
> > > >
> > > >
> > > >
> > > > ------------------ 原始邮件 ------------------
> > > > 发件人: "Xiangdong Huang"<[email protected]>;
> > > > 发送时间: 2019年8月31日(星期六) 晚上10:50
> > > > 收件人: "dev"<[email protected]>;
> > > >
> > > > 主题: [IoTDB-189] question about the feature of compatibility of HDFS
> > > >
> > > >
> > > >
> > > > Hi,
> > > >
> > > > This issue only has a title.. without any description...
> > > >
> > > > I think it is for letting IoTDB write TsFiles on HDFS directly.
> > > >
> > > > My question is, is only TsFile written on HDFS? or all files (e.g.,
> > wal,
> > > > system data file)?
> > > >
> > > > Besides, if all files are on HDFS, do we need to avoid using File name
> > as a
> > > > mark or a lock (i.e., create a file and then rename it) for performance
> > > > consideration?
> > > >
> > > > (I am on my trip and then I realize clearly that current info on issue
> > is
> > > > not enough for guys who are on the internet)
> > > >
> > > > Best,
> > > > -----------------------------------
> > > > Xiangdong Huang
> > > > School of Software, Tsinghua University
> > > >
> > > >  黄向东
> > > > 清华大学 软件学院
> >

Reply via email to