Hi guys,

I think it really depends on the application that you are using.
In my case on HDFS, we have quite often thousands of custom applications
that read the same data/block at more or less the same time. I can see that
it saves us a lot of IOPS because after some time it reads straight from
the memory and I see less load on the disk.
I don't know how much benefit we get from the caching compared to
re-read each time from the disk but I imagine it's not null plus it
decreases the load on the disk allowing them to do other stuff.
Now you are the expert guys ^^, if you think there are other things to do
to improve the performance, then it's clearly not a priority.

I don't know if other people have opinions on it.

Best regards,
Michel



Le jeu. 7 janv. 2021 à 08:38, Wei-Chiu Chuang <weic...@apache.org> a écrit :

> To be honest, the HDFS caching didn't take off because somehow it didn't
> improve application performance a lot (speaking from an Impala perspective,
> I've been told it seems to help with Solr).
>
> As a matter of fact we did some performance tests recently comparing HDFS
> and Ozone.
> Depending on the set up you have and the applications you are running,
> there are variant levels of perf gaps that are identified, and I think
> there are more critical features we need to implement than caching which
> will improve performance more.
>
> If you run Impala, Impala supports remote caching which works on several
> file systems including Ozone, and it helps with improving performance quite
> nicely.
>
> On Wed, Jan 6, 2021 at 10:24 PM Lin, Yiqun <yiq...@ebay.com.invalid>
> wrote:
>
> > Hi Michel,
> >
> > As far as I know, currently Ozone hasn't a feature like HDFS cache block.
> > Not sure if someone else has done the performance test between Ozone read
> > and HDFS read.
> > But anyway, cache block on Ozone seems a good idea if it can greatly
> speed
> > the data read.
> >
> > Thanks,
> > Yiqun
> >
> > On 2021/1/6, 11:47 PM, "Michel Sumbul" <michelsum...@gmail.com> wrote:
> >
> >     External Email
> >
> >     Hi guys,
> >
> >     I would like to know if Ozone has a caching feature like HDFS? (
> >
> >
> https://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fhadoop.apache.org%2Fdocs%2Fcurrent%2Fhadoop-project-dist%2Fhadoop-hdfs%2FCentralizedCacheManagement.html&amp;data=04%7C01%7Cyiqlin%40ebay.com%7Ca80ab0cd29194003f00008d8b25a6411%7C46326bff992841a0baca17c16c94ea99%7C0%7C0%7C637455448585079421%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&amp;sdata=MvuLw8eEANxVuHiCEprR%2BWkb%2FriKjhOoDlZPaX%2FEPVo%3D&amp;reserved=0
> >     )
> >
> >     I didn’t find any doc on it for Ozone, but caching blocks frequently
> > access
> >     really improve the performance on HDFS and it might be a nice to have
> > for
> >     Ozone to avoid unnecessary IOPS.
> >
> >     Thanks,
> >     Michel
> >
> >     PS: sorry for the resend, it was the old mailing list
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: dev-unsubscr...@ozone.apache.org
> > For additional commands, e-mail: dev-h...@ozone.apache.org
> >
>

Reply via email to