srinivas created PIG-4216:
-----------------------------
Summary: Making use of Centralized Cache Management in HDFS
Key: PIG-4216
URL: https://issues.apache.org/jira/browse/PIG-4216
Project: Pig
Issue Type: Improvement
Components: impl
Affects Versions: 0.12.0
Reporter: srinivas
I am working on optimizing joins , came across new feature in HDFS "Centralized
Cache Management in HDFS" , I tried to cache a dataset in hdfs and ran pig
script join, but I don't see any improvement in performance, I am not sure if
this feature is abstracted from pig and map reduce takes care of it or pig
needs some modifications.
http://www.cloudera.com/content/cloudera/en/documentation/cdh5/latest/CDH5-Installation-Guide/cdh5ig_hdfs_caching.html
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)