srinivas created PIG-4216:
-----------------------------

             Summary: Making use of  Centralized Cache Management in HDFS
                 Key: PIG-4216
                 URL: https://issues.apache.org/jira/browse/PIG-4216
             Project: Pig
          Issue Type: Improvement
          Components: impl
    Affects Versions: 0.12.0
            Reporter: srinivas


I am working on optimizing joins , came across new feature in HDFS "Centralized 
Cache Management in HDFS" , I tried to cache a dataset in hdfs and ran pig 
script join, but I don't see any improvement in performance, I am not sure if 
this feature is abstracted from pig and map reduce takes care of it or pig 
needs some modifications.


http://www.cloudera.com/content/cloudera/en/documentation/cdh5/latest/CDH5-Installation-Guide/cdh5ig_hdfs_caching.html





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to