[ https://issues.apache.org/jira/browse/YARN-2032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Li Lu updated YARN-2032: ------------------------ Attachment: YARN-2032-091114.patch Hi everyone, based on [~mayank_bansal]'s patch, I've done an updated version for the HBase timeline storage. General strategy and data schema are kept the same as the original patch. Here're what I've done: 1. Finished fromTs and fromId function in getEntities. For fromId, I added one more check to change the first record of a scan if necessary. For fromTs, I added a separate column qualifier in the entity table to store insert time for each entity, and, during query, filter out later records if necessary. 2. I restructured the code such that data schema and operations are decoupled from the actual HBase operations. Most timeline data storage logic is in the abstract store class now. While HBase storage class only needs to implement the abstract methods and interfaces required by the abstract storage class, including table creation, get, put, scan, and a few other helper functions. I hope this "pluggable" interface would simplify future extensions, and help to provide a unified abstract storage schema across different data storages. Comments on this design would certainly be more than welcome. 3. I've added two more unit test cases, to test fromId and fromTs, for HBase storage. Currently, the UTs work fine with branch-2, or with a HttpServer.java accessible by HBaseClient. But UTs are failing in trunk branch since HttpServer.java has been replaced. I added "Ignore" tags to the UT for now, but feel free to check them under branch-2. > Implement a scalable, available TimelineStore using HBase > --------------------------------------------------------- > > Key: YARN-2032 > URL: https://issues.apache.org/jira/browse/YARN-2032 > Project: Hadoop YARN > Issue Type: Sub-task > Reporter: Vinod Kumar Vavilapalli > Assignee: Mayank Bansal > Attachments: YARN-2032-091114.patch, YARN-2032-branch-2-1.patch, > YARN-2032-branch2-2.patch > > > As discussed on YARN-1530, we should pursue implementing a scalable, > available Timeline store using HBase. > One goal is to reuse most of the code from the levelDB Based store - > YARN-1635. -- This message was sent by Atlassian JIRA (v6.3.4#6332)