[ 
https://issues.apache.org/jira/browse/YARN-3908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14643188#comment-14643188
 ] 

Vrushali C commented on YARN-3908:
----------------------------------

Hi Zhijie

Thanks.. that is a good point. But if we put the event timestamp first, we have 
no way of querying for a particular event key unless we know the exact 
timestamp. I think knowing the exact time is probably almost impossible. 

Imagine that there is another event that occurs between the two kill events, so 
it has a timestamp > kill1 and < kill2. Now we still have to fetch all those 
and filter them out. So placing the timestamp first does not help in this case. 
But if we have the event key first, the columns will be placed together and the 
event timestamps will be stored in a chronological order (using the long.max - 
ts value). So the first one being fetched for kill event would be the latest 
for that event key. 

thanks
Vrushali




> Bugs in HBaseTimelineWriterImpl
> -------------------------------
>
>                 Key: YARN-3908
>                 URL: https://issues.apache.org/jira/browse/YARN-3908
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: timelineserver
>            Reporter: Zhijie Shen
>            Assignee: Vrushali C
>         Attachments: YARN-3908-YARN-2928.001.patch, 
> YARN-3908-YARN-2928.002.patch, YARN-3908-YARN-2928.003.patch, 
> YARN-3908-YARN-2928.004.patch, YARN-3908-YARN-2928.004.patch, 
> YARN-3908-YARN-2928.005.patch
>
>
> 1. In HBaseTimelineWriterImpl, the info column family contains the basic 
> fields of a timeline entity plus events. However, entity#info map is not 
> stored at all.
> 2 event#timestamp is also not persisted.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to