[
https://issues.apache.org/jira/browse/YARN-6027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15817073#comment-15817073
]
Varun Saxena commented on YARN-6027:
------------------------------------
bq. This should not happen. There should be exactly one row for a flow on a
given day.
Yes. I think they were retrieving data based on last 24 hours instead of
specific dates. That's why duplicate records came.
bq. We do have a lot of runs of a flow on a given day, for instance hRaven is
running constantly on our cluster. So we do expect several runs of a flow in a
day.
How many do we expect typically ? Can it run into thousands ? I had raised a
JIRA to limit flow runs within a flow. We should probably have that support
then.
> Support fromId for flows API
> -----------------------------
>
> Key: YARN-6027
> URL: https://issues.apache.org/jira/browse/YARN-6027
> Project: Hadoop YARN
> Issue Type: Sub-task
> Components: timelineserver
> Reporter: Rohith Sharma K S
> Assignee: Rohith Sharma K S
> Labels: yarn-5355-merge-blocker
>
> In YARN-5585 , fromId is supported for retrieving entities. We need similar
> filter for flows/flowRun apps and flow run and flow as well.
> Along with supporting fromId, this JIRA should also discuss following points
> * Should we throw an exception for entities/entity retrieval if duplicates
> found?
> * TimelieEntity :
> ** Should equals method also check for idPrefix?
> ** Does idPrefix is part of identifiers?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]