[
https://issues.apache.org/jira/browse/FLINK-1579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15859955#comment-15859955
]
ASF GitHub Bot commented on FLINK-1579:
---------------------------------------
Github user uce commented on the issue:
https://github.com/apache/flink/pull/3286
Thanks for this PR. I think it's super cool (and long overdue) to finally
have this. I think the general approach is sound and has good things going for
it.
But I see two problems with maintaining the current implementation in the
long run:
- Changing a URL in the web dashboard requires that we also change it in
the history server file generation. It's easy to accidentally break the history
server and it's cumbersome to write tests when we extend it.
- We have all JSON generation in the JsonUtil. That results in a giant
class (as you say) that is really tightly coupled to the handlers but resides
in a different place.
I think we can word around this by moving the archival logic to the
handlers. I think it's fair that handlers that want to take part in historical
archiving implement their archival logic on their own. The handlers would need
to return their path and also their JSON. That way we keep all the handler
logic in one place for both the regular frontend and history server. A
"historic execution graph" might be a simple collection of `path, JSON` entries
that the history server persists on receive.
Another minor thing is that the final status of the operators is missing on
the history server. I don't know whether that is a front end issue or something
is missing in the JSON.
I think it's best to close this PR for now and think about how we want to
extend this a bit for better maintainability.
> Create a Flink History Server
> -----------------------------
>
> Key: FLINK-1579
> URL: https://issues.apache.org/jira/browse/FLINK-1579
> Project: Flink
> Issue Type: New Feature
> Components: Distributed Coordination
> Affects Versions: 0.9
> Reporter: Robert Metzger
> Assignee: Chesnay Schepler
>
> Right now its not possible to analyze the job results for jobs that ran on
> YARN, because we'll loose the information once the JobManager has stopped.
> Therefore, I propose to implement a "Flink History Server" which serves the
> results from these jobs.
> I haven't started thinking about the implementation, but I suspect it
> involves some JSON files stored in HDFS :)
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)