[
https://issues.apache.org/jira/browse/YARN-4696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Steve Loughran updated YARN-4696:
---------------------------------
Attachment: YARN-4696-008.patch
Patch -008. This removes a subclass of RawLocalFileSystem that I'd been trying
to instantiate directly. That doesn't work...I won't go into the details.
Note also that patch -007
# has the code to remember the cache option before the
{{FileSystemTimelineWriter}} gets a file, and restores it after
# has commented out the entire action of disabling the cache.
Why #2? It's to try to get a local FS with checksumming disabled picked up in
test cases. I've not got that working.
Why #1? Because some other part of the JVM may want caching, and so they won't
want this class disabling it for them.
I'm assuming that the caching was disabled to ensure that if this class closed
the fs instance then the solution there is: don't close the FS when the service
is stopped. We can rely on Hadoop itself to stop all filesystems in JVM
shutdown. Of course, if the concern is that its other bits of code closing the
FS, that's harder. In such a case, if I do manage to get my local FS test
working, then we may need a test-time option to not-disable the cache
> EntityGroupFSTimelineStore to work in the absence of an RM
> ----------------------------------------------------------
>
> Key: YARN-4696
> URL: https://issues.apache.org/jira/browse/YARN-4696
> Project: Hadoop YARN
> Issue Type: Sub-task
> Components: timelineserver
> Affects Versions: 2.8.0
> Reporter: Steve Loughran
> Assignee: Steve Loughran
> Attachments: YARN-4696-001.patch, YARN-4696-002.patch,
> YARN-4696-003.patch, YARN-4696-005.patch, YARN-4696-006.patch,
> YARN-4696-007.patch, YARN-4696-008.patch
>
>
> {{EntityGroupFSTimelineStore}} now depends on an RM being up and running; the
> configuration pointing to it. This is a new change, and impacts testing where
> you have historically been able to test without an RM running.
> The sole purpose of the probe is to automatically determine if an app is
> running; it falls back to "unknown" if not. If the RM connection was
> optional, the "unknown" codepath could be called directly, relying on age of
> file as a metric of completion
> Options
> # add a flag to disable RM connect
> # skip automatically if RM not defined/set to 0.0.0.0
> # disable retries on yarn client IPC; if it fails, tag app as unknown.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)