Jarek Jarcec Cecho created SQOOP-2299:
-----------------------------------------
Summary: Sqoop2: Store Context classes in repository
Key: SQOOP-2299
URL: https://issues.apache.org/jira/browse/SQOOP-2299
Project: Sqoop
Issue Type: Bug
Affects Versions: 1.99.5
Reporter: Jarek Jarcec Cecho
Assignee: Jarek Jarcec Cecho
Fix For: 1.99.7
While looking into persisting state from incremental job (SQOOP-1803), I've
uncover a Hadoop bug where any Hadoop 2 will return incorrect {{job.xml}} when
using {{JobClient}} APIs to get job's details. The issue is harder to track as
it was initially fixed in Hadoop 2.7.0 via MAPREDUCE-5875, but subsequently
reverted because of MAPREDUCE-6288 and it's not clear to me when/if the fix
will be provided. This is relevant to us as we are storing our {{Context}}
classes in job conf. I've looked around why nobody seen this problem before and
it seems that projects are generally persisting properties in their
repositories rather then using Hadoop APIs to retrieve the {{Configuration}}
object back.
Thinking about it a bit more, I think that it would be useful to keep track of
the context classes as they contain additional information that can be useful
for debugging purpose. I'm not yet sure whether we should expose those objects
over the REST interface as they can possibly contain sensitive information, but
it seems useful to at least persist those.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)