[
https://issues.apache.org/jira/browse/SQOOP-1869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241237#comment-14241237
]
Gwen Shapira edited comment on SQOOP-1869 at 12/10/14 3:29 PM:
---------------------------------------------------------------
I didn't get this to work yet, but the plan is to add new class ByteArraySchema
with a single Binary column.
The current logic in Matcher is:
{code}
if (fromSchema.isEmpty() && toSchema.isEmpty()) {
throw new SqoopException(MatcherError.MATCHER_0000, "Neither a FROM or TO
schemas been provided.");
{code}
I'm planning to change it to
{code}
if (fromSchema.isEmpty() && toSchema.isEmpty()) {
this.fromSchema = ByteArraySchema.getInstance()
this.toSchema = ByteArraySchema.getInstance()
{code}
This will keep the current logic that if one direction has a schema, this
schema will be used, and will not require changes to connectors.
As I said, its not working yet, so the plan may change.
was (Author: gwenshap):
I didn't get this to work yet, but the plan is to add new class ByteArraySchema
with a single Binary column.
The current logic in Matcher is:
if (fromSchema.isEmpty() && toSchema.isEmpty()) {
throw new SqoopException(MatcherError.MATCHER_0000, "Neither a FROM or TO
schemas been provided.");
I'm planning to change it to
if (fromSchema.isEmpty() && toSchema.isEmpty()) {
this.fromSchema = ByteArraySchema.getInstance()
this.toSchema = ByteArraySchema.getInstance()
This will keep the current logic that if one direction has a schema, this
schema will be used, and will not require changes to connectors.
As I said, its not working yet, so the plan may change.
> Sqoop2: Expand schema matching to support two schemaless connectors
> -------------------------------------------------------------------
>
> Key: SQOOP-1869
> URL: https://issues.apache.org/jira/browse/SQOOP-1869
> Project: Sqoop
> Issue Type: Improvement
> Reporter: Gwen Shapira
> Assignee: Gwen Shapira
>
> Currently the schema matches errors out if both FROM and TO connectors are
> empty. This prevents us from supporting HDFS->Kafka.
> I suggest to change the code to support the following:
> 1. Empty schema will contain a single byte[] field with whatever the
> connector writes into it.
> 2. As happens now, one connector is null and the other has a schema, the
> schema that exists will be used to parse the data.
> 3. If we have two empty schemas, the TO connector will get a byte[] and
> presumably know what to do with it.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)