[
https://issues.apache.org/jira/browse/GORA-206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13591600#comment-13591600
]
Lewis John McGibbney commented on GORA-206:
-------------------------------------------
I've still not tracked this one down but I am getting closer.
When we use a patched version of GoraCompiler to compile the latest
webpage.json schema available on NUTCH-1477 (which for the record IS
syntactically fine) we get the following generated into the WebPage.java class.
{code}
public static final Schema _SCHEMA =
Schema.parse("{\"type\":\"record\",\"name\":\"WebPage\",\"namespace\":\"org.apache.nutch.storage\",\"fields\":[{\"name\":\"baseurl\",\"type\":[\"null\",\"string\"]}},{\"name\":\"status\",\"type\":\"int\"},{\"name\":\"fetchtime\",\"type\":\"long\"},{\"name\":\"prevfetchtime\",\"type\":\"long\"},{\"name\":\"fetchinterval\",\"type\":\"int\"},{\"name\":\"retriessincefetch\",\"type\":\"int\"},{\"name\":\"modifiedtime\",\"type\":\"long\"},{\"name\":\"protocolstatus\",\"type\":[\"null\",\"protocolstatus\"]}},{\"name\":\"content\",\"type\":[\"null\",\"bytes\"]}},{\"name\":\"contenttype\",\"type\":[\"null\",\"string\"]}},{\"name\":\"prevsignature\",\"type\":[\"null\",\"bytes\"]}},{\"name\":\"signature\",\"type\":[\"null\",\"bytes\"]}},{\"name\":\"title\",\"type\":[\"null\",\"string\"]}},{\"name\":\"text\",\"type\":[\"null\",\"string\"]}},{\"name\":\"parsestatus\",\"type\":[\"null\",\"parsestatus\"]}},{\"name\":\"score\",\"type\":\"float\"},{\"name\":\"reprurl\",\"type\":[\"null\",\"string\"]}},{\"name\":\"headers\",\"type\":\"map\"},{\"name\":\"outlinks\",\"type\":\"map\"},{\"name\":\"inlinks\",\"type\":\"map\"},{\"name\":\"markers\",\"type\":\"map\"},{\"name\":\"metadata\",\"type\":\"map\"}]}");
{code}
This does not look good when I do some simple bracket matching. I think we've
introduced a big in GoraCompiler which needs to be ironed out.
> Verify storage and retrieval of Avro null-single-type Union data type within
> Gora-Cassandra
> -------------------------------------------------------------------------------------------
>
> Key: GORA-206
> URL: https://issues.apache.org/jira/browse/GORA-206
> Project: Apache Gora
> Issue Type: Sub-task
> Components: storage-cassandra
> Affects Versions: 0.3
> Reporter: Renato Javier Marroquín Mogrovejo
> Assignee: Renato Javier Marroquín Mogrovejo
> Labels: gora-cassandra, gora-core
> Fix For: 0.3
>
> Attachments: GORA-206.v1.patch, GORA-206.v2.patch
>
>
> The necessary features should be added to confirm that we are able to support
> Avro Union data types.
> This referes specifically to null-single-type unions. We will open another
> issue to address the multi-type unions.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira