[jira] [Commented] (NIFI-1935) Added ConvertDynamicJsonToAvro processor

Daniel Cave (JIRA) Mon, 13 Jun 2016 13:04:55 -0700

    [ 
https://issues.apache.org/jira/browse/NIFI-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15328084#comment-15328084
 ]


Daniel Cave commented on NIFI-1935:
-----------------------------------

1) Thursday it was NiFi-1987 that broke a fresh master build on Windows due to 
the deletion of a Cloudera repository on org.sonatype on 6/7 that was a deep 
sub-dependency in nifi-spark-receivers (it was deep under avro:avro.mapred 
1.7).  This mysteriously fixed itself and was not repeatable on 6/10 on a fresh 
Linux build (0.x or master).  
2) On our 0.x Linux server (with NiFi-1933/1934/1935 merged in, which had been 
developed against master since they started before the 0.x/master split) I was 
seeing old issues such as AttributesToJson failing to convert schema as 
attribute due to not being a string (InferAvroSchema writing it as avro.binary 
to attribute), and EvaluateJsonPath not properly interpretting the avro.binary, 
and other issues.  After trying the fresh 0.x version without the master based 
changes being merged in the attribute type changed to string and everything 
worked again.
3) Flow is OOB GetFile->SplitJSON->InverAvroSchema->ConvertAvroSchema->PutFile 
(attached).  Today, on fresh Linux server with fresh 0.x and master builds, 
with a higher throughput I was seeing larger strings passing through 
ConvertJsonToAvro in 0.x ok but in master they were changing formed and were 
being stripped of whitespaces.  At lower throughput I can't repeat the issue, 
and since our update to fix the filenames in SplitJson can't be added in (due 
to issue 2), I can't revalidate since all the filenames are the same.

Basically, I need to know what branch to use as a true master branch so that I 
can have builds and make needed changes and have consistent results.  Having 
essentially two masters is causing gremlins that are affecting my customer 
solutions.

> Added ConvertDynamicJsonToAvro processor
> ----------------------------------------
>
>                 Key: NIFI-1935
>                 URL: https://issues.apache.org/jira/browse/NIFI-1935
>             Project: Apache NiFi
>          Issue Type: New Feature
>          Components: Extensions
>    Affects Versions: 1.0.0, 0.7.0
>            Reporter: Daniel Cave
>            Assignee: Alex Halldin
>            Priority: Minor
>             Fix For: 1.0.0, 0.7.0
>
>         Attachments: 
> 0001-NIFI-1935-Added-ConvertDynamicJSONToAvro.java.-Added.patch, 
> GetFileSplitInferConvertAvro.xml
>
>
> ConvertJsonToAvro required a predefined Avro schema to convert JSON and 
> required the presence of all field on the incoming JSON.  
> ConvertDynamicJsonToAvro functions similarly, however it now accepts the JSON 
> and schema as incoming flowfiles and creates the Avro dynamically.
> This processor requires the InferAvroSchema processor in its upstream flow so 
> that it can use the original and schema flowfiles as input.  These two 
> flowfiles will have the unique attribute inferredAvroId set on them by 
> InferAvroSchema so that they can be properly matched in 
> ConvertDynamicJsonToAvro.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (NIFI-1935) Added ConvertDynamicJsonToAvro processor

Reply via email to