[ 
https://issues.apache.org/jira/browse/NIFI-1942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306957#comment-15306957
 ] 

ASF GitHub Bot commented on NIFI-1942:
--------------------------------------

GitHub user pvillard31 opened a pull request:

    https://github.com/apache/nifi/pull/476

    NIFI-1942 Processor to validate CSV against user-supplied schema

    This processor is designed to validate a CSV formatted FlowFile against a 
user-supplied schema.
    
    It leverages Cell Processors from super-csv library and gives the following 
options to define the expected schema:
    
    - ParseBigDecimal
    - ParseBool
    - ParseChar
    - ParseDate
    - ParseDouble
    - ParseInt
    - ParseLong
    - Optional
    - DMinMax
    - Equals
    - ForbidSubStr
    - LMinMax
    - NotNull
    - Null
    - RequireHashCode
    - RequireSubStr
    - Strlen
    - StrMinMax
    - StrNotNullOrEmpty
    - StrRegEx
    - Unique
    - UniqueHashCode
    
    Nested cell processors are not supported except with Optional.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/pvillard31/nifi validate-csv

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/nifi/pull/476.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #476
    
----
commit 30b8ad2e3dddbad07ce39e54db78f9426aed6001
Author: Pierre Villard <[email protected]>
Date:   2016-05-27T08:05:16Z

    NIFI-1942 Processor to validate CSV against user-supplied schema

----


> Create a processor to validate CSV against a user-supplied schema
> -----------------------------------------------------------------
>
>                 Key: NIFI-1942
>                 URL: https://issues.apache.org/jira/browse/NIFI-1942
>             Project: Apache NiFi
>          Issue Type: Improvement
>          Components: Extensions
>            Reporter: Pierre Villard
>            Assignee: Pierre Villard
>            Priority: Minor
>
> In order to extend the set of "quality control" processors, it would be 
> interesting to have a processor validating CSV formatted flow files against a 
> user-specified schema.
> Flow file validated against schema would be routed to "valid" relationship 
> although flow file not validated against schema would be routed to "invalid" 
> relationship.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to