[ 
https://issues.apache.org/jira/browse/NIFI-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15275174#comment-15275174
 ] 

ASF GitHub Bot commented on NIFI-1280:
--------------------------------------

GitHub user ToivoAdams opened a pull request:

    https://github.com/apache/nifi/pull/420

    NIFI-1280 Create FilterCSVColumns Processor.

    First version. 
    SQL select statement is used to specify how CSV data should be transformed.
    Modified Calcite CSV adapter is used for SQL execution.
    Some code is borrowed from HiveJdbcCommon, methods convertToCsvStream() 

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/ToivoAdams/nifi nifi-1280

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/nifi/pull/420.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #420
    
----
commit 72cdeb684cde342d6b240d496c0553afa78a4a6f
Author: Toivo Adams <[email protected]>
Date:   2016-05-07T09:29:15Z

    NIFI-1280 Create FilterCSVColumns Processor.

----


> Create FilterCSVColumns Processor
> ---------------------------------
>
>                 Key: NIFI-1280
>                 URL: https://issues.apache.org/jira/browse/NIFI-1280
>             Project: Apache NiFi
>          Issue Type: Task
>          Components: Extensions
>            Reporter: Mark Payne
>            Assignee: Toivo Adams
>
> We should have a Processor that allows users to easily filter out specific 
> columns from CSV data. For instance, a user would configure two different 
> properties: "Columns of Interest" (a comma-separated list of column indexes) 
> and "Filtering Strategy" (Keep Only These Columns, Remove Only These Columns).
> We can do this today with ReplaceText, but it is far more difficult than it 
> would be with this Processor, as the user has to use Regular Expressions, 
> etc. with ReplaceText.
> Eventually a Custom UI could even be built that allows a user to upload a 
> Sample CSV and choose which columns from there, similar to the way that Excel 
> works when importing CSV by dragging and selecting the desired columns? That 
> would certainly be a larger undertaking and would not need to be done for an 
> initial implementation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to