[ 
https://issues.apache.org/jira/browse/NIFI-4356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16260245#comment-16260245
 ] 

Henry Sowell commented on NIFI-4356:
------------------------------------

+1 on Andy’s proposal. Under the current logic the processor is significantly 
handicapped in scenarios where you want to continue working with the original 
HTML content. An example would be using  HTML elements in attributes for things 
like routing the original content based attributes extracted with this 
processor. 

> GetHTMLElement removes content from resulting flowfiles even when destination 
> is attribute
> ------------------------------------------------------------------------------------------
>
>                 Key: NIFI-4356
>                 URL: https://issues.apache.org/jira/browse/NIFI-4356
>             Project: Apache NiFi
>          Issue Type: Bug
>          Components: Extensions
>    Affects Versions: 1.3.0
>            Reporter: Andy LoPresto
>              Labels: html
>
> As noted in 
> [StackOverflow|https://stackoverflow.com/questions/46070288/multiple-gethtml-attributes-in-apache-nifi],
>  {{GetHTMLElement}} processors cannot be chained because the success 
> relationship clears the flowfile content even if the destination is an 
> attribute. I propose the logic be changed to the following:
> * Destination = content
>   * Replace the flowfile content for the *success* relationship
>   * Maintain the content for *original*
> * Destination = attribute
>   * Maintain the content for *success* and *original*
>   * Add a property descriptor for the attribute name
>   * Name attributes {{Attribute.1}} as in {{Extract*}} in the event that 
> multiple elements are matched



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to