Aldrin Piri created NIFI-921:
--------------------------------
Summary: Create a processor to promote character delimited data to
attributes
Key: NIFI-921
URL: https://issues.apache.org/jira/browse/NIFI-921
Project: Apache NiFi
Issue Type: Improvement
Components: Extensions
Reporter: Aldrin Piri
Priority: Minor
A processor that can analyze content and promote character delimited data to
attributes could prove quite helpful.
There are a large number of "schemas"/formats that are simply character
delimited formats. Typically these records are quite small in format but
"rich" in terms of the values that they possess. This processor would provide
an easy means to handle these simpler formats and make for an easy way to
reason about data in this class of formats.
We can approximate this by performing a regular expression within ExtractText
and capturing groups, but this is not a good fit for regexes.
The processor would handle likely be fed by a split text processor but, with
some reasonable consideration, could handle this splitting of text along rows
generating a unique flowfile for each. Exact contract would need some
consideration in terms of the content that passes through (entirety of original
file, row by itself, row with header if it exists)
Additionally, the processor could also consider if there is a header, delimited
in the same fashion as each of its constituent records.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)