[
https://issues.apache.org/jira/browse/TIKA-1357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14046140#comment-14046140
]
Tyler Palsulich commented on TIKA-1357:
---------------------------------------
See below for how to read and output line by line (copy & paste between the xml
start/end in EnviHeaderParser). I have a hunch this isn't really what we want
-- what if a metadata field has a newline in it? What if the line is too long
to fit into a string? On the other hand, with nice input, it's much nicer
output.
{code}
// text contents of the xhtml
String line;
while ((line = reader.readLine()) != null) {
xhtml.startElement("p");
xhtml.characters(line);
xhtml.endElement("p");
}
{code}
> Buffered text in EnviHeaderParser
> ---------------------------------
>
> Key: TIKA-1357
> URL: https://issues.apache.org/jira/browse/TIKA-1357
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.6
> Reporter: Ann Burgess
> Priority: Minor
> Labels: parser
>
> User BufferedReader to insert line by line p tags when parsing ENVI headers
> per reviewer comment: https://reviews.apache.org/r/22892/#comment81964
--
This message was sent by Atlassian JIRA
(v6.2#6252)