[ 
https://issues.apache.org/jira/browse/TIKA-1357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14046140#comment-14046140
 ] 

Tyler Palsulich commented on TIKA-1357:
---------------------------------------

See below for how to read and output line by line (copy & paste between the xml 
start/end in EnviHeaderParser). I have a hunch this isn't really what we want 
-- what if a metadata field has a newline in it? What if the line is too long 
to fit into a string? On the other hand, with nice input, it's much nicer 
output.

{code}
            // text contents of the xhtml
            String line;
            while ((line = reader.readLine()) != null) {
                xhtml.startElement("p");
                xhtml.characters(line);
                xhtml.endElement("p");
            }
{code}

> Buffered text in EnviHeaderParser
> ---------------------------------
>
>                 Key: TIKA-1357
>                 URL: https://issues.apache.org/jira/browse/TIKA-1357
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.6
>            Reporter: Ann Burgess
>            Priority: Minor
>              Labels: parser
>
> User BufferedReader to insert line by line p tags when parsing ENVI headers 
> per reviewer comment: https://reviews.apache.org/r/22892/#comment81964



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to