sebastian-nagel opened a new pull request #529: URL: https://github.com/apache/nutch/pull/529
- switch to multi-line presentation of Metadata in ParseData::toString - default implementation of Metadata::toString is still single-line - replace StringBuffer by StringBuilder in modified methods Parsechecker will now show metadata as follows: ``` $> bin/nutch parsechecker -Dplugin.includes='parse-(tika|metatags)|protocol-okhttp' http://localhost/ fetching: http://localhost/ ... Title: Apache2 Ubuntu Default Page: It works Outlinks: 2 outlink: toUrl: http://localhost/icons/ubuntu-logo.png anchor: Ubuntu Logo outlink: toUrl: http://localhost/manual anchor: manual Content Metadata: Accept-Ranges = bytes Keep-Alive = timeout=5, max=100 nutch.fetch.time = 1591696071739 Server = Apache/2.4.41 (Ubuntu) ETag = "2aa6-59647cb960db3-gzip" Connection = Keep-Alive Vary = Accept-Encoding Last-Modified = Fri, 01 Nov 2019 12:06:26 GMT Date = Tue, 09 Jun 2020 09:47:51 GMT Content-Type = text/html Parse Metadata: dc:title = Apache2 Ubuntu Default Page: It works Content-Encoding = UTF-8 Content-Type-Hint = text/html; charset=UTF-8 Content-Type = application/xhtml+xml; charset=UTF-8 ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]

