Modified: tika/site/publish/1.9/examples.html
URL: 
http://svn.apache.org/viewvc/tika/site/publish/1.9/examples.html?rev=1841351&r1=1841350&r2=1841351&view=diff
==============================================================================
--- tika/site/publish/1.9/examples.html (original)
+++ tika/site/publish/1.9/examples.html Wed Sep 19 15:33:39 2018
@@ -116,23 +116,23 @@
 <p>The <a href="./api/org/apache/tika/Tika.html">Tika facade</a>, provides a 
number of very quick and easy ways to have your content parsed by Tika, and 
return the resulting plain text</p><style type="text/css">
    @import url('attached-includes/css/shCoreDefault.css');
 </style>
-<div id="highlighter_428116" class="syntaxhighlighter nogutter  java"><table 
border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div 
class="container"><div class="line number54 index0 alt1"><code class="java 
keyword">public</code> <code class="java plain">String parseToStringExample() 
</code><code class="java keyword">throws</code> <code class="java 
plain">IOException, SAXException, TikaException {</code></div><div class="line 
number55 index1 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Tika tika = 
</code><code class="java keyword">new</code> <code class="java 
plain">Tika();</code></div><div class="line number56 index2 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
keyword">try</code> <code class="java plain">(InputStream stream = 
ParsingExample.</code><code class="java keyword">class</code><code class="java 
plain">.getResourceAsStream(</code><code class="java string">"test.doc"</code><c
 ode class="java plain">)) {</code></div><div class="line number57 index3 
alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">tika.parseToString(stream);</code></div><div class="line number58 index4 
alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number59 index5 alt2"><code 
class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
+<div id="highlighter_38152" class="syntaxhighlighter nogutter  java"><table 
border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div 
class="container"><div class="line number54 index0 alt1"><code class="java 
keyword">public</code> <code class="java plain">String parseToStringExample() 
</code><code class="java keyword">throws</code> <code class="java 
plain">IOException, SAXException, TikaException {</code></div><div class="line 
number55 index1 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Tika tika = 
</code><code class="java keyword">new</code> <code class="java 
plain">Tika();</code></div><div class="line number56 index2 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
keyword">try</code> <code class="java plain">(InputStream stream = 
ParsingExample.</code><code class="java keyword">class</code><code class="java 
plain">.getResourceAsStream(</code><code class="java 
string">"test.doc"</code><co
 de class="java plain">)) {</code></div><div class="line number57 index3 
alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">tika.parseToString(stream);</code></div><div class="line number58 index4 
alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number59 index5 alt2"><code 
class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
 <div class="section">
 <h4><a name="Parsing_using_the_Auto-Detect_Parser">Parsing using the 
Auto-Detect Parser</a></h4>
-<p>For more control, you can call the <a 
href="./api/org/apache/tika/parser/Parser.html">Tika Parsers</a> directly. Most 
likely, you'll want to start out using the <a 
href="./api/org/apache/tika/parser/AutoDetectParser.html">Auto-Detect 
Parser</a>, which automatically figures out what kind of content you have, then 
calls the appropriate parser for you.</p><div id="highlighter_333246" 
class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" 
cellspacing="0"><tbody><tr><td class="code"><div class="container"><div 
class="line number85 index0 alt2"><code class="java keyword">public</code> 
<code class="java plain">String parseExample() </code><code class="java 
keyword">throws</code> <code class="java plain">IOException, SAXException, 
TikaException {</code></div><div class="line number86 index1 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">AutoDetectParser parser = </code><code class="java keyword">new</code> 
<code class="java pla
 in">AutoDetectParser();</code></div><div class="line number87 index2 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">BodyContentHandler handler = </code><code class="java 
keyword">new</code> <code class="java 
plain">BodyContentHandler();</code></div><div class="line number88 index3 
alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">Metadata metadata = </code><code class="java 
keyword">new</code> <code class="java plain">Metadata();</code></div><div 
class="line number89 index4 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> 
<code class="java plain">(InputStream stream = ParsingExample.</code><code 
class="java keyword">class</code><code class="java 
plain">.getResourceAsStream(</code><code class="java 
string">"test.doc"</code><code class="java plain">)) {</code></div><div 
class="line number90 index5 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nb
 sp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, 
handler, metadata);</code></div><div class="line number91 index6 alt2"><code 
class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">handler.toString();</code></div><div class="line number92 index7 
alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number93 index8 alt2"><code 
class="java 
plain">}</code></div></div></td></tr></tbody></table></div></div></div>
+<p>For more control, you can call the <a 
href="./api/org/apache/tika/parser/Parser.html">Tika Parsers</a> directly. Most 
likely, you'll want to start out using the <a 
href="./api/org/apache/tika/parser/AutoDetectParser.html">Auto-Detect 
Parser</a>, which automatically figures out what kind of content you have, then 
calls the appropriate parser for you.</p><div id="highlighter_790262" 
class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" 
cellspacing="0"><tbody><tr><td class="code"><div class="container"><div 
class="line number85 index0 alt2"><code class="java keyword">public</code> 
<code class="java plain">String parseExample() </code><code class="java 
keyword">throws</code> <code class="java plain">IOException, SAXException, 
TikaException {</code></div><div class="line number86 index1 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">AutoDetectParser parser = </code><code class="java keyword">new</code> 
<code class="java pla
 in">AutoDetectParser();</code></div><div class="line number87 index2 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">BodyContentHandler handler = </code><code class="java 
keyword">new</code> <code class="java 
plain">BodyContentHandler();</code></div><div class="line number88 index3 
alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">Metadata metadata = </code><code class="java 
keyword">new</code> <code class="java plain">Metadata();</code></div><div 
class="line number89 index4 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> 
<code class="java plain">(InputStream stream = ParsingExample.</code><code 
class="java keyword">class</code><code class="java 
plain">.getResourceAsStream(</code><code class="java 
string">"test.doc"</code><code class="java plain">)) {</code></div><div 
class="line number90 index5 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nb
 sp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">parser.parse(stream, 
handler, metadata);</code></div><div class="line number91 index6 alt2"><code 
class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">handler.toString();</code></div><div class="line number92 index7 
alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number93 index8 alt2"><code 
class="java 
plain">}</code></div></div></td></tr></tbody></table></div></div></div>
 <div class="section">
 <h3><a name="Picking_different_output_formats">Picking different output 
formats</a></h3>
 <p>With Tika, you can get the textual content of your files returned in a 
number of different formats. These can be plain text, html, xhtml, xhtml of one 
part of the file etc. This is controlled based on the <a class="externalLink" 
href="http://docs.oracle.com/javase/7/docs/api/org/xml/sax/ContentHandler.html";>ContentHandler</a>
 you supply to the Parser.</p>
 <div class="section">
 <h4><a name="Parsing_to_Plain_Text">Parsing to Plain Text</a></h4>
-<p>By using the <a 
href="./api/org/apache/tika/sax/BodyContentHandler.html">BodyContentHandler</a>,
 you can request that Tika return only the content of the document's body as a 
plain-text string.</p><div id="highlighter_791483" class="syntaxhighlighter 
nogutter  java"><table border="0" cellpadding="0" 
cellspacing="0"><tbody><tr><td class="code"><div class="container"><div 
class="line number47 index0 alt2"><code class="java keyword">public</code> 
<code class="java plain">String parseToPlainText() </code><code class="java 
keyword">throws</code> <code class="java plain">IOException, SAXException, 
TikaException {</code></div><div class="line number48 index1 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">BodyContentHandler handler = </code><code class="java 
keyword">new</code> <code class="java 
plain">BodyContentHandler();</code></div><div class="line number49 index2 
alt2">&nbsp;</div><div class="line number50 index3 alt1"><code class="java space
 s">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">AutoDetectParser 
parser = </code><code class="java keyword">new</code> <code class="java 
plain">AutoDetectParser();</code></div><div class="line number51 index4 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">Metadata metadata = </code><code class="java 
keyword">new</code> <code class="java plain">Metadata();</code></div><div 
class="line number52 index5 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> 
<code class="java plain">(InputStream stream = 
ContentHandlerExample.</code><code class="java keyword">class</code><code 
class="java plain">.getResourceAsStream(</code><code class="java 
string">"test.doc"</code><code class="java plain">)) {</code></div><div 
class="line number53 index6 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">parser.parse(stream, handler, metadata);</c
 ode></div><div class="line number54 index7 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">handler.toString();</code></div><div class="line number55 index8 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number56 index9 alt1"><code 
class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
+<p>By using the <a 
href="./api/org/apache/tika/sax/BodyContentHandler.html">BodyContentHandler</a>,
 you can request that Tika return only the content of the document's body as a 
plain-text string.</p><div id="highlighter_431922" class="syntaxhighlighter 
nogutter  java"><table border="0" cellpadding="0" 
cellspacing="0"><tbody><tr><td class="code"><div class="container"><div 
class="line number47 index0 alt2"><code class="java keyword">public</code> 
<code class="java plain">String parseToPlainText() </code><code class="java 
keyword">throws</code> <code class="java plain">IOException, SAXException, 
TikaException {</code></div><div class="line number48 index1 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">BodyContentHandler handler = </code><code class="java 
keyword">new</code> <code class="java 
plain">BodyContentHandler();</code></div><div class="line number49 index2 
alt2">&nbsp;</div><div class="line number50 index3 alt1"><code class="java space
 s">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">AutoDetectParser 
parser = </code><code class="java keyword">new</code> <code class="java 
plain">AutoDetectParser();</code></div><div class="line number51 index4 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">Metadata metadata = </code><code class="java 
keyword">new</code> <code class="java plain">Metadata();</code></div><div 
class="line number52 index5 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> 
<code class="java plain">(InputStream stream = 
ContentHandlerExample.</code><code class="java keyword">class</code><code 
class="java plain">.getResourceAsStream(</code><code class="java 
string">"test.doc"</code><code class="java plain">)) {</code></div><div 
class="line number53 index6 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">parser.parse(stream, handler, metadata);</c
 ode></div><div class="line number54 index7 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">handler.toString();</code></div><div class="line number55 index8 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number56 index9 alt1"><code 
class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
 <div class="section">
 <h4><a name="Parsing_to_XHTML">Parsing to XHTML</a></h4>
-<p>By using the <a 
href="./api/org/apache/tika/sax/ToXMLContentHandler.html">ToXMLContentHandler</a>,
 you can get the XHTML content of the whole document as a string.</p><div 
id="highlighter_476293" class="syntaxhighlighter nogutter  java"><table 
border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div 
class="container"><div class="line number61 index0 alt2"><code class="java 
keyword">public</code> <code class="java plain">String parseToHTML() 
</code><code class="java keyword">throws</code> <code class="java 
plain">IOException, SAXException, TikaException {</code></div><div class="line 
number62 index1 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">ContentHandler 
handler = </code><code class="java keyword">new</code> <code class="java 
plain">ToXMLContentHandler();</code></div><div class="line number63 index2 
alt2">&nbsp;</div><div class="line number64 index3 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><cod
 e class="java plain">AutoDetectParser parser = </code><code class="java 
keyword">new</code> <code class="java 
plain">AutoDetectParser();</code></div><div class="line number65 index4 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">Metadata metadata = </code><code class="java 
keyword">new</code> <code class="java plain">Metadata();</code></div><div 
class="line number66 index5 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> 
<code class="java plain">(InputStream stream = 
ContentHandlerExample.</code><code class="java keyword">class</code><code 
class="java plain">.getResourceAsStream(</code><code class="java 
string">"test.doc"</code><code class="java plain">)) {</code></div><div 
class="line number67 index6 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">parser.parse(stream, handler, metadata);</code></div><div 
class="line number68 in
 dex7 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">handler.toString();</code></div><div class="line number69 index8 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number70 index9 alt1"><code 
class="java plain">}</code></div></div></td></tr></tbody></table></div>
-<p>If you just want the body of the xhtml document, without the header, you 
can chain together a <a 
href="./api/org/apache/tika/sax/BodyContentHandler.html">BodyContentHandler</a> 
and a <a 
href="./api/org/apache/tika/sax/ToXMLContentHandler.html">ToXMLContentHandler</a>
 as shown:</p><div id="highlighter_754746" class="syntaxhighlighter nogutter  
java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td 
class="code"><div class="container"><div class="line number76 index0 
alt1"><code class="java keyword">public</code> <code class="java plain">String 
parseBodyToHTML() </code><code class="java keyword">throws</code> <code 
class="java plain">IOException, SAXException, TikaException {</code></div><div 
class="line number77 index1 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">ContentHandler 
handler = </code><code class="java keyword">new</code> <code class="java 
plain">BodyContentHandler(</code></div><div class="line number78 index2 alt
 1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java keyword">new</code> <code class="java 
plain">ToXMLContentHandler());</code></div><div class="line number79 index3 
alt2">&nbsp;</div><div class="line number80 index4 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">AutoDetectParser parser = </code><code class="java keyword">new</code> 
<code class="java plain">AutoDetectParser();</code></div><div class="line 
number81 index5 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata 
metadata = </code><code class="java keyword">new</code> <code class="java 
plain">Metadata();</code></div><div class="line number82 index6 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
keyword">try</code> <code class="java plain">(InputStream stream = 
ContentHandlerExample.</code><code class="java keyword">class</code><code 
 class="java plain">.getResourceAsStream(</code><code class="java 
string">"test.doc"</code><code class="java plain">)) {</code></div><div 
class="line number83 index7 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">parser.parse(stream, handler, metadata);</code></div><div 
class="line number84 index8 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">handler.toString();</code></div><div class="line number85 index9 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number86 index10 alt1"><code 
class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
+<p>By using the <a 
href="./api/org/apache/tika/sax/ToXMLContentHandler.html">ToXMLContentHandler</a>,
 you can get the XHTML content of the whole document as a string.</p><div 
id="highlighter_787347" class="syntaxhighlighter nogutter  java"><table 
border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div 
class="container"><div class="line number61 index0 alt2"><code class="java 
keyword">public</code> <code class="java plain">String parseToHTML() 
</code><code class="java keyword">throws</code> <code class="java 
plain">IOException, SAXException, TikaException {</code></div><div class="line 
number62 index1 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">ContentHandler 
handler = </code><code class="java keyword">new</code> <code class="java 
plain">ToXMLContentHandler();</code></div><div class="line number63 index2 
alt2">&nbsp;</div><div class="line number64 index3 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><cod
 e class="java plain">AutoDetectParser parser = </code><code class="java 
keyword">new</code> <code class="java 
plain">AutoDetectParser();</code></div><div class="line number65 index4 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">Metadata metadata = </code><code class="java 
keyword">new</code> <code class="java plain">Metadata();</code></div><div 
class="line number66 index5 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> 
<code class="java plain">(InputStream stream = 
ContentHandlerExample.</code><code class="java keyword">class</code><code 
class="java plain">.getResourceAsStream(</code><code class="java 
string">"test.doc"</code><code class="java plain">)) {</code></div><div 
class="line number67 index6 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">parser.parse(stream, handler, metadata);</code></div><div 
class="line number68 in
 dex7 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">handler.toString();</code></div><div class="line number69 index8 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number70 index9 alt1"><code 
class="java plain">}</code></div></div></td></tr></tbody></table></div>
+<p>If you just want the body of the xhtml document, without the header, you 
can chain together a <a 
href="./api/org/apache/tika/sax/BodyContentHandler.html">BodyContentHandler</a> 
and a <a 
href="./api/org/apache/tika/sax/ToXMLContentHandler.html">ToXMLContentHandler</a>
 as shown:</p><div id="highlighter_34408" class="syntaxhighlighter nogutter  
java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td 
class="code"><div class="container"><div class="line number76 index0 
alt1"><code class="java keyword">public</code> <code class="java plain">String 
parseBodyToHTML() </code><code class="java keyword">throws</code> <code 
class="java plain">IOException, SAXException, TikaException {</code></div><div 
class="line number77 index1 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">ContentHandler 
handler = </code><code class="java keyword">new</code> <code class="java 
plain">BodyContentHandler(</code></div><div class="line number78 index2 alt1
 "><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java keyword">new</code> <code class="java 
plain">ToXMLContentHandler());</code></div><div class="line number79 index3 
alt2">&nbsp;</div><div class="line number80 index4 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">AutoDetectParser parser = </code><code class="java keyword">new</code> 
<code class="java plain">AutoDetectParser();</code></div><div class="line 
number81 index5 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata 
metadata = </code><code class="java keyword">new</code> <code class="java 
plain">Metadata();</code></div><div class="line number82 index6 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
keyword">try</code> <code class="java plain">(InputStream stream = 
ContentHandlerExample.</code><code class="java keyword">class</code><code c
 lass="java plain">.getResourceAsStream(</code><code class="java 
string">"test.doc"</code><code class="java plain">)) {</code></div><div 
class="line number83 index7 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">parser.parse(stream, handler, metadata);</code></div><div 
class="line number84 index8 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">handler.toString();</code></div><div class="line number85 index9 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number86 index10 alt1"><code 
class="java plain">}</code></div></div></td></tr></tbody></table></div></div>
 <div class="section">
 <h4><a name="Fetching_just_certain_bits_of_the_XHTML">Fetching just certain 
bits of the XHTML</a></h4>
-<p>It possible to execute XPath queries on the parse results, to fetch only 
certain bits of the XHTML. </p><div id="highlighter_992485" 
class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" 
cellspacing="0"><tbody><tr><td class="code"><div class="container"><div 
class="line number92 index0 alt1"><code class="java keyword">public</code> 
<code class="java plain">String parseOnePartToHTML() </code><code class="java 
keyword">throws</code> <code class="java plain">IOException, SAXException, 
TikaException {</code></div><div class="line number93 index1 alt2"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
comments">// Only get things under html -> body -> div 
(class=header)</code></div><div class="line number94 index2 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">XPathParser xhtmlParser = </code><code class="java keyword">new</code> 
<code class="java plain">XPathParser(</code><code class="java string">
 "xhtml"</code><code class="java plain">, 
XHTMLContentHandler.XHTML);</code></div><div class="line number95 index3 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">Matcher divContentMatcher = xhtmlParser.parse(</code><code 
class="java 
string">"/xhtml:html/xhtml:body/xhtml:div/descendant::node()"</code><code 
class="java plain">);</code></div><div class="line number96 index4 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">ContentHandler handler = </code><code class="java keyword">new</code> 
<code class="java plain">MatchingContentHandler(</code></div><div class="line 
number97 index5 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java keyword">new</code> <code class="java 
plain">ToXMLContentHandler(), divContentMatcher);</code></div><div class="line 
number98 index6 alt1">&nbsp;</div><div class="line number99 index7 alt2"><code 
class=
 "java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">AutoDetectParser parser = </code><code class="java keyword">new</code> 
<code class="java plain">AutoDetectParser();</code></div><div class="line 
number100 index8 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata 
metadata = </code><code class="java keyword">new</code> <code class="java 
plain">Metadata();</code></div><div class="line number101 index9 alt2"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
keyword">try</code> <code class="java plain">(InputStream stream = 
ContentHandlerExample.</code><code class="java keyword">class</code><code 
class="java plain">.getResourceAsStream(</code><code class="java 
string">"test2.doc"</code><code class="java plain">)) {</code></div><div 
class="line number102 index10 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">parser.parse(stream, handle
 r, metadata);</code></div><div class="line number103 index11 alt2"><code 
class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">handler.toString();</code></div><div class="line number104 index12 
alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number105 index13 alt2"><code 
class="java 
plain">}</code></div></div></td></tr></tbody></table></div></div></div>
+<p>It possible to execute XPath queries on the parse results, to fetch only 
certain bits of the XHTML. </p><div id="highlighter_633188" 
class="syntaxhighlighter nogutter  java"><table border="0" cellpadding="0" 
cellspacing="0"><tbody><tr><td class="code"><div class="container"><div 
class="line number92 index0 alt1"><code class="java keyword">public</code> 
<code class="java plain">String parseOnePartToHTML() </code><code class="java 
keyword">throws</code> <code class="java plain">IOException, SAXException, 
TikaException {</code></div><div class="line number93 index1 alt2"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
comments">// Only get things under html -> body -> div 
(class=header)</code></div><div class="line number94 index2 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">XPathParser xhtmlParser = </code><code class="java keyword">new</code> 
<code class="java plain">XPathParser(</code><code class="java string">
 "xhtml"</code><code class="java plain">, 
XHTMLContentHandler.XHTML);</code></div><div class="line number95 index3 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">Matcher divContentMatcher = xhtmlParser.parse(</code><code 
class="java 
string">"/xhtml:html/xhtml:body/xhtml:div/descendant::node()"</code><code 
class="java plain">);</code></div><div class="line number96 index4 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">ContentHandler handler = </code><code class="java keyword">new</code> 
<code class="java plain">MatchingContentHandler(</code></div><div class="line 
number97 index5 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java keyword">new</code> <code class="java 
plain">ToXMLContentHandler(), divContentMatcher);</code></div><div class="line 
number98 index6 alt1">&nbsp;</div><div class="line number99 index7 alt2"><code 
class=
 "java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">AutoDetectParser parser = </code><code class="java keyword">new</code> 
<code class="java plain">AutoDetectParser();</code></div><div class="line 
number100 index8 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata 
metadata = </code><code class="java keyword">new</code> <code class="java 
plain">Metadata();</code></div><div class="line number101 index9 alt2"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
keyword">try</code> <code class="java plain">(InputStream stream = 
ContentHandlerExample.</code><code class="java keyword">class</code><code 
class="java plain">.getResourceAsStream(</code><code class="java 
string">"test2.doc"</code><code class="java plain">)) {</code></div><div 
class="line number102 index10 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">parser.parse(stream, handle
 r, metadata);</code></div><div class="line number103 index11 alt2"><code 
class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">handler.toString();</code></div><div class="line number104 index12 
alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number105 index13 alt2"><code 
class="java 
plain">}</code></div></div></td></tr></tbody></table></div></div></div>
 <div class="section">
 <h3><a name="Custom_Content_Handlers">Custom Content Handlers</a></h3>
 <p>The textual output of parsing a file with Tika is returned via the SAX <a 
class="externalLink" 
href="http://docs.oracle.com/javase/7/docs/api/org/xml/sax/ContentHandler.html";>ContentHandler</a>
 you pass to the parse method. It is possible to customise your parsing by 
supplying your own ContentHandler which does special things.</p>
@@ -141,16 +141,16 @@
 <p>By using the <a 
href="./api/org/apache/tika/sax/PhoneExtractingContentHandler.html">PhoneExtractingContentHandler</a>,
 you can have any phone numbers found in the textual content of the document 
extracted and placed into the Metadata object for you.</p></div>
 <div class="section">
 <h4><a name="Streaming_the_plain_text_in_chunks">Streaming the plain text in 
chunks</a></h4>
-<p>Sometimes, you want to chunk the resulting text up, perhaps to output as 
you go minimising memory use, perhaps to output to HDFS files, or any other 
reason! With a small custom content handler, you can do that.</p><div 
id="highlighter_132420" class="syntaxhighlighter nogutter  java"><table 
border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div 
class="container"><div class="line number113 index0 alt2"><code class="java 
keyword">public</code> <code class="java plain">List&lt;String> 
parseToPlainTextChunks() </code><code class="java keyword">throws</code> <code 
class="java plain">IOException, SAXException, TikaException {</code></div><div 
class="line number114 index1 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">final</code> 
<code class="java plain">List&lt;String> chunks = </code><code class="java 
keyword">new</code> <code class="java plain">ArrayList&lt;>();</code></div><div 
class="line number115 index2 alt2"><code c
 lass="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">chunks.add(</code><code class="java string">""</code><code class="java 
plain">);</code></div><div class="line number116 index3 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">ContentHandlerDecorator handler = </code><code class="java 
keyword">new</code> <code class="java plain">ContentHandlerDecorator() 
{</code></div><div class="line number117 index4 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java color1">@Override</code></div><div class="line number118 index5 
alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">public</code> <code class="java keyword">void</code> <code 
class="java plain">characters(</code><code class="java 
keyword">char</code><code class="java plain">[] ch, </code><code class="java 
keyword">int</code> <code class="java plain">start, </c
 ode><code class="java keyword">int</code> <code class="java plain">length) 
{</code></div><div class="line number119 index6 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java plain">String lastChunk = chunks.get(chunks.size() - </code><code 
class="java value">1</code><code class="java plain">);</code></div><div 
class="line number120 index7 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java plain">String thisStr = </code><code class="java 
keyword">new</code> <code class="java plain">String(ch, start, 
length);</code></div><div class="line number121 index8 alt2">&nbsp;</div><div 
class="line number122 index9 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java keyword">if</code> <code class="java plain">(lastChunk.length() + 
length > MAXIMUM_TEXT_CHUNK_SIZE) {
 </code></div><div class="line number123 index10 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java plain">chunks.add(thisStr);</code></div><div class="line number124 
index11 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java plain">} </code><code class="java keyword">else</code> <code 
class="java plain">{</code></div><div class="line number125 index12 alt2"><code 
class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java plain">chunks.set(chunks.size() - </code><code class="java 
value">1</code><code class="java plain">, lastChunk + 
thisStr);</code></div><div class="line number126 index13 alt1"><code 
class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java plain">}</c
 ode></div><div class="line number127 index14 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number128 index15 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">};</code></div><div class="line number129 index16 alt2">&nbsp;</div><div 
class="line number130 index17 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">AutoDetectParser parser = </code><code class="java keyword">new</code> 
<code class="java plain">AutoDetectParser();</code></div><div class="line 
number131 index18 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata 
metadata = </code><code class="java keyword">new</code> <code class="java 
plain">Metadata();</code></div><div class="line number132 index19 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
keyword">try</code> <code class
 ="java plain">(InputStream stream = ContentHandlerExample.</code><code 
class="java keyword">class</code><code class="java 
plain">.getResourceAsStream(</code><code class="java 
string">"test2.doc"</code><code class="java plain">)) {</code></div><div 
class="line number133 index20 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">parser.parse(stream, handler, metadata);</code></div><div 
class="line number134 index21 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">chunks;</code></div><div class="line number135 index22 alt2"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">}</code></div><div class="line number136 index23 alt1"><code class="java 
plain">}</code></div></div></td></tr></tbody></table></div></div></div>
+<p>Sometimes, you want to chunk the resulting text up, perhaps to output as 
you go minimising memory use, perhaps to output to HDFS files, or any other 
reason! With a small custom content handler, you can do that.</p><div 
id="highlighter_469094" class="syntaxhighlighter nogutter  java"><table 
border="0" cellpadding="0" cellspacing="0"><tbody><tr><td class="code"><div 
class="container"><div class="line number113 index0 alt2"><code class="java 
keyword">public</code> <code class="java plain">List&lt;String> 
parseToPlainTextChunks() </code><code class="java keyword">throws</code> <code 
class="java plain">IOException, SAXException, TikaException {</code></div><div 
class="line number114 index1 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">final</code> 
<code class="java plain">List&lt;String> chunks = </code><code class="java 
keyword">new</code> <code class="java plain">ArrayList&lt;>();</code></div><div 
class="line number115 index2 alt2"><code c
 lass="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">chunks.add(</code><code class="java string">""</code><code class="java 
plain">);</code></div><div class="line number116 index3 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">ContentHandlerDecorator handler = </code><code class="java 
keyword">new</code> <code class="java plain">ContentHandlerDecorator() 
{</code></div><div class="line number117 index4 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java color1">@Override</code></div><div class="line number118 index5 
alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">public</code> <code class="java keyword">void</code> <code 
class="java plain">characters(</code><code class="java 
keyword">char</code><code class="java plain">[] ch, </code><code class="java 
keyword">int</code> <code class="java plain">start, </c
 ode><code class="java keyword">int</code> <code class="java plain">length) 
{</code></div><div class="line number119 index6 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java plain">String lastChunk = chunks.get(chunks.size() - </code><code 
class="java value">1</code><code class="java plain">);</code></div><div 
class="line number120 index7 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java plain">String thisStr = </code><code class="java 
keyword">new</code> <code class="java plain">String(ch, start, 
length);</code></div><div class="line number121 index8 alt2">&nbsp;</div><div 
class="line number122 index9 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java keyword">if</code> <code class="java plain">(lastChunk.length() + 
length > MAXIMUM_TEXT_CHUNK_SIZE) {
 </code></div><div class="line number123 index10 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java plain">chunks.add(thisStr);</code></div><div class="line number124 
index11 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java plain">} </code><code class="java keyword">else</code> <code 
class="java plain">{</code></div><div class="line number125 index12 alt2"><code 
class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java plain">chunks.set(chunks.size() - </code><code class="java 
value">1</code><code class="java plain">, lastChunk + 
thisStr);</code></div><div class="line number126 index13 alt1"><code 
class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code
 class="java plain">}</c
 ode></div><div class="line number127 index14 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">}</code></div><div class="line number128 index15 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">};</code></div><div class="line number129 index16 alt2">&nbsp;</div><div 
class="line number130 index17 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">AutoDetectParser parser = </code><code class="java keyword">new</code> 
<code class="java plain">AutoDetectParser();</code></div><div class="line 
number131 index18 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">Metadata 
metadata = </code><code class="java keyword">new</code> <code class="java 
plain">Metadata();</code></div><div class="line number132 index19 alt1"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
keyword">try</code> <code class
 ="java plain">(InputStream stream = ContentHandlerExample.</code><code 
class="java keyword">class</code><code class="java 
plain">.getResourceAsStream(</code><code class="java 
string">"test2.doc"</code><code class="java plain">)) {</code></div><div 
class="line number133 index20 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">parser.parse(stream, handler, metadata);</code></div><div 
class="line number134 index21 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">chunks;</code></div><div class="line number135 index22 alt2"><code 
class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">}</code></div><div class="line number136 index23 alt1"><code class="java 
plain">}</code></div></div></td></tr></tbody></table></div></div></div>
 <div class="section">
 <h3><a name="Translation">Translation</a></h3>
 <p>Tika provides a pluggable Translation system, which allow you to send the 
results of parsing off to an external system or program to have the text 
translated into another language.</p>
 <div class="section">
 <h4><a name="Translation_using_the_Microsoft_Translation_API">Translation 
using the Microsoft Translation API</a></h4>
-<p>In order to use the Microsoft Translation API, you need to sign up for a 
Microsoft account, get an API key, then pass the key to Tika before 
translating.</p><div id="highlighter_332291" class="syntaxhighlighter nogutter  
java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td 
class="code"><div class="container"><div class="line number23 index0 
alt2"><code class="java keyword">public</code> <code class="java plain">String 
microsoftTranslateToFrench(String text) {</code></div><div class="line number24 
index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">MicrosoftTranslator translator = </code><code class="java 
keyword">new</code> <code class="java 
plain">MicrosoftTranslator();</code></div><div class="line number25 index2 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java comments">// Change the id and secret! See <a 
href="http://msdn.microsoft.com/en-us/library/hh454950.aspx.";>http://msdn.microso
 ft.com/en-us/library/hh454950.aspx.</a></code></div><div class="line number26 
index3 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">translator.setId(</code><code class="java 
string">"dummy-id"</code><code class="java plain">);</code></div><div 
class="line number27 index4 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">translator.setSecret(</code><code class="java 
string">"dummy-secret"</code><code class="java plain">);</code></div><div 
class="line number28 index5 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> 
<code class="java plain">{</code></div><div class="line number29 index6 
alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">translator.translate(text, </code><code class="java 
string">"fr"</code><code class="java plain">);</code></div><div class=
 "line number30 index7 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">} </code><code 
class="java keyword">catch</code> <code class="java plain">(Exception e) 
{</code></div><div class="line number31 index8 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java string">"Error while 
translating."</code><code class="java plain">;</code></div><div class="line 
number32 index9 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">}</code></div><div class="line number33 index10 alt2"><code class="java 
plain">}</code></div></div></td></tr></tbody></table></div></div></div>
+<p>In order to use the Microsoft Translation API, you need to sign up for a 
Microsoft account, get an API key, then pass the key to Tika before 
translating.</p><div id="highlighter_943197" class="syntaxhighlighter nogutter  
java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td 
class="code"><div class="container"><div class="line number23 index0 
alt2"><code class="java keyword">public</code> <code class="java plain">String 
microsoftTranslateToFrench(String text) {</code></div><div class="line number24 
index1 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">MicrosoftTranslator translator = </code><code class="java 
keyword">new</code> <code class="java 
plain">MicrosoftTranslator();</code></div><div class="line number25 index2 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java comments">// Change the id and secret! See <a 
href="http://msdn.microsoft.com/en-us/library/hh454950.aspx.";>http://msdn.microso
 ft.com/en-us/library/hh454950.aspx.</a></code></div><div class="line number26 
index3 alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">translator.setId(</code><code class="java 
string">"dummy-id"</code><code class="java plain">);</code></div><div 
class="line number27 index4 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">translator.setSecret(</code><code class="java 
string">"dummy-secret"</code><code class="java plain">);</code></div><div 
class="line number28 index5 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java keyword">try</code> 
<code class="java plain">{</code></div><div class="line number29 index6 
alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">translator.translate(text, </code><code class="java 
string">"fr"</code><code class="java plain">);</code></div><div class=
 "line number30 index7 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java plain">} </code><code 
class="java keyword">catch</code> <code class="java plain">(Exception e) 
{</code></div><div class="line number31 index8 alt2"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java string">"Error while 
translating."</code><code class="java plain">;</code></div><div class="line 
number32 index9 alt1"><code class="java 
spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code class="java 
plain">}</code></div><div class="line number33 index10 alt2"><code class="java 
plain">}</code></div></div></td></tr></tbody></table></div></div></div>
 <div class="section">
 <h3><a name="Language_Identification">Language Identification</a></h3>
-<p>Tika provides support for identifying the language of text, through the <a 
href="./api/org/apache/tika/language/LanguageIdentifier.html">LanguageIdentifier</a>
 class.</p><div id="highlighter_92105" class="syntaxhighlighter nogutter  
java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td 
class="code"><div class="container"><div class="line number23 index0 
alt2"><code class="java keyword">public</code> <code class="java plain">String 
identifyLanguage(String text) {</code></div><div class="line number24 index1 
alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">LanguageIdentifier identifier = </code><code class="java 
keyword">new</code> <code class="java 
plain">LanguageIdentifier(text);</code></div><div class="line number25 index2 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">identifier.getLanguage();</code></div><div class="line number26 index3 
alt1
 "><code class="java 
plain">}</code></div></div></td></tr></tbody></table></div></div>
+<p>Tika provides support for identifying the language of text, through the <a 
href="./api/org/apache/tika/language/LanguageIdentifier.html">LanguageIdentifier</a>
 class.</p><div id="highlighter_643397" class="syntaxhighlighter nogutter  
java"><table border="0" cellpadding="0" cellspacing="0"><tbody><tr><td 
class="code"><div class="container"><div class="line number23 index0 
alt2"><code class="java keyword">public</code> <code class="java plain">String 
identifyLanguage(String text) {</code></div><div class="line number24 index1 
alt1"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java plain">LanguageIdentifier identifier = </code><code class="java 
keyword">new</code> <code class="java 
plain">LanguageIdentifier(text);</code></div><div class="line number25 index2 
alt2"><code class="java spaces">&nbsp;&nbsp;&nbsp;&nbsp;</code><code 
class="java keyword">return</code> <code class="java 
plain">identifier.getLanguage();</code></div><div class="line number26 index3 
alt
 1"><code class="java 
plain">}</code></div></div></td></tr></tbody></table></div></div>
 <div class="section">
 <h3><a name="Additional_Examples">Additional Examples</a></h3>
 <p>A number of other examples are also available, including all of the 
examples from the <a class="externalLink" 
href="http://manning.com/mattmann/";>Tika In Action book</a>. These can all be 
found in the <a class="externalLink" 
href="https://svn.apache.org/repos/asf/tika/trunk/tika-example";>Tika Example 
module</a> in SVN.</p></div></div>

Modified: tika/site/publish/download.html
URL: 
http://svn.apache.org/viewvc/tika/site/publish/download.html?rev=1841351&r1=1841350&r2=1841351&view=diff
==============================================================================
--- tika/site/publish/download.html (original)
+++ tika/site/publish/download.html Wed Sep 19 15:33:39 2018
@@ -86,26 +86,26 @@
       <div id="content">
         <!-- Licensed to the Apache Software Foundation (ASF) under one or 
more --><!-- contributor license agreements.  See the NOTICE file distributed 
with --><!-- this work for additional information regarding copyright 
ownership. --><!-- The ASF licenses this file to You under the Apache License, 
Version 2.0 --><!-- (the "License"); you may not use this file except in 
compliance with --><!-- the License.  You may obtain a copy of the License at 
--><!--  --><!-- http://www.apache.org/licenses/LICENSE-2.0 --><!--  --><!-- 
Unless required by applicable law or agreed to in writing, software --><!-- 
distributed under the License is distributed on an "AS IS" BASIS, --><!-- 
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. 
--><!-- See the License for the specific language governing permissions and 
--><!-- limitations under the License. --><div class="section">
 <h2><a name="Download_Apache_Tika"></a>Download Apache Tika</h2>
-<p>Apache Tika 1.19 is now available. See the <a class="externalLink" 
href="http://www.apache.org/dist/tika/CHANGES-1.19.txt";>CHANGES.txt</a> file 
for more information on the list of updates in this initial release.</p>
+<p>Apache Tika 1.19 is now available. See the <a class="externalLink" 
href="https://www.apache.org/dist/tika/CHANGES-1.19.txt";>CHANGES.txt</a> file 
for more information on the list of updates in this initial release.</p>
 <ul>
-<li><a class="externalLink" 
href="http://www.apache.org/dyn/closer.cgi/tika/tika-1.19-src.zip";>Mirrors for 
tika-1.19-src.zip</a> (source archive, <a class="externalLink" 
href="http://www.apache.org/dist/tika/tika-1.19-src.zip.asc";>PGP 
signature</a>)<br />SHA512: 
<tt>b0ec5f1746ceb002e3f33d2a55680952dad63ec9421f5245d28e33398d077547<br 
/>b88a6f521a4b76563f38bf887aa33b8a07de318c5c546039623be3ae65d34eec</tt></li>
-<li><a class="externalLink" 
href="http://www.apache.org/dyn/closer.cgi/tika/tika-app-1.19.jar";>Mirrors for 
tika-app-1.19.jar</a> (runnable jar, <a class="externalLink" 
href="http://www.apache.org/dist/tika/tika-app-1.19.jar.asc";>PGP 
signature</a>)<br />SHA512: 
<tt>668cd4770ce98fca79c9af63bed718011f6e229ed8834ef5ead81536aad476fc<br 
/>1e3d99a07bee213acb36e102347d3a91dad9dfabd92c20017ea5620fd8b824db</tt></li>
-<li><a class="externalLink" 
href="http://www.apache.org/dyn/closer.cgi/tika/tika-server-1.19.jar";>Mirrors 
for tika-server-1.19.jar</a> (runnable jar, <a class="externalLink" 
href="http://www.apache.org/dist/tika/tika-server-1.19.jar.asc";>PGP 
signature</a>)<br />SHA512: 
<tt>a9e2b6186cdb9872466d3eda791d0e1cd059da923035940d4b51bb1adc4a3566<br 
/>70fde46995725844a2dd500a09f3a5631d0ca5fbc2d61a59e8e0bd95c9dfa6c2</tt></li>
-<li><a class="externalLink" 
href="http://www.apache.org/dyn/closer.cgi/tika/tika-eval-1.19.jar";>Mirrors for 
tika-eval-1.19.jar</a> (runnable jar, <a class="externalLink" 
href="http://www.apache.org/dist/tika/tika-eval-1.19.jar.asc";>PGP 
signature</a>)<br />SHA512: 
<tt>b9d087b38c22bc16b73e4c3d5f66c752e601efa6a82105572c6a2359e119c244<br 
/>2c6d5eaa5270c29d040ecace398b570d02fdd3b112b495775f67122066614a47</tt></li></ul>
-<p>Apache Tika releases are available under the <a class="externalLink" 
href="http://www.apache.org/licenses/LICENSE-2.0";>Apache License, Version 
2.0</a>. See the NOTICE.txt file contained in each release artifact for 
applicable copyright attribution notices.</p>
-<p>If you are looking for previous releases of Apache Tika, have a look in the 
<a class="externalLink" 
href="http://archive.apache.org/dist/tika/";>archives</a>.</p>
-<p>If you are looking for releases of Apache Tika from the Apache Lucene 
project (pre-0.8 releases), have a look in the <a class="externalLink" 
href="http://archive.apache.org/dist/lucene/tika/";>lucene archives</a>. If you 
are looking for releases of Apache Tika from the Apache Incubator (pre-0.2 
releases), have a look in the <a class="externalLink" 
href="http://archive.apache.org/dist/incubator/tika/";>incubator 
archives</a>.</p></div>
+<li><a class="externalLink" 
href="https://www.apache.org/dyn/closer.cgi/tika/tika-1.19-src.zip";>Mirrors for 
tika-1.19-src.zip</a> (source archive, <a class="externalLink" 
href="https://www.apache.org/dist/tika/tika-1.19-src.zip.asc";>PGP 
signature</a>, <a class="externalLink" 
href="https://www.apache.org/dist/tika/tika-1.19-src.zip.sha512";>SHA512</a>)<br 
/>SHA512: 
<tt>b0ec5f1746ceb002e3f33d2a55680952dad63ec9421f5245d28e33398d077547<br 
/>b88a6f521a4b76563f38bf887aa33b8a07de318c5c546039623be3ae65d34eec</tt></li>
+<li><a class="externalLink" 
href="https://www.apache.org/dyn/closer.cgi/tika/tika-app-1.19.jar";>Mirrors for 
tika-app-1.19.jar</a> (runnable jar, <a class="externalLink" 
href="https://www.apache.org/dist/tika/tika-app-1.19.jar.asc";>PGP 
signature</a>, <a class="externalLink" 
href="https://www.apache.org/dist/tika/tika-app-1.19.jar.sha512";>SHA512</a>)<br 
/>SHA512: 
<tt>668cd4770ce98fca79c9af63bed718011f6e229ed8834ef5ead81536aad476fc<br 
/>1e3d99a07bee213acb36e102347d3a91dad9dfabd92c20017ea5620fd8b824db</tt></li>
+<li><a class="externalLink" 
href="https://www.apache.org/dyn/closer.cgi/tika/tika-server-1.19.jar";>Mirrors 
for tika-server-1.19.jar</a> (runnable jar, <a class="externalLink" 
href="https://www.apache.org/dist/tika/tika-server-1.19.jar.asc";>PGP 
signature</a>, <a class="externalLink" 
href="https://www.apache.org/dist/tika/tika-server-1.19.jar.sha512";>SHA512</a>)<br
 />SHA512: 
<tt>a9e2b6186cdb9872466d3eda791d0e1cd059da923035940d4b51bb1adc4a3566<br 
/>70fde46995725844a2dd500a09f3a5631d0ca5fbc2d61a59e8e0bd95c9dfa6c2</tt></li>
+<li><a class="externalLink" 
href="https://www.apache.org/dyn/closer.cgi/tika/tika-eval-1.19.jar";>Mirrors 
for tika-eval-1.19.jar</a> (runnable jar, <a class="externalLink" 
href="https://www.apache.org/dist/tika/tika-eval-1.19.jar.asc";>PGP 
signature</a>, <a class="externalLink" 
href="https://www.apache.org/dist/tika/tika-eval-1.19.jar.sha512";>SHA512</a>)<br
 />SHA512: 
<tt>b9d087b38c22bc16b73e4c3d5f66c752e601efa6a82105572c6a2359e119c244<br 
/>2c6d5eaa5270c29d040ecace398b570d02fdd3b112b495775f67122066614a47</tt></li></ul>
+<p>Apache Tika releases are available under the <a class="externalLink" 
href="https://www.apache.org/licenses/LICENSE-2.0";>Apache License, Version 
2.0</a>. See the NOTICE.txt file contained in each release artifact for 
applicable copyright attribution notices.</p>
+<p>If you are looking for previous releases of Apache Tika, have a look in the 
<a class="externalLink" 
href="https://archive.apache.org/dist/tika/";>archives</a>.</p>
+<p>If you are looking for releases of Apache Tika from the Apache Lucene 
project (pre-0.8 releases), have a look in the <a class="externalLink" 
href="https://archive.apache.org/dist/lucene/tika/";>lucene archives</a>. If you 
are looking for releases of Apache Tika from the Apache Incubator (pre-0.2 
releases), have a look in the <a class="externalLink" 
href="https://archive.apache.org/dist/incubator/tika/";>incubator 
archives</a>.</p></div>
 <div class="section">
 <h2><a name="Export_control"></a>Export control</h2>
 <p>Apache Tika includes cryptographic software. The country in which you 
currently reside may have restrictions on the import, possession, use, and/or 
re-export to another country, of encryption software. BEFORE using any 
encryption software, please check your country's laws, regulations and policies 
concerning the import, possession, or use, and re-export of encryption 
software, to see if this is permitted. See &lt;<a class="externalLink" 
href="http://www.wassenaar.org/";>http://www.wassenaar.org/</a>&gt; for more 
information.</p>
 <p>The U.S. Government Department of Commerce, Bureau of Industry and Security 
(BIS), has classified this software as Export Commodity Control Number (ECCN) 
5D002.C.1, which includes information security software using or performing 
cryptographic functions with asymmetric algorithms. The form and manner of this 
Apache Software Foundation distribution makes it eligible for export under the 
License Exception ENC Technology Software Unrestricted (TSU) exception (see the 
BIS Export Administration Regulations, Section 740.13) for both object code and 
source code.</p>
 <p>The following provides more details on the included cryptographic 
software:</p>
 <ul>
-<li>Apache Tika uses the Bouncy Castle generic encryption libraries for 
extracting text content and metadata from encrypted PDF files. See <a 
class="externalLink" 
href="http://www.bouncycastle.org/";>http://www.bouncycastle.org/</a> for more 
details on Bouncy Castle.</li></ul></div>
+<li>Apache Tika uses the Bouncy Castle generic encryption libraries for 
extracting text content and metadata from encrypted PDF files. See <a 
class="externalLink" 
href="https://www.bouncycastle.org/";>https://www.bouncycastle.org/</a> for more 
details on Bouncy Castle.</li></ul></div>
 <div class="section">
 <h2><a name="Verify"></a>Verify</h2>
-<p>It is essential that you verify the integrity of the downloaded files using 
the PGP signatures. Please read <a class="externalLink" 
href="http://httpd.apache.org/dev/verification.html";>Verifying Apache HTTP 
Server Releases</a> for more information on why you should verify our 
releases.</p>
-<p>The PGP signatures can be verified using PGP or GPG. First download the <a 
class="externalLink" 
href="https://people.apache.org/keys/group/tika.asc";>KEYS</a> file as well as 
the .asc signature files for the relevant release packages. Make sure you get 
these files from the main distribution directory, rather than from a mirror. 
Then verify the signatures using</p>
+<p>It is essential that you verify the integrity of the downloaded files using 
the PGP signatures. Please read <a class="externalLink" 
href="https://httpd.apache.org/dev/verification.html";>Verifying Apache HTTP 
Server Releases</a> for more information on why you should verify our 
releases.</p>
+<p>The PGP signatures can be verified using PGP or GPG. First download the <a 
class="externalLink" href="https://www.apache.org/dist/tika/KEYS";>KEYS</a> file 
as well as the .asc signature files for the relevant release packages. Make 
sure you get these files from the main distribution directory, rather than from 
a mirror. Then verify the signatures using</p>
 <div class="source">
 <pre>% pgpk -a KEYS
 % pgpv apache-tika-X.Y.Z.tar.gz.asc</pre></div>

Modified: tika/site/src/site/apt/download.apt.vm
URL: 
http://svn.apache.org/viewvc/tika/site/src/site/apt/download.apt.vm?rev=1841351&r1=1841350&r2=1841351&view=diff
==============================================================================
--- tika/site/src/site/apt/download.apt.vm (original)
+++ tika/site/src/site/apt/download.apt.vm Wed Sep 19 15:33:39 2018
@@ -20,43 +20,47 @@
 Download Apache Tika
 
    Apache Tika ${project.parent.version} is now available.
-   See the 
{{{http://www.apache.org/dist/tika/CHANGES-${project.parent.version}.txt}CHANGES.txt}}
+   See the 
{{{https://www.apache.org/dist/tika/CHANGES-${project.parent.version}.txt}CHANGES.txt}}
    file for more information on the list of updates in this initial release.
 
-   * 
{{{http://www.apache.org/dyn/closer.cgi/tika/tika-${project.parent.version}-src.zip}Mirrors
 for tika-${project.parent.version}-src.zip}}
-     (source archive, 
{{{http://www.apache.org/dist/tika/tika-${project.parent.version}-src.zip.asc}PGP
 signature}})\
+   * 
{{{https://www.apache.org/dyn/closer.cgi/tika/tika-${project.parent.version}-src.zip}Mirrors
 for tika-${project.parent.version}-src.zip}}
+     (source archive, 
{{{https://www.apache.org/dist/tika/tika-${project.parent.version}-src.zip.asc}PGP
 signature}},
+      
{{{https://www.apache.org/dist/tika/tika-${project.parent.version}-src.zip.sha512}SHA512}})\
      SHA512: 
<<<b0ec5f1746ceb002e3f33d2a55680952dad63ec9421f5245d28e33398d077547\
                 
b88a6f521a4b76563f38bf887aa33b8a07de318c5c546039623be3ae65d34eec>>>
 
-   * 
{{{http://www.apache.org/dyn/closer.cgi/tika/tika-app-${project.parent.version}.jar}Mirrors
 for tika-app-${project.parent.version}.jar}}
-     (runnable jar, 
{{{http://www.apache.org/dist/tika/tika-app-${project.parent.version}.jar.asc}PGP
 signature}})\
+   * 
{{{https://www.apache.org/dyn/closer.cgi/tika/tika-app-${project.parent.version}.jar}Mirrors
 for tika-app-${project.parent.version}.jar}}
+     (runnable jar, 
{{{https://www.apache.org/dist/tika/tika-app-${project.parent.version}.jar.asc}PGP
 signature}},
+     
{{{https://www.apache.org/dist/tika/tika-app-${project.parent.version}.jar.sha512}SHA512}})\
      SHA512: 
<<<668cd4770ce98fca79c9af63bed718011f6e229ed8834ef5ead81536aad476fc\
                 
1e3d99a07bee213acb36e102347d3a91dad9dfabd92c20017ea5620fd8b824db>>>
 
-   * 
{{{http://www.apache.org/dyn/closer.cgi/tika/tika-server-${project.parent.version}.jar}Mirrors
 for tika-server-${project.parent.version}.jar}}
-     (runnable jar, 
{{{http://www.apache.org/dist/tika/tika-server-${project.parent.version}.jar.asc}PGP
 signature}})\
+   * 
{{{https://www.apache.org/dyn/closer.cgi/tika/tika-server-${project.parent.version}.jar}Mirrors
 for tika-server-${project.parent.version}.jar}}
+     (runnable jar, 
{{{https://www.apache.org/dist/tika/tika-server-${project.parent.version}.jar.asc}PGP
 signature}},
+          
{{{https://www.apache.org/dist/tika/tika-server-${project.parent.version}.jar.sha512}SHA512}})\
      SHA512: 
<<<a9e2b6186cdb9872466d3eda791d0e1cd059da923035940d4b51bb1adc4a3566\
                 
70fde46995725844a2dd500a09f3a5631d0ca5fbc2d61a59e8e0bd95c9dfa6c2>>>
 
-   * 
{{{http://www.apache.org/dyn/closer.cgi/tika/tika-eval-${project.parent.version}.jar}Mirrors
 for tika-eval-${project.parent.version}.jar}}
-     (runnable jar, 
{{{http://www.apache.org/dist/tika/tika-eval-${project.parent.version}.jar.asc}PGP
 signature}})\
+   * 
{{{https://www.apache.org/dyn/closer.cgi/tika/tika-eval-${project.parent.version}.jar}Mirrors
 for tika-eval-${project.parent.version}.jar}}
+     (runnable jar, 
{{{https://www.apache.org/dist/tika/tika-eval-${project.parent.version}.jar.asc}PGP
 signature}},
+          
{{{https://www.apache.org/dist/tika/tika-eval-${project.parent.version}.jar.sha512}SHA512}})\
      SHA512: 
<<<b9d087b38c22bc16b73e4c3d5f66c752e601efa6a82105572c6a2359e119c244\
                 
2c6d5eaa5270c29d040ecace398b570d02fdd3b112b495775f67122066614a47>>>
 
    []
 
    Apache Tika releases are available under the
-   {{{http://www.apache.org/licenses/LICENSE-2.0}Apache License, Version 2.0}}.
+   {{{https://www.apache.org/licenses/LICENSE-2.0}Apache License, Version 
2.0}}.
    See the NOTICE.txt file contained in each release artifact for applicable
    copyright attribution notices.
 
    If you are looking for previous releases of Apache Tika, have a look in the 
-   {{{http://archive.apache.org/dist/tika/}archives}}.
+   {{{https://archive.apache.org/dist/tika/}archives}}.
    
    If you are looking for releases of Apache Tika from the Apache Lucene 
project (pre-0.8 releases), 
-   have a look in the {{{http://archive.apache.org/dist/lucene/tika/}lucene 
archives}}. If you are 
+   have a look in the {{{https://archive.apache.org/dist/lucene/tika/}lucene 
archives}}. If you are
    looking for releases of Apache Tika from the Apache Incubator (pre-0.2 
releases), have a look in the 
-   {{{http://archive.apache.org/dist/incubator/tika/}incubator archives}}.
+   {{{https://archive.apache.org/dist/incubator/tika/}incubator archives}}.
 
 Export control
 
@@ -81,17 +85,17 @@ Export control
 
    * Apache Tika uses the Bouncy Castle generic encryption libraries for
      extracting text content and metadata from encrypted PDF files.
-     See {{http://www.bouncycastle.org/}} for more details on Bouncy Castle.
+     See {{https://www.bouncycastle.org/}} for more details on Bouncy Castle.
 
 Verify
 
    It is essential that you verify the integrity of the downloaded files
    using the PGP signatures. Please read
-   {{{http://httpd.apache.org/dev/verification.html}Verifying Apache HTTP 
Server Releases}}
+   {{{https://httpd.apache.org/dev/verification.html}Verifying Apache HTTP 
Server Releases}}
    for more information on why you should verify our releases.
 
    The PGP signatures can be verified using PGP or GPG. First download the 
-   {{{https://people.apache.org/keys/group/tika.asc}KEYS}} file as well as the 
.asc signature files for the relevant release
+   {{{https://www.apache.org/dist/tika/KEYS}KEYS}} file as well as the .asc 
signature files for the relevant release
    packages. Make sure you get these files from the main distribution
    directory, rather than from a mirror. Then verify the signatures using
 


Reply via email to