Author: nick
Date: Mon Oct 26 11:49:33 2015
New Revision: 1710579
URL: http://svn.apache.org/viewvc?rev=1710579&view=rev
Log:
Make the "Full list of Supported Formats" a formal section, and list it in the
ToC
Modified:
tika/site/publish/1.10/formats.html
tika/site/publish/1.11/formats.html
tika/site/publish/1.12/formats.html
tika/site/publish/1.8/formats.html
tika/site/publish/1.9/formats.html
tika/site/src/site/apt/1.10/formats.apt
tika/site/src/site/apt/1.11/formats.apt
tika/site/src/site/apt/1.12/formats.apt
tika/site/src/site/apt/1.8/formats.apt
tika/site/src/site/apt/1.9/formats.apt
Modified: tika/site/publish/1.10/formats.html
URL:
http://svn.apache.org/viewvc/tika/site/publish/1.10/formats.html?rev=1710579&r1=1710578&r2=1710579&view=diff
==============================================================================
--- tika/site/publish/1.10/formats.html (original)
+++ tika/site/publish/1.10/formats.html Mon Oct 26 11:49:33 2015
@@ -114,7 +114,8 @@
<li><a href="#Scientific_formats">Scientific formats</a></li>
<li><a href="#Executable_programs_and_libraries">Executable programs and
libraries</a></li>
<li><a href="#Crypto_formats">Crypto formats</a></li>
-<li><a href="#Database_formats">Database formats</a></li></ul></li></ul>
+<li><a href="#Database_formats">Database formats</a></li></ul></li>
+<li><a href="#Full_list_of_Supported_Formats">Full list of Supported
Formats</a></li></ul>
<div class="section">
<h3><a name="HyperText_Markup_Language">HyperText Markup Language</a></h3>
<p>The HyperText Markup Language (HTML) is the lingua franca of the web. Tika
uses the <a class="externalLink"
href="http://home.ccil.org/~cowan/XML/tagsoup/">TagSoup</a> library to support
virtually any kind of HTML found on the web. The output from the <a
href="./api/org/apache/tika/parser/html/HtmlParser.html">HtmlParser</a> class
is guaranteed to be well-formed and valid XHTML, and various heuristics are
used to prevent things like inline scripts from cluttering the extracted text
content.</p></div>
@@ -207,7 +208,7 @@
<p>The <a
href="./api/org/apache/tika/parser/jdbc/SQLite3Parser.html">SQLite3Parser</a>
is able to extract content from SQLite3 files, in a tabular form. However, it
requires that the <a href="#org.xerial_sqlite-jdbc_jar"></a> is manually added
to the classpath first, as that binary jar isn't shipped as standard.</p>
<p>The <a
href="./api/org/apache/tika/parser/microsoft/JackcessParser.html">JackcessParser</a>
is able to extract metadata and content in a tabular form, from Microsoft
Access database files.</p></div></div>
<div class="section">
-<h2><a name="Full_list_of_supported_formats:"></a>Full list of supported
formats:</h2>
+<h2><a name="Full_list_of_Supported_Formats"></a>Full list of Supported
Formats</h2>
<ul>
<li>org.apache.tika.parser.asm.<a
href="./api/org/apache/tika/parser/asm/ClassParser">ClassParser</a>
<ul>
Modified: tika/site/publish/1.11/formats.html
URL:
http://svn.apache.org/viewvc/tika/site/publish/1.11/formats.html?rev=1710579&r1=1710578&r2=1710579&view=diff
==============================================================================
--- tika/site/publish/1.11/formats.html (original)
+++ tika/site/publish/1.11/formats.html Mon Oct 26 11:49:33 2015
@@ -114,7 +114,8 @@
<li><a href="#Scientific_formats">Scientific formats</a></li>
<li><a href="#Executable_programs_and_libraries">Executable programs and
libraries</a></li>
<li><a href="#Crypto_formats">Crypto formats</a></li>
-<li><a href="#Database_formats">Database formats</a></li></ul></li></ul>
+<li><a href="#Database_formats">Database formats</a></li></ul></li>
+<li><a href="#Full_list_of_Supported_Formats">Full list of Supported
Formats</a></li></ul>
<div class="section">
<h3><a name="HyperText_Markup_Language">HyperText Markup Language</a></h3>
<p>The HyperText Markup Language (HTML) is the lingua franca of the web. Tika
uses the <a class="externalLink"
href="http://home.ccil.org/~cowan/XML/tagsoup/">TagSoup</a> library to support
virtually any kind of HTML found on the web. The output from the <a
href="./api/org/apache/tika/parser/html/HtmlParser.html">HtmlParser</a> class
is guaranteed to be well-formed and valid XHTML, and various heuristics are
used to prevent things like inline scripts from cluttering the extracted text
content.</p></div>
@@ -207,7 +208,7 @@
<p>The <a
href="./api/org/apache/tika/parser/jdbc/SQLite3Parser.html">SQLite3Parser</a>
is able to extract content from SQLite3 files, in a tabular form. However, it
requires that the <a href="#org.xerial_sqlite-jdbc_jar"></a> is manually added
to the classpath first, as that binary jar isn't shipped as standard.</p>
<p>The <a
href="./api/org/apache/tika/parser/microsoft/JackcessParser.html">JackcessParser</a>
is able to extract metadata and content in a tabular form, from Microsoft
Access database files.</p></div></div>
<div class="section">
-<h2><a name="Full_list_of_supported_formats:"></a>Full list of supported
formats:</h2>
+<h2><a name="Full_list_of_Supported_Formats"></a>Full list of Supported
Formats</h2>
<ul>
<li>org.apache.tika.parser.asm.<a
href="./api/org/apache/tika/parser/asm/ClassParser">ClassParser</a>
<ul>
Modified: tika/site/publish/1.12/formats.html
URL:
http://svn.apache.org/viewvc/tika/site/publish/1.12/formats.html?rev=1710579&r1=1710578&r2=1710579&view=diff
==============================================================================
--- tika/site/publish/1.12/formats.html (original)
+++ tika/site/publish/1.12/formats.html Mon Oct 26 11:49:33 2015
@@ -114,7 +114,8 @@
<li><a href="#Scientific_formats">Scientific formats</a></li>
<li><a href="#Executable_programs_and_libraries">Executable programs and
libraries</a></li>
<li><a href="#Crypto_formats">Crypto formats</a></li>
-<li><a href="#Database_formats">Database formats</a></li></ul></li></ul>
+<li><a href="#Database_formats">Database formats</a></li></ul></li>
+<li><a href="#Full_list_of_Supported_Formats">Full list of Supported
Formats</a></li></ul>
<div class="section">
<h3><a name="HyperText_Markup_Language">HyperText Markup Language</a></h3>
<p>The HyperText Markup Language (HTML) is the lingua franca of the web. Tika
uses the <a class="externalLink"
href="http://home.ccil.org/~cowan/XML/tagsoup/">TagSoup</a> library to support
virtually any kind of HTML found on the web. The output from the <a
href="./api/org/apache/tika/parser/html/HtmlParser.html">HtmlParser</a> class
is guaranteed to be well-formed and valid XHTML, and various heuristics are
used to prevent things like inline scripts from cluttering the extracted text
content.</p></div>
@@ -207,7 +208,7 @@
<p>The <a
href="./api/org/apache/tika/parser/jdbc/SQLite3Parser.html">SQLite3Parser</a>
is able to extract content from SQLite3 files, in a tabular form. However, it
requires that the <a href="#org.xerial_sqlite-jdbc_jar"></a> is manually added
to the classpath first, as that binary jar isn't shipped as standard.</p>
<p>The <a
href="./api/org/apache/tika/parser/microsoft/JackcessParser.html">JackcessParser</a>
is able to extract metadata and content in a tabular form, from Microsoft
Access database files.</p></div></div>
<div class="section">
-<h2><a name="Full_list_of_supported_formats:"></a>Full list of supported
formats:</h2>
+<h2><a name="Full_list_of_Supported_Formats"></a>Full list of Supported
Formats</h2>
<p>TODO Populate this at release time</p></div>
</div>
<div id="sidebar">
Modified: tika/site/publish/1.8/formats.html
URL:
http://svn.apache.org/viewvc/tika/site/publish/1.8/formats.html?rev=1710579&r1=1710578&r2=1710579&view=diff
==============================================================================
--- tika/site/publish/1.8/formats.html (original)
+++ tika/site/publish/1.8/formats.html Mon Oct 26 11:49:33 2015
@@ -113,7 +113,8 @@
<li><a href="#Font_formats">Font formats</a></li>
<li><a href="#Scientific_formats">Scientific formats</a></li>
<li><a href="#Executable_programs_and_libraries">Executable programs and
libraries</a></li>
-<li><a href="#Crypto_formats">Crypto formats</a></li></ul></li></ul>
+<li><a href="#Crypto_formats">Crypto formats</a></li></ul></li>
+<li><a href="#Full_list_of_Supported_Formats">Full list of Supported
Formats</a></li></ul>
<div class="section">
<h3><a name="HyperText_Markup_Language">HyperText Markup Language</a></h3>
<p>The HyperText Markup Language (HTML) is the lingua franca of the web. Tika
uses the <a class="externalLink"
href="http://home.ccil.org/~cowan/XML/tagsoup/">TagSoup</a> library to support
virtually any kind of HTML found on the web. The output from the <a
href="./api/org/apache/tika/parser/html/HtmlParser.html">HtmlParser</a> class
is guaranteed to be well-formed and valid XHTML, and various heuristics are
used to prevent things like inline scripts from cluttering the extracted text
content.</p></div>
@@ -196,7 +197,7 @@
<h3><a name="Crypto_formats">Crypto formats</a></h3>
<p>The <a
href="./api/org/apache/tika/parser/crypto/Pkcs7Parser.html">Pkcs7Parser</a> is
able to parse the contents of PKCS7 signed messages, but doesn't include any
information from the outer PKCS7 wrapper.</p></div></div>
<div class="section">
-<h2><a name="Full_list_of_supported_formats:"></a>Full list of supported
formats:</h2>
+<h2><a name="Full_list_of_Supported_Formats"></a>Full list of Supported
Formats</h2>
<ul>
<li>org.apache.tika.parser.asm.<a
href="./api/org/apache/tika/parser/asm/ClassParser">ClassParser</a>
<ul>
Modified: tika/site/publish/1.9/formats.html
URL:
http://svn.apache.org/viewvc/tika/site/publish/1.9/formats.html?rev=1710579&r1=1710578&r2=1710579&view=diff
==============================================================================
--- tika/site/publish/1.9/formats.html (original)
+++ tika/site/publish/1.9/formats.html Mon Oct 26 11:49:33 2015
@@ -113,7 +113,8 @@
<li><a href="#Font_formats">Font formats</a></li>
<li><a href="#Scientific_formats">Scientific formats</a></li>
<li><a href="#Executable_programs_and_libraries">Executable programs and
libraries</a></li>
-<li><a href="#Crypto_formats">Crypto formats</a></li></ul></li></ul>
+<li><a href="#Crypto_formats">Crypto formats</a></li></ul></li>
+<li><a href="#Full_list_of_Supported_Formats">Full list of Supported
Formats</a></li></ul>
<div class="section">
<h3><a name="HyperText_Markup_Language">HyperText Markup Language</a></h3>
<p>The HyperText Markup Language (HTML) is the lingua franca of the web. Tika
uses the <a class="externalLink"
href="http://home.ccil.org/~cowan/XML/tagsoup/">TagSoup</a> library to support
virtually any kind of HTML found on the web. The output from the <a
href="./api/org/apache/tika/parser/html/HtmlParser.html">HtmlParser</a> class
is guaranteed to be well-formed and valid XHTML, and various heuristics are
used to prevent things like inline scripts from cluttering the extracted text
content.</p></div>
@@ -202,7 +203,7 @@
<h3><a name="Crypto_formats">Crypto formats</a></h3>
<p>The <a
href="./api/org/apache/tika/parser/crypto/Pkcs7Parser.html">Pkcs7Parser</a> is
able to parse the contents of PKCS7 signed messages, but doesn't include any
information from the outer PKCS7 wrapper.</p></div></div>
<div class="section">
-<h2><a name="Full_list_of_supported_formats:"></a>Full list of supported
formats:</h2>
+<h2><a name="Full_list_of_Supported_Formats"></a>Full list of Supported
Formats</h2>
<ul>
<li>org.apache.tika.parser.asm.<a
href="./api/org/apache/tika/parser/asm/ClassParser">ClassParser</a>
<ul>
Modified: tika/site/src/site/apt/1.10/formats.apt
URL:
http://svn.apache.org/viewvc/tika/site/src/site/apt/1.10/formats.apt?rev=1710579&r1=1710578&r2=1710579&view=diff
==============================================================================
--- tika/site/src/site/apt/1.10/formats.apt (original)
+++ tika/site/src/site/apt/1.10/formats.apt Mon Oct 26 11:49:33 2015
@@ -28,7 +28,7 @@ Supported Document Formats
formats than those listed below, this page only documents those formats
from which Tika is able to extract metadata and/or textual content.
-%{toc|section=1|fromDepth=1}
+%{toc|fromDepth=1}
* {HyperText Markup Language}
@@ -297,7 +297,8 @@ Supported Document Formats
able to extract metadata and content in a tabular form, from Microsoft
Access
database files.
-Full list of supported formats:
+
+Full list of Supported Formats
*
org.apache.tika.parser.asm.{{{./api/org/apache/tika/parser/asm/ClassParser}ClassParser}}
Modified: tika/site/src/site/apt/1.11/formats.apt
URL:
http://svn.apache.org/viewvc/tika/site/src/site/apt/1.11/formats.apt?rev=1710579&r1=1710578&r2=1710579&view=diff
==============================================================================
--- tika/site/src/site/apt/1.11/formats.apt (original)
+++ tika/site/src/site/apt/1.11/formats.apt Mon Oct 26 11:49:33 2015
@@ -28,7 +28,7 @@ Supported Document Formats
formats than those listed below, this page only documents those formats
from which Tika is able to extract metadata and/or textual content.
-%{toc|section=1|fromDepth=1}
+%{toc|fromDepth=1}
* {HyperText Markup Language}
@@ -297,7 +297,8 @@ Supported Document Formats
able to extract metadata and content in a tabular form, from Microsoft
Access
database files.
-Full list of supported formats:
+
+Full list of Supported Formats
*
org.apache.tika.parser.asm.{{{./api/org/apache/tika/parser/asm/ClassParser}ClassParser}}
Modified: tika/site/src/site/apt/1.12/formats.apt
URL:
http://svn.apache.org/viewvc/tika/site/src/site/apt/1.12/formats.apt?rev=1710579&r1=1710578&r2=1710579&view=diff
==============================================================================
--- tika/site/src/site/apt/1.12/formats.apt (original)
+++ tika/site/src/site/apt/1.12/formats.apt Mon Oct 26 11:49:33 2015
@@ -28,7 +28,7 @@ Supported Document Formats
formats than those listed below, this page only documents those formats
from which Tika is able to extract metadata and/or textual content.
-%{toc|section=1|fromDepth=1}
+%{toc|fromDepth=1}
* {HyperText Markup Language}
@@ -297,6 +297,7 @@ Supported Document Formats
able to extract metadata and content in a tabular form, from Microsoft
Access
database files.
-Full list of supported formats:
+
+Full list of Supported Formats
TODO Populate this at release time
Modified: tika/site/src/site/apt/1.8/formats.apt
URL:
http://svn.apache.org/viewvc/tika/site/src/site/apt/1.8/formats.apt?rev=1710579&r1=1710578&r2=1710579&view=diff
==============================================================================
--- tika/site/src/site/apt/1.8/formats.apt (original)
+++ tika/site/src/site/apt/1.8/formats.apt Mon Oct 26 11:49:33 2015
@@ -28,7 +28,7 @@ Supported Document Formats
formats than those listed below, this page only documents those formats
from which Tika is able to extract metadata and/or textual content)
-%{toc|section=1|fromDepth=1}
+%{toc|fromDepth=1}
* {HyperText Markup Language}
@@ -259,7 +259,8 @@ Supported Document Formats
parse the contents of PKCS7 signed messages, but doesn't include any
information from
the outer PKCS7 wrapper.
-Full list of supported formats:
+
+Full list of Supported Formats
*
org.apache.tika.parser.asm.{{{./api/org/apache/tika/parser/asm/ClassParser}ClassParser}}
Modified: tika/site/src/site/apt/1.9/formats.apt
URL:
http://svn.apache.org/viewvc/tika/site/src/site/apt/1.9/formats.apt?rev=1710579&r1=1710578&r2=1710579&view=diff
==============================================================================
--- tika/site/src/site/apt/1.9/formats.apt (original)
+++ tika/site/src/site/apt/1.9/formats.apt Mon Oct 26 11:49:33 2015
@@ -28,7 +28,7 @@ Supported Document Formats
formats than those listed below, this page only documents those formats
from which Tika is able to extract metadata and/or textual content.
-%{toc|section=1|fromDepth=1}
+%{toc|fromDepth=1}
* {HyperText Markup Language}
@@ -286,7 +286,8 @@ Supported Document Formats
parse the contents of PKCS7 signed messages, but doesn't include any
information from
the outer PKCS7 wrapper.
-Full list of supported formats:
+
+Full list of Supported Formats
*
org.apache.tika.parser.asm.{{{./api/org/apache/tika/parser/asm/ClassParser}ClassParser}}