Modified: tika/site/publish/1.18/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.18/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.18/gettingstarted.html (original) +++ tika/site/publish/1.18/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="http://maven.apache.org/">Maven 2</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 7 or higher to build Tika.</p></div> <div class="section"> @@ -120,36 +119,31 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>1.18</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on <tt> tika-parsers </tt> instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>1.18</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. You can use the following command in the tika-parsers directory to get a full listing of all the dependencies.</p> <div> -<pre>$ mvn dependency:tree | grep :compile -</pre></div></div> +<pre>$ mvn dependency:tree | grep :compile</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_a_Gradle-built_project"></a>Using Tika in a Gradle-built project</h2> <p>To add a dependency on Apache Tika to your Gradle built project, including the full set of parsers, you should depend on the <tt> tika-parsers </tt> artifact:</p> <div> <pre>dependencies { runtime 'org.apache.tika:tika-parsers:1.18' -} -</pre></div></div> +}</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> <p>If you are using <a class="externalLink" href="http://ant.apache.org/ivy/">Apache Ivy</a> as your dependency manager tool with Ant, then to include Tika with the full set of parsers, you should depend on the <tt> tika-parsers </tt> artifact like this:</p> <div> <pre> <dependencies> <dependency org="org.apache.tika" name="tika-parsers" rev="1.18"/> - </dependencies> -</pre></div> + </dependencies></pre></div> <p>Otherwise, probably the easiest way to use Tika is to include the full <tt> tika-app </tt> jar on your classpath. For just core functionality, you can add the <tt> tika-core </tt> jar, but be aware that the full set of parsers have a large number of dependencies which must be included which is very fiddly to do by hand with Ant! To include Tika in your Ant project, you should do something like:</p> <div> <pre><classpath> @@ -160,8 +154,7 @@ <!-- or: Tika with all Parsers--> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-*.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -277,15 +270,13 @@ Batch Options: To modify child process jvm args, prepend "J" as in: -JXmx4g or -JDlog4j.configuration=file:log4j.xml. - </pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> <div class="section"> <h2><a name="Wrappers"></a>Wrappers</h2> <p>Several wrappers are available to use Tika in another programming language, such as <a class="externalLink" href="https://github.com/aviks/Taro.jl">Julia</a> or <a class="externalLink" href="https://github.com/chrismattmann/tika-python">Python</a>.</p></div>
Modified: tika/site/publish/1.19.1/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.19.1/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.19.1/gettingstarted.html (original) +++ tika/site/publish/1.19.1/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="http://maven.apache.org/">Maven 2</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 8 or higher to build Tika.</p></div> <div class="section"> @@ -120,36 +119,31 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>1.19.1</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on <tt> tika-parsers </tt> instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>1.19.1</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. You can use the following command in the tika-parsers directory to get a full listing of all the dependencies.</p> <div> -<pre>$ mvn dependency:tree | grep :compile -</pre></div></div> +<pre>$ mvn dependency:tree | grep :compile</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_a_Gradle-built_project"></a>Using Tika in a Gradle-built project</h2> <p>To add a dependency on Apache Tika to your Gradle built project, including the full set of parsers, you should depend on the <tt> tika-parsers </tt> artifact:</p> <div> <pre>dependencies { runtime 'org.apache.tika:tika-parsers:1.19.1' -} -</pre></div></div> +}</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> <p>If you are using <a class="externalLink" href="http://ant.apache.org/ivy/">Apache Ivy</a> as your dependency manager tool with Ant, then to include Tika with the full set of parsers, you should depend on the <tt> tika-parsers </tt> artifact like this:</p> <div> <pre> <dependencies> <dependency org="org.apache.tika" name="tika-parsers" rev="1.19.1"/> - </dependencies> -</pre></div> + </dependencies></pre></div> <p>Otherwise, probably the easiest way to use Tika is to include the full <tt> tika-app </tt> jar on your classpath. For just core functionality, you can add the <tt> tika-core </tt> jar, but be aware that the full set of parsers have a large number of dependencies which must be included which is very fiddly to do by hand with Ant! To include Tika in your Ant project, you should do something like:</p> <div> <pre><classpath> @@ -160,8 +154,7 @@ <!-- or: Tika with all Parsers--> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-*.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -277,15 +270,13 @@ Batch Options: To modify child process jvm args, prepend "J" as in: -JXmx4g or -JDlog4j.configuration=file:log4j.xml. - </pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> <div class="section"> <h2><a name="Wrappers"></a>Wrappers</h2> <p>Several wrappers are available to use Tika in another programming language, such as <a class="externalLink" href="https://github.com/aviks/Taro.jl">Julia</a> or <a class="externalLink" href="https://github.com/chrismattmann/tika-python">Python</a>.</p></div> Modified: tika/site/publish/1.19/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.19/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.19/gettingstarted.html (original) +++ tika/site/publish/1.19/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="http://maven.apache.org/">Maven 2</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 8 or higher to build Tika.</p></div> <div class="section"> @@ -120,36 +119,31 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>1.19</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on <tt> tika-parsers </tt> instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>1.19</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. You can use the following command in the tika-parsers directory to get a full listing of all the dependencies.</p> <div> -<pre>$ mvn dependency:tree | grep :compile -</pre></div></div> +<pre>$ mvn dependency:tree | grep :compile</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_a_Gradle-built_project"></a>Using Tika in a Gradle-built project</h2> <p>To add a dependency on Apache Tika to your Gradle built project, including the full set of parsers, you should depend on the <tt> tika-parsers </tt> artifact:</p> <div> <pre>dependencies { runtime 'org.apache.tika:tika-parsers:1.19' -} -</pre></div></div> +}</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> <p>If you are using <a class="externalLink" href="http://ant.apache.org/ivy/">Apache Ivy</a> as your dependency manager tool with Ant, then to include Tika with the full set of parsers, you should depend on the <tt> tika-parsers </tt> artifact like this:</p> <div> <pre> <dependencies> <dependency org="org.apache.tika" name="tika-parsers" rev="1.19"/> - </dependencies> -</pre></div> + </dependencies></pre></div> <p>Otherwise, probably the easiest way to use Tika is to include the full <tt> tika-app </tt> jar on your classpath. For just core functionality, you can add the <tt> tika-core </tt> jar, but be aware that the full set of parsers have a large number of dependencies which must be included which is very fiddly to do by hand with Ant! To include Tika in your Ant project, you should do something like:</p> <div> <pre><classpath> @@ -160,8 +154,7 @@ <!-- or: Tika with all Parsers--> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-*.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -277,15 +270,13 @@ Batch Options: To modify child process jvm args, prepend "J" as in: -JXmx4g or -JDlog4j.configuration=file:log4j.xml. - </pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> <div class="section"> <h2><a name="Wrappers"></a>Wrappers</h2> <p>Several wrappers are available to use Tika in another programming language, such as <a class="externalLink" href="https://github.com/aviks/Taro.jl">Julia</a> or <a class="externalLink" href="https://github.com/chrismattmann/tika-python">Python</a>.</p></div> Modified: tika/site/publish/1.2/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.2/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.2/gettingstarted.html (original) +++ tika/site/publish/1.2/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="http://maven.apache.org/">Maven 2</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 5 or higher to build Tika.</p></div> <div class="section"> @@ -116,16 +115,14 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>1.2</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on tika-parsers instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>1.2</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. The listing below shows all the compile-scope dependencies of tika-parsers in the Tika 1.2 release.</p> <div> <pre>+- org.apache.tika:tika-core:jar:1.2:compile @@ -161,7 +158,6 @@ +- rome:rome:jar:0.9:compile | \- jdom:jdom:jar:1.0:compile +- org.gagravarr:vorbis-java-core:jar:0.1:compile - </pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> @@ -175,8 +171,7 @@ <!-- or: --> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-1.2.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -244,15 +239,13 @@ Description: Use the "--server" (or "-s") option to start the Apache Tika server. The server will listen to the - ports you specify as one or more arguments. -</pre></div> + ports you specify as one or more arguments.</pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app-1.2.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> </div> <div id="sidebar"> <div id="navigation"> Modified: tika/site/publish/1.20/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.20/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.20/gettingstarted.html (original) +++ tika/site/publish/1.20/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="https://maven.apache.org/">Maven 3</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 8 or higher to build Tika.</p></div> <div class="section"> @@ -120,36 +119,31 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>1.20</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on <tt> tika-parsers </tt> instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>1.20</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. You can use the following command in the tika-parsers directory to get a full listing of all the dependencies.</p> <div> -<pre>$ mvn dependency:tree | grep :compile -</pre></div></div> +<pre>$ mvn dependency:tree | grep :compile</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_a_Gradle-built_project"></a>Using Tika in a Gradle-built project</h2> <p>To add a dependency on Apache Tika to your Gradle built project, including the full set of parsers, you should depend on the <tt> tika-parsers </tt> artifact:</p> <div> <pre>dependencies { runtime 'org.apache.tika:tika-parsers:1.20' -} -</pre></div></div> +}</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> <p>If you are using <a class="externalLink" href="https://ant.apache.org/ivy/">Apache Ivy</a> as your dependency manager tool with Ant, then to include Tika with the full set of parsers, you should depend on the <tt> tika-parsers </tt> artifact like this:</p> <div> <pre> <dependencies> <dependency org="org.apache.tika" name="tika-parsers" rev="1.20"/> - </dependencies> -</pre></div> + </dependencies></pre></div> <p>Otherwise, probably the easiest way to use Tika is to include the full <tt> tika-app </tt> jar on your classpath. For just core functionality, you can add the <tt> tika-core </tt> jar, but be aware that the full set of parsers have a large number of dependencies which must be included which is very fiddly to do by hand with Ant! To include Tika in your Ant project, you should do something like:</p> <div> <pre><classpath> @@ -160,8 +154,7 @@ <!-- or: Tika with all Parsers--> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-*.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -277,15 +270,13 @@ Batch Options: To modify child process jvm args, prepend "J" as in: -JXmx4g or -JDlog4j.configuration=file:log4j.xml. - </pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> <div class="section"> <h2><a name="Wrappers"></a>Wrappers</h2> <p>Several wrappers are available to use Tika in another programming language, such as <a class="externalLink" href="https://github.com/aviks/Taro.jl">Julia</a> or <a class="externalLink" href="https://github.com/chrismattmann/tika-python">Python</a>.</p></div> Modified: tika/site/publish/1.21/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.21/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.21/gettingstarted.html (original) +++ tika/site/publish/1.21/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="http://maven.apache.org/">Maven 2</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 8 or higher to build Tika.</p></div> <div class="section"> @@ -120,36 +119,31 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>1.21</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on <tt> tika-parsers </tt> instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>1.21</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. You can use the following command in the tika-parsers directory to get a full listing of all the dependencies.</p> <div> -<pre>$ mvn dependency:tree | grep :compile -</pre></div></div> +<pre>$ mvn dependency:tree | grep :compile</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_a_Gradle-built_project"></a>Using Tika in a Gradle-built project</h2> <p>To add a dependency on Apache Tika to your Gradle built project, including the full set of parsers, you should depend on the <tt> tika-parsers </tt> artifact:</p> <div> <pre>dependencies { runtime 'org.apache.tika:tika-parsers:1.21' -} -</pre></div></div> +}</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> <p>If you are using <a class="externalLink" href="http://ant.apache.org/ivy/">Apache Ivy</a> as your dependency manager tool with Ant, then to include Tika with the full set of parsers, you should depend on the <tt> tika-parsers </tt> artifact like this:</p> <div> <pre> <dependencies> <dependency org="org.apache.tika" name="tika-parsers" rev="1.21"/> - </dependencies> -</pre></div> + </dependencies></pre></div> <p>Otherwise, probably the easiest way to use Tika is to include the full <tt> tika-app </tt> jar on your classpath. For just core functionality, you can add the <tt> tika-core </tt> jar, but be aware that the full set of parsers have a large number of dependencies which must be included which is very fiddly to do by hand with Ant! To include Tika in your Ant project, you should do something like:</p> <div> <pre><classpath> @@ -160,8 +154,7 @@ <!-- or: Tika with all Parsers--> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-*.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -277,15 +270,13 @@ Batch Options: To modify child process jvm args, prepend "J" as in: -JXmx4g or -JDlog4j.configuration=file:log4j.xml. - </pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> <div class="section"> <h2><a name="Wrappers"></a>Wrappers</h2> <p>Several wrappers are available to use Tika in another programming language, such as <a class="externalLink" href="https://github.com/aviks/Taro.jl">Julia</a> or <a class="externalLink" href="https://github.com/chrismattmann/tika-python">Python</a>.</p></div> Modified: tika/site/publish/1.22/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.22/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.22/gettingstarted.html (original) +++ tika/site/publish/1.22/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="http://maven.apache.org/">Maven 2</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 8 or higher to build Tika.</p></div> <div class="section"> @@ -120,36 +119,31 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>1.22</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on <tt> tika-parsers </tt> instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>1.22</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. You can use the following command in the tika-parsers directory to get a full listing of all the dependencies.</p> <div> -<pre>$ mvn dependency:tree | grep :compile -</pre></div></div> +<pre>$ mvn dependency:tree | grep :compile</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_a_Gradle-built_project"></a>Using Tika in a Gradle-built project</h2> <p>To add a dependency on Apache Tika to your Gradle built project, including the full set of parsers, you should depend on the <tt> tika-parsers </tt> artifact:</p> <div> <pre>dependencies { runtime 'org.apache.tika:tika-parsers:1.22' -} -</pre></div></div> +}</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> <p>If you are using <a class="externalLink" href="http://ant.apache.org/ivy/">Apache Ivy</a> as your dependency manager tool with Ant, then to include Tika with the full set of parsers, you should depend on the <tt> tika-parsers </tt> artifact like this:</p> <div> <pre> <dependencies> <dependency org="org.apache.tika" name="tika-parsers" rev="1.22"/> - </dependencies> -</pre></div> + </dependencies></pre></div> <p>Otherwise, probably the easiest way to use Tika is to include the full <tt> tika-app </tt> jar on your classpath. For just core functionality, you can add the <tt> tika-core </tt> jar, but be aware that the full set of parsers have a large number of dependencies which must be included which is very fiddly to do by hand with Ant! To include Tika in your Ant project, you should do something like:</p> <div> <pre><classpath> @@ -160,8 +154,7 @@ <!-- or: Tika with all Parsers--> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-*.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -277,15 +270,13 @@ Batch Options: To modify child process jvm args, prepend "J" as in: -JXmx4g or -JDlog4j.configuration=file:log4j.xml. - </pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> <div class="section"> <h2><a name="Wrappers"></a>Wrappers</h2> <p>Several wrappers are available to use Tika in another programming language, such as <a class="externalLink" href="https://github.com/aviks/Taro.jl">Julia</a> or <a class="externalLink" href="https://github.com/chrismattmann/tika-python">Python</a>.</p></div> Modified: tika/site/publish/1.3/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.3/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.3/gettingstarted.html (original) +++ tika/site/publish/1.3/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="http://maven.apache.org/">Maven 2</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 5 or higher to build Tika.</p></div> <div class="section"> @@ -116,20 +115,17 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on tika-parsers instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. You can use the following command in the tika-parsers directory to get a full listing of all the dependencies.</p> <div> -<pre>$ mvn dependency:tree | grep :compile -</pre></div></div> +<pre>$ mvn dependency:tree | grep :compile</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> <p>Unless you use a dependency manager tool like <a class="externalLink" href="http://ant.apache.org/ivy/">Apache Ivy</a>, the easiest way to use Tika is to include either the tika-core or the tika-app jar in your classpath, depending on whether you want just the core functionality or also all the parser implementations.</p> @@ -142,8 +138,7 @@ <!-- or: --> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-*.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -213,15 +208,13 @@ Description: Use the "--server" (or "-s") option to start the Apache Tika server. The server will listen to the - ports you specify as one or more arguments. -</pre></div> + ports you specify as one or more arguments.</pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> </div> <div id="sidebar"> <div id="navigation"> Modified: tika/site/publish/1.4/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.4/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.4/gettingstarted.html (original) +++ tika/site/publish/1.4/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="http://maven.apache.org/">Maven 2</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 6 or higher to build Tika.</p></div> <div class="section"> @@ -116,20 +115,17 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on tika-parsers instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. You can use the following command in the tika-parsers directory to get a full listing of all the dependencies.</p> <div> -<pre>$ mvn dependency:tree | grep :compile -</pre></div></div> +<pre>$ mvn dependency:tree | grep :compile</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> <p>Unless you use a dependency manager tool like <a class="externalLink" href="http://ant.apache.org/ivy/">Apache Ivy</a>, the easiest way to use Tika is to include either the tika-core or the tika-app jar in your classpath, depending on whether you want just the core functionality or also all the parser implementations.</p> @@ -142,8 +138,7 @@ <!-- or: --> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-*.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -213,15 +208,13 @@ Description: Use the "--server" (or "-s") option to start the Apache Tika server. The server will listen to the - ports you specify as one or more arguments. -</pre></div> + ports you specify as one or more arguments.</pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> </div> <div id="sidebar"> <div id="navigation"> Modified: tika/site/publish/1.5/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.5/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.5/gettingstarted.html (original) +++ tika/site/publish/1.5/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="http://maven.apache.org/">Maven 2</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 6 or higher to build Tika.</p></div> <div class="section"> @@ -116,20 +115,17 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on tika-parsers instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. You can use the following command in the tika-parsers directory to get a full listing of all the dependencies.</p> <div> -<pre>$ mvn dependency:tree | grep :compile -</pre></div></div> +<pre>$ mvn dependency:tree | grep :compile</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> <p>Unless you use a dependency manager tool like <a class="externalLink" href="http://ant.apache.org/ivy/">Apache Ivy</a>, the easiest way to use Tika is to include either the tika-core or the tika-app jar in your classpath, depending on whether you want just the core functionality or also all the parser implementations.</p> @@ -142,8 +138,7 @@ <!-- or: --> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-*.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -213,15 +208,13 @@ Description: Use the "--server" (or "-s") option to start the Apache Tika server. The server will listen to the - ports you specify as one or more arguments. -</pre></div> + ports you specify as one or more arguments.</pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> </div> <div id="sidebar"> <div id="navigation"> Modified: tika/site/publish/1.6/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.6/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.6/gettingstarted.html (original) +++ tika/site/publish/1.6/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="http://maven.apache.org/">Maven 2</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 6 or higher to build Tika.</p></div> <div class="section"> @@ -116,20 +115,17 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on tika-parsers instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. You can use the following command in the tika-parsers directory to get a full listing of all the dependencies.</p> <div> -<pre>$ mvn dependency:tree | grep :compile -</pre></div></div> +<pre>$ mvn dependency:tree | grep :compile</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> <p>Unless you use a dependency manager tool like <a class="externalLink" href="http://ant.apache.org/ivy/">Apache Ivy</a>, the easiest way to use Tika is to include either the tika-core or the tika-app jar in your classpath, depending on whether you want just the core functionality or also all the parser implementations.</p> @@ -142,8 +138,7 @@ <!-- or: --> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-*.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -213,15 +208,13 @@ Description: Use the "--server" (or "-s") option to start the Apache Tika server. The server will listen to the - ports you specify as one or more arguments. -</pre></div> + ports you specify as one or more arguments.</pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> <div class="section"> <h2><a name="Wrappers"></a>Wrappers</h2> <p>Several wrappers are available to use Tika in another programming language, such as <a class="externalLink" href="https://github.com/aviks/Taro.jl">Julia</a> or <a class="externalLink" href="https://github.com/chrismattmann/tika-python">Python</a>.</p></div> Modified: tika/site/publish/1.7/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.7/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.7/gettingstarted.html (original) +++ tika/site/publish/1.7/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="http://maven.apache.org/">Maven 2</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 6 or higher to build Tika.</p></div> <div class="section"> @@ -116,20 +115,17 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on tika-parsers instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. You can use the following command in the tika-parsers directory to get a full listing of all the dependencies.</p> <div> -<pre>$ mvn dependency:tree | grep :compile -</pre></div></div> +<pre>$ mvn dependency:tree | grep :compile</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> <p>Unless you use a dependency manager tool like <a class="externalLink" href="http://ant.apache.org/ivy/">Apache Ivy</a>, the easiest way to use Tika is to include either the tika-core or the tika-app jar in your classpath, depending on whether you want just the core functionality or also all the parser implementations.</p> @@ -142,8 +138,7 @@ <!-- or: --> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-*.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -213,15 +208,13 @@ Description: Use the "--server" (or "-s") option to start the Apache Tika server. The server will listen to the - ports you specify as one or more arguments. -</pre></div> + ports you specify as one or more arguments.</pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> <div class="section"> <h2><a name="Wrappers"></a>Wrappers</h2> <p>Several wrappers are available to use Tika in another programming language, such as <a class="externalLink" href="https://github.com/aviks/Taro.jl">Julia</a> or <a class="externalLink" href="https://github.com/chrismattmann/tika-python">Python</a>.</p></div> Modified: tika/site/publish/1.8/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.8/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.8/gettingstarted.html (original) +++ tika/site/publish/1.8/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="http://maven.apache.org/">Maven 2</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 6 or higher to build Tika.</p></div> <div class="section"> @@ -116,20 +115,17 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on tika-parsers instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. You can use the following command in the tika-parsers directory to get a full listing of all the dependencies.</p> <div> -<pre>$ mvn dependency:tree | grep :compile -</pre></div></div> +<pre>$ mvn dependency:tree | grep :compile</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> <p>Unless you use a dependency manager tool like <a class="externalLink" href="http://ant.apache.org/ivy/">Apache Ivy</a>, the easiest way to use Tika is to include either the tika-core or the tika-app jar in your classpath, depending on whether you want just the core functionality or also all the parser implementations.</p> @@ -142,8 +138,7 @@ <!-- or: --> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-*.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -213,15 +208,13 @@ Description: Use the "--server" (or "-s") option to start the Apache Tika server. The server will listen to the - ports you specify as one or more arguments. -</pre></div> + ports you specify as one or more arguments.</pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> <div class="section"> <h2><a name="Wrappers"></a>Wrappers</h2> <p>Several wrappers are available to use Tika in another programming language, such as <a class="externalLink" href="https://github.com/aviks/Taro.jl">Julia</a> or <a class="externalLink" href="https://github.com/chrismattmann/tika-python">Python</a>.</p></div> Modified: tika/site/publish/1.9/gettingstarted.html URL: http://svn.apache.org/viewvc/tika/site/publish/1.9/gettingstarted.html?rev=1867122&r1=1867121&r2=1867122&view=diff ============================================================================== --- tika/site/publish/1.9/gettingstarted.html (original) +++ tika/site/publish/1.9/gettingstarted.html Wed Sep 18 13:50:40 2019 @@ -89,11 +89,10 @@ <p>This document describes how to build Apache Tika from sources and how to start using Tika in an application.</p></div> <div class="section"> <h2><a name="Getting_and_building_the_sources"></a>Getting and building the sources</h2> -<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../source-repository.html">checkout</a> the latest sources from version control.</p> +<p>To build Tika from sources you first need to either <a href="../download.html">download</a> a source release or <a href="../contribute.html#Source_Code">checkout</a> the latest sources from version control.</p> <p>Once you have the sources, you can build them using the <a class="externalLink" href="http://maven.apache.org/">Maven 2</a> build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository.</p> <div> -<pre>mvn install -</pre></div> +<pre>mvn install</pre></div> <p>See the Maven documentation for more information about the available build options.</p> <p>Note that you need Java 6 or higher to build Tika.</p></div> <div class="section"> @@ -118,20 +117,17 @@ <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>If you want to use Tika to parse documents (instead of simply detecting document types, etc.), you'll want to depend on tika-parsers instead: </p> <div> <pre> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>...</version> - </dependency> -</pre></div> + </dependency></pre></div> <p>Note that adding this dependency will introduce a number of transitive dependencies to your project, including one on tika-core. You need to make sure that these dependencies won't conflict with your existing project dependencies. You can use the following command in the tika-parsers directory to get a full listing of all the dependencies.</p> <div> -<pre>$ mvn dependency:tree | grep :compile -</pre></div></div> +<pre>$ mvn dependency:tree | grep :compile</pre></div></div> <div class="section"> <h2><a name="Using_Tika_in_an_Ant_project"></a>Using Tika in an Ant project</h2> <p>Unless you use a dependency manager tool like <a class="externalLink" href="http://ant.apache.org/ivy/">Apache Ivy</a>, the easiest way to use Tika is to include either the tika-core or the tika-app jar in your classpath, depending on whether you want just the core functionality or also all the parser implementations.</p> @@ -144,8 +140,7 @@ <!-- or: --> <pathelement location="path/to/tika-app-${tika.version}.jar"/> -</classpath> -</pre></div></div> +</classpath></pre></div></div> <div class="section"> <h2><a name="Using_Tika_as_a_command_line_utility"></a>Using Tika as a command line utility</h2> <p>The Tika application jar (tika-app-*.jar) can be used as a command line utility for extracting text content and metadata from all sorts of files. This runnable jar contains all the dependencies it needs, so you don't need to worry about classpath settings to run it.</p> @@ -215,15 +210,13 @@ Description: Use the "--server" (or "-s") option to start the Apache Tika server. The server will listen to the - ports you specify as one or more arguments. -</pre></div> + ports you specify as one or more arguments.</pre></div> <p>You can also use the jar as a component in a Unix pipeline or as an external tool in many scripting languages.</p> <div> <pre># Check if an Internet resource contains a specific keyword curl http://.../document.doc \ | java -jar tika-app.jar --text \ - | grep -q keyword -</pre></div></div> + | grep -q keyword</pre></div></div> <div class="section"> <h2><a name="Wrappers"></a>Wrappers</h2> <p>Several wrappers are available to use Tika in another programming language, such as <a class="externalLink" href="https://github.com/aviks/Taro.jl">Julia</a> or <a class="externalLink" href="https://github.com/chrismattmann/tika-python">Python</a>.</p></div>
