Author: nick
Date: Sun May 26 17:02:45 2013
New Revision: 1486433
URL: http://svn.apache.org/r1486433
Log:
Update site publish
Modified:
poi/site/publish/subversion.html
poi/site/publish/text-extraction.html
Modified: poi/site/publish/subversion.html
URL:
http://svn.apache.org/viewvc/poi/site/publish/subversion.html?rev=1486433&r1=1486432&r2=1486433&view=diff
==============================================================================
--- poi/site/publish/subversion.html (original)
+++ poi/site/publish/subversion.html Sun May 26 17:02:45 2013
@@ -255,11 +255,21 @@ if (VERSION > 3) {
<p>
Most users of the source code probably don't need to have day to
- day access to the source code as it changes. For these users we
- provide easy to unpack source code from releases via our
+ day access to the source code as it changes. Most users will want
+ to make use of our <a href="download.html">source release</a>
+ packages, which contain the complete source tree for each binary
+ release, suitable for browsing or debugging. These source releases
+ are available from our
<a href="download.html">download page.</a>
</p>
+
+<p>
+ The Apache POI sourcecode is also available as source artifacts
+ in the Maven Central repository, which may be helpful for those
+ users who make use of POI and wish to inspect the source (eg when
+ debugging in an IDE).
+ </p>
<a name="Access+the+Version+Controlled+Source+Code"></a>
@@ -268,18 +278,18 @@ if (VERSION > 3) {
</div>
<p>
- For information on connecting to the ASF Subversion repositories,
- see the
+ For general information on connecting to the ASF Subversion,
+ repositories, see the
<a href="http://www.apache.org/dev/version-control.html">version
control page.</a>
</p>
<p>Subversion is an open-source version control system. It has been
contributed to the Apache Software Foundation and is
- now available <a
href="http://incubator.apache.org/projects/subversion.html">here</a>.
+ now available <a href="http://subversion.apache.org/">here</a>.
</p>
<p>
- The root url of the ASF Subversion repository is
+ The root url of the ASF Subversion repository is
<a
href="http://svn.apache.org/repos/asf/">http://svn.apache.org/repos/asf/</a>
for non-committers and
<a
href="https://svn.apache.org/repos/asf/">https://svn.apache.org/repos/asf/</a>
@@ -324,11 +334,32 @@ if (VERSION > 3) {
</div>
<p>
- Git read-only access to POI sources is now available.
- Please see the <a href="http://git.apache.org/">Git at Apache</a> page
for details.
- Git Clone URL: <a
href="git://git.apache.org/poi.git">git://git.apache.org/poi.git</a>
- and Http Clone URL: <a
href="http://git.apache.org/poi.git">http://git.apache.org/poi.git</a>.
+ The master source repository for Apache POI is the Subversion
+ one listed above. To support those users and developers who prefer
+ to use the Git tooling, read-only access to the POI source tree is
+ also available via Git. The Git mirrors normally track SVN to
+ within a few minutes.
+ </p>
+
+<p>
+ The official read-only Git repository for Apache POI is available
+ from <a href="http://git.apache.org/">git.apache.org/</a> .
+ The Git Clone URL is: <a
href="git://git.apache.org/poi.git">git://git.apache.org/poi.git</a>
+ and Http Clone URL: <a
href="http://git.apache.org/poi.git">http://git.apache.org/poi.git</a> .
+ Please see the <a href="http://git.apache.org/">Git at
+ Apache</a> page for more details on the service.
</p>
+
+<p>
+ In addition to the <a href="http://git.apache.org/">git.apache.org/</a>
+ repository, changes are also mirrored in near-realtime to GitHub.
+ The GitHub repository is available at
+ <a
href="https://github.com/apache/poi">https://github.com/apache/poi</a> .
+ Please note that the GitHub repository is read-only, and all
+ contributions should continue to be sent via Bugzilla for tracking.
+ (Git patches are fine though). Please see the
+ <a href="guidelines.html">contribution guidelines</a> for more
+ information on getting involved in the project.</p>
Modified: poi/site/publish/text-extraction.html
URL:
http://svn.apache.org/viewvc/poi/site/publish/text-extraction.html?rev=1486433&r1=1486432&r2=1486433&view=diff
==============================================================================
--- poi/site/publish/text-extraction.html (original)
+++ poi/site/publish/text-extraction.html Sun May 26 17:02:45 2013
@@ -253,15 +253,25 @@ if (VERSION > 3) {
<h3>Overview</h3>
</div>
-<p>Apache POI provides text extraction for all the supported file
- formats. In addition, it provides access to the metadata
- associated with a given file, such as title and author.</p>
+<p>For a number of years now, Apache POI has provided basic
+ text extraction for all the project supported file formats. In
+ addition, as well as the (plain) text, these provides access to
+ the metadata associated with a given file, such as title and
+ author.</p>
-<p>In addition to providing direct text extraction classes,
- POI works closely with the
- <a href="http://incubator.apache.org/tika/">Apache Tika</a>
- text extraction library. Users may wish to simply utilise
- the functionality provided by Tika.</p>
+<p>For more advanced text extraction needs, including Rich Text
+ extraction (such as formatting and styling), along with XML and
+ HTML output, Apache POI works closely with
+ <a href="http://tika.apache.org/">Apache Tika</a> to deliver
+ POI-powered Tika Parsers for all the project supported file formats.</p>
+
+<p>If you are after turn-key text extraction, including the latest
+ support, styles etc, you are strongly advised to make use of
+ <a href="http://tika.apache.org/">Apache Tika</a>, which builds
+ on top of POI to provide Text and Metadata extraction. If you wish
+ to have something very simple and stand-alone, or you wish to make
+ heavy modificiations, then the POI provided text extractors documented
+ below might be a better fit for your needs.</p>
@@ -289,16 +299,21 @@ if (VERSION > 3) {
-<a name="Text+Extractor+Factory+-+POI+3.5+or+later"></a>
+<a name="Text+Extractor+Factory"></a>
<div class="h3">
-<h3>Text Extractor Factory - POI 3.5 or later</h3>
+<h3>Text Extractor Factory</h3>
</div>
-<p>A new class in POI 3.5,
- <em>org.apache.poi.extractor.ExtractorFactory</em> provides a
+<p>As part of the addition of OOXML support in Apache POI 3.5, there
+ is a common class to select the appropriate POI text extractor for
+ you. <em>org.apache.poi.extractor.ExtractorFactory</em> provides a
similar function to WorkbookFactory. You simply pass it an
- InputStream, a file, a POIFSFileSystem or a OOXML Package. It
+ InputStream, a File, a POIFSFileSystem or a OOXML Package. It
figures out the correct text extractor for you, and returns it.</p>
+
+<p>For complete detection and text extractor auto-selection, users
+ are strongly encouraged to investigate
+ <a href="http://tika.apache.org/">Apache Tika</a>.</p>
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]