Add an HTML2XHTML converter as Starter
--------------------------------------
Key: COCOON3-5
URL: https://issues.apache.org/jira/browse/COCOON3-5
Project: Cocoon 3
Issue Type: Improvement
Components: cocoon-optional
Affects Versions: 3.0.0-alpha-2
Reporter: Simone Tripodi
Assignee: Cocoon Developers Team
Priority: Minor
Fix For: 3.0.0-alpha-2
This starter component for the pipeline is a component that transform an HTML
content, taken by the specified URL, and transform it in XHTML or, at least, a
well-formed XML document.
So now the original document can be processed in the pipeline in various ways:
* following links;
* implementing crwalers;
* easy transforming the original document in other various formats;
* etc...
I want to explain the need of this component with a testcase; last week I had
to face a singular problem, realizing a simple service that takes in input an
HTML page's URL, and transform it , through the Optimus' XSLT
(http://microformatique.com/optimus -
http://code.google.com/p/mf-optimus/source/browse/#svn/trunk/xsl) in an XML
document that contains the original doc's Microformats, in an easier and more
parsable formats.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.