Hi all,

All Xalan developer and users, I wan to do something to push XSLT 2.0
implemention job for Xalan, i know this topic was discussed in mail list for
many many times in recent years, so many guys are expecting it come true.
But due to low activity in Apache Xalan community, no committer to push this
thing, it is really very regretful.

These days, i submited "Implement XSLT 2.0 and XPath 2.0 for Xalan" project
proposal as an GSoC[1] project. Now, i need your feedbacks to improve the
proposal, all the advises, suggestions are welcome :-)

I am not a Xalan committer(alrough i dream to be), but if there is a chance,
i want to try my best to so something for XSLT 2.0 implemention job for
Xalan, many i can not finish all the job in the summer, but i will make it
is available, once wheels start turn, i thing there may be some other guys
can work together with me to finish this big project. Hoping all the
comunity can work together for the better future of Xalan.

[1] http://socghop.appspot.com

=========Proposal details of "Implement XSLT 2.0 and XPath 2.0 for
Xalan"================

Apache Xalan-Java is a powful XSLT processor for XML transforming based on
W3C recommendation XSL Transformations (XSLT) Version 1.0 and XML Path
Language (XPath) Version 1.0, XSL Transformations (XSLT) Version 2.0
specification was published on 23 January 2007,this new version give us more
powful functions, new dat model and serialization etc.



In my daily work, i use XSLT and XPath to parse XML and change XML document
to another format, i did this with Xalan.As we know,Xalan support XSLT
version 1.0 and XPath version 1.0 only.But my XML parse job is so complex,
and usually, i must use some XPath v2.0 functions to finish my parse job.
So, i encountered trouble.



Then I Google keywords with Xalan and XSLT 2.0,i found many developers hope
that Xalan based XSLT 2.0 could become reality besides me. At the same time,
i discuss this topic with open source developers in mail list, so many guys
are interested in implementing complete XSTL 2.0 specification for Xalan. We
discuss this project deeply, and i got many good ideas and advises from the
discussion. And also they told me Xalan lost its most active contributors a
few years ago due to time demands from Real Jobs, and development slowed
down as a result,but they love to see Xalan tackle it.Due to XSLT 2.0 and
XPath 2.0 are large specifications that I imagine could take the developers
years to implement correctly,anyway it need some guy get the ball rolling, i
want to be this guy.



Compared with XSLT 1.0, XSLT 2.0 and XPath 2.0 have some important new
characteristic.



New features in XPath 2.0:

   1. Sequences, in XPath2.0 everything is sequences
   2. For expression,in XPath 2.0,we can use for expression to iterator
every item in the sequence, compute expression value for each item, at last
return the sequence result by connect all the expression values.
   3. Condition expression, XPath 2.0's condition expression can be used to
compute different value on the basis of the condition's true or false value.
   4. Limited expression,limited expression in XPath 2.0 can be used to
judge that whether every item in the sequence satisfy appointed condition,
its value is always true of false. There are two kinds of limited
expression: some expression and every expression.
   5. Datatype,XPath 2.0's datetype is based on XML Schema, it supports all
the basic built-in datatype,such as xs:integer,xs:string and xs:date ect.
   6. Date and time, XSLT 1.0 has not date and time datatype, it must use
string to represent date and time. Due to XPath 2.0 system is based on XML
schema, so there is date and time data type in XPath 2.0.
   7. More functions support,XSLT 2.0 specification has more powful
functions definition.Functions in XPath 2.0 is defined in a special
recommedation "XQuery 1.0 and XPath 2.0 Functions and Operations", functions
in this recommendation specification belong to namespace "
http://www.w3.org/2005/xpath-functions";, this namespace is bound with prefix
"fn".

 It includes these serveral types:

    * Constructor Functions
    * Functions and Operators on Numerics
    * Functions on Strings
    * Functions on anyURI
    * Functions and Operators on Boolean Values
    * Functions and Operators on Durations, Dates and Times
    * Functions Related to QNames
    * Operators on base64Binary and hexBinary
    * Operators on NOTATION
    * Functions and Operators on Nodes
    * Functions and Operators on Sequences
    * Context Functions

There are also many new features in XSLT 2.0:

   1. Group, basic group syntax,group sort,group-adjacent and
group-starting-with etc.
   2. Connotive document node, or we can call it temporary tree
   3. Element result-document, we can use to generate multi output file in
XPath 2.0
   4. Improvement of element value-of
   5. Char mapping,XSLT 2.0 supply more flexible solution for us to handler
specific characters in XML such as '<' and '&'
   6. Custom stylesheet function,in XSLT 2.0, we can use to create our own
functions in stylesheet
   7. Other new features.

In order to implement all the XSLT 2.0 specification,it requires some
moderately basic data structure changes to track Schema types and to handle
Sequences which aren't simply nodesets,and also,i must implements all the
new functions defined in XSLT 2.0,fortunately,there is a open source XPath
2.0 processor (http://wiki.eclipse.org/PsychoPathXPathProcessor) which is at
Eclipse.It is already being used by Xerces-J for XSD 1.1 Assertion
support.Why re-invent the wheel if we can leverage something that already
exists ? PsychoPath already is XML Schema Aware, leverages Xerces-J(
http://xerces.apache.org/xerces2-j) and passes about 99.9% of the Xpath 2.0
test suite. It is currently undergoing testing against the the official W3C
Test Suite.



So, in order to implement all the XSLT 2.0 specification for Xalan, there
are two steps:

   1. Merge Xalan codes with PsychoPath codes,this will help us a lot,once
this job is done, Xalan has implemented all the XPath 2.0 features. There
are some issues to resolve during this merging process,for example:
          * PsychoPath is not "streamy" yet, nor is Xalan, it is a big
challenge
          * In PsychoPath,type informations is not part of current
interfaces,i will have to extend or replace the DTM interfaces to carry the
additional data
          * XSLTC fit in problem. XSLTC runs against the same data model
interfaces (DTM), and In Theory implementing compilation should just be a
matter of generating code equivalent to what the interpreter is doing. In
practice, it's a lot of detail work.
   2. Implement all the other XSLT 2.0 features which were described before
for Xalan.

In my plan, i will hold the whole XSLT 2.0 implemention project as a long
term open source project under Apache, do it step by step, may be as a GSoC
project,i can only implement a subset of this capability i was thinking of
working on,at lease, i will finish the following tasks:

   1. Merge Xalan codes with PsychoPath codes, i will do this first, but
this is not a simple task,may be it will cost me one month or even more to
merge it correctly.
   2. Resolve merge problems between Xalan and PsychoPath
   3. Implement XSLT 2.0 Group feature for Xalan
   4. Implement XSLT 2.0 temporary tree feature for Xalan
   5. Implement Element result-document feature for Xalan

If everything goes well and i have enough time,i will do my best to do more
job. After the summer, i will continue with this project,try to submit my
codes to Xalan as proposed changes and try to become one of Apache
committers,this is my dream.And i know this is a huge project,but where
there is a will there is a success, I love open source, i will do my best to
improve it through my attemp and ability.


Regards

Victor Wang

Reply via email to