ovidiu      02/01/17 10:09:12

  Added:       src/scratchpad/schecoon/scheme sxml.scm
  Log:
  Added. Compiled collection of SXML functions.
  
  Revision  Changes    Path
  1.1                  xml-cocoon2/src/scratchpad/schecoon/scheme/sxml.scm
  
  Index: sxml.scm
  ===================================================================
  ;             XML processing in Scheme
  ;         SXML to XML/HTML conversions and
  ;   SXSLT, XML Stylesheet Language Transformations in Scheme
  ;
  ; Author: Oleg Kiselyov <[EMAIL PROTECTED]>
  ; Original location: http://okmij.org/ftp/Scheme/SXmanip.scm
  ;                    http://okmij.org/ftp/Scheme/SXPath.scm
  ;
  ; Collected by Ovidiu Predescu <[EMAIL PROTECTED]> for Cocoon
  
  
  ;; First some definitions for SISC, which doesn't support
  ;; define-macro, which is used throught the SXML implementation.
  (define-syntax assert
    (syntax-rules ()
     ((_) #t)
     ((_ x ...)
      (and (or x (error "failed assertion ~s" `x)) ...))))
  
  (define (-- n) (- n 1))
  
  ;                     XML processing in Scheme
  ;                  SXPath -- SXML Query Language
  ;
  ; This file is examples-stripped version of Oleg Kiselyov's 
  ; SXPath.scm,v 3.5 2001/01/12 23:20:35 oleg Exp oleg
  ; 
  ; Differences from the original sxpath:
  ;
  ; Function "filter" is renamed to "sxp:filter" for the sake of
  ; compatibility with SRFI-1
  ;
  ; 1. Criterion '*' doesn't accept COMMENT, ENTITY and NAMESPACES nodes
  ;  any more
  ;
  ; 2. node-typeof? may accept a list of names. A node is accepted if its
  ;      name is in this list
  ;
  ; 3. sxpath has an additional rewriting rule for 'or':
  ; '(or@ name1 ... ) will be  
  ; 
  ; 4. Criterion *data* introduced
  ;
  ;         Kirill Lisovsky 
  ;         [EMAIL PROTECTED]
  ;
  ;                                 *  *  *
  ;
  ; SXPath is a query language for SXML, an instance of XML Information
  ; set (Infoset) in the form of s-expressions. See SSAX.scm for the
  ; definition of SXML and more details. SXPath is also a translation into
  ; Scheme of an XML Path Language, XPath:
  ;     http://www.w3.org/TR/xpath
  ; XPath and SXPath describe means of selecting a set of Infoset's items
  ; or their properties.
  ;
  ; To facilitate queries, XPath maps the XML Infoset into an explicit
  ; tree, and introduces important notions of a location path and a
  ; current, context node. A location path denotes a selection of a set of
  ; nodes relative to a context node. Any XPath tree has a distinguished,
  ; root node -- which serves as the context node for absolute location
  ; paths. Location path is recursively defined as a location step joined
  ; with a location path. A location step is a simple query of the
  ; database relative to a context node. A step may include expressions
  ; that further filter the selected set. Each node in the resulting set
  ; is used as a context node for the adjoining location path. The result
  ; of the step is a union of the sets returned by the latter location
  ; paths.
  ;
  ; The SXML representation of the XML Infoset (see SSAX.scm) is rather
  ; suitable for querying as it is. Bowing to the XPath specification,
  ; we will refer to SXML information items as 'Nodes':
  ;     <Node> ::= <Element> | <attributes-coll> | <attrib>
  ;                | "text string" | <PI>
  ; This production can also be described as
  ;     <Node> ::= (name . <Nodeset>) | "text string"
  ; An (ordered) set of nodes is just a list of the constituent nodes:
  ;     <Nodeset> ::= (<Node> ...)
  ; Nodesets, and Nodes other than text strings are both lists. A
  ; <Nodeset> however is either an empty list, or a list whose head is not
  ; a symbol.  A symbol at the head of a node is either an XML name (in
  ; which case it's a tag of an XML element), or an administrative name
  ; such as '@'.  This uniform list representation makes processing rather
  ; simple and elegant, while avoiding confusion. The multi-branch tree
  ; structure formed by the mutually-recursive datatypes <Node> and
  ; <Nodeset> lends itself well to processing by functional languages.
  ;
  ; A location path is in fact a composite query over an XPath tree or
  ; its branch. A singe step is a combination of a projection, selection
  ; or a transitive closure. Multiple steps are combined via join and
  ; union operations. This insight allows us to _elegantly_ implement
  ; XPath as a sequence of projection and filtering primitives --
  ; converters -- joined by _combinators_. Each converter takes a node
  ; and returns a nodeset which is the result of the corresponding query
  ; relative to that node. A converter can also be called on a set of
  ; nodes. In that case it returns a union of the corresponding queries over
  ; each node in the set. The union is easily implemented as a list
  ; append operation as all nodes in a SXML tree are considered
  ; distinct, by XPath conventions. We also preserve the order of the
  ; members in the union. Query combinators are high-order functions:
  ; they take converter(s) (which is a Node|Nodeset -> Nodeset function)
  ; and compose or otherwise combine them. We will be concerned with
  ; only relative location paths [XPath]: an absolute location path is a
  ; relative path applied to the root node.
  ;
  ; Similarly to XPath, SXPath defines full and abbreviated notations
  ; for location paths. In both cases, the abbreviated notation can be
  ; mechanically expanded into the full form by simple rewriting
  ; rules. In case of SXPath the corresponding rules are given as
  ; comments to a sxpath function, below. The regression test suite at
  ; the end of this file shows a representative sample of SXPaths in
  ; both notations, juxtaposed with the corresponding XPath
  ; expressions. Most of the samples are borrowed literally from the
  ; XPath specification, while the others are adjusted for our running
  ; example, tree1.
  ;
  ; $Id: sxml.scm,v 1.1 2002/01/17 18:09:12 ovidiu Exp $
  
  (define (nodeset? x)
    (or (and (pair? x) (not (symbol? (car x)))) (null? x)))
  
  ;-------------------------
  ; Basic converters and applicators
  ; A converter is a function
  ;     type Converter = Node|Nodeset -> Nodeset
  ; A converter can also play a role of a predicate: in that case, if a
  ; converter, applied to a node or a nodeset, yields a non-empty
  ; nodeset, the converter-predicate is deemed satisfied. Throughout
  ; this file a nil nodeset is equivalent to #f in denoting a failure.
  
  
  ; The following function implements a 'Node test' as defined in
  ; Sec. 2.3 of XPath document. A node test is one of the components of a
  ; location step. It is also a converter-predicate in SXPath.
  ;
  ; The function node-typeof? takes a type criterion and returns
  ; a function, which, when applied to a node, will tell if the node satisfies
  ; the test.
  ;     node-typeof? :: Crit -> Node -> Boolean
  ;
  ; The criterion 'crit' is 
  ;  one of the following symbols:
  ;     id              - tests if the Node has the right name (id)
  ;     @               - tests if the Node is an <attributes-list>
  ;     *               - tests if the Node is an <Element>
  ;     *text*          - tests if the Node is a text node
  ;     *data*          - tests if the Node is a data node 
  ;                         (text, number, boolean, etc)
  ;     *PI*            - tests if the Node is a PI node
  ;     *COMMENT*       - tests if the Node is a COMMENT node
  ;     *ENTITY*        - tests if the Node is a ENTITY node
  ;     *any*           - #t for any type of Node
  ;  or list of symbols
  ;     (crit-symbol 
  ;           ...)      - node-typeof? will return if the Node satisfies
  ;                         at least one of the criterions listed.
  ;                         This list of 
  (define (node-typeof? crit)
      (define (present? x)
        ((if (list? crit)
         memq 
         eq?)
         x crit))
    (lambda (node)
      (cond 
        ((pair? node)
         (or
         (present? '*any*)
         ; The line bleow handles id, *PI*, *COMMENT*, etc.
         (present? (car node))
         (and (present? '*)
              (not (memq (car node) 
                         '(@ *PI* *COMMENT* *ENTITY* *NAMESPACES*))))
         (and (present? '*data*) (not (list? node)))
         ))
        ((string? node) 
         (or 
         (present? '*any*)
         (present? '*text*)
         (present? '*data*)))
        (else
        (or
          (present? '*any*)
          (and (present? '*data*) (not (list? node))))))
      ))        
  
  
   ;; The function node-typeof? takes a type criterion and returns a function,
   ;; which, when applied to a node, will tell if the node satisfies
   ;; the test.
   ;;   node-typeof? :: Crit -> Node -> Boolean
   ;;
   ;; The criterion 'crit' is a symbol, one of the following:
   ;;   id              - tests if the Node has the right name (id)
   ;;   @               - tests if the Node is an <attributes-coll>
   ;;   *               - tests if the Node is an <Element>
   ;;   *text*          - tests if the Node is a text node
   ;;   *PI*            - tests if the Node is a PI node
   ;;   *any*           - #t for any type of Node
   ;(define (node-typeof? crit)
   ;  (lambda (node)
   ;    (case crit
   ;      ((*) (and (pair? node) 
   ;            (not (memq (car node) 
   ;                       '(@ *PI* *COMMENT* *ENTITY* *NAMESPACES*)))))
   ;      ((*any*) #t)
   ;      ((*text*) (string? node))
   ;      ((*data*) (not (list? node)))
   ;      (else
   ;    (and (pair? node) 
   ;         ((if (list? crit)
   ;            memq 
   ;            eq?)
   ;          (car node) crit)))
   ;      )))
   ;
   ;(define (node-typeof? crit)
   ;  (lambda (node)
   ;    (case crit
   ;      ((*) (and (pair? node) (not (memq (car node) '(@ *PI*)))))
   ;      ((*any*) #t)
   ;      ((*text*) (string? node))
   ;      (else
   ;       (and (pair? node) (eq? crit (car node))))
   ;)))
  
  
  ; Curried equivalence converter-predicates
  (define (node-eq? other)
    (lambda (node)
      (eq? other node)))
  
  (define (node-equal? other)
    (lambda (node)
      (equal? other node)))
  
  ; node-pos:: N -> Nodeset -> Nodeset, or
  ; node-pos:: N -> Converter
  ; Select the N'th element of a Nodeset and return as a singular Nodeset;
  ; Return an empty nodeset if the Nth element does not exist.
  ; ((node-pos 1) Nodeset) selects the node at the head of the Nodeset,
  ; if exists; ((node-pos 2) Nodeset) selects the Node after that, if
  ; exists.
  ; N can also be a negative number: in that case the node is picked from
  ; the tail of the list.
  ; ((node-pos -1) Nodeset) selects the last node of a non-empty nodeset;
  ; ((node-pos -2) Nodeset) selects the last but one node, if exists.
  
  (define (node-pos n)
    (lambda (nodeset)
      (cond
       ((not (nodeset? nodeset)) '())
       ((null? nodeset) nodeset)
       ((eqv? n 1) (list (car nodeset)))
       ((negative? n) ((node-pos (+ n 1 (length nodeset))) nodeset))
       (else
        (assert (positive? n))
        ((node-pos (-- n)) (cdr nodeset))))))
  
  ; filter:: Converter -> Converter
  ; A filter applicator, which introduces a filtering context. The argument
  ; converter is considered a predicate, with either #f or nil result meaning
  ; failure.
  (define (sxp:filter pred?)
    (lambda (lst)       ; a nodeset or a node (will be converted to a singleton nset)
      (let loop ((lst (if (nodeset? lst) lst (list lst))) (res '()))
        (if (null? lst)
          (reverse res)
          (let ((pred-result (pred? (car lst))))
            (loop (cdr lst)
                  (if (and pred-result (not (null? pred-result)))
                      (cons (car lst) res)
                      res)))))))
  
  ; take-until:: Converter -> Converter, or
  ; take-until:: Pred -> Node|Nodeset -> Nodeset
  ; Given a converter-predicate and a nodeset, apply the predicate to
  ; each element of the nodeset, until the predicate yields anything but #f or
  ; nil. Return the elements of the input nodeset that have been processed
  ; till that moment (that is, which fail the predicate).
  ; take-until is a variation of the filter above: take-until passes
  ; elements of an ordered input set till (but not including) the first
  ; element that satisfies the predicate.
  ; The nodeset returned by ((take-until (not pred)) nset) is a subset -- 
  ; to be more precise, a prefix -- of the nodeset returned by
  ; ((filter pred) nset)
  
  (define (take-until pred?)
    (lambda (lst)       ; a nodeset or a node (will be converted to a singleton nset)
      (let loop ((lst (if (nodeset? lst) lst (list lst))))
        (if (null? lst) lst
          (let ((pred-result (pred? (car lst))))
            (if (and pred-result (not (null? pred-result)))
                '()
                (cons (car lst) (loop (cdr lst)))))
          ))))
  
  
  ; take-after:: Converter -> Converter, or
  ; take-after:: Pred -> Node|Nodeset -> Nodeset
  ; Given a converter-predicate and a nodeset, apply the predicate to
  ; each element of the nodeset, until the predicate yields anything but #f or
  ; nil. Return the elements of the input nodeset that have not been processed:
  ; that is, return the elements of the input nodeset that follow the first
  ; element that satisfied the predicate.
  ; take-after along with take-until partition an input nodeset into three
  ; parts: the first element that satisfies a predicate, all preceding
  ; elements and all following elements.
  
  (define (take-after pred?)
    (lambda (lst)       ; a nodeset or a node (will be converted to a singleton nset)
      (let loop ((lst (if (nodeset? lst) lst (list lst))))
        (if (null? lst) lst
          (let ((pred-result (pred? (car lst))))
            (if (and pred-result (not (null? pred-result)))
                (cdr lst)
                (loop (cdr lst))))
          ))))
  
  ; Apply proc to each element of lst and return the list of results.
  ; if proc returns a nodeset, splice it into the result
  ;
  ; From another point of view, map-union is a function Converter->Converter,
  ; which places an argument-converter in a joining context.
  
  (define (map-union proc lst)
    (if (null? lst) lst
        (let ((proc-res (proc (car lst))))
        ((if (nodeset? proc-res) append cons)
         proc-res (map-union proc (cdr lst))))))
  
  ; node-reverse :: Converter, or
  ; node-reverse:: Node|Nodeset -> Nodeset
  ; Reverses the order of nodes in the nodeset
  ; This basic converter is needed to implement a reverse document order
  ; (see the XPath Recommendation).
  (define node-reverse 
    (lambda (node-or-nodeset)
      (if (not (nodeset? node-or-nodeset)) (list node-or-nodeset)
        (reverse node-or-nodeset))))
  
  ; node-trace:: String -> Converter
  ; (node-trace title) is an identity converter. In addition it prints out
  ; a node or nodeset it is applied to, prefixed with the 'title'.
  ; This converter is very useful for debugging.
  
  (define (node-trace title)
    (lambda (node-or-nodeset)
      (display "\n-->")
      (display title)
      (display " :")
      (pretty-print node-or-nodeset)
      node-or-nodeset))
  
  
  ;-------------------------
  ; Converter combinators
  ;
  ; Combinators are higher-order functions that transmogrify a converter
  ; or glue a sequence of converters into a single, non-trivial
  ; converter. The goal is to arrive at converters that correspond to
  ; XPath location paths.
  ;
  ; From a different point of view, a combinator is a fixed, named
  ; _pattern_ of applying converters. Given below is a complete set of
  ; such patterns that together implement XPath location path
  ; specification. As it turns out, all these combinators can be built
  ; from a small number of basic blocks: regular functional composition,
  ; map-union and filter applicators, and the nodeset union.
  
  
  
  ; select-kids:: Pred -> Node -> Nodeset
  ; Given a Node, return an (ordered) subset its children that satisfy
  ; the Pred (a converter, actually)
  ; select-kids:: Pred -> Nodeset -> Nodeset
  ; The same as above, but select among children of all the nodes in
  ; the Nodeset
  ;
  ; More succinctly, the signature of this function is
  ; select-kids:: Converter -> Converter
  
  (define (select-kids test-pred?)
    (lambda (node)              ; node or node-set
      (cond 
       ((null? node) node)
       ((not (pair? node)) '())   ; No children
       ((symbol? (car node))
        ((sxp:filter test-pred?) (cdr node)))   ; it's a single node
       (else (map-union (select-kids test-pred?) node)))))
  
  
  ; node-self:: Pred -> Node -> Nodeset, or
  ; node-self:: Converter -> Converter
  ; Similar to select-kids but apply to the Node itself rather
  ; than to its children. The resulting Nodeset will contain either one
  ; component, or will be empty (if the Node failed the Pred).
  (define node-self sxp:filter)
  
  
  ; node-join:: [LocPath] -> Node|Nodeset -> Nodeset, or
  ; node-join:: [Converter] -> Converter
  ; join the sequence of location steps or paths as described
  ; in the title comments above.
  (define (node-join . selectors)
    (lambda (nodeset)           ; Nodeset or node
      (let loop ((nodeset nodeset) (selectors selectors))
        (if (null? selectors) nodeset
          (loop 
           (if (nodeset? nodeset)
               (map-union (car selectors) nodeset)
               ((car selectors) nodeset))
           (cdr selectors))))))
  
  
  ; node-reduce:: [LocPath] -> Node|Nodeset -> Nodeset, or
  ; node-reduce:: [Converter] -> Converter
  ; A regular functional composition of converters.
  ; From a different point of view,
  ;    ((apply node-reduce converters) nodeset)
  ; is equivalent to
  ;    (foldl apply nodeset converters)
  ; i.e., folding, or reducing, a list of converters with the nodeset
  ; as a seed.
  (define (node-reduce . converters)
    (lambda (nodeset)           ; Nodeset or node
      (let loop ((nodeset nodeset) (converters converters))
        (if (null? converters) nodeset
          (loop ((car converters) nodeset) (cdr converters))))))
  
  
  ; node-or:: [Converter] -> Converter
  ; This combinator applies all converters to a given node and
  ; produces the union of their results.
  ; This combinator corresponds to a union, '|' operation for XPath
  ; location paths.
  ; (define (node-or . converters)
  ;   (lambda (node-or-nodeset)
  ;     (if (null? converters) node-or-nodeset
  ;     (append 
  ;      ((car converters) node-or-nodeset)
  ;      ((apply node-or (cdr converters)) node-or-nodeset)))))
  ; More optimal implementation follows
  (define (node-or . converters)
    (lambda (node-or-nodeset)
      (let loop ((result '()) (converters converters))
        (if (null? converters) result
          (loop (append result (or ((car converters) node-or-nodeset) '()))
                (cdr converters))))))
  
  
  ; node-closure:: Converter -> Converter
  ; Select all _descendants_ of a node that satisfy a converter-predicate.
  ; This combinator is similar to select-kids but applies to
  ; grand... children as well.
  ; This combinator implements the "descendant::" XPath axis
  ; Conceptually, this combinator can be expressed as
  ; (define (node-closure f)
  ;      (node-or
  ;        (select-kids f)
  ;      (node-reduce (select-kids (node-typeof? '*)) (node-closure f))))
  ; This definition, as written, looks somewhat like a fixpoint, and it
  ; will run forever. It is obvious however that sooner or later
  ; (select-kids (node-typeof? '*)) will return an empty nodeset. At
  ; this point further iterations will no longer affect the result and
  ; can be stopped.
  
  (define (node-closure test-pred?)         
    (lambda (node)              ; Nodeset or node
      (let loop ((parent node) (result '()))
        (if (null? parent) result
          (loop ((select-kids (node-typeof? '*)) parent)
                (append result
                        ((select-kids test-pred?) parent)))
          ))))
  
  ; node-parent:: RootNode -> Converter
  ; (node-parent rootnode) yields a converter that returns a parent of a
  ; node it is applied to. If applied to a nodeset, it returns the list
  ; of parents of nodes in the nodeset. The rootnode does not have
  ; to be the root node of the whole SXML tree -- it may be a root node
  ; of a branch of interest.
  ; Given the notation of Philip Wadler's paper on semantics of XSLT,
  ;  parent(x) = { y | y=subnode*(root), x=subnode(y) }
  ; Therefore, node-parent is not the fundamental converter: it can be
  ; expressed through the existing ones.  Yet node-parent is a rather
  ; convenient converter. It corresponds to a parent:: axis of SXPath.
  ; Note that the parent:: axis can be used with an attribute node as well!
  
  (define (node-parent rootnode)
    (lambda (node)              ; Nodeset or node
      (if (nodeset? node) (map-union (node-parent rootnode) node)
        (let ((pred
               (node-or
                (node-reduce
                 (node-self (node-typeof? '*))
                 (select-kids (node-eq? node)))
                (node-join
                 (select-kids (node-typeof? '@))
                 (select-kids (node-eq? node))))))
          ((node-or
            (node-self pred)
            (node-closure pred))
           rootnode)))))
  
  ;-------------------------
  ; Evaluate an abbreviated SXPath
  ;     sxpath:: AbbrPath -> Converter, or
  ;     sxpath:: AbbrPath -> Node|Nodeset -> Nodeset
  ; AbbrPath is a list. It is translated to the full SXPath according
  ; to the following rewriting rules
  ; (sxpath '()) -> (node-join)
  ; (sxpath '(path-component ...)) ->
  ;             (node-join (sxpath1 path-component) (sxpath '(...)))
  ; (sxpath1 '//) -> (node-or 
  ;                  (node-self (node-typeof? '*any*))
  ;                   (node-closure (node-typeof? '*any*)))
  ; (sxpath1 '(equal? x)) -> (select-kids (node-equal? x))
  ; (sxpath1 '(eq? x))    -> (select-kids (node-eq? x))
  ; (sxpath1 '(or@ ...))  -> (select-kids (node-typeof?
  ;                                          (cdr '(or@ ...))))
  ; (sxpath1 ?symbol)     -> (select-kids (node-typeof? ?symbol))
  ; (sxpath1 procedure)   -> procedure
  ; (sxpath1 '(?symbol ...)) -> (sxpath1 '((?symbol) ...))
  ; (sxpath1 '(path reducer ...)) ->
  ;             (node-reduce (sxpath path) (sxpathr reducer) ...)
  ; (sxpathr number)      -> (node-pos number)
  ; (sxpathr path-filter) -> (filter (sxpath path-filter))
  
  (define (sxpath path)
    (lambda (nodeset)
      (let loop ((nodeset nodeset) (path path))
      (cond
       ((null? path) nodeset)
       ; (or@ ... ) handler
       ((and (list? (car path)) 
           (not (null? (car path)))
           (eq? 'or@ (caar path)))
        (loop ((select-kids (node-typeof? (cdar path))) nodeset)
            (cdr path)))
       ((nodeset? nodeset)
        (map-union (sxpath path) nodeset))
       ((procedure? (car path))
        (loop ((car path) nodeset) (cdr path)))
       ((eq? '// (car path))
        (loop
         ((if (nodeset? nodeset) append cons) nodeset
        ((node-closure (node-typeof? '*any*)) nodeset))
         (cdr path)))
       ((symbol? (car path))
        (loop ((select-kids (node-typeof? (car path))) nodeset)
            (cdr path)))
       ((and (pair? (car path)) (eq? 'equal? (caar path)))
        (loop ((select-kids (apply node-equal? (cdar path))) nodeset)
            (cdr path)))
       ((and (pair? (car path)) (eq? 'eq? (caar path)))
        (loop ((select-kids (apply node-eq? (cdar path))) nodeset)
            (cdr path)))
       ((pair? (car path))
        (let reducer ((nodeset 
                     (if (symbol? (caar path))
                         ((select-kids (node-typeof? (caar path))) nodeset)
                         (loop nodeset (caar path))))
                    (reducing-path (cdar path)))
        (cond
         ((null? reducing-path) (loop nodeset (cdr path)))
         ((number? (car reducing-path))
          (reducer ((node-pos (car reducing-path)) nodeset)
                   (cdr reducing-path)))
         (else
          (reducer ((sxp:filter (sxpath (car reducing-path))) nodeset)
                   (cdr reducing-path))))))
       (else
        (error "Invalid path step: " (car path)))
  ))))
  
  
  ; Pre-order traversal of a tree and creation of a new tree:
  ;     apply-templates:: tree x <templates> -> <new-tree>
  ; where
  ; <templates> ::= (<template> ...)
  ; <template>  ::= (<node-test> <node-test> ... <node-test> . <handler>)
  ; <node-test> ::= an argument to node-typeof? above
  ; <handler>   ::= <tree> -> <new-tree>
  ;
  ; This procedure does a _normal_, pre-order traversal of an SXML
  ; tree.  It walks the tree, checking at each node against the list of
  ; matching templates.
  ; If the match is found (which must be unique, i.e., unambiguous), the
  ; corresponding handler is invoked and given the current node as an
  ; argument.  The result from the handler, which must be a <tree>,
  ; takes place of the current node in the resulting tree.
  ; The name of the function is not accidental: it resembles rather closely
  ; an 'apply-templates' function of XSLT.
  
  (define (apply-templates tree templates)
  
                ; Filter the list of templates. If a template does not
                ; contradict the given node (that is, its head matches
                ; the type of the node), chop off the head and keep the
                ; rest as the result. All contradicting templates are removed.
    (define (filter-templates node templates)
      (cond
       ((null? templates) templates)
       ((not (pair? (car templates)))  ; A good template must be a list
        (filter-templates node (cdr templates)))
       (((node-typeof? (caar templates)) node)
        (cons (cdar templates) (filter-templates node (cdr templates))))
       (else
        (filter-templates node (cdr templates)))))
  
                ; Here <templates> ::= [<template> | <handler>]
                ; If there is a <handler> in the above list, it must
                ; be only one. If found, return it; otherwise, return #f
    (define (find-handler templates)
      (and (pair? templates)
         (cond
          ((procedure? (car templates))
           (if (find-handler (cdr templates))
               (error "ambiguous template match"))
           (car templates))
          (else (find-handler (cdr templates))))))
  
    (let loop ((tree tree) (active-templates '()))
     ;(cout "active-templates: " active-templates nl "tree: " tree nl)
      (if (nodeset? tree)
        (map-union (lambda (a-tree) (loop a-tree active-templates)) tree)
        (let ((still-active-templates 
               (append 
                (filter-templates tree active-templates)
                (filter-templates tree templates))))
          (cond 
           ;((null? still-active-templates) '())
           ((find-handler still-active-templates) =>
            (lambda (handler) (handler tree)))
           ((not (pair? tree)) '())
           (else
            (loop (cdr tree) still-active-templates))))))
  )
  
  ;;                                               
  ;;                            S X M L   T o o l s
  ;;                                               
  ;;                             Swiss Army Knife  
  ;;                              for SXML data    
  ;;                                               
  
  ;; Some of the functions are dealing with normalized
  ;; or minimized form of SXML data.
  ;; Normalized form is defined in SXML Specification.
  ;; Minimized form is just the same, but doesn't require the presence 
  ;; of empty attribute lists.
  
  ;; $Revision: 1.1 $ from $Date: 2002/01/17 18:09:12 $:
  
  ;; Copyright (c) 2000-2001 Kirill Lisovsky
  ;;
  ;; This software is free,
  ;; MIT-style license is attached in the file COPYRIGHTS
  
  ;==============================================================================
  ; Predicates
  
  ; Predicate which returns #t if <obj> is SXML node, otherwise returns #f. 
  (define (sxml:node? obj)      
    (and                            ; The object is SXML node if:
         (list? obj)                ; - it is a list and
         (not (null? obj))          ; - it is not empty
         (symbol? (car obj))        ; - its car is symbol
         ))
  
  ; Predicate which returns #t if <obj> is SXML element, otherwise returns #f. 
  (define (sxml:element? obj)   
    (and                            ; The object is SXML element if:
         (list? obj)                ; - it is a list and
         (not (null? obj))          ; - it is not empty
         (symbol? (car obj))        ; - its car is symbol (name)
         (not (member (car obj) 
             '(@                  ; - it is not a list of attributes, etc.
                *TOP*             
                *PI*
                *COMMENT* 
                *ENTITY*
                *NAMESPACES*)))))
  
  ; Predicate which returns #t if <obj> is SXML element, otherwise returns #f. 
  (define (sxml:attr-list? obj) 
    (and                            ; The object is SXML element if:
         (list? obj)                ; - it is a list and
         (not (null? obj))          ; - it is not empty
         (symbol? (car obj))        ; - its car is symbol (name)
         (eq? (car obj) '@)))       ; - it is a list of attributes, etc.
  
  
  ; Predicate which returns #t if given element <obj> is empty. 
  ; Empty element has no content but may contain some attributes.
  ; It is a counterpart of XML empty-element.
  (define (sxml:empty-element? obj)
    (and (sxml:element? obj)
         (null? (sxml:content obj))))
  
  ; Predicate which returns #t if given element <obj> is void.
  ; Void element has neither content nor attributes, namespaces or comments,
  ; So, the every void element is also the empty one.
  (define (sxml:void-element? obj)
    (and 
      (sxml:element? obj)
      (or 
        (null? (cdr obj)) 
        (and 
        (eq? '@ (cadr obj)) 
        (null? (cdadr obj)) 
        (null? (cddr obj))))))
  
  
  ; Returns #t if the given <obj> is shallow-normalized SXML element.
  ;  This means that this element itself has a list of attributes
  ;  in the second position, but it doesn't  test its nested elements.
  (define (sxml:shallow-normalized? obj)
    (and 
      (sxml:element? obj)
      (not 
        (null? (cdr obj))) 
      (list? (cadr obj)) 
      (eq? (car (cadr obj)) 
         '@)))
  
  
  ;==============================================================================
  ; Accessors  
  
  ; Returns a name of the given element
  ; Sample call: (sxml:name element)
  ; It is introduced for the sake of encapsulation.
  (define sxml:name car)
  
  ; Safe version of sxml:name, which returns #f if the given <obj> is 
  ; not a SXML element.
  ; Otherwise returns its name.
  (define (sxml:element-name obj)
    (if (sxml:element? obj)
      (car obj)
      #f))
  
  ; Safe version of sxml:name, which returns #f if the given <obj> is 
  ; not a SXML node.
  ; Otherwise returns its name.
  (define (sxml:node-name obj)
    (if (sxml:node? obj)
      (car obj)
      #f))
  
  ; Returns the content of given SXML element, representing it
  ; as a list of values and nested elements in document (depth-first) order. 
  ; This list is empty for an empty element.
  ; <obj> is minimized or normalized SXML element.
  (define (sxml:content obj)
    ((select-kids (node-typeof-list? '(* *text*))) obj))
    
  
  ; Returns the list whose car is the list of attributes for given <obj>
  ; and whose cdr is the content of given <obj>
  ; <obj> is minimized or normalized SXML element
  (define (sxml:attr-content obj)
    (let rpt ((cl (cdr obj)) (cn '()))
      (if (null? cl)
         (cons '(@) cn)
        ((lambda (h)
         (if (and (list? h)
                  (eq? (car h) '@))
           (cons h (append cn (cdr cl)))
           (rpt (cdr cl) (append cn (list h))))) 
         (car cl)))))
  
  ; Returns a list of all the children elements for the given <obj>,
  ; <obj> which have to be minimized or normalized SXML element
  (define (sxml:child-elements obj)
    (let rpt ((ns obj)
            (rez '()))
      (if (null? ns)
        rez
        (rpt (cdr ns) 
           (if (sxml:element? (car ns))
             (cons (car ns) rez) 
             rez)))))
           
  ; Returns the list of attributes for the given element.
  ; Analog of ((sxpath '(@ *)) obj)
  ; Empty list is returned if there is no attributes for the element.
  (define (sxml:attr-list obj)
    (let rpt ((cl (cdr obj)))
      (if (null? cl)
        '()
         (if (and (pair? (car cl))
                  (eq? (caar cl) '@))
           (cdar cl)
           (rpt (cdr cl)))) 
         ))
  
  
  ; Accessor for the attribute named <attr-name> for the given SXML element
  ; <obj> which may also be an  attributes-list
  ;
  ; Analog of ((if-car-sxpath '(@ attr-name *text*)) obj)
  ;
  ; It returns: 
  ;    the value of the attribute if the attribute is present and has a value
  ;    #f if there is no such an attribute in the given element
  ;    attribute name (as a string) if the attribute is singular (boolean)
  (define (sxml:attr obj attr-name)
    (cond 
      ((assv attr-name 
           (cond
             ((and (not (null? (cdr obj))) 
                   (list? (cadr obj))
                   (eq? '@ (caadr obj)))
              (cdadr obj))   ; fast track for normalized elements 
             ((eq? '@ (car obj))
             (cdr obj))
             (else
             (sxml:attr-list obj)))) => 
                                           (lambda(x)
                                             (if (null? (cdr x))
                                               (symbol->string attr-name)
                                               (cadr x))))
      (else #f)))
  
  ;==============================================================================
  ; Data modification functions
  ;
  ; Constructors and mutators
  
  ; if the <new-content> is an empty list then the <obj> is transformed 
  ; to an empty element
  ; The resulting SXML element is normalized
  (define (sxml:content! obj new-content)
    (set-cdr! obj (cons 
                         (cons '@ (sxml:attr-list obj)) 
                             new-content))
    obj)
  
  ; Ground function for attribute manipulations
  ; The resulting SXML element is normalized, if <new-attrlist> is empty,
  ; the cadr of <obj> is (@)
  (define (sxml:attrlist! obj new-attrlist)
         (set-cdr! obj 
                 (cons
                        (cons '@ new-attrlist) 
                        (sxml:content obj)))
         obj) 
         
  ; Change name of a SXML element
  (define (sxml:name! obj new-name)
    (set-car! obj new-name)
    obj)
  
  ; Add an attribute <attr-name> with a value <attr-value> for an element 
  ; <obj>
  ; Returns #f if the attribute with given name already exists, 
  ; or the modified SXML node in case of success
  ; The resulting SXML node is normalized
  (define (sxml:add-attr! obj attr-name attr-value)
    (let ((x (sxml:attr-list obj)))
      (cond
        ((null? x)
         (set-cdr! obj (cons 
                        (list '@ (list  attr-name attr-value)) 
                        (sxml:content obj)))
         obj)
        ((assv attr-name x)
         #f)
        (else
        (set-car! (cdr obj) 
                  (append (list '@ (list attr-name attr-value)) x))
        obj))))
  
  ; Set value of the attribute <attr-name> of element <obj> to <attr-value>
  ; Returns #f if there is no such attribute, or the modified SXML element 
  ; in case of success
  ; The resulting SXML element is normalized
  (define (sxml:set-attr! obj attr-name attr-value)
    (let ((x (sxml:attr-list obj)))
       (if (null? x)
         #f
         (cond 
         ((assv attr-name x) => (lambda (y)
                                  (set-car! (cdr y) attr-value) obj))
         (else #f)))))
  
  ; Set the value of the attribute <attr-name> of element <obj> to <attr-value>
  ; If there is no such attribute the new one is added
  ; Returns: the SXML element modified 
  (define (sxml:attr! obj attr-name attr-value)
    ((lambda (x)
       (if (null? x)
         (set-cdr! obj (cons 
                             (list '@ (list attr-name attr-value)) 
                             (sxml:content obj)))
         (cond 
         ((assv attr-name x) => (lambda (y)
                                  (set-car! (cdr y) attr-value)))
         (else (set-car! (cdr obj) 
                         (append (list '@ (list attr-name attr-value)) 
                                x)))))
       obj)
     (sxml:attr-list obj)))
  
  
  ; Appends <new-data> to the content of the <obj>
  ; The resulting SXML element is normalized
  (define (sxml:append-content! obj . new-data)
     (sxml:content! obj
                     (append (sxml:content obj) 
                             new-data))
     obj)
  
  ; Appends <new-data> to the content of the <obj>
  ; The resulting SXML element is normalized
  (define (sxml:append-content1! obj new-data)
    (sxml:content! obj
                     (append (sxml:content obj) 
                             new-data))
    obj)
  
  ; This function inserts new sub-elements in sxml node <obj>
  ; if there is no elements with the same name yet
  ; <element> is a sxml element to insert
  ;
  ; Absolete name: if-insert-node!
  (define (sxml:insert-unique! obj element)
    (cond
      ((null? ((sxpath `(,(sxml:name element))) obj))
            (sxml:append-content! obj element))
      (else #f)))
  
  ; Normalize SXML data structure for the given <obj> and its descendants
  ; Returns: normalized SXML element
  (define (sxml:normalize! obj)
    (if (not (sxml:shallow-normalized? obj))
      ((lambda(x) 
         (if (null? x)
            (set-cdr! obj (cons '(@) (sxml:content obj)))
          (set-cdr! obj (cons (cons '@ x) (sxml:content obj)))
          ))
       (sxml:attr-list obj)))
    (for-each
      sxml:normalize!
      (sxml:child-elements obj))
     obj)
  
  ; Eliminates empty lists of attributes for the given SXML element 
  ; <obj> and its descendants ("minimize" it)
  ; Returns: minimized SXML element
  (define (sxml:squeeze! obj)
    (if (null? (sxml:attr-list obj))
      (set-cdr! obj (sxml:content obj)))
      (for-each 
        sxml:squeeze!
        (sxml:child-elements obj))
      obj)
  
  
  ;==============================================================================
  ; SXPath-related 
  
  ;------------------------------------------------------------------------------
  ; extensions
  
  ; The function node-typeof-list? takes a type criterion and returns
  ; a function, which, when applied to a node, will tell if the node satisfies
  ; the test.
  ;     node-typeof? :: Crit -> Node -> Boolean
  ;
  ; The criterion 'crit' is 
  ;  one of the following symbols:
  ;     id              - tests if the Node has the right name (id)
  ;     @               - tests if the Node is an <attributes-coll>
  ;     *               - tests if the Node is an <Element>
  ;     *text*          - tests if the Node is a text node
  ;     *data*          - tests if the Node is a data node 
  ;                         (text, number, boolean, etc)
  ;     *PI*            - tests if the Node is a PI node
  ;     *any*           - #t for any type of Node
  ;  or list of symbols
  ;     (id ...)        - tests if the Node has the right name, 
  ;                         listed in (id ...) list
  (define (node-typeof-list? crit)
      (define (present? x)
        ((if (list? crit)
         memq 
         eq?)
         x crit))
    (lambda (node)
      (cond 
        ((pair? node)
         (or
         (present? '*any*)
         (present? (car node))
         (and (present? '*)
              (not (memq (car node) 
                         '(@ *PI* *COMMENT* *ENTITY* *NAMESPACES*))))
         (and (present? '*data*) (not (list? node)))
         ))
        ((string? node) 
         (or 
         (present? '*any*)
         (present? '*text*)
         (present? '*data*)))
        (else
        (or
          (present? '*any*)
          (and (present? '*data*) (not (list? node))))))
      ))        
           
  
  
  

----------------------------------------------------------------------
In case of troubles, e-mail:     [EMAIL PROTECTED]
To unsubscribe, e-mail:          [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to