Hi.

I have written down what I was able to figure out on matchers and
selectors. Please comment. Especially, if you happen to have
implemented this stuff :-)

I will XMLize it to match the other docs after the first round of
comments. (no comments, no XML ;-)

Speaking of comments, there are currently two of my patches still
neither commented, rejected nor applied. And I'm sitting on two more...

        Chris.

-- 
C h r i s t i a n       H a u l
[EMAIL PROTECTED]
    fingerprint: 99B0 1D9D 7919 644A 4837  7D73 FEF9 6856 335A 9E08

Using and Implementing Matchers and Selectors



Introduction

Matchers and selectors are sitemap components. They are used to
determine the flow, the other components involved and their ordering
of the request processing. One particular matcher you're probably
familiar with, is the WildcardURIMatcher. That is the one that
determines the (sub-)pipeline in the welcome example. But there are
many more matchers supplied with Cocoon, one matches the requested URI
on regular expressions, others match the client's hostname, existence
of parameters and so on.

There is also a number of selectors supplied with Cocoon. Selectors
test a generally simple boolean expression and allow to select on
roughly the same things as matchers would. There is a selector on the
client's hostname, on the client's browser etc.

To make things even more complicated, actions have very similar
properties. You can nest other sitemap elements in an action and those
are included in the processing only if the action completes
successfully.



So what are the differences?

The basic structure of a selector is that of a case, switch or
if-elseif-...-elseif-else statement in programming languages while
matchers and actions compare more to a if statement. In addition
selectors don't communicate the basis for their decision to the
embedded elements, just select the next part(s) of the
pipeline. Matchers and actions, however, add a new map to the
environment that can be used for the further processing in the
sub pipeline. 

You've already come across this feature on the example sitemap: The
value matched by the WildcardURIMatcher is used to determine the
filename "docs/samples/xsp/{1}.xsp". Here "{1}" represents the value
that is stored in the environmental map with the key "1". The name of
the key is arbitrary and set by the matcher. If you had supplied a
more complex pattern, there would be others. For example <map:match
pattern="*/*/*?user=*"> would result in keys "1", "2", "3", and "4"
being defined, corresponding to the "*"s in the pattern.

Generally, one could say that selectors are better suited if the
decisions has few easily distinguishable cases, the map feature is not
needed and the decision occurs later in the pipeline. Their
implementation should be lightweight and so is their integration in
the compiled sitemap.

Actions should be used to "do" something, not to chose between
different sub pipelines. That should be done only, if an error occurred
and you cannot continue the regular pipeline.

Matchers are often the very first element in a pipeline. They direct
the processing based on more complex decision process. They are
general purpose but more costly than selectors.


Using Matchers

Like every other sitemap component, matchers need to be declared in
the sitemap:

<map:sitemap xmlns:map="http://apache.org/cocoon/sitemap/1.0";>
  <map:components>
   ...

  <map:matchers default="wildcard">
     <map:matcher name="wildcard" 
src="org.apache.cocoon.matching.WildcardURIMatcherFactory"/>
     ...
     <map:matcher name="next-page" 
src="org.apache.cocoon.matching.WildcardParameterValueMatcherFactory">
        <map:parameter name="parameter-name" value="next-state"/>
     </map:matcher>
  </map:matchers>

  ...
  </map:components>

  <map:resources/>
  <map:pipelines/>
</map:sitemap>

Matchers are given a short name (e.g, "wildcard") and of course the
name of the JAVA class that implements the matcher or a matcher
factory. In addition, parameters can be defined as well.

One matcher may be defined as default matcher, so whenever a matcher
appears in the sitemap without explicit type specification it will be
assumed that it is of the default type.

In a pipeline use the matcher like this

<map:sitemap xmlns:map="http://apache.org/cocoon/sitemap/1.0";>
  <map:components/>
  <map:resources/>
  <map:pipelines>
     <map:pipeline>

        <map:match pattern="*">
           <map:generate type="serverpages" src="test/{1}.xsp"/>
           <map:transform src="stylesheets/dynamic-page2html.xsl"/>
           <map:serialize/>
        </map:match>

     </map:pipeline>
  </map:pipelines>
</map:sitemap>

Matchers can be nested:

         <map:match type="sessionstate" pattern="edit*">
            <!-- here you could insert parameters for the above matcher -->
            <map:parameter name="state-key" value="__sessionstate"/>
            <map:match type="next-page" pattern="ok*">
                   <!-- do something here, eg. database updates -->
                   <map:redirect-to resource="simple-page1"/>
            </map:match>
            <map:match type="next-page" pattern="delete*">
                   <!-- do something different here, eg. database deletions -->
                   <map:redirect-to resource="simple-page1"/>
            </map:match>
          </map:match>


Using Selectors

As said above, selectors are very similar to matchers. Again, you need
to declare selectors in the sitemap.xmap

<map:sitemap xmlns:map="http://apache.org/cocoon/sitemap/1.0";>
  <map:components>
   ...
  <map:selectors default="browser">
   <map:selector name="browser" 
src="org.apache.cocoon.selection.BrowserSelectorFactory">
    <browser name="explorer" useragent="MSIE"/>
    <browser name="lynx" useragent="Lynx"/>
    <browser name="netscape" useragent="Mozilla"/>
   </map:selector>
   <map:selector name="coded" src="org.apache.cocoon.selection.CodedSelectorFactory"/>
   <map:selector name="parameter" 
src="org.apache.cocoon.selection.ParameterSelectorFactory"/>
  </map:selectors>

  ...
  </map:components>

  <map:resources/>
  <map:pipelines/>
</map:sitemap>

Their use is a bit different to matchers, though:

<map:sitemap xmlns:map="http://apache.org/cocoon/sitemap/1.0";>
  <map:components/>
  <map:resources/>
  <map:pipelines>
     <map:pipeline>

        <map:match pattern="*">
           <map:generate type="serverpages" src="test/{1}.xsp"/>

           <map:select type="browser">
              <!-- you could insert parameters here as well -->
              <map:when test="explorer">
                 <map:transform src="stylesheets/w3c-2-msie.xsl"/>
              </map:when>
              <map:when test="lynx">
                 <map:transform src="stylesheets/dynamic-page2html-text.xsl"/>
                 <map:serialize/>
              </map:when>
              <map:when test="netscape">
                 <map:transform src="stylesheets/ns4.xsl"/>
              </map:when>
              <map:otherwise>
                 <map:transform src="stylesheets/w3c.xsl"/>
              </map:otherwise>
           </map:select>

           <map:transform src="stylesheets/dynamic-page2html.xsl"/>
           <map:serialize/>
        </map:match>

     </map:pipeline>
  </map:pipelines>
</map:sitemap>

Obviously, this could have been done with matchers as well. Decide on
yourself, what appears clearer to you in a specific situation.



Matchers

Since the basic structure and the assumptions are very similar, we
look first at matchers and matcher factories and point out the
differences to selectors at the end.

Matchers need to implement the org.apache.cocoon.matching.Matcher
interface. See javadocs for more details, see also example matchers
included in the distribution.

If you would like to do global configuration for your matcher, it has
to implement the
org.apache.avalon.framework.configuration.Configurable,
org.apache.avalon.framework.thread.ThreadSafe, and
org.apache.avalon.framework.component.Component interfaces.

(not sure about this, is it really necessary to implement all three of
them?) 


MatcherFactories

Matcher factories generate two distinct parts of source code: a
processed pattern stored in a global variable in the sitemap and a
method used to do the actual matching. Since the global variable can
be of an arbitrary type and it is an argument for the matcher method,
it is, too, configurable.

For each uniquely named matcher the function generateParameterSource
and generateMethodSource are called exactly once, while
generateClassSource is called for every use of the generated matcher
in sitemap pipelines.

Note that you may use the same matcher factory (or the same matcher or
whatever) and declare it with different names. Although they will be
instances of the very same class they would be different instances and
thus another matcher method would be generated, e.g. using different
configuration parameters.

The generated matcher method looks like

  private Map wildcardMatch (int [] pattern, Map objectModel,
                             Parameters parameters) {

    // this has been generated by generateMethodSource ->                            
    HashMap map = new HashMap();
    String uri = XSPRequestHelper.getSitemapURI(objectModel);
    if (uri.startsWith("/"))
      uri = uri.substring(1);
    if (org.apache.cocoon.matching.helpers.WildcardURIMatcher.match (
          map, uri, pattern)) {
      return map;
    } else {
      return null;
    }
    // <- this has been generated by generateMethodSource
  }


The method takes three arguments: the first is the aforementioned by
generateClassSource processed pattern, the current environment
(objectModel), and the parameters given for the corresponding match
element. In the example above for nested matchers, this would be the
"<map:parameter name="state-key" value="__sessionstate"/> ". The
"int []" part of the method signature was generated by
generateParameterSource.

To configure the generated matcher, global configuration parameters
can be used to create different sources. To read global configuration
parameters, dom2 is used, you cannot use the usual avalon classes for
this.

Local configuration parameters are avalon parameters and thus can be
easily read and used with the generated matcher method.

If the matcher returns not null, the match was successful and the
nested sub pipeline is executed. Components in sub pipeline can access
the matching result through the returned map.

The easiest way to write your own matcher would be to base it upon
org.apache.cocoon.matching.WildcardURIMatcherFactory and override the
generateMethodSource method with your own.



Selectors

Selectors and selector factories differ from their matcher counter
parts only in the fact that selectors return boolean values rather
than maps. Thus when a selector returns "true" the nested elements
will be included in the processing, otherwise they are not
included. Since no map is returned, no additional information may be
passed to those elements.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]

Reply via email to