Hello all, just getting up to speed on Cocoon, and finding it all 
quite fascinating. No doubt I'll have many more questions in the 
coming months.

Anyway, today's question is on xpath. I'm trying to customize the 
HTML Generator 'scraper' example to extract bits from a given HTML 
page. Now, I have rudimentary knowledge of xpath syntax, but not 
enough I guess because I'm stuck.

Here's a sample of the HTML to be scraped:

-----------
<table width="100%" border="0">
        <bis t="pr" f="p020326.htm">
<tr>
        <td nowrap align="right" valign="top">
    26 Mar 2002 &nbsp;
    </td>
        <td valign="top">
    Financial Stability Forum holds its seventh meeting 
    (<a href="p020326.htm">Read</a>)
    </td>
</tr>
</bis>

<bis t="pr" f="p020318.htm">
<tr>
        <td nowrap align="right" valign="top">
        [snip]
    </td>
</tr>
</bis>

Etc.
----------

The <bis...> stuff is used by another, non-XML process, but it seemed 
to me it should be a no-brainer to write an xpath argument that would 
pull out between the <bis> </bis> and transform them.

However, it isn't. Can anyone point me in the right general direction 
here?

-----------------------------
Brent Eades, Almonte, Ontario
     http://www.almonte.com
     http://www.bankofcanada.ca


---------------------------------------------------------------------
Please check that your question has not already been answered in the
FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>

To unsubscribe, e-mail: <[EMAIL PROTECTED]>
For additional commands, e-mail: <[EMAIL PROTECTED]>

Reply via email to