Greetings.
Before I try and solve this for the third time I wanted to throw it out to
the group and maybe get leads in the right direction.
What I am trying to do is "parse" HTML code received from a website. First
of all I want to remove all HTML tags and secondly I wanted to write a
function to pull data from an html table based on a row header in the table.
For example:
<HTML>
<TITLE>How to Parse</TITLE>
<BODY BGCOLOR="F0F0F0">
<h1>This is a Table</h1>
<hr>
<TABLE Border ="1" Size="50%">
<TR>
<TD align="right">Boxes</td><td>100</td><td align="left">10</td></tr>
<TR><TD align="right">Shoes</td><td>50</td><td align="left">20</td></tr>
<TR><TD align="right">Socks</td><td>500</td><td align="left">52</td></tr>
</table>
</html>
would yield (one of):
[Boxes 100 10]
[Shoes 50 20]
[Socks 500 52]
(depending on the input parameter)
My first thoughts was that I should be able to this without problem, but I
seem to be having difficulty with the grammar of parse.
Anyhow, anything that could steer me in the right direction would be
appreciated greatly.
Thanks
Brian