On Thu, Apr 10, 2008 at 1:40 PM, Mark Ng <[EMAIL PROTECTED]> wrote:
>  XFN itself is fairly easy to deal with by just throwing pages through
>  tidy and using DOM/SAX/xPath, surely ?  I made a rudimentary parser to
>  do this some time ago.  The code is a little ugly to publish, but I
>  don't mind sharing privately.

Here's a *very* hacky code example from when I just wanted to check my
'me' links - I include it here just to demonstrate how simple XFN can
be and hopefully it's apparent how easy it would be to work up into a
nice objecty system for spidering:

<?php

$url = 'http://ciaranmcnulty.com/';
if($html = @file_get_contents($url)){
        $dom = new DomDocument();
        if(@$dom->loadHtml($html)){
                $xpath = new DomXpath($dom);
                if($nodes = $xpath->query("//a[contains(concat(' ',
normalize-space(@rel), ' '),' me ')]")){
                        foreach($nodes as $node){
                                echo $node->getAttribute('href'), PHP_EOL;
                        }
                }
        }
        else{ echo 'Could not parse HTML', PHP_EOL; }
}
else{  echo 'Could not fetch file', PHP_EOL; }
?>
_______________________________________________
microformats-discuss mailing list
microformats-discuss@microformats.org
http://microformats.org/mailman/listinfo/microformats-discuss

Reply via email to