[uf-discuss] Parsing XFN in PHP

Julian Bond Tue, 08 Apr 2008 06:22:29 -0700

I need some advice about reading rel="me" tags in arbitrary web pagesusing PHP. I'm intending to use this to help build a lifestream stylefunction. The basic intent is to cut down the amount of data entry theuser has to do. When they give me a MyBlogLog, Friendfeed, Plaxo Pulsepage that has lists of links to their profile pages I should be able toavoid having to ask them for all of them again. So:-


- User gives me a URL for one of their profile pages
- Use Curl to collect the source
- Parse the source looking for links with a rel="me"
- Extract an array of Link URL - Link Text
- Do something useful with the array. (???? followed by Profit!)

I've been searching this morning for a PHP library to do the parsing andlink extraction or PHP examples or example regex to use inPREG_MATCH_ALL or something/anything, without success. Since the sourcedata is probably badly written and broken html, I don't think I can useXML methods as all the XML unserialising code I've used barfs on badlyformed XML. One possibility I suppose is to run it though HTML-Tidyfirst but I run the (admittedly small) chance of html-tidy wiping outsome of the links.


So what do people use to consume XFN with PHP?

--
Julian Bond  E&MSN: julian_bond at voidstar.com  M: +44 (0)77 5907 2173
Webmaster:          http://www.ecademy.com/      T: +44 (0)192 0412 433
Personal WebLog:    http://www.voidstar.com/     skype:julian.bond?chat
                        Not Tested On Animals
_______________________________________________
microformats-discuss mailing list
microformats-discuss@microformats.org
http://microformats.org/mailman/listinfo/microformats-discuss

[uf-discuss] Parsing XFN in PHP

Reply via email to