Found this awhile back and thought I'd share: https://github.com/scrapinghub/extruct
"extruct is a library for extracting embedded metadata from HTML markup" > > - W3C's HTML Microdata > - embedded JSON-LD > - Microformat via mf2py > - Facebook's Open Graph > - (experimental) RDFa via rdflib > -- http://github.com/RDFLib --- You received this message because you are subscribed to the Google Groups "rdflib-dev" group. To unsubscribe from this group and stop receiving emails from it, send an email to rdflib-dev+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/rdflib-dev/CACfEFw_%2B_XNufMW_esk07F%2BBGk6P_TSE%2B2g4B4jnq7hjekkTRw%40mail.gmail.com.