Anything to Triples -- any23

    http://code.google.com/p/any23/

Any23 is an open-source Java library that parses structured data out of various Web document formats, and maps them into the RDF data model. Any23 is based on code that has been developed for the Sindice search engine at DERI. This is the initial release. It supports the following input formats:

  * RDF/XML
  * Turtle (Notation 3)
  * N-Triples
  * RDFa embedded in XHTML and HTML
* Microformats: Adr, Geo, hCalendar, hCard, hListing, hResume, hReview, License, XFN

The focus of this first release is to extract the code from the Sindice codebase, and getting project infrastructure into place. Future versions will focus on a more flexible API, improved performance, more input data formats, higher-quality extraction, and more output formats such as JSON.

Any23 can be downloaded from the project website:
    http://code.google.com/p/any23/

Feedback is very welcome on the project mailing list:
    http://groups.google.com/group/any23-dev

Best,
Richard

Reply via email to