Re: "scheme" attribute of META element

Julian Reschke Sat, 06 Jun 2009 00:42:26 -0700

Ian Hickson wrote:

I can't speak for Elliot, but the Web repository connector inside SAPNetweaver's Knowledge Management has supported RFC2731-style encodedmetadata (as shown above) for many years now.
Could you elaborate on how this tool consumes this data? Any informationyou may have would be very useful. Could you walk us through an example ofhow this information gets used? How do the various schemes affect thehandling of the metadata? Have you found particular processing is neededto process invalid values? Is the tool's input limited to files generatedby one organisation, or does it process input from arbitrary Web sites?


I think I did that already once many months ago.

Anyway.

In the SAP KM system, everything is designed around the concept ofresources, which essentially consist of a binary content stream, genericmetadata (MIME type, encoding, whatnot), Access Control information(ACLs), versioning information (checked-in/out, version history...), andcustom metadata.

Most metadata lives in name/value pairs, where the name is an XML typename (nsuri + localname), and the value can be numbers, strings, XML,... (and lists of them).

SAP KM resources expose a generic API, which is used by the UI, protocolhandlers (HTTP/WebDAV, ICE, web services...), and internals services(search, collaboration, ...).

The implementation of resources varies, they can be be based on fileshares, database tables, remote content management systems, remoteWebDAV servers, LDAP, ... and also generic HTTP servers.

The latter are usually used to pull in read-only information that shouldbe exposed to the internal search system (SAP TREX). The code thatimplements these resources extracts metadata from well-known HTMLelements (title, keywords, ...), using configurable filters, and throughthe use of RFC 2731 formatted meta elements.

How this information is used in detail depends on the consumers usingthe KM API, which is hard to predict. Some use cases are decorations inthe UI based on additional properties, or support in custom searches.

One of the reasons RFC 2731 support was added specifically was thatseveral companies wanted to expose additional properties in their HTMLdocuments (such as additional document related dates), and have them beaccessible through the services mentioned above.


Back to your question:

> how this information gets used? How do the various schemes affect the
> handling of the metadata? Have you found particular processing is needed

Schemes aren't used (as I said in a later mail), but link/meta and theRCF 2731 style encoding of prefixes is.

> to process invalid values? Is the tool's input limited to filesgenerated

> by one organisation, or does it process input from arbitrary Web sites?

The tool works for generic web resources.

BR, Julian

Re: "scheme" attribute of META element

Reply via email to