On 25.03.2013 13:12, Jordan Mendelson wrote:
Hello,

Has there been any work towards a microformat for datasets like what
you'd find at http://data.gov, http:///commoncrawl.org, etc?

Open data is becoming more common and there is a lot of metadata
surrounding it (url, format of the data, size of dataset, when it was
published, when it was updated, description, sample data,
license/terms of use, contributors, geo (if data relates to an area),
etc and really no way to easily find it outside some very incomplete
directories.

With a microformat, one might actually be able to build a decent
search engine to help people who are searching for datasets for use in
research, commerce, etc.

My organization publishes several hundred TB of web crawl data and at
a recent talk at Strata, someone asked me about a microformat for
datasets. I feel like if there isn't one started yet, one needs to be
started.


Jordan
_______________________________________________
microformats-new mailing list
microformats-new@microformats.org
http://microformats.org/mailman/listinfo/microformats-new

I won't bug you with the "guidelines" on how to come up with new microformats because you can dig that up from the wiki and then collaborate with the community.

If you do end up developing a new microformats for datasets, I suggest to take a look at VoID [1] and DCAT [2] (RDF vocabularies) as well.

[1] http://www.w3.org/TR/void/
[2] http://www.w3.org/TR/vocab-dcat/

-Sarven
_______________________________________________
microformats-new mailing list
microformats-new@microformats.org
http://microformats.org/mailman/listinfo/microformats-new

Reply via email to