Re: [uf-discuss] hoard.it
Thanks. I don't know what Dan did for hoard.it, but our original script treated 'about' or 'circa' as the date plus/minus five years. So 'circa 1800' would be returned as '1795/1805'. For 'before' or 'after', you could return a pair of dates with either the first or second blank, accordingly. This is assuming we encode time periods as per the guidelines in the PNDS application profile: http://www.ukoln.ac.uk/metadata/pns/pndsdcap/ #DctermsTemporalDctermsPeriod Jim On 8 Jul 2008, at 06:02, Bob Jonkman wrote: Sounds great! How does it deal with dates commonly found in genealogy, such as ABT 7 July 1950 or AFT 25 Dec 2000 or BEF Jan 1925? or even ABT 2000 ? --Bob. Jim O'Donnell [EMAIL PROTECTED] http://eatyourgreens.org.uk http://flickr.com/photos/eatyourgreens ___ microformats-discuss mailing list microformats-discuss@microformats.org http://microformats.org/mailman/listinfo/microformats-discuss
Re: [uf-discuss] hoard.it
Sounds great! How does it deal with dates commonly found in genealogy, such as ABT 7 July 1950 or AFT 25 Dec 2000 or BEF Jan 1925? or even ABT 2000 ? --Bob. On 3 Jul 2008 at 23:03, Jim O'Donnell wrote: Hello, This might be of interest to members of this group, as it deals with extracting data from semantic HTML. Prior to this year's Mashed Museum event at the University of Leicester, Dan Zambonini put together a prototype which aggregates data by spidering online museum catalogues: http://hoardit.pbwiki.com/ It's a pretty fantastic demo of how information can be extracted from well-structured HTML, even before you think of putting microformats etc. on top. In particular, it does a pretty good job of figuring out when an object was made: http://feeds.boxuk.com/museums/object_100yrs.php The date parser is based on some code Dan I knocked together at Mashed Museum 2007, which looks at strings like 'late Victorian', 'early 20th Century', '4th January 1853' and so on, and converts them to machine-readable ISO dates. Our original idea, which we never got round to actually implementing, was that this would be useful as a web service - you give it a string, it gives you a machine-parsable representation of that string. The recent discussion here about dates has made me wonder if such a web service woud be useful for microformats parsers. What do others think? Cheers Jim Jim O'Donnell [EMAIL PROTECTED] http://eatyourgreens.org.uk http://flickr.com/photos/eatyourgreens ___ microformats-discuss mailing list microformats-discuss@microformats.org http://microformats.org/mailman/listinfo/microformats-discuss -- -- -- -- Bob Jonkman [EMAIL PROTECTED] http://sobac.com/sobac/ SOBAC Microcomputer Services Voice: +1-519-669-0388 6 James Street, Elmira ON Canada N3B 1L5 Cel: +1-519-635-9413 Software --- Office Business Automation --- Consulting ___ microformats-discuss mailing list microformats-discuss@microformats.org http://microformats.org/mailman/listinfo/microformats-discuss
Re: [uf-discuss] hoard.it
Jim O'Donnell wrote: The recent discussion here about dates has made me wonder if such a web service woud be useful for microformats parsers. What do others think? It seems to me that this type of date extraction might present risks if used by uf parsers to extract date/time from published content (and lead to the people showing up on the wrong date error mentioned in earlier posts). On the other hand, it might be great at the time content is authored, to convert ambiguous natural language dates into unambiguous microformats, as a way to reduce the pain of micro-formatting content (especially it can detect dates in plain text rather than parsing something it knows is a date). Authors could confirm the generated microformats before publishing in a way similar to how Yahoo! shortcuts Wordpress plugin works [1] Guillaume [1] http://lebleu.org/blog/2008/02/09/trying-out-yahoo-shortcuts/ ___ microformats-discuss mailing list microformats-discuss@microformats.org http://microformats.org/mailman/listinfo/microformats-discuss