Re: [uf-discuss] hoard.it

2008-07-09 Thread Jim O'Donnell
Thanks. I don't know what Dan did for hoard.it, but our original  
script treated 'about' or 'circa' as the date plus/minus five years.  
So 'circa 1800' would be returned as '1795/1805'. For 'before' or  
'after', you could return a pair of dates with either the first or  
second blank, accordingly. This is assuming we encode time periods as  
per the guidelines in the PNDS application profile:
http://www.ukoln.ac.uk/metadata/pns/pndsdcap/ 
#DctermsTemporalDctermsPeriod


Jim

On 8 Jul 2008, at 06:02, Bob Jonkman wrote:


Sounds great!  How does it deal with dates commonly found in
genealogy, such as ABT 7 July 1950 or AFT 25 Dec 2000 or BEF
Jan 1925? or even  ABT 2000 ?

--Bob.


Jim O'Donnell
[EMAIL PROTECTED]
http://eatyourgreens.org.uk
http://flickr.com/photos/eatyourgreens



___
microformats-discuss mailing list
microformats-discuss@microformats.org
http://microformats.org/mailman/listinfo/microformats-discuss


Re: [uf-discuss] hoard.it

2008-07-07 Thread Bob Jonkman
Sounds great!  How does it deal with dates commonly found in 
genealogy, such as ABT 7 July 1950 or AFT 25 Dec 2000 or BEF 
Jan 1925? or even  ABT 2000 ?

--Bob.

On 3 Jul 2008 at 23:03, Jim O'Donnell wrote:

 Hello,
 
 This might be of interest to members of this group, as it deals with 
 extracting data from semantic HTML. Prior to this year's Mashed 
 Museum event at the University of Leicester, Dan Zambonini put 
 together a prototype which aggregates data by spidering online museum 
 catalogues: http://hoardit.pbwiki.com/ It's a pretty fantastic demo of
 how information can be extracted from  well-structured HTML, even
 before you think of putting microformats  etc. on top.
 
 In particular, it does a pretty good job of figuring out when an 
 object was made: http://feeds.boxuk.com/museums/object_100yrs.php The
 date parser is based on some code Dan  I knocked together at  Mashed
 Museum 2007, which  looks at strings like 'late Victorian',  'early
 20th Century', '4th January 1853' and so on, and converts them  to
 machine-readable ISO dates.
 
 Our original idea, which we never got round to actually implementing, 
 was that this would be useful as a web service - you give it a 
 string, it gives you a machine-parsable representation of that 
 string. The recent discussion here about dates has made me wonder if 
 such a web service woud be useful for microformats parsers. What do 
 others think?
 
 Cheers
 Jim
 
 Jim O'Donnell
 [EMAIL PROTECTED]
 http://eatyourgreens.org.uk
 http://flickr.com/photos/eatyourgreens
 
 
 
 ___
 microformats-discuss mailing list
 microformats-discuss@microformats.org
 http://microformats.org/mailman/listinfo/microformats-discuss


-- -- -- --
Bob Jonkman [EMAIL PROTECTED] http://sobac.com/sobac/
SOBAC Microcomputer Services  Voice: +1-519-669-0388
6 James Street, Elmira ON  Canada  N3B 1L5  Cel: +1-519-635-9413
Software   ---   Office  Business Automation   ---   Consulting


___
microformats-discuss mailing list
microformats-discuss@microformats.org
http://microformats.org/mailman/listinfo/microformats-discuss


Re: [uf-discuss] hoard.it

2008-07-07 Thread Guillaume Lebleu

Jim O'Donnell wrote:
The recent discussion here about dates has made me wonder if such a 
web service woud be useful for microformats parsers. What do others 
think?
It seems to me that this type of date extraction might present risks if 
used by uf parsers to extract date/time from published content (and lead 
to the people showing up on the wrong date error mentioned in earlier 
posts).


On the other hand, it might be great at the time content is authored, to 
convert ambiguous natural language dates into unambiguous microformats, 
as a way to reduce the pain of micro-formatting content (especially it 
can detect dates in plain text rather than parsing something it knows is 
a date). Authors could confirm the generated microformats before 
publishing in a way similar to how Yahoo! shortcuts Wordpress plugin 
works [1]


Guillaume

[1] http://lebleu.org/blog/2008/02/09/trying-out-yahoo-shortcuts/
___
microformats-discuss mailing list
microformats-discuss@microformats.org
http://microformats.org/mailman/listinfo/microformats-discuss