Re: [whatwg] time

2009-03-16 Thread Jim O'Donnell


On 16 Mar 2009, at 20:10, Tom Duhamel wrote:


It seems that pretty much everyone agrees on this:
- Allow the use of an alternate calendar, but only Gregorian is  
required to be understood by user agents
- We only require the user agent to display dates; they are free to  
do more if they like (conversion, ...) but are not required to.
- Calendar is specified in a new attribute ('calendar' or something  
similar) and the value of 'datetime' attibute is specified in the  
calendar specified by that new attribute
- If 'datetime' attribute is missing, try to interpret the content  
as an ISO date. User agent could print the content as is, or print  
in a more friendly way if desired (in case it was successfully read  
as a valid ISO date).
- If content is missing, print 'datetime' attribute (in a friendly  
way, if desired and set by user, or simply as is if unable to do  
better)
- If both content and datetime are present, print content on page  
and show a representation of the date in datetime with a mechanism  
such as a tool tip


This seems overly complex to me.

Can we follow existing practice from TEI ie. datetime may only be  
Gregorian and no other calendar - calendar (if present) identifies  
the calendar in the original text (analogous to the way the HTML lang  
attribute indicates the language of the enclosed text).


So, a date marked up in a TEI document as
date calendar =julian value=1547-02-2818th Feb. 1546/date
transforms to the following HTML
time calendar=julian value=1547-02-2818th Feb. 1546/date

My reasoning here is that TEI is already in widespread use, authors  
familiar with it will expect the same markup practices in HTML and  
one of the largest uses for historical dates as time elements will  
be transformation of existing TEI documents to HTML.


It seems dangerous, to me, to adopt a whole new standard for  
historical dates in HTML when there is an existing standard in wide use.
Essentially I'm asking that the spec for time mirror the existing  
spec for date to make it compatible with historical texts:

http://www.tei-c.org/Guidelines/P4/html/ref-DATE.html

Regards
Jim

Jim O'Donnell
j...@eatyourgreens.org.uk
http://eatyourgreens.org.uk






Re: [whatwg] time

2009-03-14 Thread Jim O'Donnell


On 13 Mar 2009, at 16:19, David Singer wrote:


Can we drop this topic?  Apart from suggesting
a) that the fully delimited date format be required (extended format);
b) that year  and before be allowed;
c) that parsing the body text as 8601 may be dangerous if it's  
notated the same way but not (possibly proleptic) Gregorian;


I agree, with the addition of time periods denoted by two datetimes  
seperated by a / eg. 1914/1918. This would bring HTML5 in line with  
existing authorship practices in use in TEI. Authors will be able to  
encode any historical date be it Julian, Roman, Mayan, Chinese etc by  
continuing to do what they're already accustomed to doing -  
publishing the machine readable date as a proleptic Gregorian date.


The proposed schemes for alternative calendars seem like a red  
herring to me, since noone in practice encodes machine readable  
versions of alternative calendars when digitising a text.


Regards
Jim

Jim O'Donnell
j...@eatyourgreens.org.uk
http://eatyourgreens.org.uk
http://flickr.com/photos/eatyourgreens
http://twitter.com/pekingspring






Re: [whatwg] time

2009-03-14 Thread Jim O'Donnell


On 13 Mar 2009, at 10:33, Mikko Rantalainen wrote:

This is already a solved problem in the Text Encoding Intiative   
(TEI).
The value of a date/time is encoded in the Gregorian calendar,   
using

ISO8601. The calendar attribute is used to indicate the  calendar of
the original, written date enclosed in the tags.


I'm not sure why the original calendar would need to be indicated  
in the
'calendar' attribute. It does not matter for the 'value' or  
software in

general, if I've understood correctly. If the 'value' is always in
Proleptic Gregorian calendar, it's all the software needs to know.



Hello, the usual use of TEI is encoding historical papers for digital  
preservation eg. digitising large archives of correspondence or  
literature. The calendar attribute exists to preserve semantic  
information in the digital version of a paper document. I was  
interested to know whether there was value in preserving such  
information in HTML also, since digital versions of historic  
documents are published on the web as HTML.


I understand, though, that HTML should not become a grab bag of  
features from other SGML or XML vocabularies so I wouldn't push for a  
calendar attribute on time if it isn't generally useful or if I'm  
the only person that wants it.


Regards
Jim

Jim O'Donnell
j...@eatyourgreens.org.uk
http://eatyourgreens.org.uk






Re: [whatwg] time

2009-03-12 Thread Jim O'Donnell

Hi Robert
On 12 Mar 2009, at 02:53, Robert J Burns wrote:

Since you keep repeating the following example (by copy and paste?)  
I will mention that you have the year wrong in one place or the  
other (1731 and 1732). Dates only diverge by years between Julian  
and Gregorian many millions of years in the future or past or in BC  
if using eras that insert a zero year into the calendar. Since  
we're not even proposing the ability to change eras, we should  
assume the same era for both calendars (in any event that only  
applies to BCE/BC dates).


I used that example since it's the Julian date example used in the  
TEI docs for the date element. Happy to give other examples but  
that one seems to nicely demonstrate historical dates. Just one  
correction. 1 January was not adopted as the start of a new year  
until 1752. Hence Jan, Feb, Mar 1731 in the Julian Calendar are Jan,  
Feb, Mar 1732 in the calendar we use now (Gregorian).


On the issue of the calendar attribute, I suggested that as it's  
already present in TEI-encoded documents. TEI is in wide use by  
archives and libraries to digitise historical text and the calendar  
attribute is familiar to TEI authors. Obviously, the semantic info  
captured by TEI is lost when documents are transformed to HTML in  
order for publication online. I thought it would be useful to retain  
this semantic information in HTML but, as I also said, it isn't  
essential,


Regards
Jim

Jim O'Donnell
j...@eatyourgreens.org.uk
http://eatyourgreens.org.uk
http://flickr.com/photos/eatyourgreens
http://twitter.com/pekingspring






Re: [whatwg] time

2009-03-12 Thread Jim O'Donnell

Hi Andy

On 12 Mar 2009, at 21:46, Andy Mabbett wrote:

In message caaf25cf-1fbe-494c-8361-e6811b6c5...@googlemail.com,  
Geoffrey Sneddon foolist...@googlemail.com writes


Ultimately, why is the Gregorian calendar good enough for the ISO  
but not us? I'm sure plenty of arguments were made to the ISO  
before ISO8601 was published, yet that still supports only the  
Gregorian calendar, having been revised twice since it's original  
publication in 1988. Is there really any need to go beyond what  
ISO 8601 supports?


What were the use-case(s) for ISO8601? If merely the exchange of  
calendar information, it's unlikely that it took account of the pre- 
Gregorian/ BCE/ imprecise situations under consideration here.



ISO8601 is already used by historians to mark up dates pre 1582 or  
BCE, or even uncertain dates. The proleptic Gregorian calendar is  
fine back to something like -1BC, which covers pretty much all of  
recorded history. Uncertain dates of the form we deal with on a day- 
to-day basis in museums, like 'circa 1920' or 'late 20th century' can  
be encoded as '1915/1925' or '1951/2000' using ISO8601.


I think uncertain times encoded as ISO8601 date ranges in the  
datetime attribute will handle any reasonable markup of a historical  
date. Anyone who has a weird use case that falls outside this can  
fall back on RDFa.


Does anyone see a case for identifying the calendar of the date in  
the running text, or is that just me wearing my pedantic, semantic  
history markup hat?


Jim

Jim O'Donnell
j...@eatyourgreens.org.uk
http://eatyourgreens.org.uk
http://flickr.com/photos/eatyourgreens
http://twitter.com/pekingspring






Re: [whatwg] time

2009-03-11 Thread Jim O'Donnell


On 11 Mar 2009, at 04:46, Leif Halvard Silli wrote:

This is already a solved problem in the Text Encoding Intiative  
(TEI). [ ... ] date calendar=Julian value=1732-02-22Feb. 11,  
1731./date [ ... ] We can't change the author's original written  
dates, but it would be useful to normalise documents using the  
Julian calendar to proleptic Gregorian dates.


Yes, the draft needs to clear up the (mis)understanding that time  
requires authors to place Gregorian dates in the original.


If the calendaric meta information should be available to human  
consumers, then then @title seems like a better place. The draft  
could recommend how to use @title for time.



How about something like time calendar=Julian value=1732-02-22  
title=22 February 1732Feb. 11 1731/time, where title and  
calendar are optional?


Regards
Jim

Jim O'Donnell
j...@eatyourgreens.org.uk
http://eatyourgreens.org.uk
http://flickr.com/photos/eatyourgreens
http://twitter.com/pekingspring






Re: [whatwg] time

2009-03-11 Thread Jim O'Donnell


On 11 Mar 2009, at 08:54, Robert J Burns wrote:




Authoring tools can be used to convert from other formats to  
Gregorian.


And in that regard, it should be very relevant to have a calendar  
attribute.


Or reuse the RDFa datatype attribute with new calendar system  
keywords.





I'm not sure that would work, or rather that you would need the  
complexity of RDFa for something as simple as a change of calendar.  
My example of a Julian date in 1732

time calendar=Julian datetime=1732-02-22Feb. 11 1731/time,
(from http://www.tei-c.org/Guidelines/P4/html/ref-DATE.html)
would then be, I think, in RDFa
time datatype=GregorianDate content=1732-02-22Feb. 11 1731/time
since datatype describes the format of the content attribute, which  
is always a proleptic Gregorian date. So you'd lose the semantic  
information that the marked-up date is in the Julian calendar.


I'm proposing that datetime accept ISO8601 dates as per the existing  
W3C datetime profile. ISO8601 dates may only use the Gregorian  
calendar. In addition, it would be useful, but not essential, in  
describing the semantics of historical dates to have an attribute, as  
per TEI, identifying the calendar in use in the text itself. That  
shouldn't add any extra load to datetime parsers - they only need to  
be able to understand Gregorian ISO8601 dates.


Expanding the datetime or RDFa content attributes to contain non- 
Gregorian dates would be an unnecessary headache and, frankly, more  
trouble than it's worth.


Regards
Jim

Jim O'Donnell
j...@eatyourgreens.org.uk
http://eatyourgreens.org.uk
http://flickr.com/photos/eatyourgreens
http://twitter.com/pekingspring






Re: [whatwg] time

2009-03-10 Thread Jim O'Donnell

Hi David,

On 10 Mar 2009, at 17:03, David Singer wrote:



The trouble is, that opens a large can of worms.  Once we step out  
of the Gregorian calendar, we'll get questions about various other  
calendar systems (e.g. Roman ab urbe condita http:// 
en.wikipedia.org/wiki/Ab_urbe_condita, Byzantine Indiction cycles  
http://en.wikipedia.org/wiki/Indiction, and any number of other  
calendar systems from history and in current use).  Then, of  
course, are the systems with a different 'year' (e.g. lunar rather  
than solar).  And if we were to introduce a 'calendar system  
designator', we'd have to talk about how one converted/normalized.




This is already a solved problem in the Text Encoding Intiative  
(TEI). The value of a date/time is encoded in the Gregorian calendar,  
using ISO8601. The calendar attribute is used to indicate the  
calendar of the original, written date enclosed in the tags.

eg. from the TEI docs for dates and times
date calendar=Julian value=1732-02-22Feb. 11, 1731./date
I suggested that the calendar attribute be adopted in HTML5, as it  
would be useful to those of us who mark up historical texts in HTML.  
We can't change the author's original written dates, but it would be  
useful to normalise documents using the Julian calendar to proleptic  
Gregorian dates.


See http://www.tei-c.org/Guidelines/P4/html/ref-DATE.html and the  
associated guidelines on publishing dates and times

http://www.tei-c.org/Guidelines/P4/html/CO.html#CONADA



Adding a range construct to 8601, or having a range construct  
ourselves using 2 8601 dates, seems like something we could ask for  
or do.



8601 already allows ranges. Simply seperate two ISO8601 dates with  
a / eg. 1595-11/1596-02.


Regards
Jim

Jim O'Donnell
j...@eatyourgreens.org.uk
http://eatyourgreens.org.uk
http://flickr.com/photos/eatyourgreens
http://twitter.com/pekingspring






Re: [whatwg] Historic dates in HTML5

2009-03-05 Thread Jim O'Donnell

On 5 Mar 2009, at 15:17, Greg Houston wrote:


Personally, I think it would be an improvement to the datetime
attribute if it was valid for at least - - :

p ... For these events to take place within a three week or so
period is simply impossible. The time
datetime=-0004-03-13eclipse/time cannot be the one written in the
records of Josephus./p


I agree that this would be an improvement, since it would make time  
compatible with hCalendar by using ISO8601 for datetime.


By remarkable serendipity, Paul Tarjan posted a presentation about  
searchmonkey today

http://www.slideshare.net/ptarjan/semantic-searchmonkey
It mentions searching the web by date, where that information is  
available. So I would say a key use case for marking up historic  
dates is searching large archives by date eg. searching the National  
Maritime Museum archive for Elizabethan Navy records dating from 1595.


Wikipedia uses the date-time design pattern, from microformats, to  
mark up historic dates using ISO8601. In addition, TEI is widely used  
by archives and libraries to mark up texts, including ISO8601 dates  
(http://www.tei-c.org/Guidelines/P4/html/ref-DATE.html). Since  
authors are already publishing dates online, surely HTML5 should  
accept all ISO8601 dates rather than a limited subset, which requires  
additional processing on the part of authoring and publishing  
software to filter out valid dates that are invalid HTML5.


Regards
Jim

Jim O'Donnell
j...@eatyourgreens.org.uk
http://eatyourgreens.org.uk
http://flickr.com/photos/eatyourgreens
http://twitter.com/pekingspring



[whatwg] Historic dates in HTML5

2009-03-04 Thread Jim O'Donnell

Hello,

Apologies for coming in late but Bruce Lawson pointed me in the  
direction of this discussion  I had some comments about dates in HTML5.


Lachlan Hunt wrote:
The time element was primarily designed to address use cases involving
contemporary dates.  It doesn't address non-Gregorian calendars or BCE
dates by design, as it is not really meant for historical dates.

Probably the most historical dates that it would really be suitably
optimised for are people's birthdates, which, for people alive today,
don't really extend back beyond the early 20th century, with very few
exceptions.

Has any consideration been given to using the time tag in the same  
manner as the date element from TEI to mark up dates, particularly  
the calendar attribute to indicate non-Gregorian calendars? (see  
http://www.tei-c.org/Guidelines/P4/html/ref-DATE.html)


I ask because one of the sites I look after is the National Maritime  
Museum archive search, which includes ~70,000 records dating from the  
16th century to present. Of those, around 3,500 predate the Gregorian  
calendar and have, presumably, Julian dates: http://is.gd/lFrh
It would be useful to mark up these dates as dates in the HTML  
versions of the catalogue records. However, from what Lachlan Hunt  
has said, it seems the time tag in HTML5 can't be used to do this.  
This then leads to a situation where some dates on the web can be  
marked up, semantically, as dates but others cannot, which seems  
somewhat ridiculous really.


Is there any suitable markup in HTML5 for dates in digitised  
documents from museums, libraries and archives?


Regards
Jim O'Donnell
j...@eatyourgreens.org.uk
http://eatyourgreens.org.uk
http://flickr.com/photos/eatyourgreens
http://twitter.com/pekingspring