Hello Brian

Brian Suda wrote:
2008/8/4, Martin McEvoy <[EMAIL PROTECTED]>:
 There are (in my view) only three real ways to resolve this issue
 ...
 3 Support NLP  (Natural Language Processing)  <span class="duration">3
minutes 23 seconds</span>.

 I personally am in favour of number 3 as I believe it is not too difficult
to build a parser that will process just durations (hours minutes  seconds)
as long as there is an agreed format.

--- any sort of NLP is much harder than you think!  If we are back to
codifying something, either we build it in english (which people would
disagree with) or having an list of all known way to spell, decline
and abbreviate hours in all known human languages. Is is very much a
boiling the oceans solution.

-brian

I disagree (slightly) consider this

   <span class="duration">1 hour 3 minutes 23 seconds</span>

or this

   <span class="duration">1 heures 3 minutes 23 secondes</span>


The parser already knows that this is a duration and the contents are a Numerical value and thus text (words) are striped as they are nothing to do with the value, they are only there for a human to understand would leave us with...

<span class="duration">1  3 23</span>

as long as we we know what format this is supposed to represent , the first number is an hour, the second minutes, and third seconds and this is documented as a decided format then it would be fairly straight forward after that to output any format you like.

Thanks


Martin McEvoy
_______________________________________________
microformats-new mailing list
[email protected]
http://microformats.org/mailman/listinfo/microformats-new

Reply via email to