subject:"Re\: \[whatwg\] <video> feedback"

Re: [whatwg] video feedback

2012-12-21 Thread Ian Hickson

On Thu, 20 Dec 2012, Jer Noble wrote:
 On Dec 17, 2012, at 4:01 PM, Ian Hickson i...@hixie.ch wrote:
  
  Should we add a preciseSeek() method with two arguments that does a 
  seek using the given rational time?
 
 This method would be more useful if there were a way to retrieve the 
 media's time scale.  Otherwise, the script would have to pick an 
 arbitrary scale value, or provide the correct media scale through other 
 means (such as querying the server hosting the media).  Additionally, 
 authors like Rob are going to want to retrieve this precise 
 representation of the currentTime.  If rational time values were 
 encapsulated into their own interface, a preciseCurrentTime (or 
 similar) read-write attribute could be used instead.

Ok. I assume this is something you (Apple) are interested in implementing; 
is this something any other browser vendors want to support? If so, I'll 
be happy to add something along these lines.

-- 
Ian Hickson   U+1047E)\._.,--,'``.fL
http://ln.hixie.ch/   U+263A/,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'

On 2012/12/18 9:01, Ian Hickson wrote:
On Tue, 2 Oct 2012, Jer Noble wrote:
The nature of floating point math makes precise frame navigation
difficult, if not impossible. Rob's test is especially hairy, given
that each frame has a timing bound of [startTime, endTime), and his test
attempts to navigate directly to the startTime of a given frame, a value
which gives approximately zero room for error.

...
That makes sense.

Should we add a preciseSeek() method with two arguments that does a seek
using the given rational time?

I draw your attention to Don't Store that in a float
http://randomascii.wordpress.com/2012/02/13/dont-store-that-in-a-float/
and its suggestion to use a double starting at 2^32 to avoid the issue
around precision changing with magnitude as the time increases.

Regards

-Mark

--
注意：この電子メールには、株式会社エイチアイの機密情報が含まれている場合
が有ります。正式なメール受信者では無い場合はメール複製、 再配信または情
報の使用を固く禁じております。エラー、手違いでこのメールを受け取られまし
たら削除を行い配信者にご連絡をお願いいたし ます.

NOTE: This electronic mail message may contain confidential and
privileged information from HI Corporation. If you are not the intended
recipient, any disclosure, photocopying, distribution or use of the
contents of the received information is prohibited. If you have received
this e-mail in error, please notify the sender immediately and
permanently delete this message and all related copies.

Re: [whatwg] video feedback

2012-12-20 Thread Ian Hickson

On Thu, 20 Dec 2012, Mark Callow wrote:
 On 2012/12/18 9:01, Ian Hickson wrote:
  On Tue, 2 Oct 2012, Jer Noble wrote:
  The nature of floating point math makes precise frame navigation 
  difficult, if not impossible.  Rob's test is especially hairy, given 
  that each frame has a timing bound of [startTime, endTime), and his 
  test attempts to navigate directly to the startTime of a given frame, 
  a value which gives approximately zero room for error.
 
  That makes sense.
 
  Should we add a preciseSeek() method with two arguments that does a 
  seek using the given rational time?
 
 I draw your attention to Don't Store that in a float 
 http://randomascii.wordpress.com/2012/02/13/dont-store-that-in-a-float/ 
 and its suggestion to use a double starting at 2^32 to avoid the issue 
 around precision changing with magnitude as the time increases.

Everything in the Web platform already uses doubles.

-- 
Ian Hickson   U+1047E)\._.,--,'``.fL
http://ln.hixie.ch/   U+263A/,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'

Re: [whatwg] video feedback

2012-12-20 Thread Boris Zbarsky


On 12/20/12 9:54 AM, Ian Hickson wrote:

Everything in the Web platform already uses doubles.


Except WebGL.  And Audio API wave tables, sample rates, AudioParams, PCM 
data (though thankfully times in Audio API do use doubles).  And 
graphics libraries used to implement canvas, in many cases...


I think the only safe claim about everything in the web platform is 
that it's all different.  ;)


-Boris

Re: [whatwg] video feedback

2012-12-20 Thread Mark Callow

On 2012/12/21 2:54, Ian Hickson wrote:
 On Thu, 20 Dec 2012, Mark Callow wrote:
 I draw your attention to Don't Store that in a float 
 http://randomascii.wordpress.com/2012/02/13/dont-store-that-in-a-float/ 
 and its suggestion to use a double starting at 2^32 to avoid the issue 
 around precision changing with magnitude as the time increases.
 Everything in the Web platform already uses doubles.
Yes, except as noted by Boris. The important point is the idea of using
2^32 as zero time which means the precision barely changes across the
range of time values of interest to games, videos, etc.

Regards

-Mark

-- 
注意：この電子メールには、株式会社エイチアイの機密情報が含まれている場合
が有ります。正式なメール受信者では無い場合はメール複製、 再配信または情
報の使用を固く禁じております。エラー、手違いでこのメールを受け取られまし
たら削除を行い配信者にご連絡をお願いいたし ます.

NOTE: This electronic mail message may contain confidential and
privileged information from HI Corporation. If you are not the intended
recipient, any disclosure, photocopying, distribution or use of the
contents of the received information is prohibited. If you have received
this e-mail in error, please notify the sender immediately and
permanently delete this message and all related copies.

Re: [whatwg] video feedback

2012-12-20 Thread Ian Hickson

On Fri, 21 Dec 2012, Mark Callow wrote:
 On 2012/12/21 2:54, Ian Hickson wrote:
  On Thu, 20 Dec 2012, Mark Callow wrote:
  I draw your attention to Don't Store that in a float 
  http://randomascii.wordpress.com/2012/02/13/dont-store-that-in-a-float/ 
  and its suggestion to use a double starting at 2^32 to avoid the issue 
  around precision changing with magnitude as the time increases.
  Everything in the Web platform already uses doubles.
 Yes, except as noted by Boris. The important point is the idea of using 
 2^32 as zero time which means the precision barely changes across the 
 range of time values of interest to games, videos, etc.

Ah, well, for video that ship has sailed, really.

-- 
Ian Hickson   U+1047E)\._.,--,'``.fL
http://ln.hixie.ch/   U+263A/,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'

Re: [whatwg] video feedback

2012-12-20 Thread Jer Noble


On Dec 20, 2012, at 7:27 PM, Mark Callow callow.m...@artspark.co.jp wrote:

 On 2012/12/21 2:54, Ian Hickson wrote:
 On Thu, 20 Dec 2012, Mark Callow wrote:
 I draw your attention to Don't Store that in a float 
 http://randomascii.wordpress.com/2012/02/13/dont-store-that-in-a-float/ 
 and its suggestion to use a double starting at 2^32 to avoid the issue 
 around precision changing with magnitude as the time increases.
 Everything in the Web platform already uses doubles.
 Yes, except as noted by Boris. The important point is the idea of using 2^32 
 as zero time which means the precision barely changes across the range of 
 time values of interest to games, videos, etc. 

I don't believe the frame accuracy problem in question had to do with 
precision instability, per se.  Many of Rob Coenen's frame accuracy issues were 
found within the first second of video.  Admittedly, this is where the 
avaliable precision is changing most rapidly, but it is also where available 
precision is greatest by far.

An integral rational number has a benefit over even the 2^32 zero time 
suggestion: for common time scale values[1], it is intrinsically stable over 
the range of time t=[0..2^43).  It has the added benefit of being exactly the 
representation used by the underlying media engine.


On Dec 17, 2012, at 4:01 PM, Ian Hickson i...@hixie.ch wrote:

 Should we add a preciseSeek() method with two arguments that does a seek 
 using the given rational time?


This method would be more useful if there were a way to retrieve the media's 
time scale.  Otherwise, the script would have to pick an arbitrary scale value, 
or provide the correct media scale through other means (such as querying the 
server hosting the media).  Additionally, authors like Rob are going to want to 
retrieve this precise representation of the currentTime.  If rational time 
values were encapsulated into their own interface, a preciseCurrentTime (or 
similar) read-write attribute could be used instead.

-Jer

[i] E.g., 1001 is a common time scale for 29.997 and 23.976 FPS video.

Re: [whatwg] video feedback

2012-12-17 Thread Ian Hickson

On Tue, 2 Oct 2012, Jer Noble wrote:
 On Sep 17, 2012, at 12:43 PM, Ian Hickson i...@hixie.ch wrote:
  On Mon, 9 Jul 2012, adam k wrote:
 
  i'm aware that crooked framerates (i.e. the notorious 29.97) were not 
  supported when frame accuracy was implemented.  in my tests, 29.97DF 
  timecodes were incorrect by 1 to 3 frames at any given point.
  
  will there ever be support for crooked framerate accuracy?  i would 
  be more than happy to contribute whatever i can to help test it and 
  make it possible.  can someone comment on this?
  
  This is a Quality of Implementation issue, basically. I believe 
  there's nothing inherently in the API that would make accuracy to such 
  timecodes impossible.
 
 The nature of floating point math makes precise frame navigation 
 difficult, if not impossible.  Rob's test is especially hairy, given 
 that each frame has a timing bound of [startTime, endTime), and his test 
 attempts to navigate directly to the startTime of a given frame, a value 
 which gives approximately zero room for error.
 
 I'm most familiar with MPEG containers, but I believe the following is 
 also true of the WebM container: times are represented by a rational 
 number, timeValue / timeScale, where both numerator and denominator are 
 unsigned integers.  To seek to a particular media time, we must convert 
 a floating-point time value into this rational time format (e.g. when 
 calculating the 4th frame's start time, from 3 * 1/29.97 to 3 * 
 1001/3).  If there is a floating-point error in the wrong direction 
 (e.g., as above, a numerator of 3002 vs 3003), the end result will not 
 be the frame's startTime, but one timeScale before it.
 
 We've fixed some frame accuracy bugs in WebKit (and Chromium) by 
 carefully rounding the incoming floating point time value, taking into 
 account the media's time scale, and rounding to the nearest 1/timeScale 
 value.  This fixes Rob's precision test, but at the expense of 
 precision. (I.e. in a 30 fps movie, currentTime = 0.99 / 30 will 
 navigate to the second frame, not the first, due to rounding, which is 
 technically incorrect.)
 
 This is a common problem, and Apple media frameworks (for example) 
 therefore provide rational time classes which provide enough accuracy 
 for precise navigation (e.g. QTTime, CMTime). Using a floating point 
 number to represent time with any precision is not generally accepted as 
 good practice when these rational time classes are available.

That makes sense.

Should we add a preciseSeek() method with two arguments that does a seek 
using the given rational time?

-- 
Ian Hickson   U+1047E)\._.,--,'``.fL
http://ln.hixie.ch/   U+263A/,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'

Re: [whatwg] video feedback

2012-10-02 Thread Jer Noble

On Sep 17, 2012, at 12:43 PM, Ian Hickson i...@hixie.ch wrote:

 On Mon, 9 Jul 2012, adam k wrote:
 
 i have a 25fps video, h264, with a burned in timecode.  it seems to be 
 off by 1 frame when i compare the burned in timecode to the calculated 
 timecode.  i'm using rob coenen's test app at 
 http://www.massive-interactive.nl/html5_video/smpte_test_universal.html 
 to load my own video.
 
 what's the process here to report issues?  please let me know whatever 
 formal or informal steps are required and i'll gladly follow them.
 
 Depends on the browser. Which browser?
 
 
 i'm aware that crooked framerates (i.e. the notorious 29.97) were not 
 supported when frame accuracy was implemented.  in my tests, 29.97DF 
 timecodes were incorrect by 1 to 3 frames at any given point.
 
 will there ever be support for crooked framerate accuracy?  i would be 
 more than happy to contribute whatever i can to help test it and make it 
 possible.  can someone comment on this?
 
 This is a Quality of Implementation issue, basically. I believe there's 
 nothing inherently in the API that would make accuracy to such timecodes 
 impossible.

TLDR; for precise navigation, you need to use a a rational time class, rather 
than a float value.

The nature of floating point math makes precise frame navigation difficult, if 
not impossible.  Rob's test is especially hairy, given that each frame has a 
timing bound of [startTime, endTime), and his test attempts to navigate 
directly to the startTime of a given frame, a value which gives approximately 
zero room for error.

I'm most familiar with MPEG containers, but I believe the following is also 
true of the WebM container: times are represented by a rational number, 
timeValue / timeScale, where both numerator and denominator are unsigned 
integers.  To seek to a particular media time, we must convert a floating-point 
time value into this rational time format (e.g. when calculating the 4th 
frame's start time, from 3 * 1/29.97 to 3 * 1001/3).  If there is a 
floating-point error in the wrong direction (e.g., as above, a numerator of 
3002 vs 3003), the end result will not be the frame's startTime, but one 
timeScale before it. 

We've fixed some frame accuracy bugs in WebKit (and Chromium) by carefully 
rounding the incoming floating point time value, taking into account the 
media's time scale, and rounding to the nearest 1/timeScale value.  This fixes 
Rob's precision test, but at the expense of precision. (I.e. in a 30 fps movie, 
currentTime = 0.99 / 30 will navigate to the second frame, not the first, 
due to rounding, which is technically incorrect.)

This is a common problem, and Apple media frameworks (for example) therefore 
provide rational time classes which provide enough accuracy for precise 
navigation (e.g. QTTime, CMTime). Using a floating point number to represent 
time with any precision is not generally accepted as good practice when these 
rational time classes are available.

-Jer

Re: [whatwg] video feedback

2012-10-02 Thread Silvia Pfeiffer

On Wed, Oct 3, 2012 at 6:41 AM, Jer Noble jer.no...@apple.com wrote:
 On Sep 17, 2012, at 12:43 PM, Ian Hickson i...@hixie.ch wrote:

 On Mon, 9 Jul 2012, adam k wrote:

 i have a 25fps video, h264, with a burned in timecode.  it seems to be
 off by 1 frame when i compare the burned in timecode to the calculated
 timecode.  i'm using rob coenen's test app at
 http://www.massive-interactive.nl/html5_video/smpte_test_universal.html
 to load my own video.

 what's the process here to report issues?  please let me know whatever
 formal or informal steps are required and i'll gladly follow them.

 Depends on the browser. Which browser?


 i'm aware that crooked framerates (i.e. the notorious 29.97) were not
 supported when frame accuracy was implemented.  in my tests, 29.97DF
 timecodes were incorrect by 1 to 3 frames at any given point.

 will there ever be support for crooked framerate accuracy?  i would be
 more than happy to contribute whatever i can to help test it and make it
 possible.  can someone comment on this?

 This is a Quality of Implementation issue, basically. I believe there's
 nothing inherently in the API that would make accuracy to such timecodes
 impossible.

 TLDR; for precise navigation, you need to use a a rational time class, rather 
 than a float value.

 The nature of floating point math makes precise frame navigation difficult, 
 if not impossible.  Rob's test is especially hairy, given that each frame has 
 a timing bound of [startTime, endTime), and his test attempts to navigate 
 directly to the startTime of a given frame, a value which gives approximately 
 zero room for error.

 I'm most familiar with MPEG containers, but I believe the following is also 
 true of the WebM container: times are represented by a rational number, 
 timeValue / timeScale, where both numerator and denominator are unsigned 
 integers.


FYI: the Ogg container also uses rational numbers to represent time.


  To seek to a particular media time, we must convert a floating-point time 
 value into this rational time format (e.g. when calculating the 4th frame's 
 start time, from 3 * 1/29.97 to 3 * 1001/3).  If there is a 
 floating-point error in the wrong direction (e.g., as above, a numerator of 
 3002 vs 3003), the end result will not be the frame's startTime, but one 
 timeScale before it.

 We've fixed some frame accuracy bugs in WebKit (and Chromium) by carefully 
 rounding the incoming floating point time value, taking into account the 
 media's time scale, and rounding to the nearest 1/timeScale value.  This 
 fixes Rob's precision test, but at the expense of precision. (I.e. in a 30 
 fps movie, currentTime = 0.99 / 30 will navigate to the second frame, 
 not the first, due to rounding, which is technically incorrect.)

 This is a common problem, and Apple media frameworks (for example) therefore 
 provide rational time classes which provide enough accuracy for precise 
 navigation (e.g. QTTime, CMTime). Using a floating point number to represent 
 time with any precision is not generally accepted as good practice when these 
 rational time classes are available.

 -Jer

46 matches

Mail list logo