[android-porting] Re: AV sync issue in real media porting

RaviY Tue, 04 May 2010 21:15:31 -0700

Buffering of audio data before decoding shouldn't matter. If you can
share your log, maybe we can find something.


-Ravi

On May 4, 12:43 pm, kiran <[email protected]> wrote:
> Hi Ravi, I have logged and observed your explaination in seek.
> This is working perfectly fine if the real video stream has AAC audio.
> But not with G2Cook audio.
> I am using the same parser node as well the same omx component
> to handle both the audio cases in the real video stream. The
> decoder call, decode() will be selected at runtime to be either aac or
> cook.
> As I explained in my 1st post, I am confused if, buffering 29 audio
> packets
> inside the decoder (while sending 0 o/p buffer size), and decoding
> after 30th
> packet arrives and sending the o/p corresponding to the whole of 30
> packets
> in the last packet decode call by the omx component in case of cook is
> the problem?
> Please refer to my 1st post for more detailed explaination of the
> issue/
> Any insight into the issue is really appreciated, Thanks .
>
> On May 2, 8:19 pm, RaviY <[email protected]> wrote:
>
>
>
> > On May 1, 12:11 pm, kiran <[email protected]> wrote:> I am porting Real 
> > Media on Android, and Please let me know the
> > > following
>
> > > 1) Can I share some global variables between the parser interface
> > > file( fileformats/rm/parser/src/irmff.cpp)
> > >    and the omx_ra_component.cpp. If so, how can I share?
>
> > You might have to use OSCL singleton. But, what is the need for this?
> > We should know the usecase before trying to make any changes.
>
> > > 2) After seek, can I drop audio frames before giving to renderer at
> > > media_output_inport.
> > >   based on the targetNPT of the seek , and the actual PTS of the
> > > audio packet recieved after seek.
>
> > > Here is detail of the issue while porting:
>
> > > I have integrated Real Media (RM) to Eclair. In real media files, the
> > > audio codec can be one of the 2 formats, AAC or G2COOK (also called as
> > > Real Media 8 Low Bit rate, this is a constant bit rate codec, while
> > > AAC is VBRcodec).
>
> > > During Normal playback, there is AV sync in all real media files
> > > irrespective of the audio format used is AAC or G2COOK, where as if I
> > > seek, there is AV sync in all real media files with AAC as audio but
> > > there is AV sync mismatch in all the real media files with G2COOK as
> > > audio. Most of the times, audio will be heard ahead of video.
>
> > > One of the differences between AAC and G2COOK formats is, if the audio
> > > format used is AAC, the RM audio packet will have multiple encoded
> > > audio frames and this packet can be decoded by the decoder
> > > independently.
>
> > > Where as if the audio format used is G2COOK, the RM audio packets are
> > > interleaved upto a factor of 30 packets, and the decoder needs to
> > > buffer all the 30 packets before it deinterleves all the packets and
> > > decodes all the encoded frames in each of those packets.
>
> > > The first of these 30 packets is called a audio key frame.So,
> > > essentially after repositioning (seeking ) the file, I need to  look
> > > for the closed audio keyframe as well as video keyframe around the
> > > seeked position and return the PTS of these 2 packets to the player
> > > upon request.
>
> > > One observation on the PTS of AV keyframe packets after seek in COOK
> > > audio is
> > > TargetNPT = 27500 ms
> > > Audio PTS = 25000 ms
> > > Video PTS = 20500 ms
>
> > > While PTS of AV keyframe packets after seek in AAC audio is
> > > TargetNPT = 27500 ms
> > > Audio PTS = 27430 ms
> > > Video PTS = 20500 ms
>
> > > where there is a difference of as much as 6-7 secs between the
> > > keyframes of AV streams.
>
> > > At the OMX_ra_component.cpp, whenever I recieve the audio keyframe, I
> > > just do a memcpy of the packet in decoder's internal memory and give
> > > back input buffer size as totally consumed while output buffer size
> > > decoded as zero. This is done till I recieve the 29 packets, and after
> > > recieving the last frame of this audio block, while I again send the
> > > input buffer size as totally consumed, I will now send the total
> > > decoded samples of the 30 audio packets recieved till now.
>
> > > While the AV sync mismatch right after the seek, there is the same
> > > constant delay till the EOF between audio and video. Always audio
> > > played ahead. Any pointers to resolve the issue is highly appreciated.
>
> > > Here are some pointers I got from helix community:
>
> > > Here is the explaination I got from real media people...regarding the
> > > AV Sync issue
>
> > > ""In case of cook audio the timestamp of first packet (audio keyframe)
> > > of the superblock after seek will be always be ahead of the playback
> > > clock, when compared to that of aac where every packet is a key
> > > frame.
>
> > > Helix player is aware of this and hence after decoding the entire
> > > superblock the client engine will clip the decoded buffer at the
> > > begining before  giving it to the renderer.
>
> > > The number of frames of decoded audio that will be clipped will be
> > > calculated using the sampling rate of the audio , play duration of the
> > > current decoded super block as well as the difference between PTS of
> > > the first packet of superblock with refrence to the actual playback
> > > clock.""
>
> > > Any of the PV experts , please let me know how can I drop the decoded
> > > audio before giving it to the renderer in opencore, as at the omx
> > > component level, after decoding, I do not have the information of
> > > playback clock and the hence the diff of audio packet w.r.t the
> > > playback clock.
>
> > > --
> > > unsubscribe: [email protected]
> > > website:http://groups.google.com/group/android-porting
>
> > I will try to explain in brief:
> > - User issues a SetPlaybackRange() request to say 20000ms (TargetNPT).
> > - Assuming that you have not changed any of the default settings, the
> > SDK is setup such that it is always desired to seek to a sync point.
> > - Parser (node) tries to set the source to this new value. However, it
> > sees that its' 2 tracks start at different sync point locations: audio
> > at 18000ms and video at 15000ms. Here, the parser node is supposed to
> > reset all the tracks to the least common NPT time. Let's say, that it
> > is 15000ms. This is ActualNPT.
> > - Parser node also needs to report to the mediaTS corresponding to
> > this NPT that it will be generating, called ActualMediaTS.
> > - The player engine then receives these values and computes the
> > SkipMediaTS:
> >      SkipMediaTS = ActualMediaTS + (TargetNPT - ActualNPT)
> > i.e.,SkipMediaTS = ActualMediaTS + (20000 - 150000)
> > i.e.,SkipMediaTS = ActualMediaTS + 50000
> > - This SkipMediaTS is sent to the sinknode to discard frames up until
> > that timestamp.
> > - The parser node keeps sending frames from the ActualMediaTS value,
> > the decoder node keeps decoding them, and the sink node will discard
> > frames until a media frame of timestamp SkipMediaTS arrives.
>
> > -Ravi
>
> > --
> > unsubscribe: [email protected]
> > website:http://groups.google.com/group/android-porting-Hide quoted text -
>
> > - Show quoted text -
>
> --
> unsubscribe: [email protected]
> website:http://groups.google.com/group/android-porting

-- 
unsubscribe: [email protected]
website: http://groups.google.com/group/android-porting

[android-porting] Re: AV sync issue in real media porting

Reply via email to