On Dec 12, 2007, at 10:32 PM, Simon Flannery wrote:
I would like to check my understanding of Early and Late media also
know as Early (or Late) operating mode:
- Using Early media, the inviting user may talk immediately, and the
core network should buffer the media before the invited user (or
users) accept the session
- The received media is buffered until at least the first invited user
accepts the invitation
- The buffered media is also sent to all users that accept the
invitation
- Conversely, Late media mimics a traditional phone call where the
inviting user must wait until at least one inviting user accepts the
session before being allowed to send media
Is my understanding correct? Are any of the dot points in error?
Welcome to one of the most confusing aspects among the many confusing
aspects of SIP that were introduced by our attempts to compensate for
the broken signaling model of ISUP. Personally, I think we
unwittingly introduced a nightmarish level of complexity and should
have just said "no! If you need two stage signaling, finalize the
first stage with a 200 OK and then re-INVITE if the SDP changes for
the second session." But we didn't do that, so we have what we have.
IETF seems to use the terms "early media" and "early session",
although I'm afraid I don't completely understand the nuances despite
having struggled with them for many years. I suspect I don't want to
understand just because the whole thing hurts my delicate
sensibilities. I'm told that RFC 3960 is a useful reference for this
discussion.
Both uses of "early" refer to the transmission of media before
completion of some phase of the signaling. Sometimes this is mean to
be "before completion of the offer/answer phase" (that is, after an
"offer", but before an "answer", and sometimes (as defined in RFC
3960) before the completion of a final response (generally a 200) to
an INVITE request.
RFC 3960 uses the following definition:
Early media refers to media (e.g., audio and video) that is
exchanged
before a particular session is accepted by the called user.
Within a
dialog, early media occurs from the moment the initial INVITE is
sent
until the User Agent Server (UAS) generates a final response. It
may
be unidirectional or bidirectional, and can be generated by the
caller, the callee, or both. Typical examples of early media
generated by the callee are ringing tone and announcements (e.g.,
queuing status). Early media generated by the caller typically
consists of voice commands or dual tone multi-frequency (DTMF) tones
to drive interactive voice response (IVR) systems.
In IETF and general IMS environments (AFAIK), there is no expectation
that early media will be buffered. Anything not received by a
listener is expected to be discarded.
However, OMA's Push to talk Over Cellular (POC) specification does
(or at least did, the last time I looked) allow a predictive mode
where a POC controller (which is for all practical purposes a SIP
B2BUA with media handling) can "answer" on behalf of a POC terminal
with which it has an active relationship. The POC controller then
buffers media while it activates the radio bearer needed to send the
media on to the terminating POC client. Since the POC controller has
effectively "answered" the INVITE request with a terminal response
and media answer, I don't believe this mode qualifies as "early
media" by any of the definitions in use around the IETF.
Good luck in sorting this all out.
--
Dean
_______________________________________________
Sip mailing list https://www1.ietf.org/mailman/listinfo/sip
This list is for NEW development of the core SIP Protocol
Use [EMAIL PROTECTED] for questions on current sip
Use [EMAIL PROTECTED] for new developments on the application of sip