ABSTRACT

In this chapter (RFC 5567), we describe an architectural framework for Media Server (MS) control for conferencing services as a part of the conference framework described in Chapter 2. Unlike 3pcc (see Section 1.3), we consider that the MS is a physically separate entity that is a first step for providing scalability and is not a part of the conference controller or an Application Server. The primary focus has been to define different logical entities that exist within the context of Media Server control. We have specified how the MS, being a network device, processes multimedia streams of audio and/or video in real-time protocol (RTP) streams and the control of RTP streams using the Extended RTP Profile for real-time transport control protocol (RTCP)-based RTP audio-visual Profile Feedback (RTP/AVPF), mixing of incoming media streams, media stream source (for multimedia announcements), media stream processing (e.g., transcoding and dual-tone multi-frequency [DTMF] detection), and media stream sink (for multimedia recordings). Note that we have decomposed all these media functions of the MS in the subsequent Chapters 8 through 15, making the XCON conference systems more scalable. Finally, we cover the appropriate naming conventions and interactions between the MS, the Application Server, and conference participants and their devices.