• 沒有找到結果。

Wireless ATM-based multicode CDMA transport architecture for MPEG-2 video transmission

N/A
N/A
Protected

Academic year: 2021

Share "Wireless ATM-based multicode CDMA transport architecture for MPEG-2 video transmission"

Copied!
18
0
0

加載中.... (立即查看全文)

全文

(1)

Wireless ATM-Based Multicode

CDMA Transport Architecture for

MPEG-2 Video Transmission

PO-RONG CHANG,

MEMBER, IEEE

,

AND

CHIN-FENG LIN

Invited Paper

In this paper, we present a generic ATM-based multicode code division multiple access (CDMA) transport architecture for the Motion Picture Experts Group phase 2 (MPEG-2 ) compressed video services over a bandlimited mobile channel with the em-phasis of wireless asynchronous transfer mode (ATM) cell design, spreading code management, and the impact of CDMA systems on the video services. Such services allow users to share novel MPEG-2 video applications without any geographical restrictions. The ATM technique is especially well suited for variable bit rate (VBR) MPEG-2 video because of its ability to allocate bandwidth on demand to these services. However, since the mobile radio has a limited channel capacity, the overall capacity of the traditional ATM-based CDMA system may not be sufficient to accommodate the MPEG-2 video services requested by the multiple mobile users simultaneously. To tackle this difficulty, a multicode CDMA technique is proposed to provide multiple VBR MPEG-2 video services by varying the number of spreading codes assigned to each MPEG-2 video in order to meet its dynamic throughput requirement. Moreover, both the multipath fading and interference on CDMA radio channels tend to cause significant transmission error and MPEG-2 bit streams are very vulnerable to these errors. Powerful forward error correction (FEC) codes are therefore necessary to protect the video data so that it can be successfully transmitted at acceptable signal power levels. Two separate FEC code schemes are applied to the header and payload of an ATM cell containing MPEG-2 video data, respectively. The ATM cell header is protected by a relatively powerful FEC code to ensure the low average cell loss rate (CLR). On the other hand, the ATM cell payload is encoded for varying degrees of error protection according to both the priority and statistical behavior of the payload data in MPEG-2 videos. An adaptive FEC code combining scheme is proposed to provide the good protection for payload data with the maximization of its code rate in order to minimize the extra bandwidth for FEC overhead. Two typical MPEG-2 test sequences are conducted to evaluate the effectiveness of our system.

Keywords—MPEG video, multicode code division multiple ac-cess (CDMA), wireless ATM.

Manuscript received January 1, 1998; revised March 25, 1998. This work was supported in part by the National Science Council, Taiwan, R.O.C., under Contract NSC 88-2213-E-009-128.

The authors are with the Department of Communication Engineering, National Chiao-Tung University, Hsin-Chu, Taiwan.

Publisher Item Identifier S 0018-9219(99)07477-0.

I. INTRODUCTION

Future third-generation mobile communication [1]–[3] based on the asynchronous transfer mode (ATM) princi-ple is designed to support a wide variety of multimedia services with diverse statistical characteristics and quality of service (QoS) requirements at cell and call levels. Among the various kinds of services, video service is becoming an important component of multimedia com-munications. A number of video coding schemes have been proposed for the compression of video signals that generate a high bit-rate stream via the ATM broad-band networks. A well-known coding scheme called the Motion Picture Expert Group (MPEG) is emerging as a major international standard for multimedia applications [4]. The MPEG specification was developed specifically to allow the transmission of VCR-quality digital images at a data rate of approximately 1–1.5 Mbits/s. IS01172 is also sometimes referred to as MPEG-1. Another specification known as MPEG-2 has been developed to support higher resolution images. The MPEG video compression algorithm relies on two basic techniques: block-based motion compensa-tion for reduccompensa-tion in temporal redundancy and discrete cosine transform (DCT) for reduction in spatial redundancy. MPEG is able to reduce the raw image data by twenty to fiftyfold. Because of the conflicting requirements of random access and highly efficient compression, three main picture types are defined. Intrapictures (I pictures) are coded without reference to other pictures and points for random access. Predictive (P) pictures are coded with reference to previous I- or P-frames. Bidirectionally predictive (B) pictures are coded with reference to a previous I- or P-frame as well as the future I- or P-P-frame. As a result, the MPEG encoder produces a variable bit-rate (VBR) compressed video bit stream in which instantaneous bit rates vary widely with scene content. Moreover, the cor-relation between consecutive pictures is very low since MPEG video generates very bursty traffic that changes dynamically on a picture-by-picture basis depending on

(2)

coding mode. The most interesting standard currently under development is MPEG-4, which offers universal access capabilities in error-prone environments and content-based scalability. Before the MPEG-4 standard has been com-pleted, this paper is aimed at investigating the wireless transmission of the defined benchmark MPEG-2 videos. It is expected to provide the design philosophy of a generic wireless transport architecture for the forthcoming MPEG-4 as well as the other widely used benchmark video codecs, i.e., H.261 and H.263 families.

The ATM technology utilizes the nature of the traffic to allocate effectively the network resources via statistical multiplexing. According to the traffic categories as de-fined in the ATM Forum TM4.0 (ATM service categories), the first class includes all the jitter-tolerant applications [unspecified bit rate (UBR) and available bit rate (ABR) service categories], while the second includes all the real-time applications [constant bit rate (CBR) and VBR service categories]. This paper focuses on the development of the code division multiple access (CDMA) transport ar-chitecture for the VBR video service as well as CBR video service. Wireless ATM has become an important research topic over the last three years [5]–[7]. Such services allow users to share novel multimedia applications without any geographical restrictions. Zhang et al. [8] have investigated an application of wireless ATM techniques to the transmission of MPEG video. However, they did not consider the problem of transmitting multiple high VBR MPEG videos over a narrow-band wireless channel simultaneously. This problem could be ameliorated via multicode CDMA transmission [9]–[11]. In other words, high data transmission rates are achieved by allocating more than one spreading code to a single MPEG video in order to create more than one virtual channel for the MPEG video. As VBR MPEG video traffic is both delay sensitive and has a high degree of burstiness, it is commonly believed that a large number of spreading codes corresponding to the peak source bit rate must be reserved for the video transmission to satisfy its QoS requirements. The reservation of a large number of spreading codes assigned to each MPEG video seems extremely inefficient for the transmission of multiple MPEG videos over a narrow-band radio channel. Taking into consideration that the ratio between the peak and average bit rate for VBR MPEG video is generally high, this implies that a large portion of spreading codes remains unused when the low bit-rate B-frames are transmitted via the multicode CDMA network. To increase the efficiency in the spreading code assignment mechanism, we designed an algorithm which dynamically allocates an appropriate number of codes to each MPEG video on the basis of its actual source rate. Furthermore, in order to avoid the self-interference that a MPEG employing multiple codes may incur, the multiple codes to/from one MPEG should be made orthogonal. This particular spreading coding scheme is called the concatenated orthogonal/pseudonoise (PN) spreading code [10] which is capable of subdividing a high bit-rate MPEG stream into several parallel lower basic bit-rate streams without self-interference. In addition, the

quality of received MPEG video may degrade severely when the ATM cell loss occurs due to the multipath fading, interference, and channel noise. In Section III, an adaptive forward error correction (FEC) code combining scheme has been proposed to eliminate the cell loss according to both the statistical behavior and the importance of data types in MPEG video. The objective of code combining is to maximize its code rate and to provide good error protection with reasonable time delay. Simulation results show that this FEC code combining scheme is able to protect the MPEG video data and achieve the good re-ceived picture quality without a large amount of extra FEC overhead data. Moreover, Zhang and Lee [12] have shown that the error concealment techniques are very effective in improving both objective and subjective quality of received MPEG video via multipath fading channels. Therefore, in Section VII, a modification of both the spatial and temporal error concealment techniques [13]–[18] is applied to recover the received MPEG video with infor-mation loss. Finally, a number of typical MPEG video test sequences are conducted to verify the effectiveness of the proposed ATM-based multicode CDMA system in various aspects.

II. OVERVIEW OFMPEG STANDARD

The MPEG video international standard [4] specifies the coded representation of the video data. The video source coding is based on motion compensated hybrid DCT coding which employs two basic techniques: motion compensation for the reduction of temporal redundancy and DCT transform compression for the reduction of spatial redundancy. In order to achieve the highly efficient com-pression and meet the conflicting requirements of random access, the input video is divided into units of group-of-pictures (GOP’s) consisting of an intra (I) picture, coded without reference to other pictures, and arrangement of predictive (P) pictures, coded with reference to previous (I or P) picture, and bidirectionally predictive (B) pictures, coded with reference to an immediate previous (I or P) picture, as well as an immediate future (P or I) picture. The I picture frame at the beginning of a GOP serves as a basic entry point to facilitate random seek or channel switching and also provides coding robustness to transmission error, but it is coded with only moderate compression to reduce the spatial redundancies. P picture frames are coded more efficiently using motion compensated prediction from a past I or P picture frame and are generally used as a reference for further prediction. B picture frames provide the highest degree of compression, but require both past and future reference pictures for motion compensation. It should be mentioned that B pictures are never used as references for prediction. A GOP is defined by its length which is the distance between I pictures. An example of a GOP in MPEG is presented in Fig. 1, where I, P, and B denote picture encoding in the intrapicture mode, predicted mode, and bidirectionally predicted mode, respectively. The GOP

(3)

Fig. 1. Example of MPEG GOP.

Fig. 2. Logical structure of a wireless ATM cell with outer/inner FEC code combining.

The macroblocks of 16 16 pixels are the basic coding units for the MPEG algorithm. Each macroblock is divided into four blocks, where each block contains 8 8 pixels. The main extension from monochrome video to color is the addition of two 8 8 chrominace blocks to the macroblock. A row of macroblocks that makes up a horizontal strip in the image is called a slice and a number of slices are combined to form a picture. The coding mode of each macroblock within a specific picture depends on its picture type. For I pictures, a DCT is performed on each block. The resulting two-dimensional block of DCT coefficients is quantized and scanned in a zig-zag order to convert it into a one-dimensional string of quantized DCT coefficients. Run-length coding is used for the quantized coefficient data. The predicted pictures (P and B) use motion compensated prediction of the contents of the macroblock based on past or future reference pictures. This prediction is subtracted from the actual data in the current macroblock to form an error signal. The prediction error is coded like the intracoded macroblocks.

III. DESIGN OFWIRELESSATM CELLS FORMPEG VIDEOS

Zhang et al. [8] have proposed a wireless ATM cell model for MPEG-2 videos which is quite similar to Lei’s cell structure [19] with a header of 5 bytes and a payload of 48 bytes for MPEG-2 over wireline ATM networks. However, wireless multipath fading, interference, and noise tend to cause significant ATM cell loss. A single cell loss will result in the severe MPEG image quality degradaion when the cell carries a large amount of payload data. Lin and ElZarki [24] have suggested that the wireless ATM cell should not carry too much data to avoid the loss of large amount of image data in a single cell loss. Furthermore, FEC codes are necessary to protect either ATM cell header or payload data so that it can be successfully transmitted at an acceptable signal power level in order to provide adequate image quality. However, the extra bandwidth for FEC overhead for wireless ATM cells seems critical in the narrow-band wireless networks. This section will investigate the appropriate tradeoff between error control and MPEG video transmission quality.

Table 1

Priority Rank for the Different Data Types Within Each Frame (I, P, B)

Two separate error control coding schemes are applied to the header and payload of a wireless ATM cell il-lustrated in Fig. 2. Usually, an ATM cell consists of a 5-byte header. Raychaudhur [3] showed that the ATM header over a wireless network could be compressed to either 2 or 3 bytes. For simplicity, a 2-byte compressed header will be used in the wireless ATM cell. A shortened Bose–Chaudhuri–Hocquenghem (BCH) code of (31,16) is suggested to protect this header. On the other hand, a concatenated error control coding scheme with unequal error protection (UEP) is applied to the payload in accor-dance with two priority categories, i.e., the priority for each picture frame in the temporal domain and the importance of data types belonging to the frame. There are three priority levels involved in three frame types belonging to the MPEG video at three different time points in the temporal line (domain). For the MPEG bit stream, I-frame (I picture frame) information tends to be the most important data for transmission because the effect of losses on this frame will tend to propagate, until the next I-frame, and then causes no temporal prediction for and B-frames. The P-and B-frames are, respectively, the medium P-and the least important data for transmission since P-frame is usually treated as a reference for the temporal prediction of B-frames. Therefore, there are BCH codes with three different error protection levels, i.e., of (255 215), (255 223), and (255, 231) which are used to protect I-, P-, and B-frames, respectively. These codes are called the “inner” FEC codes according to the definition of a concatenated code composed of convolutional and Reed–Solomon (RS) codes [20]. The outer FEC codes are proposed to protect the high-priority data belonging to each frame twice. By analyzing the data types in each of I-, P-, and B-frames, it is found that there are six priority levels for the data in I-frames and seven priority levels for the data in either P- or B-frame shown in Table 1. For example, for I-frames, the highest priority is given to frame header and the lowest priority is given

(4)

to high frequency AC coefficients. The frame header is usually included in a resource management (RM) cell with a high error protection which is used to provide the current frame information and its resource management status for mobile receivers before transmitting the ATM cells. Hence, the outer FEC codes are proposed to protect the second, third, and fourth highest priority data [Slice header, macroblock (MB) header, and DC values] in an I-frame, and to protect the second, third, fourth, and fifth highest priority data (Slice header, MB header, motion vectors, and DC values) in both P- and B-frames. Neverthless, the remaining low priority data (AC values) are still protected by the inner codes. For the sake of reducing the implementation complexity, a class of shortened BCH codes is chosen for the design of the outer FEC coding. The next section will discuss more details of the code selection principle for outer codes.

A. Code Selection Principle for Outer Codes According to Data Partitioning

The work by Parthasarthy et al. [21], which describes a code selection strategy in the case of a normal RS code scheme for video traffic over wireline ATM networks, is closest to that in this paper. However, in [21], the MPEG video protected by the use of concatenated code according to its priority have not been considered. The objective of code selection is to minimize the outer code overhead and to provide good protection with reasonable time delay. In other words, we choose an appropriate class of outer codes which maximizes the average code rate for a coded ATM cell given by

(1) where denotes the average number of message bit in a coded ATM cell, and and represent the av-erage numbers of bits in a coded header and bits in a coded payload with inner/outer coding, respectively. The maximization of is over all the shortened BCH outer codes. Unfortunately, it is hard to characterize directly according to the VBR characteristics of the high-priority data in each frame of MPEG video. To overcome this difficulty, the expression of (1) can be approximated by a form in the slice manner

Slice

CSlice Code CATMh (2)

where Slice is the average number of message bits in a slice, CSlice denotes the average number of bits in a coded slice using outer coding scheme only. Code and CATMh represent the average numbers of bits in the inner code overheads and bits in the coded ATM headers for a slice, respectively. Let denote the number of message bits in the inner code. In other words, each ATM cell with inner coding can carry message bits per cell. It should be noted that the coded bits generated by outer coding are treated as the message bits for the inner coding. Hence, the average

number of ATM cells for a slice is equal to CSlice cell , where denotes the smallest integer which is greater than As a result, Code

and CATMh where (I-frame) or

223 (P-frame) or 231 (B-frame). Therefore, the expression of (2) becomes

Slice

CSlice CSlice

(3)

where or 223 or 231.

Since Slice is a fixed value independent of outer coding scheme, the maximization of then becomes the mini-mization of CSlice The expression of CSlice for MPEG-2 in 4 : 2 : 0 mode is given by CSlice CSliceh CMBh CDC CDC CDC AC for I-frames (4) or CSlice CSliceh CMBh CMV CDC CDC CDC AC for P- or B-frames (5)

where CSliceh is the average number of bits in a coded slice header using outer coding. CMBh denotes the average number of bits in a coded MB header, and is the average number of MB’s in a slice. CDC denotes the average number of bits in the coded dc values of -block and represents the average number of -blocks in a

slice, where or or CMV is the

average number of bits in a coded motion vector and is the average number of motion vectors in a slice. Note that the above parameters are encoded using outer coding scheme only. Meanwhile, AC denotes the average number of bits belonging to the low priority AC coefficients in a slice without outer coding.

B. Adaptive Code Combining Strategy for Outer Code Selection

Two typical MPEG-2 videos are considered in the design of outer code selection. They are Flower Garden with a high average bit rate of 3 Mbits/frame and Table Tennis with a medium average bit rate of 1.5 Mbits/frame. Figs. 3 and 4 show the data distributions for DC values in -blocks, -blocks, and -blocks belonging to I-frames of Flower Garden and Table Tennis, respectively. Observing those data distributions, it is found that most of the high-priority data are distributed over a small number of bits. From Figs. 3 and 4, the maximum number of bits in DC (DC values in -block) is 20. However, the probability of DC less than 7 bits is 0.825 for Table Tennis or 0.569 for Flower Garden. Moreover, Pr DC bits

(Table Tennis), 0.969 (Flower Garden). In other words, most of ’s are distributed over the range of

(5)

(a)

(b)

(c)

Fig. 3. Data distribution of DC coefficients in (a)Y -block, (b) U-block, and (c) V -block belonging to the I-frame of Table Tennis.

(a)

(b)

(c)

Fig. 4. Data distribution of DC coefficients in (a)Y -block, (b) U-block, and (c) V -block belonging to the I-frame of Flower Garden.

[0,11]. Based on the above observation, an adaptive code combining scheme is proposed to achieve the optimal outer code selection according to the statistical characteristics of MPEG source coding. This combining strategy is quite similar to the concept of Huffman source coding. Two basic shortened BCH codes of (7,4) and (15,7) play an important role in the code combining scheme in order to minimize the code overhead and also to provide good protection with reasonable delay. The following procedure for encoding DC in an I-frame is treated as a typical example for code combining: 1) if DC bits, use BCH(7,4)

to encode DC ; 2) if DC use BCH(15,7)

to encode it; 3) if DC bits, use BCH(7,4)

BCH(15,7) to encode it; 4) if DC bits,

use 2 BCH(15,7) to encode it; 5) if DC bits, use BCH(7,4) 2 BCH(15,7) to encode it; and

6) if DC bits, use 2 BCH(7,4) 2

BCH(15,7) to encode it. The probability for each data range can be found as : 1) Pr DC

(Table Tennis), 0.25 (Flower Garden); 2) Pr DC

0.354; 3) Pr DC

0.400; 4) Pr DC 0.030; 5)

Pr DC ; 6)

Pr DC Therefore,

the average number of bits in the BCH outer-coded DC

belonging to an I-frame becomes CDC for

Table Tennis or 16.5320 for Flower Garden. Similarly, the same BCH outer code combining can be applied to DC values in both the - and -blocks belonging to an

I-frame. Hence, CDC (Table Tennis), 13.188

(Flower Garden), CDC 11.573.

are found to be 88, 22, 22 for either Table Tennis or Flower Garden. A combining of 6 BCH(15,7) is used to encode a fixed 38-bit slice header belonging to an I-frame with high FEC protection. Similarly, a fixed 2-bit MB header is protected by BCH(7,4). As a result, CSlice

and CMBh In addition, equals 22 for both

Table Tennis and Flower Garden, and AC

for Table Tennis or 22 141.772 for Flower Garden. From

(4), CSlice for Table Tennis or 24385.409

for Flower Garden. Moreover, the average code rate for ATM cells generated from an I-frame is computed by (3).

It yields for Table Tennis or 0.7388 for

Flower Garden. In addition, the data distribution for high priority data in either P- or B-frame belonging to either Flower Garden or Table Tennis can be found in [29]. It is found that their high priority data are also distributed over a small number of bits. For P- and B-frames, the same BCH outer code combining scheme is applied to their high priority data according to their data distribution. However, a combining of BCH(15,7) 2 BCH(31,16) is applied to the 38-bit slice header belonging to either P- or B-frame with a medium FEC protection. This leads to CSlice The average code rate for ATM cells generated from either P- or B-frame is found to be

(Table Tennis) or 0.766 65 (Flower Garden), or (Table Tennis) or 0.754 85 (Flower Gar-den). In the near future, a class of rate compatible punctured convolution codes (RCPC’s) as well as turbo codes will be considered in the design of the inner coding for MPEG videos in order to achieve the better FEC code performance, where turbo codes are, in essence, parallel concatenated convolutional codes which have been shown to perform near the capacity limit on fully interleaved fading channels [30]. Moreover, we will concatenate our proposed BCH outer code combining scheme with either RCPC or turbo inner code [31] to further reduce the BER floor.

IV. TRANSMISSION OF MPEG-2 VIDEOS OVERATM-BASED MULTICODE CDMA MOBILE COMMUNICATION NETWORKS

In the near future, the third-generation mobile commu-nication will allow high mobility users to access video information stored at various network sites. The multicode CDMA seems one of the major technologies [1], [9], [10] for supporting video services in the third-generation

(6)

Fig. 5. Overall schematics of multiple MPEG video transmission via ATM-based multicode CDMA system where RM cell containsNk; frame type, and outer code combining type.

system. In this paper, we are trying to propose a generic multicode CDMA system for providing the transmission of multiple MPEG-2 videos via a wireless ATM network. Fig. 5 shows this concept. For the th MPEG source, the system first identifies the frame type at a specific time instant, and the frame is then decomposed into a number of slices. A data priority analysis is applied to each slice for the construction of wireless ATM cells. Moreover, an outer FEC code proposed in Section III is applied to the high-priority payload according to the priority analysis. Moreover, an inner FEC code is used to protect the outer-coded high-priority data twice and the remaining

low-priority data with regard to the importance of the frame type in the temporal domain. Since the traditional CDMA system may not be sufficient to accommodate the transmission of multiple high bit-rate MPEG videos via a narrow-band mobile channel, a multicode CDMA technique is proposed to overcome this difficulty. In the multicode CDMA system, when the th MPEG needs times the basic transmission rate, it converts the original high bit-rate bit stream using a multiplexer or serial-to-parallel converter into basic rate streams, encodes each with a different spreading code, modulates them with a different Walsh modulator, and superimposes them before upconverting for

(7)

mobile transmission. In other words, each spreading code in the multicode CDMA carries a data stream with a basic rate spreading codes in parallel will provide a single user with times the basic rate capability. Note that each MPEG admitted into the system has a primary PN code assigned to it. These PN codes are not orthogonal between different MPEG users. To avoid the self-interference that a MPEG user employing multiple codes may incur, the multiple codes to/from one user should be made orthogonal. If is the primary PN code assigned to the th MPEG with a basic transmission bit rate new concatenated spreading codes, ’s can be derived by

where is the th orthogonal code,

and if It should be mentioned that the

actual value of depends on the frame type, frame period, basic transmission rate, desired cell loss rate (CLR), and total number of bits in all the coded ATM cells. In Section V, a spreading code assignment mechanism is proposed to determine an appropriate value of which is able to achieve both the higher transmission capability and better received picture quality. Due to the orthogonality requirement, the maximum number of orthogonal codes per user is the ratio between the channel chip rate and the Walsh modulator output rate. is termed as the spreading

sequence length. Hence, The above coding

strategy is called the subcode concatenated scheme that orthogonal sequences are concatenated with a PN sequence to increase the randomness of the orthogonal sequence. The binary orthogonal sequences used in this paper were the well-known Walsh–Hadamard (WH) codes which have zero cross correlation at zero time delay. They are used when synchronization of transmission can be maintained. Unfortunately, the multipath fading in a cellular radio environment introduces nonzero time delays that destroy the orthogonality between Walsh–Hadamard (WH) codes. Fong et al. [7] showed that a sufficiently long PN sequence is concatenated with WH codes to randomize and eliminate their unsatisfactory and inhomogeneous behavior at nonzero time delays. The long PN sequence may be chosen as either -sequence or Gold sequence. Before transmitting the ATM cells, an RM cell containing the frame header, the value of , and the resource management information is sent to the desired mobile receiver. Thus, the receiver is able to demodulate the bit streams. Finally, spatial/temporal error concealment techniques are proposed to improve the received image quality.

A. Multicode CDMA Transmitter Model with

Concatenated Orthogonal/PN Spreading Code Scheme

For MPEG videos transmitted over a mobile channel, each of them is divided into parallel data streams (virtual channels) for user at time where a specific WH code is assigned to each virtual channel. The value of is dynamically determined by a code assignment mechanism discussed in the next section. For simplicity,

is assumed to be a constant in the th

time interval i.e.,

for Hence, the total number of codes

assigned to all the users in the new time interval

is where denotes the th intersection

interval of the time intervals for all the MPEG’s, i.e.,

Usually, is a

fixed value called the frame period. Note that WH codes assigned to different users with different PN codes may be identical. It should be mentioned that the above arguments are valid when each user has a continuous MPEG video presentation. However, this assumption may not be true since some users may not have MPEG videos during a specific time interval, i.e., To tackle this difficulty, the value of for this particular interval is set to zero. Therefore, the above argument becomes valid again. For simplicity, we assume that each of the MPEG’s has its corresponding picture frames during The transmitting binary phase shift keying (BPSK) signal of the th data stream (virtual channel) belonging to the th MPEG during the th time interval is expressed as

(6) where is the transmission power of the base station, is the random phase angle, uniformly distributed between zero and introduced by the modulator, and is the data signal which consists of a sequence of rectangular pulses of duration i.e.,

(7)

where and is a unit pulse function of

duration The concatenated spreading code can be expressed as

(8)

where is the concatenated spreading

se-quence which is equal to the product of a PN sese-quence used by the th user and a code sequence assigned to its th virtual channel. The duration of each data bit is while the duration of each chip in the spreading code is The number of chips per bit is where is an integer. The period of the WH code sequence is equal to the processing gain The long PN sequence has a period that is

much greater than Moreover, and are chosen

to be relatively prime so that every possible chip of the PN sequence can occur at the beginning of some data bit. As a result, the total signal transmitted to users is

(8)

B. Mobile Radio Channel Model

The Rayleigh multipath fading model is the general accepted channel model for mobile communications [22]. In this paper, we adopted the Rayleigh fading model for performance analysis in our multicode CDMA system. The channel impulse response for the th virtual channel assigned to the th user is given by

(10) where is the th Rayleigh distributed random path gain, is the th random path phase, uniformly dis-tributed between zero and and is the th uniformly distributed random delay ranging from zero to one data bit period represents the unit impulse function. denotes the number of resolvable multipaths for the th virtual channel assigned to user In addition, it should be mentioned that these channel parameters vary with the transmitter–receiver distance. It may be shown that

and for

since all the parallel virtual channels introduced by the same user are transmitted over the same propagation environment between the transmitter and receiver, and then would have identical channel characteristics, where denotes the variance of a random variable

C. Receiver Model

The received signal at the input to the mobile receiver is given by

(11)

where is the complex envelope of denotes

the real part of complex number, and is a white Gauss-ian noise with two-sided power spectral density

For simplified analysis, the first virtual channel assigned to the first user is chosen as the reference for calculating the probability of error of its data symbol in the th

sampling time interval The receiver

is able to coherently recover the carrier phase and locking to the th path as a reference path between the reference transmitter and its corresponding receiver. All other paths constitute interference. That is, we assume

without loss of generality that and The

envelope of the output of the optimum BPSK demodulator at the th sampling time instant is denoted by

and can be expressed as

(12)

where

(13) is the intra multi-user interference (self-interference) indicating the interference introduced by the other virtual channels belonging to the same reference user, is the intra multipath interference, and denotes the inter multi-user interference. It should be noted that the orthog-onality of the concatenated spreading codes eliminates the intra multi-user interference (self-interference) [10]. This leads

D. Calculation of Average CLR

This wireless ATM cell is transmitted over the wireless CDMA link. Assuming that bit synchronization and cell synchronization have been achieved, the mobile receiver first decodes the ATM header. If the cell is successful, the receiver then decodes the payload. There are two major factors that cause wireless networks to discard cells. One is bit errors in the header due to the multipath fading, interference, and channel noise. The other is buffer overflow in multiplexing or cross-connecting equipment. The cell loss process due to buffer overflow can be approximated by a two-state Markov chain called Gilbert model [23]. Here, for simplicity, we only consider the cell loss due to the bit errors in the header. Usually, a single cell loss may cause a loss of a amount of image data and results in a high degradation of picture quality. Hence, the CLR is regarded as the major performance index used to evaluate the quality of received picture. For the simplified derivation of the CLR, the Gaussian assumption is to take all the self-interference, intra multipath interference, and inter multi-user interference terms as Gaussian noise. Reference [29] has shown that the average value of half the signal-to-noise plus interference power ratio becomes

(14)

where represents the received signal energy per bit via the th path (reference path),

and Note that is always greater than

one. For simplicity, assume that the path gains for all the virtual channels in the same user are independent identically

distributed (i.i.d.), i.e., Thus the value

of is identical to i.e., Proakis [22] showed that the bit error rate (BER) for a receiver with maximal ratio

(9)

combining (MRC) of order can be expressed as a form in terms of

for MRC of order (15)

In (15), for MRC, the quantity represents the aver-age signal-to-noise ratio per combined path, and

Since a BCH code of (31,16,3) is proposed to protect the 16-bit compressed ATM header, the CLR is computed by

(16)

V. FRAMEINTERLEAVINGTECHNIQUES FOR

MULTI-USERTRAFFIC REGULATION

Fig. 6 shows the bit-rate trace which is a bitcount ob-served frame-by-frame in a segment (frames 1-150) in Flower Garden. The main peaks on the 360-kbit range, correspond to I-frames, while the second peaks, on the 220–310-kbit, are P-frames; the frames on the 63-kbit range are B-frames. The bit-rate trace of Table Tennis is presented in Fig. 7. Its I-frames, P-frames, and B-frames are on the range of 193, 116, and 21 kbit, respectively. The total bit count of this two-user video transmission becomes 553 kbit when the I-frames of both Flower Garden and Table Tennis are transmitted to mobile receiver simultaneously. Due to the transmission rate limitation of mobile channel, it is difficult to carry such multiple high bit-rate I-frames over a narrow-band mobile channel within a short frame time

period ( s) simultaneously. On the other

hand, the bit-count of the two-user video transmission becomes extremely low (84 kbit) when both low bit-rate B-frames are transmitted at the same time. This rapidly VBR characteristics seems a weakness to the transmission of multiple MPEG videos via a CBR mobile channel. To tackle this difficulty, a frame interleaving strategy is proposed to avoid the simultaneous appearance of multiple high bit-rate frames and then regulate the total multi-user traffic. An optimal frame interleaving pattern can be found by minimizing the total multi-user traffic subject to an inter-user time delay constraint of In this paper, both the GOP and of each MPEG video are assumed to be identical to 6, where GOP and are the distance between I-frames and the distance between P-frames, respectively.

Two typical MPEG videos of GOP and are

Flower Garden and Table Tennis. To illustrate the concept

Fig. 6. Bit-rate trace for Flower Garden.

of frame interleaving, we consider an example of five MPEG videos of GOP 6 and under a reasonable

inter-MPEG delay constraint of The optimal

frame interleaving pattern for these five MPEG videos is given at the bottom of the page.

In the optimal interleaving pattern, the lowest average bit-rate MPEG video is given to MPEG 1 and the highest average bit-rate MPEG video is assigned to MPEG 5. The th MPEG is delayed by a time interval of frame periods with respect to MPEG 1 in order to avoid the simultaneous appearance of multiple high bit-rate frames, However, due to the delay constraint of two I-frames may appear in both MPEG 1 and MPEG 5 at the same time. Therefore, at steady state (after three frame periods), this leads to six frame combi-nations, (P,B,B,I,P), (B,P,B,B,B), (B,B,P,B,B), (I,B,B,P,I), (B,I,B,B,B), and (B,B,I,B,B), and the frame combination with a maximum five-user bit rate in the proposed frame in-terleaving is (I,B,B,P,I), where the th component represents a specific frame type of the th MPEG, where

There are two high bit-rate I-frames, one medium bit-rate P-frame, and two low bit-rate B-frames belonging to the maximum five-user bit-rate frame combination which are much better than five high bit-rate I-frames that appear in the transmission of five MPEG videos simultaneously. It should be noted that these six frame combinations may be reduced to four frame combinations when these five MPEG videos have an identical bit-rate for each of their I-, P-, and B-frames. The concept of frame interleaving can be extended to a more general case. The optimal frame

Time MPEG-1 MPEG-2 MPEG-3 MPEG-4 MPEG-5

(10)

Fig. 7. Bit-rate trace for Table Tennis.

interleaving of MPEG videos with different GOP and can be obtained by minimizing the following performance index called the effective bandwidth for total multiplexed MPEG video streams [32] using the simulated annealing algorithms:

subject to (17)

where denotes the bit-rate trace (traffic envelope) for the th MPEG video stream,

referred as the time delay arrangement vector, completely specifies the synchronization structure for the streams, is the time delay for the th MPEG stream, and is any positive integer.

VI. DYNAMIC SPREADINGCODEASSIGNMENT FORMULTIPLEMPEG VIDEOSERVICES

The spreading code assignment mechanism is responsible for assigning an appropriate number of WH codes frame by frame for every MPEG video in each frame time period so that the received picture quality of each MPEG video is guaranteed with a CLR value less than the maximum allowable CLR. For example, it is found that the average number of ATM cells for each of I-, P-, and B-frames belonging to both typical MPEG videos Flower Garden and Table Tennis according to the concatenated channel coding

is in the order of For example, the

average number of ATM cells for an I-frame is 1.41 10 for Flower Garden or 9.75 10 for Table Tennis. The maximum allowable cell loss rate is then set to be

in the order of if the allowable

average number of cell losses in a frame is in the order of The picture degradation due to these cell losses can be eliminated by using the spatial and temporal error concealment technique proposed in Section VII. From (16), it is known that the maximum allowable CLR for each frame depends essentially on frame combination, and the number of WH codes assigned to each frame in a specific frame combination. On the contrary, it is desired to obtain the appropriate maximum WH code number assigned to each frame that is able to achieve its maximum allowable CLR. For ease of analysis, it is assumed that all the MPEG videos with an identical medium bit-rate trace are transmitted over a typical indoor multipath fading channel

[25] with and dB. Hence, the

number of WH codes assigned to each frame belonging to any one of the above MPEG videos with an identical rate trace is assumed to be identical. This medium bit-rate MPEG video may be chosen as Table Tennis. In each frame interleaving scheme, the maximum multi-user bit-rate frame combination belonging to the optimal frame interleaving will play an important role in the determination of maximum allowable WH code number assigned to each frame. There is no general method to determine such numbers, only the heuristic method. It is found that the

(11)

maximum -user bit-rate frame combination belonging to the optimal frame interleaving contains only the I- and B-frames when the number of users, is equal to either two or three and the inter-MPEG delay constraint is

In this case, there are one I-frame and B-frames involved in the combination, where Therefore, the CLR is a function of two integer variables, and which are the numbers of WH codes assigned to I- and B-frames belonging to any one MPEG video, respectively. The cell loss rate for I-frame, i.e., due to

interfering B-frames is characterized by the following half the signal-to-noise plus interference power ratio:

(18) In contrast to CLR the cell loss rate for a specific B-frame, i.e., CLR due to one interfering I-frame and

interfering B-frames is characterized by

(19) Next, it is desired to determine the appropriate values for both and which are able to quarantee both CLR and CLR be in their respective maximum allowable ranges. The average number of WH codes assigned to a specific frame can be estimated by

(20)

where denotes the average number of bits in a specific

frame, is set to be s and kbit/s [10]. In

Section III, for Table Tennis, it is found that equals 0.6957 for an I-frame or 0.755 12 for a P-frame or 0.650 94 for a B-frame. In addition, the actual value of is obtained to be 1.9274 10 bits/frame for an I-frame or bits/frame for a P-frame or 2.0737 10 bits/frame for a B-frame. Therefore, the resulting average

WH code numbers become and

that are used to determine the rough estimate for the operating range of maximum allowable WH code number for each frame. From the above discussion and (15)–(17),

it is found that and are able

to guarantee that the CLR’s for both I- and B-frames are in the order of 10 when Moreover, the average number of ATM cells for each frame in Table Tennis is 9.75 10 for an I-frame, or 5.4 10 for a P-frame, or 1.2 10 for a B-frame according to the ATM cell generation method proposed in Section III. This leads to almost one cell loss in each frame. By the similar analysis, and become 45 and 6, respectively, in order to guarantee the reasonable CLR for eithe I- or B-frame when becomes three. When is greater than three, a P-frame will appear in the maximum four-user bit-rate frame combination, i.e., (I,B,B,P) if or two

I-frames and one P-frame appear in the maximum five-user bit-rate frame combination (I,B,B,P,I) if When becomes larger, the total number of WH codes assigned to those frames also becomes larger and then creates larger interference to each other. In order to achieve the reasonable CLR for each frame, it is possible to abandon some low-priority data of the least important frames, (B- or P-frames), so that the picture quality is still maintained. For example, the picture quality for a B-frame in Table Tennis is still at the decoded video peak signal to noise ratio (PSNR) performance of PSNR 33.602, PSNR 39.343, and PSNR 41.254 when 60% of AC coefficients have been abandoned. This results in Similarly, the picture quality for a P-frame is at the performance of PSNR

35.151, PSNR 39.749, and PSNR 41.912 when

40% of AC coefficients have been abandoned. Thus, the value of is decreased from 24 to 17. One may abandon more low-priority data when becomes larger. Therefore, both the desired CLR and picture quality for each frame can be achieved. Note that the above analysis for the case of multiple MPEG video streams with an identical bit-rate trace can be extended to a more general case. For example, the bit-rate traces for multiple MPEG videos may be extremely different. The analysis to the general case can be found in Section VIII.

A. Spreading Code Assignment Mechanism

Since WH codes assigned to an MPEG video would create the interference to the other MPEG videos, this leads to a picture degradation in those MPEG videos. The limited maximum allowable number of WH codes is used to ensure the picture quality of all the MPEG videos within an acceptable level. Meanwhile, more WH codes also provide more virtual channels for transmission of multiple MPEG videos in order to carry all the image data or high-priority data to receivers within a short frame time period as much as possible. The tradeoff between picture quality and transmission capability seems a critical factor in determining an appropriate number of WH codes for each MPEG video. However, there is no general principle to justify this tradeoff. Our approach is based on the heuristic method. First, we should determine the optimal frame interleaving and its maximum multi-user bit-rate frame combination. Then, the maximum allowable WH code number, i.e., for each frame is determined according to the analysis of CLR’s discussed above. Thus, the maximum average message bits transmitted over virtual channels within a frame period is calculated by

Let denote total number of bits in the frame of interest, and let denote the number of bits in the th macroblock belonging to slice

The low-priority data of bits should

be abandoned according to priority bank if is greater than In the meantime, the number of abandoned bits in the th MB belonging to slice is proportional to

that is, Hence all the slices have

been recreated based on those new macroblocks. Thus, an outer/inner code combining scheme is applied to each slice,

(12)

and then to generate a number of coded ATM cells. The actual value of WH code number assigned to the frame is determined by total number of bits in all the coded ATM cells, frame period, and transmission bit rate. More details of assignment mechanism are described in the following algorithm:

Procedure

step 1 [Check whether the system is able to carry

bits/frame within ?]

If go to step 3, otherwise do the following step,

step 2 [Abandon the low-priority data in each MB]

i) Abandon low-priority bits in the th MB be-longing to slice according to the priority bank ii) Re-create each MB with the higher-priority bits:

and

step 3 [Create the coded ATM cells]

i) Rearrange the slices based on the new MB’s ii) Choose the appropriate inner FEC for payload and

add the coded cell header to the ATM cell

iii) By analyzing both the priority and statistical behav-ior of data, data bits of each slice are first packetized in the ATM adaptation layer (AAL), and then packed into the ATM cell according to the outer FEC code combing

step 4 [Spreading code assignment]

i)

total number of bits in all the coded ATM cells (no. of bits in all the coded

ATM cell headers)

(no. of bits in the outer/inner FEC overheads) (21) ii)

number of WH codes assigned to the frame (22)

VII. SPATIAL ANDTEMPORAL ERRORCONCEALMENT

TECHNIQUES FORMPEG-2 VIDEO WITHINFORMATIONLOSS

Zhang and Lee [12] have shown that the MPEG-2 source coding algorithm is very sensitive to multipath fading and interference. For example, a single bit error in the ATM header (cell loss) will often result in loss of a whole image block, which could further cause consecutive block losses. Sometimes, a number of consecutive blocks may be destroyed when a single bit error occurs in either slice header or macroblock header or motion vector even the ATM cell is successful.

The cell discarding may also be caused by network congestion and buffer overflow [23]. Several methods have been proposed to minimize the effect of cell loss on the received image quality. These schemes include FEC coding

with interleaving and error concealment using postpro-cessing methods [13]–[18]. In this paper, we consider the combination of these strategies. The two main cell loss resilience techniques described here are the spatial and temporal error concealment techniques. The spatial error concealment exploits the spatial redundancy in one picture. This technique is proposed for intracoded pictures, i.e., I-frame, where no motion information exists. Temporal error concealment makes use of the temporal redundancy in one sequence. It is devoted to intercoded pictures, i.e., P- and B-frames, because there exists some motion information. It should be mentioned that the spatial/temporal error concealment techniques are also applied to the recovery of information loss even the ATM cells are successful. For instance, a single bit error in the DCT coefficients may damage one block when the DCT coefficients of each block are coded using run-length and variable-length coding. Moreover, a number of subsequent blocks may be destroyed if a single bit error occurs in either macroblock header or slice header.

A. Multiresolution Multidirectional Interpolation for Spatial Error Concealment

Since lost blocks are often correlated with neighboring spatial blocks due to the correlation property of nature images, interpolation and surface fitting can be used to partially recover the lost information [13], [14]. However, such techniques are low-pass filtering in nature and tend to ignore high-frequency information such as edges and contours. Lee et al. [16] have proposed a fuzzy logic approach to recover the lost blocks with high frequency and yield the desired reconstructed picture quality. Meanwhile, this fuzzy loss recovery technique is too complicated for implementation. Therefore, a cost-effective multidirectional interpolation [15] is employed to conceal the lost block with high frequency, where the damaged block is surrounded by eight neighboring blocks of correctly received pixels. A voting classification mechanism operates on the eight neighboring correct blocks and determines which directions characterize the three strongest edges. A spatial interpola-tion is performed to create the recovered block for each direction separately. Those interpolation results are then mixed by averaging to obtain the final recovered block con-taining multiple interpolated edges. The determination of the directions for the strongest edges depends on extending edges presented in the neighboring blocks so that they pass through the damaged block. The direction classification for edges can be performed by Sobel gradient operator which is able to give accurate angle estimates.

The multidirectional interpolation works well when the angular direction of edge is near to one of eight directional categories. However, this method becomes invalid when the edge direction is not close to any directional category. To tackle this difficulty, a multiresolution direction classi-fication is proposed to achieve the more accurate direction estimation. This classification is a two-level windowing process. The first-level process is the original voting mech-anism which provides a coarse estimation for the edge

(13)

direction and then determines the selected nested region between two adjacent direction categories. The second-level process is a fine estimation for the edge direction based on four finer directional categories belonging to the chosen nested region. The difference between any two adjacent finer directional categories is 5.625 In simulation results, we will compare our multiresolution approach with Kwok’s method [15] and show the improvement of our method.

Sometimes, a cell loss may cause a sequence of consecu-tive horizontal block losses when it contains a slice header, or a macroblock header or a sequence of block data. A sliding block iteration method is proposed to conceal the contiguous block loss by using the multiresolution multidi-rectional interpolation block-by-block. First, the damaged left-most block is recovered based on its seven neighboring correct blocks. Next, the recovered block is treated as the correct neighboring block adjacent to the second left-most damaged block and can be applied for multidirectional interpolation. In the same time, the similar recovery process is performed from the right-most damaged block to the left one. The sliding block iteration is terminated when all the damaged blocks have been recovered. The method is also valid for color MPEG videos by applying the iteration to each of -, -, and -blocks, independently. Another method to conceal consecutive block losses is to apply block-interleaving scheme to isolate the block loss. However, it may cause encoding and reconstruction delay [14], [24]. Sometimes, it will become invalid when the consecutive block losses are caused by the bit errors in either slice header or macroblock header.

B. Iterative Pattern Matching Algorithm for Temporal Error Concealment

The general approach for temporal error concealment is the temporal replacement [14], [17], [18]. In temporal replacement, damaged blocks in the current frame are replaced by the spatially corresponding ones in the previ-ously decoded reference frame with motion compensation if motion information is available. Narula and Lim [18] have shown that the estimate of motion vector in the lost block (i.e., macroblock of size 16 16 in MPEG-2) can be obtained by taking the median of all eight adjacent correct motion vectors. It should be mentioned that the motion vector is defined in a macroblock sense. Therefore, the damaged macroblock can be replaced by a correct or previously concealed macroblock in the reference picture based on the estimate. However, this method does not work well when high-frequency information such as edges, contours, and fine textures appear in the macroblock. To tackle this difficulty, an iterative pattern matching algo-rithm is proposed to improve the performance of temporal replacement. First, the damaged macroblock is recovered based on the estimated median motion vector. This leads to a coarse recovered marcroblock. In the spatial domain, the coarse recovered macroblock with 16 16 pixels is further divided into 16 subblocks with each having 4 4 pixels. Since the corner subblocks tend to be more correlated with their three immediate neighboring correct subblocks, each

corner subblock can be combined with its three neighboring correct subblocks to construct an estimated larger block of size 8 8. This estimated block is further replaced by a block in the reference picture which has the minimum mean square error deviation to match the estimated block in the current frame. One can find such a block by using block matching algorithm [14]. The corner subblocks belonging to the best-matching blocks are then treated as the correct subblocks and can be applied for the next pattern matching. Finally, those recovered subblocks are then combined to yield the fine recovered marcoblock. More details of the iterative pattern matching algorithm can be found in [33]. The pattern matching process is valid for the recovery of the damaged macroblocks belonging to a B-frame. For a B-frame, two recovered macroblocks may be obtained by using the pattern matching iteration with respect to the previous and the next reference pictures, respectively. Finally, the damaged macroblock of a B-frame has been concealed by taking the average of these two recovered macroblocks.

The iterative pattern matching can be extended to con-ceal the consecutive lost macroblocks [33]. The recovery process is performed in the right direction and left direction simultaneously until all the damaged macroblocks have been recovered.

VIII. SIMULATION RESULTS

The channel model described in this paper is identical to Model-I employed in [25]. In this model, the variances of the Rayleigh path gains are the same for all MPEG users and equal to 14.2 dB (0.038). Additionally, the maximum number of resolved multipaths for all users is equal to 4, i.e., The channel noise is set to be

dB. The multicode CDMA system has a transmission rate

of kb/s, a processing gain of and

a bandwidth of 10 MHz [10]. Its corresponding PN and orthogonal spreading sequences are chosen as the Gold sequence with a period equal to and the WH code with a period of 128. For CDMA receiver, it has an MRC of order For the interleaving scheme, the outercoded data are interleaved over 215 bits for I-frames (223 bits for P-frames or 231 bits for B-frames), and the inner-coded data plus the coded ATM header data are interleaved over 20 cells.

To evaluate the performance of multiple MPEG video transmission via multicode CDMA systems, a high bit-rate Flower Garden and a medium bit-bit-rate Table Tennis are considered as a typical example in the simulation. From section V, it is found that there are five frame combinations, i.e., (I,B), (B,I), (P,B), (B,P), and (B,B) in the optimal frame interleaving pattern, where the first and second components belong to Table Tennis and Flower Garden, respectively. Thus, (B,I) is the maximum two-user bit-rate frame combination. The average WH code numbers for an I-frame in Flower Garden and for a B-frame in

Table Tennis are found to be and

(14)

Fig. 8. CLR for an I-frame versus a varying WH code number NIdue to the interfering B-frame with eitherNB= 13 or 5.

method proposed in Section III with

and For the second maximum

two-user bit-rate frame combination, i.e., (I,B), the average WH code numbers for an I-frame in Table Tennis and

for a B-frame in Flower Garden are and

respectively when their corresponding code

rates are and

Fig. 8 shows a comparison between the simulation values and Gaussian approximation for average cell loss rate for an I-frame, CLR with a varying WH code number, due to the interfering B-frame with either or

when Note that Fig. 8 highlights a generic representation for CLR where the values of and represent, respectively, the I- and B-frames belonging to either Table Tennis or Flower Garden. CLR at a point represents the average CLR with

a value of 6.27 10 for an I-frame in

Table Tennis due to the interfering B-frame in Flower Garden. On the other hand, CLR at

denotes the average CLR with a value of 2.688

10 for an I-frame in Flower Garden due

to the interfering B-frame in Table Tennis. Thus, the average number of cell losses in an I-frame (cell loss number per I-frame) belonging to Table Tennis

is equal to 0.611 since the average

number of ATM cells generated from its I-frame (ATM cell number per I-frame) is found to be 975. The average cell loss number per I-frame for Flower Garden equals

3.79 since its average ATM cell

number per I-frame is 1410. On the contrary, Fig. 9 shows the average CLR for a B-frame with either or due to the interfering I-frame with a varying In

contrast to Fig. 8, CLR at represents

the average CLR with a value of 2.0 10 for a B-frame in Flower Garden due to the interfering I-frame in Table Tennis. Therefore, the average cell loss number per B-frame for Flower Garden equals 0.6 since its

Fig. 9. CLR for a B-frame with eitherNB= 13 or 5 due to the

interfering I-frame with a varying WH codeNI:

average ATM cell number per B-frame is 300. Similarly, the average cell loss number per B-frame for Table Tennis

equals 1.622 since CLR at is 1.0385

10 and its average ATM cell number per B-frame is 120. Based on the above analysis, it is seen that the average number of ATM cell losses in either I- or B-frames belonging to either Flower Garden or Table Tennis is in the order of As a result, the appropriate maximum allowable numbers of WH codes assigned to I- and

B-frames could be chosen as and

respectively, by considering the compromise between the above two cases.

Considering the third maximum two-user bit-rate frame combination, i.e., (B,P), and the fourth maximum two-user bit-rate frame combination, i.e., (P,B) according to the similar analysis, their maximum allowable WH code

numbers can be chosen as and

since the average WH code numbers for a P-frame and for a B-frame in the highest bit-rate Flower Garden are,

respectively, and It is found

that the average cell loss number per frame for a P-frame in Flower Garden and for a B-frame in Table Tennis are equal

to and

respectively. According to above analysis, these maximum allowable WH code numbers for I- , P-, and B-frames may be directly chosen as the WH code numbers derived from their respective average bit-rates in the highest average bit-rate MPEG video stream. Meanwhile, actually, these maximum allowable WH numbers could be obtained in accordance with the highest peak bit rate for each frame in the highest average bit-rate MPEG video over the presentation time period. This would result in the larger value of maximum allowable WH number for each frame. More WH codes are able to provide more virtual channels for the transmission of multiple MPEG videos in order to carry all the picture data to receivers within a short time period. However, the larger maximum allowable WH

(15)

number also creates the larger CLR for each picture frame and then degrades the picture quality further. Moreover, a large portion of WH codes remains unused when multiple low bit-rate B-frames are transmitted over the multicode CDMA channels. For the case of maximum allowable WH numbers in the average bit-rate manner, it is suggested that an amount of low-priority data in a picture frame with higher peak bit-rate should be abandoned according to the spreading code assignment mechanism in order to achieve the reasonable CLR when its bit-rate is greater than the average bit-rate at a specific time instant. For example, the picture quality for a B-frame in Flower Garden is maintained at the performance of PSNR

PSNR and PSNR when 60%

of AC coefficients have been abandoned. Thus, the value of is reduced from 13 to 7. Similarly,

becomes three when 60% of AC coefficients have been abandoned. Its picture quality is still maintained at the

performance of PSNR PSNR and

PSNR Thus, the value of is reduced

from 13 to 7. From the above discussion, it ensures that our spreading code assignment mechanism is able to achieve each picture frame with a reasonable average ATM cell loss number per frame proportional to when the maximum allowable WH code numbers in the average bit-rate manner are treated as the input to the assignment mechanism. As a result, both the cell loss number per frame and received picture quality performance for each picture frame with higher peak bit-rate are still achieved at acceptable levels when the low-priority data in each macroblock of these frames have been abandoned according to our assignment mechanism.

The second major factor in picture degradation is caused by the transmission error in the payload containing MPEG video data; even the delivery of its associated ATM cell is successful. The impact of transmission error in MPEG videos seems very significant in the picture quality since both the DCT coefficients and macroblock headers are encoded using run-length and VLC. In this paper, we make a modification for the original MPEG codec developed by ISO/IEC [27] to localize the error in a picture frame. For example, a DCT coefficient is lost if a single bit error occurs in that coefficient. A loss of the complete macroblock occurs when a single bit error appears in its header. Moreover, a slice of subsequent macroblock may be destroyed when a bit error occurs in a slice header. For P- and B-frames, a bit error in a motion vector may result in the corruption of this macroblock and the following predicted macroblocks since motion vectors are differentially encoded for the intercoded macroblocks in the same slice. The adaptive FEC code combining has been proposed to protect the payload in accordance with the importance of data types in MPEG videos. An extra more powerful outer FEC code is applied to the more valuable information, i.e., slice header, macroblock header, and DC coefficients in I-frames, or slice header, macroblock header, motion vectors, and DC coefficients in either P-or B-frames, thereby spending mP-ore channel coding bits

Fig. 10. Received I-frame (frame 54) in Flower Garden due to the interfering B-frame (frame 55) in Table Tennis.

on it. The higher frequency DCT coefficients are visually less important than the DC coefficients and have very little impact on video quality.

Our system has been evaluated over 150 frames of both Flower Garden and Table Tennis. We particularly choose the worst case of a cell loss containing the slice header and a bit error in the macroblock header when the ATM cell is successful. The worse case of the maximum two-user bit-rate frame combination, (B,I), occurs in the fifty-fouth frame (I-frame) in Flower Garden and the fifty-fifth frame (B-frame) in Table Tennis. Fig. 10 shows the received I-frame (I-frame 54) in Flower Garden with six lost areas. Observing Fig. 10 from top to bottom, it is found that the second, fourth, and sixth lost areas (black areas) are caused by the three cell losses, and the first, third, and fifth lost areas are caused by the bit errors in the payload when its ATM cell is successful. Especially, the cell loss containing a slice header results in the loss of a slice of subsequent macroblocks in the second area. The third lost area is caused by a bit error in macroblock header. The damage rates for DC and AC coefficients are found to be 0.1515 and 13.15%, respectively. The first and fifth areas are lost when both the DC and AC coefficients have been damaged simultaneously. Moreover, from Fig. 10, it is seen that the remaining area in the picture seems visually good even 13.15% of AC coefficients have been damaged. As a result, this verifies the effectiveness of the adaptive FEC code combining for high priority data (macroblock header and DC coefficients). Fig. 11 illustrates the received B-frame (frame 55) in Table Tennis with three lost areas, where the first lost area is caused by a cell loss containing a slice header and the other two lost areas are caused by the bit errors in the payload (macroblock header and motion vectors). DC coefficients did not have any damage. The damage rate for AC coefficients is 6.867%. Fig. 12 shows the recovered I-frame (frame 54) in Flower Garden by the multiresolution multidirectional interpolation techniques. The picture quality of Fig. 12 is improved at the

perfor-mance of PSNR , PSNR ,

and PSNR . On the other hand, Fig. 13

(16)

Fig. 11. Received B-frame (frame 55) in Table Tennis due to the interfering I-frame (frame 54) in Flower Garden.

Fig. 12. Recovered I-frame (frame 54) in Flower Garden by multiresolution multidirectional interpolation.

by the iterative pattern matching method. The picture

quality performance of Fig. 13 is PSNR ,

PSNR , and PSNR . It is

found that the received I-frame (frame 54) in Table Tennis and the received B-frame (frame 53) in Flower Garden belong to the second maximum two-user bit-rate frame combination, (I,B) [29]. There are four lost areas for each of received I- and B-frames, where two of them are caused by the loss in their cells. For the I-frame in Table Tennis, two macroblock headers (second and fourth lost areas) have been damaged when their associated ATM cells are successful. The damage rates for its DC and AC coefficients are equal to 0.0758 and 2.575%, respectively. However, they have not been damaged simultaneously. On the other hand, there are a damaged macroblock header (first lost area), a damaged motion vector (third lost area), and no damage in DC coefficients for the B-frame in the Flower Garden. The damage rate for its AC coefficients is 8.490%. Reference [29] has shown that the PSNR’s of both the recovered frames are PSNR 32.620 167, PSNR 35.651 696, and PSNR 39.462 460 for the recovered I-frame, and PSNR 21.847 149, PSNR 31.926 815, and PSNR 27.735 929 for the recovered B-frame. For the worst case of the third maximum two-user bit-rate frame combination, (B,P), it has been found that the received

Fig. 13. Recovered B-frame (frame 55) in Table Tennis by iter-ative matching method.

P-frame (frame 57) in Flower Garden and the received B-frame (B-frame 58) in Table Tennis have the identical number of lost areas, where three lost areas appear in each figure. There are two cell losses (second and third lost areas), a damaged motion vector (first lost area) and no damage in both the macroblock headers and DC coefficients belonging to the P-frame in Flower Garden. The damage rate for its AC coefficients is 2.56%. In contrast to Flower Garden, there are a single cell loss (second lost area), two damaged macroblock headers (first and third lost areas) and no dam-age in the motion vectors for the B-frame in Table Tennis. The damage rates for its DC and AC coefficients are 0.0515 and 5.1%, respectively. Only the iterative pattern matching method is applied to either received P- or B-frame. The recovered P- and B-frames are improved at the

perfor-mance of PSNR 21.077 045, PSNR 30.912 437,

and PSNR 27.098 217, and PSNR 31.538 477,

PSNR 33.142 929, and PSNR 36.408 348, respec-tively [29]. Here, we ignore both the fourth maximum two-user bit-rate and minimum two-user bit-rate frame combinations, i.e., (P,B) and (B,B) which seem less im-portant in the evaluation of our system. Finally, it should be mentioned that the reference pictures used to recover either P- or B-frame by using the iterative pattern matching method are either the previously recovered I- or P-frame or both.

IX. CONCLUSION ANDFUTURERESEARCHWORK

A MPEG-2 video transmission over ATM-based multi-code CDMA channels is investigated in this paper. Two separate FEC code schemes are applied to the header and payload of a ATM cell, respectively. A shortened BCH(31,16) code is proposed to effectively protect the ATM cell header in order to ensure the average number of ATM cell losses in a frame be in the the order of Moreover, an adaptive FEC code combining is used to provide the good protection for the payload data. Especially, an additional outer FEC code is proposed to increase the error protection capability for the high priority data in the payload with the maximization of its code rate. The spreading code assignment mechanism provides the

數據

Fig. 1. Example of MPEG GOP.
Fig. 4. Data distribution of DC coefficients in (a) Y -block, (b) U-block, and (c) V -block belonging to the I-frame of Flower Garden.
Fig. 5. Overall schematics of multiple MPEG video transmission via ATM-based multicode CDMA system where RM cell contains N k ; frame type, and outer code combining type.
Fig. 6 shows the bit-rate trace which is a bitcount ob- ob-served frame-by-frame in a segment (frames 1-150) in Flower Garden
+5

參考文獻

相關文件

In Section 3, the shift and scale argument from [2] is applied to show how each quantitative Landis theorem follows from the corresponding order-of-vanishing estimate.. A number

Now, nearly all of the current flows through wire S since it has a much lower resistance than the light bulb. The light bulb does not glow because the current flowing through it

(ii) “The dismissal of any teacher who is employed in the school – (a) to occupy a teacher post in the establishment of staff provided for in the code of aid for primary

(ii) “The dismissal of any teacher who is employed in the school – (a) to occupy a teacher post in the establishment of staff provided for in the code of aid for primary

For the proposed algorithm, we establish a global convergence estimate in terms of the objective value, and moreover present a dual application to the standard SCLP, which leads to

Moreover, for the merit functions induced by them for the second- order cone complementarity problem (SOCCP), we provide a condition for each sta- tionary point to be a solution of

experiment may be said to exist only in order to give the facts a chance of disproving the

request even if the header is absent), O (optional), T (the header should be included in the request if a stream-based transport is used), C (the presence of the header depends on