• 沒有找到結果。

JOINT SOURCE-CHANNEL RATE CONTROL FOR MULTIVIEW VIDEO STREAMING OVER ERASURE CHANNELS

N/A
N/A
Protected

Academic year: 2022

Share "JOINT SOURCE-CHANNEL RATE CONTROL FOR MULTIVIEW VIDEO STREAMING OVER ERASURE CHANNELS"

Copied!
5
0
0

加載中.... (立即查看全文)

全文

(1)

JOINT SOURCE-CHANNEL RATE CONTROL FOR MULTIVIEW VIDEO STREAMING OVER ERASURE CHANNELS

Weiliang Xu, Junni Zou

School of Communication Engineering Shanghai University, China

Hongkai Xiong

Department of Electronic Engineering Shanghai Jiao Tong University, China

ABSTRACT

To enable robust multiview video streaming over current wireless broadcast infrastructures, this paper proposes a joint source and channel coding optimization framework for video streaming based upon multiview video coding technique.

It utilizes random linear coding (RLC) with ladder-shaped global coding coefficient matrix (GCM) for unequal erasure protection. An accurate expression of the decoding proba- bility for RLC with ladder-shaped GCM is derived. A fast two-layer local search algorithm is developed to solve the source and channel coding rate allocation problem, where the source and channel rate of each layer can be adjusted independently for the overall video quality. Simulation re- sults demonstrate significant video quality improvement to existing rate allocation schemes.

Index Terms— Interactive multiview video streaming, scheduling, fairness, asymmetric bargaining game

1. INTRODUCTION

Multiview video (MVV) consists of multiple video sec- quences that are captured simultaneously by multiple cam- eras from different viewpoints. The state of the art multi- view video coding (MVC) standard [1] compresses MVV efficiently by exploiting both temporal and interview re- dundancies. It requires to transmit the entire set of multiview sequences to the receiver thus is very suitable for video broad- cast applications such as cable TV broadcast. In this paper, we consider MVC content distribution in current wireless broadcast system, where MVC stream is broadcast from the server to multiple receivers with diverse channel conditions.

Wireless communication is prone to transmission errors due to attenuation, fading and interference [2]. A class of rate- less codes, such as random linear coding (RLC) [3], have been proved to have the ability to provide erasure protection with simple implementation. One challenge for RLC is that a full- rank global coding coefficient matrix (GCM) is required to successfully decode all source packets. As RLC is operated at

The work has been partially supported by the NSFC grants No.

61472234 and No. 61271211.

intermediate nodes, it is very difficult to guarantee a full-rank GCM receivers. To reduce the impact of the GCM’s rank defi- ciency on video transmission, RLC with ladder-shaped GCM [4] [5] is proposed. It improves the decodability of video data at receivers by enabling partial decoding of a block.

Since different packets may contribute differently to the decrease of video distortion, RLC with unequal error pro- tection (UEP) schemes [5]-[8] have been proposed recently.

UEP RLC provides better protection for most important pack- ets, and thus guarantee their higher decoding probabilities.

Joint source-channel coding (JSCC) is an effective method for robust video transmission over wireless lossy channels. The channel coding in the existing JSCC works on MVV stream- ing is designed on the basis of either forward error correction techniques [9] [10], or RLC with generalized GCM [8] [11].

In this paper, we consider a joint source-channel coding prob- lem for MVV broadcast, in which views are classified into dif- ferent layers in terms of MVC prediction structure, and RLC with ladder-shaped GCM is employed to provide unequal era- sure protection. Our main contributions are summarized as follows:

• We propose a joint source and channel coding rate optimization framework that utilizes RLC with ladder- shaped GCM for unequal erasure protection. Con- strained by the channel capacity, the source and chan- nel rate allocation problem is modeled as an aggregate video quality maximization problem.

• We derive an accurate expression of the decoding prob- ability for RLC with ladder-shaped GCM, in which the linear dependency of the encoding vectors for the re- dundant packets, ignored in [4], is completely consid- ered.

• We develop a fast two-layer local search algorithm to solve the proposed JSCC rate allocation problem. Un- like the existing local search algorithms, it can adjust the source rate and channel rate of each layer indepen- dently, so as to be more close to the global optimum.

The remainder of this paper is organized as follows: Sec.

II provides an accurate expression of the decoding probabil- ity for RLC with ladder-shaped GCM. Sec. III formulates



‹,((( ,&,3

(2)

the JSCC optimization problem, and then presents a fast two- layer local search algorithm. Simulation results are discussed in Sec. IV. Finally, Sec. V concludes the paper.

2. DECODING PROBABILITY FORMULATION We adopt ladder-shaped GCM as our channel coding scheme to cope with the rank deficiency problem in RLC with gen- eralized GCM. Assume that the source message is classified intoL layers according to their contributions to video quality.

LetS = (s1, . . . sL) represents the number of source pack- ets in each layer. The structure of the ladder-shaped GCM is shown in Fig. 1, where each submatrixMi contains global encoding vectors corresponding to thei-th layer, the white area is filled by zeros. The first part ofMihavesi rows are all linearly independent, and the second part havekirows for generating the redundant packets. LetK = (k1, . . . kL) de- note the number of redundant packets, then the total number of the encoded packets for all the layers can be represented as N = (n1, . . . nL), where ni= si+ki. The UEP coding strat- egy includes the firstl layers’ source packets into the encoded packets of all subsequent layers. The decoding probability of thel-th layer is non-zero only if: (i) for the first l layers, the number of the received encoded packets are at least equal to the number of the source packets, or (ii) any higher layers can be decoded with a probability greater than zero.

0

0

0

V

V

V

N

N

N

6

6XEPDWUL[ IRU VRXUFH OD\HU  6XEPDWUL[ IRU VRXUFH OD\HU  6XEPDWUL[ IRU VRXUFH OD\HU  6XEPDWUL[ IRU UHGXQGDQW 6XEPDWUL[ IRU UHGXQGDQW 6XEPDWUL[ IRU UHGXQGDQW

Fig. 1. The structure of the ladder-shaped GCM Further, let A = (α1, . . . αL) represent the number of successfully received encoded packets associated with each layer. The source message is transmitted to the receiver at a packet erasure ratep. Let P1:lbe the decoding probability of the firstl layers regardless of the impact of their higher layers.

ForL = 1 and l = 1, the least number of received encoded packets iss1, thenP1:1is given by

P1:1(n1, s1) =

n1



α1=s1

PR(n1, α1) · PF(s1, α1) (1)

where

PR(n1, α1) =

 n1 α1



(1 − p)α1pn1−α1 (2) represents the probability of receivingα1 encoded packets whilen1 encoded packets are generated at the source node.

Here,PF(s1, α1) is the probability that the GCM can reach the rank ofs1when receivingα1encoded packets.

In [4],PF(s1, α1) is regarded as 1 if α1 ≥ s1. However, even if the firstsiencoding vectors of layeri are linearly inde- pendent, the encoding vectors of the redundant packets may be linearly dependent. It means that the value ofPF(s1, α1) may be less than 1 even if we haveα1≥ s1.

Let PD(d, k) represent the probability that d out of k redundant packets are linearly independent for Galois field GF (2q) [12], then we have

PD(d, k) =

d−1

i=0

 1 − 1

qk−i



(3) For the worst case, ifα1 ≤ k1 all the received packets are redundant packets, otherwise k1 redundant packets are in- cluded in the received packets. The minimum probability of PF(s1, α1) is

PFmin(s1, α1) =

⎧⎨

PD(s1, α1), α1≤ k1

PD(n1− α1, k1), k1< α1< n1

1, α1= n1

(4) We now investigate the case thatL > 1 and l ≤ L. Let αminl denote the least number of received encoded packets for P1:lto be a non-zero value,Sl= l

i=1siandAl= l

i=1αi

are the number of source packets and received encoded pack- ets comprising the firstl layers. Clearly, for l = 1, we have αmin1 = S1. For the case with 2 ≤ l ≤ L, αminl can be obtained using the the following recursion:

αminl = sl+ max(αminl−1− αl−1, 0) (5) Thus, the minimum decoding probability of receiving the first l layers but not higher can be written as

P1,lmin(N, S) 

n1



α1=0

· · ·

nl−1

αl−1=0 nl



αlminl αminl+1−1

αl+1=0

· · ·

αminL−1 αL=0

L

i=1

 ni

αi



(1 − p)αipni−αi



· PFmin(S, A) (6) where the first line counts the events of receiving the possible number of encoded packets to decode the firstl layers, while the second line lists all the possible cases that the subsequent layersl + 1, . . . , L can not be decoded. We seek PFmin(S, A) an extension ofPFmin(s1, α1) and have

PFmin(S, A)  PFmin(Sl, Al)

=

⎧⎨

PD(Sl, Al), Al≤ Kl

PD(Nl− Al, Kl), Kl< Al< Nl

1, Al= Nl

(7)



(3)

Therefore, the average decoding probability can be calcu- lated by

P¯1,l(N, S) = P1,lmin(N, S) + P1,lmax(N, S)

2 (8)

whereP1,lmax(N, S) is the corresponding maximum value that can be obtained by replacingPFmin(Sl, Al) with 1 in (6).

3. JOINT SOURCE AND CHANNEL CODING At the server side, the input MVV is encoded using MVC.

The resulting bit stream is packetized in equal length and clas- sified into multiple layers of different importance according to the prediction structure. RLC with ladder-shaped GCM is employed to generate redundant packets. Then, the encoded packets are broadcast to the receivers. At the receiver side, Gaussian elimination is used for RLC decoding, followed by a MVC decoder. In this work, we aim at the overall video quality by optimal source and channel rate allocation strategy.

3.1. Optimal Source and Channel Rate Allocation We utilize RLC to encode the MVV at group of pictures (GOP) level. DenoteS = {s1, . . . , sL} the number of equal length source packets included in one GOP for each view, and N = {n1, . . . , nL} the corresponding number of encoded packets generated at the server includingK = {k1, . . . , kL} redundant packets. Assume the size of a source packet isb bits, and a GOP is transmitted int seconds, then the source rate of viewi can be calculated as

ri= b · si

t , i = 1, . . . , L. (9) The stream of encoded packets then is broadcast to M receiver classes characterized with their packet erasure rates p = (p1, . . . pM). Denote ξi the fraction of receiver popula- tion associated with classi and M

i=1ξi = 1. We can for- mulate the source and channel rate allocation as a constrained optimization problem:

maxN,S

1

M i=1

ξiQi(N, S, pi)

s.t.

L i=1

ni≤ Nc

(10)

where the source and channel rate are jointly optimized for maximizing the average video quality with respect to different receiver classes. Qi is the video quality of thei-th receiver class that can be computed by

Qi(N, S, pi) =

L l=1

P¯1,l(N, S, pi) · Q1:l(S) (11)

where ¯P1,l(N, S, pi) represents the average decoding proba- bility obtained by (8) for the packet erasure rate equal topi, andQ1:l(S) denotes the sum video quality of the first l views which can be computed via expressions derived in [13]. The constraint condition means that the overall data rate should not exceed a given available bandwidth, andNc is the maxi- mum number of packets can be transmitted on the channel.

3.2. Two-layer Local Search Algorithm

Due to high computational complexity of full search, local search becomes the alternative for the problem in (10). The existing local search algorithms often transform the joint source and channel rate allocation to a single variable opti- mization problem with several coding rate combinations [15]

or keep the maximum rate for each layer fixed [16], resulting in the missing of too many possible combinations of source and channel rate. In this paper, we propose a two-layer local search algorithm, in which the source rate and channel rate can be adjusted independently.

The two-layer local search algorithm starts from an initial source distribution andSLis equal to the maximum valueNc. For the inner-layer search, the optimization is essentially the same as channel coding with a given vector of source packets.

Set an initial distribution of the encoded packets in terms of the fraction of source packets for each layer and the maximal number of packets to be transmitted. Every step the neigh- bors of multiple feasible distribution vectors are examined by (10). If any neighbor has a higher average video quality, it is included in the set of feasible distribution vectors. This procedure is repeated until the set of feasible distribution vec- tors is∅, then we get a possible optimal pair of source and channel rate allocation. In the outer layer, the same process is conducted to search the neighbors of the source distribution.

After these two stages, we achieve a solution for the case thatSLis equal toNc. Since lower source rate can increase the decoding probability while lead to a worse video quality, there exits a tradeoff between these two metrics. We next attempt to reduce the overall source rate and find the global optimal solution.

4. SIMULATION RESULTS

We encode the “Ballroom” sequence with 640x480 resolution by using H.264/AVC reference software. The sequence con- sists of three views. The GOP size is 8 and the frame rate is 30 fps. The source packets are classified into three layers organized along the decrease of their importance. The view encoding order is 0-2-1, where view 0 has the highest im- portance for it is intra-coded. The packet size including the network coding header is set to 1500 bytes and the field size isGF (28) [17].

Fig. 2 shows the comparison of our RLC with ladder- shaped GCM and expanding-window RLC presented in [8].



(4)

It is observed that RLC with ladder-shaped GCM has a good robustness against packet loss. When the packet erasure rate increases from 0.02 to 0.14, the average video quality only ex- periences 1.41 dB performance degradation and still reaches 33.12 dB. A better performance is always achieved compared with expanding-window RLC due to a higher decoding prob- ability of RLC with ladder-shaped GCM.

      





















3DFNHW(UDVXUH5DWH

3615 G%

5/&ZLWKODGGHUíVKDSHG*&0 ([SDQGLQJíZLQGRZ5/&

Fig. 2. Average video quality vs. packet erasure rate Suppose there are two receiver classes with the packet era- sure rate of 0.02 and 0.14. Consider two cases that the frac- tion of receiver population associated with class 1 isξ1= 0.1 andξ1 = 0.9. Fig. 3 compares the source rates of these two cases with the available bandwidth. We can find that, given the same bandwidth, higher source rate is achieved whenξ1= 0.9 for the reason that the receivers with a better channel con- ditions, they can achieve a high decoding probability with less redundant packets.

      























%DQGZLGWK NESV

6RXUFHUDWH NESV

ξ ξ 

ξ ξ 

Fig. 3. Source rate vs. bandwidth for ξ1= 0.1 and ξ1= 0.9 Fig. 4 shows the average quality of each view, in which the proposed JSCC scheme is employed or just channel cod- ing (CC) is adopted. The video quality achieved by decoding of view 0, 2 and 1 is 32.74 dB, 32.96 dB and 32.72 dB, re- spectively. The packet erasure rates is the same andξ1= 0.5.

We can see that JSCC significantly outperforms CC. When the bandwidth is 500 kbps, CC achieves a higher quality of view 0 than JSCC, but it fails to deliver the other two views as it allocates the whole rate to layer 1 for the purpose of de- coding view 0 with a high probability. Similar result can also be found when the bandwidth is 700 kps.

Fig. 4. Average video quality of each view vs. bandwidth for JSCC and CC

Finally, we compare the proposed two-layer search algo- rithm with the full search and the method that allocates an equal code rate for all the layers. The reference algorithm em- ploys local search only once to search different distributions of source packets, while the number of the encoded packets is determined by the code rate that is iterated by a small step size. The packet erasure rates and the distribution of receiver population remain unchanged. It is observed in Fig. 5 that all these three algorithms achieve higher average quality along with the increasing bandwidth. Our two-layer local search algorithm gives a better performance than the scheme that allocates equal code rate to each layer and the performance degradation is negligible compared with full search.

      

















%DQGZLGWK

3615 G%

(TXDOFRGHUDWH 3URSRVHGDOJRULWKP )XOOVHDUFK

Fig. 5. PSNR comparison of three different algorithms

5. CONCLUSION

This paper proposed a framework of broadcasting robust MVV over wireless erasure channels. Our objective is to maximized the aggregate video quality of all the receivers by a joint source and channel coding rate optimization. We adopted RLC with ladder-shaped GCM for unequal erasure protection and presented an accurate expression of its decod- ing probability. Also, a two-layer local search algorithm was developed to find a solution in polynomial time. Simulation results showed that the proposed framework can maximize the overall video quality. Future work would be expanded to more complicated wireless multicast system and take the view popularity among the receivers into consideration.



(5)

6. REFERENCES

[1] A. Vetro, T. Wiegand, and G. J. Sullivan, “Overview of the stereo and multiview video coding extensions of the H.264/MPEG-4 AVC standard,” Proc. IEEE, vol. 99, no.

4, pp. 626-642, Apr. 2011.

[2] H. Zhihai and X. Hongkai, “Transmission distortion analysis for real-time video encoding and streaming over wireless networks,” IEEE Trans. Circuits Syst.

Video Technol., vol. 16, pp. 1051-1062, 2006.

[3] D. Lun, M. Medard, R. Koetter, and M. Effros, “On cod- ing for reliable communication over packet networks,”

Physical Commun., vol. 1, no. 1, pp. 22-30, 2008.

[4] H. Wang, S. Xiao, C. Kuo, “Random linear network coding with ladder-shaped global coding matrix for ro- bust video transmission,” Journal of Visual Communica- tion and Image Representation, vol. 22, no. 3, pp. 203- 212, 2011.

[5] X. Zhang, C. Jing, F. Tang, et al., “Joint redundant and random network coding for robust video transmission over lossy networks,” Mobile Information Systems, no.

8, pp. 213-230, 2012.

[6] D. Vukobratovi´c and V. Stankovi´c, “Unequal error pro- tection random linear coding for erasure channels,” in IEEE Trans. Communications, vol. 60, pp. 1243-1252, May 2012.

[7] N. Thomos, J. Chakareski, and P. Frossard, “Priori- tized distributed video delivery with randomized net- work coding,” in IEEE Trans. Multimedia, vol. 13, pp.

776-787, Aug. 2011.

[8] J. Chakareski, V. Velisavljevic, and V. Stankovic,

“View-Popularity-Driven Joint Source and Channel Coding of View and Rate Scalable Multi-View Video,”

in IEEE Journal of Selected Topics in Signal Processing, 2015.

[9] J. Liu, Y. Liu, S. Ci, and R. Yao, “3d visual experi- ence oriented crosslayer optimized scalable texture plus depth based 3d video streaming over wireless networks,”

in Journal of Visual Communication and Image Repre- sentation, vol. 25, no.5, July 2014, pp. 1209-1221.

[10] A. Vosoughi, P. Cosman, and L. B. Milstein, “Joint source-channel coding and unequal error protection for video plus depth,” in IEEE Signal Processing Letters, vol. 22, August 2014, pp. 31-34.

[11] L. Toni, N. Thomos, and P. Frossard, “Interactive free viewpoint video streaming using prioritized network coding,” in International Workshop on Multimedia Sig- nal Processing, Pula, Italy, Oct. 2013.

[12] O. Trullols-Cruces, J. Barcelo-Ordinas, and M. Fiore,

“Exact Decoding Probability Under Random Linear Network Coding,” IEEE Commun. Lett., vol. 15, no. 1, pp. 67-69, Jan. 2011.

[13] A. Fiandrotti, J. Chakareski, and P. Frossard,

“Popularity-based rate allocation in multiview-video,”

in Proc. Conf. Vis. Commun. Image Process., pp. 1-7, Jul. 2010.

[14] A. Tassi, I. Chatzigeorgiou and D.Vukobratovi´c,

“Resource-Allocation Frameworks for Network-Coded Layered Multimedia Multicast Services,” IEEE J. Sel.

Areas Commun., vol. 33, no. 2, pp. 141-155, Feb. 2015.

[15] N. Ramzan, A. Amira, and C. Grecos, “Efficient trans- mission of multiview video over unreliable channels,”

in IEEE International Conference on Image Processing (ICIP), pp. 1885-1889, Sep. 2013.

[16] Stankovic V, Hamzaoui R and Xiong Z, “Real-time er- ror protection of embedded codes for packet erasure and fading channels,” in IEEE Trans. Circuits Syst. Video Tech., vol. 14, no. 8, pp. 1064-1072, 2004.

[17] P. A. Chou, Y. Wu, and K. Jain, “Practical network cod- ing,” in Proc. 2003 Allerton Conf. Commun., Control, Comput.



數據

Fig. 4. Average video quality of each view vs. bandwidth for JSCC and CC

參考文獻

相關文件

The aim of the competition is to offer students a platform to express creatively through writing poetry in English. It also provides schools with a channel to

Then, we tested the influence of θ for the rate of convergence of Algorithm 4.1, by using this algorithm with α = 15 and four different θ to solve a test ex- ample generated as

In Paper I, we presented a comprehensive analysis that took into account the extended source surface brightness distribution, interacting galaxy lenses, and the presence of dust

The Model-Driven Simulation (MDS) derives performance information based on the application model by analyzing the data flow, working set, cache utilization, work- load, degree

An OFDM signal offers an advantage in a channel that has a frequency selective fading response.. As we can see, when we lay an OFDM signal spectrum against the

n Logical channel number and media information (RTP payload type). n Far endpoint responds with Open Logical

• When the coherence bandwidth is low, but we need to use high data rate (high signal bandwidth). • Channel is unknown

• When the coherence bandwidth is low, but we need to use high data rate (high signal bandwidth). • Channel is unknown