Design strategy for three-dimensional subband filter banks

(1)

DESIGN STRATEGY FOR THREE-DIMENSIONAL SUBBAND FILTER BANKS

Po-Cheng Wu, Liang-Gee Chen, Yeong-Kang Lai, and

Tsung-Han Tsai

Lab.

332, Department

of

Electrical Engineering

National Taiwan University

Taipei, Taiwan,

R.O.C.

E-mail:

pcwu8video.ee.ntu.edu.tw

ABSTRACT

Since three-dimensional (3-D) subband coding has been introduced, most researches on 3-D subband coding perform temporal filtering first. In this paper, how- ever, we investigate the best permutation strategy for temporal, vertical, and horizontal filtering to minimize the requirement of delay elements and find that the results are opposite to our expectation.

1. INTRODUCTION

Recently, there has been rapid progress in the area of multirate digital signal processing. Applications of multirate systems include subband coding of video and audio signals, fast transforms using digital filter banks, wavelet analysis, and many others [1]-[3]. One of the most important applications of multirate systems is subband coding (SBC). Since it was introduced by Cro- chiere et al. [4] in 1976, subband coding has been an effective coding approach for video and audio applications [5]-[7]. Subband coding employs a filter bank

for splitting the input signal, so the filter banks are es- sential t o subband coding. Because filter banks usually deal with a large number of data, high speed computing hardware is indispensable for subband coding systems. Subband coding was first extended to three dimensional (temporal, vertical, and horizontal) filtering, i.e.,

3-D

subband coding, by Karlsson and Vetterli 281. S- ince then, most researches which deal with 3-D subband coding perform temporal filtering first, e.g., [8]- [13]. Since 3-D subband coding requires a large number

of delay elements to store the intermediate data, delay elements dominate the hardware cost in this coding scheme. Therefore, the most important task remains how t o minimize the requirement of delay elements. In this paper, we investigate the best permutation strategy for temporal, vertical, and horizontal filtering in different conditions in order t o minimize the required number of delay elements.

0-7803-3258-X/96/$5.00 0 1996 IEEE 605

2. DESIGN STRATEGY

Let us assume the video format is M lines x N pixels x P frames/sec, and the numbers of filter taps for tem- poral, vertical, and horizontal filtering are x + 1, y

+

1, and z

+

1, respectively. Thus, the required numbers

of delay elements for temporal, vertical, and horizontal filtering are MNx, Ny, and z, respectively. In sub- band coding, we usually employ the FIR direct form filter. The advantage of employing the FIR direct for- m is that the lowpass filter and the highpass filter can share the same delay elements.

2.1. E i g h t - N l Bands

We will first consider 3-D subband coding with eight- full bands as shown in Fig. 1. From Fig. 1, we find that by employing the FIR direct form filters, stage

1 requires one set of dela,y elements; stage 2 requires two sets of delay elements; stage 3 requires four sets of delay elements. The required numbers of delay elements for different permutations of temporal, vertical, and horizontal filtering are listed in Table 1. Note that the filtering results are identical even though the filtering permutations are different. From Table l , we find that for eight-full-band splitting, the cascade s-

trategy, which requires the minimum number of delay elements, is horizontal, vertical, followed by temporal filtering, i.e., (H, V, T).

2.2. Four-Reduced Bands

In order to achieve a high compression ratio in 3-D subband coding, we often discard the four higher bands as

shown in Fig. 2. Thus, from Fig. 2, we find that stage 1 requires one set of delay elements; stage 2 requires two sets of delay elements; stage 3 requires three sets

of delay elements. The reiquired numbers of delay elements for different permutations are listed in Table 2. From Table 2, we find that, for four-reduced-band splitting, the cascade strategy which minimizes the delay

(2)

I

S

Figure 1: 3-D subband coding with eight-full bands.

I

Permutations

IJ

Stage 1 ( X I )

I

Stage 2 ( x 2 )

I

Stage 3 ( x 4 )

I

Total

I

Table 1: Comparison of the required delay elements for different filtering permutations in 3-D subband coding with eight-full bands.

elements is horizontal, vertical, followed by temporal filtering, i.e., (H, V, T).

2.3. Finite Wordlength

E

Finally, if we further consider the finite wordlength effect t o enhance the accuracy of subband coding, we assume that the wordlengths of delay elements in stage 1, stage 2, and stage 3 are a, b, and c bits, respectively. The relation between a, b, and c is usually a

5

b

5

c. Now, we still look at 3-D subband coding with four-reduced bands. The required numbers of delay elements for different permutations are listed in Ta-

ble 3. From Table 3, we are unable to distinguish which permutation strategy is the best choice. Therefore, we make the following assumptions: the video format is M = 512 lines and N = 1024 pixels; the temporal filter has x

+

1 = 2 taps (in order to minimize the delay time); the vertical filter has y

+

l = 8 taps; the horizontal filter has 2;

+

1 = 16 taps; the wordlengths of delay elements in stage 1, stage 2, and stage 3 are

a = 8 , b = 9, and c = 10 bits, respectively. After care- ful calculation, we find that if we consider the finite wordlength effect in the above conditions, the cascade strategy which minimizes the delay elements is vertical, horizontal, followed by temporal filtering, i.e., (V, H,

TI.

3. CONCLUSI

From above discussion, we find that the results are opposite to our expectation. Performing the temporal filtering first is not the best strategy. On the contrary, placing the temporal filter at final stage will save the delay elements. We also find that the best permutation strategy for temporal, vertical, and horizontal filtering depends on different conditions. Finally, for the syn-

thesis filter banks or a higher number of filtering stages in 3-D subband coding, by employing the same scheme, we can also derive the best permutation strategy which minimizes the requirement of delay elements.

(3)

Input

Permutations T l V l H

1

Stage 1 (

x

1) Stage 2 ( ~ 2 ) Stage 3 ( ~ 3 ) Total

M N x NY z

I

H

L

T l H l V

I

M N x z

\

/

\

/

3 bands

1 band

N u l 2

Temp.

L

Temp.

H

I I V T H V H T H T V H V T

Figure 2: 3-D subband coding with four-reduced bands;.

I I, NY M N x / 2 z

NY z

2 NY/2

z

M N x / 2

Table 2: Comparison of the required delay elements for different filtering permutations in 3-D subband coding with four-reduced bands.

4. REFERENCES

R. E. Crochiere and L. R. Rabiner, Multirate Digi- tal Signal Processing. Englewood Cliffs, NJ: Pren-

tice Hall, 1983.

P. P. Vaidyanathan, Multirate Systems and Filter

Banks. Englewood Cliffs, NJ: Prentice Hall, 1993.

N. J . Fliege, Multirate Digital Signal Processing. New York: Wiley, 1994.

R. E. Crochiere, S. A. Webber, and J. L. Flanagan, “Digital coding of speech in subbands,” Bell Syst.

Tech. J., vol. 55, pp. 1069-1085, Oct. 1976.

J. W. Woods and S. D. O’Neil, “Subband coding of images,” IEEE Trans. Acoust., Speech, Sig-

nal Processing, vol. ASSP-34, pp. 1278-1288, Oct.

1986.

H.

Gharavi, “Subband coding algorithms for video applications: videophone to HDTV-conferencing,”

IEEE Trans. Circuits Syst. Video Technol., vol. 1,

no. 2, pp. 174-183, June 1991.

[7] E.

B.

Richardson and N. S . Jayant, “Subband coding with adaptive prediction for 56 kbits/s audio,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, pp. 6!31-696, Aug. 1986.

[8] G. Karlsson and

M.

Vetterli, “Three dimensional sub-band coding of video,” in Proc. IEEE Int.

Conf. Acoust., Speech, Signal Processing, 1988, p-

p. 1100-1103.

[9] J. Hartung, “Architecture for real-time implemen- tation of three-dimlensional subband video coding,” in Proc. IEEE Int. Symp. Circuits Syst., vol.

3, 1992, pp. 225-228.

[lo] K. N. Ngan and W. L. Chooi, “Very low bit rate video coding using ;ID subband approach,” IEEE

Trans. Circuits Syst,, Video Technol., vol. 4, no. 3,

pp. 309-316, June 1!994.

(4)

Permutations

11

Stage 1 ( X I )

I

Stage 2 ( x 2 )

1

Stage 3 ( x 3 )

I

Total

I

Table 3: Comparison of the required delay elements for different filtering permutations considering the finite wordlength effect in 3-D subband coding with four-reduced bands.

[ll] C. I. Podilchuk, N. S. Jayant, and N. Farvard-

m, “Three-dimensional subband coding of video,”

IEEE Trans. Image Processing, vol. 4, no. 2, pp.

125-139, Feb. 1995.

[12] F. Fan, S. Simon, I. Bruyland, W. Zhu, B.

D.

Canne, and M. V. Bladel, “A method for hier- archical subband HDTV splitting,” IEEE Trans.

Circuits Syst. Video Technol., vol. 5, no. 3, pp.

225-230, June 1995.

[13] C. H . Chou and C. W. Chen, “A perceptually op- timized 3-D subband codec for video communica tion over wireless channels,” IEEE Trans. Circuits

Syst. Video Technol., vol. 6, no. 2, pp. 143-156,

Apr. 1996.