A Wavelet VQ Image Compression System
全文
(2) LL band. Source Image. 3 Stage DWT. Index. SQ. HL band x 3. HL band 21-D vector. LH band x 3. LH band 21-D vector. Vector Formation. HH band x 3. Encoded LL band. SOC. HH band 21-D vector. Encoded HL band. 21-D Crossband VQ. Encoded LH band. Arithmetic Coding. Bit stream. Encoded HH band. (a) Encoder Encoded LL band. Bit stream. Inverse Arithmetic Coding. Encoded HL band Encoded LH band Encoded HH band. LL band. Inverse SOC. HL band x 3. Inverse 21-D Crossband VQ. Reconstructed 21-D vector. Coefficients Reconstruction. LH band x 3. Reconstructed Image. Inverse 3 Stage DWT. HH band x 3. (b) Decoder Fig. 1. The block diagram of the proposed compression system. technique. In this work, we modify SOC for lossy coding of the quantized lowest-band coefficients. The quantization is performed with 8-bit uniform quantizer, thus the quantized coefficient index ranges from 0 to 255. The search sequence of SOC is shown in Fig. 2. First, we assume that the block marked with X is the current coefficient index to be encoded. The sequence of search is in a clockwise manner. The first coefficient index to be searched is marked with A, then B, C, D, E, and so on. The coefficient index that has not been encoded (i.e. coefficient index with dashed-line in Fig. 2) is skipped in the search process. The searching process will continue until finding same coefficient index or the number of "miss-matched coefficient index" is greater than a maximum number m. If there is no index matched, then select the index with the minimum distortion from the m different indexes. If the minimum distortion is less than a predefined threshold Tsoc, send flag bit “1” followed by SOC index j to decoder; otherwise, send flag bit “0” followed by coefficient index Pi to the decoder. The SOC output symbols of the lowest band are encoded with arithmetic coding to further improve the compression efficiency. The SOC data consist of three types: (a) binary flag bits, (b) SOC index and (c) coefficient index. The binary flag is encoded by context-based arithmetic. K. L. E. F. G. 179. 179. 179. 179. 177. J. D. A. B. H. 179. 179. 179. 183. 182. I. C. X. 179. 177. 182. 179. 183. 182. 179. 179. 183. 179. 179. 179. 179. 179. 179. Fig. 2. Search sequence of SOC SOC. 21-D VQ of HL band. 21-D VQ of LH band. 21-D VQ of HH band. Fig. 3. Same orientation crossband vector encoding (CAE) [11]. SOC index is encoded by adaptive arithmetic coding [12] with probability table containing m symbols. Coefficient index is encoded by adaptive arithmetic coding with 256 symbols..
(3) 2.3. Crossband Vectors Quantization. Average performance of different LL band coding scheme with filter size = 4 x 4 30. ∑C j =5. 2 i, j. (1). 27. 0.24. The flag bit “0” is sent to the decoder followed by index k. It is noted that the energy measure defined in Eq. (1) is not uniform for all coefficients. Obviously, the coefficients in lower bands contribute more than those in higher bands. The definition is to consider the fact that the coefficients of lower bands are more significant than those of higher bands. We refer to the definition as weighted energy measure. The most popular measure of distortion between two vectors is the squared Euclidean distance. In this paper, a weighted distortion measure for two vectors is proposed, which is defined as j =4. 1 ( C i , j − Vk , j ) 2 ∑ 4 j =1. 1 j = 20 ∑ (C i , j − Vk , j ) 2 16 j =5. 0.25. 0.26. 0.27. 0.28. 0.29. 0.3. 0.31. 0.32. 0.33. 0.34. bpp LL band coded by SOC. LL band coded by JPEG. Fig. 4. Experimental results of lowest band coded by SOC and JPEG Comparision of different distortion and energy measure with m = 32, Tsoc = 0 and filter size 4 x 4 30 29.5 29 28.5 28. If Ei less than a predefined energy threshold Te, send flag bit “1” to decoder. Otherwise Ci is mapped into an index k, which point to the closest (least distortion) codeword V k in the codebook.. d (Ci ,Vk ) = (Ci , 0 − Vk , 0 ) 2 +. 28. 26.5. PSNR. 1 1 Ei = C + ∑ Ci2, j + 4 j =1 16 2 i ,0. j = 20. 28.5. 27.5. Ci ∈ R 21 is computed by. j=4. +. 29. 27.5 0.2. 0.22. 0.24. 0.26. 0.28. 0.3. 0.32. 0.34. 0.36. 0.38. 0.4. bpp Weighted distortion and weight energy measure Weighted distortion and uniform energy measure. MSE distortion and weight energy measure MSE distortion and uniform energy measure. Fig. 5. Experimental results of general distortion and energy measure, weighted distortion and energy measure Experimental result of proposed method and JPEG 30 29.8 29.6 29.4 PSNR. each input vector. 29.5. PSNR. Vectors that are put into the VQ can be formed by coefficients from different subbands. In this work, a crossband vector is formed from Ci and its descendants, as shown in Fig. 3. We refer to this as a “same orientation” crossband vector. There are three types of 21-D vectors that are grouped together from three difficult orientations: HL, LH and HH bands, as shown in Fig. 3. The quantization process is described below. In encoding, wavelet coefficients to be encoded are first divided into a set of non-overlapping vectors. Then the energy, Ei, of. 29.2 29 28.8 28.6 28.4 28.2 0.24. 0.25. 0.26. 0.27. 0.28. 0.29. 0.3. 0.31. 0.32. bpp Proposed method with filter size 4 x 4. JPEG. Fig. 6. Experimental results of proposed method and JPEG. 3. Experimental Results. (2). The distortion measure gives larger weights to lower-frequency bands, whereas smaller weights to higher-frequency bands, which is similar to the definition in Eq. (1). The novel distortion measure procedure improves the rate-distortion performance significantly, as compared the squared Euclidean distance. The crossband VQ outputs consist of (a) binary flag bits and (b) 21-D VQ index. The binary flag is encoded by context-based arithmetic encoding (CAE). VQ Index is encoded by adaptive arithmetic coding with probability table containing 256 symbols.. In this work, the bit rate is used to measure the co mpression efficiency; PSNR (peak signal-to-noise ratio) and reconstructed pictures are employed to evaluate the objective and subjective quality, respectively. We used a three-stage DWT with Daubechies filter. The filter size of 4 x 4 is chosen because it gives better subjective quality. The codebook with size N = 256 is obtained by LBG algorithm with random initialization [13]. The training set consists of wavelet coefficients of 20 monochrome images. The vector dimension is 21, and the weighted distortion measure, defined in Eq. (2), is used to generate the codebook. The test images include.
(4) Fig. 7. Reconstructed image of JPEG, bpp = 0.25, PSNR = 29.15dB. Fig. 8. Reconstructed image of the proposed method, bpp = 0.23, PSNR = 29.4dB Pepper, Lake (inside training) and Lena (outside training). The PSNR and bit rate listed in the following are the average of the three test images. To demonstrate the efficiency of the modified SOC, we adopted MSOC and JPEG to code the lowest band, and higher bands are coded with the same crossband VQ. The result in Fig. 4 indicates that the modified SOC performs better than JPEG. Fig. 5 shows the objective qualities of different distortion measure and energy measure. It is seen that the proposed weighted distortion measure and weighted energy measure achieves about 1 dB gain over the general distortion measure. Fig. 6 co mp ares the rate-d isto rtio n performances with the proposed method and JPEG. It is seen that the performance of the proposed method is better than JPEG. The comparison of. Fig. 9. Reconstructed image of Pepper with JPEG, bpp = 0.25, PSNR = 30.13dB. Fig. 10. Reconstructed image of Pepper with proposed method, bpp = 0.24, PSNR = 31.06dB. subjective quality is demonstrated in Fig. 7 and Fig. 8. The results indicate that the proposed method yields less blocky effect at the lower bit rate. Fig. 9-10 show the reconstructed Pepper image with the proposed method and JPEG, respectively. It is seen again that the proposed method provides better performance than the conventional JPEG.. 4. Conclusions In this paper, we have presented an efficient wavelet VQ technique for the compression of images. It employs several novel schemes including a modified SOC for coding the lowest band, a crossband VQ with weighted energy measure and weighted distortion measure for the quantization of higher bands, and several.
(5) arithmetic coders for further coding the SOC outputs and VQ indexes. The results indicate that the proposed technique achieves better rate-distortion performance and visual quality than JPEG.. References [1]. [2]. [3]. [4]. [5]. [6]. A. S. Lewis and G. Knowles, "Image compression using the 2-D wavelet transform," IEEE Trans. on Image Processing, vol. 1, no. 2, pp. 244-250, Apr. 1992. H. Gharavi and A. Tabatabai, "Subband coding of monochrome and color image," IEEE Trans. on Circuits and Syst., vol. 35, no. 2, pp. 207-214, Feb. 1988. N. Mohsenian and N. Nasrabadi, “Edge-based subband VQ techniques for images and video,” IEEE Trans. on Circuit and Systems for Video Technolo., vol 4, no. 1, pp. 53-67, Feb. 1994. P. C. Cosman, R. M. Gary and M. Vetterli, "Vector quantization of image subbands: a survey," IEEE Trans. on Image Processing, vol. 5, no. 2, pp. 202-225, Feb. 1996. J. M. Shapiro, "Embedded image coding using zerotrees of wavelets coefficients," IEEE Trans. on Signal Processing, vol. 41, no. 12, pp. 3445-3462, Dec. 1993. A. Said and W. A. Pearlman, "A new, fast, and efficient image codec based on set partitioning in hierarchical trees," IEEE Trans. on Circuit and Systems for Video Technolo., vol. 6, no. 3, pp. 243-250, June 1996.. [7]. Y. Huh, J. J. Hwang, and K. R. Rao, "Classified wavelet transform coding of images using two-channel conjugate vector quantization," in Proc. ICIP-94, Austin, TX, vol. 3, pp. 363-367, Nov. 1994. [8] R. M. Gray and A. Gersho, Vector Quantization and Signal Compression, Kluwer Academic Publishers, 1992. [9] Y. Huh, J. J. Hwang, and K. R. Rao, "Block wavelet transform coding of images using classified vector quantization," IEEE Trans. on Circuit and Systems for Video Technolo., vol. 5, no. 1, pp. 63-67, Feb. 1995. [10] C. H. Hsieh and J. C. Tsai, "Lossless compression of VQ index with search-order coding," IEEE Trans. on Image processing, vol. 5, no. 11, pp. 1579-1582, Nov. 1996. [11] N. Brady, F. Bossen and N. Murphy, "Context-based arithmetic encoding of 2D shape sequences," IEEE Image Processing, 1997. Proceedings International Conference, vol. 1 , 1997 , pp. 29-32. [12] I. H. Witten, R. M. Neal, and I. G. Cleary, “Arithmetic coding for data compression,” Commun. ACM, vol. 30, pp. 520-540, June 1987. [13] Y. Linde, A. Buzo, and R. M. Gray, "An algorithm for vector quantizer design," IEEE Trans. Commun., vol. COM-28, pp. 84-95, Jan. 1980..
(6)
數據
相關文件
` Sustainable tourism is tourism attempting to make a low impact on the environment and local culture, while helping to generate future employment for local people.. The
In this paper, a novel subspace projection technique, called Basis-emphasized Non-negative Matrix Factorization with Wavelet Transform (wBNMF), is proposed to represent
在雲中街文創聚落中營運中的「凹凸 咖啡館」是利用當時遺留下的建築群
3.16 Career-oriented studies provide courses alongside other school subjects and learning experiences in the senior secondary curriculum. They have been included in the
This paper presents (i) a review of item selection algorithms from Robbins–Monro to Fred Lord; (ii) the establishment of a large sample foundation for Fred Lord’s maximum
Huang, A nonmonotone smoothing-type algorithm for solv- ing a system of equalities and inequalities, Journal of Computational and Applied Mathematics, vol. Hao, A new
The Hilbert space of an orbifold field theory [6] is decomposed into twisted sectors H g , that are labelled by the conjugacy classes [g] of the orbifold group, in our case
In this study, we compute the band structures for three types of photonic structures. The first one is a modified simple cubic lattice consisting of dielectric spheres on the