影像/視訊的壓縮與錯誤回復之研究(I)

(1)

行政院國家科學委員會專題研究計畫期中進度報告

影像/視訊的壓縮與錯誤回復之研究(1/3)

計畫類別：個別型計畫

計畫編號： NSC94-2213-E-011-041-

執行期間： 94 年 08 月 01 日至 95 年 07 月 31 日執行單位：國立臺灣科技大學資訊工程系

計畫主持人：鍾國亮

報告類型：精簡報告

處理方式：本計畫可公開查詢

中華民國 95 年 8 月 30 日

(2)

行政院國家科學委員會專題研究計畫成果報告影像/視訊的壓縮與錯誤回復之研究( 1 / 3 )

The Study for Image/Video Compression and Error Recovery (1/3)

計畫編號: NSC 94-2213-E-011-041-

執行期限: 94 年 8月 1日至 95 年 7月 31日主持人：鍾國亮教授國立台灣科技大學資訊工程系

中文摘要

影像與視訊壓縮與錯誤之回復是多媒體傳輸技術中相當重要的一環，其目的在於如何在有限的網路頻寬下傳輸大量的影像與視訊資料。由於近年來多媒體影音通訊技術的快速發展，所以人們對影像與視訊壓縮這個主題的關心程度有明顯的增加。這些研究主題亦是目前本實驗室主要研究的核心問題之一。計劃主持人同研究小組過去幾年在這些方面累積不少經驗及成果。本研究小組在今年度計畫中完成下列主要研究成果。

(1) 植基於紋理與多重模組的無失真半調子影像壓縮技術 [1]：我們提出一個植基於區塊與紋理的訓練機制，根據各種紋理特徵來建構文件鄰屬算術編碼 (Context-Based Arithmetic Coding) 所須的最佳模組 (Template)。在壓縮半調子影像時，

我們會先將輸入影像切割成多個子區塊，並對每個子區塊進行紋理的檢測來決定這個區塊所需要的最佳模組。實驗結果顯示，我們所提出的的

紋理與多重模組演算法能夠有效的提升壓縮效能。與 JBIG 壓縮標準 [2]、BACIC 演算法[3]以及 PACIC 演算法[4]比較之下，分別可達到 24%、

19.4% 與 17.6% 的壓縮改良率。另外，我們的演算法可達到與 FACIC 演算法[5]相同水準的壓縮比，但壓縮一張影像只需要其 1%的執行時間 (本研究成果目前已在 IEEE Trans.

Image Processing 期刊進行複審)。

(2) 植基於可調式估計架構的混合式錯誤回復演算法[6]：植基於新的可調式估計架構，我們提出一個有效的混合式錯誤回復演算法。利用在受損區塊的鄰近區塊的資訊，受損區塊會被分為三種類型。依據受損區塊的類型，我們所提出的可調式估計架構可以採用 Bézier 曲面估計、第一階導數平面估計、或是較大群質心估計有效地回復受損的區塊。依據六種測試用視訊影像串列，實驗結果顯示我們所提出的混合式錯誤回復演算法可以在不同的錯誤率下改進視訊品質與執行時間。(本研究成果目前已被

(3)

2006 年 CVGIP 研討會接受(Oral)，

並在 J. Visual Communication and Image Representation 期刊進行複審)。

英文摘要

Image and video compression and concealment are the kernel issues in multimedia technologies. Recently, the field of multimedia applications is continually growing and quite important.

It has a significant increase in the level of interest in image and video compression.

These two topics form the core of a major modernization, and their applications are also quite wide. Our research group has developed several efficient results. In this project, we do the research and have the results as shown below.

(1) Texture- and multiple-template–based algorithm for lossless compression of error–diffused images [1]: We first present a novel block- and texture- based approach to train the multiple–template according to the most representative texture features.

Based on the trained multiple- template, we next present an efficient texture- and multiple-template-based (TM–based) algorithm for lossless compression of error–diffused images.

In our proposed TM–based algorithm, the input image is divided into many blocks and for each block, the best template is adaptively selected from the multiple–template based on the texture feature of that block.

Experimental results demonstrate that the compression improvement ratios of our proposed algorithm over the JBIG [2], the BACIC algorithm [3]

and the PACIC algorithm [4] are 24%, 19.4%, and 17.6%, respectively. In

addition, the encoding time required in the previous FACIC algorithm [5]

is 109.131 seconds in average while our proposed algorithm takes 0.995 second; the average compression ratio of our proposed TM–based algorithm is quite competitive to that of the FACIC algorithm.

(2) Efficient hybrid error concealment algorithm based on adaptive estimation scheme [6]: Video transmission plays an important role in multimedia communication. Due to transmission error, robust video transmission has become increasingly important in pro-viding better quality of services. Based on our proposed novel adaptive estimation scheme, we present an efficient hybrid error concealment algorithm for robust video trans-mission. Using the information of neighboring macro- blocks (MBs) of the corrupted MBs, the corrupted MBs are classified into three types. According to the type of the corrupted MB, our proposed adaptive estimation scheme could adopt the Bézier surface estimation, the first order plane estimation, or the centroid of major cluster estimation to conceal the corrupted MB efficiently.

Based on six testing video sequences, experimental results demonstrate that our proposed hybrid error concealment algorithm can improve the video quality and the execution time performance over different lost rates.

計畫緣由、結果與討論

影像與視訊壓縮與錯誤之回復是多媒體儲存技術中相當重要的一環，其目的在於如何在有限的網路頻寬下傳輸大量的影像與視訊資料。壓縮技術可降低

(4)

傳輸影像與視訊所需頻寬，錯誤回復技術可針對傳輸時所發生錯誤與以更正，

兩者間具有相當密切的關係。由於文件影像通常會以半調子方式來呈現，因此半調子影像的壓縮在文件影像的處理上是相當重要的。在過去雖然已有許多相關的壓縮演算法被提出[2,3,4,5]，其中 [2,3,4]所提出的方法雖然擁有較快的編碼速度，但壓縮效能的表現上卻不及[5]

所提出的方法。然而，[5]所提出的方法因為其過於複雜導致需要大量的編碼時間。因此，如何提出一個兼具快速與高壓縮比的演算法是相當重要的。視訊傳輸在多媒體通訊上扮演著一個很重要的角色，由於傳輸時會發生錯誤，使得強健的視訊傳輸在提供較佳的服務品質方面變得越來越重要。最近，Zheng 和 Chaum[7]提出了一個植基於第一階導數平面估計的錯誤回復演算法。實驗結果顯示使用第一階導數平面估計來回復錯誤可以達到不錯的回復效果。然而，

Zheng 和 Chaum 的演算法並沒有針對受損區塊的類型，例如：高移動區塊，低移動區塊等，使用不同的錯誤回復策略，導致其效率無法進ㄧ步的提升。為了改善這個情況，計劃主持人同研究小組結合了過去幾年在這些方面累積的經驗及成果，投入相當多的心力，針對以下的議題進行研究並完成下列研究成果。

(一) 植基於紋理與多重模組的無失真半調子影像壓縮技術 [1]：

對於文件影像處理而言，半調子影像的壓縮是相當重要的一環。過去所提出的壓縮演算法，大多是利用於文件鄰屬算術編碼技術 (Context-Based Arithmetic Coding) 與編碼模組(Template)來將資料加

以分類，透過降低熵 (entropy) 的方式來進行資料的壓縮。為了得到最佳的編碼模組，我們提出一個植基於區塊與紋理的訓練機制來建構文件鄰屬算術編碼所須的最佳模組。利用區塊式編碼方式，我們會將輸入影像切割成多個子區塊，並針對每個子區塊檢測其紋理特性，藉以決定各個子該區塊所對應的最佳模組。由於我們的演算法能夠針對不同的影像特性指定最佳的編碼模組，可對壓縮效能進行有效的提升。實驗結果顯示，我們所提出的紋理與多重模組演算法與 JBIG 壓縮標準[2]、BACIC 演算法[3]以及 PACIC 演算法[4]比較之下，分別可達到 24% 、 19.4% 與 17.6%的壓縮改良率。另外，我們的演算法可達到與 FACIC 演算法[5]

相同水準的壓縮比，但壓縮一張影像只需要其 1%的執行時間 (本研究成果目前已在 IEEE Trans. Image Processing 期刊進行複審)。

(二) 植基於可調式估計架構的混合式錯誤回復演算法[6]：

由於傳輸時會發生錯誤，使得強健的視訊傳輸在提供較佳的服務品質方面變得越來越重要。最近，

Zheng 和 Chaum[7]提出了一個植基於第一階導數平面估計的錯誤回復演算法。雖然實驗結果顯示使用第一階導數平面估計來回復錯誤可以達到不錯的回復效果，但由於該演算法並未針對不同的區塊類型使用不同的錯誤回復策略，導致其效率無法進ㄧ步的提升。因此，

利用一個新的可調式估計架構，我們提出有效的混合式錯誤回復演算法來提升影像品質。針對每一個

(5)

受損區塊，在分析其鄰近區塊之移動向量(Motion Vector)資訊之後，我們將受損區塊會分為三種類型。依據受損區塊的類型，可以採用 Bézier 曲面估計、第一階導數平面估計、或是較大群質心估計有效地回復受損的區塊。依據六種測試用視訊影像串列，實驗結果顯示我們所提出的混合式錯誤回復演算法可以在不同的錯誤率下改進視訊品質與執行時間。(本研究成果目前已被 2006 年 CVGIP 研討會接受 (Oral) ，並在 J. Visual Communication and Image Representation 期刊進行複審)。

計畫自評

本計畫的成果與報告內容和原計畫的申請項目大致相同，計畫之分項子題成的狀況亦很理想。各個計畫子題的成果均已完成，其中各計畫子題之關成果皆在國外著名的期刊下審查過程中。本計畫所提及之各項成果對於影像與視訊壓縮的研究領域及實作方面上皆有相當貢獻。除了在學術理論上將有顯著的成果外，在實際應用上，不但增加執行效能，更提供更理想的影像品質。透過本計畫的研究與實作，能夠讓參與的人員對於影像與視訊壓縮與錯誤回復的相關技術有深入的瞭解，並將理論與實際應用結合，可提升參與之工作人員今後之研發能力進而提昇國家競爭力。

參考文獻

[1] Y. H. Huang and K. L. Chung, ” Texture- and multiple-template-based algorithm for lossless compression of error-diffused Images,” IEEE Trans.

Image Processing, revised.

[2] Coded representation of picture and audio information-progressive bi-level image compression, ISO/IEC Int. Std.

11544, 1993.

[3] M. D. Reavy and C. G. Boncelet, ``An algorithm for compression of bilevel images,'' IEEE Trans. Image Processing, vol. 10, pp. 669-676, May, 2001.

[4] C. S. Lee and H. Park, “Near-lossless /lossless compression of error-diffused images using a two-pass approach,'' IEEE Trans. Image Processing, vol. 12, pp. 170-175, Feb. 2003.

[5] B. Martins and S. Forchhammer, “Tree coding of bilevel image,” IEEE Trans.

Image Processing, vol. 7, pp. 517-528, Apr. 1998.

[6] K. L. Chung, T. H. Huang, and P. H.

Liao, “Efficient hybrid error concealment algorithm based on adaptive estimation scheme,” J. Visual Communication and Image Representation, revised.

[7] J. Zheng and L. P. Chaum, “Error concealment algorithm for H.26L using first-order plane estimation,”

IEEE Trans. Multimedia, vol. 6, pp.

801-805, Dec. 2004.