影像處理及圖形模式的幾個基本問題之演算法設計及實作(I)

(1)

※※※※※※※※※※※※※※※※※※※※※※※※

※ ※

※（

影像處理及圖形模式的五個基本問題之演算法設計及實作

）※

※ ※

※※※※※※※※※※※※※※※※※※※※※※※※

計畫類別：□個別型計畫 □整合型計畫計畫編號：NSC 89－2218－E－011－017

執行期間： 89 年 8 月 1 日至 90 年 7 月 31 日計畫主持人：鍾國亮教授

共同主持人：鍾國亮教授

本成果報告包括以下應繳交之附件：

□赴國外出差或研習心得報告一份

□赴大陸地區出差或研習心得報告一份

□出席國際學術會議心得報告及發表之論文各一份

□國際合作研究計畫國外研究報告書一份

執行單位：國立台灣科技大學資管系暨資工研究所

中華民國 90 年 7 月 31 日

ü

(2)

行政院國家科學委員會專題研究計畫成果報告

影像處理及圖形模式的五個基本問題之演算法設計及實作

The Algorithms Design and Implementations for Solving Five Problems in Image Processing and Graphical Models

計畫編號：NSC 89-2218-E-011-017 執行期限：89 年 8 月 1 日至 90 年 7 月 31 日

主持人：鍾國亮教授國立台灣科技大學資管系暨資工研究所

中文摘要

由於人類對資訊需求的擴大和網路技術及應用的成長，影像處理及圖形模式這一領域也隨著不斷的發展。雖然目前的科技在這類議題上有一些進展，且處理圖像的軟硬體也有大幅度的進步，但在新的要求，乃至於新的問題上，我們對影像處理及圖形模式上的一些問題 (如偵測線、找出對稱軸、找出圖點之中間點及直接在壓縮圖像上計算等)，仍希望能處理得更快更好，或者能擴大以前的成果至新的問題上。所以本計劃打算就相關於影像處理及圖形模式上一些問題提出更快的解決方法或擴大以前的結果至新的應用領域上。本研究小組在今年度計畫中完成下列主要研究成果。

(1) 哈克轉換的成果[1][2][3]：架構在一個 affine 轉換，我們計劃把用斜截參數哈克測線的記憶空間轉成會被全部用到的空間。經過我們的 affine

轉換後，我們將先提出一個修正的哈克轉換。然後應用遞增方式來改進第一個演算法而提出只需少量乘法的第二個哈克轉換演算法。最後將改進第二個演算法提出第三個演算法，使其主要只需整數運算而已。測線結果即將刊登於 Real-Time Imaging 期刊、測圓結果即將刊登於 Computer Vision and Image Understanding 期刊、測橢圓結果目前正在 IEEE Trans. Image Processing 期刊複審中。

(2) 求加權下圖像一個點集合的中間點(Median)[4]：依曼哈頓距離度量，

首先提出一個在平面下點集合 S 的中間點，其中Ｓ中各點有其自已的加權值。這個演算法要花 O(|R(S)|)的時間。其中 |R(S)| 是指圍住 S 的最小長方形區域之面積。接著應用稀疏距陣表示方式的技巧，用 O(|S|)的時間來找其中間點。此結果已刊登於 Information Processing Letters 期刊。

(3) 在四分樹 (quadtree) 及可明暗 (shading) 為基礎的壓縮影像上，設

(3)

的壓縮影像上，當 N×N 影像四分樹分壓縮成 K×K 時，應用計算上的一些週期性質，提出一個 O(K²logK＋N²)執行時間的快速傅利葉轉換(FFT)。這個快速傅利葉轉換除非是 K=N，否則將比傳統在四分樹壓縮影像上的 FFT 還快。

在此我們所提的方法，除了具有四分樹的壓縮效果外，還可降低以往四分樹方法時對影像所造成的區塊狀的顯示情況(block effect)。此結果已刊登於 Information Processing Letters 期刊。

(4) 在子計畫(3)的壓縮影像中，設計快速的離散餘弦轉換 (DCT) 演算法 [6] ：架構在四分樹格式及使用 Gouraud 明暗法的壓縮影像上，當 N×

N 影像四分樹壓縮成 K×K 時，應用計算上的一些週期性質，在這種壓縮影像上提出一個二維的離散餘弦轉換 (DCT)演算法。這個 DCT 只要 O(K2logK

＋N2) 執行時間。除非是 K=N，不然我們的演算法將比傳統 DCT 應用在 N×N 影像所要執行時間還快。此結果即將刊登於 Information Processing Letters 期刊。

英文摘要

Owing to the great demand for information and the growth at the techniques and applications of network, the field of imaging processing and

technology in this field has some developments in graphics and image processing, we still need to handle some old problems as good as possible or to solve new problems stimulated by the previous results. In this project, we plan to propose better solutions for some existing basic problems such as Hough transform, symmetry detection, finding median, orthogonal transforms on compressed image, and so on. Here, we want to handle the following five research issues which are important in imaging processing and graphical models. In this project, we do the research and have the results as shown below:

(1) Hough transform: Based on affine transformation, we plan to transfer the memory space in the slope--intercept--based Hough transform, which is used for detecting lines, into a fully utilized memory space. Following the affine transformation, we will first present a modified Hough transform. Then, using the incremental approach, we will improve the first algorithm such that only few multiplications are involved in the second algorithm.

Finally, applying a simple sign testing technique, we will improve the second algorithm such that such that the third algorithm mainly concerns integer operations. The main results are listed below:

(4)

1.”A New Randomized Algorithm for Detecting Lines”, will be published in Real-Time Imaging.

2.”An Efficient Randomized Algorithm for Detecting Circles”, will be published in Computer Vision and Image Understanding.

3.”A Novel and Efficient Randomized Algorithm for Detecting Ellipses”, has been submitted to IEEE Trans. Image Processing.

(2) Finding medians of weighted discrete points: According to the Manhattan metric, we will first present an algorithm for finding the medians of the given discrete points set S on Z², where each point is associated with any weight. The algorithm takes O(|R(S)|) time, where

|R(S)| denotes the area of the smallest rectangle containing S. Then, we will use the sparse matrix representation technique to find the medians of S in O(|S|) time. The main result has been published in Information Processing Letters.

(3) Fast Fourier transform on images in quadtree and shading format: Based on the quadtree format and the Gouraud shading approach, suppose one image with size N×N has been partitioned into K × K squares.

Employing some periodic properties, we plan to present a fast Fourier transform (FFT) algorithm which runs in O(K²logK ＋ N²) time. Our FFT

algorithm will be faster than the conventional FFT algorithm unless K=N. Besides the compression effect, our approach can alleviate the block effect. The main result has been published in Information Processing Letters.

(4) Fast Discrete Cosine on images in quadtree and shading format: Based on the quadtree and the Gouraud shading approach, suppose one image with size N ×N has been partitioned into K ×K squares. Employing some derived periodic properties, we will present a novel algorithm for computing the 2--D Discrete Cosine transform (DCT) on this compressed image domain. Our algorithm takes O(K²logK ＋ N²) time and is faster than the conventional DCT algorithm on the original N ×N image unless K=N. The main result will be published in Information Processing Letters.

計畫緣由、結果與討論

影像處理這一領域一直不斷的發展，

且處理圖像的軟硬體也有大幅度的進步，但在新的要求，乃至於新的問題上，我們對影像處理及圖形模式上的一些問題 (如偵測線、找出對稱軸、找出圖點之中間點及直接在壓縮圖像上計算等)，仍希望能處理得更快更好。

本計畫中的五個子題的成果將對於哈克轉換、對稱軸偵測、求點集合之中

(5)

本計畫之完成可對國內在影像及圖形模式的問題處理上的學術研究之提升乃至國家科技之發展，皆有深遠的影響。本研究小組吾人針對此基礎且實用的結果進行深入研究並完成下列研究成果。

(1) 哈克轉換的成果：在哈克轉換投票過程中，應用之前的投票計算值到之後的投票計算上及改變計算上較慢的運算成計算上較快的運算，以改進為了求過某點之所有參數值時所花費的大量計算時間，進而達到可加快哈克轉換的測線工作。以往的哈克轉換雖然有效，但其所需的計算空間和時間也很大。其中在空間的使用上，可發現在執行時所規劃給定的參數空間陣列中，很多是不必要而浪費掉的。

因此利用隨機取點的方式來投票，加快哈克轉換的計算時間，以使實務上更能使用哈克轉換來快速有效的偵測出特定的圖形。

(2) 求加權下圖像一個點集合的中間點 (Median) ：在曼哈頓度量 (Manhattan metric)平面上兩點之距離下，擴大 Lungo 求 S 點集合中間點方法到有加權的情況上。即 S 中各點可依其重要性給定不同的加權值，然後在這加權值下求 S 點集合中間點。

同時把求這加權式中間點的所需的計算時間，改進到線性 O(|S|) 時間，即和 S 點集合的個數有關，而不是 O(|R(S)|) 的和其所散佈在平面上長方形區域的範圍大小有關。

計出快速傅利葉轉換(FFT)：利用計算上的週期特性，推導並設計出一個各區塊有重疊的四分樹及區塊內有明暗處理的壓縮圖像上之快速 FFT 演算法。當 N×N 影像壓縮成 K×K 時，其計算時間為 O(K²logK+N²)，除非 K=N 否則比以前要 O(N²logN) 的時間更快，

且效果也比以前的好。

(4) 在子計劃(3)的壓縮影像中，設計快速的離散餘弦轉換(DCT)演算法：

利用計算上的週期特性及三角幾何的化約方法，推導並設計出一個各區塊有重疊的四分樹及區塊有明暗處理的壓縮圖像上之快速 DCT 演算法。當 N

×N 影像壓縮成 K×K 時，其計算時間為 O(K²logK+N²)，除非 K=N 否則比以前在四分樹壓縮影像上之演算法所需之 O(N²logN) 的時間更快，且效果也比以前的好。

計畫自評

本計畫的成果與報告內容和原計畫的申請項目大致相同，計畫之分項子題達成的狀況亦很理想。各個計畫子題的成果均已完成，其中各個項目大多已登刊於國外著名期刊，餘者亦於國外著名期刊接受覆審中。本計畫所提及之各項成果預期對於影像處理、圖形模式及影像壓縮等研究領域的完整性有很大的貢獻；除了在學術界提供一些對影像及圖形模式之問題處理上的新解決方法外，在業界也提供相關問題上可更快更省空間的方式。所以

(6)

本計算之完成可對國內在影像及圖形模式的問題處理上的學術研究之提升乃至國家科技之發展，皆有深遠的影響。經由本計畫的研究與實作，讓參與的人員深入瞭解影像處理的重要性，更促使參與人員體會到影像處理及影像壓縮等研究主題在學術及工業界的相互結合，如此結合理論與實際應用，提昇參與人員今後之研發能力進而提昇國家競爭力。

參考文獻

[1] T. C. Chen and K. L. Chung, “A New Randomized Algorithm for Detecting Lines,”to appear in Real-Time Imaging.

[2] T. C. Chen and K. L. Chung, ”An Efficient Randomized Algorithm for Detecting Circles,”to appear in Computer Vision and Image Understanding.

[3] 目前投往 IEEE Trans. Image Processing 期刊複審中.

[4] K. L. Chung, On finding medians of weighted discrete points, Information Processing Letters,74(3-4)(2000) pp.

103-106.

[5] K. L. Chung and W. M. Yan, An efficient algorithm for the Fourier transform on a compressed image in restricted quadtree and shading format, Information Processing Letters,79(1)(2001) pp. 1-5.

[6] 即將在 Information Processing

Letters 期刊上刊登。