多聲源方位偵測與聲源數量估測結果

第五章多聲源方位偵測與聲源數量估算演算法

5.4 多聲源方位偵測與聲源數量估測結果

實驗是在正常吵雜的環境中進行，使用 8 個麥克風構成的麥克風陣列，麥克風陣列如下圖十九所示，首先是單聲源的情況，下表三代表單聲源的情況經由

圖十九、麥克風陣列 Experimental Conditions Experimental

Results Experimental Results Source SNR

(dB) Correct

Experimental Conditions Experimental

Results Experimental Results Source SNR

(dB) Correct

出之估算方法，同時存在雙聲源的時候依舊可以找出各別的方位以及估算出聲源數量，接者觀察同時有四個聲源存在於空間中的時候估算的結果，如下表五所示，

Experimental Conditions Experimental

Results Experimental Results Source SNR

(dB) Correct Angle

第六章未來展望

本論文中我們提出了一套利用分散式的麥克風陣列在只有麥克風座標以及麥克風收集到的聲音資訊下對多聲源方位以及聲源數量估測的方法，透過這樣的方法我們可以處裡同時存在多個聲源的情況，並且估算出實際空間中存在的聲源數量，由於麥克風可以是分散式的不需要有特定的形狀，所以更能符合各種應用，也因為此聲源方位偵測的方法不需要知道聲源數量，因此更能適用實際的空間狀態。

目前使用的麥克風陣列有 8 個麥克風，根據第四章的分析可以知道，越多麥克風對於誤差的降低是有幫助的，而誤差的降低不僅是代表角度估測正確性的提升同時意味著聲源的數量可以更多，未來可以應用在人數的估計以及輔助保全攝影機監控雜亂的會場方面，另外可以應用在人機介面上，做到多人同時與機器人互動。

Reference:

[1]. P. R. Roth, “Effective measurements using digital signal analysis,” IEEE Spectrum, vol. 8, pp. 62-70, Apr. 1971.

[2]. G.. C. Carter, A. H. Nuttall, and P. G. Cable, “The smoothed coherence transform(SCOT),” Naval Underwater Systems Center, New London Lab., New London Lab., New London, CT, Tech. Memo TC-159-72, Aug. 8, 1972.

[3]. G.. C. Carter, A. H. Nuttall, and P. G. Cable, “The smoothed coherence transform,”

Proc. IEEE (Lett.), vol. 61, pp. 1497-1498, Oct. 1973.

[4]. C. H. Knapp and G. C. Carter, “The generalized correlation method for estimation of time delay,” IEEE Trans. Acoust., Speech, Signal Processing, ASSP-24(4):320-327, Aug. 1976.

[5]. B. Champagne, S. Bedard, and A. Stephenne, “Performance of Time-Delay Estimation in the Presence of Room Reverberation,” IEEE Trans. of Speech and Audio Processing, vol. 4, no. 2, March 1996.

[6]. A. Stephenne, and B. Champagne,” Cepstral prefiltering for time delay estimation in reverberant environments,” ICASSP-95., vol. 5, May 1995.

[7]. M. S. Brandstein, H. F. Silverman, “A Robust Method for Speech Signal Time-Delay Estimaiton in Reverberant Rooms,“ICASSP-97, vol. 1, April 1997.

[8]. M. Omologo and P. Svaizer, “Use of the Cross-power-Spectrum Phase in Acoustic Event Location,” IEEE Trans. of Speech and Audio Processing, vol. 5, no. 3, May 1997.

[9]. Hu, J., Su, T.M., Cheng, C.C., Liu, W.H., and Wu, T.I, "A self-calibrated speaker tracking system using both audio and video data", IEEE Conference on Control Applications, Sep, 2002.

[10]. Hu, J., Cheng, C.C., Liu, W.H., Su, T.M., "A Speaker Tracking System with Distance Estimation Using Microphone Array", IEEE/ASME International Conference on Advanced Manufacturing Technologies and Education, Aug, 2002.

[11]. M. Xinyu and C. L. Nikias, “Joint Estimaiton of Time Delay and Frequency Delay in Impulsive Noise Using Fractional Lower Order Statistics,” IEEE Trans. of Signal Processing, vol. 44, no. 11, Nov. 1996.

[12]. P. G. Georgiou and P. Tsakalides, “Alpha-Stable Modeling of Noise and Robust Time-Delay Estimation in the Presence of Impulsive Noise,” IEEE Trans. of Multimedia, vol. 1, no. 3 Sep. 1999.

[13]. C.L. Nikas and M. Shao, Signal Processing with Alpha-Stable Distributions and Applications. New York: Wiley, 1995.

[14]. M. Wax, T. J. Shan, and T. Kailath. “ Spatio-temporal spectral analysis by eigenstructure methods,” IEEE Trans. Acoust., Speech, Signal Processing, vol ASSP-32, pp. 817–827, Aug 1984.

[15]. H. Wang and M. Kaveh, “Coherent signal subspace processing for detection and estimation of angle of arrival of multiple wideband sources,” IEEE Trans. Acoust., Speech, Signal Processing, vol ASSP-33, pp. 823–831, Aug. 1985.

[16]. G. Bienvenu, “Eigensystem properties of the sampled space correlation matrix,”

Proc. IEEE, ICASSP, Boston MA, 1983, pp. 332–335.

[17]. K. M. Buckley and L. J. Griffiths, “Eigenstructure based broadband source location estimation,” in Proc. IEEE ICASSP, Tokyo, Japan, 1986, pp. 1869–1872.

[18]. M. A. Doron, A. J. Weiss, and H. Messer, “Maximum likelihood direction finding of wideband sources,” IEEE Trans. Signal Processing, vol. 41, pp. 411–414, Jan 1993.

[19]. Y.Bresler and A.Macovski, “Exact maximum likelihood parameter estimation of superimposed exponential signals in noise,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, pp. 1081-1089, Oct. 1986.

[20]. M. Agarwal and S. Prasad, “DOA estimation of wideband sources using a harmonic source model and uniform linear array,” IEEE Trans. Signal Processing, vol.

47, pp. 619-629, Mar. 1999.

[21]. H. Messer, “The potential performance gain in using spectral information in passive detection/localization of wideband sources,” IEEE Trans. Signal Processing, vol. 43, pp. 2964-2974, Dec. 1995.

[22]. M. Agrawal and S. Prasad, “Broadband doa estimation using spatial-only modeling of array data,” IEEE Trans. Signal Processing, vol. 48, pp. 663-670, Mar.

2000.

[23]. J.-H Lee, Y.-M Chen, and C.-C Yeh, “A covariance approximation method for near-field direction finding using a uniform linear array,” IEEE Trans. Signal

Processing, vol. 43, pp. 1293-1298, May 1995.

[24]. R. O. Schmidt, “Multiple Emitter Location and Signal Parameter Estimation”, IEEE Trans. Antennas and Propag., vol. AP-34, no. 3, pp.276-280,March 1986

[25]. K. Yao, R. E. Hudson, C. W. Reed, D. Chen, and F. Lorenzelli, “Blind beamforming on a randomly distributed sensor array system,” IEEE J. Select. Areas Commun., vol. 16, pp. 1555-1567, Oct. 1998

[26]. D. Arthur, S. Vassilvitskii: "k-means++ The Advantages of Careful Seeding"

2007 Symposium on Discrete Algorithms (SODA).

在文檔中多聲源方位偵測與聲源數量估算 (頁 42-47)

第五章 多聲源方位偵測與聲源數量估算演算法

5.4 多聲源方位偵測與聲源數量估測結果

第六章 未來展望

Reference:

第五章多聲源方位偵測與聲源數量估算演算法

第六章未來展望