第五章 多聲源方位偵測與聲源數量估算演算法
5.4 多聲源方位偵測與聲源數量估測結果
實驗是在正常吵雜的環境中進行,使用 8 個麥克風構成的麥克風陣列,麥克 風陣列如下圖十九所示,首先是單聲源的情況,下表三代表單聲源的情況經由
圖 十九、麥克風陣列 Experimental Conditions Experimental
Results Experimental Results Source SNR
(dB) Correct
Experimental Conditions Experimental
Results Experimental Results Source SNR
(dB) Correct
出之估算方法,同時存在雙聲源的時候依舊可以找出各別的方位以及估算出聲源 數量,接者觀察同時有四個聲源存在於空間中的時候估算的結果,如下表五所示,
Experimental Conditions Experimental
Results Experimental Results Source SNR
(dB) Correct Angle
第六章 未來展望
本論文中我們提出了一套利用分散式的麥克風陣列在只有麥克風座標以及 麥克風收集到的聲音資訊下對多聲源方位以及聲源數量估測的方法,透過這樣的 方法我們可以處裡同時存在多個聲源的情況,並且估算出實際空間中存在的聲源 數量,由於麥克風可以是分散式的不需要有特定的形狀,所以更能符合各種應 用,也因為此聲源方位偵測的方法不需要知道聲源數量,因此更能適用實際的空 間狀態。
目前使用的麥克風陣列有 8 個麥克風,根據第四章的分析可以知道,越多麥 克風對於誤差的降低是有幫助的,而誤差的降低不僅是代表角度估測正確性的提 升同時意味著聲源的數量可以更多,未來可以應用在人數的估計以及輔助保全攝 影機監控雜亂的會場方面,另外可以應用在人機介面上,做到多人同時與機器人 互動。
Reference:
[1]. P. R. Roth, “Effective measurements using digital signal analysis,” IEEE Spectrum, vol. 8, pp. 62-70, Apr. 1971.
[2]. G.. C. Carter, A. H. Nuttall, and P. G. Cable, “The smoothed coherence transform(SCOT),” Naval Underwater Systems Center, New London Lab., New London Lab., New London, CT, Tech. Memo TC-159-72, Aug. 8, 1972.
[3]. G.. C. Carter, A. H. Nuttall, and P. G. Cable, “The smoothed coherence transform,”
Proc. IEEE (Lett.), vol. 61, pp. 1497-1498, Oct. 1973.
[4]. C. H. Knapp and G. C. Carter, “The generalized correlation method for estimation of time delay,” IEEE Trans. Acoust., Speech, Signal Processing, ASSP-24(4):320-327, Aug. 1976.
[5]. B. Champagne, S. Bedard, and A. Stephenne, “Performance of Time-Delay Estimation in the Presence of Room Reverberation,” IEEE Trans. of Speech and Audio Processing, vol. 4, no. 2, March 1996.
[6]. A. Stephenne, and B. Champagne,” Cepstral prefiltering for time delay estimation in reverberant environments,” ICASSP-95., vol. 5, May 1995.
[7]. M. S. Brandstein, H. F. Silverman, “A Robust Method for Speech Signal Time-Delay Estimaiton in Reverberant Rooms,“ICASSP-97, vol. 1, April 1997.
[8]. M. Omologo and P. Svaizer, “Use of the Cross-power-Spectrum Phase in Acoustic Event Location,” IEEE Trans. of Speech and Audio Processing, vol. 5, no. 3, May 1997.
[9]. Hu, J., Su, T.M., Cheng, C.C., Liu, W.H., and Wu, T.I, "A self-calibrated speaker tracking system using both audio and video data", IEEE Conference on Control Applications, Sep, 2002.
[10]. Hu, J., Cheng, C.C., Liu, W.H., Su, T.M., "A Speaker Tracking System with Distance Estimation Using Microphone Array", IEEE/ASME International Conference on Advanced Manufacturing Technologies and Education, Aug, 2002.
[11]. M. Xinyu and C. L. Nikias, “Joint Estimaiton of Time Delay and Frequency Delay in Impulsive Noise Using Fractional Lower Order Statistics,” IEEE Trans. of Signal Processing, vol. 44, no. 11, Nov. 1996.
[12]. P. G. Georgiou and P. Tsakalides, “Alpha-Stable Modeling of Noise and Robust Time-Delay Estimation in the Presence of Impulsive Noise,” IEEE Trans. of Multimedia, vol. 1, no. 3 Sep. 1999.
[13]. C.L. Nikas and M. Shao, Signal Processing with Alpha-Stable Distributions and Applications. New York: Wiley, 1995.
[14]. M. Wax, T. J. Shan, and T. Kailath. “ Spatio-temporal spectral analysis by eigenstructure methods,” IEEE Trans. Acoust., Speech, Signal Processing, vol ASSP-32, pp. 817–827, Aug 1984.
[15]. H. Wang and M. Kaveh, “Coherent signal subspace processing for detection and estimation of angle of arrival of multiple wideband sources,” IEEE Trans. Acoust., Speech, Signal Processing, vol ASSP-33, pp. 823–831, Aug. 1985.
[16]. G. Bienvenu, “Eigensystem properties of the sampled space correlation matrix,”
Proc. IEEE, ICASSP, Boston MA, 1983, pp. 332–335.
[17]. K. M. Buckley and L. J. Griffiths, “Eigenstructure based broadband source location estimation,” in Proc. IEEE ICASSP, Tokyo, Japan, 1986, pp. 1869–1872.
[18]. M. A. Doron, A. J. Weiss, and H. Messer, “Maximum likelihood direction finding of wideband sources,” IEEE Trans. Signal Processing, vol. 41, pp. 411–414, Jan 1993.
[19]. Y.Bresler and A.Macovski, “Exact maximum likelihood parameter estimation of superimposed exponential signals in noise,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, pp. 1081-1089, Oct. 1986.
[20]. M. Agarwal and S. Prasad, “DOA estimation of wideband sources using a harmonic source model and uniform linear array,” IEEE Trans. Signal Processing, vol.
47, pp. 619-629, Mar. 1999.
[21]. H. Messer, “The potential performance gain in using spectral information in passive detection/localization of wideband sources,” IEEE Trans. Signal Processing, vol. 43, pp. 2964-2974, Dec. 1995.
[22]. M. Agrawal and S. Prasad, “Broadband doa estimation using spatial-only modeling of array data,” IEEE Trans. Signal Processing, vol. 48, pp. 663-670, Mar.
2000.
[23]. J.-H Lee, Y.-M Chen, and C.-C Yeh, “A covariance approximation method for near-field direction finding using a uniform linear array,” IEEE Trans. Signal
Processing, vol. 43, pp. 1293-1298, May 1995.
[24]. R. O. Schmidt, “Multiple Emitter Location and Signal Parameter Estimation”, IEEE Trans. Antennas and Propag., vol. AP-34, no. 3, pp.276-280,March 1986
[25]. K. Yao, R. E. Hudson, C. W. Reed, D. Chen, and F. Lorenzelli, “Blind beamforming on a randomly distributed sensor array system,” IEEE J. Select. Areas Commun., vol. 16, pp. 1555-1567, Oct. 1998
[26]. D. Arthur, S. Vassilvitskii: "k-means++ The Advantages of Careful Seeding"
2007 Symposium on Discrete Algorithms (SODA).