• 沒有找到結果。

第五章 結論

5.2 未來研究

為了讓本研究能更適用於實際應用上,可針對以下方向做改進:

投影片切換偵測的部分:

(1) 改善 DTW 無法區分兩張版面結構相似的影像,而少抓到換頁的地方。

(2) 目前僅使用 1fps 的方式偵測投影片切換,可針對處理需要增加(減少),

以達正確率或速度的提升。

(3) 可考慮處理換頁時使用特效的情況,與攝影機大幅移動時的事件。

教學重點探勘的部分:

(1) 利用影像、音訊探勘所得結果推論與教學方式之間的關係。

(2) 可定義數種講者使用的手勢,以更準確的過濾非有意義手勢的部分。

44

參考文獻

[1] S. Ammouri, and G. A. Bilodeau, “Face and Hands Detection and Tracking Applied to the Monitoring of Medication Intake,” Canadian Conference on

Computer and Robot Vision, pp. 147-154, Canadian, May 2008.

[2] C. Cotsaces, N. Nikolaidis, and I. Pitas, “Video Shot Detection and Condensed Representation a review,” IEEE Signal Processing Magazine, vol. 23, no. 2, pp.

28-37, Mar. 2006.

[3] H. Fang, J. Jiang, and Y. Feng, “A Fuzzy Logic Approach for Detection of Video Shot Boundaries,” Pattern Recognition, vol. 39, no. 11, pp. 2092-2100, Nov.

2006.

[4] A. M. Ferman, A. M. Tekalp, and R. Mehrotra, “Robust Color Histogram Descriptors for Video Segment Retrieval and Identification,” IEEE Trans. On

Image Processing, vol. 11, no. 5, pp. 497-508, May 2002.

[5] C. Fredembach, M. Schroder, and S. Susstrunk, “Eigenregions for Image Classification,” IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.

26, no. 12, pp. 1645-1649, Dec. 2004.

[6] X. Gao, and X. Tang, “Unsupervised Video-Shot Segmentation and Model-Free Anchorperson Detection for News Video Story Parsing,” IEEE Trans. on Circuits

and Systems for Video Technology, vol. 12, no. 9, pp.765-776, Sept. 2002.

[7] U. Gargi, R. Kasturi, and S. H. Strayer, “Performance Characterization of Video-Shot-Change Detection Methods,” IEEE Trans. on Circuits and Systems

for Video Technology, vol. 10, no. 1, pp.1-13, Feb. 2000.

[8] Y. Gong, “An Accurate and Robust Method for Detecting Video Shot Boundaries,” Proceedings of IEEE International Conference on Multimedia

Computing and Systems, vol. 1, pp. 850-854, July 1999.

[9] R. C. Gonzalez, R. E. Woods, “Digital Image Processing,” Prentice-Hall second

edition, 2002.

[10] C. Grana, and R. Cucchiara, “Linear Transition Detection as a Unified Shot Detection Approach,” IEEE Trans. on Circuits and Systems for Video Technology,

vol. 17,no. 4, pp. 483-489, Apr. 2007.

[11] J. Ha, R. M. Haralick, and I. T. Phillips, “Recursive X-Y Cut Using Bounding Boxes of Connected Components,” Proceedings of the Third International

Conference on Document Analysis and Recognition, vol. 2, pp. 952-955, Aug.

1995.

[12] O. Ikeda, “Estimation of Speaking Speed for Faster Face Detection in Video-Footage,” International Conference on Multimedia and Expo, pp. 442-445, July 2005.

[13] T. Kikukawa, and S. Kawafuchi, “Development of An Automatic Summary Editing System for the Audio Visual Resources,” IEICE Trans., vol. J75-A, no. 2, pp. 204-212, 1992.

[14] R.A. Kirsch, “Computer Determination of the Constituent Structure of Biological Images,” Computers in Biomedical Research, vol. 4, pp. 315-328, 1971.

[15] I. Koprinska, and S. Carrato, “Temporal Video Segmentation: A Survey,” Signal

Processing: Image Communication, vol. 16, pp. 477-500, Jan. 2001.

[16] C. M. Li, Y. S. Li, S. H. Wang, and X. Q. Zhang, “Moving Human Body Detection in Video Sequences,” Proceedings of the Sixth International

Conference on Machine Learning and Cybernetics, vol. 4, pp. 2188-2192, Aug.

2007.

46

[17] L. Liang, Y. Liu, H. Lu, X. Xue, and Y. P. Tan, “Enhanced Shot Boundary Detection Using Video Text Information,” IEEE Trans. on Consumer Electronics,

vol. 51, no. 2, pp. 580-588, May 2005.

[18] H. C. Liu, and G. Zick, “Automatic Determination of Scene Changes in MPEG Compressed Video,” IEEE International Symposium on Circuits and Systems, vol.

1, pp. 764-767, May 1995.

[19] A. Nagasaka and Y. Tanaka, “Automatic Video Indexing and Full-Video Search for Object Appearances,” Proceeding of IFIP Second Workshop Conf. on Visual

Database System II, Budapest, Hunary, pp.113-127, 1992.

[20] W. Niblack, “An Introduction to Image Processing,” Prentice-Hall, Englewood

Cliffs, NJ, pp. 115-116, 1986.

[21] N. Otsu, “A Threshold Selection Method from Gray-Level Histogram,” IEEE

Trans. on Systems, Man, and Cybernetics, vol. 9, no. 1, pp. 62-66, Jan. 1979.

[22] T. Peng, K. Zhao, and B. Li, “Video Abrupt Transition Detection Based on K-L Transform,” IEEE International Conference on Image and Graphics, pp.

845-848, Aug. 2007.

[23] M. Piccardi, “Background Subtraction Techniques: a Review,” IEEE

International Conference on Systems, Man and Cybernetics, vol. 4, pp.

3099-3104, 2004.

[24] S. Salvador, and P. Chan, “Toward Accurate Dynamic Time Warping in Linear Time and Space,” Intelligent Data Analysis, vol. 11, pp. 561-580, Oct.2007.

[25] B. Shahraray, “Scene Change Detection and Content-based Sampling of Video Sequences,” Proceeding of IS&T/SPIE conference on Digital Video

CompressionAlgorithms and Technologies, vol. 2419, pp. 2-13, 1995.

[26] K. W. Sze, K. M. Lam, and G. Qiu, “A New Key Frame Representation for Video Segment Retrieval,” IEEE Trans. on Circuits and Systems for Video Technology,

vol.15, no. 9, pp. 1148-1155, Sept. 2005.

[27] K. W. Sze, K. M. Lam, and G. Qiu, “An Optimal Key Frame Representation for Video Shot Retrieval,” Proceedings of IEEE International Symposium on

Intelligent Multimedia, Video and Speech Processing, pp. 270-273, Oct. 2004.

[28] F. Wang , C. W. Ngo ,and T. C. Pong, “Structuring Low-Quality Videotaped Lectures for Cross-Reference Browsing by Video Text Analysis,” Pattern

Recognition, vol. 41, no. 10, pp. 3257-3269, Oct. 2008.

[29] X. Yi, and N. Ling, “Fast Pixel-Based Video Scene Change Detection,” in

Proceeding IEEE Int. Symp. on Circuits and Systems, pp. 3443-3446, May 2005.

[30] J. Yuan, H. Wang, L. Xiao, W. Zheng, J. Li, F. Lin, and B. Zhang, “A Formal Study of Shot Boundary Detection,” IEEE Trans. on Circuits and Systems for

Video Technology, vol. 17, no. 2, pp.168-186, Feb. 2007.

[31] R. Zabith, J. Miler, and K. Mai, “A Feature-based Algorithm for Detecting and Classifying Production Effects,” ACM Journal of Multimedia Systems, vol. 7, no.

2, pp.119-128, 1999.

[32] H. J. Zhang, A. Kankanhalli, and S. W. Smoliar, “Automatic Partitioning of Full-motion Video,” ACM Journal of Multimedia Systems, vol.1, no. 1, pp. 10-28, 1993.

[33] Y. Zhuangt, Y. Rui, T. S. Huang, and S. Mehrotra, “Adaptive Key Frame Extraction using Unsupervised Clustering,” Proceeding of IEEE International

Conference on Image Processing, vol. 1, pp. 866-870, Oct. 1998.

[34] 王小川, “語音訊號處理,”

全華科技圖書股份有限公司

, 2004

48

附錄ㄧ

影像中出現遮蔽物之偵測結果與方法比較之相似程度圖,橫軸皆為時間(單位為 秒);縱軸為相似程度的分數(介於負 1 與正 1 之間),綠點表示沒有發生投影片 換頁的情況,紅點表示有發生投影片換頁的情況。

only Kirsch Niblack + ER Correction

( II – a )

( II – b )

( II – c )

( II – d )

( II – e )

( II – f )

( II – g )

( II – h )

( II – i )

( II – j )

相關文件