未來展望

第五章、結論與未來展望

5.2. 未來展望

本論文目前提出之辨識系統，藉由時序分析，能夠對情緒做更細緻之描述與辨識，但其仍有幾點問題必須解決。

特徵點擷取方面，本研究所使用之 AAM 演算法，對於人臉情緒形變過大之表情無法準確擷取人臉特徵，主要原因在於其與樣板影像差別過大所致，或許可藉由建立更豐的資料庫，或加入局部特徵做改善。另外，就運算速度而言，AAM 建立人臉模型之速度仍有加強之空間，若前述之 AAM 準確性有所改善，可透過影像追蹤之方式，每次計算形變模型與紋理模型時，以前一次之人臉模型為基礎，

迭代計算此次之人臉模型參數，降低迭代次數與運算量。

情緒辨識方面，本研究雖然加入時序對情緒做分析，但其仍以靜態情緒辨識為主，對話情境之動態情緒辨識則未能達到其辨識效能，主因仍在於運算速度與時序分析之完善性。在系統運算量納入考量情況下，透過更深入之時序分析，與人臉變化頻率分析，設法擷取人臉表情關鍵之辨識特徵，可以做為未來之探討方向，以求更自然、更廣範之人機互動為目標。

參考文獻

[1] A. Sharkey and N. Sharkey, “Children, and Elderly, and Interactive Robots Anthropomorphism and Deception in Robot Care and Companionship,” IEEE

Robotics & Automation Magazine, vol. 18, no. 1, pp. 32-38, 2011.

[2] “PARO,” available: www.parorobots.com.

[3] “Sony-AIBO,” available: www.sony-aibo.co.uk.

[4] “Gecko System - Mobile Robot Solution for Safety, Security and Service,”

available: www.geckosystems.com/markets/CareBot.php.

[5] “Robosoft,” available: www.robosoft.com/robotic-solutions/healthcare/kompai /index.html.

[6] D. Hanson, “Hanson Robotics Inc.,” available: www.hansonrobotics.com.

[7] A. Vinciarelli, M. Pantic, D. Heylen, C. Pelachaud, I. Poggi, F. D. Errico and M.

Schroeder, “Bridging the Gap between Social Animal and Unsocial Machine: A Survey of Social Signal Processing,” IEEE Trans. Affective Computing, no. 1, vol. 3, pp. 69-87, 2012.

[8] G. S. Shergill, A. Sarrafzadeh, O. Diegel and A. Shekar, “Computerized Sales Assistants: The Application of Computer Technology to Measure Consumer Interest – A Conceptual Framework,” Journal of Electronic Commerce Research, vol. 9, no. 2, pp. 176-191, 2008.

[9] S. Gregory, “Spy on Sports Fans,” TIME Ideas, 2013, available:

http://ideas.time.com/2013/03/14/10-big-ideas/slide/spy-on-sports-fans/.

[10] B. T. Horowitz, “Cybercare: Will Robots Help the Elderly to Live at Home for Longer?” Scientific American, June 21, 2010.

[11] M. Swangnetr and D. B. Kaber, “Emotional State Classification in Patient-Robot

Interaction Using Wavelet Analysis and Statistics-Based Feature Selection,”

IEEE Trans. Human-Machine Systems, vol. 1, no. 43, pp. 63-75, 2013.

[12] “Aethon,” available: www.aethon.com.

[13] A. A. Salah and T. Gevers (eds.), “Computer Analysis of Human Behavior,”

Springer, 2011, chapter 10.

[14] P. Ekman and W. V. Friesen, “Unmasking the Face: A Guide to Recognizing Emotion from Facial Clues,” Prentice Hall, New Jersey, 1975.

[15] J. A. Russell, “A Circumplex Model of Affect,” Journal of Personality & Social

Psychology, vol. 39, no. 6, pp. 1161–1178, 1980.

[16] W. Gu, C. Xiang, Y. V. Venkatesh, D. Huang and H. Lin, “Facial Expression Recognition Using Radial Encoding of Local Gabor Features and Classifier Synthesis,” Pattern Recognition, vol.45, no.1, pp.80-91, 2012.

[17] M. Song, D. Tao, Z. Liu, X. Li and M. Zhou, “Image Ratio Features for Facial Expression Recognition Application,” IEEE Trans. System Man and Cybernetics

Part B-Cybernetics, vol. 42, no. 3, pp. 779-788, 2010.

[18] K. T. Song, M. J. Han and J. W. Hong, "Online Learning Design of an Image-Based Facial Expression Recognition System," To appear in Intelligent

Service Robotics, Vol. 3, No. 3, pp. 151-162, 2010..

[19] K. T. Song and S. C. Chien, “Facial Expression Recognition Based on Mixture of Basic Expression and Intensities,” in Proc. IEEE Int. Conf. Systems, Man, and

Cybernetics, Seoul, South Korea, pp. 3123-3128, 2012.

[20] C. M. Whissell, “The Dictionary of Affect in Language,” Emotion: Theory,

Research and Experience, New York: Academic Press, 1989.

[21] R. E. Thayer, “The Biopsychology of Mood and Arousal,” New York: Oxford

University Press, 1989.

[22] I. Hupont, E. Cerezo and S. Baldassarri, “Sensing Facial Emotions in A Continuous 2D Affective Space,” IEEE Int. Conf. Systems, Man and Cybernetics, Istanbul, Turkey, pp. 2045-2051, 2010.

[23] P. Yang, Q. Liu, X. Cui and D. N. Metaxas, “Facial Expression Recognition Using Encoded Dynamic Features,” IEEE Conf. Computer Vision and Pattern

Recognition, Anchorage, AK, USA, pp.1-8, 2008.

[24] S. Hommel and U. Handmann, “AAM Based Continuous Facial Expression Recognition for Face Image Sequences,” in Proc. IEEE Int. Symp.

Computational Intelligence and Informatics, Budapest, Hungary, pp. 189-194,

2011.

[25] Y. Zhang and Q. Ji, "Active and Dynamic Information Fusion for Facial Expression Understanding from Image Sequences," IEEE Trans. Pattern

Analysis and Machine Intelligent, vol. 27, no. 5, pp. 699-714, 2005.

[26] M. A. Nicolaou, H. Gunes and M. Pantic, “Output-Associative RVM Regression for Dimensional and Continuous Emotion Prediction,” in Proc. IEEE Int. Conf.

Automatic Face & Gesture Recognition, Santa Barbara, CA, USA, pp. 21-24,

2011.

[27] N. Cristian and J. S. Taylor, “An Introduction to Support Vector Machines and Other Kernel-based Learning Methods,” New York: Cambridge University Press, 2000.

[28] M. E. Tipping, “Sparse Bayesian Learning and the Relevance Vector Machine,”

Journal of Machine Learning Research, vol.1, no.3, pp. 211–244, 2001.

[29] A. Ethem, “Introduction to Machine Learning,” Second Edition. The MIT Press,

Cambridge, Massachusetts, London, England, 2010.

[30] P. Viola and M. Jones, “Rapid Object Detection Using a Boosted Cascade of

Simple Features,” IEEE Computer Society Conf. Computer Vision and Pattern

Recognition, Kauai, HI, pp. 511- 518, 2001.

[31] R. Lienhart and J. Maydt, “An Extended Set of Haar-Like Features for Rapid Object Detection,” in Proc. Int. Conf. Image Processing, Rochester, NY, USA, pp. 900-903, 2002.

[32] T. F. Cootes, G. J. Edwards and C. J. Taylor, “Active Appearance Models.” in

Proc. European Conf. Computer Vision, Springer, Berlin, pp 484–498, 1998.

[33] T. F. Cootes, G. J. Edwards and C. J. Taylor, “Active Appearance Models,” IEEE

Transactions on Pattern Analysis and Machine Intelligence, Vol. 23, No. 6, pp.

681-685, 2001.

[34] T. F. Cootes and C. J. Taylor, “Technical Report: Statistical Models of Appearance for Computer Vision,” The University of Manchester School of

Medicine, 2004.

[35] A. McAndrew, “Introduction to Digital Image Processing with Matlab,”

Thomson Course Technology, 2004.

[36] 陳奕彣, 人臉辨識及表情辨識之整合設計, 碩士論文, 國立交通大學電機與 控制工程學系, 2010.

[37] I. Matthews and S. Baker, “Active Appearance Models Revisited,” Int. Journal

of Computer Vision, vol. 60, no.2, pp.135-164, 2004.

[38] C. Goodall, “Procrustes Methods in the Statistical Analysis of Shape,” Journal of

the Royal Statistical Society B, vol 53, no.2, pp.285-339, 1991.

[39] J.R. Shewchuk, “Triangle: Engineering a 2D Quality Mesh Generator and Delaunay Triangulator,” First Workshop on Applied Computational Geometry,

Proceedings, Philadelphia, pp. 124-133, 1996.

[40] P. Ekman and W.V. Friesen, “Facial Action Coding System (FACS): A Technique

for the Measurement of Facial Movement,” Palo Alto, Calif: Consulting

Psychologists Press, 1978.

[41] P. Lucey, J. F. Cohn, T. Kanade, J. Saragih, Z. Ambadar and I. Matthews, “The Extended Cohn-Kanade Dataset (CK+): A Complete Dataset for Action Unit and Emotion-Specified Expression,” IEEE Computer Society Conf. Computer Vision

and Pattern Recognition Workshops, San Francisco, CA, USA, pp. 94-101, 2010.

[42] M. Frieddman and A. Kandel, “Introduction to Pattern Recognition,” World

Scientific, 1999.

[43] D. E. King, “Dlib-ml: A Machine Learning Toolkit,” Journal of Machine

Learning Research 10, pp. 1755-1758, 2009.

[44] D. J. C. MacKay, “The Evidence Framework Applied to Classification Networks,”

Neural Comput., vol. 4, no. 5, pp. 720–736, 1992.

[45] L. Tierney and J. B. Kadane, “Accurate Approximations for Posterior Moments and Marginal Densities,” Journal of the American Statistical Association, vol. 81, no. 393, pp.82-86, 1986.

[46] D. J. C. MacKay, “Bayesian Interpolation,” Neural Comput., vol. 4, no. 3, pp.

415–447, 1992a.

[47] F. Melgani and L. Bruzzone, “Classiﬁcation of Hyperspectral Remote Sensing Images with Support Vector Machines,” IEEE Trans. Geoscience and Remote

Sensing Symposium, vol. 42, no. 8, pp. 1778–1790, 2004.

[48] T. Hastie and R. Tibshirani, “Classification by Pairwise Coupling,” Annals of

Statisitcs, vol. 26, no. 2, pp. 451-471, 1998.

[49] S. Kullback and R. A. Leibler, “On Information and Sufficiency,” Annals of

Mathematical Statistics, vol. 22, no.1, pp. 79-80, 1951.

[50] Y. H. Yang, Y. F. Su, Y. Ch. Lin and H. Chen, “Music Emotion Recognition: the

Role of Individuality,” in Proc. the International workshop on Human-centered

Multimedia, Augsburg, Bavaria, Germany, pp.13-22, 2007.

[51] M. J. Han, C. H. Lin, and K. T. Song, “Robotic emotional expression generation based-on mood transition and personality model,” IEEE Trans. Cybernetics, 2012.

[52] J. A. Russell and M. Bullock, “Multidimensional Scaling of Emotional Facial Expressions: Similarity from Preschoolers to Adults,” Journal of Personality and

Social Psychology, vol. 48, no. 5, pp. 1290–1298, 1985.

[53] S. Jain, H. Changbo and J. K. Aggarwal , “Facial Expression Recognition with Temporal Modeling of Shapes,” IEEE Int. Conf. Computer Vision Workshops, Barcelona, Spain, pp. 1642-1649, 2011.

[54] E. Silva, C. Esparza and Y. Mejia , “POEM-based Facial Expression Recognition, a New Approach,” Image, Signal Processing, and Artificial Vision, Antioquia, Colombia, pp. 162-167, 2012.

100

附錄一、基本人臉情緒混合比例問卷調查樣張

受測者基本資料：

性別: 女年齡:28

問卷說明：

請依序填入各影像所含基本情緒 Neutral(中性, Ne), Anger(生氣, An), Disgust(憎惡、厭惡, Di), Fear(害怕, Fe), Happy(高興, Ha), Sadness(傷心, Sa), Surprise(驚訝, Su)之百分比，百分比加總必須為 100%。

例如:

中性: 20 生氣: 40 厭惡: 10 害怕: 15 高興: 0 傷心: 15 驚訝: 0

S002 S026

指出 S002 影像所包含基本情緒比例之百分比指出 S026 影像所包含基本情緒比例之百分比

中性: 0 生氣:70 厭惡:30 害怕: 0 中性: 40 生氣: 30 厭惡: 30 害怕:0

高興: 0 傷心: 0 驚訝:0 高興: 0 傷心: 0 驚訝: 0

101

S074 S015

指出 S074 影像所包含基本情緒比例之百分比指出 S015 影像所包含基本情緒比例之百分比

中性: 90 生氣: 0 厭惡: 0 害怕: 0 中性: 70 生氣: 0 厭惡:0 害怕: 0

高興: 0 傷心: 0 驚訝: 10 高興: 30 傷心: 0 驚訝: 0

S013 S028

指出 S013 影像所包含基本情緒比例之百分比指出 S028 影像所包含基本情緒比例之百分比

中性: 20 生氣: 0 厭惡: 60 害怕: 0 中性: 30 生氣: 0 厭惡: 0 害怕: 10

高興: 0 傷心: 0 驚訝: 20 高興: 0 傷心: 0 驚訝: 60

S073 S009

指出 S073 影像所包含基本情緒比例之百分比指出 S009 影像所包含基本情緒比例之百分比

中性: 10 生氣: 0 厭惡:0 害怕: 20 中性: 10 生氣: 0 厭惡: 20 害怕: 0

高興: 0 傷心: 0 驚訝: 70 高興: 0 傷心: 70 驚訝: 0

102

S025 S004

指出 S025 影像所包含基本情緒比例之百分比指出 S004 影像所包含基本情緒比例之百分比

中性: 80 生氣: 10 厭惡: 10 害怕: 0 中性: 0 生氣: 40 厭惡: 60 害怕: 0

高興: 0 傷心: 0 驚訝: 0 高興: 0 傷心: 0 驚訝: 0

在文檔中基於時間強化設計之情緒辨識方法 (頁 103-112)

第五章、 結論與未來展望