第五章、 結論與未來展望
5.2. 未來展望
本論文目前提出之辨識系統,藉由時序分析,能夠對情緒做更細緻之描述與 辨識,但其仍有幾點問題必須解決。
特徵點擷取方面,本研究所使用之 AAM 演算法,對於人臉情緒形變過大之 表情無法準確擷取人臉特徵,主要原因在於其與樣板影像差別過大所致,或許可 藉由建立更豐的資料庫,或加入局部特徵做改善。另外,就運算速度而言,AAM 建立人臉模型之速度仍有加強之空間,若前述之 AAM 準確性有所改善,可透過 影像追蹤之方式,每次計算形變模型與紋理模型時,以前一次之人臉模型為基礎,
迭代計算此次之人臉模型參數,降低迭代次數與運算量。
情緒辨識方面,本研究雖然加入時序對情緒做分析,但其仍以靜態情緒辨識 為主,對話情境之動態情緒辨識則未能達到其辨識效能,主因仍在於運算速度與 時序分析之完善性。在系統運算量納入考量情況下,透過更深入之時序分析,與 人臉變化頻率分析,設法擷取人臉表情關鍵之辨識特徵,可以做為未來之探討方 向,以求更自然、更廣範之人機互動為目標。
94
參考文獻
[1] A. Sharkey and N. Sharkey, “Children, and Elderly, and Interactive Robots Anthropomorphism and Deception in Robot Care and Companionship,” IEEE
Robotics & Automation Magazine, vol. 18, no. 1, pp. 32-38, 2011.
[2] “PARO,” available: www.parorobots.com.
[3] “Sony-AIBO,” available: www.sony-aibo.co.uk.
[4] “Gecko System - Mobile Robot Solution for Safety, Security and Service,”
available: www.geckosystems.com/markets/CareBot.php.
[5] “Robosoft,” available: www.robosoft.com/robotic-solutions/healthcare/kompai /index.html.
[6] D. Hanson, “Hanson Robotics Inc.,” available: www.hansonrobotics.com.
[7] A. Vinciarelli, M. Pantic, D. Heylen, C. Pelachaud, I. Poggi, F. D. Errico and M.
Schroeder, “Bridging the Gap between Social Animal and Unsocial Machine: A Survey of Social Signal Processing,” IEEE Trans. Affective Computing, no. 1, vol. 3, pp. 69-87, 2012.
[8] G. S. Shergill, A. Sarrafzadeh, O. Diegel and A. Shekar, “Computerized Sales Assistants: The Application of Computer Technology to Measure Consumer Interest – A Conceptual Framework,” Journal of Electronic Commerce Research, vol. 9, no. 2, pp. 176-191, 2008.
[9] S. Gregory, “Spy on Sports Fans,” TIME Ideas, 2013, available:
http://ideas.time.com/2013/03/14/10-big-ideas/slide/spy-on-sports-fans/.
[10] B. T. Horowitz, “Cybercare: Will Robots Help the Elderly to Live at Home for Longer?” Scientific American, June 21, 2010.
[11] M. Swangnetr and D. B. Kaber, “Emotional State Classification in Patient-Robot
95
Interaction Using Wavelet Analysis and Statistics-Based Feature Selection,”
IEEE Trans. Human-Machine Systems, vol. 1, no. 43, pp. 63-75, 2013.
[12] “Aethon,” available: www.aethon.com.
[13] A. A. Salah and T. Gevers (eds.), “Computer Analysis of Human Behavior,”
Springer, 2011, chapter 10.
[14] P. Ekman and W. V. Friesen, “Unmasking the Face: A Guide to Recognizing Emotion from Facial Clues,” Prentice Hall, New Jersey, 1975.
[15] J. A. Russell, “A Circumplex Model of Affect,” Journal of Personality & Social
Psychology, vol. 39, no. 6, pp. 1161–1178, 1980.
[16] W. Gu, C. Xiang, Y. V. Venkatesh, D. Huang and H. Lin, “Facial Expression Recognition Using Radial Encoding of Local Gabor Features and Classifier Synthesis,” Pattern Recognition, vol.45, no.1, pp.80-91, 2012.
[17] M. Song, D. Tao, Z. Liu, X. Li and M. Zhou, “Image Ratio Features for Facial Expression Recognition Application,” IEEE Trans. System Man and Cybernetics
Part B-Cybernetics, vol. 42, no. 3, pp. 779-788, 2010.
[18] K. T. Song, M. J. Han and J. W. Hong, "Online Learning Design of an Image-Based Facial Expression Recognition System," To appear in Intelligent
Service Robotics, Vol. 3, No. 3, pp. 151-162, 2010..
[19] K. T. Song and S. C. Chien, “Facial Expression Recognition Based on Mixture of Basic Expression and Intensities,” in Proc. IEEE Int. Conf. Systems, Man, and
Cybernetics, Seoul, South Korea, pp. 3123-3128, 2012.
[20] C. M. Whissell, “The Dictionary of Affect in Language,” Emotion: Theory,
Research and Experience, New York: Academic Press, 1989.
[21] R. E. Thayer, “The Biopsychology of Mood and Arousal,” New York: Oxford
University Press, 1989.
96
[22] I. Hupont, E. Cerezo and S. Baldassarri, “Sensing Facial Emotions in A Continuous 2D Affective Space,” IEEE Int. Conf. Systems, Man and Cybernetics, Istanbul, Turkey, pp. 2045-2051, 2010.
[23] P. Yang, Q. Liu, X. Cui and D. N. Metaxas, “Facial Expression Recognition Using Encoded Dynamic Features,” IEEE Conf. Computer Vision and Pattern
Recognition, Anchorage, AK, USA, pp.1-8, 2008.
[24] S. Hommel and U. Handmann, “AAM Based Continuous Facial Expression Recognition for Face Image Sequences,” in Proc. IEEE Int. Symp.
Computational Intelligence and Informatics, Budapest, Hungary, pp. 189-194,
2011.
[25] Y. Zhang and Q. Ji, "Active and Dynamic Information Fusion for Facial Expression Understanding from Image Sequences," IEEE Trans. Pattern
Analysis and Machine Intelligent, vol. 27, no. 5, pp. 699-714, 2005.
[26] M. A. Nicolaou, H. Gunes and M. Pantic, “Output-Associative RVM Regression for Dimensional and Continuous Emotion Prediction,” in Proc. IEEE Int. Conf.
Automatic Face & Gesture Recognition, Santa Barbara, CA, USA, pp. 21-24,
2011.
[27] N. Cristian and J. S. Taylor, “An Introduction to Support Vector Machines and Other Kernel-based Learning Methods,” New York: Cambridge University Press, 2000.
[28] M. E. Tipping, “Sparse Bayesian Learning and the Relevance Vector Machine,”
Journal of Machine Learning Research, vol.1, no.3, pp. 211–244, 2001.
[29] A. Ethem, “Introduction to Machine Learning,” Second Edition. The MIT Press,
Cambridge, Massachusetts, London, England, 2010.
[30] P. Viola and M. Jones, “Rapid Object Detection Using a Boosted Cascade of
97
Simple Features,” IEEE Computer Society Conf. Computer Vision and Pattern
Recognition, Kauai, HI, pp. 511- 518, 2001.
[31] R. Lienhart and J. Maydt, “An Extended Set of Haar-Like Features for Rapid Object Detection,” in Proc. Int. Conf. Image Processing, Rochester, NY, USA, pp. 900-903, 2002.
[32] T. F. Cootes, G. J. Edwards and C. J. Taylor, “Active Appearance Models.” in
Proc. European Conf. Computer Vision, Springer, Berlin, pp 484–498, 1998.
[33] T. F. Cootes, G. J. Edwards and C. J. Taylor, “Active Appearance Models,” IEEE
Transactions on Pattern Analysis and Machine Intelligence, Vol. 23, No. 6, pp.
681-685, 2001.
[34] T. F. Cootes and C. J. Taylor, “Technical Report: Statistical Models of Appearance for Computer Vision,” The University of Manchester School of
Medicine, 2004.
[35] A. McAndrew, “Introduction to Digital Image Processing with Matlab,”
Thomson Course Technology, 2004.
[36] 陳奕彣, 人臉辨識及表情辨識之整合設計, 碩士論文, 國立交通大學電機與 控制工程學系, 2010.
[37] I. Matthews and S. Baker, “Active Appearance Models Revisited,” Int. Journal
of Computer Vision, vol. 60, no.2, pp.135-164, 2004.
[38] C. Goodall, “Procrustes Methods in the Statistical Analysis of Shape,” Journal of
the Royal Statistical Society B, vol 53, no.2, pp.285-339, 1991.
[39] J.R. Shewchuk, “Triangle: Engineering a 2D Quality Mesh Generator and Delaunay Triangulator,” First Workshop on Applied Computational Geometry,
Proceedings, Philadelphia, pp. 124-133, 1996.
[40] P. Ekman and W.V. Friesen, “Facial Action Coding System (FACS): A Technique
98
for the Measurement of Facial Movement,” Palo Alto, Calif: Consulting
Psychologists Press, 1978.
[41] P. Lucey, J. F. Cohn, T. Kanade, J. Saragih, Z. Ambadar and I. Matthews, “The Extended Cohn-Kanade Dataset (CK+): A Complete Dataset for Action Unit and Emotion-Specified Expression,” IEEE Computer Society Conf. Computer Vision
and Pattern Recognition Workshops, San Francisco, CA, USA, pp. 94-101, 2010.
[42] M. Frieddman and A. Kandel, “Introduction to Pattern Recognition,” World
Scientific, 1999.
[43] D. E. King, “Dlib-ml: A Machine Learning Toolkit,” Journal of Machine
Learning Research 10, pp. 1755-1758, 2009.
[44] D. J. C. MacKay, “The Evidence Framework Applied to Classification Networks,”
Neural Comput., vol. 4, no. 5, pp. 720–736, 1992.
[45] L. Tierney and J. B. Kadane, “Accurate Approximations for Posterior Moments and Marginal Densities,” Journal of the American Statistical Association, vol. 81, no. 393, pp.82-86, 1986.
[46] D. J. C. MacKay, “Bayesian Interpolation,” Neural Comput., vol. 4, no. 3, pp.
415–447, 1992a.
[47] F. Melgani and L. Bruzzone, “Classification of Hyperspectral Remote Sensing Images with Support Vector Machines,” IEEE Trans. Geoscience and Remote
Sensing Symposium, vol. 42, no. 8, pp. 1778–1790, 2004.
[48] T. Hastie and R. Tibshirani, “Classification by Pairwise Coupling,” Annals of
Statisitcs, vol. 26, no. 2, pp. 451-471, 1998.
[49] S. Kullback and R. A. Leibler, “On Information and Sufficiency,” Annals of
Mathematical Statistics, vol. 22, no.1, pp. 79-80, 1951.
[50] Y. H. Yang, Y. F. Su, Y. Ch. Lin and H. Chen, “Music Emotion Recognition: the
99
Role of Individuality,” in Proc. the International workshop on Human-centered
Multimedia, Augsburg, Bavaria, Germany, pp.13-22, 2007.
[51] M. J. Han, C. H. Lin, and K. T. Song, “Robotic emotional expression generation based-on mood transition and personality model,” IEEE Trans. Cybernetics, 2012.
[52] J. A. Russell and M. Bullock, “Multidimensional Scaling of Emotional Facial Expressions: Similarity from Preschoolers to Adults,” Journal of Personality and
Social Psychology, vol. 48, no. 5, pp. 1290–1298, 1985.
[53] S. Jain, H. Changbo and J. K. Aggarwal , “Facial Expression Recognition with Temporal Modeling of Shapes,” IEEE Int. Conf. Computer Vision Workshops, Barcelona, Spain, pp. 1642-1649, 2011.
[54] E. Silva, C. Esparza and Y. Mejia , “POEM-based Facial Expression Recognition, a New Approach,” Image, Signal Processing, and Artificial Vision, Antioquia, Colombia, pp. 162-167, 2012.
100
附錄一、基本人臉情緒混合比例問卷調查樣張
受測者基本資料:
性別: 女 年齡:28
問卷說明:
請依序填入各影像所含基本情緒 Neutral(中性, Ne), Anger(生氣, An), Disgust(憎 惡、厭惡, Di), Fear(害怕, Fe), Happy(高興, Ha), Sadness(傷心, Sa), Surprise(驚訝, Su)之百分比,百分比加總必須為 100%。
例如:
中性: 20 生氣: 40 厭惡: 10 害怕: 15 高興: 0 傷心: 15 驚訝: 0
S002 S026
指出 S002 影像所包含基本情緒比例之百分比 指出 S026 影像所包含基本情緒比例之百分比
中性: 0 生氣:70 厭惡:30 害怕: 0 中性: 40 生氣: 30 厭惡: 30 害怕:0
高興: 0 傷心: 0 驚訝:0 高興: 0 傷心: 0 驚訝: 0
101
S074 S015
指出 S074 影像所包含基本情緒比例之百分比 指出 S015 影像所包含基本情緒比例之百分比
中性: 90 生氣: 0 厭惡: 0 害怕: 0 中性: 70 生氣: 0 厭惡:0 害怕: 0
高興: 0 傷心: 0 驚訝: 10 高興: 30 傷心: 0 驚訝: 0
S013 S028
指出 S013 影像所包含基本情緒比例之百分比 指出 S028 影像所包含基本情緒比例之百分比
中性: 20 生氣: 0 厭惡: 60 害怕: 0 中性: 30 生氣: 0 厭惡: 0 害怕: 10
高興: 0 傷心: 0 驚訝: 20 高興: 0 傷心: 0 驚訝: 60
S073 S009
指出 S073 影像所包含基本情緒比例之百分比 指出 S009 影像所包含基本情緒比例之百分比
中性: 10 生氣: 0 厭惡:0 害怕: 20 中性: 10 生氣: 0 厭惡: 20 害怕: 0
高興: 0 傷心: 0 驚訝: 70 高興: 0 傷心: 70 驚訝: 0
102
S025 S004
指出 S025 影像所包含基本情緒比例之百分比 指出 S004 影像所包含基本情緒比例之百分比
中性: 80 生氣: 10 厭惡: 10 害怕: 0 中性: 0 生氣: 40 厭惡: 60 害怕: 0
高興: 0 傷心: 0 驚訝: 0 高興: 0 傷心: 0 驚訝: 0