未來展望

第五章結論

第二節未來展望

本論文為人工建構資料庫，在收集資料時發現資料來源單一(只有從巴哈論壇收集)、由於沒有加入回饋功能，故資料庫無法自動新增詞彙以及產品量其實也還算少。未來可以架伺服器建立資料庫，利用爬蟲的方式從各大論壇如批踢踢實業坊 (PTT) (https://www.ptt.cc/bbs/hotboards.html) 和伊莉論壇 (eyny) (http://www68.eyny.com/index.php)等抓取相關評論並自動學習後，將評論之摘要及產品推薦整合成一個網頁供使用者參考討論。

在評論分析方面，由於論壇性質所以幾乎沒有反諷、釣魚等句子。比較句的分析上雖然只有用 Breadth-First Search 和找尋標記為 Head Na 兩種方式初步與意見詞配對，但是仍不夠完善，例如例句中的「比 PVC 本體還精緻的機槍登場」，

本論文的處理為「機槍+精緻」，但對於「PVC 本體」的極性並未處理。未來可以朝查詢反諷語句之真義及比較句中比較詞 A 和比較詞 B 的極性分群等。

參考文獻

Agarwal, Basant, and Namita Mittal. Categorical probability proportion difference (CPPD): A feature selection method for sentiment classification. Proceedings of the 2nd Workshop on Sentiment Analysis where AI meets Psychology (SAAIP 2012), COLING. 2012.

Agerri, Rodrigo and Bermudez, Josu, and Rigau, German. 2014. Ixa pipeline: Efficient and ready to use multilingual nlp tools. In Proceedings of the 9th Language Resources and Evaluation Conference (LREC2014), pages 26–31, Reykjavik, Iceland, May.

ALTER(アルター)：https://alter-web.jp/

Amazon.cp.jp：https://www.amazon.co.jp/

Baccianella, S. and Esuli, A. and Sebastiani, F. 2010. Senti- WordNet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In Seventh conference on International Language Resources and Evaluation (LREC-2010), Malta., volume 25.

Brown, Peter F and Desouza, Peter V and Mercer, Robert L and Vincent Pietra, J Della and Lai, Jenifer C. 1992. Classbased n-gram models of natural language.

Computational linguistics, 18(4):467–479. Rodrigo Agerri, Josu Bermudez, and German Rigau. 2014. Ixa pipeline: Efficient and ready to use multilingual nlp tools. In Proceedings of the 9th Language Resources and Evaluation Conference (LREC2014), pages 26–31, Reykjavik, Iceland, May.

Carletta, J. (1996). "Assessing Agreement on Classification Tasks: the Kappa Statistic," Computational linguistics, 22(2), pp. 249-254.

Church, K.W., Hanks, P.: Word association norms, mutual information, and lexicography. Computational Linguistics 16(1) (1990) 22–29

Clark, Alexander. 2003. Combining distributional and morphological information for part of speech induction. In Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics-Volume 1, pages 59–66.

cLayz（クレイズ）：http://clayz-online.com/

De Clercq, O., Van de Kauter, M., Lefever, E., & Hoste, V. (2015). Applying hybrid terminology extraction to aspect-based sentiment analysis. In International

Workshop on Semantic Evaluation (SemEval 2015) (pp. 719-724). Association for

Computational Linguistics.

Garcıa-Pablos, A., Cuadros, M., & Rigau, G. (2015). V3: unsupervised aspect based sentiment analysis for SemEval-2015 Task 12. SemEval-2015, 714–718.

goo 辞書：https://dictionary.goo.ne.jp/

GSC(GOOD SMILE COMPANY)：http://www.goodsmile.info/zh/

Hall, Mark and Frank, Eibe and Holmes, Geoffrey and Pfahringer, Bernhard and Peter Reutemann and Ian H. Witten. 2009. The WEKA data mining software: an update.

SIGKDD Explor. Newsl., 11(1):10–18, november.

Hu, M. and Liu, B. 2004. Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 168–177.

Jiménez-Zafra, S. M., Martınez-Cámara, E., Martın-Valdivia, M. T., & Urena-López, L.

A. (2015). SINAI: Syntactic approach for Aspect Based Sentiment Analysis. SemEval-2015, 730–735.

Koppula, A. R., Pallelra, R. R., Repaka, R., & Movva, V. S. (2015).

UMDuluth-CS8761-12: A Novel Machine Learning Approach for Aspect Based Sentiment Analysis. SemEval-2015, 742–747.

KOTOBUKIYA | 株式会社壽屋コトブキヤ：http://www.kotobukiya.co.jp/

Ku, L.-W. and Chen, H.-H. 2007. Mining Opinions from the Web: Beyond Relevance Retrieval. Journal of American Society for Information Science and Technology, Special Issue on Mining Web Resources for Enhancing Information Retrieval, 58(12), 1838-1850.

Liu, Bing and Hu, Minqing and Cheng, Junsheng. 2005. Opinion Observer: Analyzing

and Comparing Opinions on the Web. In Proceedings of the 14th International World Wide Web conference (WWW-2005). Chiba, Japan.

Liu, Kang and Xu, Liheng and Zhao, Jun 2014. Extracting Opinion Targets and Opinion Words from Online Reviews with Graph Co-ranking Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics

Liu, L., Lei, M., & Wang, H. (2013). Combining domain-specific sentiment lexicon with hownet for chinese sentiment analysis. Journal of Computers, 8(4), 878-883.

Lu, Bin and Ott, Myle and Cardie, Claire, and Tsou, Benjamin K. 2011. Multi-aspect sentiment analysis with topic models. In Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on, pages 81–88. IEEE.

McCallum, Andrew Kachites. 2002. MALLET: A Machine Learning for Language Toolkit.

Mikolov, Tomas and Sutskever, Ilya and Chen, Kai and Corrado, Greg S and Dean, Jeff. 2013. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems, pages 3111–3119.

Miller, George A. 1995. Wordnet: a lexical database forenglish. Communications of the ACM, 38(11):39–41.

Nielsen, Finn A° rup. 2011. A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs. In Proceedings, 1st Workshop on Making Sense of Microposts (#MSM2011): Big things come in small packages. pp: 93-98. Greece.

Pontiki M., Galanis D., Papageorgiou H., Manandhar S., & Androutsopoulos I.(2015, June). Semeval-2015 task 12: Aspect Based Sentiment Analysis. In Proceedings

of the 9th international workshop on semantic evaluation (SemEval 2015) (pp.

486-495).

PTT：https://www.ptt.cc/bbs/hotboards.html

Saias, J. (2015, June). Sentiue: Target and aspect based sentiment analysis in semeval-2015 task 12. Association for Computational Linguistics.

San Vicente, I., Saralegi, X., Agerri, R., & Sebastián, D. S. (2015, June). Elixa: A modular and flexible absa platform. In Proceedings of the 9th International

Workshop on Semantic Evaluation (SemEval 2015) (pp. 748-752).

SIGLEX(Special Interest Group on the Lexicon)：http://alt.qcri.org/semeval2015/

Stone P. and Dunphy, D. and Smith, M. and Ogilvie, D. 1966. The General Inquirer: A Computer Approach to Content Analysis. Cambridge (MA): MIT Press.

Weblio 日中中日辞典：http://cjjc.weblio.jp/

Wilson, Theresa and Wiebe, Janyce and Hoffmann, Paul. 2005. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, HLT ’05, pages 347–354, Stroudsburg, PA, USA.

𝑖𝑥𝑎 - 𝑝𝑖𝑝𝑒 - 𝑛𝑒𝑟𝑐 Named Entity Recognition system. Available from ：

https://github.com/ixa-ehu/ixa-pipe-nerc

中研院中文剖析系統：http://parser.iis.sinica.edu.tw/

中研院中文斷詞系統：http://ckipsvr.iis.sinica.edu.tw/

分群範例【風華の開箱】 ALTER 未聞花名本間芽衣子：

https://forum.gamer.com.tw/Co.php?bsn=60036&sn=237462

巴哈姆特電玩資訊站：https://www.gamer.com.tw/

陳傳生，2014“使用廣義知網於情感詞彙之極性分析研究”，國立師範大學資訊工程研究所碩士論文。

臉書社團「 PVC_Figure 人型討論分享社」：

https://www.facebook.com/groups/figure.hot/

附錄

1. 特徵詞分群

分群於身體之詞彙(65 個詞)

大腿小腿肉小辮子心手

手指手指甲手掌手臂右手

左手皮皮膚耳朵肉肉

肉體肌膚呆毛屁屁屁股

秀髮肚臍身體乳量股間

股溝前髮指甲美腿面相

香肩馬尾眼神眼睛脖子

麻花捲單腳短髮腋微乳

微笑腰腳腳指甲腳趾

腳趾腿裸足嘴型嘴唇

皺紋膚質膝髮尾髮絲

頭頭皮屑頭髮臉臉蛋

臉部臉頰雙手關節辮子

分群於整體之詞彙(135 個詞)

人少女方式水手水準

牛鬼主題凹凸感凹陷感可愛型

本體正面白色立體感份量

光澤共通點成品曲線色

圖騰壽屋槍槍托槍超

漆磁鐵窩網襪緊身衣

舞台劍噴嘴墜飾樣子

澎澎裙皺褶緞帶緞帶花蓮蓬頭

蝙蝠蝴蝶結輪胎鞋上鞋子

髮飾齒輪機身機槍機關槍

燈燈籠燒痕興趣褲

褲子褲襪貓耳超讚頭紗蕾絲

顆粒點鎖扣鬆緊帶爆表

繩結霧襪子鐵絲襯裙

大衣

2. 意見詞分群

分群於正向之詞彙(118 個詞)

一流一等一一覽無遺大膽不錯

分明仔細充分出色可圈可點

可愛正點生動用心光滑

全新好好看好棒成功

有意思有趣自然呆呆呈現出來

均勻完美完整良好到位

固定性感明確明顯物超所值

花俏表現出來亮便宜俐落

搶眼楚楚可憐滑緊緊緊

蒼白颯爽親民還好還原

舊難得騷霸氣豪棒

彎曲

分群於模糊之詞彙(34 個詞)

一樣不行化反光扎眼

如此吹了相當相關胖次

真不少高到高熱掉淋淋

深透明這麼這樣就是了

發光黑傻巴實對

緊繃凜黝黑爆難怪

飄逸飄飄變質

分群於負向之詞彙(36 個詞)

不佳少可怕可惜失真

失敗失望危險多餘色移

怪怪怪怪異突兀胖

凌亂害臊差恐怖草率

馬虎乾瘦崩崩壞減少

硬微妙腫詭異誇張

慘遜色僵硬難難看

糟糕

分群於偏負之詞彙(12 個詞)

平扁扁單一短貴

圓塌塌塌溢色溢

違和過頭

在文檔中利用剖析樹結構探討論壇評論之特徵與意見詞配對關係 (頁 97-109)

第五章 結論

第二節 未來展望