• 沒有找到結果。

目前只對拍賣商品的標題進行文件自動分類,對於只有品牌名稱或是中文字 數較少的標題,會增加自動分類的困難度,這時候如果可以對拍賣商品資訊中的商 品圖片找出特徵,以商品標題搭配商品圖片的方式進行自動分類,以提升分類的準 確性,還可以找出標題與圖片不符合的商品。

另外對於在使用 WordNet 必須將中文翻譯成英文,在將英文詞彙消除歧義,

如果能加強中翻英的部分,勢必能找到更正確的語意,但因為語言文化的差異拍賣 商品的關鍵詞不一定都能翻譯成英文。而 WordNet 本身就是一個本體論的架構具 有分類能力,如果可以運用 WordNet 自身的分類架構進行語意消歧義,例如商品 都是實體的物品而對於抽象的語意進行過濾,在挑選中文詞彙語意的時候可以更 精準。

參考文獻

[1] 電子商務經營模式, http://zh.wikipedia.org/zh-tw/電子商務經營模式.

[2] Salton, G. "Automatic text processing : the Transformation, Analysis, and Retrieval of Information by Computer." Mass. : Addison-Wesley, 1988.

[3] 杜海倫. "以標題進行新聞自動分類." 國立清華大學, 1999.

[12] Cover, T. and J. "Thomas, Elements of information theory." Wiley, 1991.

[13] Hovold, J. "Naive Bayes Spam Filtering Using Word-Position-Based Attributes."

Proceedings of Conference on Email and Anti-Spam, 2005.

[14] Vapnik, V. "The Nature of Statistical Learning Theory." NY Springer, 1995.

[15] Salton, G. "Automatic text processing : the Transformation, Analysis, and Retrieval of Information by Computer." Mass. : Addison-Wesley, 1988.

[16] Duda, R. O. and P. E. Hart. "Pattern Classification and Scene Analysis." Wiley New York, 1973.

[17] James, M. "Classification algorithms." John Wiley & Sons Inc, 1985. Document Classification." Journal of the American Society for Information Science, Vol.31, No.6, 396-402, 1980.

[23] 王稔智、張俊盛. "「適應性文件分類系統」." 第十四屆計算語言學研討會論 文集, 99-121, 2001.

[24] Tseng, Y-H. "Fast co-occurrence thesaurus construction for Chinese news." IEEE International Conference on System, Man, and Cybernetics, Vol.2, 853-858, 2001 [25] Yiming Yang, Jan O. Pedersen. "A Comparative Study on Feature Selection in Text

Categorization." Proceeding of 14th International Conference on Machine Learning, 1997.

[26] Joachims, T. "Text categorization with Support Vector Machines: Learning with many relevant features." In Proceedings of ECML-98, 137-142, 1998.

[27] Larkey, L. and Croft, W. B. "Combining Classifiers in Text Categorization"

Proceedings of the 19th International Conference on Research and Development Information Retrieval (SIGIR96), Zurich, Switzerland, 289-297, 1996.

[28] 詹欣逸. "利用 wordnet 判斷字詞包含關係─應用於動態階層文件分群." 國立 中央大學, 2013.

[29] G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. J. Miller. "Introduction to WordNet: An On-line Lexcial Database.", International journal of lexicography,

[30] 謝靜婷. "半自動建立中文 wordnet 之研究." 國立清華大學, 2002. Domains." In Emerging Trends in Engineering and Technology, 2008. ICETET '08.

First International Conference on, 1187-1191, 2008.

[35] Haisheng, Li, Tian Yun, Ye Ben and Cai Qiang. "Comparison of Current Semantic Similarity Methods in Wordnet." In Computer Application and System Modeling (ICCASM), 2010 International Conference on, 4, V4-408-V4-411, 2010.

[36] Ahsaee, M. G., M. Naghibzadeh and S. E. Yasrebi. "Using Wordnet to Determine Semantic Similarity of Words." In Telecommunications (IST), 2010 5th International Symposium on, 1019-1027, 2010.

[37] Peng-Yuan, Liu, Zhao Tie-Jun and Yu Xiao-Feng. "Application-Oriented Comparison and Evaluation of Six Semantic Similarity Measures Based on Wordnet." In Machine Learning and Cybernetics, 2006 International Conference on, 2605-2610, 2006.

[38] Young-Bum, Kim and Kim Yu-Seop. "Latent Semantic Kernels for Wordnet:

Transforming a Tree-Like Structure into a Matrix." In Advanced Language Processing and Web Information Technology, 2008. ALPIT '08. International Conference on, 76-80, 2008.

[39] Chua, Stephanie and Narayanan Kulathuramaiyer. "Semantic Feature Selection Using Wordnet." In Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence, 166-172: IEEE Computer Society, 2004.

[40] Yang, C-H. and S-J. Ker. "Considerations of Linking WordNet with MRD." In Proceedings of the 19th international conference on Computational linguistics, Vol.1, 1-7, 2002.

[41] Chen, Hsin-Hsi, Chi-Ching Lin and Wen-Cheng Lin. "Construction of a Chinese-English Wordnet and Its Application to Clir." In Proceedings of the fifth international workshop on on Information retrieval with Asian languages, 189-196.

Hong Kong, China: ACM, 2000.

[42] 中央研究院中文斷詞系統, http://ckipsvr.iis.sinica.edu.tw/.

[43] 馬偉雲. 未知詞擷取作法, http://ckipsvr.iis.sinica.edu.tw/uwe.htm. 2004.

[44] Farbrizio, S. "Machine Learning in Automated Text Categorization." ACM Computing Surveys, Vol.34, No.1, 1-47, 2002.

[45] Dong, J., H. Cao, P. Liu and L. Ren "Bayesian Chinese Spam Filter Based on Crossed N-gram." Proceedings of the 6th International Conference on Intelligent Systems Design and Applications (ISDA), 103-108, 2006.

[46] Salton, G. "Automatic Text Processing ." Mass. : Addison-Wesley, 1989.

[47] Belur V. Dasarathy. "Nearest Neighbor(NN) Norms : Pattern Classification Techniques." Ieee Computer Society, 1990.

[48] Baker, L. D., and McCallum, A. K. "Distributional Clustering of Words for Text Classification." In Proceedings of SIGIR-98, 96-103, 1998.

[49] Libsvm, http://www.csie.ntu.edu.tw/~cjlin/libsvm/.

[50] 林銘達. "以影像分割和語義 WordNet 為基礎的自動圖像註解法." 國立屏東 商業技術學院, 2013.

[51] 錯別字, http://zh.wikipedia.org/zh-tw/錯別字.

附錄一 中研院平衡語料庫詞類標記集

Vt VF VF1, VF2 /*動作謂賓動詞*/

相關文件