5.1 研究總結
標籤分析中,本論文研究發現在我們設定的四種向量組合中,利用標籤(Tag) 來輔助分群,Tags as words 是最好組合的方式,優於其他三種組合方式。Words only 和 Tags only 特徵數都較少,無法達到好的效果,Words+tags 將標籤和摘 要的比例分開,沒有集中,效果比 Tags as words 差一點。在這部分實驗中可看 出 Constrained-PLSA 加入些許標記的資訊,很明顯的優於其他分群法,些許的 標記資訊,可以有效的提升效能。
在 Semi-supervised Leaning 實驗中,本論文所提出的方法,將背景知識 (Background Knowledge),加入 Constrained-PLSA 演算法,能有效的提升效能 且 穩 定 的 效 果 。 在 實 驗 結 果 中 , 在 兩 個 類 別 和 四 個 類 別 的 情 況 下 , Constrained-PLSA 都能達到很好的效果,與其它方法做比較,大約都高 10%左右 的效能。Constrained-PLSA 只給予些許的標記(Labeled)資訊,使分群的效果達 到不錯的水準,讓系統能夠有效的應用於文件管理應用、資訊擷取等應用上。
5.2 未來研究
本論文之 Constrained-PLSA 在大部分的情況下都有不錯的效能,但在 Reuters 實驗中發現種子數 3%至 5%的情況下略輸 tSVM,但效果還是有一定的水 準,未來可以研究 Constrained-PLSA,要怎麼修改演算法,讓本論文之方法能 夠在這個部分也比 tSVM 效能好。
參考文獻
[1] T. Joachims, “Text categorization with suport vector machines:
Learning with many relevant features," in Proceedings of the 10th European Conference on Machine Learning. London, UK: Springer-Verlag, 1998, pp. 137–142. [Online]. Available:
http://portal.acm.org/citation.cfm?id=645326.649721
[2] A. McCallum and K. Nigam, “A comparison of event models for naive bayes text classification," in IN AAAI-98 WORKSHOP ON LEARNING FOR TEXT CATEGORIZATION. AAAI Press, 1998, pp. 41–48.
[3] R. E. Schapire and Y. Singer, “Boostexter: A boosting-based system for text categorization," Machine Learning, vol. 39, no. 2/3, pp.
135–168, 2000.
[4] A. Blum and T. Mitchell, “Combining labeled and unlabeled data with co-training," in Proceedings of the eleventh annual conference on Computational learning theory, ser. COLT’ 98. New York, NY, USA: ACM, 1998, pp. 92–100. [Online]. Available:
http://doi.acm.org/10.1145/279943.279962
[5] W. Wang and Z.-H. Zhou, “A new analysis of co-training," in Proceedings of the 27th International Conference on Machine Learning (ICML-10), J. F¨urnkranz and T. Joachims, Eds. Haifa, Israel:
Omnipress, June 2010, pp. 1135–1142. [Online]. Available:
http://www.icml2010.org/papers/275.pdf
[6] K. Nigam, A. K. McCallum, S. Thrun, and T. Mitchell, “Text classification from labeled and unlabeled documents using em," Mach.
Learn., vol. 39, pp. 103–134, May 2000. [Online]. Available:
http://portal.acm.org/citation.cfm?id=347709.347724
[7] T. Joachims, “Transductive inference for text classification using support vector machines," in Proceedings of the Sixteenth International Conference on Machine Learning, ser. ICML ’99. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 1999, pp.
200–209. [Online]. Available:
http://portal.acm.org/citation.cfm?id=645528.657646
[8] A. Blum and S. Chawla, “Learning from labeled and unlabeled data using graph mincuts," in Proceedings of the Eighteenth International Conference on Machine Learning, ser. ICML ’01. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 2001, pp. 19–26. [Online].
Available: http://portal.acm.org/citation.cfm?id=645530.757779
43
[9] A. B. Goldberg and X. Zhu, “Seeing stars when there aren’t many stars:
graph-based semi-supervised learning for sentiment categorization,"
in Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing, ser. TextGraphs-1. Stroudsburg, PA, USA:
Association for Computational Linguistics, 2006, pp. 45–52.
[Online]. Available:
http://portal.acm.org/citation.cfm?id=1654758.1654769
[10] K. Wagstaff, C. Cardie, S. Rogers, and S. Schr¨odl, “Constrained k-means clustering with background knowledge," in Proceedings of the Eighteenth International Conference on Machine Learning, ser.
ICML ’01. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 2001, pp. 577–584. [Online]. Available:
http://portal.acm.org/citation.cfm?id=645530.655669
[11] S. Basu, A. Banerjee, and R. J. Mooney, “Semi-supervised clustering by seeding," in Proceedings of the Nineteenth International Conference on Machine Learning, ser. ICML ’02. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 2002, pp. 27–34. [Online].
Available: http://portal.acm.org/citation.cfm?id=645531.656012 [12] S. Basu, M. Bilenko, and R. J. Mooney, “A probabilistic framework
for semi-supervised clustering," in Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, ser. KDD ’04. New York, NY, USA: ACM, 2004, pp. 59–68.
[Online]. Available: http://doi.acm.org/10.1145/1014052.1014062 [13] K. Zhou, X. Gui-Rong, Q. Yang, and Y. Yu, “Learning with positive
and unlabeled examples using topicsensitive plsa," IEEE Trans. on Knowl. and Data Eng., vol. 22, pp. 46–58, January 2010. [Online].
Available: http://dx.doi.org/10.1109/TKDE.2009.56
[14] W. Li and A. McCallum, “Semi-supervised sequence modeling with syntactic topic models," in Proceedings of the 20th national conference on Artificial intelligence - Volume 2. AAAI Press, 2005, pp. 813–818. [Online]. Available:
http://portal.acm.org/citation.cfm?id=1619410.1619463
[15] A. Carlson, J. Betteridge, R. C. Wang, E. R. Hruschka, Jr., and T.
M. Mitchell, “Coupled semi-supervised learning for information extraction," in Proceedings of the third ACM international conference on Web search and data mining, ser. WSDM ’10. New York, NY, USA: ACM, 2010, pp. 101–110. [Online]. Available:
http://doi.acm.org/10.1145/1718487.1718501
[16] S. X. Yu and J. Shi, “Grouping with bias," in NIPS, 2001, pp.
1327–1334.
[17] R. Nock and F. Nielsen, “Grouping with bias revisited," in Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition, ser. CVPR’04. Washington, DC, USA:
IEEE Computer Society, 2004, pp. 460–465. [Online]. Available:
http://portal.acm.org/citation.cfm?id=1896300.1896368
[18] T. Joachims, “Transductive learning via spectral graph partitioning," in Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003), 2003, pp. 290–297.
[19] X. Zhu, “Semi-supervised learning literature survey," Computer Sciences, University of Wisconsin-Madison, Tech. Rep. 1530, 2005.
[20] J. Shi and J. Malik, “Normalized cuts and image segmentation," IEEE Trans. Pattern Anal. Mach. Intell., vol. 22, no. 8, pp. 888–905, 2000.
[21] A. Y. Ng, M. I. Jordan, and Y.Weiss, “On spectral clustering:
Analysis and an algorithm," in Advances in Neural Information Processing Systems 14. MIT Press, 2001, pp. 849–856.
[22] T. Hofmann, “Probabilistic latent semantic analysis," in Proc. of Uncertainty in Artificial Intelligence, UAI’99, 1999. [Online].
Available: citeseer.ist.psu.edu/hofmann99probabilistic.html
[23] T.Hofmann, “Unsupervised learning by probabilistic latent semantic analysis," Mach. Learn., vol. 42, no. 1-2, pp. 177–196, 2001.
[24] T. Hofmann, J. Puzicha, and M. I. Jordan, “Learning from dyadic data,"
in Proceedings of the 1998 conference on Advances in neural information processing systems II. Cambridge, MA, USA: MIT Press, 1999, pp. 466–472.
[25] S. Zhong, “Semi-supervised model-based document clustering: A comparative study," Mach. Learn., vol. 65, pp. 3–29, October 2006.
[Online]. Available:
http://portal.acm.org/citation.cfm?id=1164582.1164590
[26] T. Finley and T. Joachims, “Supervised clustering with support vector machines," in Proceedings of the 22nd international conference on Machine learning, ser. ICML ’05. New York, NY, USA:
ACM, 2005, pp. 217–224. [Online]. Available:
http://doi.acm.org/10.1145/1102351.1102379
[27] J. Wang, S. Wu, H. Q. Vu, and G. Li, “Text document clustering with metric learning," in Proceeding of the 33rd international ACM SIGIR
45
conference on Research and development in information retrieval, ser.
SIGIR ’10. New York, NY, USA: ACM, 2010, pp. 783–784. [Online].
Available: http://doi.acm.org/10.1145/1835449.1835614
[28] A. P. Dempster, N. M. Laird, and D. B. Rubin, “Maximum likelihood from incomplete data via the em algorithm," JOURNAL OF THE ROYAL STATISTICAL SOCIETY, SERIES B, vol. 39, no. 1, pp. 1–38, 1977.
[Online]. Available:
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.133.4884 [29] D.M.Blei, A.Y. Ng, and M. I. Jordan, “Latent Dirichlet Allocation"
in Journal of Machine Learning Research, 2003, pp. 993-1002
[30] D. Ramage, P. Heymann, C. D. Manning, and H. Garcia-Molina,
“Clustering the Tagged Web" in Second ACM International Conference on Web Search and Data Mining (WSFM), 2009