• 沒有找到結果。

Chapter 5 Experiment and Evaluation 53

5.5 User Study

5.5.2 User Study Result

We recruited 8 subjects for our user study, and they rated 211 web pages totally. We apply half-life utility measure to evaluate the performances of three different types of profiles. The rating r in half-life utility measure can be from 1 to 5 according to subjects’ ratings, and the maximum achievable utilityHUimax is gained by setting the

5.5. USER STUDY 71

Figure 5.10: Evaluation Result of User Study ratings of useri’s all items to 5.

We useHU@n to view the average performances among all subjects’ top-n item only, and we show the results includingHU@1, HU@3, HU@5, and HU@10. From the results in Fig. 5.10, the most similar item measured by the baseline method showed the best performance, which means the subjects gave the ratings averagely higher than the top-1 items measured by semantic based profiles. The utilities of semantic tag-based profiles tag-based on ConceptNet are the best amongHU@3, HU@5, and HU@10.

The utilities of semantic tag-based profiles based on WordNet are a little lower than the utilities of the baseline method in HU@3 and HU@5, but it becomes better in HU@10.

Chapter 6

Conclusion

In this thesis we proposed semantic based user profiles enriching the original tag-based user profiles by tag concepts. Each tag concept represents a common concept by the core tag and the set of semantic similar tags. We also proposed the similrity mea-sure for semantic tag-based user profiles which eliminates the deficiency of measuring similarity between tag-based user profiles. By applying cosine similarity in measuring the similarity between two distinct tags, we only get zero. But by applying the same method between two tag concepts, we can get the similarity if the two concepts are overlapped. By the similarity measure, we also can find out similar users or identify items a user has interests in.

Based on a user’s resource collection and associated sets of tags on social me-dia sites, we could construct the semantic tag-based user profile containing the set of tag concepts to represent the user’s interests. We introduced three semantic resources, WordNet and ConceptNet and Google snippets, with the associated approaches to

mea-73

sure semantic similarities between tags. We represented how to construct a tag concept from a tag by spreading activation with semantic similarities, and then we constructed a semantic tag-based user profile by a set of tag concepts from a user’s resource col-lection with associated tags.

From empirical evaluation, we showed the performances of the semantic tag-based user profiles based on WordNet, ConceptNet, and Google snippets all were better than the performance of the tag-based user profile with the data set consisting 20,578 users and 80,000 bookmarks by 5-fold cross validation. From the result of user study, se-mantic tag-based user profiles based on ConceptNet show the best utility excluding considering top 1 only.

6.1 Summary of Contributions

• We proposed a semantic similarity measure for tag-based profiles with appropri-ate properties, and this measure eliminappropri-ates the deficiency of measuring similarity by cosine similarity.

• We provided an insight into how the semantic tag-based profile of a user can be constructed from tags associated with the user’s social media collection, and the semantic relations preserved in the profile could reflect the user’s interests as the concepts.

• We proposed tag concepts capturing semantic relations between tags, and se-mantic similarities between tags could be measured based on different sese-mantic

6.2. FUTURE WORK 75

resources to represent different meanings. In this thesis we utilized WordNet, ConceptNet, and Google snippets to measure semantic similarity.

6.2 Future Work

According to our definition of semantic tag-based profiles, we can construct different profiles based on different approaches and semantic resources. However, it is pos-sible to combine different semantic resources with associated similarity measures to construct one semantic tag-based profile revealing better performance. Based on the same tag with different semantic resources, we may construct tag concepts including distinct set of tags and associated weights. Thus, combining all tag concepts into one is an important issue to do in the future.

The problem about how to filter dissimilar tags in a tag concept is also a research issue. Further, if we can confirm dissimilar tags when measuring the similarity between tag concepts, we can obtain more accurate semantic similarity between tag concepts and between semantic tag-based profiles probably.

In this thesis, we construct semantic tag-base profiles based on tag-based profiles which tag weights are measured by a simple approach. However, tag weights can be determined by different approaches for different circumstances. For example, we can consider temporal factor and add more tag weights on the set of tags used recently.

And we can combine those factors with our proposed solutions for different purposes.

[1] G. Adomavicius and A. Tuzhilin. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering, 17(6):734–749, 2005.

[2] M. Ames and M. Naaman. Why we tag: motivations for annotation in mobile and on-line media. In CHI ’07: Proceedings of the SIGCHI conference on Human factors in computing systems, pages 971–980, New York, NY, USA, 2007. ACM.

[3] J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence, pages 43–52, 1998.

[4] M. J. Carman, M. Baillie, and F. Crestani. Tag data and personalized information re-trieval. In SSM ’08: Proceeding of the 2008 ACM workshop on Search in social media, pages 27–34, New York, NY, USA, 2008. ACM.

[5] A. M. Collins and E. F. Loftus. A spreading-activation theory of semantic processing.

Psychological Review, 82(6):407 – 428, 1975.

[6] S. A. Golder and B. A. Huberman. Usage patterns of collaborative tagging systems.

Journal of Information Science, 32(2):198–208, 2006.

[7] H. Halpin, V. Robu, and H. Shepherd. The complex dynamics of collaborative tagging. In 76

BIBLIOGRAPHY 77

WWW ’07: Proceedings of the 16th international conference on World Wide Web, pages 211–220, New York, NY, USA, 2007. ACM.

[8] C. Havasi, R. Speer, and J. Alonso. Conceptnet 3: a flexible, multilingual semantic network for common sense knowledge. In Recent Advances in Natural Language Pro-cessing, Borovets, Bulgaria, September 2007.

[9] Y.-C. Huang. Tag-based profile presentation with semantic relationship, June 2008.

[10] C.-C. Hung. Tag-based user profiling for social media recommendation, June 2008.

[11] R. J¨aschke, L. Marinho, A. Hotho, L. Schmidt-Thieme, and G. Stumme. Tag recommen-dations in folksonomies. In PKDD ’07: Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases, pages 506–514, Berlin, Heidelberg, 2007. Springer-Verlag.

[12] J. J. Jiang and D. W. Conrath. Semantic similarity based on corpus statistics and lexical taxonomy. In International Conference Research on Computational Linguistics, pages 9008+, September 1997.

[13] B. Krulwich. Lifestyle finder: Intelligent user profiling using large-scale demographic data. AI Magazine, 18(2):37–45, 1997.

[14] X. Li, L. Guo, and Y. E. Zhao. Tag-based social interest discovery. In WWW ’08: Pro-ceedings of the 17th international conference on World Wide Web, pages 675–684, New York, NY, USA, 2008. ACM.

[15] Y. Li, Z. A. Bandar, and D. McLean. An approach for measuring semantic similarity between words using multiple information sources. IEEE Transactions on Knowledge and Data Engineering, 15(4):871–882, 2003.

[16] H. Liu and P. Singh. Conceptnet — a practical commonsense reasoning tool-kit. BT Technology Journal, 22(4):211–226, 2004.

[17] A. Maedche and S. Staab. Measuring similarity between ontologies. In EKAW ’02: Pro-ceedings of the 13th International Conference on Knowledge Engineering and Knowl-edge Management. Ontologies and the Semantic Web, pages 251–263, London, UK, 2002. Springer-Verlag.

[18] C. man Au Yeung, N. Gibbins, and N. Shadbolt. A study of user profile generation from folksonomies. In Proceedings of the WWW 2008 Workshop on Social Web and Knowledge Management, 2008.

[19] D. L. Medin, R. L. Goldstone, and D. Gentner. Respects for similarity. Psychological Review, 100:254–278, 1993.

[20] E. Michlmayr and S. Cayzer. Learning User Profiles from Tagging Data and Leveraging them for Personal(ized) Information Access. In WWW ’07: Proceedings of the Workshop on Tagging and Metadata for Social Information Organization, 16th International World Wide Web Conference, May 2007.

[21] P. Mika. Ontologies are us: A unified model of social networks and semantics. In ISWC

’05: Proceedings of the 4th International Semantic Web Conference, volume 3729 of Lecture Notes in Computer Science, pages 522–536, Galway, Ireland, 2005. Springer.

[22] G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. J. Miller. Introduction to wordnet: An on-line lexical database. Journal of Lexicography, 3(4):235–244, 1990.

[23] G. A. Miller and W. G. Charles. Contextual correlates of semantic similarity. Language and Cognitive Processes, 6(1):1–28, 1991.

[24] M. F. Porter. An algorithm for suffix stripping. Program, 14(3):130–137, 1980.

[25] P. Resnik. Using information content to evaluate semantic similarity in a taxonomy. In IJCAI ’95: Proceedings of the 14th International Joint Conference on Artificial Intelli-gence, pages 448–453, 1995.

BIBLIOGRAPHY 79

[26] H. Rubenstein and J. B. Goodenough. Contextual correlates of synonymy. Commun.

ACM, 8(10):627–633, 1965.

[27] M. Sahami and T. D. Heilman. A web-based kernel function for measuring the similarity of short text snippets. In WWW ’06: Proceedings of the 15th international conference on World Wide Web, pages 377–386, New York, NY, USA, 2006. ACM.

[28] G. Salton. Introduction to Modern Information Retrieval. McGraw-Hill, Inc., New York, NY, USA, 1983.

[29] B. Sarwar, G. Karypis, J. Konstan, and J. Reidl. Item-based collaborative filtering recom-mendation algorithms. In WWW ’01: Proceedings of the 10th international conference on World Wide Web, pages 285–295, New York, NY, USA, 2001. ACM.

[30] R. Speer, C. Havasi, and H. Lieberman. Analogyspace: Reducing the dimensionality of common sense knowledge. In D. Fox and C. P. Gomes, editors, AAAI ’08: Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, pages 548–553. AAAI Press, 2008.

[31] A. Tversky. Features of similarity. Psychological Review, 84:327–352, 1977.

[32] G. Varelas, E. Voutsakis, P. Raftopoulou, E. G. Petrakis, and E. E. Milios. Semantic sim-ilarity methods in wordnet and their application to information retrieval on the web. In WIDM ’05: Proceedings of the 7th annual ACM international workshop on Web infor-mation and data management, pages 10–16, New York, NY, USA, 2005. ACM.

[33] Z. Wu and M. Palmer. Verbs semantics and lexical selection. In Proceedings of the 32nd annual meeting on Association for Computational Linguistics, pages 133–138, Morris-town, NJ, USA, 1994. Association for Computational Linguistics.

[34] D. Zeng and H. Li. How useful are tags? – an empirical analysis of collaborative tagging for web page recommendation. In PAISI, PACCF and SOCO ’08: Proceedings of the

80

相關文件