• 沒有找到結果。

Literature Survey

5.3 Future Work

There is still much work that should be done in the future. This section outlines future work that could extend the applicability and performance of the proposed summarization framework, including:

(1) The employment of natural language processing (NLP) techniques:

This thesis does not have much NLP techniques to help understand and analyze the input texts. We suppose that the use of NLP techniques, for example, information extraction, sentence parsing, lexical chains, co-reference chains, etc., can directly benefit the identification of text entities and their relationships, and hence leads to better understanding of texts and content selection.

(2) The utilization of domain knowledge and external resources:

This thesis uses no domain knowledge in the summarization process since it targets at general-purpose summarization of documents in public domains. Such a summarization framework would apparently not work well in all domains. It is expected that domain knowledge and external resources, such as patient record and disease information in medical domains, and terminologies in financial and sports areas, should improve the analysis of texts in particular domains.

(3) The consideration of language properties:

This thesis performs no deep linguistic analysis particular to languages and thus the proposed framework can be practically applied to documents in any languages, for which the decomposition of texts into word units in preprocessing can be done. It is believed that the framework is applicable to

multilingual/cross-lingual multidocument summarization as well, provided that machine translation modules are available, or the relevance of text portions in different languages can be determined.

(4) The investigation of new similarity measures:

This thesis exploits the cosine similarity metric (in vector space model and in latent semantic space) to measure the relations between each pair of sentences, as well as the relevance between a sentence and the query. We expect that more advanced techniques of assessing similarity, which incorporate word semantics and relations, will be easily integrated in the summarization model. In addition, using words or phrases with similar meanings to expand the user query will obviously profit the identification of query-biased sentences.

(5) The exploration of new surface-level features:

This thesis examines a subset of surface-level features and various combinations of them to determine the informativeness of sentences (or the likelihood of sentences of being part of the summary) in summarization.

Nevertheless, it is worth studying to discover other effective features, to identify the effect of a feature to summarization, as well as to investigate the relations between different features for feature selection.

(6) The application of machine learning techniques:

This thesis combines different features in an unsupervised manner to yield a sentence scoring function, for which parameters are tuned empirically. As more and more standard collections for training and test on evaluation of

summarization methods have been established recently, we intend to apply machine learning techniques to automatically learn an effective sentence scoring model from training data.

(7) The improvement of the summary quality:

This thesis adopts the most common technique in summarization, namely sentence extraction, to create extractive summaries. However, this strategy does not guarantee good summary quality in terms of coherence, cohesion, and overall organization, even though it may include good content in the summaries.

Fortunately, techniques to improve the quality of summaries, such as, information fusion and reformulation by natural language generation to produce abstractive summaries, passage simplification/compression to remove parts of, for example, a sentence without disturbing its understandability or underlying meaning, information ordering to yield coherent summaries, and anaphora resolution and time annotation to produce summaries with good readability, have proven successful in some degree.

(8) The use of different strategies for different types of the input document clusters:

This thesis uses the same strategy to deal with different types of the input document clusters. Such a single strategy has shown promising in evaluation.

However, we believe that a first step to examine the types of the document clusters, and then to process the documents using different strategies will probably generate better summaries. For instance, news articles can be classified into on the same event, on topically related but different events, natural disaster, biography, etc., for which different summarization strategies should be decided.

(9) The enhancement of visualization:

This thesis does not provide visualization of summaries. Obviously, it could be beneficial to the user by presenting visual information related to the content in summaries. The following gives some visualization examples in news summarization: the visualization of the spatial information, indicated in the summary, on a geographical map; a visual summary with the x-axis representing the timeline, the y-axis representing the location, and the (x, y) point labeled with keywords of news events, linking to the corresponding text summary.

(10) The addition of user interaction mechanisms:

This thesis only provides the user with simple controls, such as the length of the summary, and the summary type in generic and query-focused, over the summarization process. One shortage of such a system is the lack of dynamic response to the user’s need. Therefore, future work will add user interaction mechanisms into the proposed summarization framework. For instance, the linking of a summary sentence to the original document or to the most relevant sentences in the documents; the zoom-in and zoom-out of topics of a summary in the hierarchical structure; the control to obtain preferred summaries by relevance feedback of user interests.

Bibliography

[1] Afantenos, S., Karkaletsis, V., & Stamatopoulos, P. (2005). Summarization from medical documents: a survey. Artificial Intelligence in Medicine, 33(2), 157-177.

[2] Allan, J., Wade, C., & Bolivar, A. (2003). Retrieval and novelty detection at the sentence level. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp.

314-321). Toronto, ON, Canada.

[3] Amigó, E., Gonzalo, J., Peinado, V., Peñas, A., & Verdejo, F. (2004). An empirical study of information synthesis tasks. In Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (pp. 207-214).

Barcelona, Spain.

[4] Anderson, J. R. (1983). A spreading activation theory of memory. Journal of Verbal Learning and Verbal Behavior, 22, 261-295.

[5] Aone, C., Okurowski, M. E., & Gorlinsky, J. (1998). Trainable, scalable summarization using robust NLP and machine learning. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics (pp. 62-66). Montreal, QC, Canada.

[6] Azzam, S., Humphreys, K., & Gaizauskas, R. (1999). Using coreference chains for text summarization. In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (pp. 77-84). College Park, MD, USA.

[7] Barzilay, R., & Elhadad, M. (1997). Using lexical chains for text summarization.

In Proceedings of the ACL’97/EACL’97 workshop on intelligent scalable text summarization (pp. 10-17). Madrid, Spain.

[8] Barzilay, R., Elhadad, N., & McKeown, K. R. (2002). Inferring strategies for sentence ordering in multidocument news summarization. Journal of Artificial Intelligence Research, 17, 35-55.

[9] Barzilay, R., McKeown, K. R., & Elhadad, M. (1999). Information fusion in the context of multi-document summarization. In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (pp. 550-557). College Park, MD, USA.

[10] Berger, A., & Mittal, V. O. (2000). Query-relevant summarization using FAQs. In Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics (pp. 294-301). Hong Kong, China.

[11] Bergler, S., Witte, R., Li, Z., Khalife, M., Chen, Y., Doandes, M., &

Andreevskaia, A. (2004). Multi-ERSS and ERSS 2004. In Proceedings of the DUC 2004. Boston, MA, USA.

[12] Blair-Goldensohn, S. (2005). From definitions to complex topics: Columbia University at DUC 2005. In Proceedings of the DUC 2005. Vancouver, BC,

Canada.

[13] Bollen, J., Vandesompel, H., & Rocha, L. M. (1999). Mining associative relations from website logs and their applications to context-dependent retrieval using spreading activation. In Proceedings of the Workshop on Organizing Web Space. Berkeley, CA, USA.

[14] Boros, E., Kentor, P. B., & Neu, D. J. (2001). A clustering based approach to creating multi-document summaries. In Proceedings of the DUC 2001. New Orleans, LA, USA.

[15] Brunn, M., Chali, Y., & Pinchak, C. J. (2001). Text summarization using lexical chains. In Proceedings of the DUC 2001. New Orleans, LA, USA.

[16] Borko, H., & Bernier, C. (1975). Abstracting concepts and methods. Academic Press, NY: New York.

[17] Bosma, W. (2005). Query-Based Summarization Using Rhetorical Structure Theory. In Proceedings of the 15th Meeting of Computational Linguistics in the Netherlands (pp. 29-44). Leiden, Netherlands.

[18] Brin, S., & Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1-7), 107-117.

[19] Caid, W. R., Dumais, S. T., & Gallant, S. I. (1995). Learned vector space models for information retrieval. Information Processing & Management, 31(3), 419-429.

[20] Carbonell, J., & Goldstein, J. (1998). The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 335-336). Melbourne, Australia.

[21] Chen, H.-H., Kuo, J.-J., Huang, S.-J., Lin, C.-J., & Wang, H.-C. (2003). A summarization system for Chinese news from multiple sources. Journal of American Society for Information Science and Technology, 54(13), 1224-1236.

[22] Chen, Y.-M., Wang, X.-L., & Liu, B.-Q. (2005). Multi-document summarization based on lexical chains. In Proceedings of the 4th International Conference on Machine Learning and Cybernetics (pp. 1937-1942). Guangzhou, China.

[23] Collins, A. M., & Loftus, E. F. (1975). A spreading-activation theory of semantic processing. Psychological Review, 82(6), 407-428.

[24] Conroy, J. M., Schlesinger, J. D., & O’Leary, D. P. (2006). Topic-focused multi-document summarization using an approximate oracle score. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (COLING/ACL 2006) (pp. 152-159). Sydney, Australia.

[25] Crammer, K., and Singer, Y. (2002). PRanking with ranking. Advances in Neural Information Processing Systems, 14, 641-647.

[26] Cremmins, E. T. (1996). The art of abstracting. Information Resources Press, VA:

Arlington.

[27] D’Avanzo, E., & Magnini, B. (2005). A keyphrase-based approach to summarization: The LAKE system at DUC-2005. In Proceedings of the DUC 2005. Vancouver, BC, Canada.

[28] Dang, H. T. (2005). Overview of DUC 2005. In Proceedings of the DUC 2005.

Vancouver, BC, Canada.

[29] Daniel, N., Radev, D., & Allison, T. (2003). Sub-event based multi-document summarization. In Proceedings of the HLT-NAACL’03 Workshop on Text Summarization (pp. 9-16). Edmonton, AB, Canada.

[30] Daumé III, H., Echihabi, A., Marcu, D., Munteanu, D. S., & Soricut, R. (2002).

GLEANS : A generator of logical extracts and abstracts for nice summaries. In Proceedings of the DUC 2002. Philadelphia, PA, USA.

[31] Daumé III, H., & Marcu, D. (2006). Bayesian Query-Focused Summarization. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (pp.

305-312). Sydney, Australia.

[32] Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., & Harshman, R.

(1990). Indexing by latent semantic analysis, Journal of the American Society for Information Science, 41(6), 391-407.

[33] DeJong, G. F. (1982). An overview of the FRUMP system. In W. G. Lehnert, &

M. H. Ringle (Eds.), Strategies for natural language processing. Hillsdale, NJ:

Lawrence Erlbaum.

[34] DUC (Document Understanding Conferences): <http://duc.nist.gov/>.

[35] Dumais, S. T., Landauer, T. K., & Littman, M. L. (1996). Automatic cross-linguistic information retrieval using latent semantic indexing. In Proceedings of SIGIR’96 Workshop on Cross-Linguistic Information Retrieval (pp. 16-23). Zurich, Switzerland.

[36] Edmundson, H. P. (1969). New methods in automatic extracting. Journal of the ACM, 16(2), 264-285.

[37] Elhadad, N. & McKeown, K. R. (2001). Towards generating patient specific summaries of medical articles. In Proceedings of the NAACL 2001 Workshop on Automatic Summarization. Pittsburgh, PA, USA.

[38] Erkan, G. (2006). Using biased random walks for focused summarization. In Proceedings of the DUC 2006. Brooklyn, NY, USA.

[39] Erkan, G., & Radev, D. R. (2004). LexRank: Graph-based lexical centrality as salience in text summarization. Journal of Artificial Intelligence Research, 22, 457-479.

[40] Fish, S., & Roark, B. (2006). Query-focused summarization by supervised sentence ranking and skewed word distributions. In Proceedings of the DUC 2006. Brooklyn, NY, USA.

[41] Fuentes, M., Alfonseca, E., & Rodríguez, H. (2007). Support vector machines for query-focused summarization trained and evaluated on Pyramid data. In

Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007) (pp. 57-60). Prague, Czech Republic.

[42] Fum, D., Guida, G., & Tasso, C. (1985). Evaluating importance: a step towards text summarization. In Proceedings of the 9th International Joint Conference on Artificial Intelligence (pp. 840-844). Los Angeles, CA, USA.

[43] Ge, J., Huang, X., & Wu, L. (2003). Approaches to event-focused summarization based on named entities and query words. In Proceedings of the DUC 2003.

Edmonton, AB, Canada.

[44] Goldstein, J., Mittal, V., Carbonell, J., & Kantrowitz, M. (2000). Multi-document summarization by sentence extraction. In Proceedings of NAACL-ANLP 2000 Workshop on Automatic Summarization (pp. 40-48). Seattle, WA, USA.

[45] Gong, Y., & Liu, X. (2001). Generic text summarization using relevance measure and latent semantic analysis. In Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 19-25). New Orleans, LA, USA.

[46] Gupta, S., Nenkova, A., & Jurafsky, D. (2007). Measuring importance and query relevance in topic-focused multi-document summarization. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007) (pp. 193-196). Prague, Czech Republic.

[47] Hachey, B., Murray, G., & Reitter, D. (2005). The Embra system at DUC 2005:

Query-oriented multi-document summarization with a very large latent Semantic space. In Proceedings of the DUC 2005. Vancouver, BC, Canada.

[48] Hahn, U., & Mani, I. (2000). The challenges of automatic summarization.

Computer, 33(11), 29-36.

[49] Harabagiu, S., & Lacatusu, F. (2005). Topic themes for multi-document summarization. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp.

202-209). Salvador, Brazil.

[50] Harabagiu, S. M., & Maiorano, S. J. (2000). Acquisition of linguistic patterns for knowledge-based information extraction. In Proceedings of the 2nd International Conference on Language Resources & Evaluation. Athens, Greece.

[51] Harabagiu, S. M., & Lacatusu, F. (2002). Generating single and multi-document summaries with GIXTexter. In Proceedings of the DUC 2002. Philadelphia, PA, USA.

[52] Harnly, A., Nenkova, A., Passonneau, R., & Rambow, O. (2005). Automation of summary evaluation by the Pyramid method. In Proceedings of 2005 International Conference on Recent Advances in Natural Language Processing.

Borovets, Bulgaria.

[53] Hatzivassiloglou, V., Klavans, J. L., Holcombe, M. L., Barzilay, R., Kan, M.-Y.,

& McKeown, K. R. (2001). SimFinder: a flexible clustering tool for summarization. In Proceedings of NAACL Workshop on Automatic Summarization (pp. 41-49). Pittsburgh, PA, USA.

[54] Hirao, T., Takeuchi, K., Isozaki, H., Sasaki, Y., & Maeda, E. (2002).

NTT/NAIST’s Text Summarization Systems for TSC-2. In Proceedings of the 3rd NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering (pp. 13-18). Tokyo, Japan.

[55] Hovy, E., & Lin, C.-Y. (1997). Automated text summarization in SUMMARIST.

In Proceedings of the ACL’97/EACL’97 Workshop on Intelligent Scalable Text Summarization (pp. 18-24). Madrid, Spain.

[56] Hovy, E., Lin, C.-Y., & Zhou, L. (2005). A BE-based multidocument summarizer with query interpretation. In Proceedings of the DUC 2005. Vancouver, BC, Canada.

[57] Hovy, E., & Marcu, D. (1998). Tutorial on automated text summarization.

Presented at COLING-ACL’98. Montreal, QC, Canada.

[58] Huang, Z., Chen, H., & Zeng, D. (2004). Applying associative retrieval techniques to alleviate the sparsity problem in collaborative filtering. ACM Transactions on Information Systems, 22(1), 116-142.

[59] Jagarlamudi, J., Pingali, P., & Varma, V. (2005). A relevance-based language modeling approach to DUC 2005. In Proceedings of the DUC 2005. Vancouver, BC, Canada.

[60] Jagarlamudi, J., Pingali, P., & Varma, V. (2006). Query independent sentence scoring approach to DUC 2006. In Proceedings of the DUC 2006. Brooklyn, NY, USA.

[61] Kan, M.-Y., & Klavans, J. L. (2002). Using librarian techniques in automatic text summarization for information retrieval. In Proceedings of the 2nd ACM/IEEE-CS Joint Conference on Digital Libraries (pp. 36-45). Portland, OR, USA.

[62] Kan, M.-Y., McKeown, K. R., & Klavans, J. L. (2001). Domain-specific informative and indicative summarization for information retrieval. In Proceedings of the DUC 2001. New Orleans, LA, USA.

[63] Kan, M.-Y., McKeown, K. R., & Klavans, J. L. (2001). Applying natural language generation to indicative summarization. In Proceedings of the 8th European Workshop on Natural Language Generation (pp. 1-9). Toulouse, France.

[64] Kleinberg, J. M. (1999). Authoritative sources in a hyperlinked environment.

Journal of the ACM, 46(5), 604-632.

[65] Kuo, J.-J., Wung, H.-C., Lin, C.-J., & Chen, H.-H. (2002). Multi-document summarization using informative words and its evaluation with a QA system. In Proceedings of the 3rd International Conference on Intelligent Text Processing and Computational Linguistics (Lecture Notes in Computer Science, 2276) (pp.

391-401). Mexico City, Mexico.

[66] Kupiec, J., Pedersen, J., & Chen, F. (1995). A trainable document summarizer. In Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 68-73). Seattle, WA,

USA.

[67] Lacatusu, F., Hickl, A., Roberts, K., Shi, Y., Bensley, J., Rink, B., Wang, P., &

Taylor, L. (2006). LCC’s GISTexter at DUC 2006: Multi-strategy multi-document summarization. In Proceedings of the DUC 2006. Brooklyn, NY, USA.

[68] Lavrenko, V., & Croft, W. B. (2001). Relevance-based language models. In Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2001) (pp. 120-127).

New Orleans, LA, USA.

[69] Lehnert, W. G. (1982). Plot units: a narrative summarization strategy. In W. G.

Lehnert, & M. H. Ringle (Eds.), Strategies for natural language processing (pp.

375-412). Hillsdale, NJ: Lawrence Erlbaum.

[70] Lenci, A., Bartolini, R., Calzolari, N., Agua, A., Busemann, S., Cartier, E., Chevreau, K., & Coch, J. (2002). Multilingual summarization by integrating linguistic resources in the MLIS-MUSI project. In Proceedings of the 3rd International Conference on Language Resources and Evaluation (pp.

1464-1471). Canary Islands, Spain.

[71] Leuski, A., Lin, C.-Y., & Hovy, E. (2003). iNeATS: Interactive Multi-document Summarization. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (pp. 125-128). Sapporo, Japan.

[72] Li, W., Li, W., Li, B., Chen, Q., & Wu, M. (2005). The Hong Kong Polytechnic University at DUC2005. In Proceedings of the DUC 2005. Vancouver, BC, Canada.

[73] Li, S., Ouyang, Y., Sun, B., Guo, Z. (2006). Peking University at DUC 2006. In Proceedings of the DUC 2006. Brooklyn, NY, USA.

[74] Li, S., Ouyang, Y., Wang, W., & Sun, B. (2007). Multi-document summarization using support vector regression. In Proceedings of the DUC 2007. Rochester, NY, USA.

[75] Li, J., Sun, L., Kit, C., Webster, J. (2007). A query-focused multi-document summarizer based on lexical chains. In Proceedings of the DUC 2007. Rochester, NY, USA.

[76] Lin, C.-Y. (1999). Training a selection function for extraction. In Proceedings of the 8th International Conference on Information and Knowledge Management (pp. 55-62). Kansas City, MO, USA.

[77] Lin, C.-Y., & Hovy, E. (2002). From single to multi-document summarization: a prototype system and its evaluation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (pp. 457-464). Philadelphia, PA, USA.

[78] Lin, C.-Y., & Hovy, E. (2002). NeATS in DUC 2002. In Proceedings of the DUC 2002. Philadelphia, PA, USA.

[79] Lin, C.-Y., & Hovy, E. (2003). Automatic evaluation of summaries using N-gram co-occurrence statistics. In Proceedings of the 3rd International Conference on

Human Language Technology Research and 3rd Meeting of the North American Chapter of the Association for Computational Linguistics (pp. 71-78). Edmonton, AB, Canada.

[80] Lin, C.-Y., & Hovy, E. (2003). The potential and limitations of automatic sentence extraction for summarization. In Proceedings of the HLT-NAACL 03 on Text Summarization Workshop (pp. 73-80). Edmonton, Canada.

[81] Littman, M. L., Dumais, S. T., Landauer, T. K. (1996). Automatic cross-language information retrieval using latent semantic indexing. In Proceedings of SIGIR’96 Workshop on Cross-Linguistic Information Retrieval (pp. 16-23). Zurich, Switzerland.

[82] Luhn, H. P. (1958). The automatic creation of literature abstracts. IBM Journal of Research and Development, 2(2), 159-165.

[83] Lund, K., & Burgess, C. (1996). Producing high-dimensional semantic spaces from lexical co-occurrence. Behavior Research Methods, Instrumentation, and Computers, 28, 203-208.

[84] Maña-López, M. J., Buenaga, M. D., & Gómez-Hidalgo, J. M. (2004).

Multidocument summarization: An added value to clustering in interactive retrieval. ACM Transactions on Information Systems, 22(2), 215-241.

[85] Mani, I. (2001). Automatic Summarization. Amsterdam, Netherlands: John Benjamins Pub Co.

[86] Mani, I., & Bloedorn, E. (1999). Summarizing similarities and differences among related documents. Information Retrieval, 1(1-2), 35-67.

[87] Mani, I., & Maybury, M. T. (Eds.). (1999). Advances in automatic text summarization. Cambridge, MA: The MIT Press.

[88] Mann, W. C., & Thompson, S. A. (1988). Rhetorical structure theory: Toward a function theory of text organization, Text, 8(3), 243-281.

[89] Marcu, D. (2000). The theory and practice of discourse parsing and summarization. Cambridge, MA: The MIT Press.

[90] Mayeng, S. H., & Jang, D. (1999). Development and evaluation of a statistically based document summarization system. In I. Mani & M. T. Maybury (Eds.), Advances in automatic text summarization (pp. 61–70). Cambridge, MA: The MIT Press.

[91] McDonald, D. M., & Chen, H. (2006). Summary in context: Searching versus browsing. ACM Transactions on Information Systems, 24(1), 111-141.

[92] McKeown, K., Barzilay, R., Chen, J., Elson, D., Evans, D., Klavans, J., Nenkova, A., Schiffman, B., & Sigelman, S. (2003). Columbia’s Newsblaster: New features and future directions. In Proceedings of the 3rd International Conference on Human Language Technology Research and 3rd Meeting of the North American Chapter of the Association for Computational Linguistics (pp.

15-16). Edmonton, AB, Canada.

[93] McKeown, K. R., Barzilay, R., Evans, D., Hatzivassiloglou, V., Klavans, J. L.,

Nenkova, A., Sable, C., Schiffman, B., & Sigelman, S. (2002). Tracking and summarizing news on a daily basis with Columbia’s Newsblaster. In Proceedings of the 2nd International Conference on Human Language Technology Research (pp. 280-285). San Diego, CA, USA.

[94] McKeown, K. R., Chang, S.-F., Cimino, J., Feiner, S., Friedman, C., Gravano, L., Hatzivassiloglou, V., Johnson, S., Jordan, A. D., Klavans, J. L., Kushniruk, A., Patel, V., & Teufel, S. (2001). PERSIVAL, a system for personalized search and summarization over multimedia healthcare information. In Proceedings of the 1st ACM/IEEE-CS Joint Conference on Digital Libraries (pp. 331-340). Roanoke, VA, USA.

[95] McKeown, K., Hirschberg, J., Galley, M., & Maskey, S. (2005). From text to speech summarization. In Proceedings of the 30th International Conference on Acoustics, Speech, and Signal Processing (pp. 997-1000). Philadelphia, PA, USA.

[96] McKeown, K. R., Klavans, J. L., Hatzivassiloglou, V., Barzilay, R., & Eskin, E.

(1999). Towards multidocument summarization by reformulation: progress and prospects. In Proceedings of the 16th National Conference on Artificial Intelligence (pp. 453-460). Orlando, FL, USA.

[97] McKeown, K., & Radev, D. R. (1995). Generating summaries of multiple news articles. In Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 74-82). Seattle, WA, USA.

[98] Mihalcea, R. (2004). Graph-based ranking algorithms for sentence extraction, applied to text summarization. In Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (pp. 170-173). Barcelona, Spain.

[99] Mihalcea, R., Corley, C., & Strapparava, C. (2006). Corpus-based and knowledge-based measures of text semantic similarity. In Proceedings of the 21st National Conference on Artificial Intelligence. Boston, MA, USA.

[100] Mihalcea, R., & Tarau, P. (2005). An algorithm for language independent single and multiple document summarization. In Proceedings of the 2nd International Joint Conference on Natural Language Processing (pp. 19-24).

Jeju Island, Korea.

[101] Moens, M.-F., Uyttendaele, C., & Dumortier, J. (1999). Abstracting of legal cases: The potential of clustering based on the selection of representative objects.

Journal of the American Society for Information Science, 50(2), 151-161.

[102] Noble, B., & Daniel, J. W. (1988). Applied linear algebra. Englewood Cliffs, NJ: Prentice Hall.

[103] Paice, C. D. (1990). Constructing literature abstracts by computer:

Techniques and prospects. Information Processing & Management, 26(1), 171-186.

[104] Papineni, K., Roukos, S., Ward, T., & Zhu, W.-J. (2001). BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (pp. 311-318).

Philadelphia, PA, USA.

[105] Pirolli, P., Pitkow, J., Rao, R. (1996). Silk from a sow’s ear: extracting usable structures from the Web. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (pp. 118-125). Vancouver, BC, Canada.

[106] Quillian, M. R. (1968). Semantic memory. In M. Minsky (Ed.), Semantic Information Processing (pp. 227-270). Cambridge, MA: The MIT Press.

[107] Radev, D. (2001). Tutorial: Text summarization. Presented at the ACM SIGIR 2001. New Orleans, LA, USA.

[108] Radev, D. R., Blair-Goldensohn, S., Zhang, Z., & Raghavan, R. S. (2001).

NewsInEssence: A system for domain-independent, real-time news clustering and multi-document summarization. In Proceedings of the 1st International Conference on Human Language Technology Research. San Diego, CA, USA.

[109] Radev, D. R., Fan, W. & Zhang, Z. (2001). WebInEssence: A personalized Web-based multi-document summarization and recommendation system. In Proceedings of NAACL 2001 Workshop on Automatic Summarization. Pittsburgh,

[109] Radev, D. R., Fan, W. & Zhang, Z. (2001). WebInEssence: A personalized Web-based multi-document summarization and recommendation system. In Proceedings of NAACL 2001 Workshop on Automatic Summarization. Pittsburgh,

相關文件