Total 16 Attribute-specifying bundles 9
7.5 Suggestions for Future Research
The present study has identified spoken and news bundles in Chinese, and the identification method can be applied to other text types in Chinese, such as academic texts (e.g., journal articles, university textbooks) and online discussions. We will gain more insights into the role of lexical bundles in fulfilling the communicative needs of different text types. Additionally, a variationist perspective can be adopted to compare lexical bundles used in a wide range of Chinese-speaking communities.
Some aspects of lexical bundles have not been adequately addressed, even in English. An intriguing issue is the relationship between the position and the function of lexical bundles (Cortes and Csomay 2007, Ansari and Molavi 2013) (see Section 2.2). The present study has examined the general use of Chinese bundles in news, and a further study can focus on lexical bundles in the leads (i.e., the first one or two sentences) of news reports to investigate how lexical bundles are used to package the most important information of a news report and draw more attention from the reader.
Further psycholinguistic investigations are also needed to explore the storage and processing of lexical bundles in the mental lexicon, so that more converging evidence for the usage-based model (e.g., Bybee 2007) can accumulate. For example, the common lexical bundle I don’t know is argued to be a storage/processing unit, and corpus data show that I don’t occurs far more frequently than don’t know (Bybee 2007). How the probabilistic analysis is run in the brain to closely associate the three words I, don’t, and know remains unsolved (Tremblay et al. 2009, Gries 2013). Before we can arrive at a convincing answer, various comprehension and production experiments (e.g., grammaticality judgment tasks, gap-filling tasks, dictation tasks, self-paced reading tasks) would be necessary. Besides, it would be interesting to assess whether the frequency effect can predict the subjects’ performance more reliably than other quantitative measures (e.g., dispersion measures, internal association measures).
There are other more practical aspects of lexical bundles. Although lexical bundles are seen as semantically transparent, many studies have revealed that there is a gap between native and non-native speakers’ use of lexical bundles (see Section 2.2).
The gap may arise from the fact that some lexical bundles have undergone semantic changes and developed non-compositional meanings. Therefore, more empirical
studies are needed to investigate the role of semantic transparency in the acquisition of lexical bundles by Chinese second language learners. The results will have crucial pedagogical implications for which bundles need to be taught explicitly and how to teach lexical bundles. A corpus-based study investigating how lexical bundles in Chinese are translated would also be useful (Ji 2010). A Chinese bundle may be simply translated into its equivalent bundle in another language. Such an equivalent may not always be available, though; we may wonder what strategies expert translators usually adopt. The results of translation studies on lexical bundles will have important implications for the training of novice translators and the development of machine translation. Another intriguing direction is to adopt advanced techniques in the field of data science to examine the idiosyncratic use of lexical bundles on social networking sites. This may help to identify the linguistic patterns of different communities.
There are various kinds of multi-word expressions (see Section 2.1), and lexical bundles are just one of them. A better understanding of the relationship between lexical bundles and other kinds of multi-word units in Chinese needs to be developed.
A lexical bundle may be a fragment of a longer lexico-syntactic frame. For instance, the lexical bundle de guocheng zhong ‘DE process middle; (in) the process of’ is part of the frame ‘zai + event + de guocheng zhong’. Some lexical bundles may be regarded as the actual realization of a more abstract construction. For instance, many referential expressions are derived from the construction ‘number/demonstrative + classifier/quantifier + noun’. A close examination of each lexical bundle and a clear delineation of the relationship between lexical bundles and other types of multi-word units may herald an effective method to systematically identify constructions and lexico-syntactic frames in Chinese.
References
Aijmer, Karin. 2008. “So er I just sort I dunno I think it’s just because…”: A corpus study of I don’t know and dunno in learners’ spoken English. In: Andreas H.
Jucker, Daniel Schreier and Marianne Hundt (eds.), Corpora: Pragmatics and Discourse, 151-168. Amsterdam: Rodopi.
Aijmer, Karin and Altenberg, Bengt. 1996. Introduction. In: Karin Aijmer, Bengt Altenberg and Mats Johansson (eds.), Languages in Contrast: Papers from a Symposium on Text-based Cross-linguistic Studies, 11-16. Lund: Lund University Press.
Altenberg, Bengt. 1998. On the phraseology of spoken English: The evidence of recurrent word-combinations. In: A. P. Cowie (ed.), Phraseology: Theory, Analysis, and Applications, 101-122. Oxford: Oxford University Press.
Altenberg, Bengt and Eeg-Olofsson, Mats. 1990. Phraseology in spoken English:
Presentation of a project. In: Jan Aarts and Willem Meijs (eds.), Theory and Practice in Corpus Linguistics, 1-26. Amsterdam: Rodopi.
Ansari, Siamak and Molavi, Arezoo. 2013. Textual positions of lexical bundles across newspaper genres. International Research Journal of Applied and Basic Sciences 4(9): 2484-2490.
Baayen, R. Harald. 2003. Probabilistic approaches to morphology. In Rens Bod, Jennifer Hay and Stefanie Jannedy (eds.), Probabilistic Linguistics, 229-288.
Cambridge: MIT Press.
Baroni, Marco and Evert, Stefan. 2008. Statistical methods for corpus exploitation. In:
Anke Lüdeling and Merja Kytö (eds.), Corpus Linguistics: An International Handbook, 777-803. Berlin: Mouton de Gruyter.
Biber, Douglas. 2009. A corpus-driven approach to formulaic language in English:
Multi-word patterns in speech and writing. International Journal of Corpus
Linguistics 14(3): 275-311.
Biber, Douglas and Barbieri, Federica. 2007. Lexical bundles in university spoken and written registers. English for Specific Purposes 26(3): 263-286.
Biber, Douglas, Conrad, Susan and Cortes, Viviana. 2003. Lexical bundles in speech and writing: An initial taxonomy. In: Andrew Wilson, Paul Rayson and Tony McEnery (eds.), Corpus Linguistics by the Lune: A Festschrift for Geoffrey Leech, 71-92. Frankfurt am Main: Peter Lang.
Biber, Douglas, Conrad, Susan and Cortes, Viviana. 2004. If you look at…: Lexical bundles in university teaching and textbooks. Applied Linguistics 25(3):
371-405.
Biber, Douglas and Finegan, Edward. 1989. Styles of stance in English: Lexical and grammatical marking of evidentiality and affect. Text 9(1): 93-124.
Biber, Douglas, Johansson, Stig, Leech, Geoffrey, Conrad, Susan and Finegan, Edward. 1999. Longman Grammar of Spoken and Written English. London and New York: Longman.
Biq, Yung-O. 2004. Construction, reanalysis, and stance: ‘V yi ge N’ and variations in Mandarin Chinese. Journal of Pragmatics 36(9): 1637-1654.
Biq, Yung-O. 2015. Adverbs. In: William S-Y. Wang and Chaofen Sun (eds.), The Oxford Handbook of Chinese Linguistics, 414-428. Oxford: Oxford University Press.
Bolander, Maria. 1989. Prefabs, patterns and rules in interaction? Formulaic speech in adult learners’ L2 Swedish. In: Kenneth Hyltenstam and Loraine K. Obler (eds.), Bilingualism across the Lifespan: Aspect of Acquisition, Maturity and Loss, 73-86. Cambridge: Cambridge University Press.
Bolinger, Dwight. 1961. Syntactic blends and other matters. Language 37(3):
366-381.
Bowers, Jeffrey S., Davis, Colin J. and Hanley, Derek A. 2005. Automatic semantic activation of embedded words: Is there a “hat” in “that”? Journal of Memory and Language 52(1): 131-143.
Brooks, Patricia and Tomasello, Michael. 1999. How children constrain their argument structure constructions. Language 75(4): 720-738.
Butler, Chris. 1997. Repeated word combinations in spoken and written text: Some implications for Functional Grammar. In: Chris Butler, John Connolly, Richard A.
Gatward and Roel M. Vismans (eds.), A Fund of Ideas: Recent Development in Functional Grammar, 60-77. Amsterdam: Institute for Functional Research into Language and Language Use.
Bybee, Joan L. 1985. Morphology: A Study of the Relation between Meaning and Form. Amsterdam and Philadelphia: John Benjamins.
Bybee, Joan. 2007. Frequency of Use and the Organization of Language. Oxford:
Oxford University Press.
Bybee, Joan L. and Scheibman, Joanne. 1999. The effect of usage on degrees of constituency: The reduction of don’t in English. Linguistics 37(4): 575-596.
Cao, Juan and Zhou, Jing-ye. 2004. A new method for calculating relativity of Chinese strings. Journal of Chinese Information Processing 18(4): 55-59. (曹 娟、周經野。2004。一種計算漢字串之間相關程度的新方法。《中文信息學 報》,第18 期第 4 卷,頁 55-59。)
Carroll, John B. 1970. An alternative to Juilland’s usage coefficient for lexical frequencies and a proposal for a standard frequency index. Computer Studies in the Humanities and Verbal Behavior 3(2): 61-65.
Chafe, Wallace L. 1968. Idiomaticity as an anomaly in the Chomskyan paradigm.
Foundations of Language 4: 109-125.
Chafe, Wallace L. 1985. Linguistic differences produced by differences between
speaking and writing. In: David R. Olson, Nancy Torrance and Angela Hildyard (eds.), Literacy, Language, and Learning: The Nature and Consequences of Reading and Writing, 105-123. Cambridge: Cambridge University Press.
Chen, Lin. 2010. An investigation of lexical bundles in ESP textbooks and electrical engineering introductory textbooks. In: David Wood (ed.), Perspectives on Formulaic Language: Acquisition and Communication, 107-125. London and New York: Continuum.
Chen, Ru and Zhu, Xiaoya. 2012. Huayu Changyong Juxing yu Jiegou 330 ‘Common Chinese Patterns 330’. Taipei: Bookman. (陳如、朱曉亞。2012。《華語常用句
型與結構330》。臺北:書林。)
Chen, Yu-Hua and Baker, Paul. 2010. Lexical bundles in L1 and L2 academic writing.
Language Learning & Technology 14(2): 30-49.
Conklin, Kathy and Schmitt, Norbert. 2008. Formulaic sequences: Are they processed more quickly than nonformulaic language by native and nonnative speakers?
Applied Linguistics 29(1): 72-89.
Conrad, Susan, and Biber, Douglas. 2004. The frequency and use of lexical bundles in conversation and academic prose. Lexicographica 20: 56-71.
Cortes, Viviana. 2002. Lexical bundles in freshman composition. In: Randi Reppen, Susan M. Fitzmaurice and Douglas Biber (eds.), Using Corpora to Explore Linguistic Variation, 131-145. Amsterdam and Philadelphia: John Benjamins.
Cortes, Viviana. 2004. Lexical bundles in published and student disciplinary writing:
Examples from history and biology. English for Specific Purposes 23(4):
397-423.
Cortes, Viviana. 2008. A comparative analysis of lexical bundles in academic history writing in English and Spanish. Corpora 3(1): 43-57.
Cortes, Viviana and Csomay, Eniko. 2007. Positioning lexical bundles in university
lectures. In: Mari Carmen Campoy and María José Luzón (eds.), Spoken Corpora in Applied Linguistics, 57-76. Frankfurt am Main: Peter Lang.
Coulmas, Florian. 1981. Conversational Routine. The Hague: Mouton.
Cowie, A. P. 1981. The treatment of collocations and idioms in learners’ dictionaries.
Applied Linguistics 2(3): 223-235.
Cowie, A. P. 1994. Phraseology. In: Ronald E. Asher (ed.), The Encyclopedia of Language and Linguistics, 3168-3171. Oxford: Pergamon.
Cowie, A. P. 1998. Phraseological dictionaries: Some east-west comparisons. In: A. P.
Cowie (ed.), Phraseology: Theory, Analysis, and Applications, 209-228. Oxford:
Oxford University Press.
Coxhead, Averil. 2000. A new academic word list. TESOL Quarterly 34(2): 213-238.
Crystal, David. 2008. A Dictionary of Linguistics and Phonetics (the sixth edition).
Oxford: Blackwell.
Cui, Xiliang. 2006. Hanyu Shouyu yu Zhongguo Renwen Shijie ‘Chinese Idioms and Chinese Humanistic Inner World’. Beijing: Beijing Language and Culture University Press. (崔希亮。2006。《漢語熟語與中國人文世界》。北京:北京 語言大學出版社。)
Culpeper, Jonathan and Kytö, Merja. 2002. Lexical bundles in Early Modern English dialogues: A window into the speech-related language of the past. In: Teresa Fanego, Belén Méndez-Naya and Elena Seoane (eds.), Sounds, Words, Texts, and Change, 45-63. Amsterdam and Philadelphia: John Benjamins.
De Cock, Sylvie. 1998. A recurrent word combination approach to the study of formulae in the speech of native and non-native speakers of English.
International Journal of Corpus Linguistics 3(1): 59-80.
Dolch, E. W. 1936. A basic sight vocabulary. The Elementary School Journal 36(6):
456-460.
Dooley, Robert A. and Levinsohn, Stephen H. 2001. Analyzing Discourse: A Manual of Basic Concept. SIL International.
Ellis, Nick C. 1996. Sequencing in SLA: Phonological memory, chunking, and points of order. Studies in Second Language Acquisition 18(1): 91-126.
Ellis, Nick C. 1998. Emergentism, connectionism and language learning. Language Learning 48(4): 631-664.
Ellis, Nick C. 2002. Frequency effects in language processing: A review with implications for theories of implicit and explicit language acquisition. Studies in Second Language Acquisition 24(2): 143-188.
Ellis, Nick C. 2003. Construction, chunking, and connectionism: The emergence of second language structure. In: Catherine J. Doughty and Michael H. Long (eds.), The Handbook of Second Language Acquisition, 63-103. Oxford: Blackwell.
Ellis, Nick C., Simpson-Vlach, Rita and Maynard, Carson. 2008. Formulaic language in native and second language speakers: Psycholinguistics, corpus linguistics, and TESOL. TESOL Quarterly 42(3): 375-396.
Erman, Britt and Warren, Beatrice. 2000. The idiom principle and the open choice principle. Text & Talk 20(1): 29-62.
Feng, Shengli. 1997. Prosodically determined word-formation in Mandarin Chinese.
Social Science in China 18(4): 120-137.
Fillmore, Charles J. 1978. On the organization of semantic information in the lexicon.
In: Donka Farkas, Wesley M. Jacobsen and Karol W. Todrys (eds.), Papers from the Parasession on the Lexicon, 148-173. Chicago: Chicago Linguistic Society.
Firth, John R. 1957. Papers in Linguistics 1934-1951. Oxford: Oxford University Press.
Flowerdew, Lynne. 2012. Corpora and Language Education. New York: Palgrave Macmillan.
Foster, Pauline. 2001. Rules and routines: A consideration of their role in the task-based language production of native and non-native speakers. In: Martin Bygate, Peter Skehan and Merrill Swain (eds.), Researching Pedagogic Tasks:
Second Language Learning, Teaching, and Testing, 75-93. London and New York: Longman.
Fu, Huaiqing. 1985. Xiandai Hanyu Cihui ‘Modern Chinese Lexicon’. Beijing:
Peking University Press. (符淮青。1985。《現代漢語詞匯》。北京:北京大學 出版社。)
Goldberg, Adele E., Casenhiser, Devin and Sethuraman, Nitya. 2004. Learning argument structure generalizations. Cognitive Linguistics 14(3): 289-316.
Granger, Sylviane and Paquot, Magali. 2008. Disentangling the phraseological web.
In: Sylviane Granger and Fanny Meunier (eds.), Phraseology: An Interdisciplinary Perspective, 27-50. Amsterdam and Philadelphia: John Benjamins.
Gries, Stefan Th. 2008a. Phraseology and linguistic theory: A brief survey. In:
Sylviane Granger and Fanny Meunier (eds.), Phraseology: An Interdisciplinary Perspective, 3-25. Amsterdam and Philadelphia: John Benjamins.
Gries, Stefan Th. 2008b. Dispersion and adjusted frequencies in corpora.
International Journal of Corpus Linguistics 13(4): 403-437.
Gries, Stefan Th. 2009. Quantitative Corpus Linguistics with R: A Practical Introduction. London: Routledge.
Gries, Stefan Th. 2013. 50-something years of work on collocations: What is or should be next… International Journal of Corpus Linguistics 18(1): 137-165.
Gries, Stefan Th. 2014. Frequency tables: Tests, effect sizes, and explorations. In:
Dylan Glynn and Justyna A. Robinson (eds.), Corpus Methods for Semantics:
Quantitative Studies in Polysemy and Synonymy, 365-389. Amsterdam and
Philadelphia: John Benjamins.
Han, Ke-song, Wang, Yong-cheng and Chen, Gui-lin. 2000. Research on fast high-frequency strings extracting and statistics algorithm with no thesaurus.
Journal of Chinese Information Processing 15(2): 23-30. (韓客松、王永成、陳 桂林。2000。無詞典高頻字串快速提取和統計算法研究。《中文信息學報》,
第15 期第 2 卷,頁 23-30。)
Hoey, Michael and O’Donnell, Matthew Brook. 2008. Lexicography, grammar, and textual position. Journal of Lexicography 21(3): 293-309.
Hopper, Paul J. 1991. On some principles of grammaticalization. In: Elizabeth C.
Traugott and Bernd Heine (eds.), Approaches to Grammaticalization, 17-35.
Amsterdam: John Benjamins.
Howarth, Peter. 1998. The phraseology of learners’ academic writing. In: A. P. Cowie (ed.), Phraseology: Theory, Analysis, and Applications, 161-186. Oxford: Oxford University Press.
Hsieh, Shu-Kai. 2015. DeepLEX: A multi-dimensional Chinese lexical resource.
Paper presented at the CKIP 30th Anniversary Workshop on Chinese Lexicon.
Hu, Liang and Tang, Xuri. 2014. Multiword expression extraction based on word relativity. International Journal of Knowledge and Language Processing 5(1):
27-40.
Huang, Borong and Liao, Xudong. 2002. Xiandai Hanyu ‘Modern Chinese’ (the third edition). Beijing: Higher Education Press. (黃伯榮、廖序東。2002。《現代漢語》
增訂三版。北京:高等教育出版社。)
Huang, C.-T. James, Li, Y.-H. Audrey and Li, Yafei. 2009. The Syntax of Chinese.
Cambridge: Cambridge University Press.
Hunston, Susan. 2002. Corpora in Applied Linguistics. Cambridge: Cambridge University Press.
Hyland, Ken. 2008. As can be seen: Lexical bundles and disciplinary variation.
English for Specific Purposes 27(1): 4-21.
Hyland, Ken. 2012. Bundles in academic discourse. Annual Review of Applied Linguistics 32: 150-169.
Ji, Meng. 2010. Phraseology in Corpus-based Translation Studies. Frankfurt am Main:
Peter Lang.
Jiang, Nan and Nekrasova, Tatiana M. 2007. The processing of formulaic sequences by second language speakers. The Modern Language Journal 91(3): 433-445.
Johansson, Stig. 2007. Seeing Through Multilingual Corpora: On the Use of Corpora in Contrastive Studies. Amsterdam and Philadelphia: John Benjamins.
Johnstone, Barbara. 2002. Discourse Analysis. Malden: Blackwell.
Kim, YouJin. 2009. Korean lexical bundles in conversation and academic texts.
Corpora 4(2): 135-165.
Kopaczyk, Joanna. 2012. Applications of the lexical bundles method in historical corpus research. In: Piotr Pęzik (ed.), Corpus Data Across Languages and Disciplines, 83-95. Frankfurt am Main: Peter Lang.
Kopaczyk, Joanna. 2013. The Legal Language of Scottish Burghs: Standardization and Lexical Bundles 1380-1560. Oxford: Oxford University Press.
Lambrecht, Knud. 1984. Formulaicity, frame semantics, and pragmatics in German binomial expressions. Language 60(4): 753-796.
Leńko-Szymańska, Agnieszka. 2014. The acquisition of formulaic language by EFL learners: A cross-sectional and cross-linguistic perspective. International Journal of Corpus Linguistics 19(2): 225-251.
Li, Charles N. and Thompson, Sandra A. 1981. Mandarin Chinese: A Functional Reference Grammar. Berkeley: University of California Press.
Li, Ming-Yi. 2014. The use of formulaic discourse markers in CSL learners’ writing
from the aspect of textual function. Journal of Chinese Language Teaching 11(2):
31-57. (李明懿。2014。語篇功能視角之華語非母語學習者寫作使用定式篇
章標記語分析。《華語文教學研究》,第11 期第 2 卷,頁 31-57。)
Libben, Gary. 2005. Everything is psycholinguistic: Material and methodological considerations in the study of compound processing. Canadian Journal of Linguistics 50: 267-283.
Lien, Chinfa. 1989. Antonymous quadrinominals in Chinese. Journal of Chinese Linguistics 17(2): 263-306.
Lien, Chinfa. 2000. Goucixue wenti tansuo ‘Exploring morphological issues’.
Chinese Studies 18: 61-78. (連金發。2000。構詞學問題探索。《漢學研究》, 第18 期,頁 61-78。)
Lu, Shuxiang. 1979. Hanyu Yufa Fenxi Wenti ‘Issues about Chinese Grammar’.
Beijing: The Commercial Press. (呂叔湘。1979。《漢語語法分析問題》。北京:
商務印書館。)
Makkai, Adam. 1972. Idiom Structure in English. The Hague: Mouton.
Manes, Joan and Wolfson, Nessa. 1981. The compliment formula. In: Florian Coulmas (ed.), Conversational Routine, 115-132. The Hague: Mouton.
McCarthy, Michael. 2001. Issues in Applied Linguistics. Cambridge: Cambridge University Press.
McEnery, Tony and Hardie, Andrew. 2012. Corpus Linguistics: Method, Theory and Practice. Cambridge: Cambridge University Press.
McEnery, Tony, Xiao, Richard and Tono, Yukio. 2006. Corpus-based Language Studies. London: Routledge.
Mel’čuk, Igor. 1998. Collocations and lexical functions. In: A. P. Cowie (ed.), Phraseology: Theory, Analysis, and Applications, 23-53. Oxford: Oxford University Press.
Meunier, Fanny. 2012. Formulaic language and language teaching. Annual Review of Applied Linguistics 32: 111-129.
Nattinger, James. 1988. Some current trends in vocabulary teaching. In: Ronald Carter and Michael McCarthy (eds.), Vocabulary and Language Teaching, 62-82.
London and New York: Longman.
Nekrasova, Tatiana M. 2009. English L1 and L2 speakers’ knowledge of lexical bundles. Language Learning 59(3): 647-686.
Nesi, Hilary and Basturkmen, Helen. 2006. Lexical bundles and discourse signalling in academic lectures. International Journal of Corpus Linguistics 11(3):
283-304.
O’Donnell, Matthew Brook, Römer, Ute and Ellis, Nick C. 2013. The development of formulaic sequences in first and second language writing. International Journal of Corpus Linguistics 18(1): 83-108.
O’Keeffe, Anne, McCarthy, Michael and Carter, Ronald. 2007. From Corpus to Classroom: Language Use and Language Teaching. Cambridge: Cambridge University Press.
Oppenheim, Nancy. 2000. The importance of recurrent sequences for nonnative speaker fluency and cognition. In: Heidi Riggenbach (ed.), Perspectives on Fluency, 220-240. Ann Arbor: University of Michigan Press.
Partington, Alan. 1998. Patterns and Meanings: Using Corpora for English Language Research and Teaching. Amsterdam and Philadelphia: John Benjamins.
Partington, Alan and Morley, John. 2004. From frequency to ideology: Investigating word and cluster/bundle frequency in political debate. In: Barbara Lewandowska-Tomaszczyk (ed.), Practical Applications in Language and Computers, 179-192. Frankfurt am Main: Peter Lang.
Pawley, Andrew and Syder, Frances Hodgetts. 1983. Two puzzles for linguistic theory:
Nativelike selection and nativelike fluency. In: Jack C. Richards and Richard W.
Schmidt (eds.), Language and Communication, 191-225. London and New York:
Longman.
Saffran, Jennifer R., Aslin, Richard N. and Newport, Elissa L. 1996. Statistical learning by 8-month-old infants. Science 274: 1926-1928.
Saffran, Jennifer R. and Wilson, Diana P. 2003. From syllabus to syntax: Multilevel statistical learning by 12-month old infants. Infancy 4(2): 273-284.
Salazar, Danica. 2014. Lexical Bundles in Native and Non-native Scientific Writing:
Applying a Corpus-based Study to Language Teaching. Amsterdam and Philadelphia: John Benjamins.
Scheibman, Joanne. 2000. I dunno but: A usage-based account of the phonological
Scheibman, Joanne. 2000. I dunno but: A usage-based account of the phonological