Two Faces of One Suffix: Some Thoughts on Using Corpora in Usage-Based Studies of Word-Formation

Gabrijela Buljan

Abstract


This paper compares the semantic profile of a single multifunctional derivational suffix derived from data obtained in two general digital corpora of Croatian. The primary motivation is to explore whether our verdicts about the semantics of affixes may depend on the corpus selected as the source of empirical material. The issue is of vital importance, especially for those studying word formation from a usage-based perspective. If grammar is construed as the cognitive organization of our experience with language (Bybee 2006) and if we turn to large, general digital corpora for evidence of this experience, we must be aware that examining different corpora may lead to different hypotheses about users’ internalized grammar. The here-presented semantic analysis of the Croatian nominal suffix -ar(a) in the more controlled Croatian National Corpus v3.0 and the liberal web-based corpus hrWaC v2.2 yielded conspicuously different results about its dominant function. This does not mean that similar discrepancies would necessarily be observed with other affixes, and it most certainly does not negate the value of corpora in studying word formation. However, such results do caution us against generalizing corpus-relative findings into some general “truth” about the affixes studied.

Keywords


usage-based approach; word-formation; general corpus; semantic structure

Full Text:

PDF

References


Ameka, Felix K. (2006), ˝Interjections˝, in: Keith Brown (ed.), Encyclopedia of language & linguistics. 2nd edn., Elsevier, Oxford, 743–746.

Aronoff, Mark (1976), Word-formation in Generative Grammar, MIT Press, Cambridge Mass.

Baayen, Harald (1992), ˝Quantitative aspects of morphological productivity˝, in: Geert Booij, Jaap van Marle (eds.), Yearbook of morphology 1992, Kluwer, Dordrecht, 109–149.

Baayen, Harald (1994), “Derivational productivity and text typology”, Journal of Quantitative Linguistics 1(1), 16–34. https://doi.org/10.1080/0929617 9408589996

Baayen, Harald (2009), ˝Corpus linguistics in morphology: Morphological productivity˝, in: Anke Lüdeling, Merja Kytö (eds.), Corpus linguistics. An international handbook, Volume 2, De Gruyter, Berlin, 899–919.

Baayen, Harald, Rochelle Lieber (1991), ˝Productivity and English derivation: a corpus-based study˝, Linguistics, 29, 801–844.

Baayen, Harald, Antoinette Renouf (1996), ˝Chronicling the Times: productive lexical innovations in an English newspaper˝, Language, 72, 69–96.

Babić, Stjepan (2002), Tvorba riječi u hrvatskome književnome jeziku, 3rd im- proved edn., HAZU; Nakladni zavod Globus, Zagreb

Barić, Eugenija, Mijo Lončarić, Dragica Malić, Slavko Pavešić, Mirko Peti, Vesna Zečević, Marija Znika (1997), Hrvatska gramatika. 2nd revised edn, Školska knjiga, Zagreb

Barlow, Michael, Suzanne Kemmer (eds.) (2000), Usage-based models of language, CSLI, Stanford

Bauer, Laurie (1983), English word formation, CUP, Cambridge

Bauer, Laurie (1997), ˝Evaluative morphology: In search of universals˝, Studies in Language. International Journal sponsored by the Foundation “Foundations of Language”, 21(3), 533–575.

Bauer, Laurie (2001), Morphological productivity, CUP, Cambridge

Biber, Douglas, Susan Conrad (2009), Register, genre, and style, CUP, Cambridge

Blumenthal-Dramé, Alice (2012), Entrenchment in usage-based theories: What corpus data do and do not reveal about the mind, De Gruyter, Berlin

Bogunović, Irena, Jasmina Jelčić Čolakovac, Mirjana Borucinsky (2022), ˝The Database of English words and their Croatian equivalents˝, [Baza]

Brdar, Mario (2016), ˝Why Modrić and Real rather than Real and Modrić? On the order of proper names under coordination˝, Jezikoslovlje, 17(1-2), 377–395.

Buljan, Gabrijela (2023a), ˝Značenja i oblici hrvatskog sufiksa -AR-A: korpusna studija˝, Fluminensia, 35(1), 27–59.

Buljan, Gabrijela (2023b), ˝Neke misli o nastanku augmentativnog/eva- luativnih značenja hrvatskog sufiksa –ara˝, Suvremena lingvistika, 49(95), 1–27.

Buljan, Gabrijela (2024), Aspects of innovation in Croatian word-formation: A corpus-based study of suffixes -ara, -ana and -stan, Faculty of Humanities and Social Sciences, Osijek

Bybee, Joan L. (2006), ˝From usage to grammar: The mind’s response to repetition˝, Language, 82(4), 711–733.

Bybee, Joan L. (2010), Language, usage and cognition, CUP, Cambridge

Clark, Eve (1982), ˝A case study of innovation in the child’s lexicon˝, in: Eric Wanner & Lila R. Gleitman (eds.), Language acquisition: The state of the art, CUP, Cambridge, MA, 390–425.

Costa, Marcella (2017), ˝Augmentatives in Italian and German: From contrastive analysis to translation˝, in: Maria Napoli, Miriam Ravetto (eds.), Exploring intensification: Synchronic, diachronic and cross-linguistic perspectives, John Benjamins, Amsterdam, 353–370.

Daničić, Đuro, Matija Valjavec, Pero Budmani, Tomo Maretić, Stjepan Mu- sulin, Slavko Pavešić, eds. (1880–1976), Rječnik hrvatskoga ili srpskoga jezika, JAZU, Zagreb

Divjak, Dagmar, Catherine L. Caldwell-Harris (2015), ˝Frequency and entre- nchment˝, in: Eva Dabrowska, Dagmar Divjak (eds.), Handbook of Cognitive Linguistics, De Gruyter, Berlin, 53–75.

Dokulil, Miloš (1968), ˝Zur Theorie der Wortbildung˝, Wissenschaftliche Zeit- schrift der Karl-Marx-Universität, Gesellschafts- und Sprachwissenschaftliche Reihe, 17(2-3), 203–211.

Dressler, Wolfgang U. (2000), ˝Extragrammatical vs. marginal morphology˝, in: Ursula Doleschal, Ana M. Thornton (eds.), Extragrammatical and marginal morphology, Lincom, München, 1–10.

Dressler, Wolfgang U., Lavinia Merlini Barbaresi (1994), Morphopragmatics: diminutives and intensifiers in Italian, German and other languages, De Gruyter, Berlin

Egbert, Jesse, Douglas Biber, Mark Davies (2015), ˝Developing a bottom-up, user-based method of web register classification˝, Journal of the Association for Information Science and Technology, 66(9), 1817–1831.

Ellis, Nick E. (2006), ˝Language acquisition as rational contingency learning˝, Applied Linguistics, 27(1), 1–24.

Ellis, Nick C. (2012), ˝Frequency-based accounts of SLA˝, in: Susan Gass, Alison Mackey (eds.), Handbook of Second Language Acquisition, Routledge, London & New York, 193–210.

Filko, Matea (2020), Unutarleksičke i međuleksičke strukture imeničkoga dijela hrvatskoga leksika, Doktorski rad, Filozofski fakultet u Zagrebu, Zagreb

Gaeta, Livio (2015), ˝Evaluative morphology and sociolinguistic variation˝, in: Nicola Grandi, Lívia Körtvélyessy (eds.), Edinburgh Handbook of Evaluative Morphology, Edinburgh University Press, Edinburgh, 121–133.

Gaeta, Livio, Davide Ricca (2015), ˝Productivity˝, in: Peter O. Müller, Inge- borg Ohnheiser, Susan Olsen, Franz Rainer (eds.), Word-formation. An international handbook of the languages of Europe. Volume 2, De Gruyter, Berlin, 842–858.

Giltrow, Janet, Dieter Stein, eds. (2009), Genres in the Internet. Issues in the theory of genre, John Benjamins, Amsterdam/Philadelphia

Givón, Talmy (1979), On understanding grammar, Academic Press, New York

Grandi, Nicola, Lívia Körtvélyessy (2015), ˝Introduction: why evaluative morphology?˝, in: Nicola Grandi, Lívia Körtvélyessy (eds.), Edinburgh handbook of evaluative morphology, Edinburgh University Press, Edinburgh, 3–21.

Hohenhaus, Peter (2005), ˝Lexicalization and institutionalization˝, in: Pavol Štekauer, Lieber Rochelle (eds.), Handbook of Word-Formation, Springer Verlag, Dordrecht, 353–373.

Hopper, Paul, Sandra A. Thompson (1980), ˝Transitivity in grammar and discourse˝, Language, 56, 251–299.

Hummel, Martin (2015), ˝The semantics and pragmatics of Romance evaluative suffixes˝, in: Peter O. Müller, Ingeborg Ohnheiser, Susan Olsen, Franz Rainer (eds.), Word-Formation. An International Handbook of the Languages of Europe. Volume 2, De Gruyter, Berlin, 1528–1545.

Jojić, Ljiljana, Ranko Matasović (eds.) (2002–2004), Hrvatski enciklopedijski rječnik (HER), vols. 1–12, EPH d.o.o. and Novi Liber, Zagreb

Kendall, Tyler (2011), ˝Corpora from a sociolinguistic perspective˝, Revista Brasileira de Linguística Aplicada, 11(2), 361–389.

Kiršova, Mirjana (1999), Nomina loci u savremenom srpskom jeziku, Univerzitet Crne Gore, Podgorica

Košćak, Nikola (2018), Šrajbenzi spiku: Stilovi hrvatske žargonske i žargonizirane proze 1990-ih i 2000-ih, Stilistika.org, Zagreb

Körtvélyessy, Lívia (2009), ˝Productivity and creativity in word-formation: A sociolinguistics perspective˝, Onomasiology Online, 10, 1–22.

Kuzman, Taja, Nikola Ljubešić (2023), ˝Automatic genre identification: a sur- vey˝, Language Resources and Evaluation. 10.1007/s10579-023-09695-8

Ljubešić, Nikola, Filip Klubička (2014), ˝{bs,hr,sr}WaC - Web Corpora of Bosnian, Croatian and Serbian˝, Proceedings of the 9th Web as Corpus Workshop (WaC-9)”, Gothenburg, Sweden: Association for Computational Linguistics, 29–35.

Ljubešić, Nikola, Taja Kuzman (2023), ˝CLASSLA-web: Comparable Web Corpora of South Slavic Languages Enriched with Linguistic and Genre Annotation˝, Machine Learning and Knowledge Extraction, 5, 1149–1175.

Mattiello, Elisa (2008), An introduction to English slang: A description of its morphology, semantics and sociology, Polimetrica, Milan

Mel’čuk, Igor (1932), Aspects of the theory of morphology, De Gruyter, Berlin

Mengel, Swetlana (2009), ˝Wortbildungsbedeutung˝, in: Sebastian Kempgen, Peter Kosta, Tilman Berger, Karl Gutschmidt (eds.), The Slavic Languages/An International Handbook of their structure, their history and their investigation. Band 1/Volume 1, De Gruyter, Berlin, 775–781.

Mikić Čolić, Ana (2021), Neologizmi u hrvatskome jeziku, Filozofski fakultet, Osijek

Miller, Gary D. (2014), English lexicogenesis, OUP, Oxford

Milosavljević, Stefan, Boban Arsenijević (2022), ˝What differentiates Serbo-Croatian verbal theme vowels: content or markedness?˝, Glossa: a journal of general linguistics, 7(1), https://www.glossa-journal.org/article/id/8535/

Munat, Judith (2007), ˝Lexical creativity as a marker of style in science fiction and children’s literature˝, in: Judith Munat (ed.), Lexical creativity, texts and contexts, John Benjamins, Amsterdam, 163–185.

Plag, Ingo (1999), Morphological productivity. Structural constraints on English derivation, De Gruyter, Berlin

Plag, Ingo, Christiane Dalton-Puffer, Harald Baayen (1999), ˝Morphological productivity across speech and writing˝, English Language and Linguistics, 3(2), 209–228.

Pounder, Amanda (2000), Process and paradigms in word-formation morphology, De Gruyter, Berlin

Säily, Tanja (2011), ˝Variation in morphological productivity in the BNC: Sociolinguistic and methodological considerations˝, Corpus Linguistics and

Linguistic Theory, 7(1), 119–141.

Schmid, Hans-Jörg (ed.) (2017), Entrenchment and the psychology of language learning: How we reorganize and adapt linguistic knowledge, De Gruyter, Berlin

Schultink, Henk (1961), ˝Produktiviteit als morfologisch fenomeen˝, Forum der Letteren, 2, 110–125.

Silić, Josip, Ivo Pranjković (2007), Gramatika hrvatskoga jezika za gimnazije i visoka učilišta, Školska knjiga, Zagreb

Simonović, Marko, Predrag Kovačević (2022), ˝Possessive, kind and not so kind: the different uses of the adjectival -ov in Serbo-Croatian˝, Annual Review of the Faculty of Philosophy, 47(3), 87–109.

Tadić, Marko (2005), ˝Developing the Croatian National Corpus and beyond˝, in: Peter Grzybek, (ed.), Contributions to the science of text and language, Springer, Dordrecht, 295–300.

Tadić, Marko (2009), ˝New version of the Croatian National Corpus˝, in: Dana Hlaváčková, Aleš Horák, Klara Osolsobě, Pavel Rychlý (eds.), After half a century of Slavonic natural language processing, Masaryk University, Brno, 199–205.

Taylor, John R. (2015), ˝Word-formation in cognitive grammar˝, in: Peter O. Müller, Ingeborg Ohnheiser, Susan Olsen, Franz Rainer (eds.), Word-formation. An international handbook of the languages of Europe. Volume 1, De Gruyter, Berlin, 145–158.

Žanić, Ivo (2010), ˝Purgerinjosi, tovarinjosi i leginjice – tvorbene inovacije u hrvatskim vernakularima˝, in: Mario Brdar, Marija Omazić, Višnja Pavičić Takač, Tanja Gradečak-Erdeljić, Gabrijela Buljan (eds.), Prostor i vrijeme u jeziku: jezik u prostoru i vremenu, HDPL - Filozofski fakultet Sveučilišta J. J. Strossmayera, Zagreb/Osijek, 155-164.




DOI: https://doi.org/10.51558/2490-3647.2024.9.2.647

Refbacks

  • There are currently no refbacks.


ISSN: 2490-3604 (print) ● ISSN: 2490-3647 (online)

Društvene i humanističke studije - DHS is under the Creative Commons licence.