Corpus frequency as a criterion for grading lexical collocations


This article aims at examining the validity of using frequency as a criterion for assigning Spanish collocations to successive levels of proficiency. To that end, the following aspects have been studied: (i) the opinion of teachers of Spanish as a foreign/second language as to the criteria that should be used to grade vocabulary; (ii) the proposals of such professionals in that regard; (iii) their intuition on collocational frequency; and (iv) the degree of difficulty that collocations graded at different levels pose to learners. The data needed in order to explore these four dimensions have been collected by means of questionnaires administered both to teachers and learners of Spanish. The results suggest that teaching professionals regard frequency as an important criterion for grading vocabulary and their proposals coincide to a considerable extent with frequency-based grading. The findings regarding teachers' intuition on frequency as well as the correlation between frequency-based grading of collocations and collocation difficulty are less conclusive.

collocations; teaching of Spanish as a foreign/second language; grading; frequency


Alonso Ramos, M. (2004). Diccionario de colocaciones del español. <>

Alonso Ramos, M. (2012). Explorando la frecuencia léxica para el Diccionario de colocaciones del español. In T. Jiménez Juliá, B. López Meirama, V. Vázquez Rozas, & A. Veiga (Eds.), Cum corde et in nova grammatica. Estudios ofrecidos a Guillermo Rojo (pp. 19–40). Santiago de Compostela: Universidade de Santiago de Compostela.

Alonso Ramos, M. (2016). Spanish learner corpus research: Achievements and challenges. In M. Alonso Ramos (Ed.), Spanish Learner Corpus Research. Current trends and future perspectives (pp. 3–32). Amsterdam/Philadelphia: John Benjamins.

Arnaud, P. J. L. (1990). Subjective word frequency estimates in LI and L2. Paper presented at the 9th World Congress of Linguistics, Thessaloniki. ERIC Document ED329120, 1–15.

Báez Montero, I., & Suárez Rodríguez, B. (2012). Lectura analógica / lectura digital : 'el papel' de las lecturas graduadas en aprendices de E / LE. In C. Hernández González, A. Carrasco Santana & E. Álvarez Ramos (Eds.), La Red y sus aplicaciones en la enseñanza-aprendizaje del español como lengua extranjera (pp. 117–128). Valladolid: ASELE.

Benigno, V., Kraiff, O., Grossmann, F., & Velez., A. (2016). La notion de collocation fondamentale: une étude de corpus. Cahiers de Lexicologie, 108(1), 125–146.

Capel, A. (2010). A1 - B2 vocabulary: insights and issues arising from the English Profile Wordlists project. English Profile Journal, 1(1), e3. | DOI 10.1017/S2041536210000048

Capel, A. (2012). Completing the English Vocabulary Profile: C1 and C2 vocabulary. English Profile Journal, 3 (June 2012), e1. | DOI 10.1017/S2041536212000013

Consejo de Europa. (2001). Marco Común Europeo de Referencia para el aprendizaje de lenguas: aprendizaje, enseñanza, evaluación. Madrid: Ministerio de Educación, Cultura y Deporte.

Davies, M. (2006). A frequency dictionary of Spanish. New York/London: Routledge.

Dörney. Z. (2002). Questionnaires in Second Language Research. London: Erlbaum.

Durrant, P., & Schmitt, N. (2009). To what extent do native and non-native writers make use of collocations? IRAL - International Review of Applied Linguistics in Language Teaching, 47(2), 157–177. | DOI 10.1515/iral.2009.007

Ellis, R., & Barkhuizen, G. (2005). Analysing Learner Language. Oxford: Oxford University Press.

Ferrando Aramo, V. (2012). Aspectos teóricos y metodológicos para la compilación de un diccionario combinatorio destinado a estudiantes de E/LE. Tesis doctoral. Universitat Rovira i Virgili, Tarragona.

Frey, E. (1981). Subjective word frequency estimates and their stylistic relevance in literature. Poetics, 10(4–5), 395–407. | DOI 10.1016/0304-422X(81)90026-7

García Salido, M., & Alonso Ramos, M. (en prensa). Asignación de niveles de aprendizaje a las colocaciones del Diccionario de Colocaciones del Español. Revista Signos.

Gilquin, G. (2007). To Err Is Not All: What Corpus and Elicitation Can Reveal About the Use of Collocations By Learners. Zeitschrift für Anglistik und Amerikanistik, 55(3), 273–291.

Gougenheim, G., Michéa, R. Rivenc, P., & Sauvageot, A. (1964). L'élaboration du français fundamental. Paris: Didier.

Granger, S., & Bestgen, Y. (2014). The use of collocations by intermediate vs. advanced non-native writers: A bigram-based study. IRAL - International Review of Applied Linguistics in Language Teaching, 52(3), 229–252. | DOI 10.1515/iral-2014-0011

Gries, S. T. (2015). The most under-used statistical method in corpus linguistics: Multi-level (and mixedeffects) models. Corpora, 10(1), 95–125. | DOI 10.3366/cor.2015.0068

Herberich, E., Sikorski, J., & Hothorn, T. (2010). A robust procedure for comparing multiple means under heteroscedasticity in unbalanced designs. PLoS ONE, 5(3), 1–8. | DOI 10.1371/journal.pone.0009788

Instituto Cervantes. (1997–2016). Plan Curricular del Instituto Cervantes. <>

Izquierdo, M. C. (2004). La selección del léxico en la enseñanza del español como lengua extranjera. Su aplicación en el nivel elemental en estudiantes francófonos. Valencia: Serveix de de Publicacions, Universitat de Valencia.

Keniston, H. (1920). Common words in Spanish. Hispania, 3, 85–96. | DOI 10.2307/331305

Kilgarriff, A., & Renau, I. (2013). esTenTen, a Vast Web Corpus of Peninsular and American Spanish. Procedia - Social and Behavioral Sciences, 95, 12–19.

Laufer, B., & Goldstein, Z. (2004). Testing vocabulary knowledge: Size, strength, and computer adapativeness. Language Learning, 54(3), 399–436. | DOI 10.1111/j.0023-8333.2004.00260.x

Lewis, M. (1993). The Lexical Approach. London: Language Teaching Publications.

Martinez, R. (2013). A framework for the inclusion of multi-word expressions in ELT. ELT Journal, 67(April), 184–198. | DOI 10.1093/elt/ccs100

McGee, I. D. (2006). Lexical Intuitions and Collocation Patterns in Corpora. Tesis doctoral. Cardiff University.



web of science logo




PDF (Español (España)) views