Proceedings from the CALS conference 2014
Edited By Kristina Cergol Kovačević and Sanda Lucija Udier
Learner corpus of Croatian as a second and foreign language
Years of experience in teaching Croatian as a foreign language have clearly indicated the need for a lexical resource that overcomes the limitations of existing bilingual dictionaries and manuals, enabling optimal presentation of the Croatian language to non-native speakers.
The aim of this study is to describe the development methodology of the first learner corpus of Croatian as a second and foreign language, the corpus that will be used for producing new lexical resources aimed at non-native speakers of the Croatian language. The learner corpus is based on written texts and audio recordings of foreign students attending Croaticum – Center for Croatian as a Second and Foreign Language at the Faculty of Humanities and Social Sciences, University of Zagreb.
The learner corpus consists of two sub-corpora: a corpus of written texts systematically described by detailed metadata (gender, age, nationality, mother tongue, bilingual or multilingual competence, language competence of parents, etc.) and a corpus of recordings of read-aloud and spontaneous speech. Our hypothesis is that computer processing of our learner corpus should provide the in-depth analysis of learners’ language (description of their interlanguage and deviations from the standard). Computer processing will result in the extraction of important linguistic patterns, due to different levels of corpus annotation and enrichment: morphosyntactic tagging and error tagging. The research will provide insight into the interlanguage of learners and enable language instructors to better understand the real needs of their students.
Research on collected authentic natural...
You are not authenticated to view the full text of this chapter or article.
This site requires a subscription or purchase to access the full text of books or journals.
Do you have any questions? Contact us.Or login to access all content.