The compilation of the spoken sub-corpus for the Tartu corpus of Estonian learner English

Rahusaar, Anne

The compilation of the spoken sub-corpus for the Tartu corpus of Estonian learner English

Files

Rahusaar_Anne_MA_Thesis.pdf (1002.09 KB)

Date

2019

Authors

Rahusaar, Anne

Publisher

Tartu Ülikool

Abstract

Computer learner corpora (CLC) are electronic stored collections of either written or spoken texts which are produced by learners of a language, as a foreign language (Granger 2004: 124). Computer Learner Corpus research is a fairly new and growing discipline. The problem with most corpora which have been compiled so far is that they are not publicly available, thus they are not accessible for researchers outside of the specific corpus team. In Estonia, studying learner language and using corpora has become more and more popular during the recent years, yet there is still much to learn about the Estonian learners of English. There have been some studies about written learner corpora but studying and compiling a spoken learner corpus is not a common practice yet, mainly because compiling a spoken corpus is a more time-consuming process.

Keywords

inglise keel, korpused (keelet.), kõne, pragmaatika

URI

http://hdl.handle.net/10062/63917

Collections

Inglise filoloogia magistritööd – Master's theses

Full item page

The compilation of the spoken sub-corpus for the Tartu corpus of Estonian learner English

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections