corpus linguistics Archives - Page 34 of 46

A review of Fluency in Native and Nonnative English Speech

Pérez-Paredes, P. (2014). A review of Fluency in Native and Nonnative English Speech. Studies in Corpus Linguistics, 53. Amsterdam: John Benjamins, 2013. 238 pp. ISBN 978-9-027-203588. ICAME Journal.

Read the review.

Learners’ search patterns during corpus-based focus-on-form activities

This research explores the search behaviour of EFL learners (n=24) by tracking their interaction with corpus-based materials during focus-on-form activities (Observe, Search the corpus, Rewriting). One set of learners made no use of web services other than the BNC during the central Search the corpus activity while the other set resorted to other web services and/or consultation guidelines. The performance of the second group was higher, the learners’ formulation of corpus queries on the BNC was unsophisticated and the students tended to use the BNC search interface to a great extent in the same way as they used Google or similar services. Our findings suggest that careful consideration should be given to the cognitive aspects concerning the initiation of corpus searches, the role of computer search interfaces, as well as the implementation of corpus-based language learning. Our study offers a taxonomy of learner searches that may be of interest in future research.

Pérez-Paredes, P., Sánchez-Tornel, M., & Alcaraz Calero, J. M. (2012). Learners’ search patterns during corpus-based focus-on-form activities.International Journal of Corpus Linguistics, 17(4), 483-516

Full text here.

Learners’ search patterns during corpus-based focus-on-form activities

Enhancing and extending corpora and corpora tools for learning and teaching

Valoriser et développer les outils autour des corpus dans une perspective didactique / Enhancing and extending corpora and corpora tools for learning and teaching

Mardi/Tuesday, mai/May 27th

Salle/Room 205 Site Rabelais, UJF Valence, France

Programme

9h30 – Speed-dating : Présentations/Presentations

10h – Présentation et discussion autour du livre/presentation and discussion about the book « Des documents authentiques aux corpus. Démarches pour l’apprentissage des langues ». Boulton et Tyne (2014). Discussion autour de l’abondance de matières exploitables dans les corpus et la sous-exploitation dans l’enseignement des langues/Including the abondance of exploitable corpora materials and the general lack of their use in language teaching.

Conférencier: Alex Boulton

11h – Présentation de la Plate-forme Chamilo : comment l’utiliser pour les corpus ? Suivi d’une discussion en français/anglais.

Jérémie Grépiloux et Hubert Borderiou (SIMSU)

13h30 – Pedagogical uses of corpora: theories and practices / Utilisations pédagogiques des corpus : théories et pratiques, 20-minute presentation followed by a group discussion

Conférencier: Pascual Pérez-Paredes

14h30 – Speed-dating : Consultation en ligne des corpus/Consulting on-line corpora: Montrer et voir des corpus en salle informatique

16h – Bilan de la journée et projets/Summary of the day and projects

Cristelle Cavalla and Laura Hartwell

Inscriptions (Gratuit et obligatoire)/Mandatoary free registration :

https://docs.google.com/forms/d/118xpaiTACRMW5KA5ja92oEGJqZ5Q6BUmqfVmSPq41U0/viewform

Logistics: Sylvain Perraud, Sylvain.Perraud@gmail.com (Compte rendu/minutes)

Contacts: Cristelle.Cavalla@univ-paris3.fr, Laura.Hartwell@ujf-grenoble.fr

References

SACODEYL : http://www.um.es/sacodeyl/

Chamilo : http://www.chamilo.org/fr

Scientext : http ://scientext.msh-alpes.fr/scientext-site-en/spip.php?article9

EmoBase/EmoProf : http://emolex.u-grenoble3.fr/emoBase/

Enhancing and extending corpora and corpora tools for learning and teaching

Valoriser et développer les outils autour des corpus dans une perspective didactique / Enhancing and extending corpora and corpora tools for learning and teaching

Mardi/Tuesday, mai/May 27th

Salle/Room 205 Site Rabelais, UJF Valence, France

Link

Programme

9h30 – Speed-dating : Présentations/Presentations

Conférencier: Alex Boulton

11h – Présentation de la Plate-forme Chamilo : comment l’utiliser pour les corpus ? Suivi d’une discussion en français/anglais.

Jérémie Grépiloux et Hubert Borderiou (SIMSU)

13h30 – Pedagogical uses of corpora: theories and practices / Utilisations pédagogiques des corpus : théories et pratiques, 20-minute presentation followed by a group discussion

Conférencier: Pascual Pérez-Paredes

14h30 – Speed-dating : Consultation en ligne des corpus/Consulting on-line corpora: Montrer et voir des corpus en salle informatique

16h – Bilan de la journée et projets/Summary of the day and projects

Cristelle Cavalla and Laura Hartwell

Inscriptions (Gratuit et obligatoire)/Mandatoary free registration :

https://docs.google.com/forms/d/118xpaiTACRMW5KA5ja92oEGJqZ5Q6BUmqfVmSPq41U0/viewform

Logistics: Sylvain Perraud, Sylvain.Perraud@gmail.com (Compte rendu/minutes)

Contacts: Cristelle.Cavalla@univ-paris3.fr, Laura.Hartwell@ujf-grenoble.fr

References

SACODEYL : http://www.um.es/sacodeyl/

Chamilo : http://www.chamilo.org/fr

Scientext : http ://scientext.msh-alpes.fr/scientext-site-en/spip.php?article9

EmoBase/EmoProf : http://emolex.u-grenoble3.fr/emoBase/

Full-text data for the two largest BYU corpora

I have received this through the CORPORA List:
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::

At http://corpus.byu.edu/full-text/ you can now download full-text data for the two largest BYU corpora:

Corpus of Contemporary American English (COCA). 440 million words of downloadable text; the largest, most up-to-date, publicly-available corpus of English that is balanced for genre (spoken, fiction, magazine, newspaper, and academic).

The corpus of Global Web-Based English (GloWbE). 1.8 billion words of downloadable text; divided into groups from twenty different English-speaking countries (US, UK, Canada, Australia, India, etc). About 60% from blogs, for very informal language.

With this full-text data, you will have the actual corpora on your computer, and you can search the data in any way that you’d like. You can generate your own frequency data, collocates, n-grams, or concordance lines; you can search by word, lemma, and part of speech; and you can carry out complex syntactic and semantic searches offline. You can even modify the lexicon and sources tables to search the corpora in ways that are not possible via the standard web interfaces.

The data comes in three different formats (see samples): data for relational databases (info), word/lemma/PoS (vertical), and linear text (horizontal). When you purchase the data, you purchase the rights to any and all of these formats.