SACODEYL XML corpora

SACODEYL focuses on spoken interviews with British, French, German, Italian, Lithuanian, Rumanian and Spanish teenagers between 13 and 18 years of age. The interview transcripts are stored in online corpora pedagogically annotated and enriched for language learning and teaching purposes.

SACODEYL adopts a “small pedagogic corpus” approach. Each of the seven corpora – English, French, German, Italian, Lithuanian, Rumanian and Spanish – contains 20 to 25 video-recorded interviews of about 10 minutes each.

To ensure thematic comparability, a common set of interview questions was used, covering a wide range of topics including: personal information, home and family, present and past living routines, hobbies and interests, holidays, school and education, job experiences, plans for the future, open discussion topics.

A SACODEYL corpus consists of orthographical interview transcripts in XML format. Each transcript is structured on the basis of short thematic sections and annotated with regard to pedagogically relevant characteristics, e.g. topic, grammatical and lexical properties, discourse markers and CEF level.

For pedagogic support, access to various web-based enrichment resources is provided via direct links and resource sheets. This includes in particular

  • the interview sound & video files,
  • ready-made online learning modules created with Telos Language Partner,
  • suggestions for explorative & communicative activities in a classroom and/or Moodle-based e-learning environment.

You can download the SACODEYL XML files here.

SACODEYL ENGLISH

SACODEYL FRENCH

SACODEYL GERMAN

SACODEYL LITHUANIAN

SACODEYL ROMANIAN

SACODEYL SPANISH

References

Kohn Kurt (2012). Pedagogic Corpora for Content and Language Integrated Learning. Insights from the BACKBONE Project. The Eurocall Review 20/2, Sept 2012.

Pérez-Paredes, P., & Alcaraz-Calero, J. M. (2009). Developing annotation solutions for online Data Driven LearningReCALL21(01), 55.

Pérez-Paredes, P. (2019). The pedagogic advantage of teenage corpora for secondary school learners. In: P. Crosthwaite (ed.) Data Driven Learning for the Next Generation: Corpora and DDL for Pre-tertiary Learners. London: Routledge, pp.67-87.

Copyright

The SACODEYL web site, video interviews and corpora are Copyright © 2008, Universidad de Murcia (Spain); the SACODEYL learning resources and documentation are Copyright © 2008 by the project partners mentioned in the credits. The SACODEYL web site and the SACODEYL videos are provided as an integral part of the SACODEYL system and their use is only envisaged in the context of the SACODEYL Search facility.

SACODEYL by SACODEYL – Universidad de Murcia is licensed under a Creative Commons Reconocimiento-No comercial-Sin obras derivadas 3.0 Unported License.
Based on a work at www.um.es.

Permissions beyond the scope of this license may be available at http://www.um.es/sacodeyl/en/pages/license.htm.

For other uses, please contact sacodeyl@um.es.

Permission from copyright holders is needed to create, among others, course materials or Web sites. You need to obtain permission when you use SACODEYL products in a way that infringe on the exclusive rights granted to the copyright holders.