MA of L2 learner English

Corpus Linguistics 2015, University of Lancaster, 21-24 July

IMG_20150722_083955

Yu Yuan:
“Exploring the variation in world Learner Englishes: A multidimensional analysis of L2 written corpora”

109 features included in the analysis

RQ:

Can Biber’s model be extended?

How do features co-occur in learner English?

 

Data

ICLE 1.0 (Granger, 2002)

SWEECL 2.0 (Wen & Wang, 2008)

 

Tools

MA tagger Nini (2014) Manual here. Software (Windows) here.

Stanford Corenlp

R

Pythin scripts

 

Method

Kaisser’s criteria + Scree test for Factor Analysis

 

Results

10 dimensions stand out

Dimensions are largely epistemological, rhetorical and syntactical.

 

1.6 billion word Hansard Corpus available

 

Through the corpora list & Prof. Mark Davies

::::::::::::::::::::::::::::::::::::

We are pleased to announce the release of the 1.6 billion word Hansard Corpus . The corpus is part of the SAMUELS project and has been funded by the AHRC (UK).

The Hansard Corpus contains 1.6 billion words from 7.6 million speeches in the British Parliament from 1803-2005. The corpus is semantically tagged, which allows for powerful meaning-based searches. In addition, users can create “virtual corpora” by speaker, time period, House of Parliament, and party in power, and compare across these corpora.

As with all of the other BYU corpora, the corpus allows queries by part of speech, lemma, synonym, customized word lists, and by section of the corpus (e.g. which words or phrases appear in one time period much more than in another). In terms of visualization, it allows users to view frequency listings (matching words and phrases), chart displays (overall frequency by time period), collocates (including comparisons between collocates of contrasting node words), and re-sortable concordance lines.

The end result is a corpus that will be of value not only to linguists (as the largest structured corpus of historical British English from the 1800s-1900s), but hopefully to historians, political scientists, and others as well.

http://www.hansard-corpus.org

============================================

Mark Davies
Professor of Linguistics / Brigham Young University
http://davies-linguistics.byu.edu/

European Journal of Applied Linguistics invites submissions

The European Journal of Applied Linguistics (EuJAL) focuses on the particular concerns of applied linguistics in European contexts, both by addressing problems that are typically relevant for the linguistic situation in Europe, from those on the level of the EU as a pan-national body down to the level of the individual, and by examining topics broached by or discussed in European applied linguistics in particular. In addition to resulting from an epistemological stance, EuJAL is a logical outcome of the regionalization policy of the Association Internationale de Linguistique Appliquée (AILA), supporting the societies’ commitment to regionalization by focusing on the European language space and by giving applied linguists from this regional context an adequate forum. EuJAL is part of the joint activities of the European AILA affiliates.

Extended Java WordNet Library Updated

Through the Linkedin computational linguistics group

:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::

extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format. The library features support for:

* browsing, creating, editing and writing dictionaries
* encodings, including UTF-8
* Java generics
* huge dictionaries
* instance dictionaries and static singletone dictionary
* Maven
* ewn: command-line tool to browse, create and edit dictionaries in WordNet format
* thread-safety
* and more

# About Current Release

The 1.9 release contains these improvements:

– IO rewritten to use block reads
– synchronization removed or replaced with finer-grained blocks
– parsing rewritten to decrease memory reallocations
– memory use decreased for ASCII resource dictionaries: char[] -> byte[]
– reading speed improved: resource random walk ~2.7x, file random walk ~43.8x, file iteration ~11.9x
– upgrade to Java 7
– bugs fixed

# Acknowledgements

This release is supported by YourKit Open Source License and
[YourKit Java Profiler](https://www.yourkit.com/java/profiler/index.jsp)

# Resources

* [Homepage](http://extjwnl.sourceforge.net/)
* [Download](http://sourceforge.net/projects/extjwnl/files)
* [Documentation](https://github.com/extjwnl/extjwnl/wiki)
* [Mailing Lists](http://lists.sourceforge.net/lists/listinfo/extjwnl-announce)
* [Forums](http://sourceforge.net/projects/extjwnl/forums/)
* [Source Code](https://github.com/extjwnl/extjwnl/)
* [Issues](https://github.com/extjwnl/extjwnl/issues)

Researching Language Learner Interactions Online: From Social Media to MOOCs

The 2015 CALICO Monograph: Researching Language Learner Interactions Online: From Social Media to MOOCs edited by Ed Dixon and Michael Thomas is now available.

 

Ch. 1
Edward Dixon
Michael Thomas

Introduction
Ch. 2 Dana Milstein Pancake People, Throwaway Culture, and En Media Res Practices: A New Era of Distance Foreign Language Learning

Ch. 3 Alice Chik English Language Teaching Apps: Reconceptualizing Learners, Parents, and Teachers

Ch. 4
Timothy Lewis
Anna Comas-Quinn
Mirjam Hauck

Clustering, Collaboration, and Community: Sociality at Work in a cMOOC

Ch. 5 Fernando Rubio The Role of Interaction in MOOCs and Traditional Technology-Enhanced Language Courses

Ch. 6
Edward Dixon
Carolin Fuchs

Face to Face, Online, or MOOC–How the Format Impacts Content, Objectives, Assignments, and Assessments

Ch. 7
Vickie Karasic
Anu Vedantham

Video Creation Tools for Language Learning: Lessons Learned

Ch. 8
Michael Thomas

Researching Machinima in Project-Based Language Learning: Learner-Generated Content in the CAMELOT Project

Ch. 9 Yuki Akiyama Task-Based Investigations of Learner Perceptions: Affordances of Video-Based eTandem Learning

Ch. 10 Ilona Vandergriff Exercising Learner Agency in Forum Interactions in a Profesionally Moderated Language Learning Networking Site

Ch. 11 Motoko I. Christensen
Mark Christensen Language Learner Interaction in Social Network Site Virtual Worlds

Ch. 12 Geraldine Blattner
Amanda Dalola
Lara Lomicka Tweetsmarts: A Pragmatic Analysis of Well Known Native French Speaker Tweeters

Ch. 13 Theresa Schenker Telecollaboration for Novice Language Learners–Negotiation of Meaning in Text Chats between Nonnative and Native Speakers

Ch. 14 Giulia Messina Dahlberg
Sangeeta Bagga-Gupta Learning On-The-Go in Institutional Telecollaboration: Anthropological Perspectives on the Boundaries of Digital Spaces

Ch. 15 Marie-Thérèse Batardière Examining Cognitive Presence in Students’ Asynchronous Online Discussions

Ch. 16 Kelsey D. White Orientations and Access to German-Speaking Communities in Virtual Environments

Ch. 17 Megan Case Language Students’ Personal Learning Environments Through an Activity Theory Lens

Ch. 18 Bonnie Youngs
Sarah Moss-Horwitz
Elizabeth Snyder Educational Data Mining for Elementary French On-line: A Descriptive Study

Ch. 19 Stephanie Link
Zhi Li Understanding Online Interaction Through Learning Analytics: Defining a Theory-Based Research Agenda