CFP Language Resources and Evaluation Journal, entitled “Under-resourced Languages, Collaborative Approaches and Linked Open Data: Resources, Methods and Applications”

Submissions for a Special Issue of the Language Resources and Evaluation Journal, entitled “Under-resourced Languages, Collaborative Approaches and Linked Open Data: Resources, Methods and Applications”.

Important: More detailed information will be made available in September 2014. For more information please contact the guest editors.

PRELIMINARY SUBMISSION DATE: November 15, 2014

CALL FOR SUBMISSIONS
Under-resourced languages are generally described as languages that suffer from a chronic lack of available resources, from human, financial, and time resources to linguistic ones (language data and language technology), and often also experience the fragmentation of efforts in resource development. This situation is exacerbated by the realization that as technology progresses and the demand for localised languages services over digital devices increases, the divide between adequately- and under-resourced languages keeps widening. Given that most of the world’s almost 7000 languages are not adequately resourced, much work needs to be done in order to support their existence in the digital age.

Although the destiny of a language is primarily determined by its native speakers and broader cultural context, the technological development of an under-resourced language offers such a language a strategic opportunity to have the same “digital dignity”, “digital identity” and “digital longevity” as large, well-developed languages on the Web.

The Linked (Open) Data framework and the emerging Linguistic Linked (Open) Data infrastructure offer novel opportunities for under-resourced languages. On the one hand, Linked Data offers ways of exposing existing high quality, albeit small, language resources in the Semantic Web and, on the other hand, allows for the development of new state-of-the-art resources without necessarily having to rely on the availability of sophisticated language processing support.

This special issue arises from the imperative to maintain cultural and language diversity and from the basic right of all communities, languages, and cultures to be “first class citizens” in an age driven by information, knowledge and understanding. In this spirit, this special issue focuses on three strategic approaches to augment the development of resources for under-resourced languages to achieve a level potentially comparable to well-resourced, technologically advanced languages, viz. a) using the crowd and collaborative platforms; b) using technologies of interoperability with well-developed languages; and c) using Semantic Web technologies and, more specifically, Linked Data.

We invite original contributions, not published before and not under consideration for publication elsewhere, that address one or more of the following questions by means of one or more of the three approaches mentioned above:

• How can collaborative approaches and technologies be fruitfully applied to the accelerated development and sharing of high quality resources for under-resourced languages?

• How can such resources be best stored, exposed and accessed by end users and applications?

• How can small language resources be re-used efficiently and effectively, reach larger audiences and be integrated into applications?

• How can multilingual and cross-lingual interoperability of language resources, methods and applications be supported, also between languages that belong to different language families?

• How can existing language resource infrastructures be scaled to thousands of languages?

• How can research on and resource development for under-resourced languages benefit from current advances in semantic and semantic web technologies, and specifically the Linked Data framework?

GUEST EDITORS
Laurette Pretorius – University of South Africa, South Africa (pretol AT unisa DOT ac DOT za)
Claudia Soria – CNR-ILC, Italy (claudia.soria AT ilc DOT cnr DOT it)

SCIENTIFIC COMMITTEE
Sabine Bartsch, Technische Universität Darmstadt, Germany
Delphine Bernhard, LILPA, Strasbourg University, France
Peter Bouda, CIDLeS – Interdisciplinary Centre for Social and Language Documentation, Portugal
Paul Buitelaar, Insight Centre for Data Analytics, NUIG, Ireland
Steve Cassidy, Macquarie University, Australia
Christian Chiarcos, Frankfurt University, Germany
Thierry Declerck, DFKI GmbH, Language Technology Lab, Germany
Mikel Forcada, University of Alicante, Spain
Dafydd Gibbon, Bielefeld University, Germany
Yoshihiko Hayashi, Graduate School of Language and Culture, Osaka University, Japan
Sebastian Hellmann, Leipzig University, Germany
Simon Krek, Jožef Stefan Institute, Slovenia
Tobias Kuhn, ETH, Zurich, Switzerland
Joseph Mariani, LIMSI-CNRS & IMMI, France
John McCrae, Bielefeld University, Germany
Steven Moran, Universität Zürich, Switzerland
Kellen Parker, National Tsing Hua University, China
Patrick Paroubek, LIMSI-CNRS, France
Taher Pilehvar, “La Sapienza” Rome University, Italy
Maria Pilar Perea i Sabater, Universitat de Barcelona, Spain
Laurette Pretorius, University of South Africa, South Africa
Leonel Ruiz Miyares, Centro de Linguistica Aplicada (CLA), Cuba
Kevin Scannell, St. Louis University, USA
Ulrich Schäfer, Technical University of Applied Sciences Amberg-Weiden, Bavaria, Germany
Claudia Soria, CNR-ILC, Italy
Nick Thieberger, University of Melbourne, Australia
Eveline Wandl-Vogt, Austrian Academy of Sciences, ICLTT, Austria
Michael Zock, LIF-CNRS, France

Multi-Dimensional Analysis, 25 years on. A tribute to Douglas Biber

Edited by Tony Berber Sardinha and Marcia Veirano Pinto
São Paulo Catholic University
Approximately a quarter of a century ago, the Multi-Dimensional (MD) approach—one of the most powerful (and controversial) methods in Corpus Linguistics—saw its first book-length treatment. In its eleven chapters, this volume presents all new contributions covering a wide range of written and spoken registers, such as movies, music, magazine texts, student writing, social media, letters to the editor, and reports, in different languages (English, Spanish, Portuguese) and contexts (engineering, journalism, the classroom, the entertainment industry, the Internet, etc.). The book also includes a personal account of the development of the method by its creator, Doug Biber, an introduction to MD statistics, as well as an application of MD analysis to corpus design. The book should be essential reading to anyone with an interest in how texts, genres, and registers are used in society, what their lexis and grammar look like, and how they are interrelated.
[Studies in Corpus Linguistics, 60]  2014.  xxxviii, 328 pp.

CFP Current Work in Corpus Linguistics: Working with Traditionally-conceived Corpora and Beyond

 
Current Work in Corpus Linguistics: 
Working with Traditionally-conceived Corpora and Beyond

Welcome to Valladolid for the CILC2015, the seventh edition of the International Conference on Corpus Linguistics organized by AELINCO, the Spanish Association of Corpus Linguistics and hosted this year by the Department of English and the International Centre for Lexicography (University of de Valladolid, Spain). The University of Valladolid is a public university located in North-west Spain. Established in the 13th Century, it is one of the oldest universities in the world. It offers over 100 degrees, 50 masters and doctorate programmes, boasting a broad network of international relations, research centres as well as sports and cultural facilities together with a rich architectural and documentary heritage.

The conference will be held at the Palacio de Congresos “Conde Ansúrez” at the heart of Valladolid from 5 to 7 March 2015. The Conference will focus on current work in Corpus Linguistics as well as future developments of the discipline, especially in connection with the coming of age of Big Data.

Abstracts must be sent in English or Spanish and must not exceed 550 words. The time allotted for the presentation is 20 minutes (with 5 minutes for discussion). Abstracts for poster sessions will also be accepted.

The abstract proposals should be related to one of the conference panels outlined by AELINCO. Each panel has a director who shall communicate the acceptance or rejection of the proposals after the evaluation by experts.

The first circular provides information about the submission of proposals.

The deadline for submitting proposals is 1 December 2014.

For further information:

http://aelinco-en.blogs.uva.es/  

Using Sketch Engine: some how-to resources

General SKE Youtube Channel

Brief overview

Creating your corpus and data upload


Finding metadata for source documents in the Sketch Engine


Concordancer


Using the Expert option: ‘Context’ in the Sketch Engine

Thinning down your concordance output

Using Sketch Difference for lexical disambiguation

Taller sobre herramientas de análisis textual: La herramienta Sketch EngineFacultad de Filología, Universidad Complutense de Madrid, Junio 2011 (última revisión noviembre 2011)

Organizado por:

Doaa Samy (doaa_samy@hotmail.com) (doaasamy@cu.edu.eg)

Ana Fernández Pampillón Cesteros (apampi@filol.ucm.es)

Jorge Arús Hita (jarus@filol.ucm.es)

Receptive and productive mastery of different text varieties

Receptive mastery of different text varieties increases access to information, while productive mastery increases the ability to participate in varying communities. And if you cannot analyze a variety that is new to you, you cannot help yourself or others learn to master it.  Biber & Conrad (2009:4)

Biber, D. & Conrad, S. 2009.
Register, genre and style. CUP.