Free ngram databases from COW14 web corpora

From the corpora list

::::::::::::::::::::::::::::::

We are pleased to announce the release of the first very large ngram databases derived from the giga-token COW14 web corpora. They are completely free (CC-BY) and can be downloaded without registration. We have applied no frequency thresholds whatsoever. In addition to the counted ngram lists, we offer raw versions such that everybody can create their own version. The raw ngrams also contain additional information (crawl year, top-level domain, country geolocation).

There are also English dependency bigrams (based on Malt parses) containing words, their heads, and the dependency relation between them.

For end-users, there are also word and lemma frequency lists with some convenient frequency measures, optionally with a frequency threshold of 10 (smaller files, easier handling).

——————————————————————–

LICENSE AND REFERENCES

License Creative Commons Attribution 4.0 International
References http://corporafromtheweb.org/category/cow-citation/

Please tell us whenever you publish work based on COW:
https://webcorpora.org/publication/

DOWNLOAD

http://hpsg.fu-berlin.de/cow/ngrams/
http://hpsg.fu-berlin.de/cow/frequencies/

ORIGIN AND ORIGINAL CORPUS SIZES

The ngrams are derived from the COW14AX sentence-shuffled corpora.

Information http://corporafromtheweb.org/category/corpora/
Interface https://webcorpora.org/

English 9,578,828,861 tokens (International)
German 11,660,894,000 tokens (AT, CH, DE)
Spanish 3,680,794,644 tokens (International)
Swedish 4,842,753,707 tokens (FI, SV)

FREQUENCY LISTS

Languages English, German, Spanish, Swedish
Versions Lemma, Lemma + POS, Word, Word + POS
Thresholds no threshold; raw frequency > 9
Measures raw frequency, absolute rank, frequency per million,
log-frequency per million, frequency band

NGRAMS

N 1 .. 5
Languages English, German, Spanish, Swedish
Versions Raw, Word, Word + POS, Lemma (except Swedish)

DEPENDENCY BIGRAMS

Languages English (German soon, maybe Swedish)
Versions Raw, Word, Word + POS, Lemma, Lemma + POS

CFP International Journal of Bilingual Education and Bilingualism: Special Issue 2017

Through the AESLA mail-list

::::::::::::::::::::::::::::::::::::::::::

 

CALL FOR PAPERS

International Journal of Bilingual Education and Bilingualism: Special Issue 2017

As guest editors (Yolanda Ruiz de Zarobe and Roy Lyster) of a Special Issue of the International Journal of Bilingual Education and Bilingualism, we invite you to submit proposals on the following topic:

Instructional practices and teacher development in Content and Language Integrated Learning (CLIL)

The aim of this Journal is to be thoroughly international in nature. It disseminates high-quality research, theoretical advances, and international developments related to

initiatives in bilingualism and bilingual education. Each year the International Journal of Bilingual Education and Bilingualism devotes two of its issues to Special Issues.

Previous Special Issues have tended to receive remarkable praise, particularly as they focus on one issue and often provide a major step forward in the study of a particular

This Special Issue on CLIL seeks:

• To promote theoretical and applied research conducted in the context of CLIL and other content-based programs such as immersion.

• To disseminate information about best practices in content-based instruction.

• To provide a truly international exchange on how CLIL pedagogy is applied in a wide

Authors are invited to submit proposals focusing on instructional practices and teacher development in CLIL at any educational level and in any educational setting. Both

state-of-the-art articles and empirical studies are welcomed. Manuscripts submitted  should be original, not under review by any other publication and not published

– Deadline for 200-250 word abstracts: 15th September 2015. Proposals should be submitted by email attachment to the co-editors at yolanda.ruizdezarobe@ehu.es and

They should contain the author’s name, affiliation and e-mail address.

– Notification of acceptance/rejection: 1st November 2015. Please note that selection of the proposal does not always guarantee publication.

– Deadline for full papers (no longer than 7,000 words including notes and references):

15th February 2016. Each article will receive two independent and anonymous

 

For further information on the journal’s submission guidelines please visit.

http://www.tandf.co.uk/journals/rbeb

 

CFP Posters on late-breaking results June 15 deadline

Through the corpora list

:::::::::::::::::::::::::::::::::
CORPUS LINGUISTICS 2015

The CL2015 organising committee is pleased to issue a call for posters on late-breaking results on any of the topics in the conference’s scope. By “late-breaking” we mean research which was not at a sufficiently advanced stage for an abstract submission to be made in the main submission cycle, but which has now reached that point.

We anticipate that the research in question will still be in its earliest phases. “Late-breaking results” include – but are not necessarily limited to – pilot study results, corpus creation activities currently in hand, newly-developed software, and so on.

· Abstracts should be 400-750 words in length. They must be formatted using the conference stylesheet (available to download from http://ucrel.lancs.ac.uk/cl2015/call.php )

· We especially encourage submission of abstracts from early-career researchers, including postgraduate research students and postdoctoral researchers.

· Abstracts which were previously submitted for the January deadline, and not accepted, are NOT eligible to be resubmitted.

· Abstracts should be submitted by email to cl2015@lancaster.ac.uk by 15th June 2014.

· As with all presentations, at least one author of any late-submission poster must attend the conference.
For more details see http://ucrel.lancs.ac.uk/cl2015

An archive copy of the previously-circulated CL2015 Call for Participation may be found here: http://ucrel.lancs.ac.uk/cl2015/doc/CL2015-CallParticipation.pdf

Andrew Hardie, Tony McEnery, Amanda Potts, Vaclav Brezina, and Paul Rayson
The CL2015 Organising Committee

“Culture & Technology” – European Summer University in Digital Humanities

Through the corpora list

:::::::::::::::::::::::::::::::::::

“Culture & Technology” – European Summer University in Digital Humanities (ESU DH C & T) 28th of July – 07th of August 2015, University of Leipzig http://www.culingtec.uni-leipzig.de/ESU_C_T/

As the application phase closes soon (31st of May 2015) we would like to draw your attention (again) to the various types of support which are available for participants of the European Summer School (see: http://www.culingtec.uni-leipzig.de/ESU_C_T/node/480):
The German Accademic Exchange Service (DAAD) offers very generous support to up to 17 alumni / alumnae of German universities. Also former Erasmus-students or student / researchers of Universities of Applied Science, Art or Music Schools qualify as alumni / alumnae as long as they have spent altogether 3 months of their life at academic institutions in Germany.
The University of Leipzig through its International Centre makes available up to 10 bursaries for members of its Eastern European partner universities.
CLARIN-DE makes available up to 13 fellowships which cover tuition fees. If funding allows an allowance of up to € 200 will be granted to cover costs of living.
The Electronic Textual Cultures Lab at the University of Victoria (etcl), in conjunction with the Digital Humanities Summer Institute offers up to 5 tuition fellowships for international graduate students and postdoctoral fellows.
As ESU DH C & T is a member of the International Digital Humanities Training Network courses taken at the Summer University are eligible for transfer credit towards the University of Victoria Graduate Certificate in DH (http://english.uvic.ca/graduate/digital_humanities.html).

The Summer University takes place across 11 whole days. The intensive programme consists of workshops, public lectures, regular project presentations, a poster session and a panel discussion. The workshop programme is composed of the following thematic strands:
XML-TEI encoding, structuring and rendering
Methods and Tools for the Corpus Annotation of Historical and Contemporary Written Texts
Comparing Corpora
Spoken Language and Multimodal Corpora
Python
Basic Statistics and Visualization with R
Stylometry
Open Greek and Latin
Digital Editions and Editorial Theory: Historical Texts and Documents
Spatial Analysis in the Humanities
Building Thematic Research Collections with Drupal
Introduction to Project Management
Each workshop consists of a total of 16 sessions or 32 week-hours. The number of participants in each workshop is limited to 10. Workshops are structured in such a way that participants can either take the two blocks of one workshop or two blocks from different workshops.

The description of all workshops can be found at http://www.culingtec.uni-leipzig.de/ESU_C_T/node/481 in at least two languages. Short bios in at least two languages are available of most workshop leaders at http://www.culingtec.uni-leipzig.de/ESU_C_T/node/488.

Applications are considered on a rolling basis. The selection of participants is made by the Scientific Committee together with the experts who lead the workshops.

Participation fees are the same as last year.

The Summer University is directed at 60 participants from all over Europe and beyond. It wants to bring together (doctoral) students, young scholars and academics from the Arts and Humanities, Library Sciences, Social Sciences, Engineering and Computer Sciences as equal partners to an interdisciplinary exchange of knowledge and experience in a multilingual and multicultural context and thus create the conditions for future project-based cooperations and network-building across the borders of disciplines, countries and cultures.

The Summer University seeks to offer a space for the discussion and acquisition of new knowledge, skills and competences in those computer technologies which play a central role in Humanities Computing and which determine every day more and more the work done in the Humanities and Cultural Sciences, as well as in publishing, libraries, and archives, to name only some of the most important areas. The Summer University aims at integrating these activities into the broader context of the Digital Humanities, which pose questions about the consequences and implications of the application of computational methods and tools to cultural artefacts of all kinds.

In all this the Summer University aims at confronting the so-called Gender Divide , i.e. the under-representation of women in the domain of Information and Communication Technologies (ICT) in Germany and Europe. But, instead of strengthening the hard sciences as such by following the way taken by so many measures which focus on the so-called STEM disciplines and try to convince women of the attractiveness and importance of Computer Science or Engineering, the Summer University relies on the challenges that the Humanities with their complex data and their wealth of women represent for Computer Science and Engineering and the further development of the latter, on the overcoming of the boarders between the so-called hard and soft sciences and on the integration of Humanities, Computer Science and Engineering.

As the Summer University is dedicated not only to the acquisition of knowledge and skills, but wants also to foster community building and networking across disciplines, languages and cultures, countries and continents, the programme of the Summer School features also communal coffee breaks, communal lunches in the refectory of the university, and a rich cultural programme (thematic guided tours, visits of archives, museums and exhibitions, and communal dinners in different parts of Leipzig).

For all relevant information please consult the Web-Portal of the European Summer School in Digital Humanities “Culture & Technology”: http://www.culingtec.uni-leipzig.de/ESU_C_T/ which will be continually updated and integrated with more information as soon as it becomes available.

With best regards, Elisabeth Burr

Prof. Dr. Elisabeth Burr
Französische / frankophone und italienische Sprachwissenschaft
Institut für Romanistik
Universität Leipzig
Beethovenstr. 15
D-04107 Leipzig
http://www.uni-leipzig.de/~burr

Cursos intensivos de idiomas @umnoticias

Servicio de Idiomas de la Universidad de Murcia– CURSOS INTENSIVOS JULIO 2015

Cursos intensivos para JULIO 2015.

– PREINSCRIPCIÓN: Del 20 MAYO al 3 de JUNIO 2015
– MATRÍCULA : Del 17 al 23 de JUNIO 2015 (Según colectivo de acceso)
– Realización curso: Del 29 de JUNIO al 24 de JULIO 2015

Para la PREINSCRIPCIÓN que es obligatoria para todos (CONTINUADORES SIDI y ALUMNOS DE NUEVO INGRESO), debe pinchar sobre el siguiente enlace para REALIZAR PREINSCRIPCIÓN ON-LINE: https://lince.um.es/lince/servlet/um.lince.ControlCursosLince?opcion=idiomas&idioma=I017 seleccionando después el idioma que desee.

Más información en http://www.um.es/idiomas/