Skip to content

Pérez-Paredes

All things corpus & applied linguistics (a bit of guitar too)

  • Corpus resources
  • Blog
  • Selected publications
  • Corpus linguistics for education
  • A longitudinal analysis of disciplinary literacy in English-Medium Education
  • V online summer school: Writing science in English
  • About
    • Contact
    • Resumen CV
    • Conferences & talks
    • Traducciones oficiales
    • English language C2: Proyecto docente
    • Short bio
    • European projects

Search this site

Featured research paper

@perezparedes

Tweets by perezparedes

Selected papers

Categories

Archives

Corpora and metadata

Lou Burnard:

[…] it is no exaggeration to say that without metadata, corpus linguistics would be virtually impossible. Why? Because corpus linguistics is an empirical science, in which the investigator seeks to identify patterns of linguistic behaviour by inspection and analysis of naturally occurring samples of language. A typical corpus analysis will therefore gather together many examples of linguistic usage, each taken out of the context in which it originally occurred, like a laboratory specimen. Metadata can restore that context by supplying information about it, thus enabling us to relate the specimen to its original habitat. Furthermore, since language corpora are constructed from pre-existing pieces of language, questions of accuracy and authenticity are all but inevitable when using them: without metadata, the investigator has no way of answering such questions. Without metadata, the investigator has nothing but disconnected words of unknowable provenance or authenticity[1].


[1] URL: http://users.ox.ac.uk/~lou/wip/metadata.html

References: Burnard, Lou; Aston, Guy (1998). The BNC handbook: exploring the British National Corpus. Edinburgh: Edinburgh University Press.

Tweet

Related

Posted on 29th December 201929th December 2019Author perezparedesCategories corpus linguisticsTags Metadata

Post navigation

Previous Previous post: Transcription
Next Next post: Some online text processing tools in one site
Proudly powered by WordPress