#corpusMOOC Corpus Linguistics: Method, Analysis, Interpretation starts Sept 29


This free MOOC Offers practical introduction to the methodology of corpus linguistics for researchers in social sciences and humanities. It is an 8-week course and is run by Lancaster University.

More information here.



Language Processing with Perl and Prolog, 2nd edition @Springer

Language Processing with Perl and Prolog, 2nd edition
By Pierre Nugues
Published by Springer

This book has a companion website at http://ilppp.cs.lth.se/
and can be ordered from Springer: http://www.springer.com/978-3-642-41463-3
or Amazon: http://www.amazon.com/Language-Processing-Perl-Prolog-Implementation/dp/364241463X/

This book teaches the principles of natural language processing, first covering practical linguistics issues such as encoding and annotation schemes, defining words, tokens and parts of speech, and morphology, as well as key concepts in machine learning, such as entropy, regression, and classification, which are used throughout the  book. It then details the language-processing functions involved, including part-of-speech tagging using rules and stochastic techniques, using Prolog to write phase-structure grammars, syntactic formalisms and constituent and dependency parsing techniques, semantics, predicate logic, and lexical semantics, and analysis of discourse and applications in dialogue systems. A key feature of the book is the author’s hands-on approach throughout, with sample code in Prolog and Perl, extensive exercises, and a detailed introduction to Prolog. The reader is supported with a companion website that contains teaching slides, programs, and additional material.

The second edition is a complete revision of the techniques exposed in the first edition to reflect advances in the field. The author redesigned or updated all the chapters, added two new ones, and considerably expanded the sections on machine-learning techniques.

1 An Overview of Language Processing
2 Corpus Processing Tools
3 Encoding and Annotation Schemes
4 Topics in Information Theory and Machine Learning
5 Counting Words
6 Words, Parts of Speech, and Morphology
7 Part-of-Speech Tagging Using Rules
8 Part-of-Speech Tagging Using Statistical Techniques
9 Phrase-Structure Grammars in Prolog
10 Partial Parsing
11 Syntactic Formalisms
12 Constituent Parsing
13 Dependency Parsing
14 Semantics and Predicate Logic
15 Lexical Semantics
16 Discourse
17 Dialogue
Appendix: An Introduction to Prolog