Language Technology:
Machine Learning of Language
UIA / UA
Linguistics, Department of Germanic languages and literature
guest lecturer: Antal van den
Bosch (antalb@uvt.nl)
February - May 2004
Fridays, 13.30 - 17.00
Description
In the past two decades, many developers of natural language processing
(NLP) systems have started to use probabilistic and machine learning
methods for the automatic construction of NLP systems. Circumstantial
reasons for this evolution are the growing availability of annotated
language data, and the increased capacities of computers, but the main
reason is that in many NLP areas these methods have alleviated the
knowledge acquisition bottleneck that plagued NLP before, and have
resulted in practical and accurate applications. On the other
hand, due to their dependence on data, they have introduced a data
acquisition bottleneck.
The course gives an overview of this recent evolution, and handles the
following aspects of it:
-
historical motivation and background: pre-Chomskyan
linguistics, pattern recognition, artificial intelligence
-
applications: morpho-phonology, syntax, semantics,
discourse
-
linguistic issues: memory, modularity, representation
-
data issues: scaling, automated/automatic annotation
Schedule
- [February 20 2004]
- [February 27 2004]
- [March 5 2004]
- [March 12 2004]
- [March 19 2004]
- [March 26 2004]
- [April 2 2004]
- [April 23 2004]
- [April 30 2004]
Links
Relevant literature
-
Jurafsky, D., and Martin, J. (2000). Speech and language
processing: An Introduction to Natural Language Processing,
Computational Linguistics, and Speech Recognition. Prentice-Hall.
-
Mitchell, Tom (1997). Machine
Learning. McGraw Hill.
-
Manning, C., and Schütze, H. (1999). Foundations of
Statistical Natural Language Processing. MIT Press.
-
Witten, I., and Frank, E. WEKA
Tutorial.