| |
|
|
|
|
|
MBT: Memory-based tagger generation and tagging
MBT is a memory-based tagger-generator and
tagger in one. The tagger-generator part can generate a
sequence tagger on the basis of a training set of tagged sequences;
the tagger part can tag new sequences. MBT can, for instance,
be used to generate part-of-speech taggers or chunkers for natural
language processing. It has also been used for named-entity recognition, information extraction in domain-specific texts, and disfluency chunking in transcribed speech.
|
|
|
|
|
|
|
|
|
|
Features
- Tagger generation: tagged text in, tagger out
- Optional feedback loop: feed previous tag decision back to input of next decision
- Easily customizable feature representation
- Allows user-provided features
- Automatic generation of separate sub-taggers for known words and unknown words
- Can make use of full algorithmic parameters of TiMBL
- NEW: server mode is now available though a separate package, MbtServer.
- Debian, Ubuntu, RPM, and Fink packages available
Documents and reference
- MBT
v.3.2 is free software; you can redistribute it and/or modify
it under the terms of the GNU General Public
License as published by the Free
Software Foundation.
- Reference
Guide (15 pages, 110 kB PDF); Daelemans, W.,
Zavrel, J., Van den Bosch, A., and Van der Sloot, K. (2010). MBT:
Memory-Based Tagger, version 3.2, Reference Guide. ILK Technical
Report Series 10-04.
- Recent Advances in
Memory-Based Part-of-Speech Tagging. Jakub Zavrel and Walter
Daelemans. in: Actas del VI Simposio Internacional de Comunicacion
Social, Santiago de Cuba, pp. 590-597, 1999. ILK pub:
ILK-9903.
-
MBT: A Memory-Based Part of Speech Tagger-Generator. Walter
Daelemans, Jakub Zavrel, Peter Berck and Steven Gillis. in: E. Ejerhed
and I. Dagan (eds.) Proceedings of the Fourth Workshop on Very Large
Corpora, Copenhagen, Denmark, 14-27, 1996.
- Part-of-Speech
Tagging for Dutch with MBT, a Memory-based Tagger
Generator. Walter Daelemans, Jakub Zavrel, Peter Berck, in:
Congresboek van de Interdisciplinaire Onderzoeksconferentie
Informatiewetenchap 1996, TU Delft.
- Memory-Based Language
Processing. Daelemans, W., and Van den Bosch,
A. (2005). Cambridge, UK: Cambridge University Press
|
|
|
|
|
|
|
|
|
|
|
|
MBT download and links
Consult these installation
instructions for details on how to install this software if you are using a Debian, Ubuntu, or Fedora-based
system. If you want to build the code from source yourself, download
MBT server
functionality is now in a separate
package, MbtServer. If
previously you used MBT in server mode, you will now need to install
and run MbtServer:
An installation of MBT assumes installed versions of
TiMBL, version 6.4 or
higher, and TimblServer,
version 1.3 or higher. See:
|
|
|
|
|
|
|
|