ILK Home People News Publications MBLP book TiMBL MBT Software Demos Resources Implicit Linguistics HITIME MITCH A Propos Contact About Nederlands
  
MBT: Memory-Based Tagger
MBT: Memory-based tagger generation and tagging

MBT is a memory-based tagger-generator and tagger in one. The tagger-generator part can generate a sequence tagger on the basis of a training set of tagged sequences; the tagger part can tag new sequences. MBT can, for instance, be used to generate part-of-speech taggers or chunkers for natural language processing. It has also been used for named-entity recognition, information extraction in domain-specific texts, and disfluency chunking in transcribed speech.

Features
  • Tagger generation: tagged text in, tagger out
  • Optional feedback loop: feed previous tag decision back to input of next decision
  • Easily customizable feature representation
  • Allows user-provided features
  • Automatic generation of separate sub-taggers for known words and unknown words
  • Can make use of full algorithmic parameters of TiMBL
Documents and reference

MBT resources and links

An installation of MBT assumes an installed version of TiMBL, version 6.1.2 or higher. Mbt will not work with earlier versions.

TiMBL Tilburg Memory-Based Learner
MBSP demo of memory-based English shallow parsing, including Mbt
CGN Tagger-Lemmatizer demo of Dutch PoS tagging with the CGN tag set, including Mbt
Kiswahili PoS tagger demo using Mbt, at aflat.org

Registration

If you want to be informed about future MBT releases, please send an email to timbl@uvt.nl. We will not use your email for any other purpose.

Further information

Walter Daelemans Antal van den Bosch
CLiPS, Computational Linguistics and Psycholinguistics Research Center ILK, Induction of Linguistic Knowledge Research Group
University of Antwerp Tilburg University

Archived versions

1.0    2.0    2.0.1    3.0    3.1    3.1.2    3.1.3

Antal.vdnBosch@uvt.nl | Last update: Mon Dec 14 2009