Timpute 0.3 (c) ILK 2007-2008 Timpute is distributed under the GNU General Public Licence (see the file LICENSE) Timpute : TiMBL based data imputation Induction of Linguistic Knowledge Research Group, Tilburg University & Naturalis National Museum of Natural History, Leiden, The Netherlands http://ilk.uvt.nl/timpute This is the second release of Timpute TiMBL 6 is required to operate this software. The latest version can be downloaded from http://ilk.uvt.nl/timbl The TiMBL executable should also be available in your PATH. Timpute is a perl package based on the TiMBL software that self-corrects the contents of each cell in a database based on the rest of the database. Timpute is essentially a wrapper that processes the database and passes it piece by piece to TiMBL, whose output is parsed into a csv file again. The easiest method to invoke Timpute is have it run with default settings and replace every cell in every column with its corrections. The required file format at this time is comma seperated values, with the first row containing column names, and every cell contained within doublequotes. To run Timpute on the sample file included, invoke ./timpute.pl -f reptile.csv -o reptile_timputed.csv -p or to run on the xml sample file invoke ./timpute.pl --xml --xml_config reptile.conf.xml -f reptile.xml -o reptile_timputed.xml The command above specifies the input file and the output file, which will contain altered data cells changed by Timpute. The -p option specifies that Timpute should replace the contents of a cell if Timpute disagrees with the original value. More options are listed by typing ./timpute.pl --help This is very much a beta version and as such may contain bugs or improperly working features. Comments or bug reports are welcome at : s.j.hunt@uvt.nl