| |
|
|
| |
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2010
Contextual factors for finding similar experts
Author(s): Katja Hofmann, Krisztian Balog, Toine Bogers, and Maarten de Rijke
Reference: Journal for the American Society for Information Science and Technology (early view)
[doi]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2009
Putting the t where it belongs: Solving a confusion problem in Dutch
Author(s): Herman Stehouwer and Antal van den Bosch
Reference: In S. Verberne, H. van Halteren, and P.-A. Coppen (Eds.), Computational Linguistics in the Netherlands 2007: Selected Papers from the 18th CLIN Meeting, January 22, 2009, Groningen, pp. 21-36.
[pdf]
Memory-based machine translation and language modeling
Author(s): Antal van den Bosch and Peter Berck
Reference: The Prague Bulletin of Mathematical Linguistics No. 91, pp. 17-26.
[pdf]
Using language modeling for spam detection in social reference manager websites
Author(s): Toine Bogers and Antal van den Bosch
Reference: In R. Aly, C. Hauff, I. den Hamer, D. Hiemstra,
T. Huibers, and F. de Jong (Eds.), Proceedings of the 9th
Belgian-Dutch Information Retrieval Workshop (DIR 2009), pp 87-94.
[pdf]
A semantic relatedness metric based on free link structure
Author(s): Sander Wubben and Antal van den Bosch
Reference: In H.C. Bunt, V. Petukhova, and S. Wubben (Eds.), Proceedings of the Eighth International Conference on Computational Semantics (IWCS-8), pp. 355-359.
[pdf]
Comparing alternative data-driven ontological vistas of natural history
Instance-driven discovery of ontological relation labels
Author(s): Marieke van Erp, Antal van den Bosch, Sander Wubben, and Steve Hunt
Reference: In Proceedings of the EACL 2009 Workshop on Language Technology and Resources for Cultural Heritage, Social Sciences, Humanities, and Education (LaTeCH-SHELT&R 2009), Athens, Greece, pp. 60-68.
[pdf]
Clustering and matching headlines for automatic paraphrase acquisition
Author(s): Sander Wubben, Antal van den Bosch, Emiel Krahmer, and Erwin Marsi
Reference: In Proceedings of the 12th European Workshop on Natural Language Generation (ENLG 2009), Athens, Greece, pp. 122-125.
[pdf]
Language models for contextual error detection and correction
Author(s): Herman Stehouwer and Menno van Zaanen
Reference: In Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference, Athens, Greece, pp. 41-48.
[pdf]
Design and evaluation of a university-wide expert search engine
Author(s): Ruud Liebregts and Toine Bogers
Reference: In Proceedings of the 31st European Conference on Information Retrieval (ECIR 2009), vol. 5478 of Lecture Notes in Computer Science, Toulouse, France, pp. 587-594, Springer Verlag.
[pdf]
Making a clean sweep of cultural heritage
A constraint satisfaction approach to machine translation
Author(s): Sander Canisius and Antal van den Bosch
Reference: In H. Somers and L. Màrquez (Eds.), Proceedings of the 13th Annual Conference of the European Association for Machine Translation (EAMT-2009), pp. 182-189.
[pdf]
Joint memory-based learning of syntactic and semantic dependencies in multiple languages
Author(s): Roser Morante, Vincent Van Asch, and Antal van den Bosch
Reference: In Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL): Shared Task, pp. 25-30. Stroudsburg, PA: Association for Computational Linguistics.
[pdf]
Digital discoveries in museums, libraries, and archives: Computer
science meets cultural heritage
Author(s): Antal van den Bosch, Jaap van den Herik, and Paul Doorenbosch
Reference: Interdisciplinary Science Review, 34:2-3, pp. 129-138.
[pdf]
Weaving a new fabric of natural history
Author(s): Antal van den Bosch, Piroska Lendvai, Marieke van Erp, Steve Hunt, Marian van der Meij, and René Dekker
Reference: Interdisciplinary Science Review, 34:2-3, pp. 206-223.
[pdf]
Parallel identification of the spelling variants in corpora
Author(s): Martin Reynaert
References: In Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data 2009 (AND-2009), Barcelona, Spain, pp. 77--84.
[pdf]
Dependency parsing and semantic role labeling as a single task
Extending memory-based machine translation to phrases
Dependency relations as source context in phrase-based SMT
Author(s): Rejwanul Haque, Sudip Kumar Naskar, Antal van den Bosch, and Andy Way
Reference: In Proceedings of PACLIC 23: the 23rd Pacific Asia Conference on Language, Information and Computation, Hong Kong, China, pp.170-179.
[pdf]
Token merging in language model-based confusible disambiguation
Author(s): Herman Stehouwer and Menno van Zaanen
Reference: In T. Calders, K. Tuyls, and M. Pechinizkiy (Eds.), Proceedings of the 21st Benelux Conference on Artificial Intelligence (BNAIC-2009), pp. 241-248.
[pdf]
Collaborative and content-based filtering for item recommendation on social bookmarking websites
Author(s): Toine Bogers and Antal van den Bosch
Reference: In D. Jannach, W. Geyer, J. Freyne, S. S. Anand,
C. Dugan, B. Mobasher, and A. Kobsa (Eds.), Proceedings of the ACM
RecSys '09 workshop on Recommender Systems and the Social Web,
pp. 9-16, October 2009.
[pdf]
Feature construction for memory-based semantic role labeling of Catalan and Spanish
Author(s): Roser Morante and Antal van den Bosch
Reference: In N. Nicolov, G. Angelova, and R. Mitkov (Eds.), Recent Advances in Natural Language Processing V, Current Issues in Linguistic Theory 309, pp. 131-142. Amsterdam, the Netherlands: John Benjamins.
[publisher's webpage]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2008
Analysis of joint inference strategies for the semantic role labeling of Spanish and Catalan
Author(s): Mihai Surdeanu, Roser Morante, and Lluís Màrquez
Reference: In A. Gelbukh (Ed.), Proceedings of the Computational Linguistics and Intelligent Text Processing 9th International Conference, CICLing 2008. Lecture Notes in Computer Science Vol. 4919/2008, Berlin / Heidelberg: Springer, pp. 206-218.
[pdf]
Alignment-based expansion of textual database fields
Author(s): Piroska Lendvai
Reference: In A. Gelbukh (Ed.), Proceedings of the Computational Linguistics and Intelligent Text Processing 9th International Conference, CICLing 2008. Lecture Notes in Computer Science Vol. 4919/2008, Berlin / Heidelberg: Springer, pp. 522-531.
[pdf, first page]
Non-interactive OCR post-correction for giga-scale digitization projects
Author(s): Martin Reynaert
Reference: In A. Gelbukh (Ed.), Proceedings of the Computational Linguistics and Intelligent Text Processing 9th International Conference, CICLing 2008. Lecture Notes in Computer Science Vol. 4919/2008, Berlin / Heidelberg: Springer, pp. 617-630.
[pdf, first page - corrected, post-publication version]
A modular approach to learning Dutch co-reference
Author(s): Véronique Hoste and Antal van den Bosch
Reference: In C. Johansson (Ed.), Proceedings from the First Bergen Workshop on Anaphora Resolution (WAR I), Bergen, Norway, pp. 51-75.
[pdf]
Using citation analysis for expert retrieval in workgroups
Author(s): Toine Bogers, Klaas Kox, and Antal van den Bosch
Reference: In E. Hoenkamp, M. de Cock, and V. Hoste (Eds.), Proceedings of the 8th Belgian-Dutch Information Retrieval Workshop (DIR 2008), pp 21-28. Maastricht, April 2008.
[pdf]
Experiments with an ensemble of Spanish dependency parsers
Author(s): Roser Morante
Reference: Procesamiento del Lenguaje Natural, Revista no. 40, pp. 59-66.
[pdf]
Semantic role labeling tools trained on the Cast3LB-CoNNL-SemRol Corpus
Author(s): Roser Morante
Reference: In Proceedings of the Sixth International Language Resources and Evaluation (LREC'08). Marrakech, Morocco, 2008.
[pdf]
From D-Coi to SoNaR: A reference corpus for Dutch
Author(s): Nelleke Oostdijk, Martin Reynaert, Paola Monachesi, Gertjan Van Noord, Roeland Ordelman, Ineke Schuurman and Vincent Vandeghinste
Reference: In Proceedings of the Sixth International Language Resources and Evaluation (LREC'08). Marrakech, Morocco, 2008.
[pdf]
All, and only, the errors: More complete and consistent spelling and OCR-error correction evaluation
Author(s): Martin Reynaert
Reference: In Proceedings of the Sixth International Language Resources and Evaluation (LREC'08). Marrakech,Morocco, 2008.
[pdf]
From field notes towards a knowledge base
Author(s): Piroska Lendvai and Steve Hunt
Reference: In Proceedings of the Sixth International Language Resources and Evaluation (LREC'08). Marrakech,Morocco, 2008.
[pdf]
A Personalized Recommender System for Writing in the Internet Age
Author(s): Mari Carmen Puerta Melguizo, Olga Munoz Ramos, Lou Boves, Toine Bogers, and Antal van den Bosch
Reference: In Proceedings of the LREC 2008 Workshop on Natural Language Processing Resources, Algorithms, and Tools for Authoring Aid. Marrakech, Morocco, 2008.
Integrating Contextual Factors into Topic-Centric Retrieval Methods for Finding Similar Experts
Author(s): Katja Hofmann, Krisztian Balog, Toine Bogers, and Maarten de Rijke
Reference: In Proceedings of the SIGIR 2008 Workshop on Future Challenges in Expert Retrieval, pp 29-36. Singapore, Singapore, July 2008.
[pdf]
Efficient Context-Sensitive Word Completion for Mobile Devices
Author(s): Antal van den Bosch and Toine Bogers
Reference: In MobileHCI 2008: Proceedings of the 10th International Conference on Human-Computer Interaction with Mobile Devices and Services, IOP-MMI special track, pp 465-470. Amsterdam, The Netherlands, September 2008.
[pdf]
Using Language Models for Spam Detection in Social Bookmarking
Author(s): Toine Bogers and Antal van den Bosch
Reference: In Proceedings of 2008 ECML/PKDD Discovery Challenge Workshop, pp 1-12. Antwerp, Belgium, September 2008.
[pdf]
Recommending Scientific Articles using CiteULike
Author(s): Toine Bogers and Antal van den Bosch
Reference: In RecSys '08: Proceedings of the 2008 ACM Conference on Recommender Systems, pp 287-290, ACM Press, October 2008.
[pdf]
Learning the Scope of Negation in Biomedical Texts
Author(s): Roser Morante, Anthony Liekens, and Walter Daelemans
Reference: In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pp. 715-724.
[pdf]
Preparing archeological reports for intelligent retrieval
Author(s): Hans Paijmans and Sander Wubben
Reference: In Posluschny, K. Lambers, & I. Herzog (Eds.) Layers of Perception. Proceedings of the 35th International Conference on Computer Applications and Quantitative Methods in Archaology, Bonn: Dr. Rudolf Habelt GmbH.
[pdf]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2007
A memory-based classification approach to marker-based EBMT
Author(s): Antal van den Bosch, Nicolas Stroppa, and Andy Way
Reference: In F. Van Eynde, V. Vandeghinste, and I. Schuurman (Eds.), Proceedings of the METIS-II Workshop on New Approaches to Machine Translation, 63-72. January 11, 2007, Leuven, Belgium.
[pdf]
Memory Based Learning and the interpretation of Numbers in archaeological Reports
Author(s): Hans Paijmans and Sander Wubben
Reference: In M.F. Moens, T. Tuytelaars, & A.P. de Vries (Eds.), Proceedings of the 7th Dutch Belgian Information Retrieval Workshop (DIR2007), pp. 51-56, Leuven, Belgium
[pdf]
Learning to segment and label semi-structured documents with little or no supervision
Author(s): Sander Canisius and Caroline Sporleder
Reference: In P. Adriaans, M. van Someren, and S. Katrenko (Eds.), Proceedings of the 18th BENELEARN Conference. May 14, 2007, Amsterdam, The Netherlands.
[pdf]
Superlinear parallelisation of the k-nearest neighbor classifier
Author(s): Antal van den Bosch and Ko van der Sloot
Reference: In P. Adriaans, M. van Someren, and S. Katrenko (Eds.), Proceedings of the 18th BENELEARN Conference. May 14, 2007, Amsterdam, The Netherlands.
[pdf]
Automatic techniques for generating and correcting cultural heritage collection metadata
Author(s): Antal van den Bosch, Caroline Sporleder, Marieke van
Erp, and Steve Hunt Reference: In Proceedings of Digital
Humanities 2007, the 19th Joint International Conference of the
Association for Computers and the Humanities and the Association for
Literary and Linguistic Computing, University of Illinois at Urbana-Champaign, Illinois, US, pp. 223-224. [online
abstract]
Retrieving lost information from textual databases: Rediscovering expeditions from an animal specimen database
Author(s): Marieke van Erp
Reference: In Proceedings of the Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2007), Prague, Czech Republic, pp. 17-24.
[pdf, bib]
Bootstrapping information extraction from field books
Author(s): Sander Canisius and Caroline Sporleder
Reference: In Proceedings of the 2007 Joint Conference on
Empirical Methods in Natural Language Processing and Computational
Natural Language Learning (EMNLP-CoNLL), Prague, Czech Republic, pp. 827-836.
[pdf, bib]
A constraint satisfaction approach to dependency parsing
Author(s): Sander Canisius and Erik Tjong Kim Sang
Reference: In Proceedings of the CoNLL Shared Task
Session of EMNLP-CoNLL 2007, Prague, Czech Republic, pp. 1124-1128.
[pdf, bib]
ILK: Machine learning of semantic relations with shallow features and almost no data
ILK2: Semantic role labeling of Catalan and Spanish using TiMBL
Author(s): Roser Morante and Bertjan Busser
Reference: In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, pp. 183-186.
[pdf, bib]
What a proactive recommendation system needs: Relevance, non-intrusiveness, and a new long-term memory
Author(s): Mari-Carmen Puerta Melguizo, Toine Bogers, Anita Deshpande, Lou Boves, and Antal van den Bosch
Reference: In Proceedings of the 9th International Conference on Enterprise Information Systems (ICEIS 2007), Funchal, Madeira.
[pdf]
Broad expertise retrieval for sparse data environments
Author(s): Krisztian Balog, Toine Bogers, Leif Azzopardi, Maarten de Rijke, and Antal van den Bosch
Reference: In SIGIR'07: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, the Netherlands, pp. 551-558.
[pdf]
Token-based chunking of turn-internal dialogue act sequences
Author(s): Piroska Lendvai and Jeroen Geertzen
Reference: In Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, Antwerp, Belgium, pp. 174-181.
[pdf]
Memory-based semantic role labeling of Catalan and Spanish
Author(s): Roser Morante and Antal van den Bosch
Reference: In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-2007), Borovets, Bulgaria, pp. 388-394.
[pdf]
Recompiling a knowledge-based dependency parser into memory
Author(s): Sander Canisius and Antal van den Bosch
Reference: In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-2007), Borovets, Bulgaria, pp. 104-108.
[pdf]
Arabic computational morphology: Knowledge-based and empirical methods
Memory-based morphological analysis and part-of-speech tagging of Arabic
Author(s): Antal van den Bosch, Erwin Marsi, and Abdelhadi Soudi
Reference: In Soudi, A., Van den Bosch, A., and Neumann, G. (Eds), Arabic computational morphology: Knowledge-based and empirical methods, Chapter 11, pp. 203-219. Berlin: Springer.
[pdf of preprint]
Comparing and evaluating information retrieval algorithms for news recommendation
Author(s): Toine Bogers and Antal van den Bosch
Reference: In Proceedings of the 2007 ACM Conference on Recommender Systems, Minneapolis, MN, pp. 141-144, ACM Press
[pdf]
Open Boek: A system for the extraction of numeric data from archeological reports
Author(s): Hans Paijmans and Sander Wubben
Reference: In Proceedings of the UK e-Science 2007 All Hands Meeting, Nottingham, UK
[pdf]
Superlinear parallelization of k-nearest neighbor retrieval
Author(s): Antal van den Bosch and Ko van der Sloot
Reference: In M. Dastani and E. de Jong (Eds.), Proceedings of the 19th Belgian-Dutch Artificial Intelligence Conference (BNAIC-2007), Utrecht, The Netherlands, pp. 65-72.
[pdf]
Exploiting source similarity for SMT using context-informed features
Author(s): Nicolas Stroppa, Antal van den Bosch, and Andy Way
Reference: In A. Way
and B. Gawronska (Eds.), Proceedings of the 11th International Conference on Theoretical Issues in Machine Translation (TMI 2007), Skövde, Sweden, pp. 231-240.
[pdf]
A pilot study for semantic role labeling in a Dutch corpus
Author(s): Gerwert Stevens, Paola Monachesi, and Antal van den Bosch
Reference: In P. Dirix, I. Schuurman, V. Vandeghinste, and F. Van Eynde (Eds.), Computational Linguistics in the Netherlands: Selected Papers from the Seventeenth CLIN Meeting, Leuven, Belgium, pp. 99-114.
[preprint pdf]
An efficient memory-based morpho-syntactic tagger and parser for Dutch
Author(s): Antal van den Bosch, Bertjan Busser, Sander Canisius, and Walter Daelemans
Reference: In P. Dirix, I. Schuurman, V. Vandeghinste, and F. Van Eynde (Eds.), Computational Linguistics in the Netherlands: Selected Papers from the Seventeenth CLIN Meeting, Leuven, Belgium, pp. 99-114.
[preprint pdf]
Dat gebeurd mei niet: Computationele modellen voor verwarbare homofonen
Author(s): Walter Daelemans and Antal van den Bosch
Reference: In: Dominiek Sandra, Rita Rymenans, Pol Cuvelier, & Peter van Petegem (Eds), Tussen taal, spelling en onderwijs: Essays bij het emeritaat van Frans Daems. Gent: Academia Press, pp. 199-210, 2007
[pdf]
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2006
Authoritative re-ranking in fusing authorship-based subcollection
search results
Author(s): Toine Bogers and Antal van den
Bosch
Reference: In F. de Jong and W. Kraaij (Eds.), Proceedings of the Sixth Belgian-Dutch Information Retrieval Workshop, DIR-2006, pp 49-55. Enschede: Neslia Paniculata.
[pdf]
Constraint satisfaction inference: Non-probabilistic global inference for sequence labelling
Spotting the 'odd-one-out': Data-driven error detection and correction
in textual databases
Correcting 'wrong-column' errors in text databases
Authoritative re-ranking of search results
Author(s): Toine Bogers and Antal van den
Bosch
Reference: In Proceedings of the 28th European
Conference on Information Retrieval (ECIR 2006), vol. 3936 of
Lecture Notes in Computer Science, pp. 519-522. Springer Verlag, April
2006.
[pdf]
Identifying named entities in text databases from the natural history domain
Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
Author(s): Antal van den Bosch, Ineke Schuurman, and Vincent Vandeghinste
Reference: In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC-06), Genoa, Italy, 2006.
[pdf]
Corpus-induced corpus cleanup
Author(s): Martin Reynaert
Reference: In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC-06), Genoa, Italy, 2006.
[pdf; slides (containing more results)]
Dependency parsing by inference over high-recall dependency predictions
Improved morpho-phonological sequence processing with constraint satisfaction inference
Author(s): Antal van den Bosch and Sander Canisius
Reference: In Proceedings of the Eighth Meeting of the ACL Special Interest Group in Computational Phonology, SIGPHON '06, June 2006, New York City, NY.
[pdf]
All-word prediction as the ultimate confusible disambiguation
Author(s): Antal van den Bosch
Reference: In Proceedings of the HLT-NAACL Workshop on Computationally hard problems and joint inference in speech and language processing, June 2006, New York City, NY.
[pdf]
Broad Coverage Paragraph Segmentation across Languages and Domains
Author(s): Caroline Sporleder and Mirella Lapata
Reference: ACM Transactions in Speech and Language Processing, 3:2, 1-35, July 2006.
[pdf]
A rule-based approach for process discovery: Dealing with noise and imbalance in process logs
Author(s): Laura Maruster, Ton Weijters, Wil van der Aalst, and
Antal van den Bosch
Reference: Data Mining and Knowledge Discovery,
13, pp. 67-87, 2006.
[preprint pdf]
Spelling space: A computational test bed for phonological and morphological changes in Dutch spelling
Bootstrapping multilingual geographical gazetteers from corpora
Author(s): Marieke van
Erp Reference: In J. Huitink & S. Katrenko (Eds.),
Proceedings of the 11th ESSLLI Student Session, Malaga, Spain,
31 July - 11 August 2006, pp. 192-202. [pdf]
Discrete versus probabilistic sequence classifiers for domain-specific entity chunking
Expertise classification: Collaborative classification vs. automatic extraction
Author(s): Toine Bogers, Willem Thoonen, and Antal van den
Bosch
Reference: In Proceedings of the 17th Annual ASIS&T SIG/CR workshop on Social Classification, Austin, TX.
[pdf]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2005
Memory-based language processing
Local classification and global estimation: Explorations of the k-nearest neighbor algorithm
Author(s): Iris Hendrickx
Reference: Ph.D. Thesis, Tilburg University, November 2005
[pdf]
Improving sequence segmentation learning by predicting trigrams
Author(s): Antal van den Bosch and Walter Daelemans
Reference: In Proceedings of the Ninth Conference on
Natural Language Learning, CoNLL-2005, June 29-30, 2005, Ann
Arbor, MI, pp. 80-87. [pdf]
Applying spelling error correction techniques for improving semantic role labelling
Memory-based morphological analysis generation and part-of-speech
tagging of Arabic
Author(s): Erwin Marsi, Antal van den Bosch, and Abdelhadi Soudi
Reference: In Proceedings of the ACL Workshop on
Computational Approaches to Semitic Languages, June 29, 2005, Ann
Arbor, MI. [pdf]
Robust ASR lattice representation types in pragma-semantic processing
of spoken input
Rule meta-learning for trigram-based sequence processing
Memory-based understanding of user utterances in a spoken dialogue system: Effects of feature selection and co-learning
Author(s): Antal van den Bosch
Reference: In Workshop Proceedings of the 6th International Conference on Case-Based Reasoning, Chicago, August 2005, pp. 85-94.
[pdf]
Designing an active learning based system for corpus annotation
Author(s): Bertjan Busser and Roser Morante
Reference: In Proceedings of the XXI Congresso de la
Sociedad Espanola para el Procesamiento del Lenguaje Natural,
SEPLN-2005, Granada, Spain, pp. 375-381.
[pdf]
Discourse chunking and its application to sentence compression
Author(s): Caroline Sporleder and Mirella Lapata
Reference: In Proceedings of the 2005 Human Language
Technology Conference and the Conference on Empirical Methods in
Natural Language Processing, HLT/EMNLP-05), Vancouver, Canada.
[pdf]
Exploiting linguistic cues to classify rhetorical relations
Author(s): Caroline Sporleder and Alex Lascarides
Reference: In Proceedings of Recent Advances in Natural
Language Processing (RANLP-05), pp. 532-539, Borovets, Bulgaria
[pdf]
Hybrid algorithms for instance-based classification
Author(s): Iris Hendrickx and Antal van den Bosch
Reference: In J. Gama, R. Camacho, P. Brazdil, A. Jorge,
and L. Torgo (Eds.), Machine Learning: ECML 2005: 16th European
Conference on Machine Learning, Porto, Portugal, October 3-7,
2005. Lecture Notes in Computer Science 3720. Berlin: Springer Verlag,
pp. 158-169.
[pdf]
Taxonómia felismerése dokumentumszerkezetbõl
Author(s): Piroska Lendvai
Reference: In: Proceedings of Computational Linguistics in Hungary Conference (Magyar Szamítógépes Nyelvészeti Konferencia, MSZNY-2005), Szeged, Hungary, 2005. pp. 88-95.
[pdf]
Conceptual taxonomy identification in medical documents
Author(s): Piroska Lendvai
Reference: In:
Proceedings of The Second International Workshop on Knowledge
Discovery and Ontologies (KDO-2005), held within ECML/PKDD, Porto,
Portugal, 2005. pp. 31-38.
[pdf]
Scalable classification-based word prediction and confusible correction
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2004
Using rule-induction techniques to model pronunciation variation in Dutch
Author(s): Veronique Hoste, Walter Daelemans and Steven Gillis
Reference: Computer Speech and Language 18:1, pp. 1-24.
[pdf]
Memory-based semantic role labeling: Optimizing features, algorithm, and output
Optionality in evaluating prosody prediction
Author(s): Erwin Marsi Reference: In Proceedings of
5th ISCA Speech Synthesis Research Workshop, Pittsburgh,
USA, 2004.
[pdf]
GAMBL, genetic algorithm optimization of memory-based WSD
Author(s): Bart Decadt, Veronique Hoste, Walter Daelemans, and
Antal van den Bosch
Reference: In: R. Mihalcea and P. Edmonds (eds.),
Proceedings of the Third International Workshop on the Evaluation of
Systems for the Semantic Analysis of Text (Senseval-3), Barcelona,
Spain, July 2004, pages 108-112.
[pdf]
Text induced spelling correction
Author(s): Martin Reynaert
Reference: In: Proceedings of the 20th International
Conference on Computational Linguistics (COLING 2004), August 2004,
Geneva, Switzerland.
[pdf]
Multilingual text induced spelling correction
Author(s): Martin Reynaert
Reference: In: Proceedings of the COLING 2004 Workshop on Multilingual Linguistic Resources, August 2004, Geneva, Switzerland.
[pdf]
Feature transformation through rule induction: a case study with the
k-NN classifier
Author(s): Antal van den Bosch
Reference: In J. Fürnkrantz (Ed.), Proceedings of
the ECML/PKDD 2004 Workshop on Advances in Inductive Rule
Learning, Pisa, Italy, September 2004, pp. 1-16.
[pdf]
Memory-based robust interpretation of recognised speech
Maximum-entropy parameter estimation for the k-NN modified
value-difference kernel
Author(s): Iris Hendrickx and Antal van den Bosch
Reference: In R. Verbrugge, N. Taatgen, and L. Schomaker
(Eds.), Proceedings of the 16th Belgian-Dutch Conference on
Artificial Intelligence, Groningen, The Netherlands, pp.
[pdf]
Wrapped progressive sampling search for optimizing learning algorithm
parameters
Author(s): Antal van den Bosch
Reference: In R. Verbrugge, N. Taatgen, and L. Schomaker
(Eds.), Proceedings of the 16th Belgian-Dutch Conference on
Artificial Intelligence, Groningen, The Netherlands, pp.
[pdf]
FINT: Find Images aNd Text
Author(s): Menno van Zaanen and Guido de Croon
Reference: Working Notes of the Workshop of the Cross-Language Evaluation Forum, Bath, UK
[pdf]
Learning compound boundaries for Afrikaans spelling checking
Author(s): Gerhard van Huyssteen and Menno van Zaanen
Reference: In Proceedings of the Workshop on International Proofing Tools and Language Technologies, Patras, Greece, July 2004.
[pdf]
A multilingual parallel parsed corpus as gold standard for grammatical inference evaluation
Author(s): Menno van Zaanen, Andrew Roberts, and Eric Atwell
Reference: In Proceedings of The Amazing Utility of Parallel and Comparable Corpora Workshop, held at the 4th International Conference on Language Resources and Evaluation, Lisbon, Portugal, May 2004.
[pdf]
Introduction to the special issue on grammar induction
Author(s): Pieter Adriaans, Henning Fernau, Colin de la Higuera, and Menno van Zaanen
Reference: Grammars: Journal of Mathematical Research on Formal and Natural Languages. Special issue on Grammar induction. 2004.
[pdf]
Automatic sentence simplification for subtitling in Dutch and English
Author(s): Walter Daelemans, Anja Höthker, and Erik Tjong Kim Sang
Reference: In: M.T. Lino e.a. (eds.), Proceedings of the 4th
International Conference on Language Resources and Evaluation, pages
1045-1048, 2004.
[pdf]
Verb classification: Machine learning experiments in classifying verbs into semantic classes
Author(s): Bart Decadt and Walter Daelemans
Reference: In L. Guthrie e.a. (eds.), Proceedings of the
LREC 2004 Workshop "Beyond Named Entity Recognition - Semantic
Labelling for NLP Tasks", pages 25-30.
[pdf]
Evaluation and adaptation of the Celex Dutch morphological database
Author(s): Tom Laureys, Guy De Pauw, Hugo Van Hamme, Walter Daelemans, and Dirk Van Compernolle
Reference: In: M.T. Lino e.a. (Eds.), Proceedings of the 4th
International Conference on Language Resources and Evaluation, pages
1247-1250, 2004.
[pdf]
Multimodal multilingual resources in the subtitling process
Author(s): Stelios Piperidis, Iason Demiros, Prokopis
Prokopidis, Peter Vanroose, Anja Höthker, Walter Daelemans, Elsa
Sklavounou, Manos Konstantinou, and Yannis Karavidas
Reference: In: M.T. Lino e.a. (Eds.), Proceedings of the 4th
International Language Resources and Evaluation Conference (LREC 2004),
Lisbon.
[pdf]
Unsupervised text mining for ontology extraction: An evaluation of statistical measures
Author(s): Marie-Laure Reinberger and Walter Daelemans
Reference: In M.T. Lino e.a. (Eds.), Proceeding of the 4th
International Language Resources and Evaluation Conference (LREC 2004),
May 2004, Lisbon, pp. 491-494.
[pdf]
Using a parallel transcript/subtitle corpus for sentence compression
Author(s): Vincent Vandeghinste and Erik Tjong Kim Sang
Reference: In: M.T. Lino e.a. (eds.), Proceedings of the 4th
International Conference on Language Resources and Evaluation, pages
231-234, 2004.
[pdf]
A memory-based shallow parser for spoken Dutch
Author(s): Sander Canisius and Antal van den Bosch
Reference: In B. Decadt, G. De Pauw, and V. Hoste (Eds.),
Selected papers from the Thirteenth Computational Linguistics in
the Netherlands Meeting, Antwerp, Belgium, pp. 31-45.
[pdf]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2003
Learning PP attachment for filtering prosodic phrasing
Learning to identify fragmented words in spoken discourse
Author(s): Piroska Lendvai Reference: In:
Proceedings of EACL-03 Student Research Workshop, Budapest,
2003, pp. 25-32.
[pdf]
Machine learning for shallow interpretation of user utterances in
spoken dialogue systems
Author(s): Piroska Lendvai, Antal van den
Bosch and Emiel Krahmer
Reference: In: Proceedings of the EACL-03 Workshop on
Dialogue Systems: Interaction, Adaptation and Styles of
Management. Budapest, 2003. pages 69-78.
[pdf] - note: corrected version!
Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition
Author(s): Erik F. Tjong Kim Sang and Fien De Meulder
Reference: In: Proceedings of CoNLL-2003, Edmonton, Canada,
2003, pp. 142-147.
[pdf]
Memory-based one-step named-entity recognition: Effects of seed list features, classifier stacking, and unannotated data
Author(s): Iris Hendrickx and Antal van den Bosch
Reference: In: Proceedings of CoNLL-2003, the Seventh Conference on Natural Language Learning, Edmonton, Canada,
2003, pp. 176-179.
[pdf]
Memory-based named entity recognition using unannotated data
Author(s): Fien De Meulder and Walter Daelemans
Reference: In: Proceedings of CoNLL-2003, the Seventh
Conference on Natural Language Learning, Edmonton, Canada, 2003, pp. 208-211.
[pdf]
Learning to predict pitch accents and prosodic boundaries in Dutch
Process discovery for evaluating dialogue strategies
Author(s): Piroska Lendvai and Laura Maruster
Reference: In: Proceedings of the ISCA Workshop on Error
Handling in Spoken Dialogue Systems. Chateau d'Oex-Vaud, Switzerland,
2003. pages 119-122.
[pdf]
Memory-based disfluency chunking
A machine learning approach to understand business processes
Author(s): Laura Maruster
Reference: Ph.D. Thesis, Eindhoven Technical University.
Promotors: Prof. dr. ir. J.C. Wortmann, prof. dr. W.M.P. Daelemans.
August 27, 2003.
Feature-rich memory-based classification for shallow NLP and
information extraction
Author(s): Jakub Zavrel and Walter Daelemans
Reference: In Jurgen Franke, Gholamreza Nakhaeizadeh and
Ingrid Renz (Eds.), Text Mining,
Theoretical Aspects and Applications, Springer Physica-Verlag, pp. 33-54
[pdf]
Combined optimization of feature selection and algorithm parameter
interaction in machine learning of language
Author(s): Walter Daelemans, Véronique Hoste, Fien De
Meulder and Bart Naudts
Reference: Proceedings of the 14th European Conference on
Machine Learning (ECML-2003), Lecture Notes in Computer Science 2837,
Springer-Verlag, Cavtat-Dubrovnik, Croatia, pp. 84-95
[pdf]
Information extraction via double classification
Author(s): An De Sitter and Walter Daelemans
Reference: In: Proceedings of the International Workshop
on Adaptive Text Extraction and Mining. Catvat-Dubrovnik, Croatia,
66-73, September 2003. [Also: Department of Mathematics and Computer
Science, University of Antwerp, Technical Report 2003-06.]
[pdf]
Is shallow parsing useful for the unsupervised learning of semantic
clusters?
Author(s): Marie-Laure Reinberger and Walter Daelemans
Reference: In: Proceedings of the 4th Conference on
Intelligent Text Processing and Computational Linguistics (CICLing
2003), Mexico City, Mexico, Lecture Notes in Computer Science 2588, Springer Verlag, 2003,
pp. 304-313.
[pdf]
Mining for lexons: applying unsupervised learning methods to create ontology bases
Author(s): Marie-Laure Reinberger, Peter Spyns, Walter Daelemans, and Robert Meersman
Reference: In: Robert Meersman, Zahir Tari, and
Douglas Schmidt (Eds.) On the Move to Meaningful Internet Systems
2003: CoopIS, DOA, and ODBASE, Lecture Notes in Computer Science 2888,
Springer-Verlag, Catania, Italy, 803-819.
[pdf]
Workflow mining: A survey of issues and approaches
Author(s): W.M.P. van der Aalst, B.F. van Dongen, J. Herbst, L. Maruster, G. Schimm, and A.J.M.M. Weijters
Reference: Data and Knowledge Engineering, 47:2, pp. 237-267.
Various uses of a spelling checker project: Practical experiences, teaching, and learning
Author(s): Menno van Zaanen and Gerhard van Huyssteen
Reference: Southern African Linguistics and Applied Language
Studies Journal, special issue on "Language Technology in Southern
Africa: resources and applications"
[pdf]
Alignment-based learning versus data-oriented parsing
Author(s): Menno van Zaanen
Reference: In Rens Bod, Remko Scha, and Khalil Sima'an (Eds.), Data-oriented parsing, Chapter 20.
[pdf]
A memory-based approach to meter induction
Author(s): Menno van Zaanen, Rens Bod, and Henkjan Honing
Reference: In Proceedings of the 5th Triennial Conference of the European Society for the Cognitive Sciences of Music (ESCOM), Hanover, Germany, September 2003.
[pdf]
A spellchecker for Afrikaans, based on morphological analysis
Author(s): Gerhard van Huyssteen and Menno van Zaanen
Reference: In Proceedings of the 6th International Terminology in Advanced Management Applications Conference (TAMA), Pretoria, South-Africa, February 2003.
[pdf]
Proceedings of the Seventh Conference on Natural Language Learning
Author(s): Walter Daelemans and Miles Osborne (Eds.)
Reference: Proceedings of the Seventh Conference on Natural Language Learning. Edmonton, Canada, ACL, vii + 212 pages, ISBN 1-932432-08-6.
[online proceedings]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2002
Memory-based grammatical relation finding
Shallow parsing on the basis of words only: A case study
Author(s): Antal van den Bosch and Sabine Buchholz.
Reference: In Proceedings of the 40th
Meeting of the Association for Computational Linguistics (ACL'02).
corrects the proceedings version on three points: (1) two words in the
abstract, (2) the title of section 4, and (3) a corrected reference to
Eisner 1996) [pdf]
Dutch word sense disambiguation: Optimizing the localness of context
Evaluating the results of a memory-based word-expert approach to
unrestricted word sense disambiguation
Improving machine-learned detection of miscommunications in human-machine dialogues through informed data splitting
Multi-feature error detection in spoken dialogue systems
Introduction to the CoNLL-2002 shared task: Language-independent named entity recognition
Memory-based named entity recognition
Logistic-based patient grouping for multi-disciplinary treatment
Process Mining: Discovering Direct Successors in Process Logs
Author(s): Laura Maruster, Ton Weijters, Wil van der Aalst, and
Antal van den Bosch Reference: In S. Lange, K. Satoh,
C.H. Smith (Eds.): Discovery Science, 5th International
Conference. Lecture Notes in Computer Science 2534. Berlin:
Springer, pp. 364-373, 2002.
[ps]
Parameter optimization for machine learning of word sense disambiguation
Combining information sources for memory-based pitch accent placement
Expanding k-NN analogy with instance families
Author(s): Antal van den Bosch
Reference: In R. Skousen, D. Lonsdale, and D. Parkinson (Eds.),
Analogical Modeling: An exemplar-based approach to
language. Amsterdam: John Benjamins, pp. 209-224.
[pdf of preprint
]
A comparison of analogical modeling of language to memory-based language processing
Author(s): Walter Daelemans
Reference: In R. Skousen, D. Lonsdale, and D. Parkinson (Eds.),
Analogical Modeling: An exemplar-based approach to
language. Amsterdam: John Benjamins, pp. 157-179.
[pdf of preprint]
Evaluation of machine learning methods for natural language processing tasks
Author(s): Walter Daelemans and Véronique Hoste
Reference: In Proceedings of
LREC-2002, the third International Language Resources and Evaluation
Conference, Las Palmas, Spain, 755-760
[pdf]
A field survey for establishing priorities in the development of HLT
resources for Dutch
Author(s): Binnenpoorte, D., F. De Vriend, J. Sturm, W. Daelemans,
H. Strik, C. Cucchiarini
Reference: In Proceedings of
LREC-2002, the third International Language Resources and Evaluation
Conference, Las Palmas, Spain, 1862-1866
[pdf]
Review of: Knowledge and learning in language, Charles D. Yang
Author(s): Walter Daelemans
Reference: Glot International, 6:5, 137-142
[pdf of preprint]
Memory-based phoneme-to-grapheme conversion
Author(s): Bart Decadt, Jacques Duchateau, Walter Daelemans, and Patrick
Wambacq
Reference: In M. Theune, A. Nijholt, and H. Hondrop (Eds.), Computational Linguistics in the
Netherlands 2001. Selected Papers from the Twelfth CLIN Meeting,
Amsterdam - New York: Rodopi, pp. 47-61
[ps]
Transcription of out-of-vocabulary words in large vocabulary
speech recognition based on phoneme-to-grapheme conversion
Author(s): Bart Decadt, Jacques Duchateau, Walter Daelemans, and Patrick
Wambacq
Reference: In Proceedings of ICASSP-02, the International Conference on Acoustics,
Speech and Signal Processing, Volume I, Orlando, USA, pp. 861-864
[ps]
A named entity recognition system for Dutch
Author(s): Fien De Meulder,Walter Daelemans, and Véronique Hoste
Reference: In: M. Theune,
A. Nijholt, and H. Hondrop (Eds.), Computational Linguistics in the
Netherlands 2001. Selected Papers from the Twelfth CLIN Meeting,
Amsterdam - New York: Rodopi, 77-88
[ps]
Special issue on machine learning approaches to shallow parsing
Author(s): James Hammerton, Miles Osborne, Susan Armstrong, and Walter Daelemans (Eds.)
Reference: Journal of Machine Learning Research, 2, pp. 551-719
[webpage]
Introduction to the special issue on machine learning approaches to shallow parsing
Author(s): James Hammerton, Miles Osborne, Susan Armstrong, and Walter Daelemans (Eds.)
Reference: Journal of Machine Learning Research, 2, pp. 551-558
[ps]
Where do syllables come from?
Author(s): E. Martens, W. Daelemans, S. Gillis, and H. Taelman
Reference: In: W. Gray and C. Schunn (Eds.) Proceedings of
the Twenty-Fourth Annual Conference of the Cognitive Science Society,
Fairfax, Virginia, George Mason University, pp. 657-664
[pdf]
Dutch HLT resources: From BLARK to priority lists
Author(s): H. Strik, W. Daelemans, D. Binnenpoorte, J. Sturm, F. De
Vriend, C. Cucchiarini
Reference: In: Proceedings of ICSLP-2002, Denver, USA, pp. 1549-1552
[pdf]
Actieplan voor het Nederlands in de taal- en spraaktechnologie: Prioriteiten voor basisvoorzieningen
Author(s): Walter Daelemans and Helmer Strik
Reference: Report for the Nederlandse Taalunie, 165 pages,
2002. (Action Plan for Dutch in Language and Speech technology:
Priorities for Basic Resources).
[webpage]
Proceedings of the Sixth Conference on Natural Language Learning (CoNLL-2002)
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2001
Detecting problematic turns in human-machine interactions: Rule-induction versus
memory-based learning approaches
Author(s): Antal van den Bosch, Emiel Krahmer, and Marc Swerts.
Reference: In Proceedings of the 39th Meeting of the
Association for Computational Linguistics (ACL'01). New Brunswick, NJ:
ACL, pp. 499-506, 2001.
[pdf]
Improving accuracy in word class tagging through combination of
machine learning systems
Review of "Learnability in Optimality Theory"
Author(s): Walter Daelemans.
Reference: Computational Linguistics, 27:2, pp. 316-317, 2001
[ps]
SHAPAQA: Shallow parsing for question answering on the World Wide Web
Author(s): Sabine Buchholz, Walter Daelemans.
Reference: In Proceedings of Euroconference Recent Advances
in Natural Language Processing (RANLP), Tsigov Chark, Bulgaria,
5-7 September, pp. 47-51, 2001.
[pdf]
Complex answers: A case study using a WWW question answering system
Using grammatical relations, answer frequencies and the World Wide Web for TREC question answering
Author(s): Sabine Buchholz.
Reference: Notebook paper for TREC 2001. NIST, 2001. Official proceedings paper to appear later.
[pdf]
TreeTalk: Memory-based word phonemisation
Combining a self-organising map with memory-based learning
Memory-based clause identification
Transforming a chunker to a parser
Introduction to the CoNLL-2001 shared task: Clause identification
Automatic discovery of workflow models from hospital data
Robust data oriented parsing for speech-understanding
Author(s): Khalil Sima'an
Reference: In Proceedings of the International
Workshop on Parsing Technologies (IWPT'01), Beijing, China, October 2001.
[ps]
Building a tree-bank of modern Hebrew text
Author(s): Khalil Sima'an, A. Itai, Y. Winter, A. Altman, and N. Nativ
Reference: In Beatrice Daille and Laurent Romary (eds.),
Special Issue on Natural Language Processing
and Corpus Linguistics, Journal Traitement Automatique des Langues
(t.a.l.), 2001
[ps]
Enhancing the robustness of data oriented parsing for speech-understanding
Author(s): Khalil Sima'an
Reference: In Proceedings of the Natural Language Processing
Pacific Rim Symposium (NLPRS'01), Tokyo, Japan, November 2001.
[ps]
Predicting phrase breaks with memory-based learning
Author(s): Bertjan Busser, Walter Daelemans, Antal van den
Bosch Reference: In Proceedings 4th ISCA Tutorial and
Research Workshop on Speech Synthesis. Perthshire Scotland, August
29th - September 1st, 2001.
[ps]
Dutch word sense disambiguation: Data and preliminary results
Author(s): Iris Hendrickx, Antal van den
Bosch Reference: In Proceedings of SENSEVAL-2. Toulouse, France.
[pdf]
Phoneme-to-grapheme conversion for out-of-vocabulary words in large
vocabulary speech recognition
Author(s): Bart Decadt, Jacques Duchateau, Walter Daelemans, Patrick Wambacq
Reference: In: ASRU 2001 Automatic Speech
Recognition and Understanding Workshop Proceedings. IEEE CD-rom
(IEEE Catalog Number 01EX544, ISBN 0-7803-7343-X), Trento, Italy, 9-13
December, 4 pages
[ps]
Phoneme-to-grapheme conversion for out-of-vocabulary words in speech recognition
Author(s): Bart Decadt and Walter Daelemans
Reference: ATRANOS Deliverable
WP2, March 31, 15 pages
[ps]
Optimizing phoneme-to-grapheme conversion for out-of-vocabulary words in speech
recognition
Author(s): Bart Decadt and Walter Daelemans
Reference: ATRANOS Deliverable
WP2, September 30, 15 pages
[ps]
Classifier optimization and combination in the English all words task
Author(s): Véronique Hoste, Anne Kool, and Walter Daelemans
Reference: In:
Judita Preiss and David Yarowsky (Eds.), Proceedings of
SENSEVAL-2. Second International Workshop on Evaluating Word Sense
Disambiguation Systems. New Brunswick: ACL, pp. 83-86
[ps]
Strengthening the Dutch Human Language Technology Infrastructure
Author(s): Catia Cucchiarini, Walter Daelemans, Helmer Strik
Reference: The ELRA Newsletter, 6:4, pp. 3-7
[ps]
Proceedings CLIN-2000
Editors: Walter Daelemans, Khalil Sima'an, Jorn Veenstra, Jakub Zavrel
Reference: Computational Linguistics in the Netherlands 2000.
Selected Papers from the Eleventh CLIN Meeting. Amsterdam - New York:
Rodopi. ISBN 90-420-1257-9. 2001.
Proceedings of the Fifth Conference on Natural Language Learning
(CoNLL-2001)
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2000
Integrating seed names and n-grams for a named entity list and
classifier
Learning statistically neutral tasks without expert guidance
Unpacking multi-valued
symbolic features and classes in memory-based language learning
Author(s): Antal van den Bosch and Jakub Zavrel
Referemce: In P. Langley (Ed.), Proceedings of the
Seventeenth International Conference on Machine Learning, pp.
1055-1062. San Francisco, CA: Morgan Kaufmann, 2000.
[abs, ps]
Memory-Based Word Sense Disambiguation
Author(s): Jorn Veenstra, Antal van den Bosch, Sabine Buchholz, Walter Daelemans, Jakub Zavrel
Reference: Computers and the Humanities, special issue on Senseval, Word Sense Disambiguations, Ed. Adam Kilgarriff and Martha Palmer,
34:1-2, 2000
[abs,
ps of preprint]
A distributed, yet
symbolic model of text-to-speech processing
Author(s): Antal van den Bosch and Walter Daelemans
Reference: In P. Broeder and
J.M.J. Murre (Eds.), Models of Language Acquisition: inductive and
deductive approaches. Oxford University Press, 76-99, 2000.
[abs, ps]
Lazy learning: A comparison of natural and machine learning of
stress
Author(s): Steven Gillis, Walter Daelemans, and
Gert Durieux.
Reference: In: P. Broeder and J.M.J. Murre
(Eds.), Models of Language Acquisition: inductive and deductive
approaches . Oxford University Press, 76-99, 2000.
[ps]
Inductive lexica
Author(s): Walter Daelemans and Gert Durieux.
Reference: In: Van Eynde, F. and D. Gibbon
(eds.) Lexicon Development for speech and language
processing, Kluwer Academic Publishers, 115-139, 2000.
[ ps]
(preprint)
Bootstrapping a tagged corpus through combination of existing
heterogeneous taggers
Author(s): Jakub Zavrel and Walter Daelemans
Reference: In: Proceedings of the second
international conference on language resources and evaluation
(LREC-2000) , Athens, Greece, 17-20, 2000
[ps]
Part of speech tagging and lemmatisation for the Spoken Dutch
Corpus
Author(s): Frank Van Eynde, Jakub Zavrel, and
Walter Daelemans
Reference: In: Proceedings of the second
international conference on language resources and evaluation
(LREC-2000) , Athens, Greece, 1427-1434, 2000
[ps]
Lemmatisation and morphosyntactic annotation for the spoken dutch
corpus
Author(s): Frank Van Eynde, Jakub Zavrel, and
Walter Daelemans
Reference: In: Paola Monachesi (Ed.), Computational Linguistics in the Netherlands
1999. Utrecht, OTS, pp. 73-84
[ps]
Diverse classifiers for NLP disambiguation tasks. Comparisons,
optimization, combination, and evolution
Author(s): Jakub Zavrel, Sven Degroeve, Anne Kool, Walter Daelemans, Kristina Jokinen
Reference: In: Jokinen et al. (Eds.),
TWLT 18. Learning to Behave. CEvoLE 2, Ieper, Belgium, pp. 201-221
[ps]
Meta-learning for phonemic annotation of corpora
Author(s): Veronique Hoste, Walter Daelemans, Erik Tjong Kim Sang,
Steven Gillis
Reference: In: Proceedings 17th
international conference on Machine Learning , Stanford, 375-382,
2000
[ps]
A Memory-Based Alternative for Connectionist Shift-Reduce Parsing
Using induced rules as complex features in memory-based language learning
Author(s): Antal van den Bosch
Reference: In Proceedings of the Fourth Conference on
Computational Natural Language Learning and of the Second Learning
Language in Logic Workshop (CoNLL/LLL). New Brunswick, NJ: ACL,
pp.73-78.
[ps]
Tree-gram parsing: Lexical dependencies and structural relations
Author(s): Khalil Sima'an
Reference: In Proceedings of the 38th Meeting of the Association for Computational Linguistics (ACL'00). New Brunswick, NJ: ACL, pp. 53-60.
[ps]
Efficient parsing of domain language
Author(s): Khalil Sima'an
Reference: In Proceedings of the Twelfth Belgium-Netherlands Conference on Artificial Intelligence (BNAIC'00). Tilburg: ILK / Infolab, pp. 199-206.
[ps]
Machine learning for modeling Dutch pronunciation variation
Author(s): Véronique Hoste, Steven Gillis, and Walter Daelemans
Reference: In: Paola
Monachesi (Ed.), Computational Linguistics in the Netherlands
1999. Selected Papers from the Tenth CLIN Meeting, pp. 73-83
[ps]
Comparing bagging
and boosting for natural language processing tasks: a typicality
approach
Author(s): Véronique Hoste and Walter Daelemans
Reference: In: Ad Feelders (Ed.), Proceedings of the Tenth
Belgian-Dutch Conference on Machine Learning (Benelearn
2000), pp. 101-108
[ps]
Meta-learning for phonemic annotation of corpora
Author(s): Véronique Hoste, Walter Daelemans, Erik Tjong Kim Sang, and Steven Gillis
Reference: In: Proceedings of ICML-2000, Stanford University,
CA, USA, pp. 375-382
[ps]
A rule induction approach to modeling regional pronunciation variation
Author(s): Véronique Hoste, Walter Daelemans, and Steven Gillis
Reference: In: Proceedings of COLING 2000, Saarbrücken, Germany. San Francisco: Morgan Kaufman Publishers, 2000, pp. 327-333
[ps]
Genetic algorithms for feature relevance assignment in memory-based language
processing
Author(s): Anne Kool, Walter Daelemans, and Jakub Zavrel
Reference: In: Proceedings of CoNLL-2000.
[ps]
Simultaneous feature
selection and parameter optimization for memory-based natural language
processing.
Author(s): Anne Kool, Jakub Zavrel, and Walter Daelemans
Reference: In: Ad Feelders (Ed.), Proceedings of BENELEARN 2000, Tilburg, pp. 93-100, 2000
[ps]
Applying System Combination to Base Noun Phrase Identification
Author(s): Erik Tjong Kim Sang, Walter Daelemans, Hervé
Déjean, Rob Koeling, Yuval Krymolowski, Vasin Punyakanok and
Dan Roth
Reference: In: Proceedings of COLING 2000, Saarbrücken, Germany.
[ps]
Hij drinkt niet altijd "t" en ik drink er soms wél: Bronnen
van hardnekkige werkwoordfouten in het Nederlands
Author(s): D. Sandra, S. Frisson, G. Durieux, W. Daelemans, and
S. Gillis
Reference: In: S. Gillis, J. Nuyts, and J. Taeldeman (Eds.), Met taal om de tuin
geleid. Wilrijk: Universitaire Instelling Antwerpen
[ps]
Zelflerende systemen als instrument voor de taalkunde en de taaltechnologie
Author(s): Walter Daelemans, Guy De Pauw, Gert Durieux, Steven Gillis, Véronique Hoste, and Erik Tjong Kim Sang
Reference: In: S. Gillis, J. Nuyts, and
J. Taeldeman (Eds.), Met taal om de tuin geleid. Wilrijk:
Universitaire Instelling Antwerpen
[ps]
The role of algorithm bias
vs information source in learning algorithms for morphosyntactic
disambiguation
Author(s): Guy De Pauw and Walter Daelemans
Reference: In: Proceedings of the Fourth Conference on Computational
Language Learning (CoNLL-2000), Lissabon, Portugal, pp. 19-24
[ps]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1999
Forgetting exceptions is harmful in language learning
Recent Advances in Memory-Based Part-of-Speech Tagging
Interpreting knowledge representations in BP-SOM
Memory-Based Text Chunking
Author(s): Jorn Veenstra.
Reference: to appear in: Proceedings of ACAI, Chania Greece, 1999.
[ps]
Toward an exemplar-based computational model for cognitive grammar
Author(s): Walter Daelemans.
Reference: In Johan van der Auwera, Frank Durieux, and Ludo
Lejeune (Eds.) English as a Human Language. To honour Louis
Goossens. Munchen: LINCOM Europa, 73-82, 1998.
[abs, ps]
Memory-Based Shallow Parsing
Cascaded Grammatical Relation Assignment
Memory-based morphological analysis
Author(s): Antal van den Bosch and Walter Daelemans.
Reference: In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, ACL'99, University of Maryland, USA, June 20-26, 1999, pp. 285-292.
[abs, pdf]
Instance-family abstraction in memory-based language learning
Author(s): Antal van den Bosch.
Reference: In: I. Bratko and S. Dzeroski (Eds.), Machine Learning: Proceedings of the Sixteenth International Conference, ICML'99, Bled, Slovenia, June 27-30, 1999, pp. 39-48.
[abs, ps]
Machine learning of word pronunciation: the case against
abstraction
Memory-based language processing
Careful abstraction from instance families in memory-based language
learning
Machine Learning Approaches
Author(s): Walter Daelemans.
Reference: In Hans van Halteren (Ed.)
Syntactic Wordclass Tagging.
Kluwer Academic Publishers, 285-304, 1999.
[ps]
On the Arbitrariness of Lexical Categories
Author(s): Gert Durieux, Walter Daelemans and Steven Gillis
Reference: Van Eynde et al. (eds.) Computational Linguistics in the Netherlands 1998
Amsterdam: Rodopi, pp. 19-36, 1999.
[ps]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1998
Modularity in inductively-learned word pronunciation systems
Do not forget: Full memory in memory-based learning of word pronunciation
Rapid development of NLP modules with memory-based learning
Author(s): Walter Daelemans, Antal van den Bosch, Jakub Zavrel,
Jorn Veenstra, Sabine Buchholz, and Bertjan Busser.
Reference: In Proceedings of ELSNET in Wonderland, pp.
105-113. Utrecht: ELSNET, 1998. Also in R. Basili and M.T. Pazienza (Eds.),
ECML-98 TANLPS Workshop Notes, Technische Universitaet Chemnitz,
1998, pp. 1-17.
[abs,ps]
Interpretable neural networks with BP-SOM
Toward inductive lexicons: a case study
Author(s): Walter Daelemans, Gert Durieux, and Antal van den Bosch.
Reference: In: P. Velardi (ed.), Proceedings LREC Workshop
on Adapting Lexical and Corpus Resources to Sublanguages and Applications,
Granada, Spain, pp. 29-35, 1998.
[abs,ps]
A connectionist model for bootstrap learning of syllabic structure
Author(s): Jean Vroomen, Antal van den Bosch, and Beatrice De Gelder.
Reference: Language and Cognitive Processes, Special
issue on Language Acquisition and Connectionionism (Ed. K. Plunkett), 13:2/3,
pp. 193-220.
Fast NP chunking using memory-based learning techniques
Author(s) Jorn Veenstra.
Reference: In
F. Verdenius and W. van den Broek (Eds), Proceedings of Benelearn
1998, Wageningen, the Netherlands, pp. 71-79, 1998.
[abs,ps] .
Improving data driven wordclass tagging by system combination
Distinguishing complements from adjuncts using memory-based learning
Author(s): Sabine Buchholz.
Reference: In B. Keller (Ed.), Proceedings of the
ESSLLI-98 Workshop on Automated Acquisition of Syntax and Parsing,
pp.41-48.
[abs,ps]
TreeTalk-D: a machine learning approach to Dutch word
pronunciation
Author(s): Bertjan Busser.
Reference: In P. Sojka, V. Matousek, K. Pala, and I. Kopecek
(Eds.) (1998) Proceedings TSD Conference, pp. 3-8, Masaryk University,
Czech Republic.
[abs,ps,demo]
Unsupervised learning of subcategorisation information and its
application in a parsing subtask
Author(s): Sabine Buchholz.
Reference: In H. La Poutre and H.J. van den Herik (Eds.) (1998),
Proceedings of the Tenth Netherlands/Belgium Conference on Artificial
Intelligence (NAIC'98), CWI, Amsterdam, pp. 7-16.
[abs,ps]
A connectionist model for bootstrap learning of syllabic structure
Author(s): Jean Vroomen, Antal van den Bosch, and Beatrice De Gelder.
Reference: Language and Cognitive Processes, Special
issue on Language Acquisition and Connectionionism (Ed. K. Plunkett), 13:2/3,
pp. 193-220.
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1997
IGTree: Using Trees for Compression and Classification in Lazy Learning
Algorithms
Memory-Based Learning: Using Similarity for Smoothing
Resolving PP Attachment Ambiguities with Memory-Based Learning
Author(s): Jakub Zavrel, Walter Daelemans and Jorn Veenstra
Reference: Proc. of the workshop on Computational Natural Language
Learning (CoNLL'97), edited by: Mark Ellison, Madrid, 11 July 1997
[abs,
ps]
TreeTalk-D: a Machine Learning Approach to Dutch Grapheme-to-Phoneme Conversion
Workshop Notes on Empirical Learning of Natural Language Processing Tasks.
Author(s): Walter Daelemans, Ton Weijters, and Antal van den
Bosch (eds.)
Reference: In M. van Someren and G. Widmer
(eds.) Machine Learning: ECML-97, Lecture Notes in Artificial
Intelligence 1224, Berlin: Springer, 337-344, 1997.
[electronic workshop notes]
Empirical Learning of Natural Language Processing Tasks.
Author(s): Walter Daelemans, Antal van den Bosch, and Ton Weijters.
Reference: M. van Someren and G. Widmer (eds.) Machine Learning:
ECML-97, Lecture Notes in Artificial Intelligence 1224, Berlin: Springer,
337-344, 1997.
[abs,
ps]
Data Mining as a Method for Linguistic Analysis: Dutch Diminutives
Author(s): Walter Daelemans, Peter Berck, and Steven Gillis.
Reference: Folia Linguistica , XXXI/1-2, 57-75, 1997.
[abs,
ps]
A feature-relevance heuristic for indexing and compressing large
case bases
Author(s): Walter Daelemans, Antal van den Bosch, and Jakub Zavrel.
Reference: M. van Someren and G. Widmer (eds.) 9th European
Conference on Machine Learning - Poster Papers. Prague: Laboratory of
Intelligent Systems, 29-38, 1997.
[abs,
ps]
Skousen's Analogical Modeling algorithm: A comparison with Lazy Learning
Author(s): Walter Daelemans, Steven Gillis, and Gert Durieux
Reference: D. Jones and H. Somers (eds.) New Methods in Language
Processing., London:
University College Press, 3-15, 1997.
[abs,
ps]
Learning to pronounce written words. A study in inductive language learning
Author(s): Antal van den Bosch.
Reference: Ph.D. Thesis, Universiteit Maastricht, The Netherlands.
Cadier en Keer: Phidippides, 1997.
[ps]
Automatic phonetic transcription of words based on sparse data
Author(s): Maria Wolters and Antal van den Bosch.
Reference: In Walter Daelemans, Antal van den Bosch, and Ton Weijters (Eds.), Workshop notes of ECML/MLnet Familiarization Workshop on
Empirical Learning of Natural Language Processing Tasks, April 1997,
Prague, Czech Republic, pp. 61-70, 1997.
[abs] [ps]
When small disjuncts abound, try lazy learning: A case study
Intelligible neural networks with BP-SOM
Avoiding overfitting with BP-SOM
Behavioural aspects of BP-SOM
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1996
Language-Independent Data-Oriented Grapheme-to-Phoneme Conversion
Abstraction Considered Harmful: Lazy Learning of Language Processing.
Author(s): Walter Daelemans
Reference: van den Herik, J. and T. Weijters (eds.) Benelearn-96.
Proceedings of the 6th Belgian-Dutch Conference on Machine Learning.
MATRIKS: Maastricht, The Netherlands, 3-12, 1996.
-
[ps]
Morphological Analysis as Classification: an Inductive-Learning Approach.
Unsupervised Discovery of Phonological Categories through Supervised Learning
of Morphological Rules
Author(s): Walter Daelemans, Peter Berck and Steven Gillis.
Reference: Proceedings of the 16th International Conference
on Computational Linguistics (COLING-96), Copenhagen, Denmark, 95-100,
1996.
-
[abs,
ps]
Artificial Intelligence Models of Language Processing
MBT: A Memory-Based Part of Speech Tagger-Generator
Author(s): Walter Daelemans, Jakub Zavrel, Peter Berck and Steven
Gillis.
Reference: E. Ejerhed and I. Dagan (eds.) Proceedings
of the Fourth Workshop on Very Large Corpora, Copenhagen, Denmark,
14-27, 1996.
-
[abs,
ps]
Stretching the limits of learning without modules
Author(s): Antal van den Bosch and Ton Weijters
Reference: In M. van der Heyden, J. Mrsic-Flögel
and K. Weigl (Eds.), Proceedings of the HELNET International
Workshop on Neural Networks, Vol. I/II, Amsterdam: VU University
Press, pp. 177-185.
An inductive-learning approach to morphological analysis
Author(s): Antal van den Bosch, Walter Daelemans, and Ton
Weijters Reference: In Durieux, G., Daelemans, W., and
Gillis, S. (Eds.), Papers from the Sixth Computational Linguistics
in the Netherlands Meeting, December 1996, University of Antwerp,
Belgium, pp. 213-230.
Avoiding overfitting in BP-SOM
Author(s): Ton Weijters, Antal van den Bosch, Eric Postma, and Jaap van den Herik
Reference: Avoiding overfitting in BP-SOM. In H.J. van den
Herik and A. Weij\-ters (Eds.), Proceedings of BENELEARN-96,
Maastricht, pp. 157-166.
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1995
A computational model of P&P: Dresher and Kaye (1990) revisited.
Author(s): Steven Gillis, Gert Durieux, Walter Daelemans.
Reference: M. Verrips & F. Wijnen (eds.) Approaches to
Parameter Setting. Amsterdam Studies in Child Language Development,
vol 4, 135-173, 1995.
-
[abs,
ps]
The profit of learning exceptions.
Scaling effects with greedy and lazy machine-learning
algorithms
Author(s): Antal van den Bosch, Ton Weijters, and Jaap van den Herik
Reference: In Proceedings of the Seventh Dutch Conference on
AI, NAIC-95, Erasmus University, Rotterdam, pp. 211-218.
Connectionism
Author(s): Ton Weijters and Antal van den Bosch
Reference: In Verschueren, J., Östman, J.O., and
Blommaert, J. (Eds.), Handbook of Pragmatics: Learning Manual,
pp. 165-171. Amsterdam: Benjamins.
Linguistics as data mining: Dutch diminutives
Memory-based lexical acquisition and processing.
Author(s): Walter Daelemans
Reference: P. Steffens (ed.) Machine Translation and the
Lexicon, Springer Lecture Notes in Artificial Intelligence 898, 85-98,
1995.
-
[abs,
ps]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1994
Measuring the Complexity of Writing Systems
Default inheritance in an object-oriented representation of linguistic
Categories.
Author(s): Walter Daelemans and Koen De Smedt
Reference: International Journal Human-Computer Studies
41, 149-177, 1994.
A language-independent, data-oriented architecture for grapheme-to-phoneme
conversion.
Are children 'lazy learners'? A comparison of natural and machine learning
of Stress.
Author(s): Steven Gillis, Walter Daelemans and Gert Durieux
Reference: Ram, A. and Eiselt, K. (eds.) Proceedings of the
Sixteenth Annual Conference of the Cognitive Science Society, Georgia
Institute of Technology, Atlanta, USA, Hillsdale: Lawrence Erlbaum Associates,
369-374, 1994.
-
[abs,
ps]
Skousen's Analogical modeling algorithm: a comparison with lazy learning
Author(s): Walter Daelemans, Steven Gillis and Gert Durieux.
Reference: Jones, D. (ed.) Proceedings of the International
Conference on New Methods in Language Processing (NeMLaP), UMIST: Manchester, 1-7, 1994.
-
[abs,
ps]
The acquisition of stress: a data-oriented approach.
Author(s): Walter Daelemans, Steven Gillis and Gert Durieux.
Reference: Computational Linguistics 20 (3), special
issue on Computational Phonology (Steven Bird guest ed.), 421-451, 1994.
-
[abs]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1993
Learnability and markedness: Dutch stress assignment
Author(s): Steven Gillis, Walter Daelemans, Gert Durieux and Antal
van den Bosch.
Reference: Proceedings of the Fifteenth Annual Conference
of the Cognitive Science Society, Boulder Colorado, USA, Hillsdale:
Lawrence Erlbaum Associates, 452-457, 1993.
-
[abs,
ps]
Tabtalk: Reusability in data-oriented grapheme-to-phoneme conversion.
Data-oriented methods for grapheme-to-phoneme conversion
Learnability and markedness in data-driven acquisition of stress
Author(s): Walter Daelemans, Steven Gillis, Gert Durieux and Antal
van den Bosch.
Reference: T. Mark Ellison and James M. Scobbie (eds) Computational
Phonology. Edinburgh Working Papers in Cognitive Science 8, 1993, 157-178.
-
[abs,
ps]
A data-driven approach to stress acquisition
Author(s): Walter Daelemans, Antal van den Bosch, Steven
Gillis, and Gert Durieux Reference: In P. Adriaans (Ed.),
ECML-93 Workshop Notes on Machine Learning Techniques and Text
Analysis, Vienna, pp. 15-24.
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2008
Analysis of joint inference strategies for the semantic role labeling of Spanish and Catalan
Author(s): Mihai Surdeanu, Roser Morante, and Lluís Màrquez
Reference: In A. Gelbukh (Ed.), Proceedings of the Computational Linguistics and Intelligent Text Processing 9th International Conference, CICLing 2008. Lecture Notes in Computer Science Vol. 4919/2008, Berlin / Heidelberg: Springer, pp. 206-218.
[pdf]
Alignment-based expansion of textual database fields
Author(s): Piroska Lendvai
Reference: In A. Gelbukh (Ed.), Proceedings of the Computational Linguistics and Intelligent Text Processing 9th International Conference, CICLing 2008. Lecture Notes in Computer Science Vol. 4919/2008, Berlin / Heidelberg: Springer, pp. 522-531.
[pdf, first page]
Non-interactive OCR post-correction for giga-scale digitization projects
Author(s): Martin Reynaert
Reference: In A. Gelbukh (Ed.), Proceedings of the Computational Linguistics and Intelligent Text Processing 9th International Conference, CICLing 2008. Lecture Notes in Computer Science Vol. 4919/2008, Berlin / Heidelberg: Springer, pp. 617-630.
[pdf, first page - corrected, post-publication version]
A modular approach to learning Dutch co-reference
Author(s): Véronique Hoste and Antal van den Bosch
Reference: In C. Johansson (Ed.), Proceedings from the First Bergen Workshop on Anaphora Resolution (WAR I), Bergen, Norway, pp. 51-75.
[pdf]
Using citation analysis for expert retrieval in workgroups
Author(s): Toine Bogers, Klaas Kox, and Antal van den Bosch
Reference: In E. Hoenkamp, M. de Cock, and V. Hoste (Eds.), Proceedings of the 8th Belgian-Dutch Information Retrieval Workshop (DIR 2008), pp 21-28. Maastricht, April 2008.
[pdf]
Experiments with an ensemble of Spanish dependency parsers
Author(s): Roser Morante
Reference: Procesamiento del Lenguaje Natural, Revista no. 40, pp. 59-66.
[pdf]
Semantic role labeling tools trained on the Cast3LB-CoNNL-SemRol Corpus
Author(s): Roser Morante
Reference: In Proceedings of the Sixth International Language Resources and Evaluation (LREC'08). Marrakech, Morocco, 2008.
[pdf]
From D-Coi to SoNaR: A reference corpus for Dutch
Author(s): Nelleke Oostdijk, Martin Reynaert, Paola Monachesi, Gertjan Van Noord, Roeland Ordelman, Ineke Schuurman and Vincent Vandeghinste
Reference: In Proceedings of the Sixth International Language Resources and Evaluation (LREC'08). Marrakech, Morocco, 2008.
[pdf]
All, and only, the errors: More complete and consistent spelling and OCR-error correction evaluation
Author(s): Martin Reynaert
Reference: In Proceedings of the Sixth International Language Resources and Evaluation (LREC'08). Marrakech,Morocco, 2008.
[pdf]
From field notes towards a knowledge base
Author(s): Piroska Lendvai and Steve Hunt
Reference: In Proceedings of the Sixth International Language Resources and Evaluation (LREC'08). Marrakech,Morocco, 2008.
[pdf]
A Personalized Recommender System for Writing in the Internet Age
Author(s): Mari Carmen Puerta Melguizo, Olga Munoz Ramos, Lou Boves, Toine Bogers, and Antal van den Bosch
Reference: In Proceedings of the LREC 2008 Workshop on Natural Language Processing Resources, Algorithms, and Tools for Authoring Aid. Marrakech, Morocco, 2008.
Integrating Contextual Factors into Topic-Centric Retrieval Methods for Finding Similar Experts
Author(s): Katja Hofmann, Krisztian Balog, Toine Bogers, and Maarten de Rijke
Reference: In Proceedings of the SIGIR 2008 Workshop on Future Challenges in Expert Retrieval, pp 29-36. Singapore, Singapore, July 2008.
[pdf]
Efficient Context-Sensitive Word Completion for Mobile Devices
Author(s): Antal van den Bosch and Toine Bogers
Reference: In MobileHCI 2008: Proceedings of the 10th International Conference on Human-Computer Interaction with Mobile Devices and Services, IOP-MMI special track, pp 465-470. Amsterdam, The Netherlands, September 2008.
[pdf]
Using Language Models for Spam Detection in Social Bookmarking
Author(s): Toine Bogers and Antal van den Bosch
Reference: In Proceedings of 2008 ECML/PKDD Discovery Challenge Workshop, pp 1-12. Antwerp, Belgium, September 2008.
[pdf]
Recommending Scientific Articles using CiteULike
Author(s): Toine Bogers and Antal van den Bosch
Reference: In RecSys '08: Proceedings of the 2008 ACM Conference on Recommender Systems, pp 287-290, ACM Press, October 2008.
[pdf]
Learning the Scope of Negation in Biomedical Texts
Author(s): Roser Morante, Anthony Liekens, and Walter Daelemans
Reference: In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pp. 715-724.
[pdf]
Preparing archeological reports for intelligent retrieval
Author(s): Hans Paijmans and Sander Wubben
Reference: In Posluschny, K. Lambers, & I. Herzog (Eds.) Layers of Perception. Proceedings of the 35th International Conference on Computer Applications and Quantitative Methods in Archaology, Bonn: Dr. Rudolf Habelt GmbH.
[pdf]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2007
A memory-based classification approach to marker-based EBMT
Author(s): Antal van den Bosch, Nicolas Stroppa, and Andy Way
Reference: In F. Van Eynde, V. Vandeghinste, and I. Schuurman (Eds.), Proceedings of the METIS-II Workshop on New Approaches to Machine Translation, 63-72. January 11, 2007, Leuven, Belgium.
[pdf]
Memory Based Learning and the interpretation of Numbers in archaeological Reports
Author(s): Hans Paijmans and Sander Wubben
Reference: In M.F. Moens, T. Tuytelaars, & A.P. de Vries (Eds.), Proceedings of the 7th Dutch Belgian Information Retrieval Workshop (DIR2007), pp. 51-56, Leuven, Belgium
[pdf]
Learning to segment and label semi-structured documents with little or no supervision
Author(s): Sander Canisius and Caroline Sporleder
Reference: In P. Adriaans, M. van Someren, and S. Katrenko (Eds.), Proceedings of the 18th BENELEARN Conference. May 14, 2007, Amsterdam, The Netherlands.
[pdf]
Superlinear parallelisation of the k-nearest neighbor classifier
Author(s): Antal van den Bosch and Ko van der Sloot
Reference: In P. Adriaans, M. van Someren, and S. Katrenko (Eds.), Proceedings of the 18th BENELEARN Conference. May 14, 2007, Amsterdam, The Netherlands.
[pdf]
Automatic techniques for generating and correcting cultural heritage collection metadata
Author(s): Antal van den Bosch, Caroline Sporleder, Marieke van
Erp, and Steve Hunt Reference: In Proceedings of Digital
Humanities 2007, the 19th Joint International Conference of the
Association for Computers and the Humanities and the Association for
Literary and Linguistic Computing, University of Illinois at Urbana-Champaign, Illinois, US, pp. 223-224. [online
abstract]
Retrieving lost information from textual databases: Rediscovering expeditions from an animal specimen database
Author(s): Marieke van Erp
Reference: In Proceedings of the Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2007), Prague, Czech Republic, pp. 17-24.
[pdf, bib]
Bootstrapping information extraction from field books
Author(s): Sander Canisius and Caroline Sporleder
Reference: In Proceedings of the 2007 Joint Conference on
Empirical Methods in Natural Language Processing and Computational
Natural Language Learning (EMNLP-CoNLL), Prague, Czech Republic, pp. 827-836.
[pdf, bib]
A constraint satisfaction approach to dependency parsing
Author(s): Sander Canisius and Erik Tjong Kim Sang
Reference: In Proceedings of the CoNLL Shared Task
Session of EMNLP-CoNLL 2007, Prague, Czech Republic, pp. 1124-1128.
[pdf, bib]
ILK: Machine learning of semantic relations with shallow features and almost no data
ILK2: Semantic role labeling of Catalan and Spanish using TiMBL
Author(s): Roser Morante and Bertjan Busser
Reference: In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, pp. 183-186.
[pdf, bib]
What a proactive recommendation system needs: Relevance, non-intrusiveness, and a new long-term memory
Author(s): Mari-Carmen Puerta Melguizo, Toine Bogers, Anita Deshpande, Lou Boves, and Antal van den Bosch
Reference: In Proceedings of the 9th International Conference on Enterprise Information Systems (ICEIS 2007), Funchal, Madeira.
[pdf]
Broad expertise retrieval for sparse data environments
Author(s): Krisztian Balog, Toine Bogers, Leif Azzopardi, Maarten de Rijke, and Antal van den Bosch
Reference: In SIGIR'07: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, the Netherlands, pp. 551-558.
[pdf]
Token-based chunking of turn-internal dialogue act sequences
Author(s): Piroska Lendvai and Jeroen Geertzen
Reference: In Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, Antwerp, Belgium, pp. 174-181.
[pdf]
Memory-based semantic role labeling of Catalan and Spanish
Author(s): Roser Morante and Antal van den Bosch
Reference: In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-2007), Borovets, Bulgaria, pp. 388-394.
[pdf]
Recompiling a knowledge-based dependency parser into memory
Author(s): Sander Canisius and Antal van den Bosch
Reference: In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-2007), Borovets, Bulgaria, pp. 104-108.
[pdf]
Arabic computational morphology: Knowledge-based and empirical methods
Memory-based morphological analysis and part-of-speech tagging of Arabic
Author(s): Antal van den Bosch, Erwin Marsi, and Abdelhadi Soudi
Reference: In Soudi, A., Van den Bosch, A., and Neumann, G. (Eds), Arabic computational morphology: Knowledge-based and empirical methods, Chapter 11, pp. 203-219. Berlin: Springer.
[pdf of preprint]
Comparing and evaluating information retrieval algorithms for news recommendation
Author(s): Toine Bogers and Antal van den Bosch
Reference: In Proceedings of the 2007 ACM Conference on Recommender Systems, Minneapolis, MN, pp. 141-144, ACM Press
[pdf]
Open Boek: A system for the extraction of numeric data from archeological reports
Author(s): Hans Paijmans and Sander Wubben
Reference: In Proceedings of the UK e-Science 2007 All Hands Meeting, Nottingham, UK
[pdf]
Superlinear parallelization of k-nearest neighbor retrieval
Author(s): Antal van den Bosch and Ko van der Sloot
Reference: In M. Dastani and E. de Jong (Eds.), Proceedings of the 19th Belgian-Dutch Artificial Intelligence Conference (BNAIC-2007), Utrecht, The Netherlands, pp. 65-72.
[pdf]
Exploiting source similarity for SMT using context-informed features
Author(s): Nicolas Stroppa, Antal van den Bosch, and Andy Way
Reference: In A. Way
and B. Gawronska (Eds.), Proceedings of the 11th International Conference on Theoretical Issues in Machine Translation (TMI 2007), Skövde, Sweden, pp. 231-240.
[pdf]
A pilot study for semantic role labeling in a Dutch corpus
Author(s): Gerwert Stevens, Paola Monachesi, and Antal van den Bosch
Reference: In P. Dirix, I. Schuurman, V. Vandeghinste, and F. Van Eynde (Eds.), Computational Linguistics in the Netherlands: Selected Papers from the Seventeenth CLIN Meeting, Leuven, Belgium, pp. 99-114.
[preprint pdf]
An efficient memory-based morpho-syntactic tagger and parser for Dutch
Author(s): Antal van den Bosch, Bertjan Busser, Sander Canisius, and Walter Daelemans
Reference: In P. Dirix, I. Schuurman, V. Vandeghinste, and F. Van Eynde (Eds.), Computational Linguistics in the Netherlands: Selected Papers from the Seventeenth CLIN Meeting, Leuven, Belgium, pp. 99-114.
[preprint pdf]
Dat gebeurd mei niet: Computationele modellen voor verwarbare homofonen
Author(s): Walter Daelemans and Antal van den Bosch
Reference: In: Dominiek Sandra, Rita Rymenans, Pol Cuvelier, & Peter van Petegem (Eds), Tussen taal, spelling en onderwijs: Essays bij het emeritaat van Frans Daems. Gent: Academia Press, pp. 199-210, 2007
[pdf]
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2006
Authoritative re-ranking in fusing authorship-based subcollection
search results
Author(s): Toine Bogers and Antal van den
Bosch
Reference: In F. de Jong and W. Kraaij (Eds.), Proceedings of the Sixth Belgian-Dutch Information Retrieval Workshop, DIR-2006, pp 49-55. Enschede: Neslia Paniculata.
[pdf]
Constraint satisfaction inference: Non-probabilistic global inference for sequence labelling
Spotting the 'odd-one-out': Data-driven error detection and correction
in textual databases
Correcting 'wrong-column' errors in text databases
Authoritative re-ranking of search results
Author(s): Toine Bogers and Antal van den
Bosch
Reference: In Proceedings of the 28th European
Conference on Information Retrieval (ECIR 2006), vol. 3936 of
Lecture Notes in Computer Science, pp. 519-522. Springer Verlag, April
2006.
[pdf]
Identifying named entities in text databases from the natural history domain
Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
Author(s): Antal van den Bosch, Ineke Schuurman, and Vincent Vandeghinste
Reference: In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC-06), Genoa, Italy, 2006.
[pdf]
Corpus-induced corpus cleanup
Author(s): Martin Reynaert
Reference: In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC-06), Genoa, Italy, 2006.
[pdf; slides (containing more results)]
Dependency parsing by inference over high-recall dependency predictions
Improved morpho-phonological sequence processing with constraint satisfaction inference
Author(s): Antal van den Bosch and Sander Canisius
Reference: In Proceedings of the Eighth Meeting of the ACL Special Interest Group in Computational Phonology, SIGPHON '06, June 2006, New York City, NY.
[pdf]
All-word prediction as the ultimate confusible disambiguation
Author(s): Antal van den Bosch
Reference: In Proceedings of the HLT-NAACL Workshop on Computationally hard problems and joint inference in speech and language processing, June 2006, New York City, NY.
[pdf]
Broad Coverage Paragraph Segmentation across Languages and Domains
Author(s): Caroline Sporleder and Mirella Lapata
Reference: ACM Transactions in Speech and Language Processing, 3:2, 1-35, July 2006.
[pdf]
A rule-based approach for process discovery: Dealing with noise and imbalance in process logs
Author(s): Laura Maruster, Ton Weijters, Wil van der Aalst, and
Antal van den Bosch
Reference: Data Mining and Knowledge Discovery,
13, pp. 67-87, 2006.
[preprint pdf]
Spelling space: A computational test bed for phonological and morphological changes in Dutch spelling
Bootstrapping multilingual geographical gazetteers from corpora
Author(s): Marieke van
Erp Reference: In J. Huitink & S. Katrenko (Eds.),
Proceedings of the 11th ESSLLI Student Session, Malaga, Spain,
31 July - 11 August 2006, pp. 192-202. [pdf]
Discrete versus probabilistic sequence classifiers for domain-specific entity chunking
Expertise classification: Collaborative classification vs. automatic extraction
Author(s): Toine Bogers, Willem Thoonen, and Antal van den
Bosch
Reference: In Proceedings of the 17th Annual ASIS&T SIG/CR workshop on Social Classification, Austin, TX.
[pdf]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2005
Memory-based language processing
Local classification and global estimation: Explorations of the k-nearest neighbor algorithm
Author(s): Iris Hendrickx
Reference: Ph.D. Thesis, Tilburg University, November 2005
[pdf]
Improving sequence segmentation learning by predicting trigrams
Author(s): Antal van den Bosch and Walter Daelemans
Reference: In Proceedings of the Ninth Conference on
Natural Language Learning, CoNLL-2005, June 29-30, 2005, Ann
Arbor, MI, pp. 80-87. [pdf]
Applying spelling error correction techniques for improving semantic role labelling
Memory-based morphological analysis generation and part-of-speech
tagging of Arabic
Author(s): Erwin Marsi, Antal van den Bosch, and Abdelhadi Soudi
Reference: In Proceedings of the ACL Workshop on
Computational Approaches to Semitic Languages, June 29, 2005, Ann
Arbor, MI. [pdf]
Robust ASR lattice representation types in pragma-semantic processing
of spoken input
Rule meta-learning for trigram-based sequence processing
Memory-based understanding of user utterances in a spoken dialogue system: Effects of feature selection and co-learning
Author(s): Antal van den Bosch
Reference: In Workshop Proceedings of the 6th International Conference on Case-Based Reasoning, Chicago, August 2005, pp. 85-94.
[pdf]
Designing an active learning based system for corpus annotation
Author(s): Bertjan Busser and Roser Morante
Reference: In Proceedings of the XXI Congresso de la
Sociedad Espanola para el Procesamiento del Lenguaje Natural,
SEPLN-2005, Granada, Spain, pp. 375-381.
[pdf]
Discourse chunking and its application to sentence compression
Author(s): Caroline Sporleder and Mirella Lapata
Reference: In Proceedings of the 2005 Human Language
Technology Conference and the Conference on Empirical Methods in
Natural Language Processing, HLT/EMNLP-05), Vancouver, Canada.
[pdf]
Exploiting linguistic cues to classify rhetorical relations
Author(s): Caroline Sporleder and Alex Lascarides
Reference: In Proceedings of Recent Advances in Natural
Language Processing (RANLP-05), pp. 532-539, Borovets, Bulgaria
[pdf]
Hybrid algorithms for instance-based classification
Author(s): Iris Hendrickx and Antal van den Bosch
Reference: In J. Gama, R. Camacho, P. Brazdil, A. Jorge,
and L. Torgo (Eds.), Machine Learning: ECML 2005: 16th European
Conference on Machine Learning, Porto, Portugal, October 3-7,
2005. Lecture Notes in Computer Science 3720. Berlin: Springer Verlag,
pp. 158-169.
[pdf]
Taxonómia felismerése dokumentumszerkezetbõl
Author(s): Piroska Lendvai
Reference: In: Proceedings of Computational Linguistics in Hungary Conference (Magyar Szamítógépes Nyelvészeti Konferencia, MSZNY-2005), Szeged, Hungary, 2005. pp. 88-95.
[pdf]
Conceptual taxonomy identification in medical documents
Author(s): Piroska Lendvai
Reference: In:
Proceedings of The Second International Workshop on Knowledge
Discovery and Ontologies (KDO-2005), held within ECML/PKDD, Porto,
Portugal, 2005. pp. 31-38.
[pdf]
Scalable classification-based word prediction and confusible correction
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2004
Using rule-induction techniques to model pronunciation variation in Dutch
Author(s): Veronique Hoste, Walter Daelemans and Steven Gillis
Reference: Computer Speech and Language 18:1, pp. 1-24.
[pdf]
Memory-based semantic role labeling: Optimizing features, algorithm, and output
Optionality in evaluating prosody prediction
Author(s): Erwin Marsi Reference: In Proceedings of
5th ISCA Speech Synthesis Research Workshop, Pittsburgh,
USA, 2004.
[pdf]
GAMBL, genetic algorithm optimization of memory-based WSD
Author(s): Bart Decadt, Veronique Hoste, Walter Daelemans, and
Antal van den Bosch
Reference: In: R. Mihalcea and P. Edmonds (eds.),
Proceedings of the Third International Workshop on the Evaluation of
Systems for the Semantic Analysis of Text (Senseval-3), Barcelona,
Spain, July 2004, pages 108-112.
[pdf]
Text induced spelling correction
Author(s): Martin Reynaert
Reference: In: Proceedings of the 20th International
Conference on Computational Linguistics (COLING 2004), August 2004,
Geneva, Switzerland.
[pdf]
Multilingual text induced spelling correction
Author(s): Martin Reynaert
Reference: In: Proceedings of the COLING 2004 Workshop on Multilingual Linguistic Resources, August 2004, Geneva, Switzerland.
[pdf]
Feature transformation through rule induction: a case study with the
k-NN classifier
Author(s): Antal van den Bosch
Reference: In J. Fürnkrantz (Ed.), Proceedings of
the ECML/PKDD 2004 Workshop on Advances in Inductive Rule
Learning, Pisa, Italy, September 2004, pp. 1-16.
[pdf]
Memory-based robust interpretation of recognised speech
Maximum-entropy parameter estimation for the k-NN modified
value-difference kernel
Author(s): Iris Hendrickx and Antal van den Bosch
Reference: In R. Verbrugge, N. Taatgen, and L. Schomaker
(Eds.), Proceedings of the 16th Belgian-Dutch Conference on
Artificial Intelligence, Groningen, The Netherlands, pp.
[pdf]
Wrapped progressive sampling search for optimizing learning algorithm
parameters
Author(s): Antal van den Bosch
Reference: In R. Verbrugge, N. Taatgen, and L. Schomaker
(Eds.), Proceedings of the 16th Belgian-Dutch Conference on
Artificial Intelligence, Groningen, The Netherlands, pp.
[pdf]
FINT: Find Images aNd Text
Author(s): Menno van Zaanen and Guido de Croon
Reference: Working Notes of the Workshop of the Cross-Language Evaluation Forum, Bath, UK
[pdf]
Learning compound boundaries for Afrikaans spelling checking
Author(s): Gerhard van Huyssteen and Menno van Zaanen
Reference: In Proceedings of the Workshop on International Proofing Tools and Language Technologies, Patras, Greece, July 2004.
[pdf]
A multilingual parallel parsed corpus as gold standard for grammatical inference evaluation
Author(s): Menno van Zaanen, Andrew Roberts, and Eric Atwell
Reference: In Proceedings of The Amazing Utility of Parallel and Comparable Corpora Workshop, held at the 4th International Conference on Language Resources and Evaluation, Lisbon, Portugal, May 2004.
[pdf]
Introduction to the special issue on grammar induction
Author(s): Pieter Adriaans, Henning Fernau, Colin de la Higuera, and Menno van Zaanen
Reference: Grammars: Journal of Mathematical Research on Formal and Natural Languages. Special issue on Grammar induction. 2004.
[pdf]
Automatic sentence simplification for subtitling in Dutch and English
Author(s): Walter Daelemans, Anja Höthker, and Erik Tjong Kim Sang
Reference: In: M.T. Lino e.a. (eds.), Proceedings of the 4th
International Conference on Language Resources and Evaluation, pages
1045-1048, 2004.
[pdf]
Verb classification: Machine learning experiments in classifying verbs into semantic classes
Author(s): Bart Decadt and Walter Daelemans
Reference: In L. Guthrie e.a. (eds.), Proceedings of the
LREC 2004 Workshop "Beyond Named Entity Recognition - Semantic
Labelling for NLP Tasks", pages 25-30.
[pdf]
Evaluation and adaptation of the Celex Dutch morphological database
Author(s): Tom Laureys, Guy De Pauw, Hugo Van Hamme, Walter Daelemans, and Dirk Van Compernolle
Reference: In: M.T. Lino e.a. (Eds.), Proceedings of the 4th
International Conference on Language Resources and Evaluation, pages
1247-1250, 2004.
[pdf]
Multimodal multilingual resources in the subtitling process
Author(s): Stelios Piperidis, Iason Demiros, Prokopis
Prokopidis, Peter Vanroose, Anja Höthker, Walter Daelemans, Elsa
Sklavounou, Manos Konstantinou, and Yannis Karavidas
Reference: In: M.T. Lino e.a. (Eds.), Proceedings of the 4th
International Language Resources and Evaluation Conference (LREC 2004),
Lisbon.
[pdf]
Unsupervised text mining for ontology extraction: An evaluation of statistical measures
Author(s): Marie-Laure Reinberger and Walter Daelemans
Reference: In M.T. Lino e.a. (Eds.), Proceeding of the 4th
International Language Resources and Evaluation Conference (LREC 2004),
May 2004, Lisbon, pp. 491-494.
[pdf]
Using a parallel transcript/subtitle corpus for sentence compression
Author(s): Vincent Vandeghinste and Erik Tjong Kim Sang
Reference: In: M.T. Lino e.a. (eds.), Proceedings of the 4th
International Conference on Language Resources and Evaluation, pages
231-234, 2004.
[pdf]
A memory-based shallow parser for spoken Dutch
Author(s): Sander Canisius and Antal van den Bosch
Reference: In B. Decadt, G. De Pauw, and V. Hoste (Eds.),
Selected papers from the Thirteenth Computational Linguistics in
the Netherlands Meeting, Antwerp, Belgium, pp. 31-45.
[pdf]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2003
Learning PP attachment for filtering prosodic phrasing
Learning to identify fragmented words in spoken discourse
Author(s): Piroska Lendvai Reference: In:
Proceedings of EACL-03 Student Research Workshop, Budapest,
2003, pp. 25-32.
[pdf]
Machine learning for shallow interpretation of user utterances in
spoken dialogue systems
Author(s): Piroska Lendvai, Antal van den
Bosch and Emiel Krahmer
Reference: In: Proceedings of the EACL-03 Workshop on
Dialogue Systems: Interaction, Adaptation and Styles of
Management. Budapest, 2003. pages 69-78.
[pdf] - note: corrected version!
Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition
Author(s): Erik F. Tjong Kim Sang and Fien De Meulder
Reference: In: Proceedings of CoNLL-2003, Edmonton, Canada,
2003, pp. 142-147.
[pdf]
Memory-based one-step named-entity recognition: Effects of seed list features, classifier stacking, and unannotated data
Author(s): Iris Hendrickx and Antal van den Bosch
Reference: In: Proceedings of CoNLL-2003, the Seventh Conference on Natural Language Learning, Edmonton, Canada,
2003, pp. 176-179.
[pdf]
Memory-based named entity recognition using unannotated data
Author(s): Fien De Meulder and Walter Daelemans
Reference: In: Proceedings of CoNLL-2003, the Seventh
Conference on Natural Language Learning, Edmonton, Canada, 2003, pp. 208-211.
[pdf]
Learning to predict pitch accents and prosodic boundaries in Dutch
Process discovery for evaluating dialogue strategies
Author(s): Piroska Lendvai and Laura Maruster
Reference: In: Proceedings of the ISCA Workshop on Error
Handling in Spoken Dialogue Systems. Chateau d'Oex-Vaud, Switzerland,
2003. pages 119-122.
[pdf]
Memory-based disfluency chunking
A machine learning approach to understand business processes
Author(s): Laura Maruster
Reference: Ph.D. Thesis, Eindhoven Technical University.
Promotors: Prof. dr. ir. J.C. Wortmann, prof. dr. W.M.P. Daelemans.
August 27, 2003.
Feature-rich memory-based classification for shallow NLP and
information extraction
Author(s): Jakub Zavrel and Walter Daelemans
Reference: In Jurgen Franke, Gholamreza Nakhaeizadeh and
Ingrid Renz (Eds.), Text Mining,
Theoretical Aspects and Applications, Springer Physica-Verlag, pp. 33-54
[pdf]
Combined optimization of feature selection and algorithm parameter
interaction in machine learning of language
Author(s): Walter Daelemans, Véronique Hoste, Fien De
Meulder and Bart Naudts
Reference: Proceedings of the 14th European Conference on
Machine Learning (ECML-2003), Lecture Notes in Computer Science 2837,
Springer-Verlag, Cavtat-Dubrovnik, Croatia, pp. 84-95
[pdf]
Information extraction via double classification
Author(s): An De Sitter and Walter Daelemans
Reference: In: Proceedings of the International Workshop
on Adaptive Text Extraction and Mining. Catvat-Dubrovnik, Croatia,
66-73, September 2003. [Also: Department of Mathematics and Computer
Science, University of Antwerp, Technical Report 2003-06.]
[pdf]
Is shallow parsing useful for the unsupervised learning of semantic
clusters?
Author(s): Marie-Laure Reinberger and Walter Daelemans
Reference: In: Proceedings of the 4th Conference on
Intelligent Text Processing and Computational Linguistics (CICLing
2003), Mexico City, Mexico, Lecture Notes in Computer Science 2588, Springer Verlag, 2003,
pp. 304-313.
[pdf]
Mining for lexons: applying unsupervised learning methods to create ontology bases
Author(s): Marie-Laure Reinberger, Peter Spyns, Walter Daelemans, and Robert Meersman
Reference: In: Robert Meersman, Zahir Tari, and
Douglas Schmidt (Eds.) On the Move to Meaningful Internet Systems
2003: CoopIS, DOA, and ODBASE, Lecture Notes in Computer Science 2888,
Springer-Verlag, Catania, Italy, 803-819.
[pdf]
Workflow mining: A survey of issues and approaches
Author(s): W.M.P. van der Aalst, B.F. van Dongen, J. Herbst, L. Maruster, G. Schimm, and A.J.M.M. Weijters
Reference: Data and Knowledge Engineering, 47:2, pp. 237-267.
Various uses of a spelling checker project: Practical experiences, teaching, and learning
Author(s): Menno van Zaanen and Gerhard van Huyssteen
Reference: Southern African Linguistics and Applied Language
Studies Journal, special issue on "Language Technology in Southern
Africa: resources and applications"
[pdf]
Alignment-based learning versus data-oriented parsing
Author(s): Menno van Zaanen
Reference: In Rens Bod, Remko Scha, and Khalil Sima'an (Eds.), Data-oriented parsing, Chapter 20.
[pdf]
A memory-based approach to meter induction
Author(s): Menno van Zaanen, Rens Bod, and Henkjan Honing
Reference: In Proceedings of the 5th Triennial Conference of the European Society for the Cognitive Sciences of Music (ESCOM), Hanover, Germany, September 2003.
[pdf]
A spellchecker for Afrikaans, based on morphological analysis
Author(s): Gerhard van Huyssteen and Menno van Zaanen
Reference: In Proceedings of the 6th International Terminology in Advanced Management Applications Conference (TAMA), Pretoria, South-Africa, February 2003.
[pdf]
Proceedings of the Seventh Conference on Natural Language Learning
Author(s): Walter Daelemans and Miles Osborne (Eds.)
Reference: Proceedings of the Seventh Conference on Natural Language Learning. Edmonton, Canada, ACL, vii + 212 pages, ISBN 1-932432-08-6.
[online proceedings]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2002
Memory-based grammatical relation finding
Shallow parsing on the basis of words only: A case study
Author(s): Antal van den Bosch and Sabine Buchholz.
Reference: In Proceedings of the 40th
Meeting of the Association for Computational Linguistics (ACL'02).
corrects the proceedings version on three points: (1) two words in the
abstract, (2) the title of section 4, and (3) a corrected reference to
Eisner 1996) [pdf]
Dutch word sense disambiguation: Optimizing the localness of context
Evaluating the results of a memory-based word-expert approach to
unrestricted word sense disambiguation
Improving machine-learned detection of miscommunications in human-machine dialogues through informed data splitting
Multi-feature error detection in spoken dialogue systems
Introduction to the CoNLL-2002 shared task: Language-independent named entity recognition
Memory-based named entity recognition
Logistic-based patient grouping for multi-disciplinary treatment
Process Mining: Discovering Direct Successors in Process Logs
Author(s): Laura Maruster, Ton Weijters, Wil van der Aalst, and
Antal van den Bosch Reference: In S. Lange, K. Satoh,
C.H. Smith (Eds.): Discovery Science, 5th International
Conference. Lecture Notes in Computer Science 2534. Berlin:
Springer, pp. 364-373, 2002.
[ps]
Parameter optimization for machine learning of word sense disambiguation
Combining information sources for memory-based pitch accent placement
Expanding k-NN analogy with instance families
Author(s): Antal van den Bosch
Reference: In R. Skousen, D. Lonsdale, and D. Parkinson (Eds.),
Analogical Modeling: An exemplar-based approach to
language. Amsterdam: John Benjamins, pp. 209-224.
[pdf of preprint
]
A comparison of analogical modeling of language to memory-based language processing
Author(s): Walter Daelemans
Reference: In R. Skousen, D. Lonsdale, and D. Parkinson (Eds.),
Analogical Modeling: An exemplar-based approach to
language. Amsterdam: John Benjamins, pp. 157-179.
[pdf of preprint]
Evaluation of machine learning methods for natural language processing tasks
Author(s): Walter Daelemans and Véronique Hoste
Reference: In Proceedings of
LREC-2002, the third International Language Resources and Evaluation
Conference, Las Palmas, Spain, 755-760
[pdf]
A field survey for establishing priorities in the development of HLT
resources for Dutch
Author(s): Binnenpoorte, D., F. De Vriend, J. Sturm, W. Daelemans,
H. Strik, C. Cucchiarini
Reference: In Proceedings of
LREC-2002, the third International Language Resources and Evaluation
Conference, Las Palmas, Spain, 1862-1866
[pdf]
Review of: Knowledge and learning in language, Charles D. Yang
Author(s): Walter Daelemans
Reference: Glot International, 6:5, 137-142
[pdf of preprint]
Memory-based phoneme-to-grapheme conversion
Author(s): Bart Decadt, Jacques Duchateau, Walter Daelemans, and Patrick
Wambacq
Reference: In M. Theune, A. Nijholt, and H. Hondrop (Eds.), Computational Linguistics in the
Netherlands 2001. Selected Papers from the Twelfth CLIN Meeting,
Amsterdam - New York: Rodopi, pp. 47-61
[ps]
Transcription of out-of-vocabulary words in large vocabulary
speech recognition based on phoneme-to-grapheme conversion
Author(s): Bart Decadt, Jacques Duchateau, Walter Daelemans, and Patrick
Wambacq
Reference: In Proceedings of ICASSP-02, the International Conference on Acoustics,
Speech and Signal Processing, Volume I, Orlando, USA, pp. 861-864
[ps]
A named entity recognition system for Dutch
Author(s): Fien De Meulder,Walter Daelemans, and Véronique Hoste
Reference: In: M. Theune,
A. Nijholt, and H. Hondrop (Eds.), Computational Linguistics in the
Netherlands 2001. Selected Papers from the Twelfth CLIN Meeting,
Amsterdam - New York: Rodopi, 77-88
[ps]
Special issue on machine learning approaches to shallow parsing
Author(s): James Hammerton, Miles Osborne, Susan Armstrong, and Walter Daelemans (Eds.)
Reference: Journal of Machine Learning Research, 2, pp. 551-719
[webpage]
Introduction to the special issue on machine learning approaches to shallow parsing
Author(s): James Hammerton, Miles Osborne, Susan Armstrong, and Walter Daelemans (Eds.)
Reference: Journal of Machine Learning Research, 2, pp. 551-558
[ps]
Where do syllables come from?
Author(s): E. Martens, W. Daelemans, S. Gillis, and H. Taelman
Reference: In: W. Gray and C. Schunn (Eds.) Proceedings of
the Twenty-Fourth Annual Conference of the Cognitive Science Society,
Fairfax, Virginia, George Mason University, pp. 657-664
[pdf]
Dutch HLT resources: From BLARK to priority lists
Author(s): H. Strik, W. Daelemans, D. Binnenpoorte, J. Sturm, F. De
Vriend, C. Cucchiarini
Reference: In: Proceedings of ICSLP-2002, Denver, USA, pp. 1549-1552
[pdf]
Actieplan voor het Nederlands in de taal- en spraaktechnologie: Prioriteiten voor basisvoorzieningen
Author(s): Walter Daelemans and Helmer Strik
Reference: Report for the Nederlandse Taalunie, 165 pages,
2002. (Action Plan for Dutch in Language and Speech technology:
Priorities for Basic Resources).
[webpage]
Proceedings of the Sixth Conference on Natural Language Learning (CoNLL-2002)
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2001
Detecting problematic turns in human-machine interactions: Rule-induction versus
memory-based learning approaches
Author(s): Antal van den Bosch, Emiel Krahmer, and Marc Swerts.
Reference: In Proceedings of the 39th Meeting of the
Association for Computational Linguistics (ACL'01). New Brunswick, NJ:
ACL, pp. 499-506, 2001.
[pdf]
Improving accuracy in word class tagging through combination of
machine learning systems
Review of "Learnability in Optimality Theory"
Author(s): Walter Daelemans.
Reference: Computational Linguistics, 27:2, pp. 316-317, 2001
[ps]
SHAPAQA: Shallow parsing for question answering on the World Wide Web
Author(s): Sabine Buchholz, Walter Daelemans.
Reference: In Proceedings of Euroconference Recent Advances
in Natural Language Processing (RANLP), Tsigov Chark, Bulgaria,
5-7 September, pp. 47-51, 2001.
[pdf]
Complex answers: A case study using a WWW question answering system
Using grammatical relations, answer frequencies and the World Wide Web for TREC question answering
Author(s): Sabine Buchholz.
Reference: Notebook paper for TREC 2001. NIST, 2001. Official proceedings paper to appear later.
[pdf]
TreeTalk: Memory-based word phonemisation
Combining a self-organising map with memory-based learning
Memory-based clause identification
Transforming a chunker to a parser
Introduction to the CoNLL-2001 shared task: Clause identification
Automatic discovery of workflow models from hospital data
Robust data oriented parsing for speech-understanding
Author(s): Khalil Sima'an
Reference: In Proceedings of the International
Workshop on Parsing Technologies (IWPT'01), Beijing, China, October 2001.
[ps]
Building a tree-bank of modern Hebrew text
Author(s): Khalil Sima'an, A. Itai, Y. Winter, A. Altman, and N. Nativ
Reference: In Beatrice Daille and Laurent Romary (eds.),
Special Issue on Natural Language Processing
and Corpus Linguistics, Journal Traitement Automatique des Langues
(t.a.l.), 2001
[ps]
Enhancing the robustness of data oriented parsing for speech-understanding
Author(s): Khalil Sima'an
Reference: In Proceedings of the Natural Language Processing
Pacific Rim Symposium (NLPRS'01), Tokyo, Japan, November 2001.
[ps]
Predicting phrase breaks with memory-based learning
Author(s): Bertjan Busser, Walter Daelemans, Antal van den
Bosch Reference: In Proceedings 4th ISCA Tutorial and
Research Workshop on Speech Synthesis. Perthshire Scotland, August
29th - September 1st, 2001.
[ps]
Dutch word sense disambiguation: Data and preliminary results
Author(s): Iris Hendrickx, Antal van den
Bosch Reference: In Proceedings of SENSEVAL-2. Toulouse, France.
[pdf]
Phoneme-to-grapheme conversion for out-of-vocabulary words in large
vocabulary speech recognition
Author(s): Bart Decadt, Jacques Duchateau, Walter Daelemans, Patrick Wambacq
Reference: In: ASRU 2001 Automatic Speech
Recognition and Understanding Workshop Proceedings. IEEE CD-rom
(IEEE Catalog Number 01EX544, ISBN 0-7803-7343-X), Trento, Italy, 9-13
December, 4 pages
[ps]
Phoneme-to-grapheme conversion for out-of-vocabulary words in speech recognition
Author(s): Bart Decadt and Walter Daelemans
Reference: ATRANOS Deliverable
WP2, March 31, 15 pages
[ps]
Optimizing phoneme-to-grapheme conversion for out-of-vocabulary words in speech
recognition
Author(s): Bart Decadt and Walter Daelemans
Reference: ATRANOS Deliverable
WP2, September 30, 15 pages
[ps]
Classifier optimization and combination in the English all words task
Author(s): Véronique Hoste, Anne Kool, and Walter Daelemans
Reference: In:
Judita Preiss and David Yarowsky (Eds.), Proceedings of
SENSEVAL-2. Second International Workshop on Evaluating Word Sense
Disambiguation Systems. New Brunswick: ACL, pp. 83-86
[ps]
Strengthening the Dutch Human Language Technology Infrastructure
Author(s): Catia Cucchiarini, Walter Daelemans, Helmer Strik
Reference: The ELRA Newsletter, 6:4, pp. 3-7
[ps]
Proceedings CLIN-2000
Editors: Walter Daelemans, Khalil Sima'an, Jorn Veenstra, Jakub Zavrel
Reference: Computational Linguistics in the Netherlands 2000.
Selected Papers from the Eleventh CLIN Meeting. Amsterdam - New York:
Rodopi. ISBN 90-420-1257-9. 2001.
Proceedings of the Fifth Conference on Natural Language Learning
(CoNLL-2001)
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
2000
Integrating seed names and n-grams for a named entity list and
classifier
Learning statistically neutral tasks without expert guidance
Unpacking multi-valued
symbolic features and classes in memory-based language learning
Author(s): Antal van den Bosch and Jakub Zavrel
Referemce: In P. Langley (Ed.), Proceedings of the
Seventeenth International Conference on Machine Learning, pp.
1055-1062. San Francisco, CA: Morgan Kaufmann, 2000.
[abs, ps]
Memory-Based Word Sense Disambiguation
Author(s): Jorn Veenstra, Antal van den Bosch, Sabine Buchholz, Walter Daelemans, Jakub Zavrel
Reference: Computers and the Humanities, special issue on Senseval, Word Sense Disambiguations, Ed. Adam Kilgarriff and Martha Palmer,
34:1-2, 2000
[abs,
ps of preprint]
A distributed, yet
symbolic model of text-to-speech processing
Author(s): Antal van den Bosch and Walter Daelemans
Reference: In P. Broeder and
J.M.J. Murre (Eds.), Models of Language Acquisition: inductive and
deductive approaches. Oxford University Press, 76-99, 2000.
[abs, ps]
Lazy learning: A comparison of natural and machine learning of
stress
Author(s): Steven Gillis, Walter Daelemans, and
Gert Durieux.
Reference: In: P. Broeder and J.M.J. Murre
(Eds.), Models of Language Acquisition: inductive and deductive
approaches . Oxford University Press, 76-99, 2000.
[ps]
Inductive lexica
Author(s): Walter Daelemans and Gert Durieux.
Reference: In: Van Eynde, F. and D. Gibbon
(eds.) Lexicon Development for speech and language
processing, Kluwer Academic Publishers, 115-139, 2000.
[ ps]
(preprint)
Bootstrapping a tagged corpus through combination of existing
heterogeneous taggers
Author(s): Jakub Zavrel and Walter Daelemans
Reference: In: Proceedings of the second
international conference on language resources and evaluation
(LREC-2000) , Athens, Greece, 17-20, 2000
[ps]
Part of speech tagging and lemmatisation for the Spoken Dutch
Corpus
Author(s): Frank Van Eynde, Jakub Zavrel, and
Walter Daelemans
Reference: In: Proceedings of the second
international conference on language resources and evaluation
(LREC-2000) , Athens, Greece, 1427-1434, 2000
[ps]
Lemmatisation and morphosyntactic annotation for the spoken dutch
corpus
Author(s): Frank Van Eynde, Jakub Zavrel, and
Walter Daelemans
Reference: In: Paola Monachesi (Ed.), Computational Linguistics in the Netherlands
1999. Utrecht, OTS, pp. 73-84
[ps]
Diverse classifiers for NLP disambiguation tasks. Comparisons,
optimization, combination, and evolution
Author(s): Jakub Zavrel, Sven Degroeve, Anne Kool, Walter Daelemans, Kristina Jokinen
Reference: In: Jokinen et al. (Eds.),
TWLT 18. Learning to Behave. CEvoLE 2, Ieper, Belgium, pp. 201-221
[ps]
Meta-learning for phonemic annotation of corpora
Author(s): Veronique Hoste, Walter Daelemans, Erik Tjong Kim Sang,
Steven Gillis
Reference: In: Proceedings 17th
international conference on Machine Learning , Stanford, 375-382,
2000
[ps]
A Memory-Based Alternative for Connectionist Shift-Reduce Parsing
Using induced rules as complex features in memory-based language learning
Author(s): Antal van den Bosch
Reference: In Proceedings of the Fourth Conference on
Computational Natural Language Learning and of the Second Learning
Language in Logic Workshop (CoNLL/LLL). New Brunswick, NJ: ACL,
pp.73-78.
[ps]
Tree-gram parsing: Lexical dependencies and structural relations
Author(s): Khalil Sima'an
Reference: In Proceedings of the 38th Meeting of the Association for Computational Linguistics (ACL'00). New Brunswick, NJ: ACL, pp. 53-60.
[ps]
Efficient parsing of domain language
Author(s): Khalil Sima'an
Reference: In Proceedings of the Twelfth Belgium-Netherlands Conference on Artificial Intelligence (BNAIC'00). Tilburg: ILK / Infolab, pp. 199-206.
[ps]
Machine learning for modeling Dutch pronunciation variation
Author(s): Véronique Hoste, Steven Gillis, and Walter Daelemans
Reference: In: Paola
Monachesi (Ed.), Computational Linguistics in the Netherlands
1999. Selected Papers from the Tenth CLIN Meeting, pp. 73-83
[ps]
Comparing bagging
and boosting for natural language processing tasks: a typicality
approach
Author(s): Véronique Hoste and Walter Daelemans
Reference: In: Ad Feelders (Ed.), Proceedings of the Tenth
Belgian-Dutch Conference on Machine Learning (Benelearn
2000), pp. 101-108
[ps]
Meta-learning for phonemic annotation of corpora
Author(s): Véronique Hoste, Walter Daelemans, Erik Tjong Kim Sang, and Steven Gillis
Reference: In: Proceedings of ICML-2000, Stanford University,
CA, USA, pp. 375-382
[ps]
A rule induction approach to modeling regional pronunciation variation
Author(s): Véronique Hoste, Walter Daelemans, and Steven Gillis
Reference: In: Proceedings of COLING 2000, Saarbrücken, Germany. San Francisco: Morgan Kaufman Publishers, 2000, pp. 327-333
[ps]
Genetic algorithms for feature relevance assignment in memory-based language
processing
Author(s): Anne Kool, Walter Daelemans, and Jakub Zavrel
Reference: In: Proceedings of CoNLL-2000.
[ps]
Simultaneous feature
selection and parameter optimization for memory-based natural language
processing.
Author(s): Anne Kool, Jakub Zavrel, and Walter Daelemans
Reference: In: Ad Feelders (Ed.), Proceedings of BENELEARN 2000, Tilburg, pp. 93-100, 2000
[ps]
Applying System Combination to Base Noun Phrase Identification
Author(s): Erik Tjong Kim Sang, Walter Daelemans, Hervé
Déjean, Rob Koeling, Yuval Krymolowski, Vasin Punyakanok and
Dan Roth
Reference: In: Proceedings of COLING 2000, Saarbrücken, Germany.
[ps]
Hij drinkt niet altijd "t" en ik drink er soms wél: Bronnen
van hardnekkige werkwoordfouten in het Nederlands
Author(s): D. Sandra, S. Frisson, G. Durieux, W. Daelemans, and
S. Gillis
Reference: In: S. Gillis, J. Nuyts, and J. Taeldeman (Eds.), Met taal om de tuin
geleid. Wilrijk: Universitaire Instelling Antwerpen
[ps]
Zelflerende systemen als instrument voor de taalkunde en de taaltechnologie
Author(s): Walter Daelemans, Guy De Pauw, Gert Durieux, Steven Gillis, Véronique Hoste, and Erik Tjong Kim Sang
Reference: In: S. Gillis, J. Nuyts, and
J. Taeldeman (Eds.), Met taal om de tuin geleid. Wilrijk:
Universitaire Instelling Antwerpen
[ps]
The role of algorithm bias
vs information source in learning algorithms for morphosyntactic
disambiguation
Author(s): Guy De Pauw and Walter Daelemans
Reference: In: Proceedings of the Fourth Conference on Computational
Language Learning (CoNLL-2000), Lissabon, Portugal, pp. 19-24
[ps]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1999
Forgetting exceptions is harmful in language learning
Recent Advances in Memory-Based Part-of-Speech Tagging
Interpreting knowledge representations in BP-SOM
Memory-Based Text Chunking
Author(s): Jorn Veenstra.
Reference: to appear in: Proceedings of ACAI, Chania Greece, 1999.
[ps]
Toward an exemplar-based computational model for cognitive grammar
Author(s): Walter Daelemans.
Reference: In Johan van der Auwera, Frank Durieux, and Ludo
Lejeune (Eds.) English as a Human Language. To honour Louis
Goossens. Munchen: LINCOM Europa, 73-82, 1998.
[abs, ps]
Memory-Based Shallow Parsing
Cascaded Grammatical Relation Assignment
Memory-based morphological analysis
Author(s): Antal van den Bosch and Walter Daelemans.
Reference: In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, ACL'99, University of Maryland, USA, June 20-26, 1999, pp. 285-292.
[abs, pdf]
Instance-family abstraction in memory-based language learning
Author(s): Antal van den Bosch.
Reference: In: I. Bratko and S. Dzeroski (Eds.), Machine Learning: Proceedings of the Sixteenth International Conference, ICML'99, Bled, Slovenia, June 27-30, 1999, pp. 39-48.
[abs, ps]
Machine learning of word pronunciation: the case against
abstraction
Memory-based language processing
Careful abstraction from instance families in memory-based language
learning
Machine Learning Approaches
Author(s): Walter Daelemans.
Reference: In Hans van Halteren (Ed.)
Syntactic Wordclass Tagging.
Kluwer Academic Publishers, 285-304, 1999.
[ps]
On the Arbitrariness of Lexical Categories
Author(s): Gert Durieux, Walter Daelemans and Steven Gillis
Reference: Van Eynde et al. (eds.) Computational Linguistics in the Netherlands 1998
Amsterdam: Rodopi, pp. 19-36, 1999.
[ps]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1998
Modularity in inductively-learned word pronunciation systems
Do not forget: Full memory in memory-based learning of word pronunciation
Rapid development of NLP modules with memory-based learning
Author(s): Walter Daelemans, Antal van den Bosch, Jakub Zavrel,
Jorn Veenstra, Sabine Buchholz, and Bertjan Busser.
Reference: In Proceedings of ELSNET in Wonderland, pp.
105-113. Utrecht: ELSNET, 1998. Also in R. Basili and M.T. Pazienza (Eds.),
ECML-98 TANLPS Workshop Notes, Technische Universitaet Chemnitz,
1998, pp. 1-17.
[abs,ps]
Interpretable neural networks with BP-SOM
Toward inductive lexicons: a case study
Author(s): Walter Daelemans, Gert Durieux, and Antal van den Bosch.
Reference: In: P. Velardi (ed.), Proceedings LREC Workshop
on Adapting Lexical and Corpus Resources to Sublanguages and Applications,
Granada, Spain, pp. 29-35, 1998.
[abs,ps]
A connectionist model for bootstrap learning of syllabic structure
Author(s): Jean Vroomen, Antal van den Bosch, and Beatrice De Gelder.
Reference: Language and Cognitive Processes, Special
issue on Language Acquisition and Connectionionism (Ed. K. Plunkett), 13:2/3,
pp. 193-220.
Fast NP chunking using memory-based learning techniques
Author(s) Jorn Veenstra.
Reference: In
F. Verdenius and W. van den Broek (Eds), Proceedings of Benelearn
1998, Wageningen, the Netherlands, pp. 71-79, 1998.
[abs,ps] .
Improving data driven wordclass tagging by system combination
Distinguishing complements from adjuncts using memory-based learning
Author(s): Sabine Buchholz.
Reference: In B. Keller (Ed.), Proceedings of the
ESSLLI-98 Workshop on Automated Acquisition of Syntax and Parsing,
pp.41-48.
[abs,ps]
TreeTalk-D: a machine learning approach to Dutch word
pronunciation
Author(s): Bertjan Busser.
Reference: In P. Sojka, V. Matousek, K. Pala, and I. Kopecek
(Eds.) (1998) Proceedings TSD Conference, pp. 3-8, Masaryk University,
Czech Republic.
[abs,ps,demo]
Unsupervised learning of subcategorisation information and its
application in a parsing subtask
Author(s): Sabine Buchholz.
Reference: In H. La Poutre and H.J. van den Herik (Eds.) (1998),
Proceedings of the Tenth Netherlands/Belgium Conference on Artificial
Intelligence (NAIC'98), CWI, Amsterdam, pp. 7-16.
[abs,ps]
A connectionist model for bootstrap learning of syllabic structure
Author(s): Jean Vroomen, Antal van den Bosch, and Beatrice De Gelder.
Reference: Language and Cognitive Processes, Special
issue on Language Acquisition and Connectionionism (Ed. K. Plunkett), 13:2/3,
pp. 193-220.
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1997
IGTree: Using Trees for Compression and Classification in Lazy Learning
Algorithms
Memory-Based Learning: Using Similarity for Smoothing
Resolving PP Attachment Ambiguities with Memory-Based Learning
Author(s): Jakub Zavrel, Walter Daelemans and Jorn Veenstra
Reference: Proc. of the workshop on Computational Natural Language
Learning (CoNLL'97), edited by: Mark Ellison, Madrid, 11 July 1997
[abs,
ps]
TreeTalk-D: a Machine Learning Approach to Dutch Grapheme-to-Phoneme Conversion
Workshop Notes on Empirical Learning of Natural Language Processing Tasks.
Author(s): Walter Daelemans, Ton Weijters, and Antal van den
Bosch (eds.)
Reference: In M. van Someren and G. Widmer
(eds.) Machine Learning: ECML-97, Lecture Notes in Artificial
Intelligence 1224, Berlin: Springer, 337-344, 1997.
[electronic workshop notes]
Empirical Learning of Natural Language Processing Tasks.
Author(s): Walter Daelemans, Antal van den Bosch, and Ton Weijters.
Reference: M. van Someren and G. Widmer (eds.) Machine Learning:
ECML-97, Lecture Notes in Artificial Intelligence 1224, Berlin: Springer,
337-344, 1997.
[abs,
ps]
Data Mining as a Method for Linguistic Analysis: Dutch Diminutives
Author(s): Walter Daelemans, Peter Berck, and Steven Gillis.
Reference: Folia Linguistica , XXXI/1-2, 57-75, 1997.
[abs,
ps]
A feature-relevance heuristic for indexing and compressing large
case bases
Author(s): Walter Daelemans, Antal van den Bosch, and Jakub Zavrel.
Reference: M. van Someren and G. Widmer (eds.) 9th European
Conference on Machine Learning - Poster Papers. Prague: Laboratory of
Intelligent Systems, 29-38, 1997.
[abs,
ps]
Skousen's Analogical Modeling algorithm: A comparison with Lazy Learning
Author(s): Walter Daelemans, Steven Gillis, and Gert Durieux
Reference: D. Jones and H. Somers (eds.) New Methods in Language
Processing., London:
University College Press, 3-15, 1997.
[abs,
ps]
Learning to pronounce written words. A study in inductive language learning
Author(s): Antal van den Bosch.
Reference: Ph.D. Thesis, Universiteit Maastricht, The Netherlands.
Cadier en Keer: Phidippides, 1997.
[ps]
Automatic phonetic transcription of words based on sparse data
Author(s): Maria Wolters and Antal van den Bosch.
Reference: In Walter Daelemans, Antal van den Bosch, and Ton Weijters (Eds.), Workshop notes of ECML/MLnet Familiarization Workshop on
Empirical Learning of Natural Language Processing Tasks, April 1997,
Prague, Czech Republic, pp. 61-70, 1997.
[abs] [ps]
When small disjuncts abound, try lazy learning: A case study
Intelligible neural networks with BP-SOM
Avoiding overfitting with BP-SOM
Behavioural aspects of BP-SOM
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1996
Language-Independent Data-Oriented Grapheme-to-Phoneme Conversion
Abstraction Considered Harmful: Lazy Learning of Language Processing.
Author(s): Walter Daelemans
Reference: van den Herik, J. and T. Weijters (eds.) Benelearn-96.
Proceedings of the 6th Belgian-Dutch Conference on Machine Learning.
MATRIKS: Maastricht, The Netherlands, 3-12, 1996.
-
[ps]
Morphological Analysis as Classification: an Inductive-Learning Approach.
Unsupervised Discovery of Phonological Categories through Supervised Learning
of Morphological Rules
Author(s): Walter Daelemans, Peter Berck and Steven Gillis.
Reference: Proceedings of the 16th International Conference
on Computational Linguistics (COLING-96), Copenhagen, Denmark, 95-100,
1996.
-
[abs,
ps]
Artificial Intelligence Models of Language Processing
MBT: A Memory-Based Part of Speech Tagger-Generator
Author(s): Walter Daelemans, Jakub Zavrel, Peter Berck and Steven
Gillis.
Reference: E. Ejerhed and I. Dagan (eds.) Proceedings
of the Fourth Workshop on Very Large Corpora, Copenhagen, Denmark,
14-27, 1996.
-
[abs,
ps]
Stretching the limits of learning without modules
Author(s): Antal van den Bosch and Ton Weijters
Reference: In M. van der Heyden, J. Mrsic-Flögel
and K. Weigl (Eds.), Proceedings of the HELNET International
Workshop on Neural Networks, Vol. I/II, Amsterdam: VU University
Press, pp. 177-185.
An inductive-learning approach to morphological analysis
Author(s): Antal van den Bosch, Walter Daelemans, and Ton
Weijters Reference: In Durieux, G., Daelemans, W., and
Gillis, S. (Eds.), Papers from the Sixth Computational Linguistics
in the Netherlands Meeting, December 1996, University of Antwerp,
Belgium, pp. 213-230.
Avoiding overfitting in BP-SOM
Author(s): Ton Weijters, Antal van den Bosch, Eric Postma, and Jaap van den Herik
Reference: Avoiding overfitting in BP-SOM. In H.J. van den
Herik and A. Weij\-ters (Eds.), Proceedings of BENELEARN-96,
Maastricht, pp. 157-166.
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1995
A computational model of P&P: Dresher and Kaye (1990) revisited.
Author(s): Steven Gillis, Gert Durieux, Walter Daelemans.
Reference: M. Verrips & F. Wijnen (eds.) Approaches to
Parameter Setting. Amsterdam Studies in Child Language Development,
vol 4, 135-173, 1995.
-
[abs,
ps]
The profit of learning exceptions.
Scaling effects with greedy and lazy machine-learning
algorithms
Author(s): Antal van den Bosch, Ton Weijters, and Jaap van den Herik
Reference: In Proceedings of the Seventh Dutch Conference on
AI, NAIC-95, Erasmus University, Rotterdam, pp. 211-218.
Connectionism
Author(s): Ton Weijters and Antal van den Bosch
Reference: In Verschueren, J., Östman, J.O., and
Blommaert, J. (Eds.), Handbook of Pragmatics: Learning Manual,
pp. 165-171. Amsterdam: Benjamins.
Linguistics as data mining: Dutch diminutives
Memory-based lexical acquisition and processing.
Author(s): Walter Daelemans
Reference: P. Steffens (ed.) Machine Translation and the
Lexicon, Springer Lecture Notes in Artificial Intelligence 898, 85-98,
1995.
-
[abs,
ps]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1994
Measuring the Complexity of Writing Systems
Default inheritance in an object-oriented representation of linguistic
Categories.
Author(s): Walter Daelemans and Koen De Smedt
Reference: International Journal Human-Computer Studies
41, 149-177, 1994.
A language-independent, data-oriented architecture for grapheme-to-phoneme
conversion.
Are children 'lazy learners'? A comparison of natural and machine learning
of Stress.
Author(s): Steven Gillis, Walter Daelemans and Gert Durieux
Reference: Ram, A. and Eiselt, K. (eds.) Proceedings of the
Sixteenth Annual Conference of the Cognitive Science Society, Georgia
Institute of Technology, Atlanta, USA, Hillsdale: Lawrence Erlbaum Associates,
369-374, 1994.
-
[abs,
ps]
Skousen's Analogical modeling algorithm: a comparison with lazy learning
Author(s): Walter Daelemans, Steven Gillis and Gert Durieux.
Reference: Jones, D. (ed.) Proceedings of the International
Conference on New Methods in Language Processing (NeMLaP), UMIST: Manchester, 1-7, 1994.
-
[abs,
ps]
The acquisition of stress: a data-oriented approach.
Author(s): Walter Daelemans, Steven Gillis and Gert Durieux.
Reference: Computational Linguistics 20 (3), special
issue on Computational Phonology (Steven Bird guest ed.), 421-451, 1994.
-
[abs]
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1993
Learnability and markedness: Dutch stress assignment
Author(s): Steven Gillis, Walter Daelemans, Gert Durieux and Antal
van den Bosch.
Reference: Proceedings of the Fifteenth Annual Conference
of the Cognitive Science Society, Boulder Colorado, USA, Hillsdale:
Lawrence Erlbaum Associates, 452-457, 1993.
-
[abs,
ps]
Tabtalk: Reusability in data-oriented grapheme-to-phoneme conversion.
Data-oriented methods for grapheme-to-phoneme conversion
Learnability and markedness in data-driven acquisition of stress
Author(s): Walter Daelemans, Steven Gillis, Gert Durieux and Antal
van den Bosch.
Reference: T. Mark Ellison and James M. Scobbie (eds) Computational
Phonology. Edinburgh Working Papers in Cognitive Science 8, 1993, 157-178.
-
[abs,
ps]
A data-driven approach to stress acquisition
Author(s): Walter Daelemans, Antal van den Bosch, Steven
Gillis, and Gert Durieux Reference: In P. Adriaans (Ed.),
ECML-93 Workshop Notes on Machine Learning Techniques and Text
Analysis, Vienna, pp. 15-24.
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
1992
Generalization performance of backpropagation learning on a syllabification
task
Author(s): Walter Daelemans and Antal van den Bosch.
Reference: M.F.J. Drossaers and A. Nijholt (eds.) Connectionism
and Natural Language Processing. Proceedings Third Twente Workshop on Language
Technology, 27-38, 1992.
-
[abs,
ps]
A neural network for hyphenation
Author(s): Walter Daelemans and Antal van den Bosch
Reference: In: I. Aleksander and J. Taylor (Eds.), Artificial Neural Networks 2, Vol. 2. Amsterdam: North-Holland, pp. 1647-1650.
Linguistic pattern matching capabilities of connectionist networks
Author(s): Antal van den Bosch and Walter Daelemans
Reference: In: J. van Eijk and
W. Meyer Viol (Eds.), Proceedings of the Computational
Linguistics in the Netherlands meeting, CLIN-1991. Utrecht: OTS, pp. 40-53
(also appeared in W. Daelemans and D. Powers (Eds.), Proceedings
First SHOE Workshop, Tilburg: ITK, pp. 183-196.)
Exploring artificial learning algorithms: Learning to stress Dutch simplex words.
Author(s): Steven Gillis, Gert Durieux, Walter Daelemans, and Antal van den Bosch
Reference: Antwerp Papers in Linguistics 71.
|
|
|
|
|
|
|
| |
Last update: Tue Mar 9 2010
|
|
| |
|