bla bla bla bla bla
Roser Morante - academic web page
Links biomedical text mining

Home
Home

Research
Research

Publications
Publications

Activities
Activities

Links
Links

Links
Links biomedical text mining



General NER Taggers Parsers PPI Annotation
Search Abbreviations Corpora Databases Research Literature
Data Integration Event Extraction Knowledge discovery

Molecular Biology of the Cell by Bruce Alberts, Alexander Johnson, Julian Lewis, Martin Raff, Keith Roberts, and Peter Walter, 2002

Artificial Intelligence and Molecular Biology Edited by Lawrence Hunter, 1993


General Resources

Bio-NLP tools linked in the Biocreative web page

Compendium of BioNLP Resources by Martin Krallinger

Corpora for biomedical NLP, bookmarks by Kevin Cohen, Biomedical Text Mining Group, Center for Computational Pharmacology University of Colorado Health Sciences Center

BioNLP Resources by Alex Morgan

BioNLP Benchmarks by Jörg Hakenberg

Resources for Biomedical Terminology and Ontology by Mark A. Mandel

BioNLP UIMA Component Repository


Named Entity Taggers

BANNER CRF based, developed by Arizona State University, Dept. of Computer Science and Dept. of Biomedical Informatics

ABNER CRF based, by Burr Settles, Department of Computer Sciences, University of Wisconsin-Madison

LINGPIPE Information extraction and data mining tools, developed by Alias-i, Inc.

JNET JULIE Lab Named Entity Tagger

MetaMap Portal, program developed by Dr. Alan Aronson at the National Library of Medicine (NLM) to map biomedical text to the UMLS Metathesaurus

OrganismTagger Organism Tagger by Semantic Software Lab


Tokenizers - taggers

GENIA tagger Tagger developed at the Tsujii Laboratory of the University of Tokyo, Japan

SPECIALIST Text Tools developed by the Lexical Systems Group, NLM, NIH, USA


Parsers

David McClosky's self trained biomedical parser, Brown University, USA

ENJU Syntactic parser developed at the Tsujii Laboratory of the University of Tokyo, Japan

LRDEP A dpendency parser developed by Kenji Sagae at the Tsujii Laboratory of the University of Tokyo, Japan

Parsed MEDLINE developed by the Tsujii Laboratory of the University of Tokyo, Japan


Other tools

BioSimplify, Sentence simplification for biomedical texts, Department of Biomedical Informatics, Arizona State University


Data Integration

Hanalyzer: a 3R System. Center for Computational Pharmacology, University of Colorado

Arrowsmith: linking documents, disciplines, investigators and databases, University of Illinois at Chicago

PPI - Information Extraction systems

PPI-learning with all-dependency-paths kernel by Bioinformatics Laboratory, Turku Center for Computer Science, University of Turku, Finland

AKANE++PPI Information extraction system for PPI developed at the Tsujii Laboratory of the University of Tokyo, Japan


Event Extraction systems / Semantic role labelers

Stanford Biomedical Event Parser (SBEP) by Bioinformatics Laboratory, The Stanford Natural Language Processing Group, USA

BioKIT semantic role labeler by the NUS Natural Language Processing Group

Annotation Tools

KNOWTATOR General-purpose text annotation tool, CCP Center for Computational Pharmacology, University of Colorado Health Sciences, USA

MMAX2 General-purpose text annotation tool, EML Research gGmbH, Germany

XConc SuiteXML based tools for corpus annotation, Tsujii Lab, Japan

CALLISTO General-purpose text annotation tool, Mitre Corporation, USA


Search Tools

PolySearch Extracts relations between two types of entities. University of Alberta, Canada.

Entrez. Web application that finds information about biomedical entities in heterogeneous databases. NCBI, USA.

CoPub. Detection of co-occurring biomedical concepts in abstracts. CDD, Center of Molecular and Biomolecular Informatics, Nijmegen, The Netherlands

iHOP. Retrieving collections of co-mentioned proteins. CNB-CSIC, Madrid, Spain

eUtils. Tools that provide access to Entrez data. NCBI, USA

MiMIr2. Search of protein interactions. NIH Natioanl Center for Integrative Biomedical Informatics, USA

GIN. System for browsing articles and molecule interaction information. CLAIR Group, University of Michigan, USA

MedEvi Search engine for biomedical concepts. EMBL-EBI, Cambridge, UK

GoPubMed. Shows the most qualifying concepts in GO and MeSH for a search. Technische Universitaet Dresden, Germany

EBIMed. Web application that finds associations between biomedical entities. EMBL-European Bioinformatics Institute, UK.

MedGene. Web application that retrieves biomedical relationships based on the co-citations of all Medline records. Harvard University, USA.

LitMiner. Literature mining tool based on co-citations. Helmholtz Zentrum, Munich, Germany.

Ali Baba. Tool that visualizes the result of searching biomedical relations in abstracts. Humboldt University, Germany.

BioText. Search engine. University of California, Berkeley, USA.

MedstractPlus. Mines relations from Medline. Brandeis University, Waltham, USA.


Knowledge discovery

Bitola. Biomedical discovery support system.


Abbreviation - acronym finders

ARGH Biomedical Acronym Resolver provided by the eTBLAST team, UT Southwestern, USA

JACRO JULIE Lab Acronym Annotator


Corpora

Corpora linked in the Biocreative web page

Becorpus Biocaster Event Corpus and Tools by Nigel Collier

Links by Martin Krallinger, Universidad Autónoma de Mdrid, Spain

ART Corpus Aberystwyth University, UK

PennBioIE CYP LDC

PennBioIE Oncology LDC

BioScope University of Szeged, Hungary

GENIA University of Tokyo, Japan

GENIA EVENT University of Tokyo, Japan

DEP GENIA Institute of Computational Linguistics at the University of Zurich, Switzerland. The GENIA corpus parsed with the Pro3Gres parser

GENIA TreeBank 1.0 University of Tokyo, Japan

GREC corpus, NaCTeM.

BioInfer Bio Information Extraction Resource, University of Turku, Finland

PPI corpora, University of Turku, Finland

PASBio, predicate-argument structures, National Institute of Informatics, Tokyo, Japan

Parsed MEDLINE developed by the Tsujii Laboratory of the University of Tokyo, Japan

Hedge classification dataset , by B. Medlock, NLIP Group, University of Cambridge, UK

AIMED PPI corpus , by R. Bunescu et al. 2005, Department of Computer Science, University of Texas, USA

Yapex Training and test data for the protein tagger Yapex.

Linnaeus corpus of species names.

Corpora for NLPby Jörg Hakenberg.

Meta-knowledge GENIA corpusby NACTEM


Databases

OMIM Compendium of human genes and genetic phenotypes, Johns Hopkins University School of Medicine

Links to biomedical databases from the Health Sciences Library, University of Buffalo

Links to biomedical databases from Israel Science and Technology

Links to Molecular Biology Databases from the Introduction to Bioinformatics Course by L. Hunter

UMLS Unified Medical Language System

MeSH Medical Subject Headings, U.S. National Library of Medicine

MeSH HUGO Gene Nomenclature Committee


Research groups/centers - projects

HITRL, Sydney, Australia

NICTA, Melbourne, Australia

CLAIR, Department of Electrial Engineering and Computer Science, University of Michigan, USA

Tsujii Laboratory, Graduate School of Information Science and Technology, The University of Tokyo, Japan

GRIB, Research Unit on Biomedical Informatics, Barcelona, Spain

Bioinformatics Laboratory, Turku Center for Computer Science, University of Turku, Finland

Biomedical Text Mining Group, Center for Computational Pharmacology University of Colorado Health Sciences Center, USA

BioText University of Berkeley, USA

NaCTeM National Centre for Text Mining, School of Computer Science, University of Manchester, UK

Biomine Department of Computer Science, University of Helsinki, Finland

Mining the Bibliome Based at the IRCS, University of Pennsylvania, USA

Lexical Systems Group located within the Cognitive Science Branch of the Lister Hill Center for Biomedical Communications, USA

BioSemantics Based at the Center for Human and Clinical Genetics of the Leiden University Medical Center, The Netherlands

Projects

BioCaster

BioDiscourseRelation

OntoGene Institute of Computational Linguistics, University of Zurich, Switzerland

PASBio


Literature

Selected Bibliography for Text Mining for Biomedicine by Sophia Ananiadou

BLIMP Biomedical LIterature (and text) Mining Publications, Queen's University, Canada

Researchers

Nigel Collier