| login | register |
| C | r | i | t | i | c | a | l | A | s | s | e | s | s | m | e | n | t | o | f | I | n | f | o | r | |||
| m | a | t | i | o | n | E | x | t | r | a | c | t | i | o | n | i | n | B | i | o | l | o | g | y |
| News | About | Events | Tasks | Resources |
BioCreative II.5BioCreative II.5 workshop recordings (Events) [2010-01-14]Various talk recordings from the BioCreative II.5 workshop held from Oct 7-9, 2009, in Madrid. Note that to view these files, you need a special viewer (Windows) or Quicktime 7 (no later!) plugin (OS X), which you need to download from here and install, too.
Downloads
CorporaBioCreative II.5 Elsevier corpus (Resources) [2009-12-18]We are pleased to announce that Elsevier B.V. has granted us the privilege of providing the corpus of FEBS Letters articles used during BioCreative II.5 to the scientific community. The official announcement of the corpus' availability was published on the FEBS homepage. The corpus contains 1190 articles, mostly from 2007 and 2008, both in machine-readable XML format and the UTF-8 special format used during the challenge, as distributed via the BioCreative Meta-Server. All annotations (i.e., the gold standard) used during BioCreative II.5 are contained within the package. Additionally, an archive containing all UniProt 15.0 accession-taxonomic ID mappings as well as a list of clusters of homonym ortholog proteins in UniProt 15.0 can be downloaded here. The clusters were established from UniRef50 r15.0 clusters, intersected with all clusters extracted by using case-insensitive matching of UniProt names (all names available per record, but excluding one-letter names and purely numerical "names"). The taxonomy mapping file can be used in conjunctions with the evaluation library, while the homonym ortholog clusters are provided as reference (limited clusters relevant for each the training and test set only are provided directly through the corpus). Corpus overview:
We would like to express our gratitude to Elsevier for granting us the rights to keep providing this significant collection of articles and to the MINT database curators for contributing the annotations. DownloadsBioCreative II.5Evaluation library (Resources) [2009-12-17]This is the final version of the BioCreative evaluation library including a command line tool to use it; current version: 2.0a1. This is the first release candidate and it is possible that you might encounter a bug or that some functionality still will be improved after the initial feedback. If you have reason to believe that there is a problem with the tool or the library, or any other questions related to it, please contact the author, Florian Leitner. This library is used to evaluate the results of BioCreative II.5 with regard to the official BC II.5 evaluation function. The evaluation score is calculated from the AUC (area under curve) of the interpolated precision/recall (iP/R) curve, macro-averaged for IPT ant INT results. The library provides various additional performance calculations which can be generated through the command line tool (see below and the tool's help and documention). In addition, if you wish to use the library directly, please consult the inline documentation. You will need to have a working version of Python 2.5 (or 2.6, 2.7) installed to use this package. It imports only on the standard libraries part of any Python base distribution as long as you do not want to use the plotting functionality. In this case, you need to install matplotlib, too. To run the evaluation after installing the library (see the included README.txt file), you can call it from the command line:
The The tool allows you to explore your results in more detail than just the official evaluation function. By default, it gives you a detailed overview of evaluation results, including recall, precision, and F-score of your data, and all values are reported both micro-, and macro-averaged (the official evaluation function is the macro-averaged AUC iP/R score), except for the ACT task, where there is no macro/micro-averaging, but instead provides calculations for specificity, sensitivity, accuracy, and Matthew's Correlation Coefficient in addition to the AUC iP/R score. The main arguments when using the library with the command line tool (bc-evaluate) are:
You can download and install the ready made source packages for all operating systems. Please have a look at the README file for instructions on how to install this library. DownloadsBioCreative IIIAnnouncement (Events) [2009-12-08]The 3rd Critical Assessment for Information Extraction in Biology challenge, BioCreative III is a community-wide effort for evaluating text mining and information extraction systems applied to the biomedical domain. The BioCreative III workshop, to be held in September 2010, will bring together stakeholders from the biocuration community with researchers from text mining and natural language processing applied to the biomedical literature. BioCreative III will have three tasks:
BackgroundBioCreative arose out the needs of working biologists, biological curators and bioinformaticians to access the wealth of information in the literature, and to link this information to biological databases, using standard ontologies and controlled vocabularies. BioCreative focuses on comparison of methods and community assessment of scientific progress. Previous BioCreative challenges have attracted considerable interest not only in the bio text mining community, but also in the bioinformatics and biological database domains, resulting in two special journal issues and useful data resources for the development of biomedical text mining systems [1][2]. BioCreative has been organized through collaborations between text mining groups, biological database curators and bioinformatics researchers. BioCreative III (2010) and BioCreative IV (2012) will be funded in part by the US National Science Foundation, with an explicit focus on developing (interactive) applications to meet the needs of end users, especially curators. BioCreative III Structure and TimetableBioCreative III will begin in January 2010 and will culminate in the BioCreative III workshop, September 13-15, 2010 in Bethesda, Maryland, USA. It will consist of three tracks:
References
BioCreative II.5
Workshop group foto (Resources) [2009-10-30]The group foto taken of all participants during the workshop infront of the CNIO.
Downloads |
|
|
Content © 2008
CNIO
|