Knowledge-based Gene Symbol Disambiguation
2008 (English)In: Second International Workshop on Data and Text Mining in Bioinformatics,2008, New York: ACM , 2008, 73-76 p.Conference paper (Refereed)
Since there is no standard naming convention for genes and gene products, gene symbol disambiguation (GSD) has become a big challenge when mining biomedical literature. Several GSD methods have been proposed based on Medline references to genes. However, nowadays gene databases, e.g. Entrez Gene, provide plenty of information about genes, and many biomedical ontologies, e.g. UMLS Metathesaurus and Semantic Network, have been developed. These knowledge sources could be used for disambiguation, in this paper we propose a method which relies on information about gene candidates from gene databases, contexts of gene symbols and biomedical ontologies. We implement our method, and evaluate the performance of the implementation using BioCreAtIvE II data sets.
Place, publisher, year, edition, pages
New York: ACM , 2008. 73-76 p.
IdentifiersURN: urn:nbn:se:liu:diva-42969DOI: 10.1145/1458449.1458466Local ID: 70300ISBN: 978-1-60558-251-1OAI: oai:DiVA.org:liu-42969DiVA: diva2:263826
Second International Workshop on Data and Text Mining in Bioinformatics,2008