liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Corpus construction based on Ontological domain knowledge
Linköping University, Department of Computer and Information Science.
Linköping University, Department of Computer and Information Science.
2011 (English)Independent thesis Advanced level (degree of Master (Two Years)), 30 credits / 45 HE creditsStudent thesis
Abstract [en]

The purpose of this thesis is to contribute a corpus for sentence level interpretation of biomedical language. The available corpora for the biomedical domain are small in terms of amount of text and predicates. Besides that these corpora are developed rather intuitively. In this effort which we call BioOntoFN, we created a corpus from the domain knowledge provided by an ontology. By doing this we believe that we can provide a rough set of rules to create corpora from ontologies. Besides that we also designed an annotation tool specifically for building our corpus. We built a corpus for biological transport events. The ontology we used is the piece of Gene Ontology pertaining to transport, the term transport GO: 0006810 and all of its child concepts, which could be called a sub-ontology. The annotation of the corpus follows the rules of FrameNet and the output is annotated text that is in an XML format similar to that of FrameNet. The text for the corpus is taken from abstracts of MEDLINE articles. The annotation tool is a GUI created using Java.

Place, publisher, year, edition, pages
2011. , 62 p.
Keyword [en]
Text mining, Biomedical text mining, Natural Language Processing
National Category
Medical Engineering
Identifiers
URN: urn:nbn:se:liu:diva-71851ISRN: LITH-IDA-EX-2011/044-SEOAI: oai:DiVA.org:liu-71851DiVA: diva2:454618
Subject / course
Computer and information science at the Institute of Technology
Presentation
Charles Babbage, B-Huset, Linkoping (English)
Uppsok
Technology
Supervisors
Examiners
Available from: 2011-11-09 Created: 2011-11-07 Last updated: 2011-11-09Bibliographically approved

Open Access in DiVA

BioOntoFN(1395 kB)326 downloads
File information
File name FULLTEXT01.pdfFile size 1395 kBChecksum SHA-512
658779a63a095b98f65d0aa121b89001802c6096bc22df796948bf138212929764dde7c8977266857a8e0758169e91d25f79e1460afe2b9e9e774314046ee510
Type fulltextMimetype application/pdf

By organisation
Department of Computer and Information Science
Medical Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 326 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 285 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf