liu.seSearch for publications in DiVA
Change search
ReferencesLink to record
Permanent link

Direct link
Semantic Analysis Of Multi Meaning Words Using Machine Learning And Knowledge Representation
Linköping University, Department of Computer and Information Science.
2011 (English)Independent thesis Advanced level (degree of Master (Two Years)), 30 credits / 45 HE creditsStudent thesis
Abstract [en]

The present thesis addresses machine learning in a domain of naturallanguage phrases that are names of universities. It describes two approaches to this problem and a software implementation that has made it possible to evaluate them and to compare them.

In general terms, the system's task is to learn to 'understand' the significance of the various components of a university name, such as the city or region where the university is located, the scienti c disciplines that are studied there, or the name of a famous person which may be part of the university name. A concrete test for whether the system has acquired this understanding is when it is able to compose a plausible university name given some components that should occur in the name.

In order to achieve this capability, our system learns the structure of available names of some universities in a given data set, i.e. it acquires a grammar for the microlanguage of university names. One of the challenges is that the system may encounter ambiguities due to multi meaning words. This problem is addressed using a small ontology that is created during the training phase.

Both domain knowledge and grammatical knowledge is represented using decision trees, which is an ecient method for concept learning. Besides for inductive inference, their role is to partition the data set into a hierarchical structure which is used for resolving ambiguities.

The present report also de nes some modi cations in the de nitions of parameters, for example a parameter for entropy, which enable the system to deal with cognitive uncertainties. Our method for automatic syntax acquisition, ADIOS, is an unsupervised learning method. This method is described and discussed here, including a report on the outcome of the tests using our data set.

The software that has been implemented and used in this project has been implemented in C.

Place, publisher, year, edition, pages
2011. , 74 p.
Keyword [en]
Machine Learning, Supervised Learning, Unsupervised Learning
National Category
Computer Science
URN: urn:nbn:se:liu:diva-70086ISRN: LiU/IDA-EX-A- -11/011- -SEOAI: diva2:435402
Subject / course
Master's programme in Computer Science
2011-04-04, 11:44 (English)
Available from: 2011-08-25 Created: 2011-08-18 Last updated: 2011-08-25Bibliographically approved

Open Access in DiVA

FinalThesis(10643 kB)206 downloads
File information
File name FULLTEXT01.pdfFile size 10643 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
Department of Computer and Information Science
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 206 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 340 hits
ReferencesLink to record
Permanent link

Direct link