liu.seSearch for publications in DiVA
ReferencesLink to record
Permanent link

Direct link
Semantic Analysis Of Multi Meaning Words Using Machine Learning And Knowledge Representation
2011 (English)Independent thesis Advanced level (degree of Master (Two Years)), 30 credits / 45 HE creditsStudentuppsats (Examensarbete)
Abstract [en]

The present thesis addresses machine learning in a domain of naturallanguage phrases that are names of universities. It describes two approaches to this problem and a software implementation that has made it possible to evaluate them and to compare them.

In general terms, the system's task is to learn to 'understand' the significance of the various components of a university name, such as the city or region where the university is located, the scienti c disciplines that are studied there, or the name of a famous person which may be part of the university name. A concrete test for whether the system has acquired this understanding is when it is able to compose a plausible university name given some components that should occur in the name.

In order to achieve this capability, our system learns the structure of available names of some universities in a given data set, i.e. it acquires a grammar for the microlanguage of university names. One of the challenges is that the system may encounter ambiguities due to multi meaning words. This problem is addressed using a small ontology that is created during the training phase.

Both domain knowledge and grammatical knowledge is represented using decision trees, which is an ecient method for concept learning. Besides for inductive inference, their role is to partition the data set into a hierarchical structure which is used for resolving ambiguities.

The present report also de nes some modi cations in the de nitions of parameters, for example a parameter for entropy, which enable the system to deal with cognitive uncertainties. Our method for automatic syntax acquisition, ADIOS, is an unsupervised learning method. This method is described and discussed here, including a report on the outcome of the tests using our data set.

The software that has been implemented and used in this project has been implemented in C.

Place, publisher, year, pages
2011. 74 p.
Keyword [en]
Machine Learning, Supervised Learning, Unsupervised Learning
National Category
Computer Science
Identifiers
urn:nbn:se:liu:diva-70086 (URN)LiU/IDA-EX-A- -11/011- -SE (ISRN)oai:DiVA.org:liu-70086 (OAI)
Subject / course
Master's programme in Computer Science
Presentation
2011-04-04, 11:44 (English)
Uppsok
Technology
Supervisors
Examiners
Available from2011-08-25 Created:2011-08-18 Last updated:2011-08-25Bibliographically approved

Open Access in DiVA

fulltext(10643 kB)140 downloads
File information
File name FULLTEXT01.pdfFile size 10643 kBChecksum SHA-512
485427ed7bebac1e7bd833ae76e733317a65312cb25d5233652046461d0f0830d76ac98f07250ffbdc316f9d5437143a817353a29ea108c564bbf4a774fa1181
Typ fulltextMimetype application/pdf

Search in DiVA

By author/editor
Alirezaie, Marjan
By organisation
Department of Computer and Information Science
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Totalt: 140 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available
Totalt: 159 hits
ReferencesLink to record
Permanent link

Direct link