liu.seSearch for publications in DiVA
Change search
Link to record
Permanent link

Direct link
BETA
Publications (10 of 80) Show all publications
Santini, M., Strandqvist, W., Nyström, M., Alirezai, M. & Jönsson, A. (2018). Can We Quantify Domainhood?: Exploring Measures to Assess Domain-Specificity in Web Corpora. In: Elloumi M. et al. (Ed.), Communications in Computer and Information Science, vol 903. Springer, Cham: . Paper presented at Database and Expert Systems Applications. DEXA 2018.. , 903
Open this publication in new window or tab >>Can We Quantify Domainhood?: Exploring Measures to Assess Domain-Specificity in Web Corpora
Show others...
2018 (English)In: Communications in Computer and Information Science, vol 903. Springer, Cham / [ed] Elloumi M. et al., 2018, Vol. 903Conference paper, Published paper (Refereed)
Abstract [en]

Web corpora are a cornerstone of modern Language Technology. Corpora built from the web are convenient because their creation is fast and inexpensive. Several studies have been carried out to assess the representativeness of general-purpose web corpora by comparing them to traditional corpora. Less attention has been paid to assess the representativeness of specialized or domain-specific web corpora. In this paper, we focus on the assessment of domain representativeness of web corpora and we claim that it is possible to assess the degree of domain-specificity, or domainhood, of web corpora. We present a case study where we explore the effectiveness of different measures - namely the Mann-Withney-Wilcoxon Test, Kendall correlation coefficient, Kullback–Leibler divergence, log-likelihood and burstiness - to gauge domainhood. Our findings indicate that burstiness is the most suitable measure to single out domain-specific words from a specialized corpus and to allow for the quantification of domainhood.

Series
Communications in Computer and Information Science, ISSN 1865-0929, E-ISSN 1865-0937 ; 903
National Category
Language Technology (Computational Linguistics)
Identifiers
urn:nbn:se:liu:diva-151423 (URN)10.1007/978-3-319-99133-7_17 (DOI)978-3-319-99132-0 (ISBN)978-3-319-99133-7 (ISBN)
Conference
Database and Expert Systems Applications. DEXA 2018.
Available from: 2018-09-20 Created: 2018-09-20 Last updated: 2018-09-25
Santini, M., Jönsson, A., Nyström, M. & Alireza, M. (2017). A Web Corpus for eCare: Collection, Lay Annotation and Learning - First Results. In: : . Paper presented at 2nd International Workshop on Language Technologies and Applications (LTA'17), Prague, Czech Republic, 3-6 September, 2017.
Open this publication in new window or tab >>A Web Corpus for eCare: Collection, Lay Annotation and Learning - First Results
2017 (English)Conference paper, Published paper (Refereed)
Abstract [en]

In this position paper, we put forward two claims: 1) it is possible to design a dynamic and extensible corpus without running the risk of getting into scalability problems; 2) it is possible to devise noise-resistant Language Technology applications without affecting performance. To support our claims, we describe the design, construction and limitations of a very specialized medical web corpus, called eCare_Sv_01, and we present two experiments on lay-specialized text classification. eCare_Sv_01 is a small corpus of web documents written in Swedish. The corpus contains documents about chronic diseases. The sublanguage used in each document has been labelled as “lay” or “specialized” by a lay annotator. The corpus is designed as a flexible text resource, where additional medical documents will be appended over time. Experiments show that the lay-specialized labels assigned by the lay annotator are reliably learned by standard classifiers. More specifically, Experiment 1 shows that scalability is not an issue when increasing the size of the datasets to be learned from 156 up to 801 documents. Experiment 2 shows that lay-specialized labels can be learned regardless of the large amount of disturbing factors, such as machine translated documents or low-quality texts that are numerous in the corpus

National Category
Language Technology (Computational Linguistics)
Identifiers
urn:nbn:se:liu:diva-141054 (URN)
Conference
2nd International Workshop on Language Technologies and Applications (LTA'17), Prague, Czech Republic, 3-6 September, 2017
Available from: 2017-09-21 Created: 2017-09-21 Last updated: 2018-01-13Bibliographically approved
Johansson, R. & Jönsson, A. (2017). Consider Clojure: A modern Lisp that runs on Java and Javascript hosts. In: : . Paper presented at PROCEEDINGS OF THE 12’TH SWECOG CONFERENCE, GÖTEBORG, OCTOBER 6-7, 2016.
Open this publication in new window or tab >>Consider Clojure: A modern Lisp that runs on Java and Javascript hosts
2017 (English)Conference paper, Oral presentation only (Refereed)
Series
Studies in Informatics, ISSN 1653-2325 ; 2
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-141055 (URN)978-91-983667-0-9 (ISBN)
Conference
PROCEEDINGS OF THE 12’TH SWECOG CONFERENCE, GÖTEBORG, OCTOBER 6-7, 2016
Available from: 2017-09-21 Created: 2017-09-21 Last updated: 2018-01-13
Loutfi, A., Jönsson, A., Karlsson, L., Lind, L., Linden, M., Pecora, F. & Voigt, T. (2017). Ecare@Home: A Distributed Research Environment on Semantic Interoperabilit. In: Mobyen Uddid Ahmed, Shahina Begum, Wasim Raad (Ed.), Internet of Things Technologies for HealthCare: (pp. 3-8). Springer
Open this publication in new window or tab >>Ecare@Home: A Distributed Research Environment on Semantic Interoperabilit
Show others...
2017 (English)In: Internet of Things Technologies for HealthCare / [ed] Mobyen Uddid Ahmed, Shahina Begum, Wasim Raad, Springer, 2017, p. 3-8Chapter in book (Refereed)
Abstract [en]

This paper presents the motivation and challenges to developing semantic interoperability for an internet of things network that is used in the context of home based care. The paper describes a research environment which examines these challenges and illustrates the motivation through a scenario whereby a network of devices in the home is used to provide high-level information about elderly patients by leveraging from techniques in context awareness, automated reasoning, and configuration planning.

Place, publisher, year, edition, pages
Springer, 2017
Series
Institute for Computer Sciences, Social Informatics and Telecommunications Engineering. Lecture Notes, ISSN 1867-8211, E-ISSN 1867-822X
Keywords
Semantic interoperability, Configuration planning, Health and care, Internet of Things
National Category
Other Medical Engineering
Identifiers
urn:nbn:se:liu:diva-141052 (URN)10.1007/978-3-319-51234-1_1 (DOI)978-3-319-51233-4 (ISBN)978-3-319-51234-1 (ISBN)
Available from: 2017-09-21 Created: 2017-09-21 Last updated: 2017-09-28Bibliographically approved
Santini, M. & Jönsson, A. (2017). E-care@home: Towards a better communication between patients and doctors using Language Technology. In: : . Paper presented at Medicinteknikdagarna, Västerås, Sweden, October 10-11, 2017.
Open this publication in new window or tab >>E-care@home: Towards a better communication between patients and doctors using Language Technology
2017 (English)Conference paper, Oral presentation only (Refereed)
National Category
Language Technology (Computational Linguistics)
Identifiers
urn:nbn:se:liu:diva-141058 (URN)
Conference
Medicinteknikdagarna, Västerås, Sweden, October 10-11, 2017
Available from: 2017-09-21 Created: 2017-09-21 Last updated: 2018-01-13Bibliographically approved
Falkenjack, J., Rennes, E., Fahlborg, D., Johansson, V. & Jönsson, A. (2017). Services for text simplification and analysis. In: Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa: . Paper presented at 21st Nordic Conference on Computational Linguistics, NoDaLiDa, Wallenberg Conference Center, Gothenburg, Sweden, May 23-24, 2017.
Open this publication in new window or tab >>Services for text simplification and analysis
Show others...
2017 (Swedish)In: Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 2017Conference paper, Published paper (Refereed)
National Category
Language Technology (Computational Linguistics)
Identifiers
urn:nbn:se:liu:diva-141053 (URN)978-91-7685-601-7 (ISBN)
Conference
21st Nordic Conference on Computational Linguistics, NoDaLiDa, Wallenberg Conference Center, Gothenburg, Sweden, May 23-24, 2017
Available from: 2017-09-21 Created: 2017-09-21 Last updated: 2018-01-13Bibliographically approved
Heimann Mühlenbock, K., Johansson Kokkinakis, S., Liberg, C., af Geijerstam, Å., Wiksten Folkeryd, J., Jönsson, A., . . . Falkenjack, J. (2015). A multivariate model for classifying texts’ readability. In: Beáta Megyesi (Ed.), Proceedings of the 20th Nordic Conference of Computational Linguistics: . Paper presented at The 20th Nordic Conference of Computational Linguistics, NODALIDA 2015 (pp. 257-261).
Open this publication in new window or tab >>A multivariate model for classifying texts’ readability
Show others...
2015 (English)In: Proceedings of the 20th Nordic Conference of Computational Linguistics / [ed] Beáta Megyesi, 2015, p. 257-261Conference paper, Published paper (Refereed)
Abstract [en]

We report on results from using the multivariate readability model SVIT to classify texts into various levels. We investigate how the language features integrated in the SVIT model can be transformed to values on known criteria like vocabulary, grammatical fluency and propositional knowledge. Such text criteria, sensitive to content, readability and genre in combination with the profile of a student’s reading ability form the base to individually adapted texts. The procedure of levelling texts into different stages of complexity is presented along with results from the first cycle of tests conducted on 8th grade students. The results show that SVIT can be used to classify texts into different complexity levels.

National Category
Language Technology (Computational Linguistics)
Identifiers
urn:nbn:se:liu:diva-117611 (URN)978-91-7519-098-3 (ISBN)
Conference
The 20th Nordic Conference of Computational Linguistics, NODALIDA 2015
Funder
Marcus and Amalia Wallenberg Foundation
Available from: 2015-05-06 Created: 2015-05-06 Last updated: 2018-01-11Bibliographically approved
Rennes, E. & Jönsson, A. (2015). A Tool for Automatic Simplification of Swedish Texts. In: Proceedings of the 20th Nordic Conference of Computational Linguistics (NoDaLiDa-2015), Vilnius, Lithuania: . Paper presented at 20th Nordic Conference of Computational Linguistics (pp. 317-320).
Open this publication in new window or tab >>A Tool for Automatic Simplification of Swedish Texts
2015 (English)In: Proceedings of the 20th Nordic Conference of Computational Linguistics (NoDaLiDa-2015), Vilnius, Lithuania, 2015, p. 317-320Conference paper, Published paper (Refereed)
Abstract [en]

We present a rule based automatic text simplification tool for Swedish. The tool is designed to facilitate experimentation with various simplification techniques. The architecture of the tool is inspired by and partly built on a previous text simplification tool for Swedish, CogFLUX. New functionality, new operation types, and new simplification operations were added.

National Category
Language Technology (Computational Linguistics)
Identifiers
urn:nbn:se:liu:diva-128185 (URN)978-91-7519-098-3 (ISBN)
Conference
20th Nordic Conference of Computational Linguistics
Available from: 2016-05-20 Created: 2016-05-20 Last updated: 2018-01-10Bibliographically approved
Kanebrant, E., Heimann Mühlenbock, K., Johansson Kokkinakis, S., Jönsson, A., Liberg, C., Af Geijerstam, Å., . . . Falkenjack, J. (2015). T-MASTER -- A tool for assessing students' reading abilities. In: Proceedings of the 7th International Conference on Computer Supported Education (CSEDU 2015), Lisbon, Portugal: . Paper presented at 7th International Conference on Computer Supported Education (CSEDU 2015) (pp. 220-227). SciTePress
Open this publication in new window or tab >>T-MASTER -- A tool for assessing students' reading abilities
Show others...
2015 (English)In: Proceedings of the 7th International Conference on Computer Supported Education (CSEDU 2015), Lisbon, Portugal, SciTePress, 2015, p. 220-227Conference paper, Published paper (Refereed)
Abstract [en]

We present T-MASTER, a tool for assessing students’ reading skills on a variety of dimensions. T-MASTER uses sophisticated measures for assessing a student’s reading comprehension and vocabulary understanding. Texts are selected based on their difficulty using novel readability measures and tests are created based on the texts. The results are analyzed in T-MASTER, and the numerical results are mapped to textual descriptions that describe the student’s reading abilities on the dimensions being analysed. These results are presented to the teacher in a form that is easily comprehensible, and lends itself to inspection of each individual student’s results.

Place, publisher, year, edition, pages
SciTePress, 2015
Keywords
Reading assessment, vaocabulary assessment, teacher and student feedback
National Category
Didactics Pedagogical Work Pedagogy Specific Languages
Identifiers
urn:nbn:se:liu:diva-128186 (URN)10.5220/0005410902200227 (DOI)978-989-758-108-3 (ISBN)
Conference
7th International Conference on Computer Supported Education (CSEDU 2015)
Available from: 2016-05-20 Created: 2016-05-20 Last updated: 2018-01-10
Saifullah, M., Balkenius, C. & Jönsson, A. (2014). A biologically based model for recognition of 2-D occluded patterns. Cognitive Processing, 15(1), 13-28
Open this publication in new window or tab >>A biologically based model for recognition of 2-D occluded patterns
2014 (English)In: Cognitive Processing, ISSN 1612-4782, E-ISSN 1612-4790, Vol. 15, no 1, p. 13-28Article in journal (Refereed) Published
Abstract [en]

In this work, we present a biologically inspired model for recognition of occluded patterns. The general architecture of the model is based on the two visual information processing pathways of the human visual system, i.e. the ventral and the dorsal pathways. The proposed hierarchically structured model consists of three parallel processing channels. The main channel learns invariant representations of the input patterns and is responsible for pattern recognition task. But, it is limited to process one pattern at a time. The direct channel represents the biologically based direct connection from the lower to the higher processing level in the human visual cortex. It computes rapid top-down pattern-specific cues to modulate processing in the other two channels. The spatial channel mimics the dorsal pathway of the visual cortex. It generates a combined saliency map of the input patterns and, later, segments the part of the map representing the occluded pattern. This segmentation process is based on our hypothesis that the dorsal pathway, in addition to encoding spatial properties, encodes the shape representations of the patterns as well. The lateral interaction between the main and the spatial channels at appropriate processing levels and top-down, pattern-specific modulation of the these two channels by the direct channel strengthen the locations and features representing the occluded pattern. Consequently, occluded patterns become focus of attention in the ventral channel and also the pattern selected for further processing along this channel for final recognition.

Place, publisher, year, edition, pages
Springer Berlin/Heidelberg, 2014
Keywords
Attention; Interactive process; Neural network model; Occluded patterns; Segmentation and recognition; Vision
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:liu:diva-110529 (URN)10.1007/s10339-013-0578-9 (DOI)000346037800002 ()24122414 (PubMedID)2-s2.0-84893388441 (Scopus ID)
Available from: 2014-09-14 Created: 2014-09-12 Last updated: 2018-01-11Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-4899-588X

Search in DiVA

Show all publications