LiU Electronic Press
Download:
File size:
3929 kb
Format:
application/pdf
Author:
Eklund, Robert (Linköping University, Department of Computer and Information Science, NLPLAB - Natural Language Processing Laboratory) (Linköping University, The Institute of Technology)
Title:
Disfluency in Swedish human–human and human–machine travel booking dialogues
Department:
Linköping University, Department of Computer and Information Science, NLPLAB - Natural Language Processing Laboratory
Linköping University, The Institute of Technology
Responsible org.:
Linköping University, Department of Computer and Information Science
Publication type:
Doctoral thesis, monograph (Other academic)
Language:
English
Place of publ.: Linköping Publisher: Linköping University Electronic Press
Pages:
388
Series:
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524; 882
Year of publ.:
2004
URI:
urn:nbn:se:liu:diva-5012
Permanent link:
http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-5012
ISBN:
91-7373-966-9
Subject category:
Language Technology (Computational Linguistics)
SVEP category:
Computational linguistics
Keywords(en) :
Speech, Speech disorders, Speech intelligibility, Speech perception, Computer linguistics, phonetic, Man-computer-interaction
Abstract(en) :

This thesis studies disfluency in spontaneous Swedish speech, i.e., the occurrence of hesitation phenomena like eh, öh, truncated words, repetitions and repairs, mispronunciations, truncated words and so on. The thesis is divided into three parts:

PART I provides the background, both concerning scientific, personal and industrial–academic aspects in the Tuning in quotes, and the Preamble and Introduction (chapter 1).

PART II consists of one chapter only, chapter 2, which dives into the etiology of disfluency. Consequently it describes previous research on disfluencies, also including areas that are not the main focus of the present tome, like stuttering, psychotherapy, philosophy, neurology, discourse perspectives, speech production, application-driven perspectives, cognitive aspects, and so on. A discussion on terminology and definitions is also provided. The goal of this chapter is to provide as broad a picture as possible of the phenomenon of disfluency, and how all those different and varying perspectives are related to each other.

PART III describes the linguistic data studied and analyzed in this thesis, with the following structure: Chapter 3 describes how the speech data were collected, and for what reason. Sum totals of the data and the post-processing method are also described. Chapter 4 describes how the data were transcribed, annotated and analyzed. The labeling method is described in detail, as is the method employed to do frequency counts. Chapter 5 presents the analysis and results for all different categories of disfluencies. Besides general frequency and distribution of the different types of disfluencies, both inter- and intra-corpus results are presented, as are co-occurrences of different types of disfluencies. Also, inter- and intra-speaker differences are discussed. Chapter 6 discusses the results, mainly in light of previous research. Reasons for the observed frequencies and distribution are proposed, as are their relation to language typology, as well as syntactic, morphological and phonetic reasons for the observed phenomena. Future work is also envisaged, both work that is possible on the present data set, work that is possible on the present data set given extended labeling and work that I think should be carried out, but where the present data set fails, in one way or another, to meet the requirements of such studies.

Appendices 1–4 list the sum total of all data analyzed in this thesis (apart from Tok Pisin data). Appendix 5 provides an example of a full human–computer dialogue.

Note:
The electronic version of the printed dissertation is a corrected version where typos as well as phrases have been corrected. A list with the corrections is presented in the errata list above.
Public defence:
2004-06-04, Visionen, Hus B, Campus Valla, Linköpings universitet, Linköping, 13:15 (English)
Degree:
Doctor of Philosophy (PhD)
Supervisor:
Ahrenberg, Lars (Linköping University, Department of Computer and Information Science, NLPLAB - Natural Language Processing Laboratory) (Linköping University, The Institute of Technology)
Available from:
2004-06-26
Created:
2004-06-26
Last updated:
2013-10-17
Statistics:
1191 hits
FILE INFORMATION
File size:
3929 kb
Mimetype:
application/pdf
Type:
fulltext
Statistics:
1476 hits
File size:
189 kb
Mimetype:
application/pdf
Type:
errata
Statistics:
77 hits