liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
An empirical study on the contribution of formal and semantic features to the grammatical gender of nouns
Linköping University, Department of Computer and Information Science, Artificial Intelligence and Integrated Computer Systems. Linköping University, Faculty of Science & Engineering. Uppsala Univ, Sweden.
Univ Lyon, France.
Univ Gothenburg, Sweden.
2021 (English)In: Linguistics Vanguard, E-ISSN 2199-174X, Vol. 7, no 1, article id 20200048Article in journal (Refereed) Published
Abstract [en]

This study conducts an experimental evaluation of two hypotheses about the contributions of formal and semantic features to the grammatical gender assignment of nouns. One of the hypotheses (Corbett and Fraser 2000) claims that semantic features dominate formal ones. The other hypothesis, formulated within the optimal gender assignment theory (Rice 2006), states that form and semantics contribute equally. Both hypotheses claim that the combination of formal and semantic features yields the most accurate gender identification. In this paper, we operationalize and test these hypotheses by trying to predict grammatical gender using only character-based embeddings (that capture only formal features), only context-based embeddings (that capture only semantic features) and the combination of both. We performed the experiment using data from three languages with different gender systems (French, German and Russian). Formal features are a significantly better predictor of gender than semantic ones, and the difference in prediction accuracy is very large. Overall, formal features are also significantly better than the combination of form and semantics, but the difference is very small and the results for this comparison are not entirely consistent across languages.

Place, publisher, year, edition, pages
WALTER DE GRUYTER GMBH , 2021. Vol. 7, no 1, article id 20200048
Keywords [en]
formal features; gender; neural networks; semantics; word embeddings
National Category
Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:liu:diva-182245DOI: 10.1515/lingvan-2020-0048ISI: 000733308700005OAI: oai:DiVA.org:liu-182245DiVA, id: diva2:1626979
Note

Funding Agencies|IDEXLYON Fellowship Grant [16-IDEX-0005]; University of Lyon Grant NSCO ED 476 [ANR-10-LABX-0081]; French National Research AgencyFrench National Research Agency (ANR) [ANR-11-IDEX-0007]

Available from: 2022-01-12 Created: 2022-01-12 Last updated: 2022-01-12

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Search in DiVA

By author/editor
Basirat, Ali
By organisation
Artificial Intelligence and Integrated Computer SystemsFaculty of Science & Engineering
In the same journal
Linguistics Vanguard
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 77 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf