liu.seSearch for publications in DiVA
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Alignment-based profiling of Europarl data in an English-Swedish parallel corpus
Linköpings universitet, Institutionen för datavetenskap, NLPLAB - Laboratoriet för databehandling av naturligt språk. Linköpings universitet, Tekniska högskolan. (HCS)
2010 (engelsk)Inngår i: Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10) / [ed] Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias, Paris, France: European Language Resources Association (ELRA) , 2010, s. 3398-3404Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

This paper profiles the Europarl part of an English-Swedish parallel corpus and compares it with three other subcorpora of the sameparallel corpus. We first describe our method for comparison which is based on alignments, both at the token level and the structurallevel. Although two of the other subcorpora contains fiction, it is found that the Europarl part is the one having the highest proportion ofmany types of restructurings, including additions, deletions and long distance reorderings. We explain this by the fact that the majorityof Europarl segments are parallel translations.

sted, utgiver, år, opplag, sider
Paris, France: European Language Resources Association (ELRA) , 2010. s. 3398-3404
Emneord [en]
parallel corpora, profiling, translation, English, Swedish
HSV kategori
Identifikatorer
URN: urn:nbn:se:liu:diva-60039ISI: 000356879508030ISBN: 2-9517408-6-7 (tryckt)OAI: oai:DiVA.org:liu-60039DiVA, id: diva2:354794
Konferanse
7th International Conference on Language Resources and Evaluation (LREC)
Tilgjengelig fra: 2010-10-05 Laget: 2010-10-04 Sist oppdatert: 2018-01-12bibliografisk kontrollert

Open Access i DiVA

fulltekst(418 kB)204 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 418 kBChecksum SHA-512
7c9d6708234586911bebce1d9a45ab34cb16998499791977f1ab10957488d3bcf4141054db87e45e00844232ad3e22eed5496fd7e359e726d3fe48c5ef0100b5
Type fulltextMimetype application/pdf

Andre lenker

Link to conference

Personposter BETA

Ahrenberg, Lars

Søk i DiVA

Av forfatter/redaktør
Ahrenberg, Lars
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 204 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

isbn
urn-nbn

Altmetric

isbn
urn-nbn
Totalt: 510 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf