Error detection for post-editing rule-based machine translation

Valotkaite, Justina

Publicação

Error detection for post-editing rule-based machine translation

2012Dissertação de mestrado

datacite.subject.fos	Humanidades::Outras Humanidades	pt_PT
dc.contributor.advisor	Specia, Lucia
dc.contributor.advisor	Orasan, Constantin
dc.contributor.advisor	Baptista, Jorge
dc.contributor.author	Valotkaite, Justina
dc.date.accessioned	2018-12-05T17:14:39Z
dc.date.available	2018-12-05T17:14:39Z
dc.date.issued	2012
dc.date.submitted	2012
dc.description.abstract	The increasing role of Post-editing (PE) as a way of improving Machine Translation (MT) output and a faster alternative to translating from scratch among translators has lately attracted researchers’ attention. A number of recent studies have proposed various attempts to facilitate this task, especially for the outputs of Statistical Machine Translation (SMT). However, little attention in the field has been given to Rule-based Machine Translation (RBMT). In this dissertation an effort was made to provide support for the PE task through Error Detection (ED). A deep linguistic error analysis was done in a sample of English sentences in two text domains translated from Portuguese by two RBMT systems. The hypothesis is that automatically identifying and highlighting errors in translations can help to perform the PE task faster, make it more efficient and less tedious. As RBMT systems tend to make repetitive, systematic mistakes translators are forced to post-edit the same mistakes which makes their task monotonous and frustrating. In order to solve this problem, a set of 40 contrastive rules was designed tackling various linguistic phenomena on the basis of the translation errors identified in the error analysis. By applying this linguistic approach the project aimed at demonstrating that one can have a rule-based system working on the basis of designed rules which could help to detect and highlight translation errors in the RBMT output. The rules were verified by performing an experimental error analysis on a new data set whose results revealed that their coverage was 98.21%. The implementation results demonstrated a successful performance of the system. In addition, the results of a psycholinguistic experiment performed with human translators confirmed that having highlighted errors is useful as this can help translators perform the postediting task up to 12 seconds per error faster and improve their efficiency by minimizing the number of missed errors.	pt_PT
dc.identifier.uri	http://hdl.handle.net/10400.1/11064
dc.language.iso	eng	pt_PT
dc.subject	Error classification	pt_PT
dc.subject	Error detection	pt_PT
dc.subject	Error analysis	pt_PT
dc.subject	Rule-based machine translation	pt_PT
dc.subject	Post-editing	pt_PT
dc.title	Error detection for post-editing rule-based machine translation	pt_PT
dc.type	master thesis
dspace.entity.type	Publication
rcaap.rights	restrictedAccess	pt_PT
rcaap.type	masterThesis	pt_PT
thesis.degree.grantor	Universidade do Algarve. Faculdade de Ciências Humanas e Sociais
thesis.degree.level	Mestre
thesis.degree.name	Mestrado em Processamento de Linguagem Natural & Tecnologia da Linguagem	pt_PT

Ficheiros

Principais

A mostrar 1 - 1 de 1

Nome:: Justina.Thesis.pdf
Tamanho:: 2.05 MB
Formato:: Adobe Portable Document Format

Ver/Abrir

Licença

A mostrar 1 - 1 de 1

Nome:: license.txt
Tamanho:: 3.41 KB
Formato:: Item-specific license agreed upon to submission
Descrição:

Ver/Abrir

Coleções

UA01-Teses
FCH1-Teses