Knjižnica Filozofskog fakulteta
Sveučilišta u Zagrebu
Faculty of Humanities and Social Sciences Institutional Repository

Statistical machine translation system and computational domain adaptation

Downloads

Downloads per month over past year

Dunđer, Ivan. (2015). Statistical machine translation system and computational domain adaptation. PhD Thesis. Filozofski fakultet u Zagrebu, Department of Information Science
Department of Information Science > Chair of social-humanistic informatics.
(Poslijediplomski doktorski studij informacijskih i komunikacijskih znanosti) [mentor Seljan, Sanja].

[img]
Preview
PDF (Croatian)
Download (2MB) | Preview

Abstract

Phrase-based statistical machine translation is one of possible automatic machine translation approaches. This work proposes methods for increasing the quality of machine translation by adapting certain parameters in the statistical machine translation model. The idea was to build phrase-based statistical machine translation systems for Croatian and English language. The systems were be trained for two directions, on two domains, on parallel corpora of different sizes and characteristics for Croatian-English and English-Croatian language pair, after which the tuning procedure was conducted. Afterwards, hybrid systems which combine features of both domains were investigated. Thereby the direct impact of domain adaptation on the quality of automatic machine translation of Croatian language was explored, whereas new findings can be utilised for building new systems. Automatic and human evaluation of machine translations were carried out, while obtained results were compared with results obtained from applying existing statistical machine translation web services.

Item Type: PhD Thesis
Uncontrolled Keywords: statistical machine translation, domain adaptation, automatic evaluation of machine translation quality, human evaluation, machine translation system ranking, computational natural language processing, language technologies, information sciences
Subjects: Information sciences > Social-humanistic informatics
Departments: Department of Information Science
Department of Information Science > Chair of social-humanistic informatics
Supervisor: Seljan, Sanja
Additional Information: Poslijediplomski doktorski studij informacijskih i komunikacijskih znanosti
Date Deposited: 24 Feb 2017 14:15
Last Modified: 09 Mar 2017 13:10
URI: http://darhiv.ffzg.unizg.hr/id/eprint/8222

Actions (login required)

View Item View Item