Brkić, Marija and Vičić, Tomislav and Seljan, Sanja. (2009). Evaluation of the Statistical Machine Translation Service for Croatian-English. In: 2nd International Conference “The Future of Information Sciences: INFuture2009 – Digital Resources and Knowledge Sharing”, 4-6 November 2009, Zagreb, Croatia.
|
PDF
(English)
Download (208kB) | Preview |
Abstract
Much thought has been given in an endeavour to formalize the translation process. As a result, various approaches to MT (machine translation) were taken. With the exception of statistical translation, all approaches require cooperation between language and computer science experts. Most of the models use various hybrid approaches. Statistical translation approach is completely language independent if we disregard the fact that it requires huge parallel corpus that needs to be split into sentences and words. This paper compares and discusses state-of-the-art statistical machine translation (SMT) models and evaluation methods. Results of statistically-based Google Translate tool for Croatian-English translations are presented and multilevel analysis is given. Three different types of texts are manually evaluated and results are analysed by the χ2-test.
Item Type: | Published conference work (Lecture) |
---|---|
Uncontrolled Keywords: | SMT (statistical machine translation), online, Google Translate, MT, Croatian-English, manual evaluation, fluency, adequacy, χ2-test |
Subjects: | Information sciences > Social-humanistic informatics Information sciences > Natural language processing, lexicography and encyclopedic science Linguistics |
Departments: | Department of Information Science |
Date Deposited: | 24 Feb 2017 09:43 |
Last Modified: | 24 Feb 2017 09:43 |
URI: | http://darhiv.ffzg.unizg.hr/id/eprint/8031 |
Actions (login required)
View Item |