{"id":133,"date":"2014-11-20T00:25:00","date_gmt":"2014-11-20T05:25:00","guid":{"rendered":"https:\/\/www.ata-divisions.org\/JLD\/wp\/2014\/11\/20\/ata-2014-is-machine-translation-your-friend-or-foe-challenges-for-english-japanese-translators\/"},"modified":"2014-11-20T00:25:00","modified_gmt":"2014-11-20T05:25:00","slug":"ata-2014-is-machine-translation-your-friend-or-foe-challenges-for-english-japanese-translators","status":"publish","type":"post","link":"https:\/\/www.ata-divisions.org\/JLD\/2014\/11\/20\/ata-2014-is-machine-translation-your-friend-or-foe-challenges-for-english-japanese-translators\/","title":{"rendered":"ATA 2014: Is Machine Translation Your Friend or Foe? Challenges for English-Japanese Translators"},"content":{"rendered":"<div>Presented by Dr. Takako Aikawa, Sr. Lecturer in Japanese, MIT Global Studies and Languages<\/div>\n<div><o:p><\/o:p><\/div>\n<div>Summarized by Kazumasa Aoyama, Park IP Translations<\/div>\n<div><\/div>\n<div>In her excellent presentation, Dr. Aikawa discussed the use of machine translation (MT) in English-to-Japanese translation. She gave answers to the question with many examples: \u201cWhy English-to-Japanese translation is so challenging for MT?\u201d&nbsp; She also showed us how human translators can help improve the quality of MT.<br \/><a name='more'><\/a><o:p><\/o:p><\/div>\n<div><\/div>\n<div>As an introduction, she told us the history of MT, its advancements, moving from rule-based MT to statistic MT (SMT), and how an SMT system works. SMT has great advantages over rule-based MT: It is scalable and sustainable.<o:p><\/o:p><\/div>\n<div><\/div>\n<div>She then showed us challenges that MT faces when translating a language to another, and particularly from English to Japanese.<o:p><\/o:p><\/div>\n<div><\/div>\n<div>She listed the following open problems for MT in general:<o:p><\/o:p><\/div>\n<div style=\"mso-list: l3 level1 lfo1; text-indent: -.25in;\"><!--[if !supportLists]-->1.<span style=\"font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Lexical Ambiguity: Many words have different meaning depending on a given context.<o:p><\/o:p><\/div>\n<div style=\"mso-list: l3 level1 lfo1; text-indent: -.25in;\"><!--[if !supportLists]-->2.<span style=\"font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Syntactic Ambiguity: One sentence can be syntactically interpreted more than one way.<o:p><\/o:p><\/div>\n<div style=\"mso-list: l3 level1 lfo1; text-indent: -.25in;\"><!--[if !supportLists]-->3.<span style=\"font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Idioms<o:p><\/o:p><\/div>\n<div style=\"mso-list: l3 level1 lfo1; text-indent: -.25in;\"><!--[if !supportLists]-->4.<span style=\"font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Pronoun resolution: MT has to know what a pronoun refers to and its gender and number.<o:p><\/o:p><\/div>\n<div><\/div>\n<div>She then gave an answer to the question. The reason why the quality of English-to-Japanese MT\/SMT is so bad is<o:p><\/o:p><\/div>\n<div>Japanese and English are so different from each other:<o:p><\/o:p><\/div>\n<div style=\"mso-list: l4 level1 lfo2; text-indent: -.25in;\"><!--[if !supportLists]-->1.<span style=\"font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Word order (SVO (English) vs. SOV (Japanese))<o:p><\/o:p><\/div>\n<div style=\"mso-list: l4 level1 lfo2; text-indent: -.25in;\"><!--[if !supportLists]-->2.<span style=\"font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Case markers. The Japanese language uses case markers such as <span lang=\"JA\" style=\"font-family: &quot;ms mincho&quot;; mso-ascii-font-family: &quot;Times New Roman&quot;; mso-fareast-font-family: &quot;MS Mincho&quot;; mso-fareast-theme-font: minor-fareast; mso-hansi-font-family: &quot;Times New Roman&quot;;\">\u304c<\/span> and <span lang=\"JA\" style=\"font-family: &quot;ms mincho&quot;; mso-ascii-font-family: &quot;Times New Roman&quot;; mso-fareast-font-family: &quot;MS Mincho&quot;; mso-fareast-theme-font: minor-fareast; mso-hansi-font-family: &quot;Times New Roman&quot;;\">\u3092<\/span>, and the word order of a sentence in Japanese is not as critical as in English \u2013 free word order.) <span lang=\"JA\" style=\"font-family: &quot;ms mincho&quot;; mso-ascii-font-family: &quot;Times New Roman&quot;; mso-fareast-font-family: &quot;MS Mincho&quot;; mso-fareast-theme-font: minor-fareast; mso-hansi-font-family: &quot;Times New Roman&quot;;\">\u592a\u90ce<u>\u304c<\/u>\u305d\u306e\u30ea\u30f3\u30b4<u>\u3092<\/u>\u98df\u3079\u305f<\/span> and <span lang=\"JA\" style=\"font-family: &quot;ms mincho&quot;; mso-ascii-font-family: &quot;Times New Roman&quot;; mso-fareast-font-family: &quot;MS Mincho&quot;; mso-fareast-theme-font: minor-fareast; mso-hansi-font-family: &quot;Times New Roman&quot;;\">\u305d\u306e\u30ea\u30f3\u30b4<u>\u3092<\/u>\u592a\u90ce<u>\u304c<\/u>\u98df\u3079\u305f<\/span> are both acceptable Japanese sentences with almost the same meaning.<o:p><\/o:p><\/div>\n<div style=\"mso-list: l4 level1 lfo2; text-indent: -.25in;\"><!--[if !supportLists]-->3.<span style=\"font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Postpositions. An English preposition may need to be translated to different Japanese postpositions depending on the context<br \/>Taro ate the apple <u>at<\/u> school\/<u>at <\/u>3 pm.<br \/><span lang=\"JA\" style=\"font-family: &quot;ms mincho&quot;; mso-ascii-font-family: &quot;Times New Roman&quot;; mso-fareast-font-family: &quot;MS Mincho&quot;; mso-fareast-theme-font: minor-fareast; mso-hansi-font-family: &quot;Times New Roman&quot;;\">\u592a\u90ce\u304c\u305d\u306e\u6797\u6a8e\u3092\u5b66\u6821<u>\u3067<\/u>\uff0f\u5348\u5f8c\uff13\u6642<u>\u306b<\/u>\u98df\u3079\u305f\u3002<\/span><br \/>Taro will come <u>by<\/u> train\/<u>by<\/u> noon.<br \/><span lang=\"JA\" style=\"font-family: &quot;ms mincho&quot;; mso-ascii-font-family: &quot;Times New Roman&quot;; mso-fareast-font-family: &quot;MS Mincho&quot;; mso-fareast-theme-font: minor-fareast; mso-hansi-font-family: &quot;Times New Roman&quot;;\">\u592a\u90ce\u306f\u96fb\u8eca<u>\u3067<\/u>\uff0f\u6b63\u5348<u>\u307e\u3067\u306b<\/u>\u6765\u307e\u3059\u3002<\/span><o:p><\/o:p><\/div>\n<div style=\"mso-list: l4 level1 lfo2; text-indent: -.25in;\"><!--[if !supportLists]-->4.<span style=\"font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Pronouns. \u201cPronoun resolution requires the understanding of a given context! But MT is still at a sentence level.\u201d<o:p><\/o:p><\/div>\n<div style=\"mso-list: l4 level1 lfo2; text-indent: -.25in;\"><!--[if !supportLists]-->5.<span style=\"font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Japanese counters (<span lang=\"JA\" style=\"font-family: &quot;ms mincho&quot;; mso-ascii-font-family: &quot;Times New Roman&quot;; mso-fareast-font-family: &quot;MS Mincho&quot;; mso-fareast-theme-font: minor-fareast; mso-hansi-font-family: &quot;Times New Roman&quot;;\">\u672c<\/span>, <span lang=\"JA\" style=\"font-family: &quot;ms mincho&quot;; mso-ascii-font-family: &quot;Times New Roman&quot;; mso-fareast-font-family: &quot;MS Mincho&quot;; mso-fareast-theme-font: minor-fareast; mso-hansi-font-family: &quot;Times New Roman&quot;;\">\u4eba<\/span>, <span lang=\"JA\" style=\"font-family: &quot;ms mincho&quot;; mso-ascii-font-family: &quot;Times New Roman&quot;; mso-fareast-font-family: &quot;MS Mincho&quot;; mso-fareast-theme-font: minor-fareast; mso-hansi-font-family: &quot;Times New Roman&quot;;\">\u518a<\/span>, etc.)<o:p><\/o:p><\/div>\n<div style=\"mso-list: l4 level1 lfo2; text-indent: -.25in;\"><!--[if !supportLists]-->6.<span style=\"font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Word-breaking. While the English language uses a white space to indicate word boundaries, the Japanese language needs a word-breaker for NLP\/MT related tasks.<o:p><\/o:p><\/div>\n<div><\/div>\n<div>She then discussed how human translators can improve the quality of MT for English-to-Japanese translation. She listed three ways.<o:p><\/o:p><\/div>\n<div style=\"mso-list: l2 level1 lfo3; text-indent: -.25in;\"><!--[if !supportLists]-->1.<span style=\"font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Training data<br \/>\u201cThe more the training data, the better the quality of an SMT system,\u201d and \u201cthe cleaner the training data, the better the quality of an SMT system.\u201d<o:p><\/o:p><\/div>\n<div style=\"mso-list: l2 level1 lfo3; text-indent: -.25in;\"><!--[if !supportLists]-->2.<span style=\"font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Controlled English. Using controlled\/simplified English helps MT to produce better quality translation.<o:p><\/o:p><\/div>\n<div style=\"mso-list: l2 level1 lfo3; text-indent: -.25in;\"><!--[if !supportLists]-->3.<span style=\"font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Post-editing. Post editing \u201cis the process of improving a machine-generated translation with a minimum of manual labor.\u201d (<a href=\"https:\/\/en.wikipedia.org\/wiki\/Postediting\">https:\/\/en.wikipedia.org\/wiki\/Postediting<\/a>)<o:p><\/o:p><\/div>\n<div style=\"margin-left: .25in;\"><\/div>\n<div>She emphasized the importance of human post-editing in our work-flow when translating English to Japanese with MT.<o:p><\/o:p><\/div>\n<div><\/div>\n<div>Machine Translation Post-editing Guidelines<o:p><\/o:p><\/div>\n<div><a href=\"https:\/\/evaluation.taus.net\/resources\/guidelines\/post-editing\/machine-translation-post-editing-guidelines\">https:\/\/evaluation.taus.net\/resources\/guidelines\/post-editing\/machine-translation-post-editing-guidelines<\/a><o:p><\/o:p><\/div>\n<div><\/div>\n<div>Skills for Post-editing<o:p><\/o:p><\/div>\n<div><u>(<\/u><a href=\"https:\/\/www.qt21.eu\/launchpad\/sites\/default\/files\/QTLaunchPad%20Post-Editing%20Webinar.pdf\">http<\/a><a href=\"https:\/\/www.qt21.eu\/launchpad\/sites\/default\/files\/QTLaunchPad%20Post-Editing%20Webinar.pdf\">:\/\/www.qt21.eu\/launchpad\/sites\/default\/files\/QTLaunchPad%20Post-Editing%20Webinar.pdf<\/a>)<o:p><\/o:p><\/div>\n<div style=\"margin-left: .5in; mso-list: l1 level1 lfo4; tab-stops: list .5in; text-indent: -.25in;\"><!--[if !supportLists]--><span style=\"font-family: &quot;arial&quot; , &quot;sans-serif&quot;; mso-fareast-font-family: Arial;\">\u2022<span style=\"font-family: &quot;times new roman&quot;; font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><\/span><!--[endif]-->Excellent word-processing and editing skills; ability to work and make corrections directly on screen<o:p><\/o:p><\/div>\n<div style=\"margin-left: .5in; mso-list: l0 level1 lfo5; tab-stops: list .5in; text-indent: -.25in;\"><!--[if !supportLists]--><span style=\"font-family: &quot;arial&quot; , &quot;sans-serif&quot;; mso-fareast-font-family: Arial;\">\u2022<span style=\"font-family: &quot;times new roman&quot;; font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><\/span><!--[endif]-->General knowledge of the problems and challenges faced by MT<o:p><\/o:p><\/div>\n<div style=\"margin-left: .5in; mso-list: l0 level1 lfo5; tab-stops: list .5in; text-indent: -.25in;\"><!--[if !supportLists]--><span style=\"font-family: &quot;arial&quot; , &quot;sans-serif&quot;; mso-fareast-font-family: Arial;\">\u2022<span style=\"font-family: &quot;times new roman&quot;; font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><\/span><!--[endif]-->Specific knowledge of the weaknesses of the particular MT system<o:p><\/o:p><\/div>\n<div style=\"margin-left: .5in; mso-list: l0 level1 lfo5; tab-stops: list .5in; text-indent: -.25in;\"><!--[if !supportLists]--><span style=\"font-family: &quot;arial&quot; , &quot;sans-serif&quot;; mso-fareast-font-family: Arial;\">\u2022<span style=\"font-family: &quot;times new roman&quot;; font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><\/span><!--[endif]-->Knowledge of source and target languages<o:p><\/o:p><\/div>\n<div style=\"margin-left: .5in; mso-list: l0 level1 lfo5; tab-stops: list .5in; text-indent: -.25in;\"><!--[if !supportLists]--><span style=\"font-family: &quot;arial&quot; , &quot;sans-serif&quot;; mso-fareast-font-family: Arial;\">\u2022<span style=\"font-family: &quot;times new roman&quot;; font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><\/span><!--[endif]-->Quick in making decisions as to what and how to correct (or ignore errors)<o:p><\/o:p><\/div>\n<div style=\"margin-left: .5in; mso-list: l0 level1 lfo5; tab-stops: list .5in; text-indent: -.25in;\"><!--[if !supportLists]--><span style=\"font-family: &quot;arial&quot; , &quot;sans-serif&quot;; mso-fareast-font-family: Arial;\">\u2022<span style=\"font-family: &quot;times new roman&quot;; font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><\/span><!--[endif]-->Ability to balance PE speed and cost with respect to required quality<o:p><\/o:p><\/div>\n<div style=\"margin-left: .5in; mso-list: l0 level1 lfo5; tab-stops: list .5in; text-indent: -.25in;\"><!--[if !supportLists]--><span style=\"font-family: &quot;arial&quot; , &quot;sans-serif&quot;; mso-fareast-font-family: Arial;\">\u2022<span style=\"font-family: &quot;times new roman&quot;; font-size: 7pt; font-stretch: normal;\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><\/span><!--[endif]-->Ability to adapt to different specifications required for each job<o:p><\/o:p><\/div>\n<div><\/div>\n<div>Here, she answered one of our big questions: Can SMT replace human translators?<o:p><\/o:p><\/div>\n<div><\/div>\n<div>Her answer was, \u201cNo. Instead, it will create a new field of translation.\u201d<o:p><\/o:p><\/div>\n<div><\/div>\n<div>Finally, she answered her first question: Machine translation is our FRIEND, if we use it appropriately and wisely.<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Presented by Dr. Takako Aikawa, Sr. Lecturer in Japanese, MIT Global Studies and Languages Summarized by Kazumasa Aoyama, Park IP Translations In her excellent presentation, Dr. Aikawa discussed the use of machine translation (MT) in English-to-Japanese translation. She gave answers to the question with many examples: \u201cWhy English-to-Japanese translation is so challenging for MT?\u201d&nbsp; She [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_genesis_hide_title":false,"_genesis_hide_breadcrumbs":false,"_genesis_hide_singular_image":false,"_genesis_hide_footer_widgets":false,"_genesis_custom_body_class":"","_genesis_custom_post_class":"","_genesis_layout":"","footnotes":""},"categories":[23,32,22],"tags":[],"_links":{"self":[{"href":"https:\/\/www.ata-divisions.org\/JLD\/wp-json\/wp\/v2\/posts\/133"}],"collection":[{"href":"https:\/\/www.ata-divisions.org\/JLD\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ata-divisions.org\/JLD\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ata-divisions.org\/JLD\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ata-divisions.org\/JLD\/wp-json\/wp\/v2\/comments?post=133"}],"version-history":[{"count":0,"href":"https:\/\/www.ata-divisions.org\/JLD\/wp-json\/wp\/v2\/posts\/133\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.ata-divisions.org\/JLD\/wp-json\/wp\/v2\/media?parent=133"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ata-divisions.org\/JLD\/wp-json\/wp\/v2\/categories?post=133"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ata-divisions.org\/JLD\/wp-json\/wp\/v2\/tags?post=133"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}