{"id":554,"date":"2011-06-05T10:46:27","date_gmt":"2011-06-05T09:46:27","guid":{"rendered":"http:\/\/wasaty.pl\/blog\/?p=554"},"modified":"2011-10-23T15:37:49","modified_gmt":"2011-10-23T14:37:49","slug":"memoq-automation-auto-translatables","status":"publish","type":"post","link":"https:\/\/wasaty.pl\/blog\/2011\/06\/05\/memoq-automation-auto-translatables\/","title":{"rendered":"MemoQ automation &#8211; auto-translatables"},"content":{"rendered":"<div style=\"float: right; margin-left: 10px;\"><a href=\"https:\/\/twitter.com\/share\" class=\"twitter-share-button\" data-via=\"Wasaty\" data-count=\"vertical\" data-url=\"https:\/\/wasaty.pl\/blog\/2011\/06\/05\/memoq-automation-auto-translatables\/\">Tweet<\/a><\/div>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-511\" title=\"memoq-logo\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/memoq-logo.png\" alt=\"\" width=\"80\" height=\"80\" \/>MemoQ is a fast, easy to use and very powerful CAT tool. However, in the default configuration it does have two features I don&#8217;t like: the QA module often gives warnings concerning numbers (&#8220;invalid number format in source\/target prevents strict matching&#8221;), and the software does not allow to copy numerical values from source segments by keyboard shortcuts, like in Trados. Both these problems can be addressed by simple operations: turning on the auto-translateables and modifying the QA rules.<\/p>\n<p><!--more-->Let&#8217;s start with turning on &#8220;automatic&#8221; insertion of numbers. The feature works by detecting numerical values in the source text, and if they are in a proper format of given language, offering the same numerical value formatted for target language on the translation results pane. Why the format change? Because different languages use different formatting rules for numbers. For example &#8220;<strong>10,189.53<\/strong>&#8221; in English should be written as &#8220;<strong>10 189,53<\/strong>&#8221; in Polish.<strong> <\/strong> The modified values can be inserted exactly the same way as other elements from the translation results pane, e.g. by double clicking or entering <strong>Ctrl-hit number<\/strong> from the list.<\/p>\n<p>The procedure of enabling this function will look differently if we want to turn on auto-translatables for current project (and only for this one), or in case we would like to turn them for all new projects.<\/p>\n<p>Let&#8217;s start with enabling automatic number &#8220;translation&#8221; for <strong>current project<\/strong> &#8212; say to check, how this works.<\/p>\n<ol>\n<li>Open\/create a project.<\/li>\n<li>In the <strong>Settings<\/strong> panel of the current project click <strong>Auto-translation rules<\/strong>. The list of available national standards will be displayed.<\/li>\n<li>Choose your target language (in case of Polish one can use Hungarian instead). The name of active rule set is displayed with bold font, exactly as for other resources. Below you can see the example of the numerical values recognized by default auto-translation rules.\n<p><div id=\"attachment_502\" style=\"width: 509px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_standard.png\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-502\" class=\"size-full wp-image-502\" title=\"mq_autotrans_standard\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_standard.png\" alt=\"\" width=\"509\" height=\"330\" srcset=\"https:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_standard.png 509w, https:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_standard-300x194.png 300w\" sizes=\"auto, (max-width: 509px) 100vw, 509px\" \/><\/a><p id=\"caption-attachment-502\" class=\"wp-caption-text\">Example of numerical values recognized by default auto-translation rules. The numbers separated by hyphens are recognized individually.<\/p><\/div><\/li>\n<\/ol>\n<p>This will definitely simplify the translation of texts with lots of numbers, especially if they are in &#8220;standard&#8221; format. On the other hand, this will not help much if the text we are dealing with contains a lot of non-standard numbers, e.g. catalog numbers where numbers are grouped with letters, numbers separated by hyphens or references to sub-points of some documentation or legal texts. As you can see above, not all of them will be recognized.<\/p>\n<p>Fortunately, auto-translation rules can be easily defined using regular expressions. To make my life easier, I&#8217;ve defined myself the group of four rules for recognizing non-standard numbers:<\/p>\n<ol>\n<li>Letters and numbers combined (e.g. catalog numbers). Example: N665, Alpha9beta<\/li>\n<li>Combined text and numbers separated by hyphen (unlimited number groups, e.g. catalog numbers, regulations). Example: IEC60001-11-1, CAP1283-34<\/li>\n<li>Groups of numbers separated by at least two dots (e.g. software version numbers, sub-subparagraph numbers, legal texts). Example: 654.123.456, 84.12.79.1<\/li>\n<li>Groups of numbers separated by at least two hyphens (e.g. catalog numbers). Example:6-234-957, 543-21-8<\/li>\n<li><strong>ADDED<\/strong>: Rules for recognizing email addresses and URLs (http and ftp). If you need to localize the URLs, just remove the rules starting with &#8220;http&#8221;.<\/li>\n<\/ol>\n<p>To use my rules you have to download <a href=\"http:\/\/wasaty.pl\/Stuff\/Non-standard%20numbers.mqres\"><strong>this file<\/strong><\/a> and in auto-translation settings (see point 2) use <strong>Import new<\/strong> command and select the downloaded file (<strong>Non-standard numbers.mqres<\/strong>). Then select it for the current project. Examples of recognized number groups are shown on the picture below.<\/p>\n<div id=\"attachment_504\" style=\"width: 510px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_custom.png\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-504\" class=\"size-full wp-image-504\" title=\"mq_autotrans_custom\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_custom.png\" alt=\"\" width=\"510\" height=\"328\" srcset=\"https:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_custom.png 510w, https:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_custom-300x192.png 300w\" sizes=\"auto, (max-width: 510px) 100vw, 510px\" \/><\/a><p id=\"caption-attachment-504\" class=\"wp-caption-text\">Numbers groups recognized by \u201eNon-standard numbers\u201d rule set.<\/p><\/div>\n<p>The numbers with one dot are not recognized to avoid a conflict with standard rules, and groups with only one hyphen are not recognized because they usually (but not always) have to be treated separately in Polish (ie. 45-69 should be formatted as 45 \u2013 69). If we activate both rules groups (standard for target language and my non-standard numbers), the result for my example file will look like this:<\/p>\n<div id=\"attachment_505\" style=\"width: 510px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_both.png\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-505\" class=\"size-full wp-image-505\" title=\"mq_autotrans_both\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_both.png\" alt=\"\" width=\"510\" height=\"330\" srcset=\"https:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_both.png 510w, https:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_both-300x194.png 300w\" sizes=\"auto, (max-width: 510px) 100vw, 510px\" \/><\/a><p id=\"caption-attachment-505\" class=\"wp-caption-text\">Active rules &quot;Numbers-PolishGroup&quot; and &quot;Non-standard numbers&quot;.<\/p><\/div>\n<p>If the auto-translation rules are meeting your expectations, you can activate them for <strong>all newly created projects<\/strong>. To do so:<\/p>\n<ol>\n<li>In the <strong>Tools<\/strong> menu select <strong>Options<\/strong>. The <strong>Default resources<\/strong> is the first active card.<\/li>\n<li>From the list on the <strong>Default resources<\/strong> pane click <strong>Auto-translation rules<\/strong>. Select rules, you want to activate for new projects. In my case this is <strong>Numbers-PolishGroup<\/strong> and (optional) <strong>Non-standard numbers<\/strong>.<\/li>\n<li>Click <strong>OK<\/strong>.<\/li>\n<\/ol>\n<p>It is worth knowing that auto-translatables can be used for many other things besides numbers. For example, if there are HTML tags in the Word or Excel file, we can detect them in the source segments and insert them by keyboard shortcut. In this case you have to use rules editor: enter &#8220;(\\&lt;.+?\\&gt;)&#8221; in the <strong>Auto-translation rules<\/strong> and &#8220;$1&#8221; in the <strong>Replace order rules<\/strong> field. Similar rules can be created e.g. for consistently marked text strings which are not to be translated. Auto-translation rules can be used also for automatic unit conversion &#8211; this subject is described in the memoQ help file.<\/p>\n<p>Now, the use of auto-translation rules can simplify our job, but there are still &#8220;non-standard number format&#8221; alerts we have to deal with. How to remove them\/reduce their number?<\/p>\n<p><strong>For the current project<\/strong>:<\/p>\n<ol>\n<li>In the project <strong>Settings<\/strong> panel select <strong>QA settings<\/strong>. Since the default settings cannot be modified, you have to clone them using <strong>Clone\/use new<\/strong> command in the lower right part of the window. Enter the meaningful name, e.g. <strong>Default-corrected numbers<\/strong>.<\/li>\n<li>Select new QA rules and click <strong>Edit<\/strong> command.<\/li>\n<li>In the <strong>Edit QA settings<\/strong> window select <strong>Check auto-translatables<\/strong> option.<br \/>\n<a href=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_03.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-506\" title=\"mq_autotrans_03\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_03-300x238.png\" alt=\"\" width=\"300\" height=\"238\" srcset=\"https:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_03-300x238.png 300w, https:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_03.png 694w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><br \/>\n<\/a><\/li>\n<li>Click the <strong>Numbers<\/strong> tab of the <strong>Edit QA settings<\/strong> window.<\/li>\n<li><del><strong>Deselect<\/strong> all checking options<\/del>. EDIT: Deselect all checkmarks except the one for <strong>Verify that numbers are matched on the target side<\/strong> (leave that one checked, see Denis Hay comment below post).<br \/>\n<a href=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_04.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-507\" title=\"mq_autotrans_04\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_04-300x238.png\" alt=\"\" width=\"300\" height=\"238\" srcset=\"https:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_04-300x238.png 300w, https:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/mq_autotrans_04.png 694w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/li>\n<\/ol>\n<p>With these settings memoQ will no longer verify the source\/target numbers using internal rules over which we don&#8217;t have any control, <strong>but<\/strong> auto-translatables verification won&#8217;t let us miss any incorrect number in the target text. Depending on the type of text and target language rules, we can drastically reduce or even eliminate the &#8220;unknown format&#8221; warnings. Some will still be there, though &#8211; if there will be e.g. &#8220;point 6.8&#8221; or &#8220;version 12.1&#8221;, we&#8217;ll still have warnings there (at least for Polish, where the dot should be changed to comma).<\/p>\n<p><strong>To apply the above change in QA settings to all new projects:<\/strong><\/p>\n<ol>\n<li>From the <strong>Tools<\/strong> menu select <strong>Options<\/strong>. The <strong>Default resources<\/strong> is the first active card.<\/li>\n<li>From the list on the <strong>Default resources<\/strong> pane click <strong>QA settings.<\/strong><\/li>\n<li>Select modified QA settings (e.g. <strong>Default-corrected numbers<\/strong>).<\/li>\n<li>Click <strong>OK<\/strong>.<\/li>\n<\/ol>\n<p>The modified rules will be active as defaults for all new memoQ projects.<\/p>\n<p><strong>EDIT:<\/strong><\/p>\n<p>Added rules for recognizing email addresses and URLs (http and ftp). If you need to localize the URLs, just remove the rules starting with &#8220;http&#8221;.<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Tweet MemoQ is a fast, easy to use and very powerful CAT tool. However, in the default configuration it does have two features I don&#8217;t like: the QA module often gives warnings concerning numbers (&#8220;invalid number format in source\/target prevents strict matching&#8221;), and the software does not allow to copy numerical values from source segments &hellip; <\/p>\n<p><a class=\"more-link btn\" href=\"https:\/\/wasaty.pl\/blog\/2011\/06\/05\/memoq-automation-auto-translatables\/\">Continue reading<\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3,6],"tags":[9,11,20,24,25],"class_list":["post-554","post","type-post","status-publish","format-standard","hentry","category-narzedzia","category-wskazowki","tag-cat","tag-english","tag-memoq","tag-porady","tag-qa","item-wrap"],"_links":{"self":[{"href":"https:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/posts\/554","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/comments?post=554"}],"version-history":[{"count":8,"href":"https:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/posts\/554\/revisions"}],"predecessor-version":[{"id":564,"href":"https:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/posts\/554\/revisions\/564"}],"wp:attachment":[{"href":"https:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/media?parent=554"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/categories?post=554"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/tags?post=554"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}