{"id":411,"date":"2010-07-14T20:37:48","date_gmt":"2010-07-14T18:37:48","guid":{"rendered":"http:\/\/hell.pl\/wasaty\/blog\/?p=410"},"modified":"2011-05-29T17:10:02","modified_gmt":"2011-05-29T16:10:02","slug":"translating-wordfast-txml-in-memoq","status":"publish","type":"post","link":"http:\/\/wasaty.pl\/blog\/2010\/07\/14\/translating-wordfast-txml-in-memoq\/","title":{"rendered":"Translating WordFast TXML in memoQ"},"content":{"rendered":"<div style=\"float: right; margin-left: 10px;\"><a href=\"https:\/\/twitter.com\/share\" class=\"twitter-share-button\" data-via=\"Wasaty\" data-count=\"vertical\" data-url=\"http:\/\/wasaty.pl\/blog\/2010\/07\/14\/translating-wordfast-txml-in-memoq\/\">Tweet<\/a><\/div>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignleft size-full wp-image-400\" title=\"memoq-wf\" src=\"http:\/\/hell.pl\/wasaty\/blog\/wp-content\/uploads\/2010\/06\/memoq-wf.png\" alt=\"memoq-wf\" width=\"80\" height=\"81\" \/> WordFast as a Word add-on (currently version Classic) was always quite popular &#8211; it was not expensive, simple and fully compatible with Trados. However, last year the company released a completely new version &#8211; WordFast Pro is written from the ground up in Java, which makes it possible to use it on different operating systems (Windows, Linux, MacOS). The software works in table layout typical for all new CAT tools and is&#8230; quite specific. It does have a lot of fans, unfortunately I don\u2019t like it (especially the terrible way it handles insertion of tags). Luckily, we don\u2019t have to translate .txml files using WordFast, it can be done with memoQ. Below you\u2019ll find the procedure step-by-step.<br \/>\n<!--more--><br \/>\nRecently I\u2019m getting a lot of translations in WordFast Pro files. The first translation I did using WF demo and I didn\u2019t liked it, so when I got the next ones, I looked for a way to process them in a much more comfortable environment, that is memoQ. As it came out, it isn\u2019t so hard.<\/p>\n<p>What do we need:<\/p>\n<ul>\n<li>WordFast Pro demo (not necessary, but recommended)<\/li>\n<li>Excel\/Calc<\/li>\n<li>memoQ<\/li>\n<li>Word\/Writer (optional)<\/li>\n<\/ul>\n<p>I have divided the procedure into two parts: preparation of txml file and preparation of wordfast TM. Let\u2019s start with the file preparation. If we don\u2019t have a WordFast, it can be downloaded from <a href=\"http:\/\/www.wordfast.com\/store_download.html\">here<\/a>.<\/p>\n<ol>\n<li>Open the file to translate (.txml) in the WordFast.<\/li>\n<div id=\"attachment_392\" style=\"width: 540px\" class=\"wp-caption aligncenter\"><img aria-describedby=\"caption-attachment-392\" decoding=\"async\" loading=\"lazy\" class=\"size-large wp-image-392\" title=\"01\" src=\"http:\/\/hell.pl\/wasaty\/blog\/wp-content\/uploads\/2010\/06\/01-540x405.png\" alt=\"WordFast Pro window. Use Ctrl-Alt-Ins to copy all source language segments to target segments.\" width=\"540\" height=\"405\" \/><p id=\"caption-attachment-392\" class=\"wp-caption-text\">WordFast Pro window. Use Ctrl-Alt-Ins to copy all source language segments to target segments.<\/p><\/div>\n<li>Use <strong>Ctrl-Alt-Ins<\/strong> to copy all source segments contents into target segments.<\/li>\n<li> Save the file, close WordFast (repeat the procedure for additional files if necessary). If, for some reason, you don\u2019t want or can\u2019t use the WordFast demo, you can use the search\/replace procedure described <a href=\"http:\/\/blog.albatrossolutions.com\/tent-cat-tools\/translating-wordfast-txml-files-in-memoq\">here<\/a>.<\/li>\n<li>Download <a href=\"http:\/\/hell.pl\/wasaty\/Stuff\/XMLConverter%23WordFast.mqres\">this filter file<\/a> and save it on your disc.<\/li>\n<li>Start the <a href=\"http:\/\/hell.pl\/wasaty\/blog\/2010\/03\/21\/nowa-nadzieja-memoq-czyli-wschodzaca-gwiazda-catow\/\">memoQ<\/a>.<\/li>\n<li>From the <strong>Tools<\/strong> menu choose <strong>Resource console<\/strong>, then <strong>Filter configurations<\/strong> and <strong>Import new<\/strong> from the left pane. Select the file saved in step 4.<\/li>\n<li>Create new project, in the Add document window select Add document as. You\u2019ll have to switch file type filter in the lower part of the window from <strong>All supported files<\/strong> to <strong>All files (*.*)<\/strong>. Select the txml file(s) saved in step 3.<\/li>\n<li>In the <strong>Document import settings<\/strong> click the yellow folder icon (<strong>Load filter configuration<\/strong>) and select <strong>WordFast<\/strong> filter imported in step 6. The filter works by hiding the original source segments (&lt;source&gt;), displaying for edition target segments (&lt;target&gt;) and defining handling of WordFast tags. See picture below.\n<div id=\"attachment_393\" style=\"width: 540px\" class=\"wp-caption aligncenter\"><img aria-describedby=\"caption-attachment-393\" decoding=\"async\" loading=\"lazy\" class=\"size-large wp-image-393\" title=\"06\" src=\"http:\/\/hell.pl\/wasaty\/blog\/wp-content\/uploads\/2010\/06\/06-540x355.png\" alt=\"TXML file imported into memoQ. Source segments (&lt;source&gt;) you can see in the preview pane below are hidden, you can translate only &lt;target&gt; segments.\" width=\"540\" height=\"355\" \/><p id=\"caption-attachment-393\" class=\"wp-caption-text\">TXML file imported to memoQ. Source segments (&lt;source&gt;) you can see in the preview pane below are hidden, you can translate only &lt;target&gt; segments.<\/p><\/div>\n<p>Tip: WordFast tags, displayed natively as consecutive numbers {1}, {2}, {3}, in memoQ are displayed as \u201cinline\u201d tags. By default you can insert them using <strong>F9<\/strong> key, but in my there are more convenient key combinations, for example <strong>Ctrl-Alt-Down<\/strong>, like in Trados. To modify keyboard settings choose <strong>Tools<\/strong> &gt; <strong>Resource console<\/strong> &gt; <strong>Keyboard shortcuts<\/strong> &gt; <strong>Clone<\/strong> (for the Default) &gt; <strong>Edit<\/strong>.<\/li>\n<li>After translation export the finished file using <strong>Export<\/strong> command. Excellent QA features of memoQ won\u2019t allow you to export files with tags mismatch and will show you the affected segments (if any).<\/li>\n<li>Just to be extra sure you can open the translated file in WordFast, to check if everything is OK.<\/li>\n<\/ol>\n<div id=\"attachment_394\" style=\"width: 540px\" class=\"wp-caption aligncenter\"> <img aria-describedby=\"caption-attachment-394\" decoding=\"async\" loading=\"lazy\" class=\"size-large wp-image-394\" title=\"07\" src=\"http:\/\/hell.pl\/wasaty\/blog\/wp-content\/uploads\/2010\/06\/07-540x405.png\" alt=\"The finished translation in WordFast The percentage of TM substitutions are shown.\" width=\"540\" height=\"405\" \/><p id=\"caption-attachment-394\" class=\"wp-caption-text\">Finished translation in WordFast. The percentage of TM substitutions are shown.<\/p><\/div>\n<p>The procedure may seem complicated, especially at first, but after the software configuration, the preparation and file opening wont\u2019 take much more time than in case of any other file format. The situations becomes a bit more complicated when along with TXML files client sends us a WordFast TM file with the .txt extension. To use this TM in memoQ we have to prepare it by removing unnecessary data and tags.<\/p>\n<ol>\n<li>Open the TM file in Word (or OpenOffice Writer) (option). In the WF TM tags are encoded as <strong>&amp;tA;<\/strong>, <strong>&amp;tB;<\/strong>, <strong>&amp;tC;<\/strong>, etc. It is possible to remove them all at once using Word. Open <strong>Search &amp; Replace<\/strong> window (Ctrl-H), click <strong>More &gt;&gt;<\/strong> button and select <strong>Use wildcards<\/strong>. Enter <strong>&amp;t[A-Z];<\/strong> in the <strong>Find<\/strong> field. Leave the Replace field empty. Click <strong>Replace all<\/strong> button. Now <strong>deselect<\/strong> <strong>Use wildcards<\/strong> checkbox and replace <strong>&amp;&#8217;A9;<\/strong> with \u00a9, and <strong>&amp;&#8217;AE;<\/strong> with \u00ae (of course, if there are such strings in your TM. Save as text file using UTF-8 encoding.<\/li>\n<li>Start Excel (or OpenOffice Calc), import the TM file by selecting <strong>Data<\/strong> &gt; <strong>External data<\/strong> &gt; <strong>From text<\/strong>. Choose <strong>Tab<\/strong> as a column separator and import the first row checkbox.\n<p><div id=\"attachment_395\" style=\"width: 540px\" class=\"wp-caption aligncenter\"><img aria-describedby=\"caption-attachment-395\" decoding=\"async\" loading=\"lazy\" class=\"size-large wp-image-395\" title=\"02\" src=\"http:\/\/hell.pl\/wasaty\/blog\/wp-content\/uploads\/2010\/06\/02-540x412.png\" alt=\"Where to find a text import command.\" width=\"540\" height=\"412\" \/><p id=\"caption-attachment-395\" class=\"wp-caption-text\">Where to find a text import command.<\/p><\/div><\/li>\n<li>Remove all columns except the ones with source and target texts. In the example above we have to remove columns A, B, C, D, F, H and I, leaving only E and G. As an alternative we can leave also column B, with the translators ID.\n<p><div id=\"attachment_396\" style=\"width: 540px\" class=\"wp-caption aligncenter\"><img aria-describedby=\"caption-attachment-396\" decoding=\"async\" loading=\"lazy\" class=\"size-large wp-image-396\" title=\"03\" src=\"http:\/\/hell.pl\/wasaty\/blog\/wp-content\/uploads\/2010\/06\/03-540x206.png\" alt=\"WordFast TM file correctly imported into Excel.\" width=\"540\" height=\"206\" \/><p id=\"caption-attachment-396\" class=\"wp-caption-text\">WordFast TM file correctly imported into Excel.<\/p><\/div><\/li>\n<li>Enter the language identifiers into first row of source and target columns. If necessary, we can add columns with additional information, like the translation author, domain, client, etc.\n<p><div id=\"attachment_397\" style=\"width: 540px\" class=\"wp-caption aligncenter\"><img aria-describedby=\"caption-attachment-397\" decoding=\"async\" loading=\"lazy\" class=\"size-large wp-image-397\" title=\"04\" src=\"http:\/\/hell.pl\/wasaty\/blog\/wp-content\/uploads\/2010\/06\/04-540x180.png\" alt=\"File with unnecessary data removed and additional column added (in this case - translation agency name).\" width=\"540\" height=\"180\" \/><p id=\"caption-attachment-397\" class=\"wp-caption-text\">File with unnecessary data removed and additional column added (in this case - translation agency name).<\/p><\/div><\/li>\n<li>If step 1 was skipped, you have to remove tags &#8211; unfortunately, Excel requires removing them one by one, i.e. you have to enter manually all possible letter combinations (see step 1). Save the file: select <strong>round office button<\/strong> &gt; <strong>Save as<\/strong> &gt; <strong>Other formats<\/strong> &gt; <strong>Unicode text<\/strong>. Close Excel.<\/li>\n<li>In memoQ select an existing memory or create a new one. Choose the <strong>Import from TMX\/CSV<\/strong> command. Select the file saved in step 5.<\/li>\n<li>In the <strong>Translation memory CSV import settings<\/strong> window select proper settings:\n<ul>\n<li>File encoding &#8211; UTF-8<\/li>\n<li>Delimiter &#8211; Tab<\/li>\n<li>First row is header<\/li>\n<\/ul>\n<p>Select proper fields for <strong>Import as source segment<\/strong> (in this case, EN-US) and <strong>Import as target segment<\/strong> (in this case, PL). If there are additional fields (rows) to import, select them and choose proper option in the <strong>Import as other field<\/strong> field.<img decoding=\"async\" loading=\"lazy\" class=\"size-large wp-image-403\" title=\"05\" src=\"http:\/\/hell.pl\/wasaty\/blog\/wp-content\/uploads\/2010\/06\/05-540x444.png\" alt=\"Proper settings for CSV file import into TM in memoQ.\" width=\"540\" height=\"444\" \/><\/li>\n<li>Voila.<\/li>\n<\/ol>\n<p>If there are a lot of segments with tags in the WordFast TM, you\u2019ll get several percent lower concordance matches in memoQ than in WordFast, but you\u2019ll still gain by working in much more comfortable environment without having to buy just another CAT tool. Very similar procedure can be used to translate WordFast TXML files in Trados Studio 2009 &#8211; you can define an xml import file there as well.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>WordFast as a Word add-on (currently version Classic) was always quite popular &#8211; it was not expensive, simple and fully compatible with Trados. However, last year the company released a completely new version &#8211; WordFast Pro is written from the ground up in Java, which makes it possible to use it on different operating systems (Windows, Linux, MacOS). The software works in table layout typical for all new CAT tools and is&#8230; quite specific. It does have a lot of fans, unfortunately I don\u2019t like it (especially the terrible way it handles insertion of tags). Luckily, we don\u2019t have to translate .txml files using WordFast, it can be done with memoQ. Below you\u2019ll find the procedure step-by-step.<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[3,6],"tags":[11,47,24,33],"_links":{"self":[{"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/posts\/411"}],"collection":[{"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/comments?post=411"}],"version-history":[{"count":1,"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/posts\/411\/revisions"}],"predecessor-version":[{"id":514,"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/posts\/411\/revisions\/514"}],"wp:attachment":[{"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/media?parent=411"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/categories?post=411"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/tags?post=411"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}