{"id":680,"date":"2012-09-27T19:41:09","date_gmt":"2012-09-27T18:41:09","guid":{"rendered":"http:\/\/wasaty.pl\/blog\/?p=680"},"modified":"2012-09-27T19:42:37","modified_gmt":"2012-09-27T18:42:37","slug":"dealing-with-sajan-tms-exports","status":"publish","type":"post","link":"http:\/\/wasaty.pl\/blog\/2012\/09\/27\/dealing-with-sajan-tms-exports\/","title":{"rendered":"Dealing with Sajan TMS exports"},"content":{"rendered":"<div style=\"float: right; margin-left: 10px;\"><a href=\"https:\/\/twitter.com\/share\" class=\"twitter-share-button\" data-via=\"Wasaty\" data-count=\"vertical\" data-url=\"http:\/\/wasaty.pl\/blog\/2012\/09\/27\/dealing-with-sajan-tms-exports\/\">Tweet<\/a><\/div>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-511\" title=\"memoq-logo\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2011\/05\/memoq-logo.png\" alt=\"\" width=\"80\" height=\"80\" \/>One of my end clients recently introduced Sajan TM management system. As a result, instead of Trados TMs and TTX files which were a standard previously, I began to receive jobs as an Excel files, containing source segments and pre-translation (from 80% matches up) in separate columns. Additionally there&#8217;s a column with TM match percentage and some additional info\/comments. And the Excel has a macro for propagating exact repetitions. Great. My task is to translate what&#8217;s missing and make sure the 100% matches strictly follow a mandatory glossary (in a separate Excel file). The general attitude was &#8220;we have a great new system, deal with it&#8221;. So I did. And since I&#8217;m sure I&#8217;m not the only one, I decided to share my solution.<\/p>\n<p><!--more--><\/p>\n<p>Excel is a great tool for spreadsheets, but it really sucks for text editing. And checking for correct translation of glossary terms in Excel would take an eternity. Also formatting tags are just text, easy to miss or damage. It was obvious that I have to find a workaround for working with these files in a decent CAT environment. Fortunately the task came out to be quite simple and preparation takes just about 5-15 minutes, depending on number of sheets &#8211; because every chapter of a file for translation is inserted into a separateExcel sheet.<\/p>\n<p>Take a look at the Sajan-exported file (click for full size):<\/p>\n<p><a href=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Excel_project_1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-681\" title=\"Excel_Sajan_1\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Excel_project_1-300x153.png\" alt=\"\" width=\"300\" height=\"153\" srcset=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Excel_project_1-300x153.png 300w, http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Excel_project_1-1024x522.png 1024w, http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Excel_project_1.png 1727w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/p>\n<p>If only 100% matches were pre-translated, I would just copy source and translation into a separate file, create a TM using<a href=\"https:\/\/plus.google.com\/101952942609793410875\/posts\/TkmcQCg2arw\"> this tool<\/a>, then copied source column into target and imported selected columns for translation. However, with non-100% matches the task was more complicated&#8230; or so I thought. Eventually I found an easier and less time-consuming solution. The procedure was designed to import text into memoQ, but should work with any tool accepting Trados bilingual DOC format.<\/p>\n<h3>The procedure:<\/h3>\n<ol>\n<li>Create a backup copy of the source file.<\/li>\n<li>Select and copy to new Excel file columns <strong>Match %<\/strong>, <strong>Basis<\/strong>, <strong>Source<\/strong> and <strong>Target<\/strong> from all Excel sheets.<\/li>\n<li>Remove <strong>Basis<\/strong> column.<\/li>\n<li>Move column <strong>Match %<\/strong> between <strong>Source<\/strong> and <strong>Target<\/strong> (see picture below).<br \/>\n<a href=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Excel_sajan_2.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-682\" title=\"Excel_sajan_2\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Excel_sajan_2-300x229.png\" alt=\"\" width=\"300\" height=\"229\" srcset=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Excel_sajan_2-300x229.png 300w, http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Excel_sajan_2.png 800w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/li>\n<li>Select all three columns and copy them (<strong>Ctrl-C<\/strong>).<\/li>\n<li>Open Word.<\/li>\n<li>Click on a triangle below <strong>Paste<\/strong> icon and select <strong>Paste special<\/strong>.<br \/>\n<a href=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-683\" title=\"Sajan_word_1\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_1.png\" alt=\"\" width=\"359\" height=\"207\" srcset=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_1.png 359w, http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_1-300x172.png 300w\" sizes=\"auto, (max-width: 359px) 100vw, 359px\" \/><\/a><\/li>\n<li>Select <strong>Non-formatted Unicode text<\/strong>. After pasting, turn on the option to display hidden characters.<br \/>\n<a href=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_2.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-684\" title=\"Sajan_word_2\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_2-300x267.png\" alt=\"\" width=\"300\" height=\"267\" srcset=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_2-300x267.png 300w, http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_2-1024x911.png 1024w, http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_2.png 1067w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/li>\n<li>Insert an empty line before pasted text by pressing <strong>Enter<\/strong> at the beginning of the first line.<\/li>\n<li>Press <strong>Ctrl-H<\/strong> to open <strong>Search &amp; Replace<\/strong> dialog.<\/li>\n<li>Click<strong> More<\/strong> to display additional dialog options.<\/li>\n<li>Select <strong>Use wildcards<\/strong> check box.<\/li>\n<li>In the <strong>Find<\/strong> field enter: <strong>^9([0-9]*)^9<\/strong><\/li>\n<li>In the <strong>Replace with<\/strong> field enter: <strong>&lt;}\\1{&gt;<\/strong><\/li>\n<li>With cursor in the <strong>Replace with<\/strong> field click <strong>Format &gt; Style<\/strong> and select <strong>tw4winMark<\/strong>.<br \/>\n<em>If you don&#8217;t have this style on a style list, open any bilingual Trados .doc or .rtf file and copy a line or two at the beginning of your file. Then try again &#8211; tw4winMark style will be on styles list.<\/em><\/li>\n<li>Click Replace all.<\/li>\n<li>In the <strong>Find<\/strong> field enter: <strong>^13<\/strong><\/li>\n<li>In the <strong>Replace with<\/strong> field enter: <strong>&lt;0}^13{0&gt;<\/strong><br \/>\n<em>The field must still have<strong> tw4winMark<\/strong> style.<\/em><\/li>\n<li>Click <strong>Replace all<\/strong>.<\/li>\n<li>Remove empty first line and any additional text pasted in step 15 (if any).<br \/>\n<a href=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_3.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-686\" title=\"Sajan_word_3\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_3-300x267.png\" alt=\"\" width=\"300\" height=\"267\" srcset=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_3-300x267.png 300w, http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_3-1024x911.png 1024w, http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_word_3.png 1067w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/li>\n<li>Save the file as Word 97-2003 (.doc) or .rtf file.<\/li>\n<li>Start memoQ (or use other CAT tool).<\/li>\n<li>Create a project in memoQ, add document saved in step 22.<br \/>\n<a href=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_memoQ_1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-690\" title=\"Sajan_memoQ_1\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_memoQ_1-300x160.png\" alt=\"\" width=\"300\" height=\"160\" srcset=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_memoQ_1-300x160.png 300w, http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_memoQ_1-1024x546.png 1024w, http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_memoQ_1.png 1073w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/li>\n<li>Open the file for translation, then click <strong>Format &gt; Run regex tagger<\/strong>.<\/li>\n<li>In the <strong>Regular expression<\/strong> field enter: {.*?}<\/li>\n<li>Click <strong>Add<\/strong>.<\/li>\n<li>Click <strong>Run tagger now<\/strong> (click <strong>OK<\/strong> in the warning dialog).<br \/>\n<a href=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_memoQ_2.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-687\" title=\"Sajan_memoQ_2\" src=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_memoQ_2-300x159.png\" alt=\"\" width=\"300\" height=\"159\" srcset=\"http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_memoQ_2-300x159.png 300w, http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_memoQ_2-1024x542.png 1024w, http:\/\/wasaty.pl\/blog\/wp-content\/uploads\/2012\/09\/Sajan_memoQ_2.png 1075w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/li>\n<li>Translate\/review, importing Excel glossary into a project term base.<br \/>\n<em>If no review of 100% matches is needed, you can sort the file by match percentage, select all 100% matches and confirm them in one go by pressing <strong>Ctrl-Enter<\/strong>.<\/em><\/li>\n<li>When finished, open original Excel file again.<\/li>\n<li>For all sheets select <strong>Source<\/strong> column, copy to <strong>Target<\/strong> column. Save.<\/li>\n<li>Open the Excel file in memoQ importing only target columns for each sheet (<strong>Add document as<\/strong>).<\/li>\n<li>Pre-translate file, perform QA.<\/li>\n<li>Export finished file.<\/li>\n<\/ol>\n<p>Of course the procedure creates some additional overhead, but at the same time it makes it possible to work with received files in a decent environment with all the features of a modern CAT tool, including QA on tags. Of course you can also save the file as XLIFF to perform an additional QA in an external tool, like Xbench or CheckMate.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Tweet One of my end clients recently introduced Sajan TM management system. As a result, instead of Trados TMs and TTX files which were a standard previously, I began to receive jobs as an Excel files, containing source segments and pre-translation (from 80% matches up) in separate columns. Additionally there&#8217;s a column with TM match &hellip; <\/p>\n<p><a class=\"more-link btn\" href=\"http:\/\/wasaty.pl\/blog\/2012\/09\/27\/dealing-with-sajan-tms-exports\/\">Continue reading<\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3,6],"tags":[9,20,24],"class_list":["post-680","post","type-post","status-publish","format-standard","hentry","category-narzedzia","category-wskazowki","tag-cat","tag-memoq","tag-porady","item-wrap"],"_links":{"self":[{"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/posts\/680","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/comments?post=680"}],"version-history":[{"count":8,"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/posts\/680\/revisions"}],"predecessor-version":[{"id":694,"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/posts\/680\/revisions\/694"}],"wp:attachment":[{"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/media?parent=680"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/categories?post=680"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/wasaty.pl\/blog\/wp-json\/wp\/v2\/tags?post=680"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}