Any advice on how to separate, in a Word file, source paragraphs followed sequentially by their translation

Question

Hello, 
 I wonder if anyone may have a suggestion to solve this problem I&rsquo;m facing: 
 I have received a large Word file in which a paragraph in the source language is followed by the corresponding translation, followed by another paragraph in the source language which is followed by its translation, so on and so forth. That&rsquo;s the only document available. 
 Out of this file we need to create a translation memory with the translations provided within. 
 Using Alignment with the file as it is would be beyond messy, I think. 
 Besides brute force, is there by any chance some way of separating/extracting the source language from the translations? 
 
 Thank you in advance for any advice/suggestion you may have. Gilberto

Kelly Edward · Accepted Answer

sounds too easy 
 any sample file ?

Paul Filkin · Answer

Gilberto V.V. 
 Just my opinion, but I think you're over thinking this. By the time you've been through hiding text in the source files you could have aligned them quite easily. Based on your example I don't think it's so messy: 
 
 The table realigns itself as you work through it and because your document is formatted in the way it is it's pretty simple to see where you are... even if you don't have a clue what it means (like me!). 
 I think I can see where Kelly was coming from, but as you are not really formatted based on paragraphs as you initially said I think an automated solution would not be so easy. It requires too much checking because sometimes it's several paragraphs before the text changes from one language to another. If it was consistent than I guess a macro could be used to pick out the paragraph markers and separate the texts. Probably... although I hate to try and second guess Kelly... he was thinking along these lines.

Alison Field · Answer

Hi Gilberto V.V. 
 Create 2 copies of the Word file, named to indicate first English and second Spanish. 
 In the English document use Find and Replace as follows: 
 Ctrl+H - opens 'Find and Replace' to the Replace tab. 
 Find what: > Format > Language > Spanish (selecting the version of Spanish your document has) 
 and 
 Replace with: > Format > Language > English (selecting the version of English your document has) 
 Leave the 'Find what' line blank and in the 'Replace with' line, type ^p 
 Then click 'Replace All' 
 This will allow you to run a Find and Replace for the Spanish text, replace it with a paragraph mark that should then then leave each English entry beginning on a new line. 
 Finally, highlight the whole document and double-click on the language title on the bottom bar, which opens the Language dialog where you can 'Mark selected text' as English. Then click OK. 
 Repeat the process in the second file to delete the English text fully and make the whole document Spanish. 
 Then you should be able to use Alignment to produce an SDLXLIFF. 
 You can then check this in the Studio Editor with a new TM added so you can confirm each segment as you check it. Or simply import the SDLXLIFF to a new TM. 
 You may have to use 'trial and error' to make the process work better depending on the textual content. 
 See here for a description of Translation Alignment: www.trados.com/solutions/translation-alignment/ 
 Let us know if this works OK, 
 All the best, 
 Ali

Trados Studio > 1. Trados Studio

Any advice on how to separate, in a Word file, source paragraphs followed sequentially by their translation

Top Replies