Any advice on how to separate, in a Word file, source paragraphs followed sequentially by their translation

Hello,

I wonder if anyone may have a suggestion to solve this problem I’m facing:

I have received a large Word file in which a paragraph in the source language is followed by the corresponding translation, followed by another paragraph in the source language which is followed by its translation, so on and so forth. That’s the only document available.

Out of this file we need to create a translation memory with the translations provided within.

Using Alignment with the file as it is would be beyond messy, I think.

Besides brute force, is there by any chance some way of separating/extracting the source language from the translations?

Thank you in advance for any advice/suggestion you may have.

Gilberto

  • In that case maybe you start a different way. It will require some manual work, but might give you what you need.

    First, make sure you see ALL non-printable characters including hidden text. Then go through the document, select the English paragraphs and press CTRL+SHIFT+H. This will format the text as hidden. You can obviously use also any other text attribute which will not change formatting. An option would be highlighting the text. When done, save the document. Now search for the text attribute you used. Replace with ^&^t and then use the other part I suggested before. When done, remove the text attribute you added. In case you used "hide" rund a search and replace for "hidden" and replace with "not hidden". For this replacement both search and replace fields are simply kept empty.

    I understand this is a lot of work, but in the end of the day this could bring you what you need, as creating a TMX from a table is a piece of cake.

    _________________________________________________________

    When asking for help here, please be as accurate as possible. Please always remember to give the exact version of product used and all possible error messages received. The better you describe your problem, the better help you will get.

    Want to learn more about Trados Studio? Visit the Community Hub. Have a good idea to make Trados Studio better? Publish it here.

  • Thanks again, Jerzy!

    I'll try that. Yes, it'll be a lot of work but I trust it will be worthwhile.
    Your help and suggestions are very much appreciated!

    Gilberto

  • Yes, a lot of work. But certainly less as trying to align the files afterwards. 

    _________________________________________________________

    When asking for help here, please be as accurate as possible. Please always remember to give the exact version of product used and all possible error messages received. The better you describe your problem, the better help you will get.

    Want to learn more about Trados Studio? Visit the Community Hub. Have a good idea to make Trados Studio better? Publish it here.

  • Just my opinion, but I think you're over thinking this.  By the time you've been through hiding text in the source files you could have aligned them quite easily.  Based on your example I don't think it's so messy:

    The table realigns itself as you work through it and because your document is formatted in the way it is it's pretty simple to see where you are... even if you don't have a clue what it means (like me!).

    I think I can see where Kelly was coming from, but as you are not really formatted based on paragraphs as you initially said I think an automated solution would not be so easy. It requires too much checking because sometimes it's several paragraphs before the text changes from one language to another.  If it was consistent than I guess a macro could be used to pick out the paragraph markers and separate the texts.  Probably... although I hate to try and second guess Kelly... he was thinking along these lines.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

1 2