How to extract text from a specific column of several tables in a Word document?

Hello! I have an MS Word file with several tables in it. The file also contains paragraphs that don't need to be translated/extracted.

I only need to extract the text in the columns with the "Translation" header of each table. Is there a way to extract only this text automatically?

Here is an image to clarify my case.

Screenshot of an MS Word document showing two tables with headers ID, Original, and Translation. Text outside the tables reads 'This is a paragraph that doesn't need to be translated.' and 'This is another paragraph that doesn't need to be translated.'

Thanks for any help!



Generated Image Alt-Text
[edited by: Trados AI at 11:51 AM (GMT 0) on 29 Feb 2024]
emoji
Parents Reply
  •  

    The original file contains 133 pages with over 50 tables. Considering this, I believe it would be quicker to manually select, copy, and paste the text from the "Translation" columns into a txt file, rather than selecting the text that doesn't need to be translated and marking them all as hidden.

    Probably quicker to do it in Trados Studio.  You can easily filter on the tables, copy source to target for all f them, then just translate the ones that are in the Translation column.  I reckon that would be faster and save a lot of messing around.

    Another more technical solution, if you are that way inclined, would be to unzip the docx, take out the underlying XML and create a custom XML filetype to handle the translation column only.

    But frankly, apart from the enjoyment of tinkering with a file like that to find a more satisfying way to do it, I really think just filtering is the fastest at the end of the day.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
Children