How to extract text from a specific column of several tables in a Word document?

Hello! I have an MS Word file with several tables in it. The file also contains paragraphs that don't need to be translated/extracted.

I only need to extract the text in the columns with the "Translation" header of each table. Is there a way to extract only this text automatically?

Here is an image to clarify my case.

Screenshot of an MS Word document showing two tables with headers ID, Original, and Translation. Text outside the tables reads 'This is a paragraph that doesn't need to be translated.' and 'This is another paragraph that doesn't need to be translated.'

Thanks for any help!



Generated Image Alt-Text
[edited by: Trados AI at 11:51 AM (GMT 0) on 29 Feb 2024]
emoji
  •  

    In Word no.  But if you hide the columns you don't wish to translate in Word using the hidden text attribute then the only text extracted for translation will be the column you do want.  Hidden text is not extracted by default.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  • Hello,

    To extract text from a specific column of several tables in a Word document, you can use the Microsoft Word filter settings in Trados Studio. Here's how you can do it:

    Step 1: Open Trados Studio and go to "File" > "Options".

    Step 2: In the Options window, navigate to "File Types".

    Step 3: Find and select "Microsoft Word 2007-2019".

    Step 4: Click on "Common".

    Step 5: In the "Processing" section, check the box for "Process tables as text".

    Step 6: Click "OK" to save your changes.

    Now, when you open your Word document in Trados Studio, the text in the tables will be extracted and ready for translation. Please note that this will extract all the text in the tables. If you want to extract text only from specific columns, you might need to manually select the text in those columns for translation.

    I hope this helps! If you have any other questions, feel free to ask.

    Best regards,

    RWS Community AI

    emoji
  •  

    I suggest you ignore the TradosAI in this case... it's hallucinating!  Good and bad.... good because we can address it, bad because we don't want any in the first place!  But it will improve over time.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  • Thanks, Paul, for your prompt response. It's a good idea.

    The original file contains 133 pages with over 50 tables. Considering this, I believe it would be quicker to manually select, copy, and paste the text from the "Translation" columns into a txt file, rather than selecting the text that doesn't need to be translated and marking them all as hidden.

    Thanks anyway, and have a wonderful day!

    emoji
  • Yes, totally. But I cannot find the "reject answer" option in its answer.

    emoji
  •  

    I already rejected it.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  •  

    The original file contains 133 pages with over 50 tables. Considering this, I believe it would be quicker to manually select, copy, and paste the text from the "Translation" columns into a txt file, rather than selecting the text that doesn't need to be translated and marking them all as hidden.

    Probably quicker to do it in Trados Studio.  You can easily filter on the tables, copy source to target for all f them, then just translate the ones that are in the Translation column.  I reckon that would be faster and save a lot of messing around.

    Another more technical solution, if you are that way inclined, would be to unzip the docx, take out the underlying XML and create a custom XML filetype to handle the translation column only.

    But frankly, apart from the enjoyment of tinkering with a file like that to find a more satisfying way to do it, I really think just filtering is the fastest at the end of the day.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  • Thank you for the wonderful suggestions! I appreciate them very much!

    emoji