How to exclude/include certain elements from the source file and therefore from the word count in Trados Studio 2022

Hi Trados Studdio Team,

I would like to know how to exclude/include certain elements (such as numbers, hyperlinks, alt text, etc.) from the word count and I would also like to know how to lock segments and exclude them from the word count.

Below you'll find a screenshot of the Word file I want to exlude hyperlinks from:

Screenshot of a Word document with a list of hyperlinks related to refugee organizations, each hyperlink is labeled with the organization's name.

Now, I try to do this in two different ways:

1.  I go to File Type Identifier > Microsoft Word 2007-2019 > Common > and I unchecked the box  "Extract hyperlink". However, the links were not extracted when I finish creating the project and they still appear in the wordcount
Trados Studio interface showing the translation of the Word document with hyperlinks still included, indicating that the 'Extract hyperlink' option did not work as intended.

1.  I go to File Type Identifier > Microsoft Word 2007-2019 > Embedded content > Enable embedded content processing > Extract in all paragraphs > Tag definition rules > Add... and here is where I no longer know how to proceed

Trados Studio dialog box for adding or editing an embedded content rule with fields for 'Start Tag' and 'End Tag' and options for 'Translate' and 'Formatting'.


Thank you!



Generated Image Alt-Text
[edited by: Trados AI at 3:58 PM (GMT 1) on 4 Apr 2024]
emoji
Parents
  •   

    I tried to exclude text from translation, and it worked quite well for me:

    Screenshot of a Word document with text and a Word Count dialog box showing statistics including 1 page, 81 words, and 432 characters with spaces.

    Using the Word filetype you mention, I arrive here:

    Screenshot of Trados Studio interface highlighting text segments with numerals, hyperlinks, and names excluded from translation.

    Screenshot of Trados Studio file details showing 6 segments, 69 words, 307 characters, and 11 tags with a character-to-word ratio of 4.45.

    Whatever is and is not counted... Trados counts far less, which makes me believe it does NOT count content in tags (would not make much sense, think of formats like IDML...)

    However, word and character count is surprisingly unstandardized: https://multifarious.filkin.com/2022/07/30/character-counts/

    I achieved this using the embedded content processor (like you), excluding [\d-]+ for "phone numbers", I unchecked "extract hyperlinks, which worked fine, and I formatted the names in Word to be "Subtle emphasis" and then excluded all character styles containing "subtle" from translation.

    BTW I think there is a misunderstanding as to what a hyperlink is: The link TEXT is still extracted, but not the link itself. If you extract the link it looks like this:

    Screenshot of Trados Studio interface showing a 'naked hyperlink' excluded from translation and marked with 'ADR' indicating address recognition.

    There are a million ways to achieve what you want to achieve.

    Daniel

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 8:12 PM (GMT 1) on 4 Apr 2024]
Reply
  •   

    I tried to exclude text from translation, and it worked quite well for me:

    Screenshot of a Word document with text and a Word Count dialog box showing statistics including 1 page, 81 words, and 432 characters with spaces.

    Using the Word filetype you mention, I arrive here:

    Screenshot of Trados Studio interface highlighting text segments with numerals, hyperlinks, and names excluded from translation.

    Screenshot of Trados Studio file details showing 6 segments, 69 words, 307 characters, and 11 tags with a character-to-word ratio of 4.45.

    Whatever is and is not counted... Trados counts far less, which makes me believe it does NOT count content in tags (would not make much sense, think of formats like IDML...)

    However, word and character count is surprisingly unstandardized: https://multifarious.filkin.com/2022/07/30/character-counts/

    I achieved this using the embedded content processor (like you), excluding [\d-]+ for "phone numbers", I unchecked "extract hyperlinks, which worked fine, and I formatted the names in Word to be "Subtle emphasis" and then excluded all character styles containing "subtle" from translation.

    BTW I think there is a misunderstanding as to what a hyperlink is: The link TEXT is still extracted, but not the link itself. If you extract the link it looks like this:

    Screenshot of Trados Studio interface showing a 'naked hyperlink' excluded from translation and marked with 'ADR' indicating address recognition.

    There are a million ways to achieve what you want to achieve.

    Daniel

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 8:12 PM (GMT 1) on 4 Apr 2024]
Children