define 'word' boundaries for better word count in Trados

Hi,

Today I⁠ got a small file from a client using MemoQ. When it comes to the word count, it showed that Trados had 140 words instead of the 123 from MemoQ.

It turned out, that placeholders were defined badly thus variables between square brackets did not get converted and this was a difference of 5 words. So there’s still a difference of 12 words (10%) between the two.

When I had a closer look it showed that there were about 6 EU standards referenced in the text and Trados counted EACH number and string between and sourrinding the '/' as 1 word.

Is there a way to define these via regex as 1 word so they get counted as 1 word? The same would go for quite long Standard names (e.g. DIN 2137-1:2018-11 or even longer ones) or badly formatted URLs and some other strings.

Just to make sure: No, it’s not an option to convert them to tags or untranslatable text as they need to be adapted (e.g. adding word joiners and no-break spaces to avoid wrapping)

I  know you can adapt some missing word count settings via TM settings (language resources e.g. ’count as word if words contain’) but I can’t see any option to add regex or similar to define rules like above or am I⁠ missing something?

Best regards,

Pascal



clarification of additional settings
[edited by: Pascal Zotto at 2:14 PM (GMT 0) on 20 Jan 2025]
emoji
Parents Reply
  •  

    What I meant (see above) and how I understand is that you modify the DIN standard notation as needed in the source, using CleanUp Task. Then you "protect" the numerical part of the standard (e.g. "535/2137-1:2018-11") as a tag. This would result in the standard being counted as one word, and in being written the way you need it. I agree with Paul, for special use cases like yours it might be desirable to be able to define word boundaries freely, but since that is not an option at this point, this looks like a viable workaround.

    Screenshot of Trados Studio interface showing a segment with the text 'DIN 5352137-1:2018-11' highlighted in purple and a warning message indicating 'DIN' counts as one word, tag as none.

    emoji


    Generated Image Alt-Text
    [edited by: RWS Community AI at 2:08 PM (GMT 0) on 27 Jan 2025]
Children