define 'word' boundaries for better word count in Trados

Hi,

Today I⁠ got a small file from a client using MemoQ. When it comes to the word count, it showed that Trados had 140 words instead of the 123 from MemoQ.

It turned out, that placeholders were defined badly thus variables between square brackets did not get converted and this was a difference of 5 words. So there’s still a difference of 12 words (10%) between the two.

When I had a closer look it showed that there were about 6 EU standards referenced in the text and Trados counted EACH number and string between and sourrinding the '/' as 1 word.

Is there a way to define these via regex as 1 word so they get counted as 1 word? The same would go for quite long Standard names (e.g. DIN 2137-1:2018-11 or even longer ones) or badly formatted URLs and some other strings.

Just to make sure: No, it’s not an option to convert them to tags or untranslatable text as they need to be adapted (e.g. adding word joiners and no-break spaces to avoid wrapping)

I  know you can adapt some missing word count settings via TM settings (language resources e.g. ’count as word if words contain’) but I can’t see any option to add regex or similar to define rules like above or am I⁠ missing something?

Best regards,

Pascal



clarification of additional settings
[edited by: Pascal Zotto at 2:14 PM (GMT 0) on 20 Jan 2025]
emoji
Parents Reply Children
  •  

    because it’s about counting source words, not about translation.

    Fair enough.  In that case I can confirm there is no solution other than to consider adopting a different tool for more of a standardised approach for wordcounts... Practicount for example.  I don't know how customisable that is, but at least you will have the same wordcount for all the tools you work with.  And if that doesn't help because  you simply want to be able to mirror the wordcount from a tool that you were provided with then I think you'd need to build your own plugin for that, or you can try submitting an idea for this in the ideas site.

    I’m not sure about it coming out soon from Tommi as the thing I heard from him what that he does not work on it anymore as he’s busy with OPUS CAT.

    I spoke with him yesterday over email and he confirmed this morning ;-)

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  •   

    Thanks for the tip. I’ll have a look at Practicount but of course it would be best for me to have everything in one app (Trados). So maybe I’ll go for a new plugin.

    Great to hear Tommi is getting back with TI+. My version became way too powerful to offer it to the broad public as it ended up in a grammar based MT with way better results than existing MT and AI combinations. At least for the language combinations which I started programming the grammar and spelling rules for.

    Edit: Practicount does not help at all as it only counts words/chars/lines/pages but does not offer at least repetitions.

    emoji