Tokens interpretation (OpenAI)

Question

Dear Community! 
 Currently testing Chat-gpt 3.5 turbo (OpenAI) and there are so many questions rising.... 
 How are tokens calculated? I need to assess approximate costs, but something is definitely wrong: 
 Example: 
 I have performed a pretranslation of 2549 words; 36829 tokens were counted, corresponding to (x4) 147316 characters....while the entire text had 14230 characters....how come? 
 Ok, the numbers are always doubled since it&acute;s not only the origin (output) text that is counted, but the input as well. But then it could be around 4000 tokens (30000 characters) tokens, not 37 000 tokens (147 000 characters)... 
 Completely lost ...and then, in the breakdown, the OpenAI account shows 5 times higher input than output.... 
 For this test I have not activated neither the terminology-aware, nor the multiple-translation options. I guess these may lead to a considerable increase of the input tokens...Would they? 
 If someone controls the issue, please, clarify. 
 It is obvious that exact token calculation and estimation is not possible. But at least, the numbers should be more o less clear... 
 Thank you in advance!

Paul Filkin · Answer

You inspired me to look closer at this being able to see the tokens question... you're not the only one asking. Perhaps this is worth a look: https://multifarious.filkin.com/2025/11/11/trados-ai-monitor/

Paul Filkin · Answer

Jens Olaf Koch 
 Jens Olaf Koch said: I build most of my own websites (and client websites together with some colleagues) nowadays with the CMS Statamic in Laravel 
 Looks pretty interesting. I've had a wordpress platform for my blog for over a decade now and transitioned from Wordpress to a hosted version where I have more control, so I'll stick with that for my own needs for now. The number of plugins is overwhelming and you rarely get exactly what you need... or you get far too much and it's bloat and advertising. So I started using Code Snippets to bend them and eventually realised it was simpler to build your own complete plugin and I've slowly but surely been replacing the ones I had with my own plugins to do only what I need. 
 Jens Olaf Koch said: Do you think we will see more and increased usage of LLMs in Trados Studio and/or plugins soon? And more grades of freedom re. model choice and prompt engineering? 
 I think it&rsquo;s important to separate what Trados Studio itself is responsible for from the level of freedom you&rsquo;re looking for. Studio is used by very large enterprises as well as individual freelancers, so anything built into the core product has to meet strict requirements around governance, consistency, data protection and long-term support. That naturally limits how much unconstrained model choice or prompt experimentation we can expose directly in the out-of-the-box features. 
 However, this doesn&rsquo;t mean there&rsquo;s no flexibility. Our aim is to provide a solid, reliable platform while enabling far more freedom through the ecosystem. Third-party developers can already build plugins that offer broader model selection, custom prompts and specialised workflows, and several of these are available today on the AppStore. That&rsquo;s where we see the most room for rapid innovation and experimentation. 
 At the same time, we are expanding our own AI capabilities in a more controlled, enterprise-grade way. Features such as batch Generative Translation, Smart Review, Generative Subtitling, Content Analysis etc. already sit on top of a huge selection of modern LLMs with the right safeguards, and this will continue to evolve. 
 So yes - I think LLM usage will increase across Studio and the wider ecosystem. The difference is simply where freedom sits: core Trados features must remain stable and governed, while plugins and integrations can deliver the flexibility and customisation many users want. 
 But Daniel Brockmann can probably provide a better insight than me.

Paul Filkin · Answer

Here's the answer from asking the very tool you're using :-) Perhaps that helps to explain somewhat?

Tokens and cost estimation in GPT models 
 This confusion is quite common &mdash; token counts rarely align neatly with word or character counts. Here&rsquo;s what&rsquo;s happening:

What tokens actually are Tokens are not characters or words. They are chunks of text that represent how the model processes language. On average:

1 token &asymp; 4 characters in English text, or

1 token &asymp; &frac34; of a word.

However, this varies depending on the language, punctuation, spacing, and formatting. For example, &ldquo;don&rsquo;t&rdquo; counts as two tokens ( don + &rsquo;t ), and numbers, markup, or special characters can multiply token counts quickly.

Why your count seems inflated If your text had ~2,500 words or ~14,000 characters, 36,000&ndash;37,000 tokens does sound high &mdash; but likely includes prompt and system text automatically sent with your request. Many interfaces (such as translation tools or connectors) send extra metadata or instructions under the hood, often tripling or quadrupling the visible text length. That means what you see as &ldquo;input&rdquo; is only part of what the model actually receives.

Input vs. output OpenAI charges separately for input (your prompt) and output (the model&rsquo;s response). If you see 5&times; more input than output, it usually means your application sends a large prompt (context, settings, memory, translation pairs, etc.) each time. You can check this in your developer console or logs if available.

Terminology and multi-translation features Yes, those would increase input tokens further, since they expand the prompt with additional instructions or examples.

Estimating cost For GPT-3.5-Turbo, you can approximate:

1,000 tokens &asymp; 750 words (roughly).

To estimate cost, divide total tokens by 1,000 and multiply by the model&rsquo;s rate.

Example: 37,000 tokens &times; $0.0005 (input) &asymp; $0.0185 for that request.

Practical takeaway You can&rsquo;t directly map tokens to visible characters because the model processes much more than what&rsquo;s displayed. The only reliable measure is the token count shown in your OpenAI usage dashboard.

Paul Filkin · Answer

Jens Olaf Koch 
 It should work with any version. But you&rsquo;re looking for a non-existent option. What makes you think there is one? 
 On the coding tool&hellip; mostly Claude but I dip into others depending on the context of the problem I&rsquo;m trying to solve.

RWS AppStore > 2. RWS AppStore Applications

Tokens interpretation (OpenAI)

Top Replies