"Number is missing in target segment or is not properly localized"

Hi, I am a free lance translator for Czech language, and my very important question about Studio 2019 is: How can I manage the message: "Number is missing in target segment or is not properly localized". In fact the Studio 2019 (as all previous versions of Trados/Studio) doesn't know the rules to respect using the numbers in the documents in Czech. So, the above mentioned message is very null and void/disturbing/ineffective.

As translator, am I able teach the Studio to manage the numbers as needed?

If not, should it be possibile for the SDL make known to everybody this very fatal failure?

Many thanks in advance for all useful reactions

Miroslava

Parents
  • Hello  ,

    Studio should recognise the correct number formatting based on the languages of your project.

    Are the numbers underlined (blue)? in the source prior to translation?- e.g.

    Screenshot of Trados Studio showing a number '50,498,371' underlined in blue, indicating automatic number recognition.Blank screenshot with no visible content.

    For automatic number recognition to work, you must have a valid TM attached to your project.

    Then check your settings in Verification/QA checker-

    Screenshot of Trados Studio Project Settings with 'Numbers' selected, showing an error in 'Check that correct number format is applied in the target segment'.

    I also note that you seem to imply you have had issues with number recognition in previous versions as well, so I would check your own PC settings as well as these are picked up from the 'Region and Language' settings in Windows as follows-

    Screenshot of Windows 'Region and Language' settings with multiple green arrows pointing to format settings, indicating areas to check for number recognition issues.

     

    Does this help?

     

     

    Lydia Simplicio | RWS Group

    _______
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 3:35 PM (GMT 0) on 28 Feb 2024]
  • Hello Steven,

    You probably know, but this was NOT an answer at all to a very legitimate question - why does the QA verification still not work for numbers in several languages (not even in EN). Pointing to Windows settings is ridiculous since a lot of us use Studio to translate between various combinations - and numbers are mishandled in each and every of them.

    Why not at least point to regular expressions, which are a working way getting around those issues?

    I too would like to see more examples of regular expressions specific for handling numbers in Studio.
    Yes, we know "www.regular-expressions.info", but that is not very Studio-specific - I have no time to learn the whole system just to create a few rules in Studio - which should be there in the first place.

    Best regards,
    Tomi
  • Hi and

    I think it's quite understandable that started into this by pointing out the basics. The QA number check and in a way also the Number Verifier App are basic and I use them when I translate documents with simple number formats. The Number Verifier actually is quite powerful, but it also has its limits. (By the way, Number verifier works well for me with German and English including product codes like D1234.)

    You are correct, Tomi, in pointing out that Regex is a way to deal with more complex formats, but I am a bit surprised by what almost sounds like an accusational tone in your post. There is a whole forum about using regex and as far as I can see most questions there are answered in a very timely manner: https://community.sdl.com/product-groups/translationproductivity/f/171

    I think that is quite a service, since you are actually requesting a service which is akin to custom software development, and you would normally have to pay for that...

    So if you have a document with a specific number format, why not post your regex question there? Then store it on your computer as you might be able to use it again in future. Over time you will get a collection of regular expressions that might cover all your normal requirements in all languages you deal with.

    There are very complex formats, like Bible quotations in different countries, and even one and the same country might have more then one standard - your customer might request a specific one. How could an off the shelf solution cover all that?

    So I hope you feel very free to post your regex questions in the above mentioned forum! (They just need to be specific with good examples.)

    All the best

    Daniel

  • Hello Daniel,

    Thank you for pointing to those forums, I have not yet seen this one!

    Yes, my tone was a bit accusational, since I was not talking about anything complex - just usual checks for "space between number and unit" and formatting of number ranges (1 - 2 or 1–2) and such, I am not entrusting Studio with anything too complex. I have been using Studio for 16 years, buying the new versions all the time only to see nothing much is being done with very elementary bugs (and this is one of them), that is where my bitter tone comes from.

    Also, I am one of the few translators that actually use regex expression and who is willing to learn. But in a role of a reviewer I get to waste huge amounts of time to correct these most elementary mistakes - the usual translator's process is this: just run QA, Studio marks every number as an error, ignore it all, leave it to the proofreader. Not too mention how many translators cannot handle the number formatting in repetitions - which should also be set by Studio for specific language combinations.

    Best regards,
    Tomi
  • Hello

    Unknown said:
    Hello Daniel,
    I have been using Studio for 16 years, buying the new versions all the time only to see nothing much is being done with very elementary bugs (and this is one of them), that is where my bitter tone comes from.
     

    I have to disagree. What we are talking about is not a bug, it is a feature you'd like to see implemented, but I think you underestimate the complexity of the feature you are wishing for. On the other hand, using regex will not only allow you to custom-tailor and modify what exactly you want to be alerted to, it even allows you to automate part of your translation using (free) Apps like Regex Autosuggest - and therefore minimizing the chance of mistyped numbers to begin with.

    Maybe it should be the localization engineer's job to provide such expressions, but apart from an adnmittedly steep learning curve at the beginning, regex is not as cryptic as it seems at first. And it's such a powerful tool. After 16 years you could be teaching us all! (Well, me at least.)

    Having said that, this thread was started by a user wishing for different decimal seperators to be accepted/recognized, and this is actually functionality that is included in Number Verifier.

    Glad to see you took your question to the Regex forum!

    Daniel

  • Hello Daniel,

    Yes, I was very pleasantly surprised how quickly guys were responding to help. A mention of Number Verifier is also very helpful - perhaps that should replace the old number QA function which is currently misleading only for several languages.

    I have set to learn it, so I am happy enough with this solutions now!

    Have a nice day,
    Tomi
  • May I add my voice in frustration at this aspect of using Studio?

    What I would like in Studio is just a simple means of checking that any number in the source text is replicated in the target text, without any concern for localisation and with a warning if any number is added.

    Incorrectly formatted numbers are easily picked up during proofreading but missing or added numbers can easily slip through and are a serious error.

    At present I rely on XBench for number checking.

    Neil
  • A case in point: here is a serious error that XBench has just picked up that was drowned amongst the hundreds of false positives thrown up by Trados Studio.

    As far as I can see, under some circumstances, Studio effectively has no number verification unless you want to manually check all the false positives (my purpose is not to complain, but to provide ideas for improvement).

    This is what XBench found, i.e. a serious error that I would have delivered had I relied on Studio's checking.

    Screenshot showing a Numeric Mismatch error in XBench with highlighted numbers: '222,257,002' in source and '222,250,002' in translation.

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 3:35 PM (GMT 0) on 28 Feb 2024]
  • Hello

    I don't think you can just make a statement like this without some sort of justification. I fully agree the QA for numbers in Studio is lacking, mainly because the number recognition is too prescriptive, but I don't see why this woulkd not be picked up. It would be more interesting to know why you were unable to do this, so what settings were you using?

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

  • Hi Paul,

    The problem was caused because the source document was badly formatted, containing a mixture of numbers correctly formatted in French with the non-breaking space and other sections where they were formatted with the period as the thousands separator. The number checking in Studio's standard QA checker was turned off and the following images show the settings in the number verifier plugin. Despite the fact that both the space and period are shown as valid thousands separators, I got hundreds of errors saying "numbers modified/unlocalised". This means that relatively unimportant "unlocalised" errors are mixed up with serious "modified" errors.

    My justification for my statement is that if error checking produces so many false positives that they cannot be examined manually, then there is, in effect, no error checking. It is a fact of life that translators have to deal with badly formatted documents and must do so without introducing errors of substance.

    XBench located the error immediately and produced no false positives.

    I hope this is of use and I am at your disposal if you have further questions.

    Thank you for your time.

    Neil

    Trados Studio project settings window showing number verifier plugin with space and period set as valid thousands separators.

    Trados Studio project settings window with number verifier plugin options, including exclude leading zeros and exclude full stop.

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 3:35 PM (GMT 0) on 28 Feb 2024]
  • Hi

    One simple way to detect such wrong or forgotten numbers and get a distince error message would be this regex:

    Trados Studio screenshot showing a Regular Expression search setup with a description 'Basic Number Check', a warning level, and a condition for combined search expressions.

    This will interpret 123.456,12€ as three individual numbers, but it will flag up a modified number in all but one cases I can think of. (That case would be that no thousands separators are used in the source, but some kind of separator is used in the target.) This is not as comprehensive as XBench, but it would catch most mistakes I think. Again, using Regex Autosuggest might greatly reduce the amount of mistakes introduced into the target text.

    The formatting check might be done as a separate step, again with the advantage of getting a distinct error message:

    (((\d{1,3},)+\d+)(\.\d+))|((\d{1,3},)+\d{3}[ $€]?) ...or something similar would flag up numbers with a comma as thousands separator. (Just check the target.)

    The position of a currency symbol or measurement would be a third regex, maybe (\d [€])|([€]\d) if you want to be alerted if the € symbol is anywhere but right after the last digit.

    I am not saying this rivals Xbench - that is a whole product designed to do nothing but QA, but I think there are simple ways to use Studio's QA checks to safeguard against most number-related QA problems.

    Daniel

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 3:35 PM (GMT 0) on 28 Feb 2024]
Reply
  • Hi

    One simple way to detect such wrong or forgotten numbers and get a distince error message would be this regex:

    Trados Studio screenshot showing a Regular Expression search setup with a description 'Basic Number Check', a warning level, and a condition for combined search expressions.

    This will interpret 123.456,12€ as three individual numbers, but it will flag up a modified number in all but one cases I can think of. (That case would be that no thousands separators are used in the source, but some kind of separator is used in the target.) This is not as comprehensive as XBench, but it would catch most mistakes I think. Again, using Regex Autosuggest might greatly reduce the amount of mistakes introduced into the target text.

    The formatting check might be done as a separate step, again with the advantage of getting a distinct error message:

    (((\d{1,3},)+\d+)(\.\d+))|((\d{1,3},)+\d{3}[ $€]?) ...or something similar would flag up numbers with a comma as thousands separator. (Just check the target.)

    The position of a currency symbol or measurement would be a third regex, maybe (\d [€])|([€]\d) if you want to be alerted if the € symbol is anywhere but right after the last digit.

    I am not saying this rivals Xbench - that is a whole product designed to do nothing but QA, but I think there are simple ways to use Studio's QA checks to safeguard against most number-related QA problems.

    Daniel

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 3:35 PM (GMT 0) on 28 Feb 2024]
Children