"Number is missing in target segment or is not properly localized"

Hi, I am a free lance translator for Czech language, and my very important question about Studio 2019 is: How can I manage the message: "Number is missing in target segment or is not properly localized". In fact the Studio 2019 (as all previous versions of Trados/Studio) doesn't know the rules to respect using the numbers in the documents in Czech. So, the above mentioned message is very null and void/disturbing/ineffective.

As translator, am I able teach the Studio to manage the numbers as needed?

If not, should it be possibile for the SDL make known to everybody this very fatal failure?

Many thanks in advance for all useful reactions

Miroslava

Parents
  • Hello  ,

    Studio should recognise the correct number formatting based on the languages of your project.

    Are the numbers underlined (blue)? in the source prior to translation?- e.g.

    Screenshot of Trados Studio showing a number '50,498,371' underlined in blue, indicating automatic number recognition.Blank screenshot with no visible content.

    For automatic number recognition to work, you must have a valid TM attached to your project.

    Then check your settings in Verification/QA checker-

    Screenshot of Trados Studio Project Settings with 'Numbers' selected, showing an error in 'Check that correct number format is applied in the target segment'.

    I also note that you seem to imply you have had issues with number recognition in previous versions as well, so I would check your own PC settings as well as these are picked up from the 'Region and Language' settings in Windows as follows-

    Screenshot of Windows 'Region and Language' settings with multiple green arrows pointing to format settings, indicating areas to check for number recognition issues.

     

    Does this help?

     

     

    Lydia Simplicio | RWS Group

    _______
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 3:35 PM (GMT 0) on 28 Feb 2024]
  • Hello Steven,

    You probably know, but this was NOT an answer at all to a very legitimate question - why does the QA verification still not work for numbers in several languages (not even in EN). Pointing to Windows settings is ridiculous since a lot of us use Studio to translate between various combinations - and numbers are mishandled in each and every of them.

    Why not at least point to regular expressions, which are a working way getting around those issues?

    I too would like to see more examples of regular expressions specific for handling numbers in Studio.
    Yes, we know "www.regular-expressions.info", but that is not very Studio-specific - I have no time to learn the whole system just to create a few rules in Studio - which should be there in the first place.

    Best regards,
    Tomi
  • Hi Paul,

    The problem was caused because the source document was badly formatted, containing a mixture of numbers correctly formatted in French with the non-breaking space and other sections where they were formatted with the period as the thousands separator. The number checking in Studio's standard QA checker was turned off and the following images show the settings in the number verifier plugin. Despite the fact that both the space and period are shown as valid thousands separators, I got hundreds of errors saying "numbers modified/unlocalised". This means that relatively unimportant "unlocalised" errors are mixed up with serious "modified" errors.

    My justification for my statement is that if error checking produces so many false positives that they cannot be examined manually, then there is, in effect, no error checking. It is a fact of life that translators have to deal with badly formatted documents and must do so without introducing errors of substance.

    XBench located the error immediately and produced no false positives.

    I hope this is of use and I am at your disposal if you have further questions.

    Thank you for your time.

    Neil

    Trados Studio project settings window showing number verifier plugin with space and period set as valid thousands separators.

    Trados Studio project settings window with number verifier plugin options, including exclude leading zeros and exclude full stop.

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 3:35 PM (GMT 0) on 28 Feb 2024]
  • Hi

    One simple way to detect such wrong or forgotten numbers and get a distince error message would be this regex:

    Trados Studio screenshot showing a Regular Expression search setup with a description 'Basic Number Check', a warning level, and a condition for combined search expressions.

    This will interpret 123.456,12€ as three individual numbers, but it will flag up a modified number in all but one cases I can think of. (That case would be that no thousands separators are used in the source, but some kind of separator is used in the target.) This is not as comprehensive as XBench, but it would catch most mistakes I think. Again, using Regex Autosuggest might greatly reduce the amount of mistakes introduced into the target text.

    The formatting check might be done as a separate step, again with the advantage of getting a distinct error message:

    (((\d{1,3},)+\d+)(\.\d+))|((\d{1,3},)+\d{3}[ $€]?) ...or something similar would flag up numbers with a comma as thousands separator. (Just check the target.)

    The position of a currency symbol or measurement would be a third regex, maybe (\d [€])|([€]\d) if you want to be alerted if the € symbol is anywhere but right after the last digit.

    I am not saying this rivals Xbench - that is a whole product designed to do nothing but QA, but I think there are simple ways to use Studio's QA checks to safeguard against most number-related QA problems.

    Daniel

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 3:35 PM (GMT 0) on 28 Feb 2024]
  • Daniel, thank you for your reply and your time.

    However, in all of the CAT tools I have used (WordFast, Déjà Vu (DVX) and MemoQ) I don't recall ever having to craft and test regular expressions in order to do a basic QA check on numbers.

    The problem lies in the attempt to detect non-localised numbers. If there were an option to turn this off and only detect numbers added or taken away, then this would completely resolve the problem.

    Until then, I see XBench or some equivalent solution as being an absolute necessity for anyone who does financial or technical translations.

    Neil
  • I have to agree with Neil about the irritation of false positives and how it makes you annoyed with Studio.

    I have had problems with false positives from date recognition in Austrian for years.

    I reported it years ago and, since other people commented about other languages with date recognition problems in the same thread, I naively thought it would be corrected "soon".

    For example "1. Januar 2018" is NOT recognised as a date in Austrian, but its translation into English as "1 January 2018" IS recognised as a date.

    So I get 3 errors, two errors because the numbers "1." and "2018" are missing in the target segment ("1. Januar 2018" is recognised as 2 numbers separated by a word in Austrian) and then I get a third error because Studio thinks there is an extra date in the English target segment.

    I have been thinking about converting the Austrian TMs and termbases to German (which does not have this problem) to get rid of the annoyance, but that means convincing the clients and then actually spending the time to do the conversion, so I will probably let it slide another few years before I finally lose my patience. If I had known this in advance I definitely would not have used Austrian.

    Yes, it is technically possible to use regex when I happen to get another Austrian job that happens to have a lot of dates in it, but I don't have the patience for using a separate workflow in these cases. I just grit my teeth and spend the few additional minutes needed in that occasional job to check the list of "missing" numbers.

    It does, however, leave a bad taste in my mouth every time. (Customer satisfaction?)

    Best regards,
    Bruce Campbell
    ASAP Language Services

  • Unknown said:

    Until then, I see XBench or some equivalent solution as being an absolute necessity for anyone who does financial or technical translations.

    XBench: I would definitely do the same if I was in your position.

    My point was not to say that everything is perfect with the Studio number check - it's not. I just meant to point out that there are simple ways (\d+ -> $1) to work around certain problems in most cases. Regex is far more limited than a programmatic solution, that's obvious. And, yes, checks that produce lots of false positives are useless - I turn them off.

    Daniel

  • I'm not sure you can turn off those checks that produce false positives, unless I am mistaken.

    Therein lies the problem...
  • Hi
    like I replied to Neil, regex is far more limited than a programmatic solution and yes, it's a bit surprising how the date checker works and does not work. I'd just turn it off (in the QA Checker, not date recognition in the TM), in which case you get no errors and both numbers checked. No, it's not perfect, and I would like to see improvement here, too. But I find it's good enough to work with for now.
    I'd like to simply see a few options to the built in number/date/measurement check, e.g. have the user enter the source/target date formats that he wants to allow in the common DD-MMMM-YYYY style, select currency symbols that are allowed to be attached to a number like $100.00 or 134,56€ without making Studio treat the whole thing as alphanumeric. Etc.
    Daniel
  • You never gave examples of the false positives, so I wouldn't know how to avoid them.
  • Hi   

    I can see that I'm not the only one who likes the simple function of checking numbers without taking into account number format.

    Checking numbers while taking into account their correct formatting is not really a trivial task. So such an approach in adverse circumstances will always involve a number of false positive results. Studio is a great tool that allows customization with individual needs in mind. Here you can download the Simple Numcheck plugin:

    http://www.posteditacat.xyz/download/simple-numcheck-2019/

    http://www.posteditacat.xyz/download/simple-numcheck-2017/

    It checks numbers in Xbench29 style (only digits, without formatting). Plugins are not signed and you don’t find them in appstore. By default, plugin runs in every project and you don't need to initialize it each time. I've been using the plugin for few years. Tested on thousands of projects. No client made a complaint.

    Simple Numcheck checks numbers (digits) and the order of numbers in the whole segment without any formatting. Reliability is 100%, virtually without false positives. In addition, it also checks alphanumeric expressions, but there are many false positives here. You can disable this option.

    Screenshot of Trados Studio showing a list of errors related to number formatting in a translation project, with multiple entries highlighted in red.

    Detailed error message window from Trados Studio's Simple Numcheck plugin, highlighting discrepancies in number formatting between source and target text segments.

    Trados Studio plugin settings window for Simple Numcheck, showing options to ignore certain number and alphanumeric differences.

    Plugin settings window displaying advanced options for number checking patterns and exclusion rules in Trados Studio's Simple Numcheck plugin.

    Trados Studio Simple Numcheck plugin settings window with a list of alphanumeric check options and measurement units for customization.

    DArek

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 3:35 PM (GMT 0) on 28 Feb 2024]
  • Hello
    this looks very interesting... May I ask for the reason why this plugin is not signed and available on the Appstore? It's free for you as the author, is it not?
    Daniel
Reply Children
No Data