Issue with autosubstitution of alphanumeric characters

Hi all. I have very large files containing segments that look like this : EPMBRMECHP Rbld.VP60HV57 Valve. I want the alphanumeric characters to be autosubstituted but this is not happening. I have a feeling it's because there is no space between the period and the alphanumeric code. I tried entering modifying the source and adding a space and it worked. Is there another way around this? Otherwise, I need edit to edit all my source documents. Thanks

emoji
Parents
  • Hi

    I did a little test based on the information you provided. This is my "source file":

    This is a EPMBRMECHP Rbld.VP60HV57 Valve.
    This is a BDEFGFHSIP Rddd.VP80KV57 Valve.
    This is a BDEFGFHZAP Abcd.VP80KX99 Valve.

    These are my TM settings:

    This is how it looks in my editor:

    The actual alphanumeric string "Rbld.VP60HV57" is recognized as such and replaced correctly. The all-upper-case string "EPMBRMECHP" is not recognized as acronym or any other placeable. It seems to be too long, there must be a max length for acronyms? Maybe can tell us more? Can this limit be set somewhere or is it hard-coded?

    I was curious and tried the same with shortened upper-case-only strings:

    Voilá! Works perfectly!

    So this should work out-of-the box - almost. If your sentences are simple like mine, the MT does it all perfectly, so nothing to worry about. If you really need to auto-substitute the all-upper-case strings, there is a plug-in called TermInjector, which is just ideal for that, but you need to know some basic Regex and it's not the most intuitive tool. (Just to say it now, it uses POSIX regex.)

    Hope that helps.

    Daniel

    emoji
Reply
  • Hi

    I did a little test based on the information you provided. This is my "source file":

    This is a EPMBRMECHP Rbld.VP60HV57 Valve.
    This is a BDEFGFHSIP Rddd.VP80KV57 Valve.
    This is a BDEFGFHZAP Abcd.VP80KX99 Valve.

    These are my TM settings:

    This is how it looks in my editor:

    The actual alphanumeric string "Rbld.VP60HV57" is recognized as such and replaced correctly. The all-upper-case string "EPMBRMECHP" is not recognized as acronym or any other placeable. It seems to be too long, there must be a max length for acronyms? Maybe can tell us more? Can this limit be set somewhere or is it hard-coded?

    I was curious and tried the same with shortened upper-case-only strings:

    Voilá! Works perfectly!

    So this should work out-of-the box - almost. If your sentences are simple like mine, the MT does it all perfectly, so nothing to worry about. If you really need to auto-substitute the all-upper-case strings, there is a plug-in called TermInjector, which is just ideal for that, but you need to know some basic Regex and it's not the most intuitive tool. (Just to say it now, it uses POSIX regex.)

    Hope that helps.

    Daniel

    emoji
Children
No Data