RegEx for Language Recognition

Hello guys ...
I am just starting using a bit of reg ext to find strings with digits (^\d), urls (^\www), telephones etc BUT ...

I translate basically form Russian into Italian and quite often it happens to find in russian source files segments NOT in russian. It happens for instance with brands, company etc ... let's say ... I have in russian files a long list of brancs like like "Ferrari" "Lamgborghini", "AlfaRomeo" ETC ...

My question is, is there a way to build a RegEx which allows me to find segments writte NOT in russian ???

If I have for example this list ... in cirillic and latin alfphabet:

КОМПАНИЯ 1
FERRARI
КОМПАНИЯ 2
LAMBORGHINI
ALENIA SPAZIO
КОМПАНИЯ 3

Is there any way to EXTRACT from that source document ALL the segments like "Ferrari", "Lamborghini" and "Alenia spazio" in order to simply COPY them from source to target and only AFTER THAT start the translation of the REAL russian source ???

Sorry if it sounds strange!

MANY THANKS!

Pietro

Parents Reply
  • ^(?=[A-Za-z]+)

    Well ... This works MUCH better ...

    Screenshot of Trados Studio document with a list of links and Italian car brand names like Lamborghini and Ferrari.

    I'll keep trying.
    Thank You

    Post Scriptum ...
    By the way ... I really need to get out of a Virtual Machine !!!
    It works terrible!

    May I ask you 3 things ???

    1.
    Is trados a 32 bit program ONLY ??? (whandering why I ahve a 32 bit version installed)
    Icon indicating SDL Trados Studio is a 32-bit program. (This come from the Activity Manager, once it often crashes)

    2.
    Do You know how Studio 2019 works on ARM processors ??? If it is not optimized then I will not take a comp working on that processor ...

    3.
    Just to give an example ...
    Error message in Trados Studio showing unknown exceptions during layout generation for multiple segments.


    SORRY!
    And ... THANK YOU AGAIN!
    Pietro

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 4:22 AM (GMT 0) on 5 Mar 2024]
Children