check segments after colon start with lower case letter (regex)

Hi all,

I have several large documents and as the title says I need to make sure the segment after a colon (:) starts with a lower case letter.

Like this:

[segment 1] Note:

[segment 2] tomatoes are green.

Is there a way to just select the segments that follow the colon? Regex?

Different segmentation is not an option (because of the way the TM is structured).

Thank you very much in advance.

Willem

emoji
Parents
  •  

    I assume the heart of the issue is the format is incorrect in the source file?
    What is the native source file?

    So what you have in source is: 
    Note: Tomatoes are green.

    But what you need is: 
    Note: tomatoes are green.

    I wonder if its easier to address this in native source document before bringing into Studio.
     does something come to mind for this? 

    If that is not, then yes I agree with  we could think about creating an app that does a verification based on context or check the Xliff itself. 

    Lyds

    Oana Nagy | RWS Group

    _____________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  •  

     does something come to mind for this? 

    I think your example is not quite the same as the one mentioned.  I think it's more like this:

    So what you have in source is: 

    #1        Note:
    #2        Tomatoes are green.


    But what you need is: 

    #1        Note:
    #2        tomatoes are green.

    Segmentation cannot be changed for this users issue so he's looking for a solution to be able to find these segments by searching with a multi-segment condition. So find this:

    \w+:

    But only when the following segment is equal to this:

    ^\p{Lu}

    I wonder if its easier to address this in native source document before bringing into Studio.

    It may already be correct in the source document.  The problem is the translated target.. probably because of autocorrect or just a translators natural inclination to  start a sentence with a capital letter.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
Reply
  •  

     does something come to mind for this? 

    I think your example is not quite the same as the one mentioned.  I think it's more like this:

    So what you have in source is: 

    #1        Note:
    #2        Tomatoes are green.


    But what you need is: 

    #1        Note:
    #2        tomatoes are green.

    Segmentation cannot be changed for this users issue so he's looking for a solution to be able to find these segments by searching with a multi-segment condition. So find this:

    \w+:

    But only when the following segment is equal to this:

    ^\p{Lu}

    I wonder if its easier to address this in native source document before bringing into Studio.

    It may already be correct in the source document.  The problem is the translated target.. probably because of autocorrect or just a translators natural inclination to  start a sentence with a capital letter.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
Children
No Data