What RegEX to search replace First BOLD Capital Letter to Normal

Hello

 

how to solve this issue , in order to be suitable for Machine Translation
Screenshot of a text editing interface with a query on how to convert the first letter in each word from bold to non-bold format. The text 'Heparin-Induced Thrombocytopenic Thrombosis' is shown in both formats.

 

 

Heparin-Induced Thrombocytopenic Thrombosis
 
to this format
 
 
Heparin-Induced Thrombocytopenic Thrombosis
 
 


Generated Image Alt-Text
[edited by: Trados AI at 4:04 AM (GMT 0) on 5 Mar 2024]
emoji
Parents
  • Hi  ,

    It's not possible to search for formatting and change them with regex. If you press Ctrl+Shift+H when the file is open in Studio then you will see all of the formatting tags. At this point if you use the CleanupTasks application you can remove all the tags that control this formatting as explained in this article:

    multifarious.filkin.com/.../

    You will however lose all the formatting in the document so it will no longer be the same when you save the target. If this is acceptable then I suggest you clean up the text in the source file before you open it for translation and this will save you a lot of hassle afterwards.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

  • There is a possibility to do that on plain SDLXLIFF file in Notepad++ (or SDLXLIFF Toolkit). Done that before for all spaces, as they were somehow formatted in a customer PPT and I was not able to remove the formatting in PowerPoint.
    When you open the SDLXLIFF in Notepad++, you can see, that this letters will be surrounded by tags.
    So search for <.+?>(\w)</.+?> and replace by $1 (in Toolkit) or \1 (in Notepad++). This will remove the formatting of single characters. Of course you will have to create a backup copy of the SDLLXIFF and test, if that works as desired, but it should do the trick.

    _________________________________________________________

    When asking for help here, please be as accurate as possible. Please always remember to give the exact version of product used and all possible error messages received. The better you describe your problem, the better help you will get.

    Want to learn more about Trados Studio? Visit the Community Hub. Have a good idea to make Trados Studio better? Publish it here.

Reply
  • There is a possibility to do that on plain SDLXLIFF file in Notepad++ (or SDLXLIFF Toolkit). Done that before for all spaces, as they were somehow formatted in a customer PPT and I was not able to remove the formatting in PowerPoint.
    When you open the SDLXLIFF in Notepad++, you can see, that this letters will be surrounded by tags.
    So search for <.+?>(\w)</.+?> and replace by $1 (in Toolkit) or \1 (in Notepad++). This will remove the formatting of single characters. Of course you will have to create a backup copy of the SDLLXIFF and test, if that works as desired, but it should do the trick.

    _________________________________________________________

    When asking for help here, please be as accurate as possible. Please always remember to give the exact version of product used and all possible error messages received. The better you describe your problem, the better help you will get.

    Want to learn more about Trados Studio? Visit the Community Hub. Have a good idea to make Trados Studio better? Publish it here.

Children