Creating a TM from a Single Bilingual File

Hi all! I'm new here. So stupid question time!

I need to translate a massive file. But I'm in luck! It's version 2.0, and I have the following: a txt file for the 1.5 ST, a txt file for the 1.5 TT, and an bilingual HTML file that has the entire thing with the English on top and Japanese right underneath.

I tried to align the TXT files, but it's... just too big. It's around 20,000 segments, things go off in the middle, several parts aren't parsing right... it would take me two weeks to align it manually.

But wait, I already have an "aligned" version! The bilingual HTML file! It's already separated line by line. Is there any way to leverage this into a TM? I found a question like this in the archives, but it was around five years old and the answer seemed to be a big "nope". I've got my fingers crossed that there's a workaround here... otherwise I will just end up having to CTRL+F every single line of this huge thing from scratch.

emoji
Parents Reply
  • Please note that you are not missing anything and it's not surprising you'll be confused with the "instructions" from Kelly.  He doesn't do instructions, he just likes to show off how he was able to do something but doesn't explain anything.

    He essentially did this:

    1. extracted the source and target text for each language using regular expressions
    2. put this extracted text into a delimited file and converted that into a TMX file (or some process like this)
    3. from there you can either import the TMX into a new Trados translation memory, or convert it directly (the Glossary Converter on the appstore can do this for you... but importing is easy enough with out of the box features)
    4. He also shows a UI for Studio full of all his little tools as he does know how to develop a little, so it won't look familiar to you and he hasn't explained any of that

    So your task would be to learn the following:

    • how to use regular expressions to identify the translatable text from your file
    • how to use that expression in a tool (like Notepad++) so it's copied to a new file in a structured way
      • I said "new file" because it doesn't have to be a TMX
      • You could simply separate the results of your regular expressions by a tab and then use the Glossary Converter to convert the tab delimited file directly to an SDLTM (translation memory)

    Kelly has essentially forgotten that everyone has to start somewhere and I can still remember the sort of questions we used to get from him... and if other users had been like Kelly today he would have learned nothing!  This is nothing to do with a free lunch as he hasn't even given you a biscuit!  I think it's fine to offer to do work for someone at a price as everyone's time is valuable.  So if you do have technical skills that go above those of the average user then contact them off forum through the chat and agree a price for the work.  But don't keep posting like this.  It's rude, grandiose and frankly unhelpful.

    We do have a forum here for regular expressions:

    https://community.rws.com/product-groups/trados-portfolio/trados-studio/f/regex_and_xpath

    You can feel safe having a go, sharing examples of what you are trying to achieve and getting some help along the way.  The example reference from Daniel is a good one and there are also some good tools out there that help to use regular expressions with explanation... some free and some at a cost.  So use the forum and ask as you go.  Once you get your head around them you'll never go back... they are very useful!

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
Children