Extract TM source files list

Question

Hi everyone, 
 
 I'd like to update some (massive) TMs, but I don't know which source files have already been aligned and added, so that's the first item on my to-do list before adding any new alignment result. 
 So far the only way I've found is to simply open the TM in Trados, look at the third column and copy/paste the source file name somewhere. 
 Is there any automated way to get the list of all the source files in my TM? I've thought about SQL-ing this, but I don't know enough about it. 
 
 Any help would be greatly appreciated! Thanks!

Paul · Answer

Liza Thetiot 
 You cannot do this easily in Trados Studio, but with the help of ChatGPT you can do this in SQLite even more easily! 
 The first thing to do is look where the information is held by inspecting the Translation Memory in a tool like DB Browser for SQLite as this is free. When you do this you'll see this sort of thing: 
 
 Look through the tables in the "Browse Data" and you'll find this: 
 
 This is the "string_attributes" table and you can see that the "value" column holds all the information related to custom fields and for alignment TMs this includes the source and target filenames. Trados Studio won't hold the full file name, just the name without the extension. It also puts source and target together into both the source and target fields (don't ask me why!). So what I want is a list of the unique values from this column, less the .sdlalign part as it's not needed. I can get that with a regex like this: 
 .+(?=\.sdlalign) 
 So, armed with all of this information I can ask ChatGPT something like this: 
 Create a SQLite instruction for use in DB Browser for SQLite. The instruction should create a list of the contents of the value column in the string_attributes table. The content returned in the value column should be evaluated using a SQLite equivalent to this regular expression: .+(?=\.sdlalign) The list of contents returned should only be unique values. 
 ChatGPT obliges with this excellent information: 
 
 So I enter this: 
 SELECT DISTINCT substr(value, 0, instr(value, '.sdlalign')) FROM string_attributes WHERE value LIKE '%.sdlalign'; 
 Into the "Execute SQL" tab: 
 
 Then "Run" the code. This returns the following: 
 en-greatminds_es-greatminds en-latinamerica_es-latinamerica en-UNICEF_es-UNICEF 
 I actually ran three alignments into a TM to test this. So now I have a list of the files I aligned. I think this is another excellent example of just how smart ChatGPT is. If I didn't know how to write the regular expression I could have probably skipped that part and just explained in my question what I wanted.

Trados Studio > 8. AI enabled

Extract TM source files list