How to prepare a txt subtitle file for translation in Studio?

Hello everybody,

I've been sent a couple of txt files that seem to be converted from a subtitle software. The text looks like this:

{\rtf1\ansi\ansicpg1252\cocoartf1561\cocoasubrtf200
{\fonttbl\f0\fswiss\fcharset0 Helvetica;}
{\colortbl;\red255\green255\blue255;}
{\*\expandedcolortbl;;}
\paperw11900\paperh16840\margl1440\margr1440\vieww26700\viewh18280\viewkind0
\pard\tx566\tx1133\tx1700\tx2267\tx2834\tx3401\tx3968\tx4535\tx5102\tx5669\tx6236\tx6803\pardirnatural\partightenfactor0

\f0\fs24 \cf0 1\
00:00:10,000 --> 00:00:20,000\
\pard\pardeftab708\ri-386\partightenfactor0
\cf0 Dear trader\
\pard\tx566\tx1133\tx1700\tx2267\tx2834\tx3401\tx3968\tx4535\tx5102\tx5669\tx6236\tx6803\pardirnatural\partightenfactor0
\cf0 \
2\
00:00:20,000 --> 00:00:30,000\
\pard\pardeftab708\ri-386\partightenfactor0
\cf0 Are you fully aware of the information you are required by law \
\pard\tx566\tx1133\tx1700\tx2267\tx2834\tx3401\tx3968\tx4535\tx5102\tx5669\tx6236\tx6803\pardirnatural\partightenfactor0
\cf0 \

Obviously, the only thing to translate here are the sentences "Dear trader" and "Are you fully aware of the information you are required by law". Can someone suggest ideas how to prepare these files in Studio, so as to extract only the text for translation?

I tried adding "\cf0 "as an opening pattern and "\" as a closing one in the document structure for txt files but the back slash doesn't seem to be accepted as an end of a pattern. I also changed all "\" to "$" and reintroduced the patterns with the dollar sign - again nothing.

As a last resort, I might try to hide unnecessary stuff with Transtools, but I have to do it manually and I am afraid of skipping sentences accidentally.

TIA

Parents
  • OMG, which software created this? This is obviously RTF, not plain text!
    Change the file extension to .rtf, then open it in Word and save as real plain text file (the actual subtitles format is SRT, see https://en.wikipedia.org/wiki/SubRip).
    BTW, translating subtitles using CAT tools is basically pointless as a) by the nature of subtitles it's pretty unlikely to find any reusable texts, and b) again due to the subtitles nature are sentences divided to multiple parts (as the parts are displayed one-by-one at different times), i.e. the sentences are nonsensically broken down to multiple separate segments.

Reply
  • OMG, which software created this? This is obviously RTF, not plain text!
    Change the file extension to .rtf, then open it in Word and save as real plain text file (the actual subtitles format is SRT, see https://en.wikipedia.org/wiki/SubRip).
    BTW, translating subtitles using CAT tools is basically pointless as a) by the nature of subtitles it's pretty unlikely to find any reusable texts, and b) again due to the subtitles nature are sentences divided to multiple parts (as the parts are displayed one-by-one at different times), i.e. the sentences are nonsensically broken down to multiple separate segments.

Children