Any advice on how to separate, in a Word file, source paragraphs followed sequentially by their translation

Hello,

I wonder if anyone may have a suggestion to solve this problem I’m facing:

I have received a large Word file in which a paragraph in the source language is followed by the corresponding translation, followed by another paragraph in the source language which is followed by its translation, so on and so forth. That’s the only document available.

Out of this file we need to create a translation memory with the translations provided within.

Using Alignment with the file as it is would be beyond messy, I think.

Besides brute force, is there by any chance some way of separating/extracting the source language from the translations?

Thank you in advance for any advice/suggestion you may have.

Gilberto

Parents Reply Children
  • Too easy, eh? Well, let's hope so.
    This is a sample from one of the Word files:

    Important Notes: This document has information about the drugs covered by this plan. For more up-to-date information or if you have any questions, please call <Column X>

     

    Notas Importantes: Este documento tiene información sobre los medicamentos que cubre este plan. Para obtener información más actualizada o si tiene alguna pregunta, llame a Servicio al Cliente <Column AL> al:]

     

    Toll-free <Column J>, TTY <Column M>

     

    Llamada gratuita: <Column J>, TTY <Column M>

     

    If you are a member of a group sponsored plan (your coverage is provided through a former employer, union group or trust), please call the Customer Service number on the back of your member ID card.]

     

    Si usted es miembro de un plan patrocinado por un grupo (recibe su cobertura a través de un fideicomiso, sindicato o empleador anterior), llame al número de Servicio al Cliente que se encuentra en la parte de atrás de su tarjeta de ID de miembro.]

     

    What is a drug list?

    A drug list, or formulary, is a list of prescription drugs covered by your planPage3_WhatIsADrugList_UofCA_ENG33.0 [If (Column F = UofCA), then print, UC Medicare Choice]. Your plan and a team of health care providers work together in selecting drugs that are needed for well-rounded care and treatment.

     

    Your plan will generally cover the drugs listed in our drug list as long as:

    • The drug is used for a medically accepted indication,
    • The prescription is filled at a network pharmacy and
    • Other plan rules are followed.

     

    For more information about your drug coverage, please review your Evidence of Coverage.]]

     

    ¿Qué es una Lista de Medicamentos?

    Una Lista de Medicamentos, o Formulario, es una lista de los medicamentos con receta que cubre su plan. Su plan y un equipo de proveedores de cuidado de la salud colaboran en la selección de los medicamentos que se necesitan para ofrecer cuidado y tratamiento integrales.

     

    Su plan generalmente cubrirá los medicamentos incluidos en la Lista de Medicamentos, siempre y cuando:

    • El medicamento se use para una indicación médicamente aceptada,
    • La receta se surta en una farmacia de la red y
    • Se sigan otras reglas del plan.

     

    Para obtener más información sobre su cobertura de medicamentos, consulte su Evidencia de Cobertura.]

  • Just my opinion, but I think you're over thinking this.  By the time you've been through hiding text in the source files you could have aligned them quite easily.  Based on your example I don't think it's so messy:

    The table realigns itself as you work through it and because your document is formatted in the way it is it's pretty simple to see where you are... even if you don't have a clue what it means (like me!).

    I think I can see where Kelly was coming from, but as you are not really formatted based on paragraphs as you initially said I think an automated solution would not be so easy. It requires too much checking because sometimes it's several paragraphs before the text changes from one language to another.  If it was consistent than I guess a macro could be used to pick out the paragraph markers and separate the texts.  Probably... although I hate to try and second guess Kelly... he was thinking along these lines.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub