Use parser to filter out html type tags

Hello

I received several word documents to translate, which include HTML tags. Can I use the parser to remove these tags?

</span>

<span class="prevnum_text_result_bold">

</p> \n<p class="prevnum_texte_result"><span class="prevnum_text_result_bold">

if so I'd be grateful for a view clues, as its all a bit advanced for me. Thank you in advance

emoji
  •   

    You can remove a tags parser rule like this:

    Trados Studio parser options window showing a list of HTML tags and their corresponding rules. The 'span' tag is highlighted with options to add, edit, or remove the rule.

    But be careful with it... you may lose more than you bargained for.  Perhaps explain why you want this... screenshots might be helpful, or use the Insert -> Code feature in the community when replying so anything you write doesn't get messed up.  The explanation of what to remove based what we're seeing doesn't make a lot of sense.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 10:14 AM (GMT 0) on 29 Feb 2024]
  • Hello Paul

      

    Thanks for your response.

    Here is a screenshot of the source.

    Screenshot of Trados Studio showing text with tags in yellow and purple, indicating a request to hide or minimize the yellow tags.

    Ideally, I'd like the tags in yellow either to be hidden or to look like the usual purple tags that you can hide (or display minimally)

    Image of a text document with multiple 'link to BE' icons, suggesting a need to manage hyperlink visibility or formatting in Trados Studio.

    I think the idea is to define a filter with start and end point of < and > respectively, and either exclude them or hide them

    but I don't have the brain power to recall how to do that ....

    Also, the documents were supplied in word, so I'll have to convert them back into html text first ...

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 10:14 AM (GMT 0) on 29 Feb 2024]
  •   

    ok - this adds exactly the context we needed for your question.  My previous answer was completely inappropriate for what you really need!  I cannot emphasise enough how important it is that the original questions are well written as we are looking at introducing AI to help with the provision of answers quickly... almost impossible for this to be effective when the questions don't provide enough context.  When they do we actually see pretty useful results and improving as we train it more.

    What you really need to know is how to properly prepare your file for translation so that the html content is correctly tagged.  For this you should tell us what your source filetype is and preferably provide a small sample.  Creating samples doesn't have to be hard.  Just take two or three paragraphs that contain the content you have a problem with, edit the text if you need to so you don't share content you're under an NDA for and attach the sample file to your post.

    Sending in huge files rarely helps... just irritates a little as we end up having to do the work you should have done to try and help you!

    Sorry for the minor rant... this is just a topic on my mind a lot recently since we are attempting to improve the value of the community for our users and this really means everyone doing their part to make this successful.  We all benefit from it in the end!

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  • Dear  

    I edited the post to include the information above:

    File type is word, but needs to be converted back to html.
    Samples included above.
    Anything else I can provide?

    emoji
  •  

    I don't see the edits.  Maybe create a new reply?

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  • Hi  

    This is a sample of the source.

    I'd like to hide the yellow tags. Everything to hide appears to be between < > tags

    Screenshot of Trados Studio showing text with yellow highlighted tags between angle brackets.

    or that they appear only as tag text that I could hide or show

    Trados Studio interface with options for 'No Tag Text', 'Partial Tag Text', and 'Full Tag Text'.

    The source document is supplied as word, but I could convert it back (copy and paste) into a text document with extension html.

    Anything else I can provide?

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 10:14 AM (GMT 0) on 29 Feb 2024]
  •  

    ok - so you want me to retype this content and then show you how to do it?  I'd also note that the snippet you show is probably indequate because it doesn't show well formed html.  So I'm going to assume there is some.  This is important because I presume you want to be able to handle the html correctly and not have to create hacked rules that make your life more difficult.  So here's a mocked up sample of what I think you're trying to achieve.

    And here's a short video showing you how to do it.  I just recorded it as I did it in-between meetings so apologies for the quality and my impatience... but hopefully it gives you what you need.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  • A marvel of modern science.

    Thank you VERY much for taking the time out in your busy schedule to walk me through this.

    Screenshot of a text comparison in Trados Studio showing differences in paragraph numbers 111 to 115.

    I successfully created the rules for paragraph and span tags

    Screenshot of Trados Studio settings with 'Embedded content' processing enabled and rules for paragraph and span tags defined.

    As for screen recording, try screenrec, nice and lightweight, seems to work well :-)

    emoji


    Generated Image Alt-Text
    [edited by: Trados AI at 10:15 AM (GMT 0) on 29 Feb 2024]
  •  

    Glad you're sorted!

    I use https://www.faststone.org/FSCaptureDetail.htm and apart from the problems I seem to be seeing recently I have never found a tool better for the features it offers!  I also use Camtasia and this is good for longer more detailed stuff in Trados especially.  But Faststone is probably the best tool on my computer and I've had more than my moneys worth in the 15+ years I've had it.  I don't know about you but I get attached to my software ;-)

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
  • One further thought  . You said in your video that ... "it helps Studio segment the file".

    What does this mean? Can I direct Studio to segment at specific places, like after the paragraph mark </p> ??

    P.S. I use Faststone for image viewing, dimension changes and the like. Didn't know it did screenrec, will look.

    emoji