Html Codes within Excel File

Hello All

I want Trados Studio to split excel cell contents into segments based on embedded HTML codes

e.g:

<B>Product Features</B><BR>Throws are acrylic knitted<BR>Product size : 130x170 cm <BR>Product colour is beige. <BR><BR><B>Washing Recommendations</B><BR>Washable at 30 degrees.<BR>Do not bleach.<BR>Do not iron.

----------------------

Product Features
Throws are acrylic knitted
Product size : 130x170 cm
Product colour is beige. 
Washing Recommendations
Washable at 30 degrees.
Do not bleach.
Do not iron.

--------

this is a sample

sample-br.xlsx

thanks 

Parents Reply
  • I find this regex-based approach rather amateurish, cumbersome and most importantly failing big time with just a little bit more complex HTML code, not mentioning anything more complicated (containing entities, comments, inline scripts, etc.)

    Therefore I use a simple script which exports the HTML content to very simple XML structure like this

    <string cell="A1">blablabla, some complicated HTML code</string>

    This is then easily processed using the XML with HTML embedded content, with all comfort of embedded content parser.

    And then it's again very easily injected back into the appropriate places into the original Excel sheet using another  rather dumb script which only reads the cell location from the XML element attribute and puts the string in the cell.

    Not a method for average Joe Translator, I know...

Children