Cannot import an html file into my project, only right click > translate with Studio

Hi!
I have been struggling for months with a csv file (cobby export from Magento shop content).

3 columns are relevant for me: 1 with product name (regular text), and 2 with html content.

So far I only managed to do the following:
* save each column separately, i.e. product names in a dedicated excel file for regular adding to my Studio project and each of the 2 html columns into 2 separated Notepad ++ documents for saving as html.

* Those 2 html files I can only translate via the context menu in MS Explorer (Translate in SDL Trados Studio), I cannot add them to my regular project, the file format won't be detected and the files will automatically be set as reference files (not possible to change to translatable files).

Can anybody think of a better way to do this, ideally in one shot or at least with all 3 files in one project, so that I can get a usable analyse?

I attached a simplified sample of the file
Thanks a lot in advance

Regards

sample.xlsxChloé

  •  

     Did you create proper HTML files in Notepad or just pasted the content and changed the file name extension to *.html?
    Anyway, this way you will get huge problems in getting the content back to Excel.
    Much better way would be saving the whole CSV als Excel 2016 (XLSX) and then use the embedded content processing in the corresponding file type in Studio 2017.
    In your project settings, where you have the csv, go to File types, then to Excel 2007-2016, click on it, select Embedded content and activate it. Then remove the CSV and add the whole Excel. Your HTML codes in Excel will now be tags and you will be able to have a whole file in the end in a format easily storable as CSV. Of course provided your language is supported by the CSV - as CSV files are NOT Unicode...

    _________________________________________________________

    When asking for help here, please be as accurate as possible. Please always remember to give the exact version of product used and all possible error messages received. The better you describe your problem, the better help you will get.

    Want to learn more about Trados Studio? Visit the Community Hub. Have a good idea to make Trados Studio better? Publish it here.

  • Hi,

    Thanks for your reply,
    my language is German into French, I think I shouldn't have any problem because of that.

    I just pasted the content of the "html" columns, i.e. text with html tags and where accentuated letters are written for example as "&auml;" for ä, into the notepad++ file, for example "<p><span style="font-size: 12px;">Unsere <strong>Gasger&auml;te </strong>der <strong>hochwertigen Dexion Serie 98</strong>"
    I must say I don't have any problem getting it back into Excel (because I only paste one excel column in each notepad++ file, else I have problems...), except sometimes I have do add the line breaks again in notepad++ to get the same number of lines again, but when I do that then I can just copy and paste it back into the excel column.

    I am aware that this is not at all a handy solution but it is the only one I found to get the content segmented right in Studio AND all the html tags displayed properly.

    I already tried working with the excel file as xlsx directly, I tried lots of stuff to get it properly segmented and tagged but unsuccessfully, at least one of these requirements was not met every time.

    I will try again tonight with your suggestion with the xlsx file, hopefully with your tricks I will get it right this time.
    I am very surprised not to find more people online asking about cobby exports of magento, I thought those might be popular tools? Do you know them?

    Thanks a lot again,
    Kind regards

    Chloé
  • Hi,
    I just tested. Now I remember why I preferred to find another way: When I import my excel file like this, I have the following problems which I don't know how to solve:

    1. I don't have one "sentence" in one segment but a whole paragraph because for example, "<br> or <p> tags are protected but not defined as a segment separator (and I don't know I to tell Trados to start a new segment after a <br> or <p> tag). This is also the case for other tags that separate "sentences" (ul, li I think). I use quotation marks because these are not sentences in the grammatical sense of the term, there are no full stops or anything at the end, these are lines of a bullet list.

    2. Accentuated letters which are displayed as code (schalld&auml;mmend for schalldämmend) are not converted into the right accentuated letters for translation and back as code after translation (which I can get done when translating the text prepared in notepad ++ and saved as html)

    3. Some of the [cLineBreak] tags (I don't know what they are and didn't find a way to display them as protected tags in Studio) are included at the end of segments instead of being isolated in one segment like I have it with the html files (messes with my pretranslation and with the analyse of fuzzy matches)

    If you have thoughts about these issues so that I could solve them and keep just processing an Excel 2016 file, it would be the best way indeed. But these 3 issues mess so much with the analyse and the use of perfect matches that I cannot have them and still save time when I do all the copy and pasting into Notepad++, etc.

    I am very curious to hear from you Jerzy and other users and to read your input on this.

    Thanks and best wishes
    Chloé
  • Hi,
    I also wanted to address separately the problem of translating html files in Studio, can someone explain why I can't just add a html file into a project and then Batch Tasks > Prepare ? Is it because it is not a proper html document with proper html structure?
    Thanks,
    Chloé