Out of memory exception when importing large WordPress XML export in Trados

Hi,

I received an XML export of a website from a potential client. I need to analyze it in Trados Studio to quantify the word count. It's quite a large file (176 MB). When I try to import it into Trados, I get an "Out of memory exception" error.

I tried creating a custom XML file type but I'm not very familiar with XML file structure and can't figure it out. I just see a list of parser rules but I don't know where to start.

I've read several threads here on the forum with similar issues but they haven't helped me.

Is there a simply way to handle the XML file? Or maybe could you suggest a way to split the XML file into multiple parts or a procedure to generate a file type that filters only the textual content for localization (website text and SEO meta tags)?

Thanks in advance

Valeria

I post here a screenshot of the generated custom file type importing the Wordpress XML export file.

Trados Studio options window showing the Parser settings for a custom XML file type. The table lists rules like category and wp:postmeta with translation settings.



Generated Image Alt-Text
[edited by: RWS Community AI at 11:56 AM (GMT 1) on 25 Jun 2025]
emoji
  • Probability score: 0.6  

    Question improvement suggestion: Provide specific details about the steps you've already taken to create the custom XML file type and any error messages encountered during that process. This will help in diagnosing the issue more effectively.  

    In the meantime, based on the information you have provided, here is a suggested answer to your question:  

    Hello Valeria,

    Step 1: **Check the XML Structure**  

    Before creating a custom XML file type, ensure that the XML structure is well-formed. You can use an XML validator to check for any syntax errors.

    Step 2: **Create a Custom XML File Type**  

    Since you are encountering an "Out of memory exception," it may be beneficial to create a custom XML file type that filters out non-textual content. Here’s how you can start:

    - Open Trados Studio and go to the "File Types" section.

    - Create a new XML file type based on the existing XML V2 file type.

    - In the parser rules, focus on defining rules that only extract the textual content you need for localization, such as website text and SEO meta tags.

    Step 3: **Limit the Size of the XML File**  

    If the XML file is too large, consider splitting it into smaller parts. You can do this manually or use a script to divide the XML into multiple files, ensuring each file remains well-formed.

    Step 4: **Adjust Parser Rules**  

    Make sure that the parser rules do not include images or other non-translatable elements. You can set these elements as non-translatable in your custom file type settings.

    Step 5: **Test the Import**  

    Once you have created the custom file type, try importing the smaller XML files into Trados Studio again. Monitor the memory usage during the import process to see if the issue persists.

    If you continue to experience issues, consider reaching out to the RWS Support team for further assistance, especially if you are unfamiliar with XML file structures.

    I hope this answer addressed your question. If not, please reject the answer, add the question improvement suggestion information, and click the RWS Community AI checkbox on the right to try again.  

    RWS Community AI

    emoji