Skip to content

Diacritics in UTF-8 Glossary File Not Working #44

Open
@DigitalProf

Description

@DigitalProf

I have attempted several times to upload a glossary such that diacritics in French will be handled correctly. I am sure is in UTF-8 and have verified in various ways.

A related issue is that Excel by default includes a Byte Order Mark (BOM), which I have seen is included in the first item in the glossary. Using Notepad++ I have converted the encoding of the file to exclude the BOM, but that does not correct the problem. Here is the link to the glossary file that I uploaded to DeepL using Python.

I am also attaching below:

  1. A screen grab of the "Save as" screen from Excel
  2. The results of a dump of the most recent glossary that I uploaded as well as a screen grab of the problems int the translated document
  3. The actual CSV
    SaveTestGlossaryV2
    GlossaryContents(V2)
    Sample Page of Character Encoding Problems

Metadata

Metadata

Assignees

No one assigned

    Labels

    api changeRequires changes to the DeepL API

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions