Skip to content
Snippets Groups Projects

Draft: Support UTF-8

Open Jakob Moser requested to merge feat/support-utf8 into master

To quote from https://gitlab.cl.uni-heidelberg.de/Tutoren/syntax/flexi/-/issues/31

That problem is known. The problem is that XLE does, by default, not use UTF-8 (neither when reading the LFG file nor when reading the sentence). This means it interprets the "ü" as something different both in the grammar and in the sentence (namely the "ü"), but at least it parses.

You can add a line to the grammar to tell XLE it should interpret it as UTF-8, but then you'd also have to tell XLE to interpret the input as UTF-8. That can probably be done, I'm working on it in xle-docker, it just isn't as easy as I thought, because the encoding does not apply to the xlerc file in which I save the sentence to be parsed.

See also: https://ling.sprachwiss.uni-konstanz.de/pages/xle/doc/xle.html#SEC23

Maybe helpful command: set-character-encoding * utf-8

Edited by Jakob Moser

Merge request reports

Ready to merge by members who can write to the target branch.
Loading

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
Please register or sign in to reply
Loading