edit/crates/unicode-gen
Leonard Hecker 5f284a1df9
Move arena & helpers into their own crate (#694)
This will allow us to use the `Arena` in `build.rs`.
This changeset also contains a version bump of all dependencies.
2025-12-01 21:51:55 +01:00
..
src Move arena & helpers into their own crate (#694) 2025-12-01 21:51:55 +01:00
Cargo.toml Move arena & helpers into their own crate (#694) 2025-12-01 21:51:55 +01:00
README.md Move arena & helpers into their own crate (#694) 2025-12-01 21:51:55 +01:00

README.md

Grapheme Table Generator

This tool processes Unicode Character Database (UCD) XML files to generate efficient, multi-stage trie lookup tables for properties relevant to terminal applications:

  • Grapheme cluster breaking rules
  • Line breaking rules (optional)
  • Character width properties

Usage

  • Download ucd.nounihan.grouped.zip
  • Run some equivalent of:
    grapheme-table-gen --lang=rust --extended --no-ambiguous --line-breaks path/to/ucd.nounihan.grouped.xml
    
  • Place the result in src/unicode/tables.rs