2
lorentz
1y

Yet another unusual take for the Orchid STL: Unicode codepoints aren't a part of the string library.

For the purposes of a high level language, the unit of text is a grapheme. Strings can be converted between Unicode and binary blobs. In a binary, indices address bytes. In text, indices address graphemes. For example, searching a string for a substring that consists of a single letter implies the added constraint that the letter must not have accents or other modifiers.

For storage and transfer optimization it's possible to discover the byte length of a string without converting it to binary

Comments
  • 0
    My inspiration here is how although Devrant appears to be unicode-safe, using fancy text can often result in your post getting cut off between a code point and an associated modifier.
  • 0
    Biggest drawback:

    Orchid's char primitive is implemented as a Rust String.
Add Comment