Skip to content

feature request: extend unicode support with full case folding #1260

@ReinierMaas

Description

@ReinierMaas

XRef: #428, https://docs.rs/regex/1.11.1/regex/index.html#unicode

According to the documentation:

Case insensitive searching is Unicode-aware and uses simple case folding.

I would like to request full case folding support.

Why do I need full case folding?

We are currently using libicu which supports full case folding. We have tested using regex on our datasets and full case folding turns out to be a requirement for the migration towards regex from libicu.

Do you need help (to implement this feature)?

I understand if you don't want to enable this by default, working towards a feature flag would be a workable solution from our point of view. We can work on the feature if the regex team would be open to having it.

We didn't make any progress towards implementing this feature, i.e. no patch is lying around.

Example

Regex (?i)sss with full case folding enabled would match:

  • sss
  • ßs

Feel free to inquire for additional information if something is missing from the request.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions