ruby-character-set 1.4.0 Ruby library to manipulate Unicode

CharacterSet is a C-extended Ruby library to work with sets of Unicode code points. It can read and write these sets in various formats and implements the stdlib Set interface for them. It also offers an alternate paradigm of String processing which grants much better performance than Regexp and String methods from the stdlib where applicable. Many parts can be used independently, e.g.:

  • CharacterSet::Character

  • CharacterSet::Parser

  • CharacterSet::Writer

  • RangeCompressor