Remove min_code_units and max_code_units data members from encodings #3

tahonermann · 2016-01-22T04:02:40Z

The min_code_units and max_code_units data members of encoding types were originally added to facilitate code unit storage allocation given a sequence of code points. For encodings like UTF16 with a BOM, these data members don't suffice due to the storage overhead required for a BOM. Additionally, encodings that support non-code-point encoding state transitions, overhead is potentially unbounded and not reflected in these values.

At present, these values are only used to determine if an encoding can potentially support random access to code points (when min_code_units == max_code_units and the encoding is stateless). Removal of these data members will require some other means to determine if an encoding is a stateless fixed-length encoding.

tahonermann added the enhancement label Jan 22, 2016

tahonermann self-assigned this Jan 22, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove min_code_units and max_code_units data members from encodings #3

Remove min_code_units and max_code_units data members from encodings #3

tahonermann commented Jan 22, 2016

Remove min_code_units and max_code_units data members from encodings #3

Remove min_code_units and max_code_units data members from encodings #3

Comments

tahonermann commented Jan 22, 2016