Add a utfbom
encoding that handles UTF-8, UTF-16BE, UTF-16LE, UTF-32BE, UTF-32LE
#33
Labels
utfbom
encoding that handles UTF-8, UTF-16BE, UTF-16LE, UTF-32BE, UTF-32LE
#33
text_view
currently definesutf8bom
,utf16bom
, andutf32bom
encodings that detect a BOM and dispatch to the appropriate non-BOM encoding to consume remaining input. However, autfbom
encoding would be useful to consume UTF-8, UTF-16, and UTF-32 formatted files that contain a BOM.There is a question of what to do if the input lacks a BOM. Options are to fail or fallback to an assumed encoding. A policy class could be used to allow programmer control; e.g., fail, fallback to UTF-8, etc...
The text was updated successfully, but these errors were encountered: