Talk:Mork (file format)
|WikiProject Computing||(Rated Stub-class)|
Is this accurate? "...storing Unicode text takes three or six bytes per character." Assuming that Mork uses UTF-16 to encode Unicode, it would take either 2 or 4 bytes per character. Perhaps the format uses some funky encoding rather than UTF-16? —Preceding unsigned comment added by Remline (talk • contribs) 15:15, 21 March 2009 (UTC)
From the comments here:
...writes out Unicode strings without using UTF-8: writes out the unpacked wchar_t characters!...Worse, it hex-encodes each wchar_t with a 3-byte encoding, meaning the file size will be 3x or 6x (depending on whether whchar_t is 2 bytes or 4 bytes.)...