Talk:Mork (file format)

From Wikipedia, the free encyclopedia
Jump to: navigation, search
WikiProject Computing (Rated Stub-class)
WikiProject icon This article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
Stub-Class article Stub  This article has been rated as Stub-Class on the project's quality scale.
 ???  This article has not yet received a rating on the project's importance scale.
 
Note icon
This article has been automatically rated by a bot or other tool as Stub-Class because it uses a stub template. Please ensure the assessment is correct before removing the |auto= parameter.

Unicode[edit]

Is this accurate? "...storing Unicode text takes three or six bytes per character." Assuming that Mork uses UTF-16 to encode Unicode, it would take either 2 or 4 bytes per character. Perhaps the format uses some funky encoding rather than UTF-16? —Preceding unsigned comment added by Remline (talkcontribs) 15:15, 21 March 2009 (UTC)

From the comments here:

...writes out Unicode strings without using UTF-8: writes out the unpacked wchar_t characters!...Worse, it hex-encodes each wchar_t with a 3-byte encoding, meaning the file size will be 3x or 6x (depending on whether whchar_t is 2 bytes or 4 bytes.)...