Big5 (OMFFBIG5)

This library contains one OmniMark external source function and one OmniMark external output function implementation, as follows:

  • reader is an external source function that reads a value string source, its argument, and returns the text of that file converted from a Big5 encoding to a UTF-8 encoding. That is, the provided source is in Big5, but the program sees UTF-8.
  • writer is an external output function that accepts UTF-8 encoded data and writes that data to a value string sink, its first argument, converted from a UTF-8 encoding to a Big5 encoding. That is, the program writes UTF-8, but the provided output receives Big5.

The Big Five data format is interpreted/produced according to the Taiwanese Industry Standard.

The only kinds of errors that can occur are in conversion: finding a character that doesn't have a conversion in the other character set. In this case, the converted value use is DEL (0x7F) in the Big5 encoding, and NOT-A-CHARACTER (0xFFFD) in the Unicode (UTF-8) encoding.

Usage Note

To use omffbig5, you must import it into your program using a statement like this:

  import "omffbig5.xmd" prefixed by big5.

(Please see the import topic for more on importing.)