swirl
Guide to OmniMark 8   OmniMark home
docs home 
IndexConceptsTasksSyntaxLibrariesLegacy LibrariesErrors
 
Functions       Platforms  
 

UTF-16 (OMFFUTF16)

 
 

This library contains one OmniMark external source function and one OmniMark external output function implementation, as follows:

Any malformed input data is read as a Unicode NOT-A-CHARACTER character (0xFFFD). The only malformed case recognized is if only half of a surrogate pair is found.

Read-in UTF-16 data is assumed by default to be big-endian, but leading and embedded Byte Order Marks (BOM) in the data are recognized and acted upon. A leading BOM is removed from the input, but embedded ones are left in.

writer has two further switch-valued arguments, placed ahead of the output argument. true is used as a default value in both cases. The two arguments are:

Any malformed output data is written as a Unicode NOT-A-CHARACTER character (0xFFFD). The only malformed cases recognized are characters too large to be encodable as UTF-16 (i.e. larger than 0xFFFF), and characters whose UTF-16 encodings would be the value of half of a surrogate pair.

A good place to find information on the details of UTF-16 encoding is: <http://www.unicode.org/unicode/faq/utf_bom.html>

Usage Note

To use omffutf16, you must import it into your program using a statement like this:

  import "omffutf16.xmd" prefixed by utf16.

(Please see the import topic for more on importing.)

Functions
   utf16.reader
   utf16.writer
 
Platforms
   HP/UX
   HP-UX Itanium 2
   IBM AIX 5.3
   Linux (Intel)
   Linux Red Hat Enterprise 5
   MS Windows 98/ME
   MS Windows NT/2000/XP
   MS Windows Vista
   Sun Solaris 8
 
 

Top [ INDEX ] [ CONCEPTS ] [ TASKS ] [ SYNTAX ] [ LIBRARIES ] [ LEGACY LIBRARIES ] [ ERRORS ]

OmniMark 8.2.0 Documentation Generated: May 6, 2008 at 10:12:26 am
If you have any comments about this section of the documentation, please use this form.

Copyright © Stilo International plc, 1988-2008.