swirl
Guide to OmniMark 8   OmniMark home
docs home 
IndexConceptsTasksSyntaxLibrariesLegacy LibrariesErrors
 
    Related Concepts   Other Library Functions  
constant  

utf8.byte-order-mark

 
 

Library: UTF-8 (OMUTF8)
Import: omutf8.xmd
Return type: String

Returns: UTF-8 byte-order mark


Declaration

constant string byte-order-mark
    


Purpose

Some UTF-8 data streams contain a leading byte-order mark (also referred to as a signature). Since this byte-order mark can interfere with later processing, it is sometimes better to remove it from the input. For example, since a UTF-8 byte-order mark can interfere with XML processing, it can be dropped from the input before the latter is passed to the XML parser:


  process
     do xml-parse scan file "input.xml" drop utf8.byte-order-mark?
        suppress
     done 
          

      Related Concepts
   Pattern matching
   Character set encoding
 
Other Library Functions
   utf8.byte-order-mark
   utf8.char
   utf8.code-point
   utf8.encoding
   utf8.multi-byte-char
   utf8.omutf8-version
   utf8.single-byte-char
 
 

Top [ INDEX ] [ CONCEPTS ] [ TASKS ] [ SYNTAX ] [ LIBRARIES ] [ LEGACY LIBRARIES ] [ ERRORS ]

OmniMark 8.2.0 Documentation Generated: March 13, 2008 at 3:27:39 pm
If you have any comments about this section of the documentation, please use this form.

Copyright © Stilo International plc, 1988-2008.