utf8trans
utf8trans
— Transliterate UTF-8 characters according to a table
utf8trans {charmap} [file]...
utf8trans
transliterates characters in the specified files (or
standard input, if they are not specified) and writes the output to
standard output. All input and output is in the UTF-8 encoding.
This program is usually used to render characters in Unicode text files as some markup escapes or ASCII transliterations. (It is not intended for general charset conversions.)
The translation is done according to the rules in the `character map', named in the file charmap. It has the following format:
The above format is intended to be restrictive, to keep
utf8trans
simple. But if a XML-based format is desired,
there is a xmlcharmap2utf8trans script that
comes with the docbook2X distribution, that converts character
maps in XSLT 2.0 format to the utf8trans
format.
utf8trans
does not work with binary files, because malformed
UTF-8 sequences in the input are substituted with
U+FFFD characters. However, null characters in the input
are handled correctly. This limitation may be removed in the future.