convmv is meant to help convert a single filename, a directory tree and the
contained files or a whole filesystem into a different encoding. It just
converts the filenames, not the content of the files. A special feature of
convmv is that it also takes care of symlinks, also converts the symlink target
pointer in case the symlink target is being converted, too.
All this comes in very handy when one wants to switch over from old 8-bit
locales to UTF-8 locales. It is also possible to convert directories to UTF-8
which are already partly UTF-8 encoded. convmv is able to detect if certain
files are UTF-8 encoded and will skip them by default. To turn this smartness
off use the --nosmart switch.
dumpasn1 is an ASN.1 parser which dumps the contents of an ASN.1-encoded
file, as well as interpreting the OIDs contained in the file into
human-readable format. Dumpasn1 is intended for examining the contents
of PKI certificates and comes with a full list of security-related OIDs,
but can be easily extended to parse other OIDs as well.
HTML to XHTML converter written in Perl
A JSON pretty-printing library compatible with aeson as well as a
command-line tool to improve readabilty of streams of JSON data. The
/library/ provides the function "encodePretty". It is a drop-in
replacement for aeson's "encode" function, producing JSON-ByteStrings
for human readers. The /command-line tool/ reads JSON from stdin and
writes prettified JSON to stdout. It also offers a complementary
"compact"-mode, essentially the opposite of pretty-printing.
JSON (JavaScript Object Notation) is a lightweight data-interchange format.
It is easy for humans to read and write. It is easy for machines to parse
and generate. It is based on a subset of the JavaScript Programming
Language, Standard ECMA-262 3rd Edition - December 1999. This library
provides a parser and pretty printer for converting between Haskell values
and JSON.
Reasonably fast data encoding library.
Tools for the conversion to and from UTF-8 Unicode encoding. Note that
RFC-2277 mandates that all "protocols" MUST handle UTF-8 properly.
- utrans converts text files created using any 8-bit character
map into UTF-8;
- uhtrans converts UTF-8 files into 7-bit ASCII with anything
else formatted as an HTML-style tags, e.g. Ӓ (decimal);
- hutrans converts 7-bit ASCII files with HTML-style tags, to UTF-8,
thus complementing the functionality of hutrans;
- ptrans converts UTF-8 files into 8-bit text using any
8-bit character map, thus complementing utrans.
Additionally, tuc is installed if not found. Tuc converts text files
between the DOS/Windows and the Unix formats.
This port depends on ports/converters/libutf-8.
Further details: RFC 2277, and RFC 2279.
ical2html takes an iCalendar file and outputs an HTML file showing
one or more months in the form of tables.
A library of C routines for the conversion of Unicode to UTF-8 and back.
The library can also be used to convert the 31-bit UCS-4 mappings to UTF-8 and
back.
This library provides an iconv() implementation, for use on systems which
don't have one, or whose implementation cannot convert from/to Unicode.
It can convert from any of these encodings to any other, through Unicode
conversion. It has also some limited support for transliteration, i.e.
when a character cannot be represented in the target character set, it can
be approximated through one or several similarly looking characters.
libiconv is for you if your application needs to support multiple character
encodings, but that support lacks from your system.
See either README or website for the list of supported encodings.