Perl extension for serialize/deserialize AMF0/AMF3 data
These are experimental modules to handle various Unicode issues. They
were made before perl included native UTF8 support.
The current set of modules are:
Unicode::String - represent strings of Unicode chars
Unicode::CharName - look up character names
Unicode::Map8 - mapping tables towards 8-bit char sets
(the Unicode::Map8 module is distributed separately)
Igbinary is a drop in replacement for the standard PHP serializer.
Instead of time and space consuming textual representation, igbinary
stores PHP data structures in a compact binary form. Savings are
significant when using memcached or similar memory based storages for
serialized data.
This module provides some functions which emulate
the corresponding CORE functions
to manipulate multiple-byte character strings.
This module uses the OOP syntax to tell methods
in which charset the specified strings should be handled.
It often happens that you have non-Roman text data in Unicode, but you
can't display it -- usually because you're trying to show it to a user
via an application that doesn't support Unicode, or because the fonts
you need aren't accessible. You could represent the Unicode characters
as "???????" or "\15BA\15A0\1610...", but that's nearly useless to the
user who actually wants to read what the text says.
What Text::Unidecode provides is a function, unidecode(...) that takes
Unicode data and tries to represent it in US-ASCII characters.
Perl interface to the libfribidi library that implements the Unicode bidi
algorithm. The bidi algorithm is a specification for displaying text that
consists of both left-to-right and right-to-left written languages.
The Text::Iconv module provides a Perl interface to the iconv() function
as defined by the Single UNIX Specification. The convert() method
converts the encoding of characters in the input string from the
fromcode codeset to the tocode codeset, and returns the result.
IMAP mailbox names are encoded in a modified UTF7 when names contains
international characters outside of the printable ASCII range. The
modified UTF-7 encoding is defined in RFC2060 (section 5.1.3).
This module converts strings from and to 2-byte Unicode UCS2 format.
All mappings happen via 2 byte UTF16 encodings, not via 1 byte UTF8
encoding. To convert between UTF8 and UTF16 use Unicode::String.
For historical reasons this module coexists with Unicode::Map8.
Please use Unicode::Map8 unless you need to care for >1 byte character
sets, e.g. chinese GB2312. Anyway, if you stick to the basic
functionality (see documentation) you can use both modules equivalently.
Practically this module will disappear from earth sooner or later as
Unicode mapping support needs somehow to get into perl's core. If you
like to work on this field please don't hesitate contacting Gisle Aas
and check out the mailing list perl-unicode!
Unicode::Map8
-------------
The Unicode::Map8 class implement efficient mapping tables between
8-bit character sets and 16 bit character sets like Unicode. About
170 different mapping tables between various known character sets and
Unicode is distributed with this package. The source of these tables
is the vendor mapping tables provided by Unicode, Inc. and the code
tables in RFC 1345. New maps can easily be installed.
By coincidence Martin Schwartz created a similar module at the same
time I did. His module is called Unicode::Map and should be available
on CPAN too. Both modules now support a unified interface. Martin's
module will be depreciated in the future.
Since UTF8 support is coming to Perl soon, there might be good reasons
to move this module in the direction of mapping to/from UTF8. I will
probably do so, once the Unicode support in the Perl core settle.
COPYRIGHT 1998-1999 Gisle Aas. All rights reserved.