Marpa::HTML does "high-level" parsing of HTML. It allows handlers to be
specified for elements, terminals and other components in the hierarchical
structure of an HTML document. Marpa::HTML is an extremely liberal HTML parser.
Marpa::HTML does not reject any documents, no mater how poorly they fit the HTML
standards.
MathML::Entities a content conversion filter for named XHTML+MathML
entities. There are over two thousand named entities in the XHTML+MathML
DTD. All the Entities defined in the XHTML+MathML DTD except the five
"safe" ones (<, >, &, ", '), will be converted to the
equivalent numeric character references or to utf-8 characters. Named
entities which are not in the XHTML+MathML DTD are escaped. This makes the
resulting XHTML (or XHTML+MathML) safe for consumption by non-validating
XML parsers.
Unlike, HTML::Entities, the mapping between MathML named entities and
codepoints is many-to-one. Therefore, there's no particular sense in
having an inverse function, which takes codepoints to named entities.
Based on: HTML::Entities by Koichi Taniguchi <taniguchi@livedoor.jp>
Net::Groonga::HTTP is a client library for Groonga http server.
Groonga is a fast full text search engine. Please look
http://groonga.org/.
Small module for inflecting pronouns for a bunch of different
genders.
Seamus Venasse <svenasse@polaris.ca>
Determine the infinitive form of a conjugated word. Also,
determine the suffix used to identify which rule to apply to
transform the conjugated word into the infinitive form.
Seamus Venasse <svenasse@polaris.ca>
This module provides an easy-to-use interface for encoding and decoding
Internationalized Domain Names (IDNs).
IDNs use characters drawn from a large repertoire (Unicode), but IDNA
allows the non-ASCII characters to be represented using only the ASCII
characters already allowed in so-called host names today (letter-digit-
hypen, "/[A-Z0-9-]/i").
These modules provide a basis for parsing snort configuration files and
rules, allow tools to be built that muck with rulesets with less effort.
An example tool, snortconfig, is included.
The exportable subroutines of Lingua::EN::Inflect provide plural
inflections and "a"/"an" selection for English words.
Plural forms of all nouns, most verbs, and some adjectives are
provided. Where appropriate, "classical" variants (for example:
"brother" -> "brethren", "dogma" -> "dogmata", etc.) are also
provided.
Seamus Venasse <svenasse@polaris.ca>
YASA is a simple implementation of Suffix Array for counting
frequency of given text/string.
This module provides a Perl interface to YASA via REST.
This is a module for finding IP addresses in plain text.
NetAddr::IP::Find exports one function, find_ipaddrs(). It
works very similar to URI::Find's find_uris() or
Email::Find's find_emails().
$num_ipaddrs_found = find_ipaddrs($text, \&callback);