The stem function takes a scalar as a parameter and stems the word according to
Martin Porters Danish stemming algorithm.
Perl::Critic::Bangs is a collection of Perl::Critic policies
that will help make your code better.
This is a text analyzer for analyzing CJK texts. Plucene does not support CJK
texts natively. This module encodes terms in MIME::Base64 format to get around
this problem. Texts are assumbed to be in UTF-8 encoding.
An analyzer plugin for p5-Plucene search engine, which filters
StandardTokenizer with SnowballAnalyzer.
The whole idea of this module is to take advantage of all the syntax
colouring modules that exist (such as Perl::Tidy) to produce colourful
code examples in a POD document (after conversion to HTML).
Regexp::Common::Email::Address provides a regex to match email addresses
as defined by RFC 2822.
Patterns for CIDR blocks.
Instead of a dry technical overview, I am going to explain the structure of this
module based on its history. I consult at a company that generates customer
leads primarily by having websites that attract people (e.g. lowering loan
values, selling cars, buying real estate, etc.). For some reason we get more
than our fair share of profane leads. For this reason I was told to write a
profanity checker.
For the data that I was dealing with, the profanity was most often in the email
address or in the first or last name, so I naively started filtering profanity
with a set of regexps for that sort of data. Note that both names and email
addresses are unlike what you are reading now: they are not whitespace-separated
text, but are instead labels.
Therefore full support for profanity checking should work in 2 entirely
different contexts: labels (email, names) and text (what you are reading).
Because open-source is driven by demand and I have no need for detecting
profanity in text, only label is implemented at the moment. And you know the
next sentence: "patches welcome" :)
Regexp::Log::Common uses Regexp::Log as a base class, to generate regular
expressions for performing the usual data munging tasks on log files that
cannot be simply split().
SGML::Parser::OpenSP is an interface to the OpenSP parser. It allows for
Perl scripts to parse an SGML document (including HTML) using the OpenSP
parser.