HTML::Entities::Interpolate is a pure Perl module to call
HTML::Entities::encode_entities, via a hash, within a string.
HTML::EscapeEvil is a Perl module that will escape HTML tags considered
evil (useful for user inut filtering). It is somewhat similar to
HTML::TagFilter, but escapes tags instead of removing them.
Apache::ParseLog provides an easy way to parse the Apache log
files, using an object-oriented constructs. The data obtained
using this module are generic enough that it is flexible to
use the data for your own applications, such as CGI, simple
text-only report generater, feeding RDBMS, data for Perl/Tk-based
GUI application, etc.
This is a collection of modules that formats text from HTML syntax
trees. Bug reports and discussions about these modules can be sent
to the <libwww@perl.org> mailing list.
The modules present in this package are:
HTML::Formatter - Base class for various formatters. Formatters
traverse a syntax tree and produce some textual output.
None of the current formatters handle tables or forms yet.
HTML::FormatText - Formatter that converts a syntax tree to plain
readable text.
HTML::FormatPS - Formatter that outputs PostScript code.
HTML-FormatExternal lets you turn HTML into plain text using one of
the browsing/formatting programs,
elinks http://elinks.cz/
html2text http://www.mbayer.de/html2text/
links http://links.twibright.com/
lynx http://lynx.isc.org/
netrik http://netrik.sourceforge.net/
vilistextum http://bhaak.dyndns.org/vilistextum/
w3m http://sourceforge.net/projects/w3m
zen http://www.nocrew.org/software/zen/
This module is an implementation of the CSS parts of Yahoo! YUIcompressor
in Perl. It was needed to produce minified css on the fly using Perl
based backend systems.
Converts HTML to Text with tables in tact
CSS::Minifier removes unnecessary whitespace from CSS. The primary requirement
developing this module is to not break working stylesheets: if working CSS is in
input then working CSS is output. The Mac/Internet Explorer comment hack will be
minimized but not stripped and so will continue to function.
This module understands space, horizontal tab, new line, carriage return, and
form feed characters to be whitespace. Any other characters that may be
considered whitespace are not minimized. These other characters include
paragraph separator and vertical tab.
For static CSS files, it is recommended that you minify during the build stage
of web deployment. If you minify on-the-fly then it might be a good idea to
cache the minified file. Minifying static files on-the-fly repeatedly is
wasteful.
HTML::FormatText::WithLinks takes HTML and turns it into plain text
but prints all the links in the HTML as footnotes. By default, it
attempts to mimic the format of the lynx text based web browser's
--dump option.
HTML::HTML5::Parse is substantially the same as the non-CPAN module
Whatpm::HTML. Changes include:
* Provides an XML::LibXML-like DOM interface. If you usually use
XML::LibXML's DOM parser, this should be a drop-in solution for tag
soup HTML.
* Constructs an XML::LibXML::Document as the result of parsing.
* Via bundling and modifications, removed external dependencies
on non-CPAN packages.