Parsec is designed from scratch as an industrial-strength parser
library. It is simple, safe, well documented (on the package homepage),
has extensive libraries and good error messages, and is also fast. It
is defined as a monad transformer that can be stacked on arbitrary
monads, and it is also parametric in the input stream type.
TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making it
ideal for screen-scraping.
libodfgen is a library for generating documents in Open Document Format
(ODF). It provides generator implementation for the following libraries:
* libwpd (::WPXDocumentInterface): text documents
* libwpg (libwpg::WPGPaintInterface): vector drawings
* libetonyek (libetonyek::KEYPresentationInterface): presentations
As these APIs are used by multiple libraries, libodfgen can be used to
generate ODF from many sources.
Liblinebreak is an implementation of the line and word breaking algorithm
as described in Unicode 5.1.0 Standard Annex 14, Revision 22. It breaks
lines that contain Unicode characters. It is designed to be used in a
generic text renderer. FBReader is one real-world example.
Kibana is an open source (Apache Licensed), browser based analytics and search
interface to Logstash and other timestamped data sets stored in ElasticSearch.
With those in place Kibana is a snap to setup and start using (seriously).
Kibana strives to be easy to get started with, while also being flexible and
powerful.
Loook is a simple Python tool that searches for text strings in
LibreOffice and OpenOffice.org files.
AND, OR and phrase searches are supported. It doesn't create an index,
but searching should be fast enough unless you have really many files.
This is the BSD-licensed version of the well-known diff utility, obtained from
OpenBSD. It lacks some features of GNU diff. It is a proposed project idea
to replace the GNU diff with this one, but it needs to be completed first.
Patches are highly appreciated.
This is the BSD-licensed version of the well-known grep utility, obtained from
OpenBSD. It lacks some features of GNU grep. It is a proposed project idea
to replace the GNU grep with this one, but it needs to be completed first.
Patches are highly appreciated.
This package builds a docset from sources in different formats. The
generated documents can be all nicely interlinked and to have the same
look and feel.
Currently it knows to handle input formats:
* POD * HTML
and knows to generate:
* HTML * PS * PDF
HTML::HTML5::Parse is substantially the same as the non-CPAN module
Whatpm::HTML. Changes include:
* Provides an XML::LibXML-like DOM interface. If you usually use
XML::LibXML's DOM parser, this should be a drop-in solution for tag
soup HTML.
* Constructs an XML::LibXML::Document as the result of parsing.
* Via bundling and modifications, removed external dependencies
on non-CPAN packages.