YamCha is a generic, customizable, and open source text chunker
oriented toward a lot of NLP tasks, such as POS tagging,
Named Entity Recognition, base NP chunking, and Text Chunking.
YamCha is using a state-of-the-art machine learning algorithm
called Support Vector Machines (SVMs), first introduced by
Vapnik in 1995.
kcgi is a minimal CGI library for web applications in ISC licensed ISO C. It was
designed to be secure and auditable. See a Comparison of CGI Libraries in C for
alternatives. To start, download kcgi.tgz and run make install into your PREFIX
of choice. The kcgi(3) manpage documents usage. kcgi is a BSD.lv project
cgiparse is a C library that is used to build cgi's that can parse
web forms. It provides a few utility functions that allow your
cgi's to be easily debuged.
To parse a cgi form it is as easy as calling cgigetvalue("name").
dhttpd is a secure and efficient personal HTTP server. It does not need a
permanent IP address and can be run from a user account on high ports. Among
its features, dhttpd supports caching, customized error messages, low profile
memory usage, little hard disk space, highly portable code, and easy setup.
dhttpd is the perfect solution for the minimalist in all of us.
HAVP (HTTP Antivirus Proxy) is a proxy with a ClamAV anti-virus scanner. The
main aims are continuous, non-blocking downloads and smooth scanning of
dynamic and password protected HTTP traffic. Havp antivirus proxy has a parent
and transparent proxy mode. It can be used with squid or standalone.
Formication is a form processor which can send custom-formatted email,
save form data to custom-formatted files, and reply to the user with
a custom-formatted response. Formication is also able to require that
certain fields be filled in. All configuration is done with simple
templates, all of the templates for a given form being within the same
file.
jToolkit is a Python web application framework built on modpython and
Apache. There is also a simple command line webserver for running
applications from.
It is aimed at dynamically generated pages rather than mostly-static
pages (for which there are templating solutions). Pages can be produced
using a variety of widgets. It handles sessions and database
connections (and multi-database portability).
URLObject is a utility class for manipulating URLs. The latest incarnation of
this library builds upon the ideas of its predecessor, but aims for a clearer
API, focusing on proper method names over operator overrides. It's also being
developed from the ground up in a test-driven manner, and has full Sphinx
documentation.
LiveJournal is an open source content management system, written
mainly in Perl and utilizing MySQL as a database backend. By itself,
it serves as a powerful content updating system. In other uses, it
is the framework application behind many successful online communities,
including LiveJournal.com and DeadJournal.com.
ljdeps is a meta-port which installs all of the perl modules needed
by LiveJournal.
Java HTML/XML Compressor is a very small, fast and easy to use library
that minifies given HTML or XML source by removing extra whitespaces,
comments and other unneeded characters without breaking the content
structure. As a result pages become smaller in size and load faster.
A command-line version of the compressor is also included.