Larbin is a powerful web crawler (also called [web] robot, spider...). It
is intended to fetch a large number of web pages to fill the database of a
search engine. With a network fast enough, Larbin is able to fetch more than
100 million pages on a standard PC.
Larbin was initially developed for the XYLEME project in the VERSO team at
INRIA. The goal of Larbin was to go and fetch XML pages on the web to fill
the database of an xml-oriented search engine.
The following can be done with Larbin:
o A crawler for a search engine
o A crawler for a specialized search enginer (xml, images, mp3...)
o Statistics on the web (about servers or page contents)
Larbin is created by: Sebastien Ailleret
Compress::LeadingBlankSpaces - Perl class to compress leading blank
spaces in (HTML, JavaScript, etc.) web content.
This class provides the functionality for the most simple web content
compression.
Basically, the outgoing web content (HTML, JavaScript, etc.) contains
a lot of leading blank spaces, because of being structured on
development stage. Usually, the client browser ignores leading
blank spaces. Indeed, the amount of those blank spaces is as
significant as 10 to 20 percent of the length of regular web page.
We can reduce this part of web traffic on busy servers with no
visible impact on transferred web content, especially for old
browsers incapable to understand modern content compression.
The main functionality of this class is concentrated within the
"squeeze_string" member function that is supposed to be used inside
the data transfer loop on server side. The rest of the class is
developed in order to serve possible exceptions, like pre-formatted
data within HTML.
Tornado is an open source version of the scalable, non-blocking web server and
tools that power FriendFeed. The FriendFeed application is written using a web
framework that looks a bit like web.py or Google's webapp, but with additional
tools and optimizations to take advantage of the underlying non-blocking
infrastructure.
The framework is distinct from most mainstream web server frameworks (and
certainly most Python frameworks) because it is non-blocking and reasonably
fast. Because it is non-blocking and uses epoll or kqueue, it can handle
thousands of simultaneous standing connections, which means it is ideal for
real-time web services. We built the web server specifically to handle
FriendFeed's real-time features - every active user of FriendFeed maintains an
open connection to the FriendFeed servers.
sitecopy is for copying locally stored websites to remote web servers.
The program will upload files to the server which have changed locally,
and delete files from the server which have been removed locally, to keep
the remote site synchronized with the local site, with a single command.
The aim is to remove the hassle of uploading and deleting individual files
using an FTP client. sitecopy will also optionally try to spot files you move
locally, and move them remotely.
sitecopy is designed to not care about what is actually on the remote
server - it simply keeps a record of what it THINKS is in on the remote
server, and works from that. WebDAV can be used to maintain remote sites
as well as FTP.
FREEZE / MELT COMPRESSION PROGRAM
This version is tested under SunOS 4.1.2, Xenix 2.3.2, MS-DOS.
The format of frozen (2.X) file is incompatible with that of frozen (1.0),
but if this package is compiled with -DCOMPAT switch, you will able to
unpack frozen (1.0) files, if you have them.
This is a reference C implementation of the LZFSE compressor introduced in the
Compression library with OS X 10.11 and iOS 9.
LZFSE is a Lempel-Ziv style data compression algorithm using Finite State
Entropy coding. It targets similar compression rates at higher compression and
decompression speed compared to deflate using zlib.
Microsoft "compress.exe/expand.exe" compatible (de)compressor
This package contains two programs:
msexpand, which decompress files compressed by Microsoft compress.exe utility
(e.g. Win 3.x installation files)
mscompress, which compress files using LZ77 compression algorithm. Output
files can be decompressed using Microsoft expand.exe or msexpand(1).
The asmixer utility is a mixer control for the X Windows environment. It
is designed to work with the AfterStep Window Manager, but it will
work with Window Maker or most any other Window Manager. It has
controls for three configurable channels which are master, pcm, and cd
volume by default.
The Network Audio System is designed for playing,
recording, and manipulating audio data over a network.
Like the X Window System, it uses the client/server model to
separate applications from the specific drivers that control
audio input and output devices.
OptimFROG is a lossless audio compression program. Its main goal is to
reduce at maximum the size of audio files, while permitting bit identical
restoration for all input. It is similar with the ZIP compression, but it
is highly specialized to compress audio data.
OptimFROG obtains asymptotically the best lossless audio compression
ratios. It has Windows, Linux, and Mac versions, fully featured input
plug-ins for the Windows Media Player, foobar2000, Winamp2/3/5, dBpowerAMP,
XMPlay, QCD, and XMMS audio players (with bitstream error resilience,
ID3v1.1 and APEv2 read tagging support, ID3v2 compatible), optimal support
for all integer PCM wave formats up to 32 bits and an extensible streamable
(error tolerant) compressed format. It is also fast, the default mode
encodes CD quality audio data at 12.4x real-time and decodes at 17.4x real-
time on AMD Athlon XP 1800+ (the fastest mode encodes at 28.1x real-time
and decodes at 24.7x real-time). Self-extracting (sfx) archives can also be
created with a small overhead of just 54 KB.