KsCD is a small, fast, CDDB enabled audio CD player which supports
multiple platforms.
Pocket Sphinx is a frontend to the sphinxbase large-vocabulary,
speaker-independent, continuous speech recognition engine.
This is an early release of a research system. The APIs and function
names are likely to change, and several tools still need to be made
available to make this all complete.
Once the system is built, try running the Perl script sphinx2-demo.
The pocketsphinx-test script should run pocketsphinx-batch over an example
utterance of "go forward ten meters."
PyKaraoke is a free karaoke player for Linux and Windows.
You can use this program to play your collection of CDG,
MIDI and MPEG karaoke songs. No songs are provided,
you must obtain these from elsewhere.
Librem is a portable and generic library for real-time audio and video
processing.
Features
- Audio buffering, mixing, codecs and resampling
- Video mixing, rescaling and pixel format conversion
Rhythmbox is an integrated music management application, originally inspired by
Apple's iTunes. It is free software, designed to work well under the GNOME
Desktop, and based on the powerful GStreamer media framework.
RipIT is used to create MPEG-1 Layer 3 (mp3) using Lame, or uses Flac (flac),
Ogg Vorbis (ogg) or Faac (m4a) to convert audio files (wav) extracted from an
audio CD. It is a console based front-end (no GUI here), written in Perl, for
various programs.
The program will do the following without user intervention:
* getting the audio CD Album/Artist/Tracks information from CDDB
* ripping the audio CD Tracks
* encoding to Flac, mp3 or Ogg
* id3 tags encoded songs
* creating an playlist (m3u) file
* optionally generating a toc (cue) sheet for nice DAO burning
* optionally preparing and send a CDDB submission and save it locally
* optionally extracting hidden songs and split ghost songs
* optionally creating md5sum files for all tracks
* running several encoder processes at the same time and same run
Shorten reduces the size of waveform files (such as audio) using
Huffman coding of prediction residuals and optional additional
quantisation. In lossless mode the amount of compression obtained
depends on the nature of the waveform. Those composing of low
frequencies and low amplitudes give the best compression, which may
be 2:1 or better. Lossy compression operates by specifying a minimum
acceptable segmental signal to noise ratio or a maximum bit rate.
Lossy compression operates by zeroing the lower order bits of the
waveform, so retaining waveform shape.
Permission is granted to use this software for decoding and
non-commercial encoding (e.g. private or research use).
If you intend to use shorten, be sure to check the full license,
which can be displayed by "shorten -l".
This is a "Standard MIDI File" module, which handles encoding and
decoding SMF. This includes a simple MML as an example of
application.
SDL_sound is a library that handles the decoding of several popular
sound file formats, such as raw, wav, mp3, flac, ogg, voc, shn,
aiff, au, and some others.
It is meant to make the programmer's sound playback tasks simpler.
The programmer gives SDL_sound a filename, or feeds it data directly
from one of many sources, and then reads the decoded waveform data
back at her leisure.
If resource constraints are a concern, SDL_sound can process sound
data in programmer-specified blocks. Alternately, SDL_sound can
decode a whole sound file and hand back a single pointer to the
whole waveform.
SDL_sound can also handle sample rate, audio format, and channel
conversion on-the-fly and behind-the-scenes, if the programmer
desires.
shntool is a multi-purpose WAVE data processing and reporting utility.
File formats are abstracted from its core, so it can process any file
that contains WAVE data, compressed or not -- provided there exists a
format module to handle that particular file type.
Apart from .wav shntool supports many formats through helper programs.
Please see the manual page for more information.