The IRST Language Modeling Toolkit features algorithms and data structures
suitable to estimate, store, and access very large LMs.
ISO 12083 is the successor to the Association of American
Publishers (AAP) Electronic Publishing Special Interest Group
(EPSIG) standard. Four DTDs are included in this package--one
for articles, one for books, one for serials, and one for
mathematics which may be used in any of the other three.
July 10, 1997
jfieber@FreeBSD.org
Nineteen ISO 8879:1986 character entity sets used by many DTDs:
Added Latin 1
Added Latin 2
Added Math Symbols: Arrow Relations
Added Math Symbols: Binary Operators
Added Math Symbols: Delimiters
Added Math Symbols: Negated Relations
Added Math Symbols: Ordinary
Added Math Symbols: Relations, Box and Line Drawing
Alternative Greek Symbols
Diacritical Marks
General Technical
Greek Letters
Greek Symbols
Monotoniko Greek
Non-Russian Cyrillic
Numeric and Special Graphic
Publishing
Russian Cyrillic
John Fieber
jfieber@FreeBSD.org
ITS Tool allows you to translate your XML documents with PO files, using rules
from the W3C Internationalization Tag Set (ITS) to determine what to translate
and how to separate it into PO file messages.
PO files are the standard translation format for GNU and other Unix-like
systems. They present translatable information as discrete messages, allowing
each message to be translated independently. In contrast to whole-page
translation, translating with a message-based format like PO means you can
easily track changes to the source document down to the paragraph. When new
strings are added or existing strings are modified, you only need to update
the corresponding messages.
ITS Tool is designed to make XML documents translatable through PO files by
applying standard ITS rules, as well as extension rules specific to ITS Tool.
ITS also provides an industry standard way for authors to override translation
information in their documents, such as whether a particular element should be
translated.
An object-oriented SGML/XML parser toolkit and DSSSL engine.
Features summary:
* Includes nsgmls
* Provides access to all information about SGML document
* Supports almost all optional SGML features
* Sophisticated entity manager
* Supports multi-byte character sets
* Object-oriented
* Written in C++ from scratch
* Fast
* Portable
* Production quality
* Free
Note: This port is a superset of the sp port. If you have sp
installed, it is recommended that you remove it before installing
jade.
John Fieber
jfieber@FreeBSD.org
The Digester package lets you configure an XML -> Java object mapping module,
which triggers certain actions called rules whenever a particular pattern of
nested XML elements is recognized. A rich set of predefined rules is available
for your use, or you can also create your own. Advanced features of Digester
include:
- Ability to plug in your own pattern matching engine, if the standard one is
not sufficient for your requirements.
- Optional namespace-aware processing, so that you can define rules that are
relevant only to a particular XML namespace.
- Encapsulation of Rules into RuleSets that can be easily and conveniently
reused in more than one application that requires the same type of
processing
JaLingo is a free OS independent dictionary application.
It understands 4 dictionary formats:
* DSL
* Mova
* PtkDict/phpMyLingvo MySQL dumps
* Sdictionary (249)
Jarnal is an open-source application for notetaking, sketching, keeping a
journal, making a presentation, annotating a document - including pdf - or
collaborating using a stylus, mouse or keyboard. It is similar to Microsoft
Windows Journal and to the earlier Mimeo whiteboarding and Palm notepad
applications.
Aspell English dictionaries.
Java2html is a syntax highlighter for Java and C++ source code that
produces a highlighted html file as output.
Java2html offers the following features:
- support for Java and C++
- fast (single pass conversion using flex)
- doesn't change formatting - only adds <FONT COLOR=#XXXX> tags
and properly escapes non-ascii characters
- easy integration with webservers - browse your sources colourized
- gzips http output for browsers to save bandwidth (only in CGI mode)
- documentation and manpage included