python-creole is an markup converter in pure Python:
* creole markup -> html
* html -> creole markup
* reStructuredText -> html
* html -> reStructuredText markup
* html -> textile markup
csvkit is a suite of command-line tools for converting to and working
with CSV, the king of tabular file formats.
Read DBF files with Python.
DBF is a file format used by databases such as dBase, Visual FoxPro
and FoxBase+. This library reads DBF files and returns the data as
native Python data types for further processing. It is primarily
intended for batch jobs and one-off scripts.
This module supplies features similar as wcwidth(3) and wcswidth(3) in C
language.
Characters have its own width on terminal depending on locale. For example,
ASCII characters occupy one column per character, east Asian fullwidth
characters (like Hiragana or Han Ideograph) occupy two columns per
character, and combining characters (apperaring in ISO-8859-11 Thai,
Unicode, and so on) occupy zero columns per character. mbwidth() gives the
width of the first character of the given string and mbswidth() gives the
width of the whole given string.
The names of mbwidth and mbswidth came from "multibyte" versions of wcwidth
and wcswidth which are "wide character" versions.
mblen(string) returns number of bytes of the first character of the string.
Please note that a character may consist of multiple bytes in multibyte
encodings such as UTF-8, EUC-JP, EUC-KR, GB2312, or Big5.
mbwidth(string) returns the width of the first character of the string.
mbswidth(string) returns the width of the whole string.
Parameters are to be given in locale encodings, not always in UTF-8.
Utilities for the documentation of Python modules.
PDFMiner is a tool for extracting information from PDF documents. Unlike other
PDF-related tools, it focuses entirely on getting and analyzing text data.
PDFMiner allows to obtain the exact location of texts in a page, as well as
other information such as fonts or lines. It includes a PDF converter that can
transform PDF files into other text formats (such as HTML).
It has an extensible PDF parser that can be used for other purposes instead
of text analysis.
Paraphrasing the website:
Python-DSV is a Python module for importing and exporting DSV (delimiter
separated values) files. DSV is a generalization of CSV (comma separated
values). CSV is a common file format used by many programs to import and
export data.
Features:
- Pure Python
- Optional wxPython GUI
- Optional heuristics for determining file format
- Handles embedded quotes, delimiters and newlines
- Customizable error handling
- Simple to use
- Portable
Have indices in Elasticsearch? This is the tool for you!
Like a museum curator manages the exhibits and collections on display,
Elasticsearch Curator helps you curate, or manage your indices.
Elasticsearch DSL is a high-level library whose aim is to help with writing
and running queries against Elasticsearch. It is built on top of the official
low-level client (elasticsearch-py).
It provides a more convenient and idiomatic way to write and manipulate
queries. It stays close to the Elasticsearch JSON DSL, mirroring its terminology
and structure. It exposes the whole range of the DSL from Python either directly
using defined classes or a queryset-like expressions.
It also provides an optional wrapper for working with documents as Python
objects: defining mappings, retrieving and saving documents, wrapping the
document data in user-defined classes.
To use the other Elasticsearch APIs (eg. cluster health) just use the underlying
client.
Organization of data in table form is a time-honored and useful method
of data representation. While columns of data are trivially generated
by computer through formatted output, even simple tasks like keeping
titles aligned with the data columns are not trivial, and the one-shot
solutions one comes up with tend to be particularly hard to maintain.
Text::Table allows you to create and maintain tables that adapt to
alignment requirements as you use them.