Python and XML
A variety of XML processing solutions are available for Python. This page attempts, at the very least, to list the most actively developed or most easily available.
Packages in the standard library
The standard library has a number of tools available, which fall into mainly three categories:
- a pythonesque, simple-to-use and very fast XML tree library:
ElementTree - the xml.etree package (new in Python 2.5 but available for older versions)
- an event-driven XML parser compatible with the W3C SAX standard:
Sax - the xml.sax package
- XML tree libraries that adhere to the W3C DOM standard:
The DOM and SAX packages have the advantage of being compatible to W3C standard APIs, so users who are already familiar with these APIs can use them without learning too many new things. Everyone else should start with the more pythonic ElementTree library, which is very well integrated into the Python language, and therefore very easy to learn and use.
A long list of special purpose and general purpose Python XML packages is available from PyPI. The following is a choice of major tools that support a broader set of XML features.
Amara 2.x - Amara provides tools you can trust to conform with XML standards without losing the familiar Python feel
lxml.objectify - a Python object API for XML based on lxml
W3C DOM-like libraries
4Suite - a framework for XML (and RDF) processing
libxml2dom - PyXML-style API for the libxml2 Python bindings
PyXML - external add-on to Python's original XML support - WARNING: this is no longer maintained! Do not use in new code!
qtxmldom - PyXML-style API for the qtxml Python bindings
py-dom-xpath - pure Python XPath implementation for use with DOM libraries
lxml - lxml has standards compliant XPath 1.0 support based on libxml2. It also supports the EXSLT extensions (including Python regular expressions) and allows calling Python functions from within XPath expressions.
If not mentioned otherwise, this means XSLT 1.0, not XSLT 2.0.
Some tools linked from the XQuery homepage provide Python bindings for their XSLT2 and XPath2 implementations
jabber.py - a Python module for the jabber instant messaging protocol
xmpppy - a Python library that is targeted to provide easy scripting with Jabber
Object Serialization in XML
pyxser - a Python extension to serialize/deserialize Python objects into XML
Books and Articles
The above lists should be arranged in ascending alphabetical order - please respect this when adding new entries. When specifying release dates please use the format YYYY-MM-DD.