This is a static archive of the Python wiki, which was retired in February 2026 due to lack of usage and the resources necessary to serve it — predominately to bots, crawlers, and LLM companies.
Pages are preserved as they were at the time of archival. For current information, please visit python.org.
If a change to this archive is absolutely needed, requests can be made via the infrastructure@python.org mailing list.

Processing And Analyzing Extremely Large Amounts Of Data In Python

Presentation Notes

My talk will describe PyTables, a Python package that enables the end user to manipulate easily scientific data tables and Numeric and numarray Python objects in a persistent, hierarchical structure. The foundation of the underlying hierarchical data in permament storage is the excellent HDF5 library.

I will be walking through the basic features of the PyTables, and demonstrating the use of the package in real-life scenarios. In addition, I will present some benchmark where PyTables will show to be competitive when compared with other persistent databases in Python.

This presentation is currently scheduled for 10am on friday March 28th.


I would like to target my presentation as best I can to those people attending.

So please add questions/suggestions below; for example:



2026-02-14 16:12