Beginning to intermediate programmers. A basic working knowledge of Python is assumed.
This tutorial will introduce beginning to intermediate programmers to the many useful Python tools & techniques for text and data processing. Topics will include regular expressions, filtering data with generators, and parsing.
- Common data sources needing processing:
- log files
- tabular data
- Tools & techniques:
- lists & dictionaries
- s.join(list) instead of accumulating
- for line in file
- filters, large data sources: generators
- Regular expressions:
- pattern matching
- regular expressions
- "real" parsers (including XML)
- state machines