Differences between revisions 15 and 20 (spanning 5 versions)
Revision 15 as of 2007-06-26 16:05:56
Size: 1058
Editor: 196-11-241-97
Comment: Account for new members
Revision 20 as of 2008-10-12 09:46:22
Size: 1841
Editor: 75-164-152-250
Comment: Fixed some no longer valid url's
Deletions are marked like this. Additions are marked like this.
Line 6: Line 6:
 * [http://code.google.com/p/html5lib html5lib] A HTML5-compliant library for parsing arbitarily-broken HTML to a range of tree formats including minidom, elementtree (including lxml) and BeautifulSoup
Line 8: Line 9:
 * [http://python.org/doc/current/lib/module-urllib.html urllib], [http://python.org/doc/current/lib/module-urllib2.html urllib2], and [http://python.org/doc/current/lib/module-httplib.html httplib] in the standard library.  * [http://docs.python.org/library/urllib.html urllib], [http://docs.python.org/library/urllib2.html urllib2], and [http://docs.python.org/library/httplib.html httplib] in the standard library.
Line 10: Line 11:
 * [http://www.pyth
----
CategoryUnicode
 * [http://www.python.org/pypi?:action=display&name=mechanoid&version=0.4.1 mechanoid] a mechanize fork.
 * [http://www.python.org/pypi/libxml2dom libxml2dom] can parse HTML by employing libxml2's liberal HTML parser.

== Resources ==

 * [http://aspn.activestate.com/ASPN/Cookbook/Python/Recipe/52199 Grab a document from the web] - from the Python Cookbook
 * [http://wwwsearch.sourceforge.net/bits/clientx.html Python web-client programming general FAQs].
 * [http://docs.python.org/library/urllib.html urllib -- Open arbitrary resources by URL]
 * [http://docs.python.org/library/urllib2.html urllib2 -- extensible library for opening URLs]

Client-Side Web Programming

Libraries

Resources

WebClientProgramming (last edited 2014-04-17 01:10:23 by DaleAthanasias)

Unable to edit the page? See the FrontPage for instructions.