Differences between revisions 2 and 26 (spanning 24 versions)
Revision 2 as of 2004-07-29 14:22:37
Size: 817
Editor: LionKimbro
Comment: trying to get a feel for what you are looking for
Revision 26 as of 2014-04-17 01:10:23
Size: 1967
Comment: not sure if this is true for python 3.x?
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
There is probably a huge amount of good stuff available from the people who are working actively with XmlRpc, Biztalk and other approaches to web services. More too from XML writers such as [http://uche.ogbuji.net/uche.ogbuji.net/ Uche Ogbuji], who has put much good stuff on IBM's developerworks site, among other things. = Client-Side Web Programming =
Line 3: Line 3:
Sadly nobody has categorised or classified it in the Wiki, so at the moment we have to scratch around. == Libraries ==
Line 5: Line 5:
----  * [[http://utidylib.berlios.de/|µTidylib]] and [[http://www.egenix.com/files/python/mxTidy.html|mxTidy]] -- Python interfaces to [[http://tidy.sourceforge.net/|html tidy]] library to clean up HTML documents.
 * [[http://code.google.com/p/html5lib|html5lib]] A HTML5-compliant library for parsing arbitarily-broken HTML to a range of tree formats including minidom, elementtree (including lxml) and BeautifulSoup
 * [[http://www.crummy.com/software/BeautifulSoup/|BeautifulSoup]] -- a permissive HTML parser.
 * Don't use [[https://docs.python.org/2/library/htmlparser.html|HTMLParser (Python 2.x)]] or [[https://docs.python.org/3.5/library/html.parser.html|html.parser (Python 3.x)]] on HTML that might be invalid! That way lies pain. Either clean it up (using tidy), or use a different parser.
 * [[http://docs.python.org/library/urllib.html|urllib]], [[http://docs.python.org/library/urllib2.html|urllib2]], and [[http://docs.python.org/library/httplib.html|httplib]] in the standard library.
 * [[http://wwwsearch.sourceforge.net/old/ClientCookie/|ClientCookie]], [[http://wwwsearch.sourceforge.net/ClientForm/|ClientForm]], and [[http://wwwsearch.sourceforge.net/mechanize/|Mechanize]] are higher-level libraries for writing a web client.
 * [[http://pypi.python.org/pypi?:action=display&name=mechanoid|mechanoid]] a mechanize fork.
 * [[http://www.python.org/pypi/libxml2dom|libxml2dom]] can parse HTML by employing libxml2's liberal HTML parser.
Line 7: Line 14:
What do you mean by web client programming? Are you talking about making a Wiki:WebBrowser? Or writing Wiki:WebServices? WebProgramming? DistributedProgramming? == Resources ==
Line 9: Line 16:
I highly recommend the FindPage.

We have pages on XmlRpc, BaseHttpServer, DocXmlRpcServer, ...

...just trying to get a feel for what you are looking for.

-- LionKimbro [[DateTime(2004-07-29T14:22:37Z)]]
 * [[http://aspn.activestate.com/ASPN/Cookbook/Python/Recipe/52199|Grab a document from the web]] - from the Python Cookbook
 * [[http://wwwsearch.sourceforge.net/old/bits/GeneralFAQ.html|Python web-client programming general FAQs]].
 * [[http://docs.python.org/library/urllib.html|urllib -- Open arbitrary resources by URL]]
 * [[http://docs.python.org/library/urllib2.html|urllib2 -- extensible library for opening URLs]]

Client-Side Web Programming



WebClientProgramming (last edited 2014-04-17 01:10:23 by DaleAthanasias)

Unable to edit the page? See the FrontPage for instructions.