Differences between revisions 8 and 9
Revision 8 as of 2008-03-16 04:17:50
Size: 2491
Editor: 68-247-130-212
Comment:
Revision 9 as of 2008-03-18 22:30:22
Size: 2876
Editor: ip-64-32-254-138
Comment:
Deletions are marked like this. Additions are marked like this.
Line 36: Line 36:
 *If you need to go parallel for Netlfix, [http://www.datawrangling.com/pycon-2008-elasticwulf-slides.html ElasticWulf] public Amazon EC2 images come with mpi4py, IPython1, pyflix, numpy, scipy, weave, pyrex, etc. already installed and configured. The [http://code.google.com/p/elasticwulf/ python code] for launching your own beowulf on EC2 using the images is on google code.
Line 38: Line 39:

 

Discuss approaches to the Netflix prize using Python, getting started with [http://pyflix.python-hosting.com/ PyFlix] for new people, algorithm + code performance, etc

Some Netflix code in Python will be shown/run (KNN, NMF, ARTmap, SVD, etc).

I will be posting the code later this month on my blog: [http://www.datawrangling.com Data Wrangling]

Some links for those just getting started:

Some approaches:

More here:

Performance pointers:

Parallel Programming is useful for lots of ML algorithms. [http://www.dehora.net/journal/2005/02/two_classic_hardbacks.html How to Write Parallel Programs] is a good book. [http://www.amazon.com/How-Write-Parallel-Programs-Course/dp/026203171X/ Amazon] Consider jython, since ML is often CPU-bound, and jython has no GIL.

NetflixPrizeBOF (last edited 2008-11-15 13:59:37 by localhost)

Unable to edit the page? See the FrontPage for instructions.