Differences between revisions 1 and 6 (spanning 5 versions)
Revision 1 as of 2008-03-15 15:05:14
Size: 1502
Editor: 63-250-241-10
Comment:
Revision 6 as of 2008-03-16 01:37:10
Size: 2453
Editor: 70-5-49-37
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
Discuss approaches to the Netflix prize, getting started with PyFlix for new people, algorithm + code performance, etc Discuss approaches to the Netflix prize using Python, getting started with PyFlix for new people, algorithm + code performance, etc
Line 3: Line 3:
Some links to get started: Some Netflix code in Python will be shown/run (KNN, NMF, ARTmap, SVD, etc).
I will be posting the code later this month on my blog: [http://www.datawrangling.com Data Wrangling]


Some links for those just getting started:
 *[http://www.netflixprize.com/teams Register a Team] in order to [http://www.netflixprize.com/download download the Netflix data]
Line 5: Line 10:
 *[http://www.grouplens.org/node/73 Movielens dataset] - smaller dataset to debug your code with...

Some approaches:
Line 21: Line 29:

Performance pointers:

 *http://www.scipy.org/PerformancePython
 *http://wiki.python.org/moin/PythonSpeed/PerformanceTips
 *http://www.scipy.org/Weave

Parallel Programming is useful for lots of ML algorithms. [http://www.dehora.net/journal/2005/02/two_classic_hardbacks.html How to Write Parallel Programs] is a good book. [http://www.amazon.com/How-Write-Parallel-Programs-Course/dp/026203171X/ Amazon] Consider jython, since ML is often CPU-bound, and jython has no GIL.

Discuss approaches to the Netflix prize using Python, getting started with PyFlix for new people, algorithm + code performance, etc

Some Netflix code in Python will be shown/run (KNN, NMF, ARTmap, SVD, etc). I will be posting the code later this month on my blog: [http://www.datawrangling.com Data Wrangling]

Some links for those just getting started:

Some approaches:

More here:

Performance pointers:

Parallel Programming is useful for lots of ML algorithms. [http://www.dehora.net/journal/2005/02/two_classic_hardbacks.html How to Write Parallel Programs] is a good book. [http://www.amazon.com/How-Write-Parallel-Programs-Course/dp/026203171X/ Amazon] Consider jython, since ML is often CPU-bound, and jython has no GIL.

NetflixPrizeBOF (last edited 2008-11-15 13:59:37 by localhost)

Unable to edit the page? See the FrontPage for instructions.