Forum for discussion about the Netflix Prize and dataset.
You are not logged in.
I want to do a class project for my data mining class. I know the first competition is over but I would like to study the first data sets. I've found some files from other sources but I do not know if they are complete or legitimate. After doing some research, I've guessed that I need probe.txt, training_set and qualifying.txt. Also, is Pyflix the best framework to work with, because I am more comfortable using c++ or Java.
Offline
I think some power made it go bye bye - possibly for legal reasons:
http://archive.ics.uci.edu/ml/datasets/Netflix+Prize
You should check with the archivers there.
Offline
I asked at the repository but others may want to as well:
http://archive.ics.uci.edu/ml/contact.html
Offline
Interesting and somewhat sad feedback from the UCI machine learning repository as to where the netflix prize dataset went to:
"The donor has requested that the data be no longer made available (for
reasons known to them). As the UCI ML Repository is entirely
donor-driven, we strive to adhere to the wishes of the data donors."
I guess the legal case[s] are causing an abundance of caution if not a rethink of originally releasing the movie data.
Offline