Netflix Prize: Forum

Forum for discussion about the Netflix Prize and dataset.

You are not logged in.

Announcement

Congratulations to team "BellKor's Pragmatic Chaos" for being awarded the $1M Grand Prize on September 21, 2009. Stay tuned for details of the next contest, Netflix Prize 2.

#1 2006-10-11 10:45:25

BringingHomeTheKevinBacon
Member
Registered: 2006-10-11
Posts: 1

NULL values in movie_titles.txt

Hi,

Has anyone else noticed that movie_titles.txt contains some NULL values in place of years.

This is all of them:

4388,NULL,Ancient Civilizations: Rome and Pompeii
4794,NULL,Ancient Civilizations: Land of the Pharaohs
7241,NULL,Ancient Civilizations: Athens and Greece
10782,NULL,Roti Kapada Aur Makaan
15918,NULL,Hote Hote Pyaar Ho Gaya
16678,NULL,Jimmy Hollywood
17667,NULL,Eros Dance Dhamaka

What are others doing about them, leaving them as NULL, inserting the correct years, inserting a generic year (EG. 2006)?

Note that the Netflix web site also does not list the years of these titles.

Regards.

Offline

 

#2 2006-10-11 10:49:09

mdawg
Member
From: Kansas City, KS
Registered: 2006-10-03
Posts: 81

Re: NULL values in movie_titles.txt

Find the release dates on Amazon or approximate.  Or don't use the release years or movie titles at all.

Offline

 

#3 2006-10-11 10:53:05

Barry
Member
Registered: 2006-10-03
Posts: 167
Website

Re: NULL values in movie_titles.txt

This is one of the things about the contest that make it seem like NetFlix is looking for skilled, careful programmers to possibly hire as much as a better algorithm.

Offline

 

#4 2006-10-11 11:38:28

benjismith
Member
From: Salt Lake City, UT
Registered: 2006-10-02
Posts: 47
Website

Re: NULL values in movie_titles.txt

Actually, you're mistaken.

Those movies were actually released to DVD in the year 0.

Jesus was a huge fan of "Jimmy Hollywood".

Offline

 

#5 2006-10-11 12:00:33

TheMoose
Member
Registered: 2006-10-03
Posts: 24

Re: NULL values in movie_titles.txt

I put them as 1800 just for the integrity of all my data in my database.  But anything with a year of 1800 is just disregarded (the year itself, not the actual movie).

Offline

 

#6 2006-10-11 16:38:24

okstumbler
Member
Registered: 2006-10-11
Posts: 2

Re: NULL values in movie_titles.txt

benjismith wrote:

Actually, you're mistaken.

Those movies were actually released to DVD in the year 0.

Jesus was a huge fan of "Jimmy Hollywood".

NULL and 0 are different things.

Offline

 

#7 2006-10-11 17:07:33

willakawill
Member
From: Chicago
Registered: 2006-10-04
Posts: 117
Website

Re: NULL values in movie_titles.txt

They don't have a NULL value BTW.
They have the text string "NULL"

Offline

 

#8 2006-10-11 19:33:29

benjismith
Member
From: Salt Lake City, UT
Registered: 2006-10-02
Posts: 47
Website

Re: NULL values in movie_titles.txt

okstumbler wrote:

benjismith wrote:

Actually, you're mistaken.

Those movies were actually released to DVD in the year 0.

Jesus was a huge fan of "Jimmy Hollywood".

NULL and 0 are different things.

Perhaps. Depends on your environment.

Certainly in SQL, null means "beats the s*** outta me". Or, perhaps, "I've put invalid data into the record, and I just don't care". Or, in many cases  it means "I really should have created a new table, and used third normal form to link optional records from table B to the records in table A. But I didn't feel like it."

But, in many areas of computer science, null and zero are semantically identical. For example, trying to dereference a "null pointer" means that your pointer is set to zero (which is a privileged memory location owned by the OS) and you tried to access it from non-kernel-level code.

Offline

 

#9 2006-10-11 19:57:04

okstumbler
Member
Registered: 2006-10-11
Posts: 2

Re: NULL values in movie_titles.txt

NULL and the number 0 have nothing in common conceptually. If someone decides to represent NULL with the number 0, because 0 has no other useful meaning in the context (as in 'NULL pointer'), does not imply that 0 and NULL are in any way related conceptually.

And I think we can all agree that in this context (Netflix data file) NULL means "beats the s*** outta me".

To be honest I didn't quite feel like arguing over NULL. I simply wanted to point out that they are not equivalent in every case, and thus are not equivalent at all.

From your last post it sounds like you have a good understanding of NULL, and I wasn't trying to correct you. But you should know that equating NULL and 0 is a common mistake made by inexperienced computer scientists and programmers. My intention was to break this habbit in a friendly way.

Offline

 

#10 2006-10-11 21:59:37

benjismith
Member
From: Salt Lake City, UT
Registered: 2006-10-02
Posts: 47
Website

Re: NULL values in movie_titles.txt

okstumbler wrote:

I wasn't trying to correct you. But you should know that equating NULL and 0 is a common mistake made by inexperienced computer scientists and programmers. My intention was to break this habbit in a friendly way.

Gotcha. I appreciate it.

I think this is a nice environment for new coders to get their feet wet with statistical machine learning algorithms, especially since there are lots of experienced developers hanging out here, willing to lend a helping hand.

My only intention with the original null/zero comment was to make a hilarious joke about movies being released in the year Zero, Anno Domine. smile

Offline

 

#11 2006-10-12 01:22:22

Cranzy
Member
From: Pretoria
Registered: 2006-10-10
Posts: 7
Website

Re: NULL values in movie_titles.txt

Yeah.. I inserted Nulls into my table and my algo actually uses dates and when it encounters the nulls it just ignores the dates for those rows and i can't reveal more smile . . . . hehehehe.

Offline

 

#12 2006-10-28 19:08:56

Erythrina
Member
Registered: 2006-10-26
Posts: 10

Re: NULL values in movie_titles.txt

In case anybody cares, here are the DVD release dates I found for the NULL date movies, from various sources (Amazon and some Indian movie site):

4388, 2002
4794, 2002
7241, 2002
10782, 2005
15918, 2005
16678, 2004
17667, 1999

Offline

 

#13 2006-11-03 08:52:18

marbol
Member
Registered: 2006-10-03
Posts: 29

Re: NULL values in movie_titles.txt

benjismith wrote:

Certainly in SQL, null means "beats the s*** outta me". Or, perhaps, "I've put invalid data into the record, and I just don't care". Or, in many cases  it means "I really should have created a new table, and used third normal form to link optional records from table B to the records in table A. But I didn't feel like it."

Nope! It means "I've fallen and I can't get up!"

Mwuhahahhahaha

Offline

 

#14 2006-11-03 08:54:17

marbol
Member
Registered: 2006-10-03
Posts: 29

Re: NULL values in movie_titles.txt

willakawill wrote:

They don't have a NULL value BTW.
They have the text string "NULL"

But in the Netflix database, I'll bet they have a NULL value. It's just that conversion into text files that causes the translation to what you see here.

Offline

 

Board footer

Powered by PunBB
© Copyright 2002–2005 Rickard Andersson