I expected that the 117 movies not rated by someone or something that seems to rate every movie would have few raters and an earliest rating date close to the cutoff date for the data. That would be consistent with a rating program of some sort that scores the entire database periodically. This did not prove to be the case. The list of movies customer 305344 failed to rate includes Dr. Shivago, Citizen Kane and A Charlie Brown Christmas.
Unlike most of the recent questions, this one cannot be looked up in the rater signature or the movie signature because this information has been summarized away. Instead I used a query on the original training data that has all the rating transactions. Later, I looked up the earliest rating date for each movie not rated by the alpha movie geek to test my hypothesis that they would be movies only recently made available for rating.
select t.movid from
(select r.movid as movid, sum(custid=305344) as geek
from netflix.train r
group by movid) t
where t.geek = 0
The most rated movies not rated by the alpha rater geek
|Million Dollar Baby||102,861||2004-11-16|
|The Hunt for Red October||83,249||1999-12-17|
|The Grapes of Wrath||16,392||2001-03-18|
|A Charlie Brown Christmas||7,546||2000-08-03|
|The Tailor of Panama||7,421||2001-03-28|
|The Best Years of Our Lives||7,031||2000-01-06|