Hi @kenjackson, this is Xavier here. How do you propose to validate this on the Netflix dataset? It is clear that you cannot use RMSE to compare to other existing approaches, right? The way to go would be to propose a different success measure (i.e. ranking based) and measure how different algorithm perform. And then validate this on users to prove that optimizing RMSE is not as useful.
If you give me a few months, I might get there. But this is the reason I wrote a blog post and not a paper ;-)
If you give me a few months, I might get there. But this is the reason I wrote a blog post and not a paper ;-)