CSCE 470 Lecture 25
Jump to navigation
Jump to search
« previous | Wednesday, October 23, 2013 | next »
Privacy
Root-Mean-Square Error (RMSE)
(See wikipedia:Root-mean-square deviation→)
Used for evaluating the effectiveness of a model/estimator (or in our case, a recommender based on ratings)
Given matrix of data:
The starred items are not known to the algorithm. Suppose the algorithm gives the following scores:
The RMSE is
The goal is to minimize this value. If the value is 1, then our recommender was off by one for every single value.
Introspection
Does rating really equal enjoyment?
Well,
Recommender Systems
Collaborative Filtering
- Take everybody, find average rating
- Find KNN (see below) and take their rating
To find similarity (as in KNN), we need:
- representation (usually a vector)
- ratings
- Watched-or-not (binary)
- distance from average
- a way to compare
- manhattan
- Euclidean
- Cosine
- Jaccard
Once we've found several similar users, how should we give score based on values?
- average
- max (optimistic)
- min (don't suck)
- weighted average
How could Clustering Help us?
Find groups of similar people that have similar tastes