Content-Based Filtering

  • Content-Based

Best for: Recommending similar items Aliases: Content-Based

How it works

$$\hat{r}_{ui}=\cos(h_u,x_i),\quad h_u=\frac{1}{|I_u|}\sum_{i\in I_u}x_i$$

Builds a user profile $h_u$ by aggregating the feature vectors $x_i$ of items the user has liked, then scores new items by similarity to that profile, e.g. $\hat{r}_{ui}=\cos(h_u,x_i)$ or a learned scorer $f_\theta(x_i)^\top h_u$. Because it relies on item features (text, tags, attributes) rather than other users’ ratings, it handles item cold-start well. The trade-off is over-specialisation: it rarely surfaces items outside the feature pattern the user already engaged with.

When to use

Recommendation from item features when interaction data is thin; for cold-start of long-tail items.

Watch out

Over-specializes to items similar to what the user already likes; quality is bounded by feature engineering.

Common fields

Retail · jobs · real estate · articles