Summary of "[ИАД, весна 2025] Рекомендательные системы, 1"

Overview

This document summarizes Lesson 1 of the course “Recommender Systems”. The lecture covers historical context and motivation, formal problem statements, common recommendation approaches, data/feedback types, evaluation metrics, validation and splitting practices, common pitfalls in evaluation, and course logistics. The primary course focus is Top-K recommendation (typical K ≈ 10).


Historical context and motivation


Types of recommender approaches


Data and feedback types


Formal problem statement

Given finite sets U (users) and I (items) and a sparse user×item interaction matrix R:


Evaluation metrics


Practical validation and data splitting


Types of experiments


Methodology — actionable instructions

  1. Problem setup
    • Define user set U and item set I and build the interaction matrix R.
    • Decide the task: rating prediction or Top-K recommendation. Choose K to match the UI/business (commonly 10).
  2. Prepare data
    • Determine whether feedback is explicit or implicit.
    • For implicit signals, set binarization thresholds (e.g., listened_duration ≥ 50% → positive).
    • Consider domain characteristics and whether to filter previously consumed items.
  3. Choose approach
    • If rich item metadata and sparse collaborative signals → content-based or hybrid.
    • If rich cross-user interaction data → collaborative filtering (user/item-based, matrix factorization, etc.).
    • For production/advanced modeling → consider hybrid and modern deep-learning methods (Transformers mentioned for later).
  4. Split data for validation
    • Use per-user holdouts (leave-last-one) for per-user evaluation or time-based splits to avoid temporal leakage.
    • Decide treatment of cold users to match the intended deployment scenario.
  5. Evaluate
    • For ranking sensitivity, use NDCG@K: compute DCG per user, compute IDCG per user, normalize, then average.
    • For ordering-sensitive precision, compute MAP@K:
      • For each user: at each rank m ≤ K, compute precision@m if the item is relevant; average these → AP@K for the user.
      • Average AP@K across users → MAP@K.
    • Report additional measures where relevant: novelty, serendipity, coverage, and business metrics (CTR, engagement, revenue).
  6. Reproducibility and rigor
    • Document exact metric definitions (denominators, clipping, smoothing).
    • Ensure evaluation code matches the protocol; be aware of multiple valid variants for ranking evaluations.
    • When comparing with external results, verify matching metric and split logic.
  7. Production considerations
    • Filter already viewed/purchased items when appropriate.
    • Choose K to match UI and business needs.
    • Account for domain-specific behaviors (repeat purchases, session-based vs long-term).

Common pitfalls and nuances


References and materials


Course logistics


Speakers and sources

Note: The course will cover more advanced models, evaluation protocols, and online policy evaluation in later lectures.

Category ?

Educational


Share this summary


Is the summary off?

If you think the summary is inaccurate, you can reprocess it with the latest model.

Video