Skip to content

Latest commit

 

History

History
41 lines (34 loc) · 1.57 KB

LearningTheory.md

File metadata and controls

41 lines (34 loc) · 1.57 KB

(Statistical) Learning Theory

Contents

  • Introduction.
  • Sequential decision making.
  • Expert advice and multi-armed bandits.
  • Online convex optimization.
  • Contextual bandits.

Introduction

  • Main contributions:
    • Mathematical model of learning and conditions characterizing what can be learned.
    • Choice of learning bias, control of overfitting.
    • SVM, Boosting
  • Components:
    • Feature:
    • Label: regression, classification.
    • Loss function, Square, Zero-one, Hlinge.
    • Model: $$f: \mathcal{X} \rightarrow \mathcal{Y}$$.
    • Train set $$(x_1, y_1) ,\cdot, (x_m, y_m)$$.
    • Learning algorithm: mapping training sets to models for a given loss.
  • Others:
    • Sample $$(x, y)$$ is drawn from unknown distribution $$\mathcal{D}$$.
    • Train set is a random sample from $$\mathcal{D}$$.
    • Bayes optimal predictor $$f^* = argmin_{\hat{y} \in \mathcal{Y}} \mathbb{E}[l(Y, \hat{y})|X=x]$$.
  • Bias-variance decomposition.

Sequential decision making

  • Problem:
    • d actions.
    • unknown deterministic assignment of losses to actions ##\mathbf{l}_t \in [0, 1]^d## for each time step t.
    • target: strategy for picking action, in order to minimize the Regret.
  • Types, according to knowing what feedback information:
    • Expert: knowing all losses to actions.
    • Bandits: knowing only one loss for its associated action.
    • Mediate: not all, not only one, knowing neighborhood feedbacks.With computational graph.