36-465/665, Spring 2021
11 February 2021
\[ \newcommand{\Prob}[1]{\mathbb{P}\left( #1 \right)} \newcommand{\Expect}[1]{\mathbb{E}\left[ #1 \right]} \newcommand{\Var}[1]{\mathrm{Var}\left[ #1 \right]} \newcommand{\Cov}[1]{\mathrm{Cov}\left[ #1 \right]} \newcommand{\Risk}{r} \newcommand{\EmpRisk}{\hat{r}} \newcommand{\Loss}{\ell} \newcommand{\OptimalStrategy}{\sigma} \DeclareMathOperator*{\argmin}{argmin} \newcommand{\ModelClass}{S} \newcommand{\OptimalModel}{s^*} \DeclareMathOperator{\tr}{tr} \]
The difference between the two curves is the risk deviation \(\gamma(\theta)\):
\(\gamma(\theta)\) is really a random function (a stochastic process), and this is one draw from its distribution (one realization of the process)
\[ \Risk(\hat{\theta}) \approx \EmpRisk(\hat{\theta}) + n^{-1}\tr{\mathbf{j}\mathbf{k}^{-1}} \]
\[ \Risk(\hat{\theta}) \approx \EmpRisk(\hat{\theta}) + n^{-1}\tr{\mathbf{j}\mathbf{k}^{-1}} \]