For any model with an intercept (even if it doesn't fit the data well), sum(y -X*betahat) = 0 (use HX=X and sum(x) = x^T 1) develop cov(y, yhat) = R^2 the wierd G&H question (old hw03) somethnig on IMRAD