Wed Mar 24 SIMON INITIATIVE RESEARCH PROJECT MSP Team: Jenny Mao, Jinghao Huang, Rebecca Pei jinghao - intro use pattern changes for instructors and students due to covid jenny data (mainly usage data for her) 4 data files 29 tables each (2019-2020 semesters) studnet demographics tutor use and exam scores instructor activity PPS -- Retention/Mobility MSP Team: Gloria Guo, Jenny Luo, Yuhang Ying slide 5 spr 2020 bump fall 2020 drops (sl 8 and 9) jinghao chekcpoint scores data sl 10 are these student curves? shihua student perforance data (logins..) spring logins higher than fall still in data collection, cleaning, and variable selection ---------------------------------------- NBA Research Project (Basketball) MSP Team: Andrew Liu, Reed Peterson, Willis Lu andrew - intro & data overview - bit unpracticed.. adjust +/- is this a better predictor can it account for team strength, coaching, ... priors contract data, team ratings...., webscraping willis - more on the data - 538 data freely available, also some methods illustration slide 8 "elo rating"? (like chess...) reed - the shift data 834.com (?) willis linear regression / bayesian regression (pyMC3 package) sl 14 - what did you use for priors for the skeletpn how does the contract info get incorporated into priors compare with simpler methids compare with offense/defense ratings (or predict with +/-) --------------------------------------------------------- PPS -- Retention/Mobility MSP Team: Gloria Guo, Jenny Luo, Yuhang Ying Huiyi (Gloria) intro - how does promise influence post secondary? any policy implications? what affects promise elegibility and use? Yuhang 11 data sets! lots fo data processing data cleaning joining etc Huiyi more on data Jenny even more data! yuhang - methods prelim eda jenny - describe further eda huiyi - scatter plots blue red scatter plot on "results" slide - why are there blues outside the square? - they wil clarify... jenny - barplots on demographic influences of promise her first slide right graph - proportions might be better than counts 2nd slde - seriation on left? right plot where is the data from ? just votech? jenny next steps what is "the model" in orange circles?