Instructor: Dr. Yuxiao Hung | https://www.linkedin.com/in/yuxiao-huang-a00b407a/
This course is an introduction of Bayesian data analysis. Topics include parameter estimation (using formal analysis, grid approximation, and Markov chain Monte Carlo), hierarchical models, generalized linear models, JAGS, and Stan. Although lectures will include some theory, the emphasis of the course will be on programming these models in R, JAGS, and Stan, and applying the models to solve real-world problems. This computational aspect will differentiate this course significantly from the Statistics department’s course in Bayesian Methods (STAT 6223).
As a result of completing this course, students will be able to
• Use R, JAGS, and Stan to conduct Bayesian data analysis (parameter estimation, prediction,and model comparison).
• Evaluate Bayesian methods against other approaches on real-world data.
• Write technical report and present the results.
• Work both individually and as a team.
Insurance Forecast by using Linear Regression and MCMC for Final Project for DATS 6450: Bayesian Statistics by Brent Skoumal (https://github.com/b-skoumal) & Prince Birring (https://github.com/princebirring)
Acceptable performance for this standard is based on how well you've applied specific learning goals within your deliverable. To review the full list of data science standards, see the course syllabus.
- Well-articulated problem statement with "specific aim" and hypothesis, based on your lightning talk. 2 .An outline of any potential methods and models.
- Detailed explanation of extant data available.
- Describe any outstanding questions, assumptions, risks, caveats.
- Define your goals and criteria, explain what success looks like.
- Demonstrate domain knowledge, including features or benchmarks from similar projects.
Acceptable performance for this standard is based on how well you've applied specific learning goals within your deliverable. To review the full list of data science standards, see the course syllabus.
- A well organized iPython notebook with code and output.
- At least one visual for each independent variable (and any relationships) using a python visualization tool.
- Provide insight about data set and its impact on your hypothesis.
Acceptable performance for this standard is based on how well you've applied specific learning goals within your deliverable. To review the full list of data science standards, see the course syllabus.
- Create iPython Notebook with code, visualizations, and markdown.
- Summarize your exploratory data analysis.
- Frame source code so it enhances your explanations.
- Explain your choice of validation and prediction metrics.
- Include a separate python module with helper functions.
- Visualize relationships between Y and two strongest variables.
- Identify areas where new data could help improve the model.
- Include project TOC, background, problem, and hypothesis.
- Describe datasets and analysis with summary and charts.
- Demonstrate your model with visualizations.
- Review the conclusions from your findings.
- Create a list of recommendations and next steps based on your work.
- Frame your materials for a non-technical audience.
- Include an appendix with full technical details.
Link: https://prezi.com/icohm5c8pwce/insurance-forecast-by-using-linear-regression-and-mcmc/