Final Project

Each student will be responsible for completing a final project. The goal of this final project is to explore some aspect of your thesis research in a fully reproducible environment and with publication-worthy quality. Ideally, your final project will leverage your thesis data and address one or more of your thesis research objectives by applying an advanced analytic technique and interpreting the results.

Upon submission you will supply the final PDF report. You will also submit the .Rmd file that produced the PDF report, your data, and any other files your .Rmd file leverages (images, .bib file, etc.). Your submitted files should be named with year, course number, lastname, first & middle initial, and then “finalproject.” For example my file name would be:

  • OPER 682 section: 2017_OPER682_boehmkebc_finalproject.Rmd
  • LOGM 682 section: 2017_LOGM682_boehmkebc_finalproject.Rmd
Section Standard Possible Points
Introduction 1.1 Provide an introduction and some background regarding your overarching thesis problem.
1.2 Outline the specific sub-problem you plan to address in this class.
1.3 Discuss your current proposed approach/analytic technique you think will address (fully or partially) this problem.
1.4 Explain how your analysis will help the decision-maker.
5
Packages Required 2.1 All packages used are loaded upfront so the reader knows which are required to replicate the analysis.
2.2 Messages and warnings resulting from loading the package are suppressed.
2.3 Explanation is provided regarding the purpose of each package (there are over 10,000 packages, don't assume that I know why you loaded each package).
5
Data Preparation 3.1 Original source where the data was obtained is cited and, if possible, hyperlinked.
3.2 Source data is thoroughly explained (i.e. what was the original purpose of the data, when was it collected, how many variables did the original have, explain any peculiarities of the source data such as how missing values are recorded, or how data was imputed, etc.).
3.3 Data importing and cleaning steps are explained in the text (tell me why you are doing the data cleaning activities that you perform) and follow a logical process.
3.4 Once your data is clean, show what the final data set looks like. However, do not print off a data frame with 200+ rows; show me the data in the most condensed form possible.
3.5 Provide summary information about the variables of concern in your cleaned data set. Do not just print off a bunch of code chunks with str(), summary(), etc. Rather, provide me with a consolidated explanation, either with a table that provides summary info for each variable or a nicely written summary paragraph with inline code.
10
Exploratory Data Analysis 4.1 Uncover new information in the data that is not self-evident (i.e. do not just plot the data as it is; rather, slice and dice the data in different ways, create new variables, or join separate data frames to create new summary information).
4.2 Provide findings in the form of plots and tables. Show me you can display findings in different ways.
4.3 Graph(s) are carefully tuned for desired purpose. One graph illustrates one primary point and is appropriately formatted (plot and axis titles, legend if necessary, scales are appropriate, appropriate geoms used, etc.).
4.4 Table(s) carefully constructed to make it easy to perform important comparisons. Careful styling highlights important features. Size of table is appropriate.
4.5 Insights obtained from the analysis are thoroughly, yet succinctly, explained. Easy to see and understand the interesting findinsg that you uncovered.
10
Modeling Results 5.1 Properly introduce your modeling technique and its relevance to your problem.
5.2 Provide mathematical formulae to discribe how the analytic technique(s) compute results.
5.3 Implement the analytic technique efficiently.
5.4 Provide adequate interpretation of the results from your analysis. Try to capture a balance in your explanation so that both an analyst and a senior leader can understand what the results mean.
5.5 Illustrate your findings in the form of plots and tables and fine tune these outputs to best describe your results.
5.6 Provide adequate justification and validation of your final approach and results.
10
Summary 6.1 Summarize the problem statement you addressed.
6.2 Summarize how you addressed this problem statement (the data used and the methodology employed).
6.3 Summarize the interesting insights that your analysis provided.
6.4 Summarize the policy or decision-making implications as a result of your analysis.
6.5 Discuss the limitations of your analysis and how you, or someone else, could improve upon it.
5
Formatting & Other Requirements 7.1 Proper coding style is followed and code is well commented (see section regarding syle).
7.2 Coding is systematic - complicated problem broken down into sub-problems that are individually much simpler. Code is efficient, correct, and minimal. Code uses appropriate data structure (list, data frame, vector/matrix/array). Code checks for common errors.
7.3 Achievement, mastery, cleverness, creativity: Tools and techniques from the course are applied very competently and, perhaps,somewhat creatively. Perhaps student has gone beyond what was expected and required, e.g., extraordinary effort, additional tools not addressed by this course, unusually sophisticated application of tools from course.
7.4 .Rmd fully executes without any errors and PDF produced matches the PDF report submitted by student.
15

Total possible points: 60