Capstone Course Project

Overview

Undergraduate

This seminar course is a senior level course designed to allow you to review, analyze, and integrate the work you have completed toward a degree in analytics. You will complete an academic project and present a paper that demonstrates mastery of your program of study in a meaningful culmination of your learning.

You should begin work on your Course Project in Week 3; there is no deliverable in Week 3 but it's a good idea to start now to stay on track. Your course project will consist of the following components, along with their recommended or required deadlines:

Graduate

The purpose of this capstone course is to provide an opportunity for students to integrate the knowledge gained completing various courses in Analytics. Students are expected to complete an approved project that demonstrates mastery of course concepts and their applications. Students should complete all required coursework prior to this course.

Your course project will consist of the following components, along with their recommended or required deadlines:

Submission Format

Undergraduate

Your paper should be written in a case study format and contain the following sections, at a minimum:

The paper should be formatted following APA guidelines and be between 6 - 10 pages in length, not including the bibliography or datasets used. Any data and calculations should be included separately in an Excel file.

Graduate

For graduate level work, the paper should include the following sections:

Reading & Resources

Here are some additional helpful resources that might prove useful, as well:

Tips

Depending on the kind of problem you're analyzing, you'll likely want to do something akin to the following.

Perform exploratory data analysis using descriptive statistics and visual displays to identify factors that influence the goal. Make sure your analysis provides answers to the questions listed below:

  1. Obtain the summary statistics for the variables and comment on them. Be sure to calculate the mean, median and mode for the relevant factors.

  2. Are there outliers in the data for the variables?

  3. What is the best measure of central tendency that you will use to describe the relevant factors?

  4. Create distributions for the relevant variables. Comment on the distributions of data for the variables? Are they symmetrical? Skewed?

  5. Obtain the correlations between variables and comment on them. Present the data for the relevant variables using appropriate charts. Justify your choice for each variable.

  6. Identify any relationships you find between the variables.

  7. Use simple or multiple linear regression technique, or other analytical techniques, to identify the cause and effect relationship between those variables. State the regression equation and interpret the regression coefficients. Comment on the goodness of fit of the regression equation for the data.

  8. Use it to predict future values of the relevant variables.

If you have a time series, then you might want to instead:

  1. Obtain the summary statistics for both variables and comment on them.

  2. Create time-series plot of the relevant variables by the appropriate time period increment.

  3. Comment on the trend(s) you see on the chart.

  4. Create other diagrams like scatter diagrams for the response variables and the predictor variables? Ae the variables related to each other?

  5. Determine the correlation between the variables and comment on the nature and strength of relationship between them.

  6. Use techniques like multiple or simple linear regression technique to identify the cause and effect relationship between the relevant variables.

  7. Comment on the goodness of fit of the regression equation for the data. Do you think your prediction(s) are accurate?

Machine Learning Guides

Please do feel free to see the Machine Learning chapter of my book, which is freely available on ResearchGate here, and has an overview of the machine learning process and summary of some fundamental machine learning approaches.

There are many machine learning techniques you can employ but some might be more complex or difficult to implement than others. Here is a summary of some of the more popular and effective approaches.

Machine Learning Algorithms

Paper Formatting Tips

In your final paper, please do include a separate Table of Contents, List of Figures, and List of Tables. You can see how to make them here:

There is some flexibility in the styling, of course, but, in the end, you should have a Table of Contents, a List of Tables, and a List of Figures, somewhat like the following:

https://libroediting.com/2012/12/27/table-of-figures-and-table-of-tables/

In addition, you can refer to these guides on APA in general, as well: