Lecture Workbook Part 1 - Understanding the Problem and the Data

xlsx

School

Kennesaw State University *

*We aren’t endorsed by this school

Course

NOT SURE

Subject

Statistics

Date

Feb 20, 2024

Type

xlsx

Pages

Uploaded by MajorDeer2792

The Problem: The Student Success Center and the Advising Department at Data University want to better understand the factors that determine a student’s GPA upon graduation. They do a great job of helping a student decide exactly what mix of classes they should take in each semester to satisfy their degree requirements, but they would like to be able to advise students on what lifestyle habits lead to student success. Of course, they tell students the generalized statements of “study hard”, “go to class”, “balance your school and social life”, etc. However, it would be helpful to know what specific habits have the greatest impact on a student’s GPA and be able to provide them with more targeted recommendations. It would also be helpful to be able to predict the expected GPA of a particular student based on their current habits. Step 1: The first step in any data science or analytics problem is to understand the problem. This step often requires asking lots of questions and really getting to the root of the problem. We are limited to the information we have above, however it does provide us with a relatively good understanding of the problem. 1. What are the pain points? What will they do once they have the specific habit? Who is the client? What so they need to achompolish their goal? What do they need? What do they want to do? What relationships do the varbiles have to the GPA? What relationship do they have with each other? to predict the expected GPA of a particular student based on their current habits. 2. What sort of outcome are you or the client looking for? What habits contribute towards GPA Wants to provide better advice What defines success and what sort of analysis would add value? Improvement in overall GPA GPA Average imporved by .2 To be completeed by 12 weeks Step 2: Next, we need to use the information provided to formulate a problem statement that can be solved with data. The key components of a good problem statement are that it is clear, concise, and measurable.

problem statement are that it is clear, concise, and measurable. 1. Clear meaning that it is easily understood and not ambiguous. 2. Concise meaning that is it no longer than it absolutely needs to be... it is straight to the point (*this problem statement should only be 1-2 sentences). 3. Measurable meaning that it can be measured and is actionable. Sample Problem Statement: “We will perform an explanatory analysis to determine the most impactful habits that lead to student success and build a model to predict a student’s expected GPA, at graduation, based on student habits, at the time of advisement.” “We will perform an explanatory analysis to determine the most impactful habits that lead to student success and build a model to predict a student’s expected GPA, at graduation, based on student habits, at the time of advisement.” + Timeframe + Improve average GPA by .2

Your preview ends here