problem24 v2

pdf

School

University of Michigan *

*We aren’t endorsed by this school

Course

215

Subject

Computer Science

Date

Nov 24, 2024

Type

pdf

Pages

Uploaded by TareqA5

Final Exam, Spring 2022: March (through May) Madness Version 1.0 All of the header information is important. Please read it.. Topics, number of exercises: This problem builds on your knowledge of Pandas, SQL, and numeric computation. It has 9 exercises, numbered 0 to 8. There are 19 available points. However, to earn 100% the threshold is 12 points. (Therefore, once you hit 12 points, you can stop. There is no extra credit for exceeding this threshold.) Exercise ordering: Each exercise builds logically on previous exercises, but you may solve them in any order. That is, if you can't solve an exercise, you can still move on and try the next one. Use this to your advantage, as the exercises are not necessarily ordered in terms of difficulty. Higher point values generally indicate more difficult exercises. Demo cells: Code cells starting with the comment ### define demo inputs load results from prior exercises applied to the entire data set and use those to build demo inputs. These must be run for subsequent demos to work properly, but they do not affect the test cells. The data loaded in these cells may be rather large (at least in terms of human readability). You are free to print or otherwise use Python to explore them, but we did not print them in the starter code. Debugging you code: Right before each exercise test cell, there is a block of text explaining the variables available to you for debugging. You may use these to test your code and can print/display them as needed (careful when printing large objects, you may want to print the head or chunks of rows at a time). Exercise point breakdown: Exercise 0: 1 point Exercise 1: 3 point Exercise 2: 1 point Exercise 3: 3 point Exercise 4: 2 point Exercise 5: 3 point Exercise 6: 3 point Exercise 7: 1 point Exercise 8: 2 point Final reminders: Submit after every exercise Review the generated grade report after you submit to see what errors were returned Stay calm, skip problems as needed, and take short breaks at your leisure Basketball basics In this notebook we want to predict which team will win a basketball game based on their past performance in a given season. You do not have to know anything about basketball aside from the background below to complete this notebook. Games. A basketball game is played between two teams. In most games, there is a "home" team and an "away" team. However sometimes games are played at a neutral site, in which case neither team is "home" nor "away." Scoring and winning. In a game, the team that scores more points wins. There are no ties. Possessions. A possession is an event where one team continuously controls the ball. In [1]: import sqlite3 import numpy as np import pandas as pd import re conn = sqlite3.connect('resource/asnlib/publicdata/basketball_db.sqlite') Exercise 0 - (1 Points):

We have a big collection of real data from several seasons of men's NCAA basketball. We are most interested in one table of this database, named MRegularSeasonDetailedResults . Complete the function get_cols(conn) to return a list of the columns in the MRegularSeasonDetailedResults table in the db connection conn . The order of the columns in the list should be the same as the order they appear in the database table. In [2]: ### Define get_cols def get_cols(conn): ### ### YOUR CODE HERE ### return pd.read_sql('''select * from MRegularSeasonDetailedResults limit 1''', conn)\ .columns\ .tolist() The demo cell below should display the following output: ['index', 'Season', 'DayNum', 'WTeamID', 'WScore', 'LTeamID', 'LScore', 'WLoc', 'NumOT', 'WFGM', 'WFGA', ' WFGM3', 'WFGA3', 'WFTM', 'WFTA', 'WOR', 'WDR', 'WAst', 'WTO', 'WStl', 'WBlk', 'WPF', 'LFGM', 'LFGA', 'LFGM 3', 'LFGA3', 'LFTM', 'LFTA', 'LOR', 'LDR', 'LAst', 'LTO', 'LStl', 'LBlk', 'LPF'] In [3]: ### call demo funtion print(get_cols(conn)) The cell below will test your solution for Exercise 0. The testing variables will be available for debugging under the following names in a dictionary format. input_vars - Input variables for your solution. original_input_vars - Copy of input variables from prior to running your solution. These should be the same as input_vars - otherwise the inputs were modified by your solution. returned_output_vars - Outputs returned by your solution. true_output_vars - The expected output. This should "match" returned_output_vars based on the question requirements - otherwise, your solution is not returning the correct output. In [4]: ### test_cell_ex0 from tester_fw.testers import Tester_ex0 tester = Tester_ex0() for _ in range(20): try : tester.run_test(get_cols) (input_vars, original_input_vars, returned_output_vars, true_output_vars) = tester.get_test_vars() except : (input_vars, original_input_vars, returned_output_vars, true_output_vars) = tester.get_test_vars() raise ### ### AUTOGRADER TEST - DO NOT REMOVE ### print('Passed! Please submit.') ['index', 'Season', 'DayNum', 'WTeamID', 'WScore', 'LTeamID', 'LScore', 'WLoc', 'NumOT', 'WFGM', 'WF GA', 'WFGM3', 'WFGA3', 'WFTM', 'WFTA', 'WOR', 'WDR', 'WAst', 'WTO', 'WStl', 'WBlk', 'WPF', 'LFGM', ' LFGA', 'LFGM3', 'LFGA3', 'LFTM', 'LFTA', 'LOR', 'LDR', 'LAst', 'LTO', 'LStl', 'LBlk', 'LPF'] initializing tester_fw.tester_6040 Passed! Please submit.

Abriged data dictionary In the MRegularSeasonDetailedResults table, each record is one game. All columns prefixed by 'W' refer to the winning team, and columns prefixed by 'L' refer to the losing team. We are primarily interested in the following columns: Column Type Has Prefix Description Season Integer No Identifies the calendar year when a season ends. DayNum Integer No Identifies the day in a season when a game occurred. TeamID Integer W or L Identifies the teams participating in a game. Score Integer W or L Number of points a team scored in a game. Loc Char W Only Identifies whether the winning team was home ('H'), away ('A'), or the game was played at a neutral site ('N'). FGA Integer W or L Number of field goal attempts occurring when a team had in a game FTA Integer W or L Number of free throw attempts a team had in a game. TO Integer W or L Number of turnovers a team had in a game. OR Integer W or L Number of offensive rebounds a team had in a game. For example, the column with the winning team's ID will be WTeamID, and the column with the losing team's free-throw attempts will be LFTA. Exercise 1 - (3 Points):

Your preview ends here