project1_1

pdf

School

Norco College *

*We aren’t endorsed by this school

Course

70A

Subject

Statistics

Date

Jan 9, 2024

Type

pdf

Pages

Uploaded by HighnessButterflyPerson542

project1 October 17, 2023 [1]: # Initialize Otter import otter grader = otter . Notebook( "project1.ipynb" ) 1 Project 1: World Progress In this project, you’ll explore data from Gapminder.org , a website dedicated to providing a fact- based view of the world and how it has changed. That site includes several data visualizations and presentations, but also publishes the raw data that we will use in this project to recreate and extend some of their most famous visualizations. The Gapminder website collects data from many sources and compiles them into tables that describe many countries around the world. All of the data they aggregate are published in the Systema Globalis . Their goal is “to compile all public statistics; Social, Economic and Environmental; into a comparable total dataset.” All data sets in this project are copied directly from the Systema Globalis without any changes. This project is dedicated to Hans Rosling (1948-2017), who championed the use of data to under- stand and prioritize global development challenges. 1.0.1 Logistics Rules. Don’t share your code with anybody but your partner. You are welcome to discuss questions with other students, but don’t share the answers. The experience of solving the problems in this project will prepare you for exams (and life). If someone asks you for the answer, resist! Instead, you can demonstrate how you would solve a similar problem. Support. You are not alone! Come to offce hours and talk to your classmates. If you want to ask about the details of your solution to a problem, come see me. If you’re ever feeling overwhelmed or don’t know how to make progress, email for help. Tests. The tests that are given are not comprehensive and passing the tests for a question does not mean that you answered the question correctly. Tests usually only check that your table has the correct column labels. However, more tests will be applied to verify the correctness of your submission in order to assign your final score, so be careful and check your work! You might want to create your own checks along the way to see if your answers make sense. Additionally, before you submit, make sure that none of your cells take a very long time to run (several minutes). Free Response Questions: Make sure that you put the answers to the written questions in the indicated cell we provide. Every free response question should include an explanation that 1

adequately answers the question. Tabular Thinking Guide: Feel free to reference Tabular Thinking Guide for extra guidance. Advice. Develop your answers incrementally. To perform a complicated table manipulation, break it up into steps, perform each step on a different line, give a new name to each result, and check that each intermediate result is what you expect. You can add any additional names or functions you want to the provided cells. Make sure that you are using distinct and meaningful variable names throughout the notebook. Along that line, DO NOT reuse the variable names that we use when we grade your answers. For example, in Question 1 of the Global Poverty section we ask you to assign an answer to latest . Do not reassign the variable name latest to anything else in your notebook, otherwise there is the chance that our tests grade against what latest was reassigned to. You never have to use just one line in this project or any others. Use intermediate variables and multiple lines as much as you would like! To get started, load datascience , numpy , plots , and otter . [2]: # Run this cell to set up the notebook, but please don't change it. # These lines import the NumPy and Datascience modules. from datascience import * import numpy as np # These lines do some fancy plotting magic. % matplotlib inline import matplotlib.pyplot as plots plots . style . use( 'fivethirtyeight' ) from ipywidgets import interact, interactive, fixed, interact_manual import ipywidgets as widgets import d8error 1.1 1. Global Population Growth The global population of humans reached 1 billion around 1800, 3 billion around 1960, and 7 billion around 2011. The potential impact of exponential population growth has concerned scientists, economists, and politicians alike. The UN Population Division estimates that the world population will likely continue to grow throughout the 21st century, but at a slower rate, perhaps reaching 11 billion by 2100. However, the UN does not rule out scenarios of more extreme growth. In this part of the project, we will examine some of the factors that influence population growth and how they have been changing over the years and around the world. There are two main sub-parts of this analysis. 2

• First, we will examine the data for one country, Bangladesh. We will see how factors such as life expectancy, fertility rate, and child mortality have changed over time in Bangladesh, and how they are related to the rate of population growth. • Next, we will examine whether the changes we have observed for Bangladesh are particular to that country or whether they reflect general patterns observable in other countries too. We will study aspects of world population growth and see how they have been changing. The first table we will consider contains the total population of each country over time. Run the cell below. [3]: population = Table . read_table( 'population.csv' ) . where( "time" , are . below( 2021 )) population . show( 3 ) <IPython.core.display.HTML object> Note: The population csv file can also be found here . The data for this project was downloaded in February 2017. 1.1.1 Bangladesh The nation of Bangladesh was established as a parliamentary democracy after the Bangladesh Liberation War ended in 1971. The war-ravaged fledgling nation was almost immediately faced with floods and famine. In this section of the project, we will examine aspects of the development of Bangladesh since that time. In the population table, the geo column contains three-letter codes established by the International Organization for Standardization (ISO) in the Alpha-3 standard. We will begin by taking a close look at Bangladesh. Use the Alpha-3 link to find the 3-letter code for Bangladesh. Question 1. Create a table called b_pop that has two columns labeled time and population_total . The first column should contain the years from 1970 through 2020 (including both 1970 and 2020) and the second should contain the population of Bangladesh in each of those years. [4]: b_pop = population . where( 'geo' , are . containing( 'bgd' )) . drop( 'geo' ) . ↪ where( 'time' , are . between( 1970 , 2021 )) b_pop [4]: time | population_total 1970 | 64232486 1971 | 65531635 1972 | 66625706 1973 | 67637541 1974 | 68742222 1975 | 70066310 1976 | 71652386 1977 | 73463593 1978 | 75450033 1979 | 77529040 … (41 rows omitted) 3

Your preview ends here