lab09-AgeUniverse

html

School

Temple University *

*We aren’t endorsed by this school

Course

1013

Subject

Astronomy

Date

Dec 6, 2023

Type

html

Pages

Uploaded by samzahroun

The Age of the Universe ¶ Welcome to Lab 9! Elements of Data Science adapted from Berkeley Data8 Sometimes, the primary purpose of regression analysis is to learn something about the slope or intercept of the best-fitting line. When we use a sample of data to estimate the slope or intercept, our estimate is subject to random error, just as in the simpler case of the mean of a random sample. In this lab, we'll use regression to get an accurate estimate for the age of the universe, using pictures of exploding stars. Our estimate will come from a sample of all exploding stars. We'll compute a confidence interval to quantify the error caused by sampling. In [1]: name = "Sam Z." In [2]: ## import statements # These lines load the tests. from gofer.ok import check import numpy as np from datascience import * import pandas as pd import matplotlib from matplotlib import patches %matplotlib inline import matplotlib.pyplot as plt import seaborn as sns import warnings warnings.simplefilter('ignore', FutureWarning) plt.style.use('fivethirtyeight') from ipywidgets import interact, interactive, fixed import ipywidgets as widgets import os user = os.getenv('JUPYTERHUB_USER') The Actual Big Bang Theory ¶ In the early 20th century, the most popular cosmological theory suggested that the universe had always existed at a fixed size. Today, the Big Bang theory prevails: Our universe started out very small and is still expanding. A consequence of this is Hubble's Law, which says that the expansion of the universe creates the appearance that every celestial object that's reasonably far away from Earth (for example, another galaxy) is moving away from us at a constant speed. If we extrapolate that motion backwards to the time when everything in the universe was in the same place, that time is (roughly) the beginning of the universe! Scientists have used this fact, along with measurements of the current location and movement speed of other celestial objects, to estimate when the universe started. The cell below simulates a universe in which our sun is the center and every other star is moving away from us. Each star starts at the same place as the sun, then moves away from it over time. Different stars have different directions and speeds ; the arrows indicate the direction and speed of travel. Run the cell, then move the slider to see how things change over time. Question 1 ¶ When did the universe start, in this example? In [3]:

# Just run this cell. (The simulation is actually not # that complicated; it just takes a lot of code to draw # everything. So you don't need to read this unless you # have time and are curious about more advanced plotting.) num_locations = 15 example_velocities = Table().with_columns( "x", np.random.normal(size=num_locations), "y", np.random.normal(size=num_locations)) start_of_time = -2 def scatter_after_time(t, start_of_time, end_of_time, velocities, center_name, other_point_name, make_title): max_location = 1.1*(end_of_time-start_of_time)*max(max(abs(velocities.column("x"))), max(abs(velocities.column("y")))) new_locations = velocities.with_columns( "x", (t-start_of_time)*velocities.column("x"), "y", (t-start_of_time)*velocities.column("y")) plt.scatter(make_array(0), make_array(0), label=center_name, s=100, c="yellow") plt.scatter(new_locations.column("x"), new_locations.column("y"), label=other_point_name) for i in np.arange(new_locations.num_rows): plt.arrow( new_locations.column("x").item(i), new_locations.column("y").item(i), velocities.column("x").item(i), velocities.column("y").item(i), fc='black', ec='black', head_width=0.025*max_location, lw=.15) plt.xlim(-max_location, max_location) plt.ylim(-max_location, max_location) plt.gca().set_aspect('equal', adjustable='box') plt.gca().set_position(make_array(0, 0, 1, 1)) plt.legend(bbox_to_anchor=(1.6, .7)) plt.title(make_title(t)) plt.show() interact( scatter_after_time, t=widgets.FloatSlider(min=start_of_time, max=5, step=.05, value=0, msg_throttle=1), start_of_time=fixed(start_of_time), end_of_time=fixed(5), velocities=fixed(example_velocities), center_name=fixed("our sun"), other_point_name=fixed("other star"), make_title=fixed(lambda t: "The world {:01g} year{} in the {}".format(abs(t), "" if abs(t) == 1 else "s", "past" if t < 0 else "future"))); interactive(children=(FloatSlider(value=0.0, description='t', max=5.0, min=-2.0, step=0.05), Output()), _dom_c…

According to this example, the universe started 2 years ago. Question 2 ¶ After 5 years (with the slider all the way to the right), stars with longer arrows are further away from the Sun. Why? The stars are further away from the sun because of the passage of time and according to this code, the universe started with the sun and other star also came from our sun and then they get further away over time. . Analogy: driving ¶ Here's an analogy to illustrate how scientists use information about stars to estimate the age of the universe. Suppose that at some point in the past, our friend Mei started driving in a car going at a steady speed of 60 miles per hour straight east. We're still standing where she started. We want to know how long she's been driving, but we forgot to record the time when she left. If we find out that she's 120 miles away, and she's been going 60 miles per hour the whole time, we can infer that she left 2 hours ago. One way we can compute that number is by fitting a line to a scatter plot of our locations and speeds. It turns out that the slope of that line is the amount of time that has passed. Run the next cell to see a picture: In [4]: # Run this cell to see a picture of Mei's locations over time. mei_velocity = Table().with_columns("x", make_array(60), "y", make_array(0)) interact( scatter_after_time, t=widgets.FloatSlider(min=-2, max=1, step=.05, value=0, msg_throttle=1), start_of_time=fixed(-2), end_of_time=fixed(1), velocities=fixed(mei_velocity), center_name=fixed("Us"), other_point_name=fixed("Mei"), make_title=fixed(lambda t: "Mei's position {:01g} hour{} in the {}".format(abs(t), "" if abs(t) == 1 else "s", "past" if t < 0 else "future"))); interactive(children=(FloatSlider(value=0.0, description='t', max=1.0, min=-2.0, step=0.05), Output()), _dom_c… The slope of the line is 2 hours. (The units are vertical-axis units divided by horizontal- axis units, which are $\frac{\texttt{miles}}{\texttt{miles} / \texttt{hour}}$, or hours.) So that's our answer. Imagine that you don't know Mei's exact distance or speed, only rough estimates. Then if you drew this line, you'd get a slightly bad estimate of the time since she left. But if you measured the distance and speed of hundreds of people who left you at the same time going different speeds, and drew a line through them, the slope of that line would be a pretty good estimate of the time they left, even if the individual measurements weren't exactly right. The drivers.csv dataset contains the speeds and distances-from-start of 100 drivers. They all left the same starting location at the same time, driving at a fixed speed on a straight line away from the start. The measurements aren't exact, so they don't fit exactly on a line. We've created a scatter plot and drawn a line through the data. In [5]: # Just run this cell to plot the data. small_driving_example = Table().with_columns( "Name", make_array("Us", "Mei"),

Your preview ends here