midterm_V1_Winter2020_solution

pdf

School

University of California, Los Angeles *

*We aren’t endorsed by this school

Course

144

Subject

Economics

Date

Feb 20, 2024

Type

pdf

Pages

17

Uploaded by AgentCrocodilePerson723

Report
Dr. Randall R. Rojas Economics 144 Department of Economics Economic Forecasting UCLA Winter, 2020 Midterm Exam February 13, 2020 For full credit on a problem, you need to show all your work and the formula(s) used. First Name Last Name UCLA ID # Please do not start the exam until instructed to do so.
1.(15%) For this problem the data consist of the average weekly cardiovascular mortality in Los Ange- les County from 1970-1979 as shown in the left-figure below. We then fit a model with trend and seasonality components to the data (left-figure, the dashed line), and plot the respec- tive residuals (right-figure), and the ACF and PACF of the residuals (two bottom-right plots). Time Weekly Mortality 1970 1972 1974 1976 1978 1980 70 80 90 100 110 120 130 Residuals 1970 1972 1974 1976 1978 1980 -10 0 10 20 5 10 15 20 25 -0.1 0.1 0.2 0.3 0.4 Lag ACF 5 10 15 20 25 -0.1 0.1 0.2 0.3 0.4 Lag PACF (a) Based on the plots above, how would you improve the model fit? I would consider either an ARMA or ARIMA model for characterizing the serial corre- lations (cycles) observed in the ACF and PACF plots of the residuals. (b) Does the original data series seem to be covariance stationary? If so, how can you tell? If not, why not and what would you recommend to make it covariance stationary. The downward trend in the series suggests that it might not be covariance station- ary. This is also observed in the ACF and PACF plots. Therefore, I would try taking the first difference of the original series.
(c) Does the residuals plot show any serial correlation? Explain your answer in detail. Yes, the spikes in both the ACF and PACF plots seem to suggest strong serial cor- relation. (d) Suppose you use auto.arima to fit a model to the original data series, and you get the following output: ARIMA(2,1,1)(2,0,2)[52] Coefficients: ar1 ar2 ma1 sar1 sar2 sma1 sma2 0.3285 0.2939 -0.9822 0.2424 0.4687 -0.2648 -0.2758 s.e. 0.0480 0.0422 0.0119 0.0632 0.0628 0.0753 0.0882 Provide a detailed interpretation of the ARIMA model suggested by R. Make sure to comment on any trend, seasonality, and/or cycles if appropriate. Model: ARIMA (2,1,1) + Seasonal-ARIMA(2,0,2) Since it has an I(1) component, this confirms that the data are not covariance stationary. The fact that auto.arima estimated an additional seasonal component, suggest that not all the seasonality had been correctly adjusted for in the original model. The trend is taken care of by the I(1) component of ARIMA. The seasonality is taken care of by the S-ARMA(2,0,2) component. The cycles are taken care of by the ARMA(2,1) component.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
2.(15%) (a) Describe a difference between a forecast for an MA process vs. a forecast for an AR process. One main difference is the quick convergence to the unconditional mean of the MA process over the AR one because of its short-term memory. (b) Describe the advantage of combining forecasts (over e.g., a single forecast) and when would you consider combining them. Assuming the forecasts are from competing models, the advantage comes from the fact that the different models are able to each pick up different dynamics of the series, and by combining them into one, we can therefore produce a more optimal forecast. (c) Describe the differences between a Recursive Scheme and a Rolling Window Scheme. How do the interpretation of the results differ from one another?
3.(15%) The plot below shows the number of births per month in New York city, from January 1946 to December 1959. Number of Births per Month in New York City Time # of Births/month 1946 1948 1950 1952 1954 1956 1958 1960 20 22 24 26 28 30 (a) If you had to choose between an AR(p) and an MA(q) model for this data set, which one would you choose and why? Based on the high persistence and lack of mean reversion, I would suggest an AR(p) model.
(b) Below we show the stl plot of the data. In terms of trend, seasonality, and cycles, briefly discuss which components you would include and why. 20 22 24 26 28 30 data -2.0 -1.0 0.0 1.0 seasonal 22 24 26 28 trend -1.5 -0.5 0.5 1.5 1946 1948 1950 1952 1954 1956 1958 1960 remainder time Based on the figure, it appears that all 3 components are needed. The trend and sea- sonality can be easily observed in the filtered subplots, and the strong cyclical pattern in the remainder suggests cycles are present as well.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
(c) Below is the summary from a fit to the data. Discuss what type of model fit was used, its statistical significance, and the behavior of the seasonal factors. Based on your answer, what else would you consider including in your model to improve the current fit? Estimate Std. Error t value Pr( > | t | ) (Intercept) 21.4654 0.3237 66.32 0.0000 trend 0.0369 0.0017 21.11 0.0000 season2 -1.5299 0.4140 -3.70 0.0003 season3 1.4426 0.4140 3.48 0.0006 season4 -0.2608 0.4141 -0.63 0.5298 season5 0.7896 0.4141 1.91 0.0584 season6 0.3075 0.4141 0.74 0.4588 season7 1.8733 0.4142 4.52 0.0000 season8 1.6502 0.4142 3.98 0.0001 season9 1.1285 0.4143 2.72 0.0072 season10 1.1573 0.4143 2.79 0.0059 season11 -0.7689 0.4144 -1.86 0.0654 season12 -0.0552 0.4145 -0.13 0.8942 The summary output is consistent with the R function tslm where you fit a season and linear trend model. In this case the model would be something like lm ( y trend + season ). More specifically, it consists of a trend component β 0 + β 1 TIME and seasonal (monthly) dummies s i =1 γ i D it .
(d) Suppose we try a different fit to the data and obtain the respective tsdisplay plot below for the residuals. What model would you propose for the residuals. Residuals 1946 1948 1950 1952 1954 1956 1958 1960 -2 0 2 4 0 5 10 15 20 25 30 35 -0.2 0.0 0.2 0.4 0.6 Lag ACF 0 5 10 15 20 25 30 35 -0.2 0.0 0.2 0.4 0.6 Lag PACF I would suggest an AR(1) given the strong spike at k = 1 in the PACF and decay in the ACF.
4.(15%) Given the time-series below: (a) Rewrite the following expression without using the lag operator and identify the process (e.g, AR(p), MA(q), or ARMA(p, q)): L - 2 + 1 2 L - 1 y t = ( L - 1) ε t ( L 2 - 5 L + 3) y t = (2 L - 1)( L - 1) ε t y t = 5 3 y t - 1 - 2 3 y t - 2 + 2 3 ε t - 2 - ε t - 1 + 1 3 ε t ARMA (2 , 2) (b) Rewrite the following expression in lag operator form and identify the process (e.g, AR(p), MA(q), or ARMA(p, q)): 3 ε t - 2 - y t - 2 = 2 ε t - 1 - 3 ε t - 3 - 1 + ε t - y t (1 - L 2 ) y t = (1 + 2 L - 3 L 2 - 3 L 3 ) ε t - 1 ARMA (2 , 3)
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
5.(15%) The figure below shows monthly observations of mortality rates ( mort ), pollution ( part ), and temperature ( tempr ). We would like to develop a VAR model to explain mortality rates, and therefore, perform a Granger Causality Test (see table below figure) to determine if temper- ature and/or pollution Granger cause mortality rates. 70 80 90 100 120 cmort 50 60 70 80 90 100 tempr 20 40 60 80 100 1970 1972 1974 1976 1978 1980 part Time Mortality Rates (Monthly Observations) > grangertest(cmort~tempr, 2) Granger causality test Model 1: cmort ~ Lags(cmort, 1:1) + Lags(tempr, 1:1) Model 2: cmort ~ Lags(cmort, 1:1) Res.Df Df F Pr(>F) 1 504 2 505 -1 75.591 < 2.2e-16 *** --- Signif. codes: 0 ?***? 0.001 ?**? 0.01 ?*? 0.05 ?.? 0.1 ? ? 1 > grangertest(cmort~part, 2) Granger causality test Model 1: cmort ~ Lags(cmort, 1:1) + Lags(part, 1:1) Model 2: cmort ~ Lags(cmort, 1:1) Res.Df Df F Pr(>F) 1 504 2 505 -1 2.0487 0.153
(a) Based on the Granger test results, what would you conclude in terms of Granger causal- ity for mortality rates? The granger test suggests that only temperature Granger causes mortality rates at the e.g., α = 5% level of significance. (b) Below is the tsdisplay plot for the residuals from a VAR model fit to Mortality Rates. Interpret the results. Residuals from a VAR Model Fit 0 100 200 300 400 500 -10 0 10 20 0 5 10 15 20 25 -0.10 0.00 0.05 0.10 Lag ACF 0 5 10 15 20 25 -0.10 0.00 0.05 0.10 Lag PACF From the ACF and PACF plots it looks like the model fit is a good one given the lack of statistically significant spikes in both functions. Also, the fact that the residuals do not have any obvious pattern, confirms our intuition.
(c) Below is the impulse response function from temperature. Interpret the results. cmort 0 2 4 6 tempr 0 2 4 6 0 3 6 9 12 16 20 24 28 32 36 Orthogonal Impulse Response from tempr 95 % Bootstrap CI, 100 runs The effect of a positive shock in temperature on mortality rates for the most part seems to be positive except for the first 1-2 months. This suggests that e.g., an increase in temperature may lead to a decrease in mortality rates. In this cases of temperature on temperature, the effect is mostly an opposite one, lasting a little over a year.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
Questions 6-10 are multiple choice. Please select one answer per question only. 6.(5%) In class we discussed the example: “Which one came first, the chicken or the egg?” Below are the Granger test results. Based on the test which choice is correct? (a) The chicken came first (b) The egg came first (c) The test suggests that both Granger cause each other (d) The test suggests that there is no granger causality between the two (e) None of the above
7.(5%) Which model more closely corresponds to the figure below? Series 0 200 400 600 800 1000 -4 -2 0 2 4 0 5 10 15 20 25 30 -0.2 0.0 0.2 0.4 0.6 Lag ACF 0 5 10 15 20 25 30 -0.2 0.0 0.2 0.4 0.6 Lag PACF (a) y t = φ 1 y t - 1 + φ 2 y t - 2 + φ 3 y t - 3 + ε t (b) y t = θ 3 ε t - 3 + ε t (c) y t = φ 3 y t - 3 + ε t (d) y t = θ 1 ε t - 1 + θ 2 ε t - 2 + θ 3 ε t - 3 + ε t (e) None of the above
8.(5%) The data below are monthly observations. Which model more closely corresponds to the figure below? Data 20 40 60 80 -4 -2 0 2 4 6 0 5 10 15 20 25 30 35 -0.2 0.0 0.2 0.4 0.6 Lag ACF 0 5 10 15 20 25 30 35 -0.2 0.0 0.2 0.4 0.6 Lag PACF (a) S-MA(1) (b) S-AR(1) (c) S-MA(2) (d) S-AR(2) (e) S-ARMA(1,1)
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
9.(5%) The data below are monthly observations. Which model more closely corresponds to the figure below? Data 20 40 60 80 -6 -2 0 2 4 6 0 5 10 15 20 25 30 35 0.0 0.2 0.4 0.6 0.8 Lag ACF 0 5 10 15 20 25 30 35 0.0 0.2 0.4 0.6 0.8 Lag PACF (a) S-AR(1) (b) S-AR(4) (c) S-MA(4) (d) S-MA(1) (e) None of the above
10.(5%) Assume you are trying to choose a best fit model for a particular dataset. If all the model choices listed below show identical performance in terms of the fit and errors, which one would be the best choice? (a) AR(7) (b) ARMA(1,2) (c) MA(10) (d) S-ARMA(1,5) (e) ARMA(1,1) + S-ARMA(2,2)