The squared distance from any sample point to the origin has a x² distribution with mean d. Consider a prediction point x₁ drawn from this distribution, and let a = Xo/|xo| be an associated unit vector. Let zi aTx; be the projection of each of the training points on this - direction. (a). Show that the z; are distributed N(0, 1) with expected squared distance from the origin 1, while the target point has expected squared distance d from the origin. (b). For d = 10 show that the expected distance of a test point from the centre of the training data is 3.1 standard deviations, while all the training points have expected distance 0.80 along direction a. So most prediction points see themselves as lying on the edge of the training set. Note: for this question you need to use a result for the expected value of a squared root of a chi-squared distribution. Either find such a result, or obtain your answer by simulation.

The squared distance from any sample point to the origin has a x² distribution with mean d. Consider a prediction point x₁ drawn from this distribution, and let a = Xo/|xo| be an associated unit vector. Let zi aTx; be the projection of each of the training points on this - direction. (a). Show that the z; are distributed N(0, 1) with expected squared distance from the origin 1, while the target point has expected squared distance d from the origin. (b). For d = 10 show that the expected distance of a test point from the centre of the training data is 3.1 standard deviations, while all the training points have expected distance 0.80 along direction a. So most prediction points see themselves as lying on the edge of the training set. Note: for this question you need to use a result for the expected value of a squared root of a chi-squared distribution. Either find such a result, or obtain your answer by simulation.

MATLAB: An Introduction with Applications

6th Edition

ISBN:9781119256830

Author:Amos Gilat

Publisher:Amos Gilat

Chapter1: Starting With Matlab

Section: Chapter Questions

Problem 1P

See similar textbooks

Related questions

Q: The following table contains output from a lasso fit to a linear model with d = 5 variables and n =…

A: In the realm of statistical modeling, particularly in linear regression, the Lasso (Least Absolute…

Q: We fit a ridge regression model to some data for λ=10 and λ=100 and we obtain the coefficients…

A: We fit a ridge regression model to some data for λ=10 and λ=100 and we obtain the coefficients…

Q: You have the following data for Johansen's Imax rank test for cointegration between 4 international…

A: Outcome is given for cointegration of 4 international equity market indices.

Q: An experiment was conducted to assess the effect of baking temperature on the density of bread. A…

A: We can identify the treatments in the experiment by examining the characteristics that are…

Q: 1. Use the rnorm() function to generate a random sample of size N = 50 from the normal distribution…

A: Given that a random sample of size N=50 from the normal distribution Nμ, σ2, with μ=23 and σ=3.

Q: A point is randomly selected with uniform probability in the X-Y. Plane within the rectangle with…

A: Given that, A point is randomly selected with uniform probability in the XY-Plane within the…

Q: A single observation of a random variable having a Beta Distribution with a =2 and B unknown is used…

A: Answer is given below:

Q: Next, suppose that I am interested in the number of mutations at 10 locations for 100 patients. I…

A: Given: The variable of interest is the number of mutations at one fixed location for 100 patients

Q: a. Generate and plot a data set of N = 1,000 two-dimensional vectors that stem from three…

A: The objective is to generate and plot a set of two-dimensional vectors that stem from three…

Q: 2. An economist used the least squares procedure to fit a regression model of the form y₁ =B₁ +…

A: “Since you have posted multiple questions with multiple sub-parts, we will provide the solution only…

Q: and arsenide is independent of producing a high percentage of workablewafers, which are the main…

A: It is an important part of statistics. It is widely used.

Q: The following is a design matrix X, response vector Y and (X'X)¯' for a multiple linear regression 1…

A: The matrix given are The model is

Q: The following table contains output from a lasso fit to a linear model with d = 6 variables and n =…

A: ### Proportion of ShrinkageIn the context of Lasso regression, the proportion of shrinkage shows how…

Q: If X and Y are independent random variables with means and variances (u,0²) and (µ¸, σ²),…

Q: The demands X and Y for two popular menu items are modeled as discrete RVs with the following joint…

A: Solution: The joint probability mass function of X and Y is X|Y 0 1 Total 0 0.25 0.25 0.5 1…

Q: Find the mean of ||y||2, where || - || is the L₂ norm, if y = A.x+n where the random vectors x, n…

A: Sol:- We know that: ||y||^2 = (A.x + n).(A.x + n) Expanding the above equation, we get: ||y||^2 =…

Q: Given a collection of data points {(xi, y₁)}₁ find the best least squares approxima- tion of the…

A: From the given data points are , we need to fit the curve by the method of least squares. Let…

Q: Suppose we have three independent random variables X, Y and Z where... Var(X + 2Y) = 13, Var(2Y +…

Question

The problem with KNN is that in high dimensions, most points tend to lie on the boundary of the data space. Consider explanatory variables drawn from a spherical multinormal distribution x ~ N(0, I), where x is a random d-vector, and I is a d x d identity matrix.

The squared distance from any sample point to the origin has a x² distribution with mean
d. Consider a prediction point xo drawn from this distribution, and let a = Xo/||xo|| be an
associated unit vector. Let z; = aTx; be the projection of each of the training points on this
direction.
(a). Show that the z; are distributed N(0, 1) with expected squared distance from the origin
1, while the target point has expected squared distance d from the origin.
(b). For d = 10 show that the expected distance of a test point from the centre of the
training data is 3.1 standard deviations, while all the training points have expected
distance 0.80 along direction a. So most prediction points see themselves as lying on
the edge of the training set. Note: for this question you need to use a result for the
expected value of a squared root of a chi-squared distribution. Either find such a result,
or obtain your answer by simulation.

Quantities that have magnitude and direction but not position. Some examples of vectors are velocity, displacement, acceleration, and force. They are sometimes called Euclidean or spatial vectors.

Expert Solution

This question has been solved!

Explore an expertly crafted, step-by-step solution for a thorough understanding of key concepts.

SEE SOLUTION Check out a sample Q&A here

Step 1: Write the give information

Step 2: Calculations

Step 3: Calculations

Solution

Step by step

Solved in 4 steps with 54 images

SEE SOLUTION Check out a sample Q&A here

Similar questions

what does the equation, d/dt Π = MΠ calculate for where Π is population vector describing the overall state probability distributions and M is a 4x4 transition rate matrix?
5. Suppose Y₁, Y2,..., Y are independent random variables whose density is for y0 for some > 0. (a) Show that the MLE for is f(y) = 30y²e-04³ n ÔMLE = ΣΤ (b) Use the R code to estimate the bias and the variance when n = 20 and 0 = 2.
Suppose we have collected a random sample from our population, denoted by (xi , yi), i = 1, . . . , n. We now fit a least squares line: yˆi = βˆ 0 + βˆ 1xi (i = 1, . . . , n). What additional assumption do we need in order to carry out statistical inference on our least square estimators βˆ 0 and βˆ 1? c. Using the results we’ve derived in class, prove that the sum of residuals is zero (Pn i=1 ei = 0)
For least-squares to work well, we need: ) the relationship between x and y to be non-linear. residuals to be Uniformly distributed. ) the residuals to have a mean of zero. the residuals to be correlated with the explanatory variable.
Exercise 4 Show that the Central Limit Theorem, as stated in the form of Theorem 4.1, implies that if X₁, X2,, X12 are independent observations from a U (0, 1) distribution, and S = X₁ + X₂++X12, then S-6 is approximately standard normal. (Work through the details and you will see how the constants 6 and 12 arise!) How well does this method work?
Urgent help is neede please
Consider the least squares problem Ax=b, where 1 0 A = - (2) ¹ - () and b 1 (a) Write down the corresponding normal equations. (b) Determine the set of least squares solutions to the problem. (c) Let H = col(A) be the column space of A. Find the best approximation of b in H.
For a GLM with canonical link function, explain how the likelihood equations imply that the residual vector e = (y – @) is orthogonal with C(X).
A recent survey of 1050 U.S. adults selected at random showed that 647 consider the occupation of firefighter to have very great prestige. Estimate the probability (to the nearest hundredth) that a U.S. adult selected at random thinks the occupation of firefighter has very great prestige.