data_t = double, OP Inner loop of inner4. udata in %rbp, vdata in %rax, sum in %xmm0 i in %rex, limit in %rbx 1 .L15: loop: Get udatali) vmovsd 0(%rbp,%rcx,8), %xmm1 vmulsd (%rax, %rcx,8), %xmm1, %xmm1 vaddsd %xmm1, %xmm0, %xmm0 $1, %rcx %rbx, %rcx Multiply by vdata[i] Add to sum 5 addq Increment i cmpq Compare i:limit jne .L15 If !=, goto loop Assume that the functional units have the characteristics listed in Figure 5.12. A. Diagram how this instruction sequence would be decoded into operations and show how the data dependencies between them would create a critical path of operations, in the style of Figures 5.13 and 5.14. B. For data type double, what lower bound on the CPE is determined by the critical path? C. Assuming similar instruction sequences for the integer code as well, what lower bound on the CPE is determined by the critical path for integer data? D. Explain how the floating-point versions can have CPES of 3.00, even though

data_t = double, OP Inner loop of inner4. udata in %rbp, vdata in %rax, sum in %xmm0 i in %rex, limit in %rbx 1 .L15: loop: Get udatali) vmovsd 0(%rbp,%rcx,8), %xmm1 vmulsd (%rax, %rcx,8), %xmm1, %xmm1 vaddsd %xmm1, %xmm0, %xmm0 $1, %rcx %rbx, %rcx Multiply by vdata[i] Add to sum 5 addq Increment i cmpq Compare i:limit jne .L15 If !=, goto loop Assume that the functional units have the characteristics listed in Figure 5.12. A. Diagram how this instruction sequence would be decoded into operations and show how the data dependencies between them would create a critical path of operations, in the style of Figures 5.13 and 5.14. B. For data type double, what lower bound on the CPE is determined by the critical path? C. Assuming similar instruction sequences for the integer code as well, what lower bound on the CPE is determined by the critical path for integer data? D. Explain how the floating-point versions can have CPES of 3.00, even though

Database System Concepts

7th Edition

ISBN:9780078022159

Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan

Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan

Chapter1: Introduction

Section: Chapter Questions

Problem 1PE

See similar textbooks

Related questions

Q: a) f(A,B,C,D) = Σm(1,4,5,6,7,9,11,13,15)

Q: Construct the system for a free cubic spline for the following data, and solve it using MATLAB to…

A: To construct a free cubic spline for the given data and solve it using MATLAB, you can follow these…

Q: Solve for the following problems numerically using Manual solution and Python solution (Numpy)…

A: It seems there might be a typo or an incomplete expression in your statement. The equality you…

Q: You solve a non-singular system of 1,000 linear equations with 1,000 unknowns. Your code uses the…

A: Program: a system of linear equations A*x=b. Method: calls Gauss-J elimination (with scaled…

Q: Given a system of linear equations as follows : -3x + 27y= 3 9x - y = -6 The solution to the system…

A: The linear equations given:- -3x + 27y = 3 9x -y = -6 Matrix A and B for solution using coefficient…

Q: Apply De Morgan's theorem to :-

A: in this question you asked to Apply De Morgan's theorem to :-A’.B + A.B' in the following you will…

Q: Discrete Math COMP 232 Summer 2024

Q: Problem 3: Use one of the proof methods to prove the following results. 1. Use a proof by cases to…

A: ANSWER:-

Q: main program for solving Toeplitz systems T\bu = {\bf b}.

A: program to solve given Toeplitz system is :

Concept explainers

Linux

An operating system (OS) is the software which manages hardware and resources, like CPU, storage and memory. The OS bridges the applications and hardware and makes the connections between all of your software and the hardware resources.

Question

Solve for a b c and d

Inner loop of inner4. data_t = double, OP = *
udata in %rbp, vdata in %rax, sum in %xmmo
i in %rcx, limit in %rbx
1
.L15:
loop:
vmovsd 0(%rbp,%rcx,8), %xmm1
vmulsd (%rax,%rcx,8), %xmm1, %xmm1
vaddsd %xmm1, %xmm0, %xmmo
$1, %rcx
%rbx, %rcx
2
Get udata[i]
3
Multiply by vdata[i]
4
Add to sum
addq
Increment i
Compare i:limit
If !=, goto loop
стра
7
jne
.L15
Assume that the functional units have the characteristics listed in Figure 5.12.
A. Diagram how this instruction sequence would be decoded into operations
and show how the data dependencies between them would create a critical
path of operations, in the style of Figures 5.13 and 5.14.
B. For data type double, what lower bound on the CPE is determined by the
critical path?
C. Assuming similar instruction sequences for the integer code as well, what
lower bound on the CPE is determined by the critical path for integer data?
D. Explain how the floating-point versions can have CPES of 3.00, even though
the multiplication operation requires 5 clock cycles.

Suppose we wish to write a procedure that computes the inner product of two
vectors u and v. An abstract version of the function has a CPE of 14–18 with x86-
64 for different types of integer and floating-point data. By doing the same sort
of transformations we did to transform the abstract program combine1 into the
more efficient combine4, we get the following code:
/* Inner product. Accumulate in temporary */
void inner4(vec_ptr u, vec_ptr v, data_t *dest)
1
{
long i;
long length
data_t *udata = get_vec_start(u);
data_t *vdata = get_vec_start (v);
data_t sum = (data_t) 0;
4
vec_length(u);
%3D
8.
9.
for (i = 0; i < length; i++) {
sum = sum + udata[i] * vdata[i];
10
11
12
}
13
*dest = sum;
14
Our measurements show that this function has CPES of 1.50 for integer data
and 3.00 for floating-point data. For data type double, the x86-64 assembly code
for the inner loop is as follows:

Expert Solution

This question has been solved!

Explore an expertly crafted, step-by-step solution for a thorough understanding of key concepts.

This is a popular solution!

SEE SOLUTION Check out a sample Q&A here

Step 1

VIEW

Step 2

VIEW

Step 3

VIEW

Trending now

This is a popular solution!

Step by step

Solved in 3 steps with 3 images

SEE SOLUTION Check out a sample Q&A here

Knowledge Booster

Learn more about

Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.

Similar questions

Answer: 1.792e3 seconds
Please draw the truth table of the following proposition.(P→Q) Λ Q
You solve a non-singular system of 1,000 linear equations with 1,000 unknowns. Your code uses the Gauss-Jordan algorithm with partial pivoting using double precision numbers and arithmetics. Why would the 2-norm of the residual of your solution not be zero?
for part a you need to mathematically solve for k
using equation method for Hamming code of (7,4),determine of the Following messages m1 = 1111 & %3D m2=1010 (solution by Hamming codes)
ts) The phase plot for an ODE - f(N) is shown below. (a) Which of these could be a plat of solutions y vs z comresponding to this CDE? A. B. C. D. You can cilck the graphs above to enierge them. OA. A OB. B OC. C OD. D (D) One equilibrium solution af this ODE is y which is chogse and another equilbrium solution of this CDE is y - which is choase (C) For which of the following value(s) of the initial condtion y(0) does the solution g(z) converge as z 00? Select all that apply DA. 0) - 3 OB. 0) - 3.5 OC. p(0) - 2 OD. (0) - -1