calculate number of operations in this algorithm   #include #include #include #include const char* dgemv_desc = "OpenMP dgemv."; /*  * This routine performs a dgemv operation  * Y :=  A * X + Y  * where A is n-by-n matrix stored in row-major format, and X and Y are n by 1 vectors.  * On exit, A and X maintain their input values.  */ void my_dgemv(int n, double* A, double* x, double* y) {    #pragma omp parallel    {       int nthreads = omp_get_num_threads();       int thread_id = omp_get_thread_num();       printf("Hello world: thread %d of %d checking in. \n", thread_id, nthreads);    }    // insert your dgemv code here. you may need to create additional parallel regions,    // and you may want to comment out the above parallel code block that prints out    // nthreads and thread_id so as to not taint your timings    #pragma omp parallel for    // insert your code here: implementation of basic matrix multiply    for(int i = 0; i < n; i++) {          #pragma omp parallel for          for(int j = 0; j < n; j++) {                 y[i] += A[i * n + j] * x[j];          }    } }

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
Question

calculate number of operations in this algorithm

 

#include <string.h>
#include <stdlib.h>
#include <stdio.h>
#include <omp.h>

const char* dgemv_desc = "OpenMP dgemv.";

/*
 * This routine performs a dgemv operation
 * Y :=  A * X + Y
 * where A is n-by-n matrix stored in row-major format, and X and Y are n by 1 vectors.
 * On exit, A and X maintain their input values.
 */

void my_dgemv(int n, double* A, double* x, double* y) {

   #pragma omp parallel
   {
      int nthreads = omp_get_num_threads();
      int thread_id = omp_get_thread_num();
      printf("Hello world: thread %d of %d checking in. \n", thread_id, nthreads);
   }

   // insert your dgemv code here. you may need to create additional parallel regions,
   // and you may want to comment out the above parallel code block that prints out
   // nthreads and thread_id so as to not taint your timings

   #pragma omp parallel for
   // insert your code here: implementation of basic matrix multiply
   for(int i = 0; i < n; i++) {
         #pragma omp parallel for
         for(int j = 0; j < n; j++) {
                y[i] += A[i * n + j] * x[j];
         }
   }

}


Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 3 steps

Blurred answer
Knowledge Booster
Array
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education