(d) Supposing that numbers are truncated (rounded down), what is the maximum absolute rounding error in bfloat16?

Computer Networking: A Top-Down Approach (7th Edition)
7th Edition
ISBN:9780133594140
Author:James Kurose, Keith Ross
Publisher:James Kurose, Keith Ross
Chapter1: Computer Networks And The Internet
Section: Chapter Questions
Problem R1RQ: What is the difference between a host and an end system? List several different types of end...
icon
Related questions
Question

Please answer part D only

4.
The bfloat 16 "brain floating point" format is a 16 bit format used in
Google's machine learning and AI software. It is a binary floating point format
which is very similar to the single precision IEEE-754 format: 1 bit is allocated for
the sign, 8 bits for the exponent with a bias of 127, but only 7 bits are allocated for
the fraction (the exponent is always chosen so that the first digit of the mantissa is
1, and then only the fraction is stored in memory).
(a) What is the approximate decimal precision of a brain floating point?
(b) If the bits are stored in the order: sign, exponent, fraction, and 0 corresponds to
a positive sign, then calculate the decimal representation of the number stored
as
1 00000110
(c) Given that the largest exponent actually used for numbers is 11111110, what
is the largest number that can be expressed as a bfloat 16?
0100010
(d) Supposing that numbers are truncated (rounded down), what is the maximum
absolute rounding error in bfloat16?
Transcribed Image Text:4. The bfloat 16 "brain floating point" format is a 16 bit format used in Google's machine learning and AI software. It is a binary floating point format which is very similar to the single precision IEEE-754 format: 1 bit is allocated for the sign, 8 bits for the exponent with a bias of 127, but only 7 bits are allocated for the fraction (the exponent is always chosen so that the first digit of the mantissa is 1, and then only the fraction is stored in memory). (a) What is the approximate decimal precision of a brain floating point? (b) If the bits are stored in the order: sign, exponent, fraction, and 0 corresponds to a positive sign, then calculate the decimal representation of the number stored as 1 00000110 (c) Given that the largest exponent actually used for numbers is 11111110, what is the largest number that can be expressed as a bfloat 16? 0100010 (d) Supposing that numbers are truncated (rounded down), what is the maximum absolute rounding error in bfloat16?
Expert Solution
steps

Step by step

Solved in 3 steps with 4 images

Blurred answer
Similar questions
  • SEE MORE QUESTIONS
Recommended textbooks for you
Computer Networking: A Top-Down Approach (7th Edi…
Computer Networking: A Top-Down Approach (7th Edi…
Computer Engineering
ISBN:
9780133594140
Author:
James Kurose, Keith Ross
Publisher:
PEARSON
Computer Organization and Design MIPS Edition, Fi…
Computer Organization and Design MIPS Edition, Fi…
Computer Engineering
ISBN:
9780124077263
Author:
David A. Patterson, John L. Hennessy
Publisher:
Elsevier Science
Network+ Guide to Networks (MindTap Course List)
Network+ Guide to Networks (MindTap Course List)
Computer Engineering
ISBN:
9781337569330
Author:
Jill West, Tamara Dean, Jean Andrews
Publisher:
Cengage Learning
Concepts of Database Management
Concepts of Database Management
Computer Engineering
ISBN:
9781337093422
Author:
Joy L. Starks, Philip J. Pratt, Mary Z. Last
Publisher:
Cengage Learning
Prelude to Programming
Prelude to Programming
Computer Engineering
ISBN:
9780133750423
Author:
VENIT, Stewart
Publisher:
Pearson Education
Sc Business Data Communications and Networking, T…
Sc Business Data Communications and Networking, T…
Computer Engineering
ISBN:
9781119368830
Author:
FITZGERALD
Publisher:
WILEY