Please answer with detail **Problem 3 and 4 request you to resolve various problems using Hadoop.**To get full credit, please explicitly show all steps of converting data from raw data to a final output following the template:For SPLITTING: please split to 3 parts (Hint: Text file has three lines).For MAPPING and REDUCING: please explicitly show which data is key, which data is value.### Diagram ExplanationThe diagram illustrates the process of data transformation using Hadoop, which involves several stages:1. **Splitting:** - The raw data is divided into three parts. These sections represent different segments of the data to be processed.2. **Mapping:** - Each segment from the splitting step is processed individually. - The data is mapped into key-value pairs, where each key is associated with a corresponding value.3. **Shuffling:** - The key-value pairs are reorganized based on the key. This step groups all values associated with similar keys together to ensure efficient data processing.4. **Reducing:** - The shuffled data undergoes reduction, where operations are performed on the values to produce a condensed output. - Again, data is maintained in key-value pairs format.5. **Final Results:** - The reduced data is compiled into a final result set, representing the processed output.### ExampleSuppose we have the document **BigData.txt** below:```W M UM U A WM W C A```**Expected Result:**- W, 3- M, 3- U, 2- A, 2- C, 1This output implies that the letter 'W' appears 3 times, 'M' appears 3 times, 'U' appears 2 times, 'A' appears 2 times, and 'C' appears 1 time after processing through Hadoop. **Problem 4: Indicating the pairs in each phase of data processing in Hadoop**Please write each step in bullet points or by drawing diagrams to get the top 2 most frequent keywords in BigData.txt using Hadoop.

The solution is given in the next step

Answered: Problem 3 and 4 request you to resolve…

Database System Concepts

7th Edition

ISBN:9780078022159

Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan

Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan

Chapter1: Introduction

Section: Chapter Questions

Problem 1PE

See similar textbooks

Similar questions

By studying an item's bounding box in the Designer, you may get insight into the attributes of the object in a number of different ways.
Implementation difficulties for Dates and Times data types?

Recommended textbooks for you

Database System Concepts

Computer Science

ISBN:

9780078022159

Author:

Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan

Publisher:

McGraw-Hill Education

Starting Out with Python (4th Edition)

Computer Science

ISBN:

9780134444321

Author:

Tony Gaddis

Publisher:

PEARSON

Digital Fundamentals (11th Edition)

Computer Science

ISBN:

9780132737968

Author:

Thomas L. Floyd

Publisher:

PEARSON

Database System Concepts

Computer Science

ISBN:

9780078022159

Author:

Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan

Publisher:

McGraw-Hill Education

Starting Out with Python (4th Edition)

Computer Science

ISBN:

9780134444321

Author:

Tony Gaddis

Publisher:

PEARSON

Digital Fundamentals (11th Edition)

Computer Science

ISBN:

9780132737968

Author:

Thomas L. Floyd

Publisher:

PEARSON

C How to Program (8th Edition)

Computer Science

ISBN:

9780133976892

Author:

Paul J. Deitel, Harvey Deitel

Publisher:

PEARSON

Database Systems: Design, Implementation, & Manag…

Computer Science

ISBN:

9781337627900

Author:

Carlos Coronel, Steven Morris

Publisher:

Cengage Learning

Programmable Logic Controllers

Computer Science

ISBN:

9780073373843

Author:

Frank D. Petruzella

Publisher:

McGraw-Hill Education

SEE MORE TEXTBOOKS