1) Identify which type of database/data processing system you would choose (Key-Value store, Column-oriented store, Document-oriented store, Graph database, Relational database, Streaming engine) in each scenario below. a) Highly structured multi-table data that requires enforcing data constraints. b) Stock market data ticker with decisions that must be made in real time. c) LinkedIn type data with interconnected nodes where much of the information resides in the links between nodes. d) An image storage system that allows lookup images by file name. e) A collection of JSON objects (e.g., tweets). f) Data that is stored in large sparse tables that are continuously growing (new rows/columns).

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
Question
1) Identify which type of database/data processing system you would choose (Key-Value store,
Column-oriented store, Document-oriented store, Graph database, Relational database,
Streaming engine) in each scenario below.
2)
a) Highly structured multi-table data that requires enforcing data constraints.
b) Stock market data ticker with decisions that must be made in real time.
c) LinkedIn type data with interconnected nodes where much of the information resides in
the links between nodes.
d)
e)
f)
An image storage system that allows lookup images by file name.
A collection of JSON objects (e.g., tweets).
Data that is stored in large sparse tables that are continuously growing (new
rows/columns).
a) Consider the following graph
B
Compute the page rank for the nodes in this graph. If you are multiplying matrices manually,
you may stop computing after 6 steps. If you use a tool (e.g., Matlab, python) for matrix
multiplication, you should get your answer to converge.
b) Now consider a graph with dead-end nodes Q and Z:
What is the page rank of Q?
What is the page rank of Z?
X
Z
Z
Transcribed Image Text:1) Identify which type of database/data processing system you would choose (Key-Value store, Column-oriented store, Document-oriented store, Graph database, Relational database, Streaming engine) in each scenario below. 2) a) Highly structured multi-table data that requires enforcing data constraints. b) Stock market data ticker with decisions that must be made in real time. c) LinkedIn type data with interconnected nodes where much of the information resides in the links between nodes. d) e) f) An image storage system that allows lookup images by file name. A collection of JSON objects (e.g., tweets). Data that is stored in large sparse tables that are continuously growing (new rows/columns). a) Consider the following graph B Compute the page rank for the nodes in this graph. If you are multiplying matrices manually, you may stop computing after 6 steps. If you use a tool (e.g., Matlab, python) for matrix multiplication, you should get your answer to converge. b) Now consider a graph with dead-end nodes Q and Z: What is the page rank of Q? What is the page rank of Z? X Z Z
c) Exercise 5.1.6 from Mining of Massive Datasets
...
Figure 5.9: A chain of dead ends
Exercise 5.1.6: Suppose we recursively eliminate dead ends from the graph,
solve the remaining graph, and estimate the PageRank for the dead-end pages
as described in Section 5.1.4. Suppose the graph is a chain of dead ends, headed
by a node with a self-loop, as suggested in Fig. 5.9. What would be the Page-
Rank assigned to each of the nodes?
3) Given the input data [(1pm, $8), (2pm, $16), (3pm, $17), (4pm, $26), (5pm, $12), (6pm,
$22), (7pm, $23), (8pm, $22), (9pm, $24), (10pm, $28), (11pm, $26), (12am, $30)].
a) What will the Hive query "compute average price" return? (yes, this question is as
obvious as it seems, it is asked for comparison with part-b and part-c)
b) What will a Storm streaming query "compute average price per each 3 hour window"
return? (tumbling, i.e., non-overlapping window of tuples). For example, the first window
would 1pm-4pm. Second window would be 4pm-7pm. If you are wondering about
overlap, I recommend defaulting to [1pm-4pm) [4pm-7pm). (i.e., including first but not
last part of the time range).
c) What will a Storm query "compute average price per each 3 hour window" return?
(sliding, i.e. overlapping window of tuples, moving the window forward 2 hours each
time). First window is 1pm-4pm, second window is 3pm-6pm and so on.
NOTE: when Storm does not have a full window, you cannot output anything until the
window fills with data.
Transcribed Image Text:c) Exercise 5.1.6 from Mining of Massive Datasets ... Figure 5.9: A chain of dead ends Exercise 5.1.6: Suppose we recursively eliminate dead ends from the graph, solve the remaining graph, and estimate the PageRank for the dead-end pages as described in Section 5.1.4. Suppose the graph is a chain of dead ends, headed by a node with a self-loop, as suggested in Fig. 5.9. What would be the Page- Rank assigned to each of the nodes? 3) Given the input data [(1pm, $8), (2pm, $16), (3pm, $17), (4pm, $26), (5pm, $12), (6pm, $22), (7pm, $23), (8pm, $22), (9pm, $24), (10pm, $28), (11pm, $26), (12am, $30)]. a) What will the Hive query "compute average price" return? (yes, this question is as obvious as it seems, it is asked for comparison with part-b and part-c) b) What will a Storm streaming query "compute average price per each 3 hour window" return? (tumbling, i.e., non-overlapping window of tuples). For example, the first window would 1pm-4pm. Second window would be 4pm-7pm. If you are wondering about overlap, I recommend defaulting to [1pm-4pm) [4pm-7pm). (i.e., including first but not last part of the time range). c) What will a Storm query "compute average price per each 3 hour window" return? (sliding, i.e. overlapping window of tuples, moving the window forward 2 hours each time). First window is 1pm-4pm, second window is 3pm-6pm and so on. NOTE: when Storm does not have a full window, you cannot output anything until the window fills with data.
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 2 steps

Blurred answer
Knowledge Booster
Types of Database Architectures
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education