Question 3: Data for this question can be found in "online-retail" sheet from assignment- data.xlsx. Since it is a big data, load first 200 rows and keep it in the data frame called "dataset". This "dataset" is used for all tasks in this question Assume that you are a data scientist in Amazon. Since the company is celebrating Silver Jubilee this year, it has decided to reward their customers. Your Manager handed over last 2 years retail data and asked you to do certain tasks. The tasks are as follows: Taskl-1: When you started working with data, you've realized that it needs cleaning to produce better results. Do essential data cleahing. The final output should be the one as follows before cleaning: any negatives?: True after cleaning: any negatives?: False

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
Question

Python code that can check the data if there are negative values and if none 

Question 3:
Data for this question can be found in "online-retail" sheet from assignment-
data.xlsx. Since it is a big data, load first 200 rows and keep it in the data
frame called "dataset". This "dataset" is used for all tasks in this question
Assume that you are a data scientist in Amazon. Since the company is celebrating
Silver Jubilee this year, it has decided to reward their customers. Your Manager
handed over last 2 years retail data and asked you to do certain tasks. The tasks are
as follows:
Taskl-1:
When you started working with data, you've realized that it needs cleaning to
produce better results.
Do essential data cleaning. The final output should be the one as follows
before cleaning: any negatives?: True
after cleaning: any negatives?: False
Transcribed Image Text:Question 3: Data for this question can be found in "online-retail" sheet from assignment- data.xlsx. Since it is a big data, load first 200 rows and keep it in the data frame called "dataset". This "dataset" is used for all tasks in this question Assume that you are a data scientist in Amazon. Since the company is celebrating Silver Jubilee this year, it has decided to reward their customers. Your Manager handed over last 2 years retail data and asked you to do certain tasks. The tasks are as follows: Taskl-1: When you started working with data, you've realized that it needs cleaning to produce better results. Do essential data cleaning. The final output should be the one as follows before cleaning: any negatives?: True after cleaning: any negatives?: False
In [39]:
1 Datal.head(200)
out[39]:
InvoiceNo StockCode
Description Quantity
InvoiceDate UnitPrice CustomerID
Country
536365
85123A
WHITE HANGING HEART T-LIGHT HOLDER
6
01/12/2010 8:26
2.55
17850.0 United Kingdom
1
536365
71053
WHITE METAL LANTERN
6
01/12/2010 8.26
3.39
17850.0 United Kingdom
2
536365
84406B
CREAM CUPID HEARTS COAT HANGER
01/12/2010 8:26
2.75
17850.0 United Kingdom
3
536365
84029G KNITTED UNION FLAG HOT WATER BOTTLE
6
01/12/2010 8:26
3.39
17850.0 United Kingdom
536365
84029E
RED WOOLLY HOTTIE WHITE HEART.
6
01/12/2010 8.26
3.39
17850.0 United Kingdom
195
536388
22469
HEART OF WICKER SMALL
12
01/12/2010 9:59
1.65
16250.0 United Kingdom
196
536388
22242
5 HOOK HANGER MAGIC TOADSTOOL
12
01/12/2010 9:59
1.65
16250.0 United Kingdom
197
536389
22941
CHRISTMAS LIGHTS 10 REINDEER
6 01/12/2010 10:03
8.50
12431.0
Australia
198
536389
21622
VINTAGE UNION JACK CUSHION COVER
8 01/12/2010 10:03
4.95
12431.0
Australia
199
536389
21791
VINTAGE HEADS AND TAILS CARD GAME
12 01/12/2010 10.03
1.25
12431.0
Australia
Transcribed Image Text:In [39]: 1 Datal.head(200) out[39]: InvoiceNo StockCode Description Quantity InvoiceDate UnitPrice CustomerID Country 536365 85123A WHITE HANGING HEART T-LIGHT HOLDER 6 01/12/2010 8:26 2.55 17850.0 United Kingdom 1 536365 71053 WHITE METAL LANTERN 6 01/12/2010 8.26 3.39 17850.0 United Kingdom 2 536365 84406B CREAM CUPID HEARTS COAT HANGER 01/12/2010 8:26 2.75 17850.0 United Kingdom 3 536365 84029G KNITTED UNION FLAG HOT WATER BOTTLE 6 01/12/2010 8:26 3.39 17850.0 United Kingdom 536365 84029E RED WOOLLY HOTTIE WHITE HEART. 6 01/12/2010 8.26 3.39 17850.0 United Kingdom 195 536388 22469 HEART OF WICKER SMALL 12 01/12/2010 9:59 1.65 16250.0 United Kingdom 196 536388 22242 5 HOOK HANGER MAGIC TOADSTOOL 12 01/12/2010 9:59 1.65 16250.0 United Kingdom 197 536389 22941 CHRISTMAS LIGHTS 10 REINDEER 6 01/12/2010 10:03 8.50 12431.0 Australia 198 536389 21622 VINTAGE UNION JACK CUSHION COVER 8 01/12/2010 10:03 4.95 12431.0 Australia 199 536389 21791 VINTAGE HEADS AND TAILS CARD GAME 12 01/12/2010 10.03 1.25 12431.0 Australia
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 2 steps

Blurred answer
Knowledge Booster
Table
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education