n python: Question: Binning or discretization can reduce the risk of de-anonymization in a dataset. If the 'act_moving_seconds' is replaced with the nearest integer-valued minute, what is the modified number of unique identifiers present in this column? Steps: replace 'act_moving_seconds' with the nearest integer-valued minute. find the new/modified number of unique identifiers for this column ( same steps as Q.4 )

Computer Networking: A Top-Down Approach (7th Edition)
7th Edition
ISBN:9780133594140
Author:James Kurose, Keith Ross
Publisher:James Kurose, Keith Ross
Chapter1: Computer Networks And The Internet
Section: Chapter Questions
Problem R1RQ: What is the difference between a host and an end system? List several different types of end...
icon
Related questions
Question

In python:

Question:

Binning or discretization can reduce the risk of de-anonymization in a dataset.

If the 'act_moving_seconds' is replaced with the nearest integer-valued minute,
what is the modified number of unique identifiers present in this column?

Steps:

  • replace 'act_moving_seconds' with the nearest integer-valued minute.
  • find the new/modified number of unique identifiers for this column ( same steps as Q.4 )


To replace the values in a column, take a look at this following example (code example is in screen shot)

 

CSV file code: 

url = "https://raw.githubusercontent.com/nat-data/data/main/jeddah_strava_segments.csv"
df = pd.read_csv(url)
O ## Example for replacing values in a column
## In this example, we are replacing the user_ID with 42 x user_ID
## We don't want to disturb the original dataframe.
df_example = df.copy()
print(df_example.filter(['user_id']).head(5))
print("\n \n Example: replacing the user_ID with 10 x user_ID \n \n")
## Pay attention to this :
df_example['user_id'] = df_example.apply (lambda row: row['user_id']*42 , axis=1)
print(df_example.filter(['user_id']).head (5))
Transcribed Image Text:O ## Example for replacing values in a column ## In this example, we are replacing the user_ID with 42 x user_ID ## We don't want to disturb the original dataframe. df_example = df.copy() print(df_example.filter(['user_id']).head(5)) print("\n \n Example: replacing the user_ID with 10 x user_ID \n \n") ## Pay attention to this : df_example['user_id'] = df_example.apply (lambda row: row['user_id']*42 , axis=1) print(df_example.filter(['user_id']).head (5))
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 2 steps with 1 images

Blurred answer
Similar questions
Recommended textbooks for you
Computer Networking: A Top-Down Approach (7th Edi…
Computer Networking: A Top-Down Approach (7th Edi…
Computer Engineering
ISBN:
9780133594140
Author:
James Kurose, Keith Ross
Publisher:
PEARSON
Computer Organization and Design MIPS Edition, Fi…
Computer Organization and Design MIPS Edition, Fi…
Computer Engineering
ISBN:
9780124077263
Author:
David A. Patterson, John L. Hennessy
Publisher:
Elsevier Science
Network+ Guide to Networks (MindTap Course List)
Network+ Guide to Networks (MindTap Course List)
Computer Engineering
ISBN:
9781337569330
Author:
Jill West, Tamara Dean, Jean Andrews
Publisher:
Cengage Learning
Concepts of Database Management
Concepts of Database Management
Computer Engineering
ISBN:
9781337093422
Author:
Joy L. Starks, Philip J. Pratt, Mary Z. Last
Publisher:
Cengage Learning
Prelude to Programming
Prelude to Programming
Computer Engineering
ISBN:
9780133750423
Author:
VENIT, Stewart
Publisher:
Pearson Education
Sc Business Data Communications and Networking, T…
Sc Business Data Communications and Networking, T…
Computer Engineering
ISBN:
9781119368830
Author:
FITZGERALD
Publisher:
WILEY