### THE DATA mun_dict = { '@CityofCTAlerts' : 'Cape Town', '@CityPowerJhb' : 'Johannesburg', '@eThekwiniM' : 'eThekwini' , '@EMMInfo' : 'Ekurhuleni', '@centlecutility' : 'Mangaung', '@NMBmunicipality' : 'Nelson Mandela Bay', '@CityTshwane' : 'Tshwane' } twitter_url = 'https://raw.githubusercontent.com/Explore-AI/Public-Data/master/Data/twitter_nov_2019.csv' twitter_df = pd.read_csv(twitter_url) twitter_df.head() ### QUESTION Municipality & Hashtag Detector Write a function which takes in a pandas dataframe and returns a modified dataframe that includes two new columns that contain information about the municipality and hashtag of the tweet. Function Specifications: Function should take a pandas dataframe as input. Extract the municipality from a tweet using the mun_dict dictonary given at the start of the notebook and insert the result into a new column named 'municipality' in the same dataframe. Use the entry np.nan when a municipality is not found. Extract a list of hashtags from a tweet into a new column named 'hashtags' in the same dataframe. Use the entry np.nan when no hashtags are found. Hint: you will need to mun_dict variable defined at the top of this notebook.
### THE DATA
mun_dict = {
'@CityofCTAlerts' : 'Cape Town',
'@CityPowerJhb' : 'Johannesburg',
'@eThekwiniM' : 'eThekwini' ,
'@EMMInfo' : 'Ekurhuleni',
'@centlecutility' : 'Mangaung',
'@NMBmunicipality' : 'Nelson Mandela Bay',
'@CityTshwane' : 'Tshwane'
}
twitter_url = 'https://raw.githubusercontent.com/Explore-
twitter_df = pd.read_csv(twitter_url)
twitter_df.head()
### QUESTION
Municipality & Hashtag Detector
Write a function which takes in a pandas dataframe and returns a modified dataframe that includes two new columns that contain information about the municipality and hashtag of the tweet.
Function Specifications:
- Function should take a pandas dataframe as input.
- Extract the municipality from a tweet using the mun_dict dictonary given at the start of the notebook and insert the result into a new column named 'municipality' in the same dataframe.
- Use the entry np.nan when a municipality is not found.
- Extract a list of hashtags from a tweet into a new column named 'hashtags' in the same dataframe.
- Use the entry np.nan when no hashtags are found.
Hint: you will need to mun_dict variable defined at the top of this notebook.
Step by step
Solved in 3 steps with 2 images