Answered: # define stemmer function stemmer =…

Computer Networking: A Top-Down Approach (7th Edition)

7th Edition

ISBN:9780133594140

Author:James Kurose, Keith Ross

Publisher:James Kurose, Keith Ross

Chapter1: Computer Networks And The Internet

Section: Chapter Questions

Problem R1RQ: What is the difference between a host and an end system? List several different types of end...

See similar textbooks

Related questions

Concept explainers

Question

# define stemmer function
stemmer = SnowballStemmer('english')

# tokenise data
tokeniser = TreebankWordTokenizer()
tokens = tokeniser.tokenize(data)

# define lemmatiser
lemmatizer = WordNetLemmatizer()

# bag of words
def bag_of_words_count(words, word_dict={}):
""" this function takes in a list of words and returns a dictionary
with each word as a key, and the value represents the number of
times that word appeared"""
for word in words:
if word in word_dict.keys():
word_dict[word] += 1
else:
word_dict[word] = 1
return word_dict

# remove stopwords
tokens_less_stopwords = [word for word in tokens if word not in stopwords.words('english')]

# create bag of words
bag_of_words = bag_of_words_count(tokens_less_stopwords)

Use the stemmer and lemmatizer functions (defined in the cells above) from the relevant library to find the stem and lemma of the nth word in the token list.

Function Specifications:

Should take a list as input and return a dict type as output.
The dictionary should have the keys 'original', 'stem' and 'lemma' with the corresponding values being the nth word transformed in that way

Expert Solution

Step by step

Solved in 2 steps

SEE SOLUTION Check out a sample Q&A here

Follow-up Questions

Read through expert solutions to related follow-up questions below.

Follow-up Question

How many stopwords are in the text in total?

Hint : you can use the nltk stopwords dictionary

Function Specifications:

Function should take a list as input
The number of stopwords should be returned as an int

Solution

by Bartleby Expert

SEE SOLUTION

Knowledge Booster

Learn more about

Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-engineering and related others by exploring similar questions and additional content below.

Recommended textbooks for you

Computer Networking: A Top-Down Approach (7th Edi…

Computer Engineering

ISBN:

9780133594140

Author:

James Kurose, Keith Ross

Publisher:

PEARSON