Finally, we will do some simple analysis – ranking of words. Write a function word_ranking(corpus, n, sort_index) which takes input parameters corpus (type list), n (type int), and sort_index (type int) and returns a list containing tuples of words and their frequencies in the top n of words. Here, corpus is a list of sentences. n is the number of ranked words to return. The resulting list should then be sorted based on the sort_index; sort_index = 0 should sort the list of tuples in ascending alphabetical order, while sort_index = 1 should sort the list of tuples by the frequencies in descending order. All other numeric values for sort_index should return an empty list (don't need to consider non-integer inputs). For example, we have: corpus = ['hi hi hi hi', 'hello hello say bye', 'bye bye'] n = 3 sort_index = 1 The function should return: [('hi', 4), ('bye', 3), ('hello', 2)] While sort_index = 0 should return: [('bye', 3), ('hello', 2), ('hi', 4)]
Finally, we will do some simple analysis – ranking of words. Write a function word_ranking(corpus, n, sort_index) which takes input parameters corpus (type list), n (type int), and sort_index (type int) and returns a list containing tuples of words and their frequencies in the top n of words. Here, corpus is a list of sentences. n is the number of ranked words to return. The resulting list should then be sorted based on the sort_index; sort_index = 0 should sort the list of tuples in ascending alphabetical order, while sort_index = 1 should sort the list of tuples by the frequencies in descending order. All other numeric values for sort_index should return an empty list (don't need to consider non-integer inputs). For example, we have:
corpus = ['hi hi hi hi', 'hello hello say bye', 'bye bye']
n = 3
sort_index = 1
The function should return:
[('hi', 4), ('bye', 3), ('hello', 2)]
While sort_index = 0 should return:
You can assume n would be selected such that exactly n ranked words will be matched.
Step by step
Solved in 2 steps with 3 images