Answered: Please experiment with the ALS…

Computer Networking: A Top-Down Approach (7th Edition)

7th Edition

ISBN:9780133594140

Author:James Kurose, Keith Ross

Publisher:James Kurose, Keith Ross

Chapter1: Computer Networks And The Internet

Section: Chapter Questions

Problem R1RQ: What is the difference between a host and an end system? List several different types of end...

See similar textbooks

Related questions

Question

PYTHON / DATABRICKS

Please experiment with the ALS algorithm, you can use the following notebook as starting template: https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/42740061589303/2161112564136160/8105921225255291/latest.html

1. Try to train and run the model on the 20 Million movie ratings dataset instead of the 1 Million one. Files:

/databricks-datasets/cs110x/ml-20m/data-001/movies.csv

/databricks-datasets/cs110x/ml-20m/data-001/ratings.csv

2. Test with various values of: ranks, regularization parameter, number of iterations.

Compare the models and find the best model based on the error value (i.e RMSE).

The documenation of the algorithm can be found at: https://spark.apache.org/docs/3.3.1/ml-collaborative-filtering.html

https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.ml.recommendation.ALS.html#pyspark.ml.recommendation.ALS

Prepare a table with at least 10 of your own ratings for the movies that you select and run the model with this data as input and show the top 20 movies that the model recommends for you to watch.

Expert Solution

Trending now

This is a popular solution!

Step by step

Solved in 2 steps

SEE SOLUTION Check out a sample Q&A here

Knowledge Booster

Learn more about

Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-engineering and related others by exploring similar questions and additional content below.

Similar questions