0% found this document useful (0 votes)

12 views6 pages

F24_Proj4

The document outlines the requirements for a Movie Recommender System project due in Fall 2024, utilizing the MovieLens dataset with approximately 1 million ratings for 3,706 movies. Students must submit an HTML file containing code for a recommendation system based on popularity and item-based collaborative filtering (IBCF), along with a web link to their application. The project emphasizes the importance of defining popularity and implementing the IBCF algorithm without using existing recommender packages.

Uploaded by

wedxwe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views6 pages

F24_Proj4

Uploaded by

wedxwe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Project 4: Movie Recommender

System Fall 2024

Contents
MovieLens Dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Submission Requirements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
HTML File (4 points) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
The App (3.5 points) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Resources . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
.

MovieLens Dataset

The dataset comprises approximately 1 million anonymous ratings for 3,706 movies, provided by 6,040
MovieLens users who joined the platform in 2000.
You can ﬁnd some insights from our exploratory data analysis: [Rcode__W13__Movie__EDA.html]
[Python__W13__Movie__RS.html]
You can download a copy of the 6040-by-3706 rating matrix in CSV format, complete with column names
(“m” + MovieID) and row names (“u” + UserID), from Coursera/Canvas.
该数据集包括 3706 部电影的大约 100 万个匿名评分，由 6040 名 2000 年加入该平台的 MovieLens 用户提供。
你可以从我们的探索性数据分析中找到一些见解：[Rcode__W13__Movie__EDA.html]
(Python__W13__Movie__RS.html)
你可以从 Coursera/Canvas 下载 6040 × 3706 评分矩阵的 CSV 格式副本，包括列名（“m”+ MovieID）和行名（“u”+
UserID）。

Submission Requirements

To complete this assignment, please provide the following:

1. An R Markdown or Python Jupyter Notebook saved in HTML format, or a link to such a ﬁle. This
ﬁle should contain all the necessary code to replicate the reported results. There is no page limit for
this part.

2. A web link to your movie recommendation application built by your team. You may share the source
code link or submit the code as a zip ﬁle on Coursera/Canvas.
It’s important to note that you cannot utilize any recommender packages from R or Python. However, you
are free to use other packages as needed.
为完成此作业，请提供以下资料：
1. 以 HTML 格式保存的 R Markdown 或 Python Jupyter Notebook，或指向此类文件的链接。该文件应包含复制报告结果所
需的所有代码。这部分没有页数限制。

1
2. 由您的团队构建的电影推荐应用程序的 web 链接。您可以在 Coursera/Canvas 上共享源代码链接或以 zip 文件的形式提交代
码。
需要注意的是，您不能使用来自 R 或 Python 的任何推荐包。但是，您可以根据需要自由地使用其他包。

HTML File (4 points)

The HTML ﬁle should contain two key components:

System I: Recommendation Based on Popularity

Recommend the top ten most popular movies. Please clearly define what you mean by “most popular.” For
example, are you considering movies with a high number of reviews as popular, or are you using additional
criteria, such as counting only reviews above a specific threshold, or focusing on movies whose average or
median rating exceeds a certain threshold (i.e., excluding movies with a significant number of low ratings
from being classified as popular)?

1
There is no single correct answer. We are primarily interested in ensuring that your implementation aligns
consistently with your deﬁnition.

Provide the code for implementing your recommendation scheme and display the top ten movies,
including their MovieID (or “m” + MovieID), title, and poster images.
系统一：基于人气的推荐

推荐十大最受欢迎的电影。请明确定义一下你所说的“最受欢迎”是什么意思。例如，您是将评论数量较多的电影视为受欢迎的电影，还
是使用额外的标准，例如仅计算超过特定阈值的评论，或者关注平均或中位数评分超过特定阈值的电影（即，将大量低评分的电影排除
在受欢迎的电影之外）？

没有唯一的正确答案。我们主要感兴趣的是确保您的实现与您的定义一致。
提供实现推荐方案的代码，并显示十大电影，包括它们的 MovieID（或“m”+ MovieID）、标题和海报图像。
System II: Recommendation Based on IBCF

For this system, follow these steps. Let R denote the 6040-by-3706 rating matrix.

1. Normalize the rating matrix by centering each row. This means subtracting row means from each row
of the rating matrix R. Row means should be computed based on non-NA entries. For instance, the
mean of a vector like (2, 4, NA, NA) should be 3.

2. Compute the (transformed) Cosine similarity among the 3,706 movies. For movies i and j, let I ij
denote the set of users who rated both movies i and j. We decide to ignore similarities computed based
on less than three user ratings. Thus, deﬁne the similarity between movie i and movie j as follows,
when the cardinality of I ij is bigger than two,

This transformation (1 + cos)/2 ensures that similarity measures are between 0 and 1. NA values may
occur when:

1) the set I ij has a cardinality less than or equal to two (i.e., this pair of movies have been rated by
only zero, one, or two users);
2) one of the denominators is zero.

3. Let S denote the 3706-by-3706 similarity matrix computed in previous step. For each row, sort the
non-NA similarity measures and keep the top 30, setting the rest to NA. This new similarity matrix,
still denoted as S, is no longer symmetric. Save this matrix. Note that some rows of the S matrix may
contain fewer than 30 non-NA values.

Display the pairwise similarity values from the S matrix (you obtained at Step 3) for the following
speciﬁed movies: “m1”, “m10”, “m100”, “m1510”, “m260”, “m3212”. Please round the results to
7 decimal places.
4. Create a function named myIBCF:
• Input: newuser, a 3706-by-1 vector (denoted as w) containing ratings for the 3,706 movies from a new
user. Many entries in this vector will be NA. Non-NA values should be integers 1, 2, 3, 4, or 5, as
ratings are based on a 5-star scale (whole star ratings only). The order of the movies in this vector
should match the rating matrix R. (Should we center w? For IBCF, centering the new user ratings is
not necessary.)

2
• Inside the function: Upon receiving this input, your function should load the similarity matrix and
use it to compute predictions for movies that have not been rated by this new user yet. Use the following
formula to compute the prediction for movie i:

where S(i) = {l : S il NA}. The formula above is identical to the one on page 10 of
[lec__W13__RecommenderSystem.pdf] where the rating for the j-th movie for this new user is denoted
as w j here, but as r aj in lec__W13__RecommenderSystem.pdf. Note that NA values may arise when the
denominator equals zero.

2
• Output: Based on your predictions, recommend the top ten movies to this new user, using the column
names of the rating matrix R (i.e., “m” +MovieID).

If fewer than 10 predictions are non-NA, select the remaining movies based on the popularity deﬁned in
System 1, prioritizing the most popular ones and excluding those already rated by the user. Save the
ranking of all movies (based on popularity) as a separate ﬁle to avoid recomputing the ranking each
time.
输出：根据您的预测，使用评分矩阵 R（即“m”+MovieID）的列名向这个新用户推荐十大电影。
如果少于 10 部预测为非 na，则根据系统 1 中定义的受欢迎程度选择剩余的电影，优先考虑最受欢迎的电影，并排除用户已经评分的
电影。将所有电影的排名（基于受欢迎程度）保存为单独的文件，以避免每次重新计算排名。
Test your function

For your function myIBCF, print the top 10 recommendations for the following two users:
• User “u1181” from the rating matrix R
• A hypothetical user who rates movie “m1613” with 5 and movie “m1755” with 4.
测试你的功能

对于您的函数 myIBCF，打印以下两个用户的十大推荐：
•用户“u1181”从评级矩阵 R
•假设用户给电影“m1613”打 5 分，给电影“m1755”打 4 分。

The App (3.5 points)

Build an application for System II. Here are the requirements:

• Present users with a set of sample movies and ask them to rate them.
• Use the ratings provided by the user as input for your myIBCF function.
• Display 10 movie recommendations returned by your myIBCF function.
To save space and/or memory, you can choose to display up to 100 movies. This means you can modify
your myIBCF function slightly so that it only requires 100 columns of the S matrix. You can decide which
100 movies to display. Mention these additional implementation details at the end of your HTML file if
applicable, but there’s no need to include the modified version of the myIBCF function in the HTML file.
We developed an imitation app inspired by a [book recommendation system]. While it appears to gather user
data, the 10 recommendations it provides are, in fact, fixed and unchanging, consistently featuring the same
initial 10 movies.
https://fengliang.shinyapps.io/Mov ieRecommend/
为系统 II 构建一个应用程序。以下是要求：
•向用户提供一组样本电影，并要求他们对其进行评分。
•使用用户提供的评分作为 myIBCF 函数的输入。
•显示由 myIBCF 函数返回的 10 部电影推荐。
为了节省空间和/或内存，您可以选择显示多达 100 部电影。这意味着您可以稍微修改 myIBCF 函数，使其只需要 S 矩阵的 100
列。您可以决定要显示哪 100 部电影。如果可以的话，在 HTML 文件的末尾提到这些附加的实现细节，但是不需要在 HTML 文件
中包含 myIBCF 函数的修改版本。
我们受[图书推荐系统]的启发，开发了一款模仿应用。虽然它似乎在收集用户数据，但它提供的 10 条推荐实际上是固定不变的，始终
以相同的 10 部电影为特色。
3
https://fengliang.shinyapps.io/MovieRecommend/

Resources

You are welcome to use any existing code, provided you cite the source. For example, you can check how two
packages implement IBCF:
• R code for package recommenderlab [https://github.com/mhahsler/recommenderlab/tree/master/R]
• Python code for package Surprise [https://github.com/NicolasHug/Surprise]

• The Github repository for the Book Recommender System mentioned above: [https://github.com/psp
achtholz/BookRecommender].
For the App, you can use Shiny if using R. Python users can consider using frameworks like [Shiny], [Dash],
or [Flask], and [Streamlit].

SRMDB - in (B28 - Research Paper)
No ratings yet
SRMDB - in (B28 - Research Paper)
5 pages
10 Golden Rules For Clinker Burning - INFINITY FOR CEMENT EQUIPMENT PDF
100% (1)
10 Golden Rules For Clinker Burning - INFINITY FOR CEMENT EQUIPMENT PDF
15 pages
Dl Project
No ratings yet
Dl Project
9 pages
2331_mid_program_project_v1_es3_d2i02jl
No ratings yet
2331_mid_program_project_v1_es3_d2i02jl
5 pages
Lecture9 Recommender Systems V0
No ratings yet
Lecture9 Recommender Systems V0
52 pages
Movie Recommendation System
No ratings yet
Movie Recommendation System
22 pages
NM (2)_merged
No ratings yet
NM (2)_merged
16 pages
smlPBL
No ratings yet
smlPBL
18 pages
NM (2)_merged_organized
No ratings yet
NM (2)_merged_organized
16 pages
Project Report on Movie Recommendation System
No ratings yet
Project Report on Movie Recommendation System
10 pages
Exp 2_3a10397ea76773097770b923fd29524b
No ratings yet
Exp 2_3a10397ea76773097770b923fd29524b
14 pages
Movie at
No ratings yet
Movie at
11 pages
ML Project Movie Recommendation System
No ratings yet
ML Project Movie Recommendation System
2 pages
Anand Yadav Internship
No ratings yet
Anand Yadav Internship
12 pages
Assignment 5zeerak
No ratings yet
Assignment 5zeerak
6 pages
Movie Recommendation Engine Using Artificial Intelligence
No ratings yet
Movie Recommendation Engine Using Artificial Intelligence
30 pages
Recommendation System in Python
No ratings yet
Recommendation System in Python
6 pages
Intership PPT Final
No ratings yet
Intership PPT Final
15 pages
3170724_ML_210490131009_OEP
No ratings yet
3170724_ML_210490131009_OEP
8 pages
Team 10 Movie Prediction
No ratings yet
Team 10 Movie Prediction
14 pages
Dr.B.C.Royengi Neeri Ngcollege: Academyofprofessi Onalcourses Durgapur
No ratings yet
Dr.B.C.Royengi Neeri Ngcollege: Academyofprofessi Onalcourses Durgapur
33 pages
IV YEAR_MINI PROJECT_FINAL REVIEW PPT SAMPLE FORMAT
No ratings yet
IV YEAR_MINI PROJECT_FINAL REVIEW PPT SAMPLE FORMAT
25 pages
dsv_final
No ratings yet
dsv_final
14 pages
PARNIT 05 PPT
No ratings yet
PARNIT 05 PPT
15 pages
Movie_Recommendation_System_project[1]
No ratings yet
Movie_Recommendation_System_project[1]
9 pages
Personalize Movie Recommendation System CS 229 Project Final Writeup
0% (1)
Personalize Movie Recommendation System CS 229 Project Final Writeup
6 pages
Vaibhav - Project Report On Movie Recommender System Using Machine Learning
No ratings yet
Vaibhav - Project Report On Movie Recommender System Using Machine Learning
11 pages
Project Srs
No ratings yet
Project Srs
17 pages
Minor Presentation
No ratings yet
Minor Presentation
20 pages
rosp PPT
No ratings yet
rosp PPT
17 pages
Movie Recommender Systems
No ratings yet
Movie Recommender Systems
11 pages
Movie at
No ratings yet
Movie at
19 pages
Project Report MRS (1)
No ratings yet
Project Report MRS (1)
47 pages
MOvie Recommendation System Project Report
No ratings yet
MOvie Recommendation System Project Report
30 pages
Divya_NM[1]-2
No ratings yet
Divya_NM[1]-2
41 pages
Recommender System Unit Ii
No ratings yet
Recommender System Unit Ii
14 pages
Recommender System
No ratings yet
Recommender System
45 pages
Survey On Cinematics Recommendation System
No ratings yet
Survey On Cinematics Recommendation System
10 pages
T10 Recommender System
No ratings yet
T10 Recommender System
45 pages
Movie_Recommendation_Report
No ratings yet
Movie_Recommendation_Report
27 pages
B28 Viva
No ratings yet
B28 Viva
27 pages
Assignment 5
No ratings yet
Assignment 5
6 pages
21ESKCA031 Baldeep Report (1)
No ratings yet
21ESKCA031 Baldeep Report (1)
34 pages
Movie Recommendation System in R Jupyter Notebook
No ratings yet
Movie Recommendation System in R Jupyter Notebook
18 pages
Project Movielense Solution
No ratings yet
Project Movielense Solution
4 pages
Final Synopsis
No ratings yet
Final Synopsis
18 pages
Movie Rec
No ratings yet
Movie Rec
13 pages
Movie Recommendation System Report
No ratings yet
Movie Recommendation System Report
5 pages
Final Report Ai Application
No ratings yet
Final Report Ai Application
18 pages
Group 12- 3rd Review
No ratings yet
Group 12- 3rd Review
27 pages
Project Movielense Solution
29% (7)
Project Movielense Solution
4 pages
16 Recommender Systems PDF
No ratings yet
16 Recommender Systems PDF
6 pages
Report Final-MovieLens
No ratings yet
Report Final-MovieLens
47 pages
Recommendation Engine Problem Statement
No ratings yet
Recommendation Engine Problem Statement
37 pages
Movie Recommendation System: Synopsis For Project (KCA 353)
No ratings yet
Movie Recommendation System: Synopsis For Project (KCA 353)
17 pages
ML CASE STUDY
No ratings yet
ML CASE STUDY
4 pages
Chatbot for banking Project Report - Phase - 1,2,3
No ratings yet
Chatbot for banking Project Report - Phase - 1,2,3
32 pages
Project Synopsis
No ratings yet
Project Synopsis
14 pages
Movie Recommendation Presentation
No ratings yet
Movie Recommendation Presentation
13 pages
Report System Predaction
No ratings yet
Report System Predaction
5 pages
Mastering Symfony
From Everand
Mastering Symfony
Sohail Salehi
No ratings yet
Foreign Exchange Risk
No ratings yet
Foreign Exchange Risk
18 pages
Database Developers
No ratings yet
Database Developers
9 pages
Ikigai
No ratings yet
Ikigai
11 pages
FINANCE MANAGEMENT FIN420 CHP 3
50% (2)
FINANCE MANAGEMENT FIN420 CHP 3
40 pages
Questionnaire A Study On The Consumer Behaivour in Prefering Various Milk Products
No ratings yet
Questionnaire A Study On The Consumer Behaivour in Prefering Various Milk Products
3 pages
Resolution No. 6 - Concurring The Appointment
100% (1)
Resolution No. 6 - Concurring The Appointment
2 pages
2011 Minerals Yearbook: Iron and Steel (Advance Release)
No ratings yet
2011 Minerals Yearbook: Iron and Steel (Advance Release)
17 pages
Web Service Tutorial 1
No ratings yet
Web Service Tutorial 1
17 pages
NGAS Illustrative Accounting Entries
No ratings yet
NGAS Illustrative Accounting Entries
9 pages
Update File Gdec_f9ba9cbd 9bcd 4302 8b39 Cf8e4b491419
No ratings yet
Update File Gdec_f9ba9cbd 9bcd 4302 8b39 Cf8e4b491419
45 pages
Numerical Assessment of Lateral Distortional Buckling in Steel-Concrete Composite Beams
No ratings yet
Numerical Assessment of Lateral Distortional Buckling in Steel-Concrete Composite Beams
20 pages
SKILLS-FOR-THE-TOEIC-TEST-Listening-and-Reading
No ratings yet
SKILLS-FOR-THE-TOEIC-TEST-Listening-and-Reading
24 pages
Bennett Arnold How To Live On 24 Hours A Day PDF
No ratings yet
Bennett Arnold How To Live On 24 Hours A Day PDF
35 pages
TDPS - Generator Brochure
0% (1)
TDPS - Generator Brochure
11 pages
Fortinet Product Matrix
No ratings yet
Fortinet Product Matrix
6 pages
ALE 300 Welcome Screen
No ratings yet
ALE 300 Welcome Screen
4 pages
Vaporis EdoBackAlternate 1997
No ratings yet
Vaporis EdoBackAlternate 1997
44 pages
Electric Fuel Pumps
No ratings yet
Electric Fuel Pumps
52 pages
Java Classes Aggregation Exercise
No ratings yet
Java Classes Aggregation Exercise
5 pages
5708 & 5739
No ratings yet
5708 & 5739
12 pages
2 Time Table - Section - A - SEM-I
No ratings yet
2 Time Table - Section - A - SEM-I
1 page
Sub Centre - Functions, Staffing, Patterns
88% (77)
Sub Centre - Functions, Staffing, Patterns
21 pages
Technology in Sport
No ratings yet
Technology in Sport
5 pages
List of Counsels Nominated With Delhi High Court
No ratings yet
List of Counsels Nominated With Delhi High Court
64 pages
Unit 5 - Medical Electronics
No ratings yet
Unit 5 - Medical Electronics
38 pages
cost accounting sums
No ratings yet
cost accounting sums
17 pages
User-ManualInfo - Denon DN-X1700
No ratings yet
User-ManualInfo - Denon DN-X1700
26 pages
TabletPhone As Arduino Screen and A 2 Oscilloscope
No ratings yet
TabletPhone As Arduino Screen and A 2 Oscilloscope
6 pages
BUS ROUTE IX, X, XII W.E.F. 01-05-2024
No ratings yet
BUS ROUTE IX, X, XII W.E.F. 01-05-2024
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

F24_Proj4

Uploaded by

F24_Proj4

Uploaded by

Project 4: Movie Recommender

System Fall 2024

To complete this assignment, please provide the following:

HTML File (4 points)

The HTML ﬁle should contain two key components:

System I: Recommendation Based on Popularity

The App (3.5 points)

Build an application for System II. Here are the requirements:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.