Implementation of Content-Based Cosine Similarity Algorithm with TF-IDF and SBERT for Movie Recommendation
Abstract
The number of films continues to increase on streaming platforms often makes users confused in deciding which film to watch. To overcome this research develops content-based movie recommendation system. Representation of the film information obtained through the application of TF-IDF and SBERT to genre and synopsis data. Cosine similarity is used to calculate the closeness between representations. The performance system is then evaluated through the Precision@K, MAP@K, and Recall@K metrics. From the test results, hybrid approach shows better performance more stable than single method. With a MAP value reaching 0.95 Recall 0.95 dan Precission 0.71 . In the future, the development system will still possible by utilizing other types of data, including user interaction data.
Downloads
References
V. Sandrya, W. Wasino, and D. Arisandi, “Sistem Rekomendasi Film Menggunakan Metode Multiple Attribute Utility Theory,” Computatio : Journal of Computer Science and Information Systems, vol. 6, no. 1, p. 19, Jun. 2022, doi: 10.24912/computatio.v6i1.17081.
A. D. Saputro and F. Amin, “Sistem Rekomendasi Content-Based Filtering Skincare Pria Di E-Commerce Shopee,” INTECOMS: Journal of Information Technology and Computer Science, vol. 7, no. 1, pp. 106–113, Jan. 2024, doi: 10.31539/intecoms.v7i1.8036.
N. Azizah and A. F. Rozi, “Sistem Rekomendasi Produk Somethinc Menggunakan Metode Content-based Filtering,” Jurnal Teknologi Dan Sistem Informasi Bisnis, vol. 6, no. 3, pp. 461–468, Jul. 2024, doi: 10.47233/jteksis.v6i3.1411.
Hanafi et al., “Improvement of E-commerce Recommender System Using Hybridization of Bert, Matrix Factorization and Attention Mechanism,” International Journal of Intelligent Engineering and Systems, vol. 17, no. 5, pp. 725–740, 2024, doi: 10.22266/ijies2024.1031.55.
H. Hartatik and A. Syafrianto, “Penerapan Model Sentence-bert Untuk Sistem Rekomendasi Buku Berbasis Konten Di Perpustakaan Digital,” Jurnal Dialektika Informatika (Detika), vol. 6, no. 1, pp. 12–19, Nov. 2025, doi: 10.24176/detika.v6i1.15916.
M. A. Hafizh Fathuddin 1, E. Prakarsa Mandyartha 2, and A. Lina Nurlaili 3, “Penerapan Sentence-Bert dan Cosine Similarity untuk Pencarian Semantik Dokumen Skripsi dalam Format PDF,” Ranah Research : Journal of Multidisciplinary Research and Development, vol. 8, no. 1, Oct. 2025, doi: 10.38035/rrj.v8i1.
M. Abdul, H. Fathuddin, E. Prakarsa Mandyartha, and A. L. Nurlaili, “Penerapan Sentence-Bert dan Cosine Similarity untuk Pencarian Semantik Dokumen Skripsi dalam Format PDF,” R2J, vol. 8, no. 1, 2025, doi: 10.38035/rrj.v8i1.
A. A. P. Yudha, Munir, and Ani Anisyah, “Perancangan Sistem Rekomendasi Akomodasi pada Event Konser dengan Metode Hybrid Filtering,” Jurnal Komputer Teknologi Informasi Sistem Informasi (JUKTISI), vol. 4, no. 2, pp. 631–641, Jul. 2025, doi: 10.62712/juktisi.v4i2.493.
A. Rizky Mangunsong, V. Sihombing, and I. Rasyid Munthe, “Pengembangan Sistem Rekomendasi Produk Berdasarkan Pola Pembelian dengan Pendekatan Algoritma Apriori,” Jurnal Ilmu Komputer dan Sistem Informasi (JIKOMSI), vol. 7, no. 1, pp. 82–86, Jan. 2024, doi: 10.55338/jikomsi.v7i1.2718.
T. Safitri, Y. Umaidah, and I. Maulana, “Analisis Sentimen Pengguna Twitter Terhadap Grup Musik BTS Menggunakan Algoritma Support Vector Machine,” Journal of Applied Informatics and Computing, vol. 7, no. 1, pp. 28–35, Jul. 2023, doi: 10.30871/jaic.v7i1.5039.
A. Rachmaniar1, S. Widayati2, and K. Rokoyah3, “Sistem Rekomendasi Produk E-commerce Menggunakan Collaborative Filtering Dan Content-based Filtering,” Journal of Information System, Informatics and Computing, vol. 9, no. 1, pp. 1–15, Jun. 2025, doi: 10.52362/jisicom.v9i1.1904.
R. M. Holis, P. E. P. Utomo, and B. F. Hutabarat, “Semantic FAQ Chatbot Using SBERT (Sentence-BERT) and Cosine Similarity for Academic Services,” Brilliance: Research of Artificial Intelligence, vol. 5, no. 2, pp. 915–922, Oct. 2025, doi: 10.47709/brilliance.v5i2.7027.
M. Y. Ridho and E. Yulianti, “From Text to Truth: Leveraging IndoBERT and Machine Learning Models for Hoax Detection in Indonesian News,” Jurnal Ilmiah Teknik Elektro Komputer dan Informatika, vol. 10, no. 3, pp. 544–555, Sep. 2024, doi: 10.26555/jiteki.v10i3.29450.
A. H. J. P. Juni Permana and Agung Toto Wibowo, “Movie Recommendation System Based on Synopsis Using Content-Based Filtering with TF-IDF and Cosine Similarity,” International Journal on Information and Communication Technology (IJoICT), vol. 9, no. 2, pp. 1–14, Dec. 2023, doi: 10.21108/ijoict.v9i2.747.
A. Febrian and E. D. Permana, “Sistem Rekomendasi Film Menggunakan Metode Content Based Filtering Dengan Algoritma Tf-idf,” Al-Aqlu: Jurnal Matematika, Teknik dan Sains, vol. 4, no. 1, pp. 19–25, Jan. 2026, doi: 10.59896/aqlu.v4i1.494.
A. Serlina, A. Rahim, and Arbansyah, “Comparative Analysis of Naïve Bayes Algorithm Performance in English and Indonesian Text Sentiment Classification on Duolingo Application in Playstore,” Teknika, vol. 14, no. 1, pp. 165–171, Mar. 2025, doi: 10.34148/teknika.v14i1.1207.
K. Peyton and S. Unnikrishnan, “A comparison of chatbot platforms with the state-of-the-art sentence BERT for answering online student FAQs,” Results in Engineering, vol. 17, p. 100856, Mar. 2023, doi: 10.1016/j.rineng.2022.100856.
P. Aprilio, M. Felix, P. S. Nugraha, and H. Fahmi, “Hybrid Feature Combination of TF-IDF and BERT for Enhanced Information Retrieval Accuracy,” JISA(Jurnal Informatika dan Sains), vol. 8, no. 1, pp. 8–15, Jun. 2025, doi: 10.31326/jisa.v8i1.2179.
A. Maitaigahasse, J. L. K. Ebongue Fendji, and M. Atemkeng, “Offline Content-based Recommendation System for Wikimedia Commons Contents,” Procedia Computer Science, vol. 257, pp. 485–494, 2025, doi: 10.1016/j.procs.2025.03.063.
D. Velamentosa and E. Zuliarso, “Sistem Rekomendasi Film Menggunakan Metode Content-based Filtering,” JATI (Jurnal Mahasiswa Teknik Informatika), vol. 9, no. 2, pp. 2918–2922, Mar. 2025, doi: 10.36040/jati.v9i2.13251.
D. R. P. Noordi, H. Hasanah, and S. Sumarlinda, “Marvel Movie Recommendation System Using Hybrid Item-Based and Content-Based Filtering Methods,” TIERS Information Technology Journal, vol. 5, no. 1, pp. 13–19, Jun. 2024, doi: 10.38043/tiers.v5i1.5209.
L. Palupi, E. Ihsanto, and F. Nugroho, “Analisis Validasi dan Evaluasi Model Deteksi Objek Varian Jahe Menggunakan Algoritma Yolov5,” Journal of Information System Research (JOSH), vol. 5, no. 1, pp. 234–241, Oct. 2023, doi: 10.47065/josh.v5i1.4380.
. Amdahl J. R. Sari, K. Sadik, A. M. Soleh, and C. Suhaeni, “Evaluasi Model Klasifikasi Dalam Deteksi Penipuan Transaksi: Studi Kasus Pada Data Tidak Seimbang,” Jurnal Gaussian, vol. 14, no. 2, pp. 565–576, Dec. 2025, doi: 10.14710/j.gauss.14.2.565-576.
D. Çelik Ertuğrul and S. Bitirim, “Job recommender systems: a systematic literature review, applications, open issues, and challenges,” Journal of Big Data, vol. 12, no. 1, Jun. 2025, doi: 10.1186/s40537-025-01173-y.
J. M. Azri Saputra, L. M. Huizen, and D. B. Arianto, “Sistem Rekomendasi Film pada Platform Streaming Menggunakan Metode Content-Based Filtering,” Jurnal Transformatika, vol. 22, no. 1, pp. 10–21, Jul. 2024, doi: 10.26623/transformatika.v22i1.7041.





