Shilling Attacks on Recommender Systems

Description

In this episode of Data Skeptic's Recommender Systems series, Kyle sits down with Aditya Chichani, a senior machine learning engineer at Walmart, to explore the darker side of recommendation algorithms. The conversation centers on shilling attacks—a form of manipulation where malicious actors create multiple fake profiles to game recommender systems, either to promote specific items or sabotage competitors. Aditya, who researched these attacks during his undergraduate studies at SPIT before completing his master's in computer science with a data science specialization at UC Berkeley, explains how these vulnerabilities emerge particularly in collaborative filtering systems. From promoting a friend's ska band on Spotify to inflating product ratings on e-commerce platforms, shilling attacks represent a significant threat in an industry where approximately 4% of reviews are fake, translating to $800 billion in annual sales in the US alone. The discussion delves deep into collaborative filtering, explaining both user-user and item-item approaches that create similarity matrices to predict user preferences. However, these systems face various shilling attacks of increasing sophistication: random attacks use minimal information with average ratings, while segmented attacks strategically target popular items (like Taylor Swift albums) to build credibility before promoting target items. Bandwagon attacks focus on highly popular items to connect with genuine users, and average attacks leverage item rating knowledge to appear authentic. User-user collaborative filtering proves particularly vulnerable, requiring as few as 500 fake profiles to impact recommendations, while item-item filtering demands significantly more resources. Aditya addresses detection through machine learning techniques that analyze behavioral patterns using methods like PCA to identify profiles with unusually high correlation and suspicious rating consistency. However, this remains an evolving challenge as attackers adapt strategies, now using large language models to generate more authentic-seeming fake reviews. His research with the MovieLens dataset tested detection algorithms against synthetic attacks, highlighting how these concerns extend to modern e-commerce systems. While companies rarely share attack and detection data publicly to avoid giving attackers advantages, academic research continues advancing both offensive and defensive strategies in recommender systems security.

Audio

Featured in this Episode

No persons identified in this episode.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes

🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Other recent transcribed episodes

Transcribed and ready to explore now

NPR News: 12-08-2025 2AM EST

08 Dec 2025

NPR News Now

NPR News: 12-07-2025 11PM EST

08 Dec 2025

NPR News Now

NPR News: 12-07-2025 10PM EST

08 Dec 2025

NPR News Now

Meidas Health: AAP President Strongly Pushes Back on Hepatitis B Vaccine Changes

08 Dec 2025

The MeidasTouch Podcast

Democrat Bobby Cole Discusses Race for Texas Governor

07 Dec 2025

The MeidasTouch Podcast

Fox News Crashes Out on Air Over Trump’s Rapid Fall

07 Dec 2025

The MeidasTouch Podcast

Comments

There are no comments yet.

Please log in to write the first comment.

Data Skeptic

This episode hasn't been transcribed yet

Other recent transcribed episodes

NPR News: 12-08-2025 2AM EST

NPR News: 12-07-2025 11PM EST

NPR News: 12-07-2025 10PM EST

Meidas Health: AAP President Strongly Pushes Back on Hepatitis B Vaccine Changes

Democrat Bobby Cole Discusses Race for Texas Governor

Fox News Crashes Out on Air Over Trump’s Rapid Fall

Login Required

Share this moment