Description
Geosocial Networks such as Yelp extend review-centered sites such as Amazon or TripAdvisor with social and spatial dimensions. Users are encouraged to review visited venues.With tens of millions of reviews and tens of millions of monthly unique visitors, review based geosocial networks are playing an increasingly influential part in our lives. Their popularity and impact makes malicious behavior, in the form of fake reviews, an essential threat to their credibility and thus to the fabric of our society. |
![]() |
|
While the impact of the occasional malicious review is likely to be minimal among large number of reviews, the focus of this project is on review campaigns, concerted efforts to bias public opinion campaigns: entities that hire groups of people to write fake reviews and dishonestly improve or damage the ratings of target venues.
To address this problem, we propose WatchYT(Watch Yelp Timeline), a system that exploits Yelp's unique combination of location and social components to detect review campaigns. WatchYT is a robust autonomous monitoring system that combines several techniques to enable Yelp users to identify venues that are likely to have been targets of review campaigns as well as participants in fake review campaigns.
WatchYT acts in 3 steps.
|
Our latest work focuses on a unique Yelp mechanism: Yelp events. Yelp Elite events are hosted by venues chosen by Yelp and attended only by Elite reviewers. We will post more details soon. Please check back!
![]() WatchYT’s plugin functionality for Ike's Place
|
![]() Fake Miami Venue: Bob's Tacos
|
![]() Distribution of venues >4 reviews in Miami,FL
|
We have implemented WatchYT as two components. First, on the server side,a web server and on the client side,a browser extension running in the user’s browser. We implemented the web server using Apache Tomcat 6.0.35. We implemented the browser extension for the Chrome browser using HTML,CSS and Javascript.
We have developed Python web scrapers to automatically collect data from Yelp user and venue accounts. We have collected freely available proxies on the Internet and used them to request data from Yelp. To prevent having these proxies blocked by Yelp, we have changed them frequently and we have performed “polite” requests, with long inter-request intervals.