22 August 2014
  Welcome Guest
  Login | Help
General Information
Transaction Series
Related Information
Connect with WIT Press
Connect with WIT
Login ID:
Your Cart
There are 0 items in your cart. [View]

Adobe PDF Reader is required to view our papers:
Get Acrobat Reader

  Welcome to the WIT eLibrary

The home of the Transactions of the Wessex Institute collection, providing on-line access to papers presented at the Institute's prestigious international conferences and from its State-of-the-Art in Science & Engineering publications.

Paper Information

A data mining approach to analysis and prediction of movie ratings

Author(s): M. Saraee, S. White & J. Eccleston

This paper details our analysis of the Internet Movie Database (IMDb), a free, user-maintained, online resource of production details for over 390,000 movies, television series and video games, which contains information such as title, genre, box-office taking, cast credits and user's ratings. We gather a series of interesting facts and relationships using a variety of data mining techniques.

In particular, we concentrate on attributes relevant to the user ratings of movies, such as discovering if big-budget films are more popular than their low budget counterparts, if any relationship between movies produced during the "golden age" (i.e.

Citizen Kane, Itís A Wonderful Life, etc.) can be proved, and whether any particular actors or actresses are likely to help a movie to succeed.

The paper also reports on the techniques used, giving their implementation and usefulness. We have found that the IMDb is difficult to perform data mining upon, due to the format of the source data.

We also found some interesting facts, such as the budget of a film is no indication of how well-rated it will be, there is a downward trend in the quality of films over time, and the director and actors/actresses involved in a film are the most important factors to its success or lack thereof. The data used in this paper is not freely distributable, but remains copyright to the Internet Movie Database inc.

It is used here within the terms of their copying policy.

Further distribution of the source data used in this paper may be prohibited.

IMDb, Internet Movie Database, data mining, classification, movies, films....

Pages: 10
Size: 357 kb
Paper DOI: 10.2495/DATA040331



Download the Full Article

Price: US$ 0.00

This article is part of the WIT OpenView scheme and you can download the full text Adobe PDF article for FREE by clicking the 'Openview' icon below.


Send this page to a friend. Send this page to a colleague.

This paper can be found in the following book

Data Mining V

Data Mining V

Buy Book from

Download the Full Article

This article is part of the WIT OpenView scheme and you can download the full text Adobe PDF article for FREE by clicking the 'Openview' icon to the right.

Copyright© 2006 by WIT Press | About Prof Carlos Brebbia
Optimised for Microsoft Internet Explorer