# Internet Movie Database

(Redirected from IMDb)
URL IMDb homepage on August 21, 2012 imdb.com Yes Online database for movies, television, and video games Registration is optional for members to participate in discussions, comments, ratings, and voting. English, French, German, Spanish, Italian, Portuguese Amazon.com Col Needham October 17, 1990; 22 years ago 51 (April 2013)[1] Active

The Internet Movie Database (IMDb) is an online database of information related to films, television programs, and video games. This includes actors, production crew personnel, and fictional characters featured in these three visual entertainment media.

It is one of the most popular online entertainment destinations, with over 100 million unique users each month and a solid and rapidly growing mobile presence.[2] IMDb was launched on October 17, 1990, and in 1998 was acquired by Amazon.com. As of April 27, 2013, IMDb had 2,503,783 titles (includes episodes) and 5,192,493 personalities in its database,[3] as well as 43 million registered users. The website has an Alexa rank of 51.

## History

### History before website

IMDb originated with a Usenet posting by British film fan and professional computer programmer Col Needham entitled "Those Eyes", about actresses with beautiful eyes. Others with similar interests soon responded with additions or different lists of their own. Needham subsequently started a (male) "Actors List", while Dave Knight began a "Directors List", and Andy Krieg took over "THE LIST" from Hank Driskill, which would later be renamed the "Actress List". Both lists had been restricted to people who were alive and working, but soon retired people were added, so Needham started what was then (but did not remain) a separate "Dead Actors/Actresses List". The goal of the participants now was to make the lists as inclusive as possible. By late 1990, the lists included almost 10,000 movies and television series correlated with actors and actresses appearing therein.

On October 17, 1990, Needham developed and posted a collection of Unix shell scripts which could be used to search the four lists, and thus the database that would become the IMDb was born. At the time, it was known as the "rec.arts.movies movie database", but by 1993 had been moved out of the Usenet group as an independent website underwritten and controlled by Needham and personal followers. Other website users were invited to contribute data which they may have collected and verified, on a volunteer basis, which greatly increased the amount and types of data to be stored. Entire new sections were added. As the site grew hugely, full production crews, uncredited performers and other demographic data were added. Needham's group allowed some advertising to support ongoing operations of the site, including the hiring of full-time paid data managers. All the primary staff came (and still come) from the burgeoning computer industry and/or training schools and did not have extensive expertise in the visual media.[citation needed] In 1998, unable to secure sufficient funding from limited advertising, contributions and unable to raise support from the visual media industries or academia, Needham sold the IMDb to Amazon.com, on condition that its operation would remain in the hands of Needham and his small cadre of managers, who soon were able to move into full-time paid staff positions.

### On the web

The database had been expanded to include additional categories of filmmakers and other demographic material, as well as trivia, biographies, and plot summaries; the movie ratings had been properly integrated with the list data; and a centralized email interface for querying the database had been created by Alan Jay. Later in the year[when?] it moved onto the World Wide Web (a network in its infancy at that time) under the name of Cardiff Internet Movie Database.[4] The database resided on the servers of the computer science department of Cardiff University in Wales. Rob Hartill was the original web interface author. In 1994 the email interface was revised to accept the submission of all information, meaning that people no longer had to email the specific list maintainer with their updates. However, the structure remained that information received on a single film was divided among multiple section managers, the sections being defined and determined by categories of film personnel and the individual filmographies contained therein. Over the next few years, the database was run on a network of mirrors across the world with donated bandwidth.[citation needed]

The website is Perl-based.[5] As of May 2011, the site has been filtered in China for more than one year, although many users address it through proxy server or by VPN.[6]

On October 17, 2010, IMDb launched original video (www.imdb.com/20) in celebration of its 20th anniversary.[7]

### As an independent company

In 1996 IMDb was incorporated in the United Kingdom, becoming the Internet Movie Database Ltd. Founder Col Needham became the primary owner as well as the identified figurehead. General revenue for site operations was generated through advertising, licensing and partnerships.

### As Amazon.com subsidiary

In 1998, Jeff Bezos, founder, owner and CEO of Amazon.com, struck a deal with Col Needham and other principal shareholders to buy IMDb outright and attach it to Amazon as a subsidiary, private company.[8] This gave IMDb the ability to pay the shareholders salaries for their work, while Amazon.com would be able to use the IMDb as an advertising resource for selling DVDs and videotapes.

IMDb continued to expand its functionality. On January 15, 2002, it added a subscription service known as IMDbPro, aimed at entertainment professionals. IMDbPro was announced and launched at the 2002 Sundance Film Festival. It provides a variety of services including film production and box office details, as well as a company directory.

As an additional incentive for users, as of 2003, users identified as one of "the top 100 contributors" of hard data received complimentary free access to IMDbPro for the following calendar year; for 2006 this was increased to the top 150 contributors, and for 2010 to the top 250.[9] In 2008 IMDb launched their first official foreign language version with the German IMDb.de. Also in 2008, IMDb acquired two other companies, Withoutabox and Box Office Mojo.

## Television episodes

On January 26, 2006, "Full Episode Support" came online, allowing the database to support separate cast and crew listings for each episode of every television series. This was described by Col Needham as "the largest change we've ever made to our data model"[citation needed], and increased the number of titles in the database from 485,000 to nearly 755,000.[citation needed]

## Characters filmography

On October 2, 2007, the characters' filmography was added. Character entries are created from character listings in the main filmography database and as such don't need any additional verification by IMDb staff. They have already been verified when they are added to the main filmography.

## Instant viewing

On September 15, 2008, a feature was added that enables instant viewing of over 6,000 movies and television shows from CBS, Sony and a number of independent film makers, with direct links from their profiles.[10] Due to licensing restrictions, this feature is only available to viewers in the United States.[11]

## Ancillary features

### User ratings of films

As one adjunct to data, the IMDb offers a rating scale that allows users to rate films on a scale of one to ten. The rating system is recognized as being severely flawed for several reasons.[12][13]

#### Filters and weights

IMDb indicates that submitted ratings are filtered and weighted in various ways in order to produce a weighted mean that is displayed for each film, series, and so on. It states that filters are used to avoid ballot stuffing; the method is not described in detail to avoid attempts to circumvent it. In fact, it sometimes produces an extreme difference between the weighted average and the arithmetic mean. For example, Jonas Brothers: The 3D Concert Experience is considered to be the worst film with a weighted average of 1.3 as of March 2009, but has a rather ordinary arithmetic mean of 4.1.[14][15]

#### Ranking (IMDb Top 250)

The IMDb Top 250 is intended to be a listing of the top 'rated' 250 films, based on ratings by the registered users of the website using the methods described.[16] Only non-documentary theatrical releases running at least forty-five minutes with over 25,000 ratings are considered; all other products are ineligible.[17] Also, the 'top 250' rating is based on only the ratings of "regular voters". The exact number of votes a registered user would have to make to be considered to be a user who votes regularly has been kept secret. IMDb has stated that to maintain the effectiveness of the top 250 list they "deliberately do not disclose the criteria used for a person to be counted as a regular voter".[18] In addition to other weightings, the top 250 films are also based on a weighted rating formula referred to in actuarial science as a credibility formula.[19] This label arises because a statistic is taken to be more credible the greater the number of individual pieces of information; in this case from eligible users who submit ratings. IMDb uses the following formula to calculate the weighted rating:

$W = \frac{Rv + Cm}{v+m}$

where:

$W\$ = weighted rating
$R\$ = average for the movie as a number from 0 to 10 (mean) = (Rating)
$v\$ = number of votes for the movie = (votes)
$m\$ = minimum votes required to be listed in the Top 250 (currently 25,000)
$C\$ = the mean vote across the whole report (currently 7.1)

The $W\$ in this formula is equivalent to a Bayesian posterior mean (See Bayesian statistics).

The IMDb also has a Bottom 100 feature which is assembled through a similar process although only 1500 votes must be received to qualify for the list.[20]

The top 250 list comprises a wide range of films, including major releases, cult films, independent films, critically acclaimed films, silent films and non-English language films.

#### Top 20 of the 250

Rank Film Year
1. The Shawshank Redemption 1994
2. The Godfather 1972
3. The Godfather: Part Two 1974
4. Pulp Fiction 1994
5. The Good, the Bad and the Ugly 1966
6. 12 Angry Men 1957
7. The Dark Knight 2008
8. Schindler's List 1993
9. The Lord of the Rings: The Return of the King 2003
10. Fight Club 1999
11. Star Wars Episode V: The Empire Strikes Back 1980
12. The Lord of the Rings: The Fellowship of the Ring 2001
13. One Flew Over the Cuckoo's Nest 1975
14. Inception 2010
15. Goodfellas 1990
16. Star Wars Episode IV: A New Hope 1977
17. Seven Samurai 1954
18. Forrest Gump 1994
19. The Matrix 1999
20. The Lord of the Rings: The Two Towers 2002

### Message boards

One of the most used features of the Internet Movie Database is the message boards that coincide with every title (excepting, as of 2006, TV episodes[21]) and name entry, along with over 140 main boards. This section is one of the more recent features of IMDb, having its beginnings in 2001. In order to post on the message boards a user needs to "authenticate" their account via cell phone, credit card, or by having been a recent customer of the parent company Amazon.com. Message boards have expanded in recent years. The Soapbox started in 1999 is a general message board meant for debates on any subject. The Politics board started in 2007 is a message board to discuss politics, news events and current affairs as well as history and economics. Both these message boards have become the most popular message boards in IMDb, more popular on a long term basis than any individual movie message board.

## Content and format

### Data provided by subjects

In 2006, IMDb introduced its "Résumé subscription service", where actors and crew can post their own résumé and upload photos of themselves[22] for a yearly fee.[23] The base annual charge for including a photo with an account was $39.95 until 2010, when it was increased to$54.95. IMDb résumé pages are kept on a sub-page of the regular entry about that person, with a regular entry automatically created for each résumé subscriber who does not already have one.[24]

As of 2012, Resume Services is now included as part of an IMDbPro subscription, and is no longer offered as a separate subscription service.

### Copyright, vandalism, and error issues

All volunteers who contribute content to the database technically retain copyright on their contributions but the compilation of the content becomes the exclusive property of IMDb with the full right to copy, modify, and sublicense it and they are verified before posting.[25] Credit is not given on specific title or filmography pages to the contributor(s) who have provided information. Conversely, a credited text entry, such as a plot summary, may be "corrected" for content, grammar, sentence structure, perceived omission or error, by other contributors without having to add their names as co-authors. Due to the process of having the submitted data or text reviewed by a section manager, IMDb is different from database projects like Wikipedia, Discogs, or OpenStreetMap in that contributors cannot add, delete, or modify the data or text on impulse, and the manipulation of data is controlled by IMDb technology and salaried staff.[26] Nevertheless, although it is generally assumed to be reliable[N 1], IMDb has been subject to deliberate additions of false information, as acknowledged by a spokesperson in 2012: "We make it easy for users and professionals to update much of our content, which is why we have an 'edit page.' The data that is submitted goes through a series of consistency checks before it goes live. Given the sheer volume of the information, occasional mistakes are inevitable, and, when reported, they are promptly fixed. We always welcome corrections."[27]

The Java Movie Database (JMDB)[28] is reportedly creating an IMDb_Error.log file that lists all the errors found while processing the IMDb plain text files. A Wiki alternative to IMDb is omdb (Open Media Database) whose content is also contributed by users but licensed under CC-by and the GFDL. Since 2007, IMDb has been experimenting with wiki-programmed sections for complete film synopses, parental guides, and FAQs about titles as determined by (and answered by) individual contributors.

### Data format and access

IMDb does not provide an API for automated queries. However most of the data can be downloaded as compressed plain text files and the information can be extracted using the command-line interface tools provided.[29] Beside that there is the Java-based graphical user interface (GUI) application available that is able to process the compressed plain text files and allow to search and display the information.[28] This GUI application supports different languages but the movie related data is of course English as made available by IMDb. A Python package called IMDbPY can also be used to process the compressed plain text files into a number of different SQL databases, enabling easier access to the entire dataset for searching or data mining.[30]

### Film titles

The IMDb has sites in English as well as versions translated completely or in part into other languages (Portuguese, Finnish, French, German, Hungarian, Italian, Polish, Romanian and Spanish). The non-English language sites display film titles in the specified language. While originally the IMDb's English-language sites displayed titles according to their original country-of-origin language, in 2010 the IMDb began allowing individual users in the UK and USA to choose primary title display by either the original-language titles, or the US or UK release title (normally, in English).

## Criticism

In 2011, in the case of Hoang v. Amazon.com, IMDb was sued by an anonymous actress for more than US\$1 million due to IMDb revealing her age (40, at the time).[31] The actress claimed that revealing her age could cause her to lose acting opportunities.[32] Judge Marsha J. Pechman, a U.S. district judge in Seattle, dismissed the lawsuit, saying the actress had no grounds to proceed with an anonymous complaint. She re-filed and so revealed that the complainant is a Huong Hoang of Texas, who uses the stage name Junie Hoang.[33] In 2013, Pechman dismissed all causes of action except for a breach of contract claim against IMDb; a jury then found for IMDb on that claim.[34]

## Notes

1. ^ It may be assumed to be generally reliable but the IMDb doesn't claim that it is 100% accurate.

## References

1. ^ "Imdb.com Site Info". Alexa Internet. Retrieved 2013-04-22.
2. ^ "Top Ten Internet Companies | CelebJunkyz.com | Celebrity News Celebrity Gossip Celebrity Blog". CelebJunkyz.com. August 19, 2009. Retrieved August 20, 2011.
3. ^ "Stats". IMDb. Retrieved Feb 16, 2013.
4. ^ "Historical Internet Movie Database Site". Cardiff School of Computer Science & Informatics. Retrieved March 21, 2013.
5. ^
6. ^ Chacksfield, Marc (January 14, 2010). "China blocks number-one movie site IMDb". 2012 Future US, Inc.
7. ^ Ehlrich, Brenna (September 30, 2010). "IMDb Turns 20, Launches Original Video to Celebrate". mashable.com.
8. ^ "INTERNET BOOKSELLER AMAZON.COM ANNOUNCES ACQUISITION OF UNITED KINGDOM COMPANY THE INTERNET MOVIE DATABASE LTD.". IMDb via PR Newswire Europe. Retrieved January 15, 2007.
9. ^ Needham, Col (January 1, 2011). "IMDb announcement: Top 250 Contributors for 2010". IMDb Contributors Top Contributors. Retrieved August 25, 2011.
10. ^ Hoffman, Harrison (September 15, 2008). "IMDb now serves full-length videos". cnet. Retrieved September 17, 2008.
11. ^ Modine, Austin (September 16, 2008). "IMDb adds full-length streaming movies (Show your US ID card at the door)". The Register. Retrieved September 17, 2008.
12. ^ Wong, David. "IMDB". Cracked.com. Retrieved February 25, 2013.
13. ^ "Why IMDb’s Top 250 Matters...And Why It Doesn’t". Screenrant.com. April 13, 2010. Retrieved February 25, 2013.
14. ^ IMDb Charts: IMDb Bottom 100. imdb.com
15. ^
16. ^ "Top 250 movies as voted by our users". IMDb. Retrieved March 1, 2007.
17. ^ "Types of titles excluded from the Top 250". IMDb. Retrieved April 25, 2007.
18. ^ The user votes average on film X is 9.4, so it should appear in your top 250 films listing, yet it doesn't. Why?
19. ^ Norberg, Ragnar (2006). "Credibility Theory" (PDF). Encyclopedia of Actuarial Science. doi:10.1002/9780470012505.tac068. ISBN 0470846763. mirror
20. ^ "Bottom 100". IMDb. Retrieved March 1, 2007.
21. ^ Each TV episode uses the same message board for the whole series
22. ^ Lycos Europe and IMDb sign sales agreement for 9 European markets. Lycos Europe press release, July 10, 2006.
23. ^ IMDb Resume FAQ: Can I subscribe only for one month or one year?. Retrieved January 22, 2008.
24. ^ IMDb Resume FAQ: Is there any difference between a regular IMDb name page and an IMDb name page created via IMDb Resume?. Retrieved January 22, 2008.
25. ^ IMDb Copyright and Conditions of Use. imdb.com
26. ^ The Plain Text Data Files IMDb – Alternate Interfaces
27. ^ "Which A-List Star Is Hacking IMDb Pages?". Hollywoodreporter.com. November 14, 2012. Retrieved February 25, 2013.
28. ^ a b "Java Movie Database (JMDB)". Jmdb.de. Retrieved October 27, 2010.
29. ^ "Alternate Interfaces". IMDb. Retrieved January 15, 2007.
30. ^ "IMDbPY". IMDbPY. Retrieved February 14, 2011.
31. ^ Bahr, Lindsey (October 18, 2011). "Lawsuit against IMDb revealing private information". Insidemovies.ew.com. Retrieved April 25, 2013.
32. ^ "Acting unions criticise IMDb in age row". BBC. October 29, 2011. Retrieved October 29, 2011.
33. ^ "Actress Sued Amazon For Revealing Age 40 Identified As Huong Junie Hoang". News.sky.com. January 7, 2012. Retrieved April 21, 2012.
34. ^ "Actress age claim against IMDb rejected". BBC News. Retrieved 12 April 2013.