HTTP 404

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Mr. Guye (talk | contribs) at 21:36, 26 April 2014. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The 404 or Not Found error message is a HTTP standard response code indicating that the client was able to communicate with a given server, but the server could not find what was requested.

The web site hosting server will typically generate a "404 Not Found" web page when a user attempts to follow a broken or dead link; hence the 404 error is one of the most recognizable errors users can find on the web.[1]

A 404 error should not be confused with "server not found" or similar errors, in which a connection to the destination server could not be made at all. A 404 error indicates that the requested resource may be available again in the future; however, the fact does not guarantee the same content.

Overview

When communicating via HTTP, a server is required to respond to a request, such as a web browser's request for a web page, with a numeric response code and an optional, mandatory, or disallowed (based upon the status code) message. In the code 404, the first digit indicates a client error, such as a mistyped Uniform Resource Locator (URL). The following two digits indicate the specific error encountered. HTTP's use of three-digit codes is similar to the use of such codes in earlier protocols such as FTP and NNTP.

At the HTTP level, a 404 response code is followed by a human-readable "reason phrase". The HTTP specification suggests the phrase "Not Found"[2] and many web servers by default issue an HTML page that includes both the 404 code and the "Not Found" phrase.

A 404 error is often returned when pages have been moved or deleted. In the first case, a better response is to return a 301 Moved Permanently response, which can be configured in most server configuration files, or through URL rewriting; in the second case, a 410 Gone should be returned. Because these two options require special server configuration, most websites do not make use of them.

404 errors should not be confused with DNS errors, which appear when the given URL refers to a server name that does not exist. A 404 error indicates that the server itself was found, but that the server was not able to retrieve the requested page.

Custom error pages

The Wikimedia 404 message

Web servers can typically be configured to display a customised 404 error page, including a more natural description, the parent site's branding, and sometimes a site map, a search form or 404 page widget. The protocol level phrase, which is hidden from the user, is rarely customized.

Internet Explorer, however, will not display custom pages unless they are larger than 512 bytes, opting instead to display a "friendly" error page.[3] Google Chrome includes similar functionality, where the 404 is replaced with alternative suggestions generated by Google algorithms, if the page is under 512 bytes in size.[citation needed]. This goes against the recommendations of RFC 2616, which states that "user agents SHOULD display any included entity to the user".

Another problem is that if the page does not provide a favicon, and a separate custom 404 page exists, extra traffic and longer loading times will be generated on every page view.[4][5]

Tracking/Checking 404 errors

A number of tools exist that crawl through a website to find pages that return 404 status codes.[6] These tools can be helpful in finding links that exist within a particular website. The limitation of these tools is that they only find links within one particular website, and ignore 404s resulting from links on other websites. As a result, these tools miss out on 83% of the 404s on websites.[7] One way around this is to find 404 errors by analyzing external links.[8]

Another common method is tracking traffic to 404 pages using log file analysis.[9] This can be useful to understand more about what 404s users reached on the site. However, log files can be somewhat complex to review. Another method of tracking traffic to 404 pages is using JavaScript-based traffic tracking tools.[10]

404 page widgets

While many websites send additional information in a 404 error message—such as a link to the homepage of a website or a search box—some also endeavor to find the correct web page the user wanted. Extensions are available for some popular CMS to do this.[11] In Europe, the NotFound Project encourages site operators to add a snippet of code to serve customised 404 error pages which provide data about missing children.[12]

Soft 404

Some websites report a "not found" error by returning a standard web page with a "200 OK" response code; this is known as a soft 404. Soft 404s are problematic for automated methods of discovering whether a link is broken. Some search engines, like Yahoo, use automated processes to detect soft 404s.[13] Soft 404s can occur as a result of configuration errors when using certain HTTP server software, for example with the Apache software, when an Error Document 404 (specified in a .htaccess file) is specified as an absolute path (e.g. http://example.com/error.html) rather than a relative path (/error.html).[14]

Some proxy servers generate a 404 error when the remote host is not present, rather than returning the correct 500-range code when errors such as hostname resolution failures or refused TCP connections prevent the proxy server from satisfying the request. This can confuse programs that expect and act on specific responses, as they can no longer easily distinguish between an absent web server and a missing web page on a web server that is present.

In July 2004, the UK telecom provider BT Group deployed the Cleanfeed content blocking system, which returns a 404 error to any request for content identified as potentially illegal by the Internet Watch Foundation.[15] Other ISPs return a HTTP 403 "forbidden" error in the same circumstances.[16] The practice of employing fake 404 errors as a means to conceal censorship has also been reported in Thailand[17] and Tunisia.[18] In Tunisia, where censorship is reportedly severe, people have become aware of the nature of the fake 404 errors and have created an imaginary character named "Ammar 404" who represents "the invisible censor".[19]

In popular culture

[relevant?]

  • During the May 2011 Greek protests, one slogan was "Error 404: Democracy not found".[20]
  • In Tunisia political censorship led to 404 becoming so ubiquitous that Tunisian bloggers invented a character called Ammar they held responsible for its deployment.[21]
  • Renny Gleeson with Weiden and Kennedy spoke at TED in 2012. Through a PIE initiative, Portland thinkers[who?] have been exploring ways to improve the emotional response to a 404 page experience. Companies like Groupon, that specialize in digital consumerism, have begun to utilize their ideas in order to personalise the consumer experience when they have reached a 'dead-end' while searching through products. These examples of 404 pages could be considered "quirky" as they do not reflect the companies own strategies, but feature irrelevant items to lower the formality of the page.[22]
  • In the movie Four Rooms there is a segment that takes place in room 404, a room where people accidentally go to looking for a party, but finding something else instead.
  • XKCD, in reference to this phenomenon, skipped Comic #404 so that going to it manually via the URL gives a 404 error page.

Slang usage

In 2008, a study carried out by the telecommunications arm of the Post Office[23] found that "404" had become a slang synonym for "clueless" in the United Kingdom. Slang lexicographer Jonathon Green said that "404" as a slang term had been driven by the "influence of technology" and young people, but at the time, such usage was relatively confined to London and other urban areas.[23]

See also

References

  1. ^ "The 5 Most Common HTTP errors according to Google". Pingdom. 6 May 2009. Retrieved 14 June 2012.
  2. ^ "6.1.1 Status Code and Reason Phrase". W3C. Retrieved 22 June 2008.
  3. ^ "Friendly HTTP Error Pages". 18 August 2010. Retrieved 14 June 2012.
  4. ^ Heng, Christopher (7 September 2008). "What is Favicon.ico and How to Create a Favicon Icon for Your Website". thesitewizard.com. Retrieved 23 February 2011.
  5. ^ Cole, Linda (3 August 1999). "The Dastardly "favicon.ico not found" Error". Web Developers' Virtual Library. QuinStreet. Retrieved 30 February 2011. {{cite web}}: Check date values in: |accessdate= (help)
  6. ^ Smarty, Ann (7 September 2009). "3 Broken Link Checkers Reviewed And Compared". Search Engine Journal. Retrieved 7 June 2013.
  7. ^ "Sources Leading To 404s". SpringTrax. Retrieved 11 February 2013.
  8. ^ Cushing, Anne (2 April 2013). "A Data-Centric Approach To Identifying 404 Pages Worth Saving". Search Engine Land. Retrieved 7 June 2013.
  9. ^ "Tracking and Preventing 404 Errors". 404errorpages.com. Retrieved 7 June 2013.
  10. ^ "Understand 404 Errors". SpringTrax.com. Retrieved 7 June 2013.
  11. ^ Swenson, Sahala (19 August 2008). "Make your 404 pages more useful". Official Google Webmaster Central Blog. Google, Inc. Retrieved 28 August 2009.
  12. ^ "Missing children messages go on 404 error pages". BBC News. 27 September 2012.
  13. ^ "Why is your crawler asking for strange URLs that have never existed on my site?". Yahoo Ysearch Help page. Retrieved 4 September 2013.
  14. ^ "Farewell to soft 404s". Google Official Blog. Retrieved 20 September 2008.
  15. ^ "LINX Public Affairs » Cleanfeed: the facts". Publicaffairs.linx.net. 10 September 2004. Retrieved 6 March 2011.
  16. ^ "DEMON – Error 403". Retrieved 14 June 2012.
  17. ^ Sambandaraksa, Don (18 February 2009). "The old fake '404 Not Found' routine". Bangkok Post. Retrieved 12 September 2010.
  18. ^ Noman, Helmi (12 September 2008). "Tunisian journalist sues government agency for blocking Facebook, claims damage for the use of 404 error message instead of 403". Open Net Initiative. Retrieved 21 November 2010.
  19. ^ "Anti-censorship movement in Tunisia: creativity, courage and hope!". Global Voices Advocacy. 27 May 2010. Archived from the original on 7 July 2012. Retrieved 28 August 2010.
  20. ^ "Sinde recibida en San Sebastián por los "indignados" al grito de "fuera"" (in Spanish). ABC Periódico Electrónico. 6 June 2011. Retrieved 12 June 2011.
  21. ^ BenMhenni, Lina. "Ammar 4040". http://advocacy.globalvoicesonline.org. Archived from the original on 9 July 2012. Retrieved 29 June 2011. {{cite web}}: External link in |publisher= (help)
  22. ^ "From skateboarding bears to missing children: The power of the 404 Not Found error page". Metro. 6 June 2011. Retrieved 16 April 2013.
  23. ^ a b "Don't be 404, know the tech slang". BBC News Online. 10 December 2008. Retrieved 13 December 2008.

External links