Jump to content

Gopher (protocol)

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Skedaddle (talk | contribs) at 12:04, 16 June 2010 (gah, more nits). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The Gopher protocol is a TCP/IP Application layer protocol designed for distributing, searching, and retrieving documents over the Internet. Strongly oriented towards a menu-document design, the Gopher protocol was a predecessor of (and later, an alternative to) the World Wide Web. The protocol offers some features not natively supported by the Web and imposes a much stronger hierarchy on information stored on it. Its text menu interface is well-suited to computing environments that rely heavily on remote text-oriented computer terminals, common in universities at the time of its creation in 1991 until 1993.[1] Although largely supplanted by the Web in the years following, the Gopher protocol is still in use by enthusiasts, and a small population of servers remains in active maintenance.

Origins

The original Gopher system was released in late spring of 1991 by Mark McCahill, Farhad Anklesaria, Paul Lindner, Daniel Torrey, Adam Huminsky, and Bob Alberti of the University of Minnesota. Its central goals were, as stated in RFC 1436:

  • A file-like hierarchical arrangement that would be familiar to users
  • A simple syntax
  • A system that can be created quickly and inexpensively
  • Extending the file system metaphor, such as searches

Gopher combines document hierarchies with collections of services, including WAIS, the Archie and Veronica search engines, and gateways to other information systems such as ftp and Usenet.

The general interest in Campus-Wide Information Systems (CWISs)[2] in higher education at the time, and the ease with which a Gopher server could be set up to create an instant CWIS with links to other sites' online directories and resources were the factors contributing to Gopher's rapid adoption. By 1992, the standard method of locating someone's e-mail address was to find their organization's CCSO nameserver entry in Gopher, and query the nameserver.[3]

Stagnation

The World Wide Web was in its infancy in 1991, and Gopher services quickly became established. By the late 1990s, Gopher had largely ceased expanding. Several factors contributed to Gopher's stagnation:

  • In February 1993, the University of Minnesota announced that it would charge licensing fees for the use of its implementation of the Gopher server.[4] As a consequence of this, some users suspected that a licensing fee would also be charged for independent implementations.[5][6] In contrast, no such limitation has yet been imposed on the World Wide Web. The University of Minnesota later re-licensed its Gopher software under the GNU GPL.[7]
  • Gopher client functionality was quickly duplicated by early Web browsers, such as Mosaic, which subsumed the protocol as part of their functions.
  • Gopher has a more rigid structure compared to the free-form HTML of the Web. With Gopher, every document has a defined format and type, and the typical user navigates through a single server-defined menu system to get to a particular document. This can be quite different from the way a typical user might traverse documents on the Web.

Availability of Gopher today

Gopher remains in active use by its enthusiasts, and there have been attempts to revive the use of Gopher. One such attempt is the Overbite project, [8] a Firefox extension that adds better support for the protocol to the browser.

As of 2010, there are approximately 150 gopher servers indexed by Veronica-2,[9] reflecting a slow growth from 2007 when there were fewer than 100,[10] although many are infrequently updated. A handful of new servers are set up every year by hobbyists — over 50 have been set up and added to Floodgap's list since 1999[11]. A snapshot of Gopherspace as it was in 2007 was circulated on BitTorrent and is still available.[12] Due to the simplicity of the Gopher protocol, setting up new servers or adding Gopher support to browsers is often done in a tongue-in-cheek way, principally on April Fools' Day[13][14]

Some have suggested that the bandwidth-sparing simple interface of Gopher would be a good match for mobile phones and Personal digital assistants (PDAs),[15] but so far, mobile adaptations of HTML and XML and other simplified content have proved more popular. The PyGopherd server provides a built-in WML front-end to Gopher sites served with it.

Gopher support in web browsers

Mozilla Firefox 3.7 displaying the top-level menu of the Floodgap gopher server
Browser Currently Supported Supported from Supported until Notes
Internet Explorer No 1 6.0 RTM Re-enable with registry patch[16]. Always uses port 70.
Internet Explorer for Mac No 5.2.3 PowerPC-only
Mozilla Firefox Yes 0 4.0 Beta 1 Always uses port 70. (Dropped from Firefox 4.0 core onwards due to security concerns; can be added back with add-on support.[17])
SeaMonkey Yes 1.0 Current
Camino Yes 1.0 Current
OmniWeb Yes 5.9.2 Current First WebKit Browser to support Gopher[18][19]
Epiphany Yes Current
Galeon Yes Current
Konqueror Plugin kio_gopher
K-Meleon Yes Current
Lynx Yes Current Complete support
ELinks Beta[20] Build option
Safari No never
Opera No never Opera 9.0 includes a proxy capability
Google Chrome No never
Classilla Yes 9.0 Current

Gopher support was disabled in Internet Explorer versions 5.* and 6 for Windows in June 2002 by a patch meant to fix a security vulnerability in the browser's Gopher protocol handler; however, it can be re-enabled by editing the Windows registry. In Internet Explorer 7, Gopher support was removed on the WinINET level.[21]

Gopher clients

Gopher was at its height of popularity during a time when there were still many equally competing computer architectures and operating systems. As such, there are several Gopher clients available for Acorn RISC OS, AmigaOS, Atari MiNT, CMS, DOS, MacOS 7x, MVS, NeXT, OS/2 Warp, most UNIX-like operating systems, VMS, Windows 3x, and Windows 9x. GopherVR was a client designed for 3D visualization, and there is even a Gopher client MOO object. The majority of these clients are hard coded to work on TCP port 70.

Gopher to HTTP gateways

Users of Web browsers that have incomplete or no support for Gopher can access content on Gopher servers via a server gateway or proxy server that converts Gopher menus into HTML; one such proxy is the Floodgap Public Gopher Proxy. Similarly, certain server packages such as GN and PyGopherd have built-in Gopher to HTTP interfaces.

Gopher characteristics

As part of its design goals, Gopher functions and appears much like a mountable read-only global network file system (and software, such as gopherfs, is available that can actually mount a Gopher server as a FUSE resource). At a minimum, whatever a person can do with data files on a CD-ROM, they can do on Gopher.

A Gopher system consists of a series of hierarchical hyperlinkable menus. The choice of menu items and titles is controlled by the administrator of the server.

File:Floodgap gopher top menu.PNG
The top level menu of a Gopher server. Selecting the "Fun and Games" menu item...
File:Floodgap gopher fun menu.PNG
... takes the user to the "Fun and Games" menu.


File:Floodgap gopher servers menu.PNG
A Gopher menu listing other accessible servers.

Similar to a file on a Web server, a file on a Gopher server can be linked to as a menu item from any other Gopher server. Many servers take advantage of this inter-server linking to provide a directory of other servers that the user can access.

Technical details

Protocol

The Gopher protocol was first described in RFC 1436. IANA has assigned TCP port 70 to the Gopher protocol.

The protocol is simple to negotiate, making it possible to browse without using a client. A standard gopher Telnet session may therefore appear as follows:

telnet quux.org 70
Trying 64.85.160.193...
Connected to quux.org.
Escape character is '^]'.
/Reference
1CIA World Factbook     /Archives/mirrors/textfiles.com/politics/CIA    gopher.quux.org 70
0Jargon 4.2.0   /Reference/Jargon 4.2.0 gopher.quux.org 70      +
1Online Libraries       /Reference/Online Libraries     gopher.quux.org 70     +
1RFCs: Internet Standards       /Computers/Standards and Specs/RFC      gopher.quux.org 70
1U.S. Gazetteer /Reference/U.S. Gazetteer       gopher.quux.org 70      +
iThis file contains information on United States        fake    (NULL)  0
icities, counties, and geographical areas.  It has      fake    (NULL)  0
ilatitude/longitude, population, land and water area,   fake    (NULL)  0
iand ZIP codes. fake    (NULL)  0
i       fake    (NULL)  0
iTo search for a city, enter the city's name.  To search        fake    (NULL) 0
ifor a county, use the name plus County -- for instance,        fake    (NULL) 0
iDallas County. fake    (NULL)  0
Connection closed by foreign host.

Here, the client has established a TCP connection with the server on port 70, the standard gopher port. The client then sends a string followed by a carriage return followed by a line feed (a "CR + LF" sequence). This is the selector, which identifies the document to be retrieved. If the item selector were an empty line, the default directory would be selected. The server then replies with the requested item and closes the connection. According to the protocol, before the connection is closed, the server should send a full-stop (i.e., a period character) on a line by itself. However, as is the case here, not all servers conform to this part of the protocol and the server may close the connection without returning the final full-stop.

In this example, the item sent back is a gopher menu, a directory consisting of a sequence of lines each of which describes an item that can be retrieved. Most clients will display these as hypertext links, and so allow the user to navigate through gopherspace by following the links.

All lines in a gopher menu are terminated by "CR + LF", and consist of five fields: the item type as the very first character (see below), the display string (i.e., the description text to display), a selector (i.e., a file-system pathname), host name (i.e., the domain name of the server on which the item resides), and port (i.e., the port number used by that server). The item type and display string are joined without a space; the other fields are separated by the tab character.

Because of the simplicity of the Gopher protocol, tools such as netcat make it possible to download Gopher content easily from the command line:

echo jacks/jack.exe | nc gopher.example.org 70 > jack.exe

Gopher item types

Item types are described in gopher menus by a single number or (case specific) letter and act as hints to the client to tell it how to handle a specific media type in a menu, analogous to a MIME type. Every client necessarily must understand itemtypes 0 and 1. All known clients understand item types 0 through 9, g, and s, and all but the very oldest also understand file-types h and i.

A list of additional file-type definitions has continued to evolve over time, with some clients supporting them and others not. As such, many servers assign the generic 9 to every binary file, hoping that the client's computer will be able to correctly process the file.

Historically, to create a link to a Web server, "GET /" was used as a pseudo-selector to simulate an HTTP client request. John Goerzen created an addition [22] to the Gopher protocol, commonly referred to as "URL links", that allows links to any protocol that supports URLs. For example, to create a link to http://gopher.quux.org/, the item type is "h", the description is arbitrary, the item selector is "URL:http://gopher.quux.org/", and the domain and port are that of the originating Gopher server. For clients that do not support URL links, the server creates an HTML redirection page.

The master Gopherspace search engine is Veronica. Veronica offers a keyword search of all the public Internet Gopher server menu titles. A Veronica search produces a menu of Gopher items, each of which is a direct pointer to a Gopher data source. Individual Gopher servers may also use localized search engines specific to their content such as Jughead and Jugtail.

GopherVR is a 3D virtual reality variant of the original Gopher system.

Gopher server software

Because the protocol is trivial to implement in a basic fashion, there are many server packages still available, and some are still maintained.

See also

  • Veronica — the search engine system for the Gopher protocol, an acronym for "Very Easy Rodent-Oriented Net-wide Index to Computer Archives"
  • Gopher+ — early proposed extensions to the Gopher protocol
  • GopherVR
  • Jugtail — an alternative search engine system for the Gopher protocol. Jugtail was formerly known as Jughead.
  • SDF Public Access Unix Network — a non-profit organization which provides free Gopher hosting
  • Phlog — The gopher version of a weblog
  • Wide area information server — a search engine whose popularity was contemporary with Gopher

References

  1. ^ Gopher archive at quux.org (gopher link)
  2. ^ Google Groups archive of bit.listserv.cwis-l discussion
  3. ^ Google Groups archive of comp.infosystems.gopher discussion
  4. ^ http://www.funet.fi/pub/vms/networking/gopher/gopher-software-licensing-policy.ancient
  5. ^ Google Groups
  6. ^ http://groups.google.com/groups?selm=36e4c2f1.10244576@nntp.best.ix.netcom.com
  7. ^ gopher://www.michaeleshun.4t.com
  8. ^ The Overbite Project, hosted by Floodgap Systems
  9. ^ gopher://gopher.floodgap.com/0/v2/vstat
  10. ^ Kaiser, Cameron (2007-03-19). "Down the Gopher Hole". TidBITS. Retrieved 2007-03-23.
  11. ^ gopher://gopher.floodgap.com/1/new
  12. ^ http://changelog.complete.org/archives/1466-download-a-piece-of-internet-history A freely downloadable BitTorrent file containing an archive of all available Gopherspace content as of 2007 and an interview with the creators of Gopher.
  13. ^ http://www.omnigroup.com/applications/omniweb/releasenotes/
  14. ^ gopher://gopher.floodgap.com/1/new "Service note for 1 April 2009—This isn't a joke server, guys, we've been running for 10 years!"
  15. ^ Wired News: Gopher: Underground Technology
  16. ^ "Microsoft Security Bulletin MS02-047". Microsoft date=2003-02-28. Retrieved 2007-03-23. {{cite web}}: Missing pipe in: |publisher= (help)
  17. ^ "Bug 388195 - Remove gopher protocol support for Firefox". Retrieved 2010-06-15.
  18. ^ "FOR IMMEDIATE RELEASE: OmniWeb 5.9.2 now includes Gopher support!". OmniGroup. 2009-04-01. Retrieved 2009-04-03. {{cite web}}: |first= missing |last= (help)
  19. ^ "A comprehensive list of changes for each version of OmniWeb". OmniGroup. 2009-04-01. Retrieved 2009-04-03.
  20. ^ Fonseca, Jonas (24 December 2004). "elinks-users ANNOUNCE ELinks-0.10.0 (Thelma)". Linux from scratch. Retrieved 22 May 2010.
  21. ^ "Release Notes for Internet Explorer 7". Microsoft. 2006. Retrieved 2007-03-23.
  22. ^ http://gopher.quux.org/Archives/Mailing%20Lists/gopher/gopher.2002-02%7C/MBOX-MESSAGE/34

Standards

  • IANA Port Number allocations
  • RFC 1436 — The Internet Gopher Protocol (a distributed document search and retrieval protocol)
  • RFC 1580 — Guide to Network Resource Tools
  • RFC 1689 — Networked Information Retrieval: Tools and Groups
  • RFC 1738 — Uniform Resource Locators (URL)
  • RFC 1808 — Relative Uniform Resource Locators
  • RFC 2396 — Uniform Resource Identifiers (URI): Generic Syntax
  • RFC 4266 — The gopher URI Scheme

Template:Earlybrowsers