Jump to content

Ctags

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 2001:14ba:80d2:6f00::1ec (talk) at 20:06, 28 September 2018. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Ctags
Developer(s)Ken Arnold
Repository
TypeProgramming tool (Specifically: Code navigation tool)
LicenseBSD

Ctags is a programming tool that generates an index (or tag) file of names found in source and header files of various programming languages. Depending on the language, functions, variables, class members, macros and so on may be indexed. These tags allow definitions to be quickly and easily located by a text editor or other utility. Alternatively, there is also an output mode that generates a cross reference file, listing information about various names found in a set of language files in human-readable form.

The original Ctags was introduced in BSD Unix and was written by Ken Arnold, with Fortran support by Jim Kleckner and Pascal support by Bill Joy.

Editors that support ctags

Tag index files are supported by many source code editors, including:

Variants of ctags

There are a few variations of the ctags program:

Etags

Etags is the ctags utility that comes with Emacs. For those options which only make sense for vi style tag files produced by ctags, etags could not recognize them and would ignore them.

Exuberant Ctags

Exuberant Ctags, written and maintained by Darren Hiebert, was initially distributed with Vim, but became a separate project upon the release of Vim 6.[1] It includes support for Emacs compatibility.

Exuberant Ctags includes support for over 40 programming languages with the ability to add support for even more using regular expressions.

Hasktags

Hasktags creates ctags compatible tag files for Haskell source files.[2] It includes support for creating Emacs etags files.[3]

jsctags

jsctags is a ctags-compatible code indexing solution for JavaScript.[4] It is specialized for JavaScript and its packaging system CommonJS and outperforms Exuberant Ctags for JavaScript code, finding more tags than the latter.[5]

Universal Ctags

Universal Ctags is a fork of Exuberant Ctags, with the objective of continuing its development.[6]

Tags file formats

There are multiple tag file formats. Some of them are described below. In the following, <\x##> represents the byte with hexadecimal representation ##.

Ctags and Exuberant Ctags

The original ctags and Exuberant Ctags have similar file formats:[7]

Ctags

This is the format used by vi and various clones. The tags file is normally named "tags".

The tags file is a list of lines, each line in the format:

{tagname}<Tab>{tagfile}<Tab>{tagaddress}

The fields are specified as follows:

  • {tagname} – Any identifier, not containing white space
  • <Tab> – Exactly one tab character, although many versions of vi can handle any amount of white space
  • {tagfile} – The name of the file where {tagname} is defined, relative to the current directory
  • {tagaddress} – An ex mode command that will take the editor to the location of the tag. For POSIX implementations of vi this may only be a search or a line number.

The tags file is sorted on the {tagname} field which allows for fast searching of the tags file.

Exuberant Ctags

This is the format used by Vim. It can generate an original ctags file format or an extended format that attempts to retain backward compatibility.

The fields are specified as follows:

{tagname}<Tab>{tagfile}<Tab>{tagaddress}[;"<Tab>{tagfield}...]

The fields up to and including {tagaddress} are the same as for ctags above.

Optional additional fields are indicated by square brackets ("[...]") and include:

  • ;" – semicolon + double quote: Ends the {tagaddress} in a way that looks like the start of a comment to vi.
  • {tagfield}

Extended format extends the {tagaddress} field under certain circumstances to include a set of extension fields embedded in an ex comment immediately appended to the ex command, which leaves it backward-compatible with original vi implementations. These extension fields are tab separated "key:value" pairs, documented in the ctags manual.

Etags

This is the format used by Emacs etags. The tags file is normally named "TAGS".

The etags files consists of multiple sections—one section per input source file. Sections are plain-text with several non-printable ascii characters used for special purposes. These characters are represented as bracketed hexadecimal codes below.

A section starts with a two line header, one line containing a single <\x0c> character, followed by a line which consists of:

{src_file},{size_of_tag_definition_data_in_bytes}

The header is followed by tag definitions, one definition per line, with the format:

{tag_definition_text}<\x7f>{tagname}<\x01>{line_number},{byte_offset}

{tagname} (along with <\x01>) can be omitted if the name of the tag can be deduced from the text at the tag definition.

Example

Given a single line test.c source code:

#define CCC(x)

The TAGS file would look like this:

<\x0c>
test.c,21
#define CCC(<\x7f>CCC<\x01>1,0

See also

References

  1. ^ "Vim documentation: version6: ctags-gone". Retrieved 2007-04-28.
  2. ^ "GHC documentation: Other Haskell utility programs". Retrieved 2010-03-05.
  3. ^ "hasktags: Produces ctags "tags" and etags "TAGS" files for Haskell programs". Retrieved 2010-03-05.
  4. ^ "pcwalton github repository for jsctags".
  5. ^ Patrick Walton. "Introducing jsctags". Retrieved 2010-05-25.
  6. ^ "Universal Ctags Website". Retrieved 2018-08-02.
  7. ^ "Proposal for extended Vi tags file format". Retrieved 2007-06-30.