Annotation

From Wikipedia, the free encyclopedia
Jump to: navigation, search

An annotation is metadata (e.g. a comment, explanation, presentational markup) attached to text, image, or other data. Often annotations refer to a specific part of the original data.

Literature and education[edit]

Textual scholarship[edit]

Main article: Textual scholarship

Textual scholarship is a discipline that often uses the technique of annotation to describe or add additional historical context to texts and physical documents.

Student uses[edit]

Students often highlight or underline passages in books in order to refer back to key phrases easily, or add marginalia to aid studying. One educational technique when analyzing prose literature is to have students or teachers circle the names of characters and put rectangular boxes around phrases identifying the setting of a given scene.[citation needed]

Annotated bibliographies add commentary on the relevance or quality of each source, in addition to the usual bibliographic information that merely identifies the source.

Software engineering[edit]

Text documents[edit]

Markup languages like XML and HTML annotate text in a way that is syntactically distinguishable from that text. They can be used to add information about the desired visual presentation, or machine-readable semantic information.

Source control[edit]

The "annotate" function (also known as "blame" or "praise") used in source control systems such as Team Foundation Server and Subversion determines who committed changes to the source code into the repository. This outputs a copy of the source code where each line is annotated with the name of the last contributor to edit that line (and possibly a revision number). This can help establish blame in the event a change caused a malfunction, or identify the author of brilliant code.

Java annotations[edit]

Main article: Java annotation

A special case is the Java programming language, where annotations can be used as a special form of syntactic metadata in the source code.[1] Classes, methods, variables, parameters and packages may be annotated. The annotations can be embedded in class files generated by the compiler and may be retained by the Java virtual machine and thus influence the run-time behaviour of an application. It is possible to create meta-annotations out of the existing ones in Java.

Computational biology[edit]

Since the 1980s, molecular biology and bioinformatics have created the need for DNA annotation. DNA annotation or genome annotation is the process of identifying the locations of genes and all of the coding regions in a genome and determining what those genes do. An annotation (irrespective of the context) is a note added by way of explanation or commentary. Once a genome is sequenced, it needs to be annotated to make sense of it.

For DNA annotation, a previously unknown sequence representation of genetic material is enriched with information relating genomic position to intron-exon boundaries, regulatory sequences, repeats, gene names and protein products. This annotation is stored in genomic databases as Mouse Genome Informatics, FlyBase, and WormBase. Educational materials on some aspects of biological annotation from the 2006 Gene Ontology annotation camp and similar events are available at the Gene Ontology website.[2]

The National Center for Biomedical Ontology (www.bioontology.org) develops tools for automated annotation[3] of database records based on the textual descriptions of those records.

As a general method, dcGO [4] has an automated procedure for statistically inferring associations between ontology terms and protein domains or combinations of domains from the existing gene/protein-level annotations.

Imaging[edit]

In the digital imaging community the term annotation is commonly used for visible metadata superimposed on an image without changing the underlying master image, such as sticky notes, virtual laser pointers, circles, arrows, and black-outs (cf. redaction).

In the medical imaging community, an annotation is often referred to as a region of interest and is encoded in DICOM format.

Law[edit]

In the United States, legal publishers such as Thomson West and Lexis Nexis publish annotated versions of statutes, providing information about court cases that have interpreted the statutes. Both the federal United States Code and state statutes are subject to interpretation by the courts, and the annotated statutes are valuable tools in legal research.

Linguistics[edit]

In linguistics, annotations include comments and metadata; these non-transcriptional annotations are also non-linguistic. A collection of texts with linguistic annotations is known as a corpus (plural corpora). The Linguistic Annotation Wiki[5] describes tools and formats for creating and managing linguistic annotations.

See also[edit]

References[edit]

  1. ^ "JDK 5.0 Developer's Guide: Annotations". Sun Microsystems. 2007-12-18. Archived from the original on 6 March 2008. Retrieved 2008-03-05. .
  2. ^ "GO Teaching Resources". Archived from the original on 10 October 2006. Retrieved 21 September 2006. 
  3. ^ http://bioontology.stanford.edu/annotator-service
  4. ^ Fang, H.; Gough, J. (2012). "DcGO: Database of domain-centric ontologies on functions, phenotypes, diseases and more". Nucleic Acids Research 41 (Database issue): D536–D544. doi:10.1093/nar/gks1080. PMC 3531119. PMID 23161684.  edit
  5. ^ annotation.exmaralda.org