Microdata is a WHATWG HTML specification used to nest metadata within existing content on web pages. Search engines, web crawlers, and browsers can extract and process Microdata from a web page and use it to provide a richer browsing experience for users. Search engines benefit greatly from direct access to this structured data because it allows search engines to understand the information on web pages and provide more relevant results to users. Microdata uses a supporting vocabulary to describe an item and name-value pairs to assign values to its properties. Microdata is an attempt to provide a simpler way of annotating HTML elements with machine-readable tags than the similar approaches of using RDFa and microformats.
Microdata vocabularies provide the semantics, or meaning of an Item. Web developers can design a custom vocabulary or use vocabularies available on the web. A collection of commonly used markup vocabularies are provided by Schema.org schemas which include: Person, Event, Organization, Product, Review, Review-aggregate, Breadcrumb, Offer, Offer-aggregate. Major search engine operators like Google, Microsoft and Yahoo! rely on this markup to improve search results. For some purposes, an ad-hoc vocabulary is adequate. For others, a vocabulary will need to be designed. Where possible, authors are encouraged to re-use existing vocabularies, as this makes content re-use easier.
||This article may require cleanup to meet Wikipedia's quality standards. The specific problem is: bad translation from the Russian. (January 2015)|
In some cases, search engines covering specific regions may provide locally-specific extensions of microdata. For example Yandex, a major search engine in Russia, supports microformats such as hCard (company contact information), hRecipe (food recipe), hReview (market reviews) and hProduct (product data) and provides its own format for definition of the terms and encyclopedic articles. This extension was made in order to get rid of problems with transliteration between the Cyrillic and Latin alphabets. Due to this implementation of additional marking parameters and routine use of Schema's vocabulary  the indexation of information in Russian-language web-pages became considerably more successful.
itemscope– Creates the Item and indicates that descendants of this element contain information about it.
itemtype– A valid URL of a vocabulary that describes the item and its properties context.
itemid– Indicates a unique identifier of the item.
itemprop– Indicates that its containing tag holds the value of the specified item property. The property's name and value context are described by the item's vocabulary. Properties values usually consist of string values, but can also use URLs using the
aelement and its
imgelement and its
srcattribute, or other elements that link to or embed external resources.
itemref– Properties that are not descendants of the element with the
itemscopeattribute can be associated with the item using this attribute. Provides a list of element ids (not
itemids) with additional properties elsewhere in the document.
The following HTML5 markup may be found on a typical “About” page containing information about a person:
<section> Hello, my name is John Doe, I am a graduate research assistant at the University of Dreams. My friends call me Johnny. You can visit my homepage at <a href="http://www.JohnnyD.com">www.JohnnyD.com</a>. I live at 1234 Peach Drive, Warner Robins, Georgia.</section>
<section itemscope itemtype="http://schema.org/Person"> Hello, my name is <span itemprop="name">John Doe</span>, I am a <span itemprop="jobTitle">graduate research assistant</span> at the <span itemprop="affiliation">University of Dreams</span>. My friends call me <span itemprop="additionalName">Johnny</span>. You can visit my homepage at <a href="http://www.JohnnyD.com" itemprop="url">www.JohnnyD.com</a>. <section itemprop="address" itemscope itemtype="http://schema.org/PostalAddress"> I live at <span itemprop="streetAddress">1234 Peach Drive</span>, <span itemprop="addressLocality">Warner Robins</span>, <span itemprop="addressRegion">Georgia</span>. </section> </section>
The following text shows how Google parses the Microdata from the above example code. Developers can test pages containing Microdata using Google's Rich Snippet Testing Tool.
Item Type: http://schema.org/Person name = John Doe jobTitle = graduate research assistant affiliation = University of Dreams additionalName = Johnny url = http://www.johnnyd.com/ address = Item(1) Item 1 Type: http://schema.org/PostalAddress streetAddress = 1234 Peach Drive addressLocality = Warner Robins addressRegion = Georgia
The same machine-readable terms can be used not only in HTML Microdata, but also in other annotations such as RDFa or JSON-LD in the markup, or in an external RDF file in a serialization such as RDF/XML, Notation3, or Turtle.
- Servers: Google can use microdata in its result pages. It is the preferred snippet format for the Google+ social network.
- Browsers: As of December 2013[update], notable browsers that added support for the Microdata DOM API, are:
- Microdata — HTML Draft Standard
- MicroData - The Future of Search Engine Relevance and Optimization (SEO)
- Schema.org http://schema.org/
- “Distributed,” “Extensibility,” And Other Fancy Words, Dive Into HTML5
- Semantic markup deployment in Russia
- Schema.org Documentation
- Schema.org Type Hierarchy
- Schema.org Turtle RDFS Schema
- Rich snippets (microdata, microformats, RDFa), Google webmaster central
- Rich Snippet display clarification
- Types of Rich Snippets (YouTube: Google Webmasters Channel)
- Opera 11.60 for Windows changelog
- Microdata — HTML Draft Standard, WHATWG
- W3C HTML Microdata Working Group Note, W3C
- Usability testing HTML5, 2009-10-04, about how some of the design decisions for microdata were made
- Almaer, Dion (2009-05-11), Hixie discusses the addition of HTML5 "microdata", Ajaxian
- HTML5 Microdata Specs, Data-Vocabulary.org
- GetSchema Annotating HTML5 with Schema.org and Microdata. An Example Wiki.
- Live Microdata, a tool to interactively edit and extract the Microdata embedded in HTML
- Microdata generator, Web based Microdata rich snippet generator