Jump to content

Faceted search

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Dtunkelang (talk | contribs) at 03:49, 21 September 2015 (removed "relevance adjustment" section that is essentially an ad for Speedtrack.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Faceted search, also called faceted navigation or faceted browsing, is a technique for accessing information organized according to a faceted classification system, allowing users to explore a collection of information by applying multiple filters. A faceted classification system classifies each information element along multiple explicit dimensions, called facets, enabling the classifications to be accessed and ordered in multiple ways rather than in a single, pre-determined, taxonomic order.[1]

Facets correspond to properties of the information elements. They are often derived by analysis of the text of an item using entity extraction techniques or from pre-existing fields in a database such as author, descriptor, language, and format. Thus, existing web-pages, product descriptions or online collections of articles can be augmented with navigational facets.

Within the academic community, faceted search has attracted interest primarily among library and information science researchers, and to some extent among computer science researchers specializing in information retrieval.[2]

Development

The Association for Computing Machinery's Special Interest Group on Information Retrieval provided the following description of the role of faceted search for a 2006 workshop:

The web search world, since its very beginning, has offered two paradigms:

  • Navigational search uses a hierarchy structure (taxonomy) to enable users to browse the information space by iteratively narrowing the scope of their quest in a predetermined order, as exemplified by Yahoo! Directory, DMOZ, etc.
  • Direct search allows users to simply write their queries as a bag of words in a text box. This approach has been made enormously popular by Web search engines.

Over the last few years, the direct search paradigm has gained dominance and the navigational approach became less and less popular. Recently, a new approach has emerged, combining both paradigms, namely the faceted search approach. Faceted search enables users to navigate a multi-dimensional information space by combining text search with a progressive narrowing of choices in each dimension. It has become the prevailing user interaction mechanism in e-commerce sites and is being extended to deal with semi-structured data, continuous dimensions, and folksonomies.[2]

Mass market use

Faceted search has become a popular technique in commercial search applications, particularly for online retailers and libraries. An increasing number of enterprise search vendors provide software for implementing faceted search applications.

Online retail catalogs pioneered the earliest applications of faceted search, reflecting both the faceted nature of product data (most products have a type, brand, price, etc.) and the ready availability of the data in retailers' existing information-systems. In the early 2000s retailers started using faceted search. A 2014 benchmark of 50 of the largest US based online retailers reveals that despite the benefits of faceted search, only 40% of the sites have implemented it. [3] Examples include the filtering options that appear in the left column on amazon.com or Google Shopping after a keyword search has been performed.

Libraries and information science

In 1933, the noted librarian Ranganathan proposed a faceted classification system for library materials, known as colon classification. In the pre-computer era, he did not succeed in replacing the pre-coordinated Dewey Decimal Classification system.[4]

Modern online library catalogs, also known as OPACs, have increasingly adopted faceted search interfaces. Noted examples include the North Carolina State University library catalog (part of the Triangle Research Libraries Network) and the OCLC Open WorldCat system. The CiteSeerX project[5] at the Pennsylvania State University allows faceted search for academic documents and continues to expand into other facets such as table search.

See also

References