Talk:Full-text search

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
WikiProject Computing  
WikiProject iconThis article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
 ???  This article has not yet received a rating on the project's quality scale.
 ???  This article has not yet received a rating on the project's importance scale.
 

To do[edit]

  1. Create new articles so that the ugly red links go away.
  2. Create an information retrieval category and add relevant documents to it.
  3. Check list of querying techniques to make sure it's complete (it isn't).
  4. Explain that search engines use varying methods to express boolean operators, etc.
  5. Incorporate modern information; Alta Vista? All popular search engines today work off the concept of full-text searching. —Preceding unsigned comment added by 12.35.22.253 (talk) 15:45, 15 May 2008 (UTC)

Search algorithm[edit]

Google's PageRank algorithm is referred to as a search algorithm, but isn't it a relevance ranking algorithm, not a search algorithm? Nurg 01:58, 19 March 2006 (UTC)

Agree that it is a relevance ranking algorithm, but it is used along with a basic vector-space model in the case of Google to provide a search algorithm. I would rather not make a direct reference though, it would be better to cite citation analysis, citation index, or Bibliometrics. Josh Froelich 14:15, 21 December 2006 (UTC)

MapReduce[edit]

AFAIK, MapReduce-based fulltext search as employed by e.g. Google doesn't use a classical fulltext index as described in the article. Rather, it uses massive parallelism to actually scan each document in real time as the search runs (rather than offline when an index is built), finding those that match ("map" stage), and then collects and ranks the results ("reduce" stage). The article makes it sound as if building a fulltext index were the only way to efficiently perform full-text searches over large numbers of documents. Multi io (talk) 02:30, 18 February 2010 (UTC)

search vs. matching[edit]

The article does not make clear the distinction between boolean pattern matching in text (string matching) and information retrieval. -- JakobVoss (talk) 10:10, 11 December 2011 (UTC)

"Full" vs. "Free"[edit]

This article uses the terms "full text search" and "free text search" interchangeably. Is there a difference? — Preceding unsigned comment added by 76.14.24.156 (talk) 17:42, 5 November 2012 (UTC)

This article uses "full-text" and "full text" interchangeably as well. If there is a differentiation there it needs to be defined. — Preceding unsigned comment added by 24.67.190.178 (talk) 22:57, 4 August 2017 (UTC)

External links modified[edit]

Hello fellow Wikipedians,

I have just added archive links to one external link on Full text search. Please take a moment to review my edit. If necessary, add {{cbignore}} after the link to keep me from modifying it. Alternatively, you can add {{nobots|deny=InternetArchiveBot}} to keep me off the page altogether. I made the following changes:

When you have finished reviewing my changes, please set the checked parameter below to true to let others know.

As of February 2018, "External links modified" talk page sections are no longer generated or monitored by InternetArchiveBot. No special action is required regarding these talk page notices, other than regular verification using the archive tool instructions below. Editors have permission to delete the "External links modified" sections if they want, but see the RfC before doing mass systematic removals. This message is updated dynamically through the template {{sourcecheck}} (last update: 15 July 2018).

  • If you have discovered URLs which were erroneously considered dead by the bot, you can report them with this tool.
  • If you found an error with any archives or the URLs themselves, you can fix them with this tool.


Cheers.—cyberbot IITalk to my owner:Online 22:11, 9 January 2016 (UTC)

History section needed[edit]

This page needs a History section. Full-text search is a candidate for the algorithm with the most far-reaching effects on human activity so far in CS history.

Encyclopedant (talk) 04:38, 28 May 2017 (UTC)

Merge with Information retrieval[edit]

This should be merge with IR. full-text search is merely an improvement over boolean keyword search that takes into account some meaning by using stemming or lemmatization. I think it sits on a continuum where on one end there is boolean keyword search and in another end there Concept search. i⋅am⋅amz3 (talk) 00:46, 18 March 2018 (UTC)