Jump to content

Searx: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
No edit summary
Violarulez (talk | contribs)
I think the article should specifically say "open source" at some point. Also added in that, aside from public instances, users can run their own searx.
Line 17: Line 17:
| website = {{URL|https://searx.me/}}
| website = {{URL|https://searx.me/}}
}}
}}
'''Searx''' ({{IPAc-en|s|ɜːr|k|s}}; stylized as '''searX''') is a [[Free software|free]] [[metasearch engine]],<ref>{{cite web |url=http://www.linux-magazine.com/Issues/2017/194/Charly-s-Column-Searx |title=Peppered with Hits » Linux Magazine |last=Kühnast |first=Charly |website=Linux Magazine |language=en-US |access-date=2017-08-31}}</ref> available under the [[GNU Affero General Public License |GNU Affero General Public License version 3]], with the aim of protecting the [[privacy]] of its users.<ref>{{cite web|url=https://nakedsecurity.sophos.com/2017/08/10/self-hosted-search-option-is-a-new-approach-to-bursting-the-filter-bubble/|title=Self-hosted search option is a new approach to bursting the filter bubble|last=Bradbury|first=Danny|date=August 10, 2017|website=Naked Security|archive-url=https://web.archive.org/web/20170904154549/https://nakedsecurity.sophos.com/2017/08/10/self-hosted-search-option-is-a-new-approach-to-bursting-the-filter-bubble/|archive-date=September 4, 2017|url-status=live|access-date=August 30, 2017}}</ref><ref>{{cite web|url=https://www.maketecheasier.com/best-search-engine-for-privacy/|title=What Is the Best Search Engine for Privacy?|last=Zak|first=Robert|date=April 3, 2017|work=Make Tech Easier|archive-url=https://web.archive.org/web/20180703190837/https://www.maketecheasier.com/best-search-engine-for-privacy/|archive-date=July 3, 2018|url-status=live|access-date=August 30, 2017}}</ref><ref>{{cite web|url=http://tuxdiary.com/2014/12/22/searx/|title=Searx: self-hosted web metasearch engine|last=Sonmez|first=John|date=December 22, 2014|website=Tuxdiary|access-date=2017-08-31}}</ref> To this end, Searx does not share users' [[IP address]]es or search history with the [[search engine]]s from which it gathers results. [[HTTP cookie#Tracking|Tracking cookies]] served by the search engines are blocked, preventing [[filter bubble|user-profiling-based results modification]].<ref>{{cite news |url=http://acc.cv.ua/svit/25296-yak-zakhistiti-svoji-dani-v-interneti-11-korisnikh-dodatkiv |title=Як захистити свої дані в інтернеті: 11 корисних додатків |last=administrator |first=Acc |work=Новини АСС |access-date=2017-08-31 |language=uk-ua}}</ref><ref>{{cite news |url=http://t3n.de/news/searx-konfigurierbare-suchmaschine-729236/ |title=Searx: Die konfigurierbare Suchmaschine, die deine Privatsphäre respektiert |work=t3n News |access-date=2017-08-31 |language=de}}</ref> By default, Searx queries are submitted via [[HTTP POST]],{{efn|Except on [[Chromium (web browser)|Chromium]]-based web browsers, where [[HTTP GET]] requests are used instead.<ref name="about.html" />}} to prevent users' query keywords from appearing in webserver logs.<ref name="about.html" /><ref name=preferences /> Searx was inspired by the [[Seeks]] project,<ref name="about.html">{{cite web |url=https://github.com/searx/searx/blob/master/searx/templates/__common__/about.html |title=about.html |website=GitHub |language=en |access-date=2020-05-23}}</ref> though it does not implement Seeks' [[peer-to-peer]] user-sourced results [[Ranking (information retrieval)|ranking]].
'''Searx''' ({{IPAc-en|s|ɜːr|k|s}}; stylized as '''searX''') is a [[Free software|free]] and [[open-source]] [[metasearch engine]],<ref>{{cite web |url=http://www.linux-magazine.com/Issues/2017/194/Charly-s-Column-Searx |title=Peppered with Hits » Linux Magazine |last=Kühnast |first=Charly |website=Linux Magazine |language=en-US |access-date=2017-08-31}}</ref> available under the [[GNU Affero General Public License |GNU Affero General Public License version 3]], with the aim of protecting the [[privacy]] of its users.<ref>{{cite web|url=https://nakedsecurity.sophos.com/2017/08/10/self-hosted-search-option-is-a-new-approach-to-bursting-the-filter-bubble/|title=Self-hosted search option is a new approach to bursting the filter bubble|last=Bradbury|first=Danny|date=August 10, 2017|website=Naked Security|archive-url=https://web.archive.org/web/20170904154549/https://nakedsecurity.sophos.com/2017/08/10/self-hosted-search-option-is-a-new-approach-to-bursting-the-filter-bubble/|archive-date=September 4, 2017|url-status=live|access-date=August 30, 2017}}</ref><ref>{{cite web|url=https://www.maketecheasier.com/best-search-engine-for-privacy/|title=What Is the Best Search Engine for Privacy?|last=Zak|first=Robert|date=April 3, 2017|work=Make Tech Easier|archive-url=https://web.archive.org/web/20180703190837/https://www.maketecheasier.com/best-search-engine-for-privacy/|archive-date=July 3, 2018|url-status=live|access-date=August 30, 2017}}</ref><ref>{{cite web|url=http://tuxdiary.com/2014/12/22/searx/|title=Searx: self-hosted web metasearch engine|last=Sonmez|first=John|date=December 22, 2014|website=Tuxdiary|access-date=2017-08-31}}</ref> To this end, Searx does not share users' [[IP address]]es or search history with the [[search engine]]s from which it gathers results. [[HTTP cookie#Tracking|Tracking cookies]] served by the search engines are blocked, preventing [[filter bubble|user-profiling-based results modification]].<ref>{{cite news |url=http://acc.cv.ua/svit/25296-yak-zakhistiti-svoji-dani-v-interneti-11-korisnikh-dodatkiv |title=Як захистити свої дані в інтернеті: 11 корисних додатків |last=administrator |first=Acc |work=Новини АСС |access-date=2017-08-31 |language=uk-ua}}</ref><ref>{{cite news |url=http://t3n.de/news/searx-konfigurierbare-suchmaschine-729236/ |title=Searx: Die konfigurierbare Suchmaschine, die deine Privatsphäre respektiert |work=t3n News |access-date=2017-08-31 |language=de}}</ref> By default, Searx queries are submitted via [[HTTP POST]],{{efn|Except on [[Chromium (web browser)|Chromium]]-based web browsers, where [[HTTP GET]] requests are used instead.<ref name="about.html" />}} to prevent users' query keywords from appearing in webserver logs.<ref name="about.html" /><ref name=preferences /> Searx was inspired by the [[Seeks]] project,<ref name="about.html">{{cite web |url=https://github.com/searx/searx/blob/master/searx/templates/__common__/about.html |title=about.html |website=GitHub |language=en |access-date=2020-05-23}}</ref> though it does not implement Seeks' [[peer-to-peer]] user-sourced results [[Ranking (information retrieval)|ranking]].


Each search result is given as a direct link to the respective site, rather than a [[Internet tracking|tracked redirect link]] as used by [[Google Search|Google]]. In addition, when available, these direct links are accompanied by cached and/or proxied links that allow viewing results pages without actually visiting the sites in question. The cached links point to saved versions of a page on the [[Wayback Machine]], while the proxied links allow viewing the current live page via a Searx-based [[Web Proxy|web proxy]]. In addition to the general search, the engine also features tabs to search within specific domains: files, images, [[Information technology|IT]], maps, music, news, science, social media, and videos.<ref>{{cite news |url=https://www.techspot.com/guides/1292-web-security-anonymizer-primer/ |title=A Primer on Staying Secure and Anonymous on the Dark Web |work=TechSpot |access-date=2017-08-30 |language=en-us}}</ref><ref name="golem.de">{{cite news |url=https://www.golem.de/news/searx-0-10-0-die-eigene-suchmaschine-auf-einem-raspberry-pi-1609-123094.html |title=Searx 0.10.0: Die eigene Suchmaschine auf einem Raspberry Pi |trans-title=Searx 0.10.0: Your own search engine on a Raspberry Pi |author=Von Jan Weisensee |access-date=2017-08-31 |language=de-DE |archive-url=https://web.archive.org/web/20200807015803/https://www.golem.de/news/searx-0-10-0-die-eigene-suchmaschine-auf-einem-raspberry-pi-1609-123094.html |archive-date=2020-08-07 |date=2016-09-07 |website=[[golem.de]]}}</ref>
Each search result is given as a direct link to the respective site, rather than a [[Internet tracking|tracked redirect link]] as used by [[Google Search|Google]]. In addition, when available, these direct links are accompanied by cached and/or proxied links that allow viewing results pages without actually visiting the sites in question. The cached links point to saved versions of a page on the [[Wayback Machine]], while the proxied links allow viewing the current live page via a Searx-based [[Web Proxy|web proxy]]. In addition to the general search, the engine also features tabs to search within specific domains: files, images, [[Information technology|IT]], maps, music, news, science, social media, and videos.<ref>{{cite news |url=https://www.techspot.com/guides/1292-web-security-anonymizer-primer/ |title=A Primer on Staying Secure and Anonymous on the Dark Web |work=TechSpot |access-date=2017-08-30 |language=en-us}}</ref><ref name="golem.de">{{cite news |url=https://www.golem.de/news/searx-0-10-0-die-eigene-suchmaschine-auf-einem-raspberry-pi-1609-123094.html |title=Searx 0.10.0: Die eigene Suchmaschine auf einem Raspberry Pi |trans-title=Searx 0.10.0: Your own search engine on a Raspberry Pi |author=Von Jan Weisensee |access-date=2017-08-31 |language=de-DE |archive-url=https://web.archive.org/web/20200807015803/https://www.golem.de/news/searx-0-10-0-die-eigene-suchmaschine-auf-einem-raspberry-pi-1609-123094.html |archive-date=2020-08-07 |date=2016-09-07 |website=[[golem.de]]}}</ref>


There are many public user-run Searx instances,<ref name="Searx instances">{{cite web |url=https://searx.space/ |title=Public Searx instances |website=searx.space}}</ref> some of which are available as [[Tor (anonymity network)|Tor]] [[hidden service]]s.<ref name="Searx instances"/> Meta-Searx instances can also be used to forward the search query to a random public instance.<ref name="Searx instances" /> A public [[API]] is available for Searx,<ref>{{cite web |url=https://searx.github.io/searx/dev/search_api.html |title=Search API — searx 0.12.0 documentation |website=searx.github.io |access-date=2017-08-31}}</ref><ref>{{cite web|last=Seitz|first=Justin|date=2017-04-18|title=Building a Keyword Monitoring Pipeline with Python, Pastebin and Searx {{!}} Automating OSINT Blog|url=http://www.automatingosint.com/blog/2017/04/building-a-keyword-monitoring-pipeline-with-python-pastebin-and-searx/|access-date=2017-08-31|website=www.automatingosint.com|language=en-US}}</ref> as well as [[Firefox]] search provider plugins.<ref>{{cite web |url=https://addons.mozilla.org/en-US/firefox/search/?q=searx |title=Search results for "searx" – Add-ons for Firefox (en-US) |website=addons.mozilla.org |access-date=2019-07-15}}</ref>
Users can run private instances of Searx on their own [[localhost|computer]], but there are also many public, user-run, Searx instances,<ref name="Searx instances">{{cite web |url=https://searx.space/ |title=Public Searx instances |website=searx.space}}</ref> some of which are available as [[Tor (anonymity network)|Tor]] [[hidden service]]s.<ref name="Searx instances"/> Meta-Searx instances can also be used to forward the search query to a random public instance.<ref name="Searx instances" /> A public [[API]] is available for Searx,<ref>{{cite web |url=https://searx.github.io/searx/dev/search_api.html |title=Search API — searx 0.12.0 documentation |website=searx.github.io |access-date=2017-08-31}}</ref><ref>{{cite web|last=Seitz|first=Justin|date=2017-04-18|title=Building a Keyword Monitoring Pipeline with Python, Pastebin and Searx {{!}} Automating OSINT Blog|url=http://www.automatingosint.com/blog/2017/04/building-a-keyword-monitoring-pipeline-with-python-pastebin-and-searx/|access-date=2017-08-31|website=www.automatingosint.com|language=en-US}}</ref> as well as [[Firefox]] search provider plugins.<ref>{{cite web |url=https://addons.mozilla.org/en-US/firefox/search/?q=searx |title=Search results for "searx" – Add-ons for Firefox (en-US) |website=addons.mozilla.org |access-date=2019-07-15}}</ref>


In 2021, a fork of Searx, known as SearXNG, was created.<ref>{{Citation|title=searxng/searxng|date=2021-10-12|url=https://github.com/searxng/searxng|publisher=SearXNG|access-date=2021-10-12}}</ref>
In 2021, a fork of Searx, known as SearXNG, was created.<ref>{{Citation|title=searxng/searxng|date=2021-10-12|url=https://github.com/searxng/searxng|publisher=SearXNG|access-date=2021-10-12}}</ref>

Revision as of 21:52, 14 March 2022

Searx
Developer(s)Adam Tauber (alias asciimoo)[1]
Initial releaseJanuary 22, 2014; 10 years ago (2014-01-22)[2]
Stable release
1.1.0[3] Edit this on Wikidata / 7 August 2022; 2 years ago (7 August 2022)
Repositorygithub.com/searx/searx
Written inPython
TypeMetasearch engine
LicenseAGPL-3.0-or-later
Websitesearx.me

Searx (/sɜːrks/; stylized as searX) is a free and open-source metasearch engine,[4] available under the GNU Affero General Public License version 3, with the aim of protecting the privacy of its users.[5][6][7] To this end, Searx does not share users' IP addresses or search history with the search engines from which it gathers results. Tracking cookies served by the search engines are blocked, preventing user-profiling-based results modification.[8][9] By default, Searx queries are submitted via HTTP POST,[a] to prevent users' query keywords from appearing in webserver logs.[10][11] Searx was inspired by the Seeks project,[10] though it does not implement Seeks' peer-to-peer user-sourced results ranking.

Each search result is given as a direct link to the respective site, rather than a tracked redirect link as used by Google. In addition, when available, these direct links are accompanied by cached and/or proxied links that allow viewing results pages without actually visiting the sites in question. The cached links point to saved versions of a page on the Wayback Machine, while the proxied links allow viewing the current live page via a Searx-based web proxy. In addition to the general search, the engine also features tabs to search within specific domains: files, images, IT, maps, music, news, science, social media, and videos.[12][13]

Users can run private instances of Searx on their own computer, but there are also many public, user-run, Searx instances,[14] some of which are available as Tor hidden services.[14] Meta-Searx instances can also be used to forward the search query to a random public instance.[14] A public API is available for Searx,[15][16] as well as Firefox search provider plugins.[17]

In 2021, a fork of Searx, known as SearXNG, was created.[18]

Search engines and other settings

Across all categories, Searx can fetch search results from about 82 different engines. This includes major search engines and site-specific searches like Bing, Google, Reddit, Wikipedia, Yahoo, and Yandex.[19] The engines used for each search category can be set via a "preferences" interface, and these settings will be saved in a cookie in the user's web browser, rather than on the server side, since for privacy reasons, Searx does not implement a user login model. Other settings such as the search interface language and the search results language (over 20 languages are available) can be set the same way.[11]

In addition to the preferences cookie, it is possible on each query to modify the engines used, search categories selected, and/or languages to search in by specifying one or more of the following textual search operators before the search keywords.[20]

  • !category — Search the specified category instead of the default ones.
  • ?category — Search the specified category in addition to the default ones.
  • !engine — Search the specified engine instead of the default ones.
  • ?engine — Search the specified engine in addition to the default ones.
  • :language — Search for results in the specified language instead of the default one.

The ! and ? operators can be specified more than once to select multiple categories or engines, for example !google !deviantart ?images :japanese cow.

Instances

Any user may run their own instance of Searx,[21][22][23] which can be done to maximize privacy, to avoid congestion on public instances, to preserve customized settings even if browser cookies are cleared, to allow auditing of the source code being run, etc.[24][25][26] Users may include their Searx instances on the editable list of all public instances, or keep them private.[19][24] It is also possible to add custom search engines to a self-hosted instance that are not available on the public instances.[27]

Another reason to use different Searx instances, and/or to run one's own, is that as of 2019, Google has begun to block some instances, including some of the IP addresses used by searx.me (former instance run by the developer), from querying it, resulting in a "google (unexpected crash: CAPTCHA required)" error.[28] In response, some instances have been modified to silently skip trying to search with Google, even when it's the only engine specified.[29][30]

See also

Footnotes

  1. ^ Except on Chromium-based web browsers, where HTTP GET requests are used instead.[10]

References

  1. ^ "asciimoo (Adam Tauber)". GitHub.
  2. ^ Tauber, Adam. "searx: A privacy-respecting, hackable metasearch engine" – via PyPI.
  3. ^ "Release v1.1.0".
  4. ^ Kühnast, Charly. "Peppered with Hits » Linux Magazine". Linux Magazine. Retrieved 2017-08-31.
  5. ^ Bradbury, Danny (August 10, 2017). "Self-hosted search option is a new approach to bursting the filter bubble". Naked Security. Archived from the original on September 4, 2017. Retrieved August 30, 2017.
  6. ^ Zak, Robert (April 3, 2017). "What Is the Best Search Engine for Privacy?". Make Tech Easier. Archived from the original on July 3, 2018. Retrieved August 30, 2017.
  7. ^ Sonmez, John (December 22, 2014). "Searx: self-hosted web metasearch engine". Tuxdiary. Retrieved 2017-08-31.
  8. ^ administrator, Acc. "Як захистити свої дані в інтернеті: 11 корисних додатків". Новини АСС (in Ukrainian). Retrieved 2017-08-31.
  9. ^ "Searx: Die konfigurierbare Suchmaschine, die deine Privatsphäre respektiert". t3n News (in German). Retrieved 2017-08-31.
  10. ^ a b c "about.html". GitHub. Retrieved 2020-05-23.
  11. ^ a b "preferences - searx.me". searx.me. Archived from the original on 2018-03-20. Retrieved 2017-09-25.
  12. ^ "A Primer on Staying Secure and Anonymous on the Dark Web". TechSpot. Retrieved 2017-08-30.
  13. ^ Von Jan Weisensee (2016-09-07). "Searx 0.10.0: Die eigene Suchmaschine auf einem Raspberry Pi" [Searx 0.10.0: Your own search engine on a Raspberry Pi]. golem.de (in German). Archived from the original on 2020-08-07. Retrieved 2017-08-31.
  14. ^ a b c "Public Searx instances". searx.space.
  15. ^ "Search API — searx 0.12.0 documentation". searx.github.io. Retrieved 2017-08-31.
  16. ^ Seitz, Justin (2017-04-18). "Building a Keyword Monitoring Pipeline with Python, Pastebin and Searx | Automating OSINT Blog". www.automatingosint.com. Retrieved 2017-08-31.
  17. ^ "Search results for "searx" – Add-ons for Firefox (en-US)". addons.mozilla.org. Retrieved 2019-07-15.
  18. ^ searxng/searxng, SearXNG, 2021-10-12, retrieved 2021-10-12
  19. ^ a b Tauber, Adam (2017-08-30). "searx: Privacy-respecting metasearch engine". Retrieved 2017-08-31.
  20. ^ "Search syntax — searx 0.12.0 documentation". searx.github.io. Retrieved 2017-08-30.
  21. ^ "My Searx instance - Logan Marchione". Logan Marchione. 2015-10-18. Retrieved 2017-08-31.
  22. ^ "New fast and private searx instance in Europe for private websearches • r/privacy". Reddit. Retrieved 2017-08-31.
  23. ^ "How to setup your own privacy respecting search engine in a couple of hours with a free ssl certificate • r/privacytoolsIO". Reddit. Retrieved 2017-08-31.
  24. ^ a b "Why use a private instance? — searx 0.12.0 documentation". searx.github.io. Retrieved 2017-08-31.
  25. ^ "Privacy advantages of running my own searx instance • r/privacytoolsIO". Reddit. Retrieved 2017-08-31.
  26. ^ "Searx.me is overloaded. Privacytools.io should link to just the instances page or randomize. • r/privacytoolsIO". Reddit. Retrieved 2017-08-31.
  27. ^ "Engine overview — searx 0.12.0 documentation". searx.github.io. Retrieved 2017-08-31.
  28. ^ "Google Captcha". GitHub issues. 2016-10-12. Retrieved 2020-05-23.
  29. ^ "!google cow - searx". searx.info. Retrieved 2019-07-15. Sorry! we didn't find any results. Please use another query or search in more categories.
  30. ^ "!google cow - searx". search.disroot.org. Retrieved 2019-07-15. Sorry! we didn't find any results. Please use another query or search in more categories.