Talk:Googlebot

From Wikipedia, the free encyclopedia
Jump to: navigation, search
          This article is of interest to the following WikiProjects:
WikiProject Google (Rated Start-class, High-importance)
WikiProject icon This article is within the scope of WikiProject Google, a collaborative effort to improve the coverage of Google and related topics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
Start-Class article Start  This article has been rated as Start-Class on the project's quality scale.
 High  This article has been rated as High-importance on the project's importance scale.
 
WikiProject Computing / Software (Rated Start-class, Mid-importance)
WikiProject icon This article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
Start-Class article Start  This article has been rated as Start-Class on the project's quality scale.
 Mid  This article has been rated as Mid-importance on the project's importance scale.
Taskforce icon
This article is supported by WikiProject Software.
 

Comments[edit]

"It visits websites that change frequently, according to how frequently they change."

To me this sentence is very confusing. How about something like:

"It visits websites that change frequently, starting with the most frequently changed pages."

Taladon 18:33, 19 September 2006 (UTC)



why don't add IP range of google-bot?—Preceding unsigned comment added by 19:51, 4 May 2006 (talk) 87.1.53.187

Probably because it varies too much from place to place, perhaps?—Preceding unsigned comment added by 70.135.120.225 (talk) 09:37, 17 June 2006


the german external link is broken


Warning: GoogleBot will try to index FTP sites too - especially if there's a HTML link to an FTP resource. It is said that in FTP mode, it will respect a "/robots.txt" file, but such a resource is only defined for web servers. Server admins be warned.... 71.105.105.145 (talk) 00:43, 28 November 2013 (UTC)

How to use google user agent:[edit]

http://diorz.tuxfamily.org/index.php?id=ByPass%20Forum%20Signup —The preceding unsigned comment was added by 217.154.102.195 (talk) 12:39, 30 April 2007 (UTC).

GoogleBot and IFrames[edit]

There are discussions inquiring about how GoogleBots handle IFrames.

I've been told that IFrames cause errors in the Bots 'readings', but I've read (from uncertain sources) that the Bots are fine with IFrames. Does anyone have compelling evidence as to one or the other?

CertGuard 05:27, 21 July 2007 (UTC)

Indexing limits[edit]

I'm thinking of adding a sub topic Indexing Limits, to googlebot are there any references or research done by any good webmasters and also I guess there should be a relation between the indexing and the Google Pagerank. If there are any, kindly refer the same.Ganesh J. Acharya 06:06, 1 August 2007 (UTC)

How often does Googlebot update?[edit]

I've been curious to know how often Googlebot caches a website and once caches does it later on update previous versions. I ask this because I've found a few websites with personal information cached on older versions of the page, yet the most recent is without said information. Some people worry that their info will be made public and curiosity has me wondering how often Googlebot updates a cached page. Thanks 74.195.2.98 17:41, 10 September 2007 (UTC)

Googlebot(s) Discovered[edit]

I was browsing the net on the ole' Google.com search engine when I found a peculiar message stating: "Your IP is: 66.249.73.186" Obviously, this was not my IP. Of course, I had to look up the possessor of this IP; thus, I used Network-tools.com. From here, I simply copy and pasted the IP and voila! I had found the IP to a GoogleBot.

The following message was presented as the entire web domain and sub-domain (host name):

  IP address: 66.249.73.186
  Host name: crawl-66-249-73-186.googlebot.com
  66.249.73.186 is from United States(US) in region North America

I performed a WHOIS trace for further investigation:

  Domain Name: GOOGLEBOT.COM
  Registrar: MARKMONITOR INC.
  Whois Server: whois.markmonitor.com
  Referral URL: http://www.markmonitor.com
  Name Server: NS1.GOOGLE.COM
  Name Server: NS2.GOOGLE.COM
  Name Server: NS3.GOOGLE.COM
  Name Server: NS4.GOOGLE.COM
  Status: clientDeleteProhibited
  Status: clientTransferProhibited
  Status: clientUpdateProhibited
  Updated Date: 06-nov-2006
  Creation Date: 21-oct-1998
  Expiration Date: 20-oct-2011

--It was quite strange to find on MarkMonitor.com's "Strategic Alliances" page they had not listed Google.

Anyways, I was only opening this section to reveal an actual GoogleBot--perhaps you can find more? —Preceding unsigned comment added by 68.114.11.211 (talk) 16:07, 30 May 2008 (UTC)

Amazon et al? — Preceding unsigned comment added by 189.10.157.96 (talk) 05:39, 8 October 2012 (UTC)

Is 'upgrade' an order?[edit]

Already have a modern browser (ItaliC, not Phoenician or syllabic), it's called NoScript, imHo it might be Dante, Kafka or Joyce (GuimarÃEs Rosa I-n P:-ortuguese-br).