User:Citylover/Workinprogress/Eubot

From Wikipedia, the free encyclopedia

Suggestions for the improvement of Eubot[edit]

The text in the 3 subsections below was copied from the page Wikipedia talk:WikiProject Missing encyclopedic articles/Antarctica and then edited on this page here. -- Citylover 12:48, 10 October 2005 (UTC)

Suggestions after Eubot (under the IP-Address 81.207.25.97) redid R1-R4[edit]

Suggestions after Eubot (under the IP-Address 81.207.25.97) created new pages from R1 to R4 (between 14:36, 3 October 2005 (UTC) and 14:40, 3 October 2005 (UTC)):

When robots explore Antarctica...
Hi Eugene, thank you very much for the changes, the articles are now almost perfect. It must be a tremendously complex task to program a bot so sophisticatedly that it is able to convert a text automatically into such a complex and still reliable Wikipedia-article. In my opinion, the Eubot-generated text is now not only quantitatively, but also qualitatively so good that you could just insert it as the official article, the text is now even better than even a good editor who creates a new article by copying/pasting from the Antarctic Gazetteer. Eubot has helped to save a lot of work for all Wikipedians who let Wikipedia explore new land in Antarctica... I have just had a quick and incomplete look now on the R1-page and found almost no conversion mistakes. The only mistakes I found so far were only tiny, isolated ones (but there are probably still more, I only had a quick look so far): -- Citylover 20:51, 3 October 2005 (UTC)

  • R1, Radian Glacier: The conversion of the abbreviation VUWAE into "Victoria University of Wellington Antarctic Expedition (VUWAE)" (see R1, Radian Glacier) seems to be incorrect. Correct would be VUWAE --> "Victoria University of Wellington Antarctic Expedition (VUWAE)" Implemented. -- Citylover 14:56, 9 October 2005 (UTC)
  • R1, Razlom Point: Another mistake (maybe not even a mistake) is in R1, Razlom Point, ice shelf instead of just ice shelf Suggestion can not be implemented because of technical reasons (see below) (why that?, would it not be possible to automatically remove the redundant information again at least after it has been created?) (this suggestion affects text that is only visible when editing.) -- Citylover 14:56, 9 October 2005 (UTC)

* R1, Mount Rath: I also think that it would be better to convert all directions like NNE (those which are neither cardinal directions nor ordinal directions) not to "north-northeast", but to "north-northeast (NNE)" because not everybody knows which direction north-northeast is (see for example R1, Mount Rath). Implemented. -- Citylover 14:56, 9 October 2005 (UTC)

  • I have found more abbreviations:
  • R1, Redfearn: For example in R1, Redfearn, I have seen that after the expression ANARE follows within parentheses the explanation (wikilinked). Maybe this is intentional. If it is not intentional, I suggest that Eubot could standardize it with the other abbreviations by naming the full, wikilinked name first and only then the abbreviation, not wikilinked and in parentheses. Suggestion not implemented because of good reason (explained below). -- Citylover 14:56, 9 October 2005 (UTC)
  • R1, Reeves Peninsula: I also saw that in R1, Reeves Peninsula, the expression "Edward VII Peninsula" was just plain text and not wikilinked, althought the expression "Edward VII Peninsula" exists as an entry of its own in the ANTARCTICA.TXT-file (although I easily understand it given the extremely complex task to program these automated Wikilinks to other not even yet existing Wikipedia-articles on geographical features of Antarctica...) Implemented, well done! -- Citylover 14:56, 9 October 2005 (UTC)
  • The years are still not wikilinked (for example "1905" should be converted to "1905") . In my opinion, Eubot should first extend all 2-digit years (where 17.., 18.., 19.. or 20.. are missing) to 4-digit years and then wikilink all years. -- Citylover 21:23, 3 October 2005 (UTC)
    • Suggestion not implemented because of generally not wikilinking years-only-dates, I can understand it, at least when the full dates are wikilinked -- Citylover 14:56, 9 October 2005 (UTC)
Thanks again for the nice comments.
I've changed Eubot with respect to VUWAE, the new abbreviations (except NZGS) and "Edward VII Peninsula". Regarding that last bit: I've changed the way Eubot recognizes phrases that are to be wikified. It used to be only consecutive words with normal capitalisation; it now recognizes roman numerals as well. There could well be false positives.
I don't think wikilinking directions adds much value; one should not over-wikify articles, not every word needs to be a wikilink.
About [[ice shelf|ice shelf]]: it's just easier to program Eubot that way.

::The expanded name of ANARE just sounds wrong in many contexts, because it is a plural. And the acronym itself, "ANARE", is pronouncable, so it just sounds a lot better. So for this acronym only, I think it just sounds better the other way around.

I'm not going to change 2-digit years into 4-digits. I think that is much better done by hand: for example, "1967-80" should often be changed to "between 1967 and 1980", or "from 1967 to 1980". As for wikilinking the years: this is a controversial subject. There are those who say years should only be wikilinked in the context of a complete date. As it's easy getting false hits on this point, I've decided not to wikilink years. (I may do something with complete dates in the future, though.)

::I have now uploaded S, with the modified script. Eugene van der Pijll 21:41, 3 October 2005 (UTC)

Suggestions after Eubot redid S1-S8[edit]

Suggestions after Eubot created new pages from S1 to S8 (between 21:19, 3 October 2005 (UTC) and 22:05, 3 October 2005 (UTC)):

I have just had a quick look on the first entries of S1. Here are my observations:
-- Citylover 22:52, 3 October 2005 (UTC)

  • General remark: I only thought about wikilinking 4-letter-directions (if there are any, I am not sure) and 3-letter-directions like NNE, WSW (see boxing the compass). I also think that wikilinking 2-letter-directions like SE (southeast) or NE (northeast) or 1-letter-directions like south, north, east, west is too much. But some people do not know where "north-northeast" points to and they might not have the idea or the time to try to look up a 3-letter-direction manually in the search entry field on the left by typing it in there. If the 3-letter-directions are always wikilinked like "north-northeast", people who do not know this direction exactly would be encouraged to use this wikilink to find out where this 3-letter-direction points to. -- Citylover 22:01, 3 October 2005 (UTC)
  • I still consider this to be over-linking.
Unfortunately not implemented. Hopefully it will be implemented... -- Citylover 16:56, 9 October 2005 (UTC)
I have checked AAE again, it is Australasian now, but totally replaced instead of being kept too. I suggest that in addition to "Australasian Antarctic Expedition", "Australasian Antarctic Expedition (AAE)" is written. I strongly suggest that for all expeditions, for example for FrAE, GerAE, SovAE, AAE, the abbreviation is not only written out, but that the abbreviation is kept and written down just behind the written out expression, as this is the normal case with abbreviations and also already implemented at expeditions like for example BANZARE. -- Citylover 16:56, 9 October 2005 (UTC)
  • I added the last one as well; without a capital 'N'. I do not have a preference myself, so I'm following your opinion (below at UK-APC). the wikilinking part of Eubot gets confused if the N is not capitalized...
  • Yes. Implemented. -- Citylover 16:56, 9 October 2005 (UTC)
  • S1, Saavedra Rock: The letter "R." there is strangely placed after the name Eduardo Saavedra (although this is already the case in the original file). Maybe another part of the name of Eduardo Saavedra? Or "Rear chief army delegate" (the latter seems very unlikely though)? -- Citylover 07:02, 4 October 2005 (UTC)
  • My guess is that it is a second family name. "Eduardo Saavedra" looks like a hispanic name to me, in which case he probably has two surnames, like "E. Saavedra Rodrigues". But I cannot be sure, and it's impossible to automate this anyway. Not implemented, it is impossible to implement because it is not even clear what it should mean. -- Citylover 16:56, 9 October 2005 (UTC)
  • Done
No, not implemented, due to technical difficulties (why the difficulties?) -- Citylover 16:56, 9 October 2005 (UTC)
  • S1, Sabine Glacier: The word "glacier" is not wikilinked. In my opinion, "glacier" would be the correct version, at least the first time this word appears as an explanation for the type of geographical feature. -- Citylover 07:02, 4 October 2005 (UTC)
  • Correct. I thought I had already done this. Implemented. -- Citylover 16:56, 9 October 2005 (UTC)
  • S1, Mount Sabine: "Jan." should be written out as "January". This applies to the following 9 months (I have already added them to the list on Wikipedia:WikiProject Missing encyclopedic articles/Antarctica): -- Citylover 07:02, 4 October 2005 (UTC)
    • Jan. --> January Implemented. -- Citylover 16:56, 9 October 2005 (UTC)
    • Feb. --> February Implemented. -- Citylover 16:56, 9 October 2005 (UTC)
    • Mar. --> March Implemented. -- Citylover 16:56, 9 October 2005 (UTC)
    • Apr. --> April Implemented. -- Citylover 16:56, 9 October 2005 (UTC)
    • Aug. --> August Implemented. -- Citylover 16:56, 9 October 2005 (UTC)
    • Sept. (four letters, not "Sep."!) --> September not found yet. -- Citylover 16:56, 9 October 2005 (UTC)
    • Oct. --> October Implemented. -- Citylover 16:56, 9 October 2005 (UTC)
    • Nov. --> November Implemented. -- Citylover 16:56, 9 October 2005 (UTC)
    • Dec. --> December
      -- Citylover 22:52, 3 October 2005 (UTC)
      Implemented, September not found yet. -- Citylover 16:56, 9 October 2005 (UTC)
  • Done. And introduced wikilinking for exact dates.
I have checked it and found all months written out at least one time (except "September", which I still have to find, but I also did not find "Sep." or "Sept." which is a good sign). -- Citylover 16:56, 9 October 2005 (UTC)
  • S1, Mount Sabine: "... relatively snow-free mountain. ..." I suggest to generally wikilink the word "snow" to --> "snow". It is only a suggestion, maybe it is exaggerated. I also have the same suggestion (generally wikilink it) for "ice": "ice" --> "ice" (maybe also an exaggeration). These Wikilinks would be similar to the Wikilinks of ice cap and ice shelf, alhtough they would be Wikilinks linking to more general things. -- Citylover 10:07, 4 October 2005 (UTC)
  • Again, I think this is overdoing it; I wouldn't do it when I'd wikify the information manually, so I won't program Eubot to do it. (But if you are copying the article, and you want to insert the links, you're of course free to do so.)
Unfortunately not implemented. -- Citylover 16:56, 9 October 2005 (UTC)
  • S1, Sabrina Coast: "In recognition of Balleny's effort, Mawson retained the name of the cutter in 95E in the latter part of March 1839." The word "Mawson" stands for Sir Douglas Mawson, one of the most important explorers of Antarctica. Eubot should/could wikilink it, but there is the problem that there are several geographical features named in honor of Mawson as well. Maybe it is possible to program Eubot that it knows all these features and if the word immediately behind "Mawson" is not a word that is one of the words behind the word "Mawson" in the names of those features, then Eubot would know that "Mawson" is the person Mawson and could change it from "Mawson" to "Sir Douglas Mawson" ("Mawson" --> "Sir Douglas Mawson"). -- Citylover 10:28, 4 October 2005 (UTC)
  • It's somewhat tricky to do this automatically; and there are a lot of other names for which this should be done (Byrd, Ronne, Gerlache, Scott, to name a few). I'm not going to do this. Note that in this case just inserting brackets around the name, like so: Mawson, already leads the reader to the correct article, with a short detour, so it's not too hard to insert the link manually. And note that a reference to Mawson as "Sir" is only proper for events after he was knighted; I got the impression that the text is fairly consistent on this point. Not implemented, but it is clear why. -- Citylover 16:56, 9 October 2005 (UTC)
  • S1, Sabrina Valley: "Named in association with Sabrina Ridge (q.v.) by a University of Waikato geological party, 1978-79." (q.v.): QV Quod Vide (Latin: see which) (probably). Maybe Eubot can avoid this strange expression? -- Citylover 10:32, 4 October 2005 (UTC)
  • I'll remove those. (It's not that strange an expression; it is used in encyclopedias to refer to other articles. Kind of like a wikilink.)
Implemented, but I suggest that "q.v." is not deleted, but replaced for example by the words "see there" (an placed just behind the Wikilink Eubot creates anyway) in order not to lose the emphasis on the link provided in the original ANTARCTICA.txt-file. For example: "Named in association with Sabrina Ridge (see there) by a University of Waikato geological party, 1978-79." (although in this special case Sabrina Ridge is not wikilinked because in the same entry, Sabrina Ridge appears alreday before and it is as asual only wikilinked the first time which is OK.) -- Citylover 07:49, 10 October 2005 (UTC)
  • S1, Sabrina Island: "... in the Balleny Islands. Named after the cutter Balleny's schooner the Eliza Scott, in 1839, when the Balleny Islands were discovered. ..." I would prefer a complete sentence. Is it possible to program Eubot that it can automatically complete the sentence (in this case: "Sabrina Island was "), and change the uppercase-letter "N" to the lowercase-letter "n"? Afterwards, the sentence would be: "Sabrina Island was named after the cutter Balleny's schooner the Eliza Scott, in 1839, when the Balleny Islands were discovered." The same applies for example also to S1, Sachse Rocks (but here with "were named", not "was named"). -- Citylover 15:03, 4 October 2005 (UTC)

::::*I prefer to do this by hand. For example because of the singular/plural problem, and because sometimes the text can be better formulated like: "Something mount (...) is a mountain.... It was discovered by ..., and named for...". I know this is annoying; I've had to change all of the entries I've put into wikipedia to form running sentences; but there is no standard way to change these. Besides, this is a change about which I'm not confident that I can make it without errors. Not implemented, due to technical problems. -- Citylover 16:56, 9 October 2005 (UTC)

  • done.
No, not implemented yet. (maybe forgotten or programming mistake) -- Citylover 16:56, 9 October 2005 (UTC)
  • Maybe (but here I am really not sure which version is linguistically more modern and more common) it would be better to write "southern end/side" instead of "south end/side", "northern end/side" instead of "north end/side", "northeastern end/side" instead of "northeast end/side" and so on... For example: S1, Sack Island. -- Citylover 15:27, 4 October 2005 (UTC)
*But unfortunately it's difficult to exactly tell Eubot when to use "south" and when to use "southern"; in the original text, they are both abbreviated "S". I'm not going to try to teach Eubot the trick. Too difficult, and I'm not always sure of the best version myself. Not implemented. It think the problem here is not technical, but it is linguistically unclear which version is better and any version is at least correct. -- Citylover 16:56, 9 October 2005 (UTC)
  • S1, Sacramento Bight: Seeing this article, I would like that "bight" (analogous to "bay", "glacier" or "ice-shelf") is being automatically wikilinked by Eubot with the Wikipedia-article Bight (geography). The same applies also to the entry S1, Saint Andrews Bay: "bight" --> "bight". -- Citylover 15:39, 4 October 2005 (UTC)

::::*Done. Implemented, thank you. -- Citylover 16:56, 9 October 2005 (UTC)

  • As I said before :-), done. Except NZGS, where I don't know the correct link, so I prefer to leave it as it is.

:::I will start uploading the "T" pages soon... Eugene van der Pijll 15:59, 4 October 2005 (UTC)

Suggestions after Eubot redid T1-T3[edit]

Suggestions after Eubot created new pages from T1 to T3 (between 16:05, 4 October 2005 (UTC) at T1 and 16:11, 4 October 2005 (UTC) at T3):

I have just had a quick look on the entries of T1 - T3. Here are my observations:
-- Citylover 16:25, 4 October 2005 (UTC)

You have probably already given the answer, I probably found it above: -- Citylover 11:19, 5 October 2005 (UTC)
I added it as well; without a capital 'N'. I do not have a preference myself, so I'm following your opinion (below at UK-APC). the wikilinking part of Eubot gets confused if the N is not capitalized... -- Eugene van der Pijll (-- Citylover 12:27, 5 October 2005 (UTC))
Not implemented yet (see the discussion above.) -- Citylover 07:34, 10 October 2005 (UTC)
  • T3, Tournachon Peak: A new abbreviation found: FIDASE (I have checked the abbreviation "FIDASE" in the file ANTARCTICA.TXT. It appears quite often there, so the discussion about it makes sense). There seem to be four different possibilities to choose from. I googled all 4 of them which different results in frequency: -- Citylover 16:47, 4 October 2005 (UTC)
    • FIDASE --> "Falkland Islands and Dependencies Aerial Survey Expedition (FIDASE)" (with "and" and withouth "s")
      • 8 (eight) hits when Googling the following way: FIDASE "Falkland Islands and Dependencies Aerial Survey Expedition"
    • FIDASE --> "Falkland Islands and Dependencies Aerial Survey Expeditions (FIDASE)" (with "and" and with "s")
      • 0 (zero) hits when Googling the following way: FIDASE "Falkland Islands and Dependencies Aerial Survey Expeditions"
    • FIDASE --> "Falkland Islands Dependencies Aerial Survey Expedition (FIDASE)" (without "and" and without "s")
      • 7 (seven) hits when Googling the following way: FIDASE "Falkland Islands Dependencies Aerial Survey Expedition"
    • FIDASE --> "Falkland Islands Dependencies Aerial Survey Expeditions (FIDASE)" (without "and" and with "s")
      • 0 (zero) hits when Googling the following way: FIDASE "Falkland Islands Dependencies Aerial Survey Expeditions"
    • Additionally, I googled also the following way (without the abbreviation "FIDASE" itself): "Falkland Islands Dependencies Aerial Survey Expeditions". Interestingly, one single search result was found and led me to the site http://www.spri.cam.ac.uk/friends/polarbytes/36/ where it was written that there have been 2 different FIDAS-Expeditions at two different times (see there). In the ANTARCTICA.TXT-file, only one of the expeditions is mentioned. Both expeditions seem to be (almost?) never mentioned at the same time with the abbrevation FIDASE. Therefore, the abbreviation FIDASE should probably be written out without the letter "s" at the end of the word "expedition(s)". The questios whether the version with the word "and" or withouth the word "and" is chosen from the remaining two written out versions. When you then do an additional Googling-search without the "s", with the "and" and withouth the abbreviation "FIDASE" ("Falkland Islands and Dependencies Aerial Survey Expedition"), then 90 results are found, whereas the Googling-search without the "s", withouth the "and" and with "FIDASE" ("Falkland Islands Dependencies Aerial Survey Expedition") only finds 29 results. Therefore, probably the following version might be the best. I have already added it to the list of abbreviations on the main page Wikipedia:WikiProject Missing encyclopedic articles/Antarctica: -- Citylover 20:36, 4 October 2005 (UTC)
      • FIDASE --> "Falkland Islands and Dependencies Aerial Survey Expedition (FIDASE)"
  • T3, Trojan Range: "... is a mountain range rising to 2,135 m, extending northward from Mount Francais along the east side of Iliad Glacier, Anvers Island, in the Palmer Archipelago. ..." This page has already been created on 10:15, 29 September 2005, probably based on Eubot's work. Warofdreams, the guy who created the page, automatically changed the expression "mountain range" into "mountain range". I suggest that Eubot generally wikilinks the expression "mountain range" when it appears the first time in an entry: -- Citylover 16:47, 4 October 2005 (UTC)
  • T3, Trollkjelen Crevasse Field: "... is a crevasse field about ..." What about wikilinking "crevasse" to "crevasse"? (I suppose that you probably think that this would be overuse.) I could not find the expression "crevasse field" when looking it up. -- Citylover 16:59, 4 October 2005 (UTC)
  • T1, Tay Head: "... is a rocky headland ...": What about generally wikilinking the expression "headland", found for example in this article? -- Citylover 17:07, 4 October 2005 (UTC)
  • T1, Firth of Tay: "... is a sound, ...". I really suggest that at least in the case of the geographical feature type "sound" that the word sound is wikilinked to Sound (geography): -- Citylover 17:11, 4 October 2005 (UTC)
  • T1, Taylor Dome: "... is an elliptical ice dome ...". Here, I suggest that the expression "ice dome" is wikilinked with Ice cap: -- Citylover 17:23, 4 October 2005 (UTC)
  • T1, Taborovskiy Peak ("... is the highest peak, 2,895 m, in the ..."), but also Tambovskaya Peak, Tammann Peaks, Tanglefoot Peak, Tankobu Peak, Tanna Peak, Tanngarden Peaks, Tantalus Peak, Tarakaka Peak, Mount Tararua, Tasch Peak, Tate Peak, Taylor Peak, Te Puna Roimata Peak, Telescope Peak, Teller Peak, Tempest Peak, Tenaza Peak, Tennant Peak, Mount Tennant, Mount Tennent, Tent Peak, Terletskiy Peak, Terminal Peak, Terningen Peak, Terry Peak and Mount Theseus:
    • Maybe the word "peak" could be generally automatically wikilinked by Eubot to "peak" (but maybe you are against it, although in this particular case, the word "peak" appears really very often and while the article Peak is not (yet) excellent, the article provides at least information about what a peak is.). -- Citylover 07:45, 5 October 2005 (UTC)
  • T1, Taff y Bryn: "... is a ridgelike summit capped by dolerite (c. 1,600 m), situated ...":
    • Maybe the word "summit" could be generally automatically wikilinked by Eubot to "summit" (the article Topographical summit, which is quite a good article by the way, but maybe you are against it.). -- Citylover 20:09, 4 October 2005 (UTC)
      • Not implemented yet. -- Citylover 07:52, 10 October 2005 (UTC)
  • T1, Taygete Cone and Thimble Peak: "... is an extinct volcanic cone northeast of Alcyone Cone in the north part of The Pleiades, Victoria Land. ...":
    • Maybe the word "cone" could be generally automatically wikilinked by Eubot to "cone" (the article Volcanic cone) (But maybe you are against it.). -- Citylover 12:04, 5 October 2005 (UTC)
  • T1, Tamarus Valley("... is an ice-free valley lying south of ..."), but also Tarn Valley and Taylor Valley:
    • Maybe the word "valley" could be generally automatically wikilinked by Eubotto "valley" (quite a good article by the way, but maybe you are against it.). -- Citylover 07:45, 5 October 2005 (UTC)
  • T2, Lake Thomas: "... is a meltwater lake that is circumscribed on the northwest and northeast sides by Robertson Ridge and Clark Glacier, in Victoria Land. ...":
    • Maybe the word "lake" could be generally automatically wikilinked by Eubot to "lake" (quite a good article by the way, but maybe you are against it.). -- Citylover 15:08, 5 October 2005 (UTC)
  • T2, Thompson Peninsula: "... is a peninsula 3 miles (4.8 km) long forming the north side of the entrance to Fournier Bay, on Anvers Island in the Palmer Archipelago. ...":
    • Maybe the word "peninsula" could be generally automatically wikilinked by Eubot to "peninsula" (quite a good article by the way, but maybe you are against it.). -- Citylover 15:32, 5 October 2005 (UTC)
      • "cove" was not converted to "cove" by Eubot and "peninsula" (which should have been implemented to "peninsula") was incorrectly converted to "cove" by Eubot (look at the articles). -- Citylover 08:15, 10 October 2005 (UTC)
  • T2, Thomson Cove : "... is a cove 1 mile (1.6 km) wide, lying just north of Etienne Fjord in Flandres Bay, along the west coast of Graham Land. ...":
    • Maybe the word "cove" could be generally automatically wikilinked by Eubot to "cove" (quite a good article by the way, but maybe you are against it.). -- Citylover 16:06, 5 October 2005 (UTC)
      • "cove" was not converted to "cove" by Eubot and "peninsula" (which should have been implemented to "peninsula") was incorrectly converted to "cove" by Eubot (look at the articles). -- Citylover 08:15, 10 October 2005 (UTC)
  • T2, Thomson Massif: "'... is a rock massif in the Aramis Range, Prince Charles Mountains, from which rise Mount Sundberg and Mount McGregor. ...":
    • Maybe the word "massif" could be generally automatically wikilinked by Eubot to "massif" (an article that provides useful information by the way, but maybe you are against it.). -- Citylover 16:28, 5 October 2005 (UTC)
  • T1, Cape Tennyson: One isolated serious mistake found: When the letter "N", "W", "E" and "S" represents the misspelled initials of a name (by omitting the .) and not one of the four cardinal directions (north, west, east or south), Eubot automatically converts the letter to north/west/east/south. Is it somehow possible to program Eubot that it can distinguish when the letter "N", "W", "E" or "S" is part of a misspelled name initial and when "N", "W", "E" or "S" stands for one of the four cardinal directions? The main problem here is that the USGS actually misspelled it (omitting the .), so it is not a mistake of Eubot, but a misspelling in the original ANTARCTICA.TXT-file leading to a mistake of Eubot. -- Citylover 10:50, 5 October 2005 (UTC)
    • Not implemented yet (look at the article T1, Cape Tennyson.). You have explained below that it is technically not possible. I suggest now that you program Eubot just for this special case of the article "Cape Tennyson" (an article that has not been created yet) that it does not make this mistake at least in this article, for example "C. E Borchgrevink" --> "C. E. Borchgrevink" and only afterwards convert the "E" to "east". Like that, we have at least one serious and obvious mistake less. -- Citylover 08:51, 10 October 2005 (UTC)
  • T1, Teksla Island and Teyssier Island (but also generally in other subpages from A1 to Z1): This is again (like in the case just above) a misspelling in the original ANTARCTICA.TXT-file that leads to conversion problems. In the case of Teksla Island, the sentence generated by Eubot is now: "'''Teksla Island''' ({{coor dm|67|27|S|60|56|E|}}) is a largest [[island]] in the [[Colbeck Archipelago]] near the coast of Mac. [[Robertson Land]], 1 [[Mile|mile]] (1.6 [[Kilometer|km]]) north of [[Chapman Ridge]]. ...". Eubot obviously could not regognize that the misspelled "Mac." (which is misspelled consistently and everywhere in the file http://geonames.usgs.gov/stategaz/ANTARCTICA.TXT, even the title of the entry "Mac. Robertson Land" itself is misspelled, see also there) belongs to "Mac. Robertson Land" respectively "Mac Robertson Land". Therefore, I suggest that Eubot first of all corrects everywhere in the file the incorrectly spelled phrase "Mac. Robertson Land" to "Mac Robertson Land" and maybe afterwards, no additional special programming is necessary. Please check also the Wikipedia-article Mac Robertson Land and especially its Whatlinkshere-page (maybe some Wikilinks pointing to some of the REDIRECT-pages of Mac Robertson Land are to be fixed). -- Citylover 11:45, 5 October 2005 (UTC)
  • T1, Taff y Bryn: One abbrevation more found: "c.". I have already added it to the list of abbreviations on the main page Wikipedia:WikiProject Missing encyclopedic articles/Antarctica. This abbreviation may be a tricky one because "Dec.", "Mac.Robertson", "Mac. Robertson" or simply a sentence ending with "... c." must be ruled out (maybe the implementation is technically impossible): -- Citylover 20:20, 4 October 2005 (UTC)
    • c. --> "circa"/"approximately"/"about"/"around"/"roughly"/"vaguely" -- Citylover 20:20, 4 October 2005 (UTC)
      • Not implemented yet. -- Citylover 09:56, 10 October 2005 (UTC)
  • T1, Taylor Dome: More abbreviations found: "... The feature was delineated by the Scott Polar Research Institute (SPRI)-NSF-TUD airborne radio echo sounding program, 1967-79. ...". Eubot could be programmed that it automatically recognizes and converts these 3 abbreviations (like any other abbrevation) in the sentence above into the following sentence (by the way, the phrase "SPRI-NSF-TUD" appears quite often in the whole ANTARCTICA.TXT-file): "... The feature was delineated by the Scott Polar Research Institute (SPRI)-National Science Foundation (NSF)-Technical University of Denmark (TUD) airborne radio echo sounding program, 1967-79. ...". The 3 new abbreviations, which I have already added to the list of abbreviations on the main page Wikipedia:WikiProject Missing encyclopedic articles/Antarctica, are: -- Citylover 17:23, 4 October 2005 (UTC)
  • I just found a new abbreviation when browsing the ANTARCTICA.TXT-file (http://geonames.usgs.gov/stategaz/ANTARCTICA.TXT). It seems that Eubot already knows the abbrevtiation "RARE" (see for example T1, Table Nunatak, Mount Tchaikovsky, Mount Tanniel, Terminal Island and Themis Nunatak). Nevertheless, I have added it to the list of abbreviations on the main page Wikipedia:WikiProject Missing encyclopedic articles/Antarctica: -- Citylover 20:50, 4 October 2005 (UTC)
  • T1: Table Nunatak: New abbreviation found: USAAF. I already added it to the list of abbreviations on the main page Wikipedia:WikiProject Missing encyclopedic articles/Antarctica: -- Citylover 10:23, 5 October 2005 (UTC)
  • More new abbreviations found in the textfile ANTARCTICA.TXT (http://geonames.usgs.gov/stategaz/ANTARCTICA.TXT). I already added them to the list of abbreviations on the main page Wikipedia:WikiProject Missing encyclopedic articles/Antarctica: -- Citylover 10:23, 5 October 2005 (UTC)
Not an important abbreviation because it only appears twice. It could be kept as an abbreviation, but it was already converted by Eubot, see N2. "USMS" could also mean "United States Meteorological Service" but I am not sure myself what to think about this possibility: Is it sure that "United States Maritime Service" is the correct one? -- Citylover 10:49, 10 October 2005 (UTC)
The abbreviation "USNS" is not implemented by Eubot for now (for example in T2). I suggest that we could dare to implement "USNS" into Eubot because "U.S. Navy SEALSs (USNS)" seems to be quite unlikely in comparison to "United States Naval Ship (USNS)". (Maybe there might still be other meanings I did not find.) -- Citylover 11:13, 10 October 2005 (UTC)


  • Four general suggestions:
    • I have just added two lists for the abbreviations "DI" and "SGS" whose meaning seems to be still unknown. The idea was that the expressions which are the most likely ones are converted into the italic font format, the least likely ones are written in a "deleted" way (like this) and maybe someone will finally find out what this abbreviation really means. I have put these two lists on the main page (Wikipedia:WikiProject Missing encyclopedic articles/Antarctica) for now, but maybe it would be better to move them elsewhere (either to this talk page here or to a page to be newly created). -- Citylover 14:36, 5 October 2005 (UTC)
    • General suggestion, concerning the subpages ranging from A1, A2, A3, A4 ... to Z1: What about adding a Wikilink not only to Wikipedia:WikiProject Missing encyclopedic articles/Antarctica (as it is already now), but also to this talk page here? For example a text like this one could be written there: "== Before creating a new Wikipedia-article based on the information below, please read [[Wikipedia:WikiProject Missing encyclopedic articles/Antarctica]] and have also a look on its talk page, [[Wikipedia talk:WikiProject Missing encyclopedic articles/Antarctica]]==". Like that, perhaps more people could see our discussion here and would probably make suggestions or report mistakes themselves for Eubot. -- Citylover 18:54, 4 October 2005 (UTC)
    • I suggest that from now on, Eubot automatically recreates not just 3-6 subpages, but all subpages from Wikipedia:WikiProject Missing encyclopedic articles/Antarctica/A1 to Wikipedia:WikiProject Missing encyclopedic articles/Antarctica/Z1 because if other people are creating new articles now, they have already the new Eubot-created version available, the pages are already more standardized and less menial human work has to be done. -- Citylover 07:45, 5 October 2005 (UTC)
    • After all or at least some of the suggestions above have been implemented and all subpages from Wikipedia:WikiProject Missing encyclopedic articles/Antarctica/A1 to Wikipedia:WikiProject Missing encyclopedic articles/Antarctica/Z1 have been updated by Eubot, I suggest that the new Antarctica-related tasks that Eubot is able to perform are added to the description of Eubot on the page Wikipedia:Bots, for example with a sentence like: "[[User:Eubot|Eubot]] is updating statistics on [[Municipalities in the Netherlands|Dutch municipalities]] and has automatically pre-generated article entries about geographical features in [[Antarctica]], based on a [http://geonames.usgs.gov/stategaz/ANTARCTICA.TXT TXT-file] provided by the [[USGS]] (see also [[Wikipedia:WikiProject Missing encyclopedic articles/Antarctica]] and [[Wikipedia talk:WikiProject Missing encyclopedic articles/Antarctica]])." -- Citylover 10:23, 5 October 2005 (UTC)

I've taken into account most of your suggestions up to here, and am now uploading all of the pages, starting with U-Z. The uploading quite often goes wrong (server cannot be contacted), in which case the upload-program aborts, so it's a bit time-consuming; that's why I only updated in small amounts. After this complete upload, I'll just collect your suggestions for a few days before uploading anything again. (Unless something goes completely wrong, of course). Eugene van der Pijll 17:36, 5 October 2005 (UTC)

A few comments:

  • "mountain range" instead of "mountain range" cannot be done easily, and it's not that important, in my opinion.
Not implemented. I only thought about it because the article Mountain range is very good and there is even an own, separate category ([[Category:Mountain ranges of Antarctica]]). But if it is technically too difficult, it cannot be done. -- Citylover 17:59, 5 October 2005 (UTC)
  • When an initial "S." in a name is misspelled as "S", it gets converted to "south"; I don't think I can do anything about that, unfortunately.
    • Not implemented yet (look at the article T1, Cape Tennyson.). You say that it is technically not possible. I suggest now that you program Eubot just for this special case of the article "Cape Tennyson" (an article that has not been created yet) that it does not make this mistake at least in this article, for example "C. E Borchgrevink" --> "C. E. Borchgrevink" and only afterwards convert the "E" to "east". Like that, we have at least one serious and obvious mistake less. -- Citylover 12:22, 10 October 2005 (UTC)
  • It seems that "Mac. Robertson Land" is the official name! It was named after MacPherson Robertson, first name: "MacPherson". Apparently, his first name is abbreviated to "Mac.", with the period. See for example "Mac. Pherson school", which was named after the same man: http://www.macrob.vic.edu.au/guiding_values_history.htm .
  • "c." followed by a space and a number is alreay converted to "about".
Not implemented: Strangely, exactly this is not the case at T1, Taff y Bryn: "... dolerite (c. 1,600 m), situated ...". -- Citylover 18:10, 5 October 2005 (UTC)
  • the "the" problem: I've thought about doing this earlier, but I don't think I can do that automatically: sometimes the "the" is already there; sometimes there should not be a "the", e.g. if something is said about "a US-ANAC survey" or something like that (not a real example, but similar phrases are in the text.
Eugene van der Pijll 17:52, 5 October 2005 (UTC)
Thanks a lot. What about uploading all S and all T too with the new Eubot-configuration in order to see what worked out and what didn't? -- Citylover 17:55, 5 October 2005 (UTC)
Generally, I think that now the really big "forgotten" abbreviations have all been detected and really big mistakes (except for some Wikilinks not regognized by Eubot to geographical features within the texts, although they are actually not even mistakes) have been found and discussed. -- Citylover 18:02, 5 October 2005 (UTC)