Jump to content

Wikipedia talk:AutoWikiBrowser/Typos

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 128.138.43.231 (talk) at 04:10, 29 March 2011 (→‎bilbliography > bibliography: new section). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

hda/had Error

Hello! Not sure if the regex can be adjusted to actually catch this, but "hda" should not correct to "had" if it's in a path, e.g. "/dev/hda1/foo/" :) Avicennasis @ 03:47, 18 Adar I 5771 / 22 February 2011 (UTC)

Thanks. Which rule triggered that change (found in the Typos tab on AWB)? There might be a way to identify the path in the Wikimarkup as not-English. (Also, additions to Talk pages are not minor edits. WP:MINOR. Editors who are ignoring minor edits won't see your new question.) -- JHunterJ (talk) 11:47, 22 February 2011 (UTC)[reply]
All such uses should probably go in code tags. Rjwilmsi 13:55, 23 February 2011 (UTC)[reply]
Ah. Must've forgot to uncheck the minor box last time. :)
How about this: "english" to "English" unless in a link, e.g., http://www.israelradio.org/english.html ? I've encountered this as well. Avicennasis @ 03:07, 30 Adar I 5771 / 6 March 2011 (UTC)
I did a search for "http://www.israelradio.org/english.html" and found 5 articles on Wikipedia. I processed them all through AWB, and AWB didn't try to change "english" to "English" on any of the articles. On what article did you encounter this issue? GoingBatty (talk) 03:21, 6 March 2011 (UTC)[reply]
That was just an example - I forget the actual link. I encountered this on Wikibooks the other day. Since they don't have any AWB pages, the RegEx is loaded from here. I'll try to rescan and find out exactly what the link was. Avicennasis @ 04:35, 30 Adar I 5771 / 6 March 2011 (UTC)
Found an example on Wikibooks, the page is here, and the "typo" it finds is
  • www.stimulus.virginia.gov > www.stimulus.Virginia.gov
Not sure if that helps at all, or if it can be avoided. But there's an example of what I mean. Avicennasis @ 06:41, 30 Adar I 5771 / 6 March 2011 (UTC)
 Done - updated the "Virginia" rule with this edit. GoingBatty (talk) 14:49, 6 March 2011 (UTC)[reply]

Gaddafi

We should think about a rule for Gaddafi, because there seem to be dozens of variants out there. Just to establish some consistency, like we do with several other names. --bender235 (talk) 12:56, 23 February 2011 (UTC)[reply]

We don't establish consistency, but we do correct misspellings. Which of the dozens of variants are misspellings, and which are acceptable variants? The misspellings could be corrected to a consistent spelling (e.g., Gaddafi), but the variants should not be. Gaddafi#Name cites http://www.straightdope.com/columns/read/513/how-are-you-supposed-to-spell-muammar-gaddafi-khadafy-qadhafi; I would think those variations should be left unchanged. -- JHunterJ (talk) 13:37, 23 February 2011 (UTC)[reply]
But why not? The article's name is Muammar al-Gaddafi, and that should be the only variant on Wikipedia. Just like we have a rule for "Beijing", or "Beirut", although these also have a number of variants. --bender235 (talk) 19:27, 23 February 2011 (UTC)[reply]
I think that while we can try to be consistent with our own use, his name has so many documented spellings (as shown in the article), I imagine it would be a nightmare to make exceptions for quotations from commentators, older sources, etc. I agree with JHunter, if someone has used a recognized missspelling (ie no one uses it anywhere), revert to our form, otherwise leave other forms as is.66.80.6.163 (talk) 20:01, 23 February 2011 (UTC)(mercurywoodrose)[reply]
Color is also at "color", but we can't "correct" usages of "colour", a valid spelling of the same topic. Similarly, we can't correct "Brontosaurus" to "Apatosaurus"; WP might have a preferred spelling, but the other is not a typo or misspelling. Moammar Qaddafi is (I assume) a valid spelling of the same topic as "Muammar al-Gaddafi", not a typo or misspelling, so we don't "correct" it. If an individual AWB user wants to use it to identify non-controversial replacement opportunities and use the (non-typo) replacement tools, that would work. -- JHunterJ (talk) 21:47, 23 February 2011 (UTC)[reply]

billionaire

AWB did not catch "bilionar" as misspelling of "billionaire" here. Please add a rule. --bender235 (talk) 18:06, 25 February 2011 (UTC)[reply]

It does not appear to be a common typo.[1] -- JHunterJ (talk) 20:14, 25 February 2011 (UTC)[reply]
Still we could add the rule in case the typo occurs again. Wikipedia is an evolving resource. --bender235 (talk) 19:07, 26 February 2011 (UTC)[reply]
"To do: Remove rare words. Note that no matches today does not mean a rule is rare, since another user may have used the rule to fix many articles yesterday." Each rule consumes some resources, and the goal is not to have 100% of possible typos included at the expense of being unable to run the tool to fix any of them. -- JHunterJ (talk) 21:43, 26 February 2011 (UTC)[reply]
If a word is rare why not put it into a rolling set for each day of the month? That you can still run the tool but rare fixes will still be picked up. ϢereSpielChequers 09:57, 2 March 2011 (UTC)[reply]
You can do that yourself. The AWB ruleset I run is the basic typo rules listed here, in addition to my own set of more nuanced rules (some of which require more human discretion than is appropriate for AWB). You could do the same. Shadowjams (talk) 07:23, 17 March 2011 (UTC)[reply]

palyer --> player

palyers --> players as well. Alex (talk) 05:10, 28 February 2011 (UTC)[reply]

A WP search found only six of these typos, which I fixed manually. GoingBatty (talk) 05:53, 28 February 2011 (UTC)[reply]

Minster - Minister

I've done enough of the following manually to believe that AWB would be more efficient:

prime minster - prime minister

cabinet minster - cabinet minister

Thanks ϢereSpielChequers 18:08, 28 February 2011 (UTC)[reply]

 Done with these two edits. -- JHunterJ (talk) 19:48, 28 February 2011 (UTC)[reply]

Speed of 'Pre-'

Will the 'Pre-' regex run acceptably fast now? If not, what metric are you using? (it doesn't hang for me, at least.) PS. Thanks for the enable. – Regregex (talk) 06:33, 4 March 2011 (UTC)[reply]

Spurious endash?

Hello!

This[2] change from "-" to "–" is obviously incorrect, see WP:HYPHEN. I am not sure that it is a bug of AWB, not some hand-made rule of a particular user. I know little about AWB and therefore ask here to help fix the problem, via technical changes or maybe social interaction (I have some negative bias towards automated editors and experience some troubles communicating with them). Incnis Mrsi (talk) 16:49, 8 March 2011 (UTC)[reply]

The word "replaced" in the edit summary shows that this change is a "find and replace" rule set up by the user, not something built into AWB. But I think the change is correct. The edit summary refers to WP:ENDASH, and the change seems a correct example of point 5 there, since "World War" contains a space. -- John of Reading (talk) 18:10, 8 March 2011 (UTC)[reply]
Point 5? It looks bizarre to me and apparently contradicts to WP:HYPHEN, but it exists. Sorry for a false alarm. Incnis Mrsi (talk) 19:17, 8 March 2011 (UTC)[reply]

Is "du Pré" rule necessary

Is the "du Pré" rule really necessary? I stumbled across the rule when it wanted to make an incorrect change to blogger "Jacqueline Dupree" in Media in Washington, D.C. Thanks! GoingBatty (talk) 03:25, 9 March 2011 (UTC)[reply]

 Fixed with this edit. Kept the rule, but allowed the "Dupree" spelling. -- JHunterJ (talk) 12:19, 9 March 2011 (UTC)[reply]

Colege -> College

Alex (talk) 23:44, 12 March 2011 (UTC)[reply]

 Done, along with Colegiate → Collegiate - GoingBatty (talk) 00:22, 13 March 2011 (UTC)[reply]

"an unusually long period" → "a unusually long period" ?

Resolved

RegexTypoFix wants to change "an unusually long period" → "a unusually long period" on Fisher Hall and Marcum Center (Miami University), based on the "A ..." rule. This doesn't seem like a correct change to me. Comments? GoingBatty (talk) 03:34, 13 March 2011 (UTC)[reply]

It wants to change "an usually" to "a usually". That seems a correct change, although it seems the text may have intended unusually there. -- JHunterJ (talk) 12:09, 13 March 2011 (UTC)[reply]
Ah, the text is "an usually"! Once I changed the text to "an unusually long period", then RegexTypoFix doesn't want to change it. Thanks! GoingBatty (talk) 15:17, 13 March 2011 (UTC)[reply]

"niger" matching even when part of scientific name

Although the "Niger(ia)" rule is set up to not match scientific names, it tries to change Chlidonias niger to Chlidonias Niger on articles such as List of birds of Oregon. Could someone please see if the rule can be updated? Thanks! GoingBatty (talk) 04:36, 13 March 2011 (UTC)[reply]

It did not attempt the change when I just tried it. -- JHunterJ (talk) 12:06, 13 March 2011 (UTC)[reply]
It still does for me - I'm using AWB SVN 7634. GoingBatty (talk) 15:10, 13 March 2011 (UTC)[reply]
I'm a few builds behind, SVN 7471. -- JHunterJ (talk) 17:07, 13 March 2011 (UTC)[reply]

"Maintenance" rule does not catch "maintanance"

I fixed about 2 dozen of these by supplying my own Find/Replace. I would like to fix the "Maintenance" rule, but I get dizzy when I look at that one for more than a few seconds. Chris the speller (talk) 17:09, 15 March 2011 (UTC)[reply]

 Done with this edit. -- JHunterJ (talk) 17:16, 15 March 2011 (UTC)[reply]

I tweaked a rule -- sorry about any confusion that may have followed

I tweaked the "(In)Significant" rule, then reverted it, not because I saw anything wrong with it, but because when I reloaded AWB to retest the whole thing live, I found that no RegEx fixes were working for me. It was as if "Enable RegEx TypoFix" was unchecked. So in a near panic, I reverted the rule change. Well, after the reversion, RegEx fixes are still not working for me. I had saved my AWB settings before shutting it down, and reloaded settings after launching it again, so it's not some setting that I forgot. My own Find/Replace rules work fine, as do General Fixes, so I can work, but it's like sweeping with a smaller broom. Any ideas would be appreciated, as would a report that other editors are successfully using RegEx Typo after reloading AWB. As for the rule tweak, maybe an experienced tweaker can look it over and give an opinion. Chris the speller (talk) 22:05, 17 March 2011 (UTC)[reply]

It's working for me. I restarted and tried processing A Rocha, "typos fixed: accomodation → accommodation" -- John of Reading (talk) 22:11, 17 March 2011 (UTC)[reply]
Thanks, that's very reassuring. I feel better that it's just me. Of course, it also makes me feel a little paranoid. Chris the speller (talk) 22:21, 17 March 2011 (UTC)[reply]
Well, now *some* of the RegEx rules seem to be working for me. I'm going to reboot the whole machine (mine, not WP!), what the heck. After that, I'll retest and evaluate whether the existing two rules for "significant" are actually working better than I thought. Chris the speller (talk) 22:58, 17 March 2011 (UTC)[reply]

Only a handful of RegEx Typos work for me now, and only intermittently. I know most of you have more fun things to do, but for a change of pace, if an editor wants to help, try the following:

  1. enable Find and Replace
  2. add a Find and Replace for 'reponsible' to 'responsible'
  3. enable RegEx Typos
  4. put 'User:Chris the speller/Sandbox2' in the page list
  5. Start.

It should fix 'reponsible' if Find and Replace is working. It should also fix about 11 other misspellings on 11 other lines if Regex Typos is working. Don't bother to save it, to allow retesting. When I do it, it doesn't catch the other 11 lines. I'd love to hear how other editors fare with this. Chris the speller (talk) 01:44, 18 March 2011 (UTC)[reply]

When I try your test, Find and Replace works, but AWB AVN 7634 doesn't find any typos. However, it does find typos on articles (see my contributions) and User:GoingBatty/Sandbox2. GoingBatty (talk) 03:17, 18 March 2011 (UTC)[reply]
I am using AWB SVN 7471, just downloaded last week. Am I already 163 versions behind? How often do I need to update it? Since yours also missed all the typos on my sandbox page, it seems that mine is not the only one misfiring. I have found a few typos, but they seem to come in spurts; it finds typos in 2 or 3 pages in a row, then misses them in dozens and dozens. Thanks for giving it a try. You don't sound too worried, but I have a bad feeling about this. Chris the speller (talk) 04:04, 18 March 2011 (UTC)[reply]
I download the latest AWB snapshot as soon as it's available, usually because it fixes a bug or includes a feature request I've submitted. Maybe you should report a bug? Good luck! GoingBatty (talk) 04:16, 18 March 2011 (UTC)[reply]
The typos in User:Chris the speller/Sandbox2 are in indented paragraphs. RegExpTypoFix skips those in case the indent is marking a quotation. Take out the colons, and the typos get fixed. (Using SVN 7471). -- John of Reading (talk) 08:08, 18 March 2011 (UTC)[reply]
Thanks, John, for providing the clear and simple answer to my problem. And thanks again, Batty, for taking the time to look into it. A humbling experience; I feel like the sorcerer's apprentice. Maybe I should change the name of this talk section to "The wrong way to set up a regression test for changes to AWB Typos" ;-)       Chris the speller (talk) 14:33, 18 March 2011 (UTC)[reply]
I added a note on the project page to indicate that typos are not checked in indented paragraphs. Thanks for the info, John! GoingBatty (talk) 01:58, 19 March 2011 (UTC)[reply]

-hsi

This seems a fairly common part of a name. Be nice not to try to change it to "-his". Rich Farmbrough, 00:14, 25 March 2011 (UTC).[reply]

 Done with this edit. GoingBatty (talk) 00:50, 25 March 2011 (UTC)[reply]

bilbliography > bibliography

Fairly common typo: [3].

By the way, why have both Wikipedia:Lists_of_common_misspellings and Wikipedia:AutoWikiBrowser/Typos?

Thanks. 128.138.43.231 (talk) 04:10, 29 March 2011 (UTC)[reply]