Wikipedia talk:AutoWikiBrowser/Typos

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Shortcut:

50€ → €50 needs looking at[edit]

On the article Harry (TV gameshow) AWB tries to 'correct' a number of instances of 1 500€ (etc.) to 1 €500 i.e. it misses the leading 1 which is obviously important in the context of the typo. Jamesmcmahon0 (talk) 16:34, 28 March 2014 (UTC)

I don't even know what "241 00€" is supposed to represent; how can you expect a Typo rule to know what's going on? If the article contained properly formatted numbers instead of garbage, I think the Typo rules would work fine. Chris the speller yack 04:23, 29 March 2014 (UTC)
I agree that the article is poorly written, I assume the 241 00€ is supposed to be 241 000€ but obviously it can't be expected to fix that. I couldn't find any guidelines in MOS:CURRENCY for how to layout large amount so I would think that £1 000 000 is equally acceptable as £1,000,000. Could the rule be changed to look for groups of three numbers separated by either a space or a comma and fix accordingly? Jamesmcmahon0 (talk) 18:38, 29 March 2014 (UTC)
I think WP:MOSNUM#Grouping of digits would apply, which indicates that commas should be used as the separators. GoingBatty (talk) 21:52, 29 March 2014 (UTC)
Yep, I would agree with that, Thanks. Since that has come up, is the any way to correct 123 456 etc to 123,456 without hideous amounts of false positives? Jamesmcmahon0 (talk) 22:53, 30 March 2014 (UTC)
I think there would be lots (by our standards) of false positives, such as "the plane dropped 3 500 pound bombs on the target", which is poorly formatted and unhyphenated, but understandable by a human reader. The correction you suggest would change its meaning. I'm not in favor of risking that kind of damage. Chris the speller yack 04:23, 31 March 2014 (UTC)

──────────────────────────────────────────────────────────────────────────────────────────────────── This is a common problem with text translated from other languages, particularly if it is done by a person from their mother into English. Other languages use other delimiters in large amounts (for example German uses points where English uses commas and commas where English uses points), so a German "100.000,99" is "100 000,99" in French and 100,000.99 in English. For those who are unsure (or French ;-) ) missing out the delimiters or not converting them is the simplest solution to translating numbers. Hence this will be more of a problem of incorrectly formatted numbers with certain currency symbols than others, and the Euro is going to be one of them because some(most?) European continental languages place the currency symbols after the number. Also according to this paper different dialects of English do or do not use spaces between the currency symbol and the numeric amount. -- PBS (talk) 10:26, 18 May 2014 (UTC)

  • I've been running into the same problem, and I think my suggestion would be to use the trailing currency symbol as the litmus for reformatting a number. So "5 500$" -> "$5,500", but "5 500" goes unchanged. I've also run into other separators, like a single quote, so maybe it just needs to match a space or any punctuation. I think the way to deal with a "241 00€" is to go with "€241.00" - ie matching a trailing group of not 3 digits as decimals. I think this could be the most robust, and requires significantly less editor intervention than punting on any non-comma separator. VanIsaacWScont 08:40, 13 June 2014 (UTC)

Issue[edit]

I was just wondering, if you come across an article that finds multiple typos but one of the typos is the correct way of spelling it (e.g. a surname), how do you ignore it so it only changes the incorrect typo and doesn't change the other?--Mjs1991 (talk) 08:51, 26 May 2014 (UTC)

@Mjs1991: You could make the correct spelling a wikilink (e.g. [[Liev Schreiber]]) or use the {{not a typo}} template. GoingBatty (talk) 13:29, 26 May 2014 (UTC)

Dependant vs. dependent[edit]

Wondering if we could craft a rule to change "dependant" to "dependent" when necessary. My understanding is:

  • In British English:
    • "Dependent" means reliant on.
    • A "dependant" is a person (usually a child or a spouse).
  • In American English, you can use "dependent" for both. (copied from grammar-monster.com)

I'm thinking we could try either:

  1. change "dependant on" (but not "a dependant on") to "dependent on", OR
  2. change "is/are now/highly/very/mostly dependant on" to "is/are now/highly/very/mostly dependent on" (more adverbs as needed)

Thoughts? GoingBatty (talk) 20:04, 30 May 2014 (UTC)

1: seems to fail in BritEng for say "any dependant on some form of assistance"
2: I'm not seeing the difference in the before and after, GB. - Dank (push to talk) 10:59, 16 June 2014 (UTC)
@Dank: Oops! Fixed my typo in option 2. Thanks! GoingBatty (talk) 11:03, 16 June 2014 (UTC)
Thanks. I think #2 now fixes the false positive problem. - Dank (push to talk) 11:31, 16 June 2014 (UTC)

Casted -> Cast[edit]

Cast is an irregular verb i.e. the past tense is also cast, not casted.

I would add it to the list myself, but I don't seem to be able to edit the page.

<Typo word="Cast" find="\b([Cc])asted\b" replace="$1ast"/>

-- Niightblade (talk) 08:26, 16 June 2014 (UTC)

Works for me, regex looks good. - Dank (push to talk) 11:00, 16 June 2014 (UTC)
But Collins English Dictionary says "casted" is an adjective that means "belonging to a caste". I also found "protect the patient's casted foot". Maybe check for "casted" that is preceded by he, she, they, was, is, be, being, etc.? Chris the speller yack 14:01, 16 June 2014 (UTC)
Thanks for catching that. I don't have a preference. - Dank (push to talk) 16:27, 16 June 2014 (UTC)

Langage / Language[edit]

Do not replace langage (French) with language when it is preceded immediately by (le|les|du|des) example page: Maurice Merleau-Ponty 138.251.14.34 (talk) 12:46, 19 June 2014 (UTC)

The right way to protect foreign-language text from English spell checkers is to use the "lang" template. I have employed it in your example page. Chris the speller yack 13:42, 19 June 2014 (UTC)

Enmao -> Emmao[edit]

In this edit, Enmao, as in Wang Enmao, was changed to Emmao. I think the <Typo word="Emm-" line is the culprit. Could this be fixed? Bgwhite (talk) 04:49, 20 June 2014 (UTC)

Yes check.svg Done -- John of Reading (talk) 05:37, 20 June 2014 (UTC)

Including[edit]

I'm seeing AWB trying to 'correct' including to includeing, giving the rule includ --> include for a reason. Thank you, BethNaught (talk) 13:33, 24 June 2014 (UTC)

@BethNaught: On what page? I suspect there may be an invisible control character between "includ" and "ing", causing the software to see it as two separate words. -- John of Reading (talk) 15:24, 24 June 2014 (UTC)
Sorry I forgot to mention it earlier.. I believe, from digging in my contribs, that the page is IEC EN 61000-3-2. BethNaught (talk) 15:35, 24 June 2014 (UTC)
I retyped the words, and that has fixed it. -- John of Reading (talk) 16:59, 24 June 2014 (UTC)

european -> European in Infoboxes[edit]

In National Front (France), AWB changed the infobox parameter in {{Infobox political party}} from european to European. This caused the parameter not to show up in the article. Bgwhite (talk) 23:22, 24 June 2014 (UTC)

@Bgwhite: I can't reproduce the issue. Since AWB won't make typo changes inside templates, and your edit to the article was to remove some extra braces, my guess is that AWB was confused about where the infobox ended, and didn't realize that the |european= parameter was part of the template. Thanks for fixing the article and not saving a bad edit! GoingBatty (talk) 02:18, 25 June 2014 (UTC)
GoingBatty, ahhh, thank you for the explanation. Bgwhite (talk) 03:58, 25 June 2014 (UTC)

full time → full-time in soccer articles[edit]

I've been running across scattered instances of "full time" → "full-time" in soccer articles fairly regularly, where it is used to refer to the end of the game, eg "the score was tied 1-1 at full time". From my memory of context, I think this can be resolved by simply retaining "full time" when preceded by "at". If no one has a counter example, could someone better with regexp program this in? VanIsaacWScont 05:57, 3 July 2014 (UTC)

@Vanisaac: Yes check.svg Done - thanks! GoingBatty (talk) 21:51, 4 July 2014 (UTC)

i.e. rule doesn't skip URLs[edit]

Although the "i.e." rule was designed to skip URLs, AWB wants to change www.currahaparish.ie. to www.currahaparish.i.e. in Curraha. I tried fixing the rule, but my fix didn't work. Could someone else please take a look at this? Thanks! GoingBatty (talk) 14:42, 5 July 2014 (UTC)

Name caught in adverb spelling web[edit]

The last name "Dealy" keeps getting caught in the adverb suffix ly -> lly rule. It wouldn't get an erroneous skip of "idealy" if you match to the capitalized spelling only. VanIsaacWScont 08:17, 6 July 2014 (UTC)

@Vanisaac: Yes check.svg Done by fixing "-ally (2)". Thanks for the report! GoingBatty (talk) 13:47, 6 July 2014 (UTC)

goverment[edit]

Is it possible to add "goverment"? Poveglia (talk) 01:00, 12 July 2014 (UTC)

@Poveglia: The rule already exists - see this edit I just did. Do you see any articles where AWB isn't fixing the typo (when the typo isn't in a protected area, such as a URL or template?) Thanks! GoingBatty (talk) 03:29, 12 July 2014 (UTC)
I feel stupid now. I think I must've made a typo myself after pressing Ctrl-F. I fixed the "goverment" typos manually. Thanks, Poveglia (talk) 05:36, 13 July 2014 (UTC)