Wikipedia talk:AutoWikiBrowser/Typos

From Wikipedia, the free encyclopedia
Jump to: navigation, search

Misspelt file names and URLs[edit]

I suggest that AWB should make sure the typo is not part of a link.

I had a couple of occasions recently where AWB did a typo fix which was perfectly valid but the correct action should have been ignoring the typo. Specifically, if the spelling mistake was within a URL or an image file name. Unfortunately, I have realised this is a bug hours after seeing the problem and the two actual examples are lost somewhere in my edits. Periglio (talk) 07:08, 16 March 2014 (UTC)

Were you using "Find & Replace" expressions that you'd set up yourself, or had you just ticked "Regex typo fixing" on the "Options" tab? Either way, the developers will need to see the diffs before they can comment with any precision. -- John of Reading (talk) 07:50, 16 March 2014 (UTC)
This was with the "Regex typo fixing" box ticked. I just happened to notice a couple during my edits which I did not save so there are no diffs. As I said, I did not think about it until long after the event, so I am unable to find the article where this took place. I have also been unable to recreate it on my user page so I apologise for assuming it was a general oversight! If I see it happen again, I will be back. Periglio (talk) 10:32, 16 March 2014 (UTC)
@Periglio: One of the items in Wikipedia:AutoWikiBrowser/Typos#To do shows one way we can update the typo rules to ignore URLs, but there are others. If you notice this behavior in the future, please post here (don't save a bad edit), so we can see how we can adjust the article and/or the typo rule. Thanks! GoingBatty (talk) 21:27, 16 March 2014 (UTC)

New rule "।"[edit]

I've just seen this new rule suggest a change at Kali, removing spaces before each "।" character - that's Unicode \u0964, not an ordinary pipe. Could Wikiuser13 (talk · contribs) or any other editor explain for me why this is a typo that needs fixing, as it's not obvious. -- John of Reading (talk) 10:33, 18 March 2014 (UTC)

Hello @John of Reading:, I knew that it is purnaviran, not pipe. In Devnagri, there is no space before purnaviram. If you know Hindi, you can see hi:विकिपीडिया:चौपाल#। (पूर्णविराम) का कौन-सा रूप सही है? at Hindi Wikipedia.--Wikiuser13 (talk | contribs) 10:39, 18 March 2014 (UTC)
Thank you, that's clearer. -- John of Reading (talk) 10:51, 18 March 2014 (UTC)

Bug with diacritics?[edit]

Sometimes when I am typo fixing, I will be presented with a page that has no changes automatically applied. However I have the skip if no typos are found setting checked so surely it should skip these pages automatically? This seems to happen on pages that use diacritics in the title (I haven't fully confirmed that). For an example try: Demographics of São Paulo you should also notice that the edit summary is given as fixing Sao Paulo to São Paulo. Jamesmcmahon0 (talk) 13:42, 28 March 2014 (UTC)

A second example:Diane de France has no changes but doesn't skip and has the edit summary; Typo fixing, typo(s) fixed: Angouleme → Angoulême Jamesmcmahon0 (talk) 13:49, 28 March 2014 (UTC)
This has come up before; see Wikipedia talk:AutoWikiBrowser/Bugs#TypoFix undoes general fixes, causing a confusing edit summary. -- John of Reading (talk) 15:08, 28 March 2014 (UTC)
Thanks, good to know the reason behind it, has been bugging me for ages. Jamesmcmahon0 (talk) 15:22, 28 March 2014 (UTC)

50€ → €50 needs looking at[edit]

On the article Harry (TV gameshow) AWB tries to 'correct' a number of instances of 1 500€ (etc.) to 1 €500 i.e. it misses the leading 1 which is obviously important in the context of the typo. Jamesmcmahon0 (talk) 16:34, 28 March 2014 (UTC)

I don't even know what "241 00€" is supposed to represent; how can you expect a Typo rule to know what's going on? If the article contained properly formatted numbers instead of garbage, I think the Typo rules would work fine. Chris the speller yack 04:23, 29 March 2014 (UTC)
I agree that the article is poorly written, I assume the 241 00€ is supposed to be 241 000€ but obviously it can't be expected to fix that. I couldn't find any guidelines in MOS:CURRENCY for how to layout large amount so I would think that £1 000 000 is equally acceptable as £1,000,000. Could the rule be changed to look for groups of three numbers separated by either a space or a comma and fix accordingly? Jamesmcmahon0 (talk) 18:38, 29 March 2014 (UTC)
I think WP:MOSNUM#Grouping of digits would apply, which indicates that commas should be used as the separators. GoingBatty (talk) 21:52, 29 March 2014 (UTC)
Yep, I would agree with that, Thanks. Since that has come up, is the any way to correct 123 456 etc to 123,456 without hideous amounts of false positives? Jamesmcmahon0 (talk) 22:53, 30 March 2014 (UTC)
I think there would be lots (by our standards) of false positives, such as "the plane dropped 3 500 pound bombs on the target", which is poorly formatted and unhyphenated, but understandable by a human reader. The correction you suggest would change its meaning. I'm not in favor of risking that kind of damage. Chris the speller yack 04:23, 31 March 2014 (UTC)

"an another"[edit]

I was doing a quick typo run-through and it changed "a another" to "an another". (talk) 17:48, 28 March 2014 (UTC)

Correctly so. Regards, Sun Creator(talk) 23:33, 28 March 2014 (UTC)
That was very unhelpful. (talk) — Preceding undated comment added 17:37, 29 March 2014 (UTC)
I can't easily create a sentence where "a another" would be valid. As we have often pointed out in other discussions, the Typo rules are meant to correct common minor mistakes in text that somewhat resembles good English. Typo rules can't fix every possible mistake, and "an another" makes no less sense than "a another", so it's not really doing any harm. When I see a Typo rule tripped up like that, I fix the sentence and continue on my merry way. Chris the speller yack 04:34, 31 March 2014 (UTC)
I'm not sure why but the BBC use "an another" a fair few times. Regards, Sun Creator(talk) 22:38, 31 March 2014 (UTC)


@Ohconfucius: My Concise Oxford lists "debut" as a valid spelling, without the accent. -- John of Reading (talk) 07:36, 5 April 2014 (UTC)

  • Many editors gnome and make that change so I thought I would put it in. But I'm fine with it not going in if it's not considered a typo. I'm always suspicious of Oxford, they allow all sorts of funny spelling ;-) -- Ohc ¡digame! 07:40, 5 April 2014 (UTC)
@Ohconfucius: On about 20 pages, when I used it, it changed 'debut' to 'Noel Coward'. OccultZone (Talk) 07:54, 5 April 2014 (UTC)
Ah, I see what it is. My bad, and I apologise. The first appearance doesn't do anything. -- Ohc ¡digame! 07:59, 5 April 2014 (UTC)