Jump to content

Talk:Softmax function

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 151.227.23.87 (talk) at 17:27, 1 August 2015 (→‎Possible?: new section). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Family of functions

From reading Serre 2005, it sounds that there are multiple definitions of softmax. Is this the case? What other representations are there? It looks like Riesenhuber and Poggio, 1999b and Yu et al., 2002 as referenced in Serre 2005 might give clues. JonathanWilliford (talk) 23:53, 11 August 2009 (UTC)[reply]

Content

Is this acceptable content? 95% of the article is a direct copy of the content from [1]. Jludwig (talk) 06:23, 10 May 2008 (UTC)[reply]

I agree with this concern. I am deleting most of the content of the article, per the concern of copyright violation. 128.197.81.32 (talk) 22:00, 30 July 2008 (UTC)[reply]

Derivation

Whats about the derivation of the softmax function? —Preceding unsigned comment added by 78.34.250.44 (talk) 21:28, 13 March 2009 (UTC)[reply]

John D Cook's definition is different from all of these

I followed the external link to the description of softmax as a substitute for maximum by John D. Cook. There, the softmax is described as

not

as in wkp. His version makes more sense to me. Can anyone corroborate me on this? I think the article needs fixing. But since there seem to be multiple definitions, it's hard to be clear.--mcld (talk) 15:31, 2 January 2014 (UTC)[reply]

I saw this on Cook's blog and I was highly surprised. Apparently this is a completely different function that is also called the softmax; I've never seen it in use. QVVERTYVS (hm?) 15:28, 5 February 2014 (UTC)[reply]
I checked Cook's blog post again, and found that he doesn't even call his function softmax; he calls it "soft maximum". Removed the link as it's quite unrelated. QVVERTYVS (hm?) 17:36, 5 February 2014 (UTC)[reply]

Cook's post is very informative on smooth maximum. There seems to be no natural setting for putting smooth maximum sub heading in soft max. Moving to a new page. — Preceding unsigned comment added by Yodamaster1 (talkcontribs) 16:50, 20 February 2015 (UTC)[reply]

Possible?

Could it ever be possible that the explanation of how the function works be any more incomprehensible?

Apparently, there's an extremely well developed culture in wikipedia of: everyone is expected to know a bunch of inscrutable variable-name conventions. Either that or writers really are convinced those are solidly established conventions of the likes of "+", "-", "x", etc. . I mean, not even "n" (usually used to mean "number of elements") is conventional enough in many cases (especially considering how often it is used to other meanings too).

I'm really fed up with this, and this article is among the most poor examples of this that I've found so far.