Wikipedia talk:WikiProject Statistics
|Main page||Talk page||Members||Templates||Resources|
|This is the talk page for discussing WikiProject Statistics and anything related to its purposes and tasks.|
|Archives: 1, 2, 3, 4, 5, 6|
|This page is of interest to the following WikiProjects:|
|Threads older than 90 days may be archived by.|
- 1 Audience considerations
- 2 Most Influential Languages in the World Economy
- 3 Request: simple better graph for Lorenz curve
- 4 Gini coefficient discussion
- 5 Differential Equation
- 6 AfC submission - 01/06
- 7 AfC submission - 04/06
- 8 Leaflet For Wikiproject Statistics At Wikimania 2014
- 9 A draft at AFC needs some specialist attention
- 10 Is this project dead?
- 11 ERROR IN CONFUSION MATRIX
I've just read the articles in Weiner and Gaussian processes. I am not a mathematician, but I am a social scientist with an interest in research methodologies. I was hoping to find a clear description of cases where an assumption of normal distribution is sound. I am working on a paper where qualitative interviews indicated heterogeneity in a key behavior, so we have looked at splitting our groups using k-means cluster analysis, based on continuous behavior observation data. We found that previously-used groupings of observations, where agents had been assumed to have homogeneous behavior, had heterogeneous behavior and that individuals clustered together in multiple equilibria. We have had a lot of push-back from the statisticians and stats-trained researchers, in the group, because they claimed at first to not understand the method and then said the findings were probably exaggerated.
I came to wikipedia with these concerns: what conceptual framework supports an assumption of homogeneity or heterogeneity? What tests are available to establish one or the other? What types of cause and effect relationships underlie equilibrium processes that exist in reality? Basically, I wanted to turn the argument around and ask them to question their assumptions in the same light they were questioning my work.
I searched the web for "empirical support for homogeneity and normal distributions" and saw the word "process" with wikipedia in the search results, and thought I was on the right track for finding information about the causal/conceptual framework, like an operational model, a process flow diagram or at least a textual description of what characteristics typify these sorts of processes, or something like that. But, I was completely unprepared to understand what I was reading. It was not helpful or useful to me at all.
I don't know in general about all of the articles in the math/stats project at Wikipedia, but these articles were not accessible to me. I think they would be inaccessible by any non-mathematician. The sort of 'text book talk' in proofs and formulas can be helpful. I've really appreciatd the project's sensitivity and specificity articles. But, in these articles there was nothing but 'text book talk'. I had no frame of reference to understand these articles.
Maybe it is my applied research background that cripples me in the more basic research and math theory arena, but it seems like the audience for wikipedia should be somewhat like that of an encyclopedia, not a text book. And definitely not an advanced undergraduate/graduate school level textbook.
So, all I can say in response to my colleagues, for now, is "your assumption contradicts the beliefs of the real people we are claiming to study" and "i've shown that there isn't a tendency toward an equilibrium between our three core behavioral indices, but toward multiple points of equilibrium". I am guessing they will reply "we know better than the people we are studying, they don't realize their equilibrium-seeking tendencies" and "all you've shown is something so confusing that we don't understand it and that you don't know how to do things the old fashioned, tried and true way".
I thought the wikipedia articles would help explain how empirical single-equilibrium processes occur, something about the standard approach for supporting an assumption of equilibrium and if and how homogeneity relates to the discussion and... And all I found were pieces written to an audience so specific that I didn't learn a single thing, although the figures did say something to me, but I can't explain what because the article didn't say.
I don't want this to be a place to settle a dogmatic/ideological score, but I do think the audience should be considered in a more meaningful way. I wanted to find information that could help me make sense of complicated math stuff, but it was over my head. I'm sorry to see that.
Most Influential Languages in the World Economy
Please comment at Talk:Linguistic demography#"Influential languages" chart on a chart which I believe should be removed from the article. Since the editor who created the chart disagrees with me, consensus is needed. Cnilep (talk) 00:59, 22 April 2014 (UTC)
Request: simple better graph for Lorenz curve
Gini coefficient discussion
Project members are invited to look at Talk:Gini coefficient#Gini in Template:infobox country and to provide input. – S. Rich (talk) 04:27, 6 May 2014 (UTC)
A lot of continuous distribution pages now have a "Different Equation" section that is rather opaque, it's just title linking to differential equations and a horribly typeset set of differential equations - no text explaining it. I don't have time to fix it up myself, so I thought I'd pass on my observation. Lucaswilkins (talk) 20:32, 12 May 2014 (UTC)
AfC submission - 01/06
AfC submission - 04/06
Leaflet For Wikiproject Statistics At Wikimania 2014
My name is Adi Khajuria and I am helping out with Wikimania 2014 in London.
One of our initiatives is to create leaflets to increase the discoverability of various wikimedia projects, and showcase the breadth of activity within wikimedia. Any kind of project can have a physical paper leaflet designed - for free - as a tool to help recruit new contributors. These leaflets will be printed at Wikimania 2014, and the designs can be re-used in the future at other events and locations.
This is particularly aimed at highlighting less discoverable but successful projects, e.g:
• Active Wikiprojects: Wikiproject Medicine, WikiProject Video Games, Wikiproject Film
• Tech projects/Tools, which may be looking for either users or developers.
• Less known major projects: Wikinews, Wikidata, Wikivoyage, etc.
• Wiki Loves Parliaments, Wiki Loves Monuments, Wiki Loves ____
• Wikimedia thematic organisations, Wikiwomen’s Collaborative, The Signpost
A draft at AFC needs some specialist attention
Is this project dead?
- Quiet, but not entirely dead. A number of topics were requests for comments at talk pages or drafts. I usually go directly to the indicated pages rather than comment here. The article you requested comments for has already been accepted, which quenches any comments at this point. I will say that the acceptance of the article was a mistake; geometric Poisson distributions (a type of compound Poisson distribution) have been around since the 70's and the present article seems a coatrack for a particular researcher's papers. --Mark viking (talk) 16:43, 29 June 2014 (UTC)
ERROR IN CONFUSION MATRIX
Hello, I just noticed an error in the confusion matrix: the denominators of FPR and FNR are switched. NB: just in the two cases at the bottom of the confusion matrix. In the list to its right things are ok. Regards, Ivo. Jul 11 2014.