A Bayesian average is a method of estimating the mean of a population using outside information, especially a pre-existing belief, that is factored into the calculation. This is a central feature of Bayesian interpretation. This is useful when the available data set is small.
Calculating the Bayesian average uses the prior mean m and a constant C. C is chosen based on the typical data set size required for a robust estimate of the sample mean. The value is larger when the expected variation between data sets (within the larger population) is small. It is smaller when the data sets are expected to vary substantially from one another.
This is equivalent to adding C data points of value m to the data set. It is a weighted average of a prior average m and the sample average.
When the are binary values 0 or 1, m can be interpreted as the prior estimate of a binomial probability with the Bayesian average giving a posterior estimate for the observed data. In this case, C can be chosen based on the desired confidence interval for the sample value. For example, for rare outcomes when m is small choosing ensures a 99% confidence interval has width about 2m.
- Yang, Xiao; Zhang, Zhaoxin (2013). "Combining Prestige and Relevance Ranking for Personalized Recommendation". Proceedings of the 22nd ACM international conference on information & knowledge management (CIKM): 1877–1880. doi:10.1145/2505515.2507885.