Semantic Brand Score

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
Semantic Brand Score

The Semantic Brand Score is a measure designed to assess the importance of one or more brands, in different contexts and whenever textual data (even big data) is available.[1][2] This metric has its foundations in graph theory and combines methods of text mining and social network analysis.[3] The Semantic Brand Score was developed based on the conceptualizations of brand equity proposed by Keller[4] and Aaker.[5] These well-known models inspired the measurement of a different construct on textual data: brand importance.

Brand equity is traditionally assessed through a series of models, which are often based on the administration of questionnaires to consumers or, for example, on financial evaluations. By contrast, the Semantic Brand Score is calculated on texts that potentially represent spontaneous expressions of different stakeholders: they are not subjected to direct interviews, thus reducing possible cognitive biases. The metric can be calculated, for example, by analyzing newspaper articles, consumer dialogue on online forums, or posts published on social media.

Definition and calculation[edit]


The calculation of the Semantic Brand Score requires that the analyzed texts are preliminarily transformed into networks of words, i.e. graphs in which each node represents a word. Links between words are given by their co-occurrence within a given range, or within a sentence. A pre-processing of natural language is advisable to clean up texts, for example by removing stopwords and word affixes (stemming). Consider for example the following network, obtained from the pre-processing of the sentence "The dawn is the appearance of light - usually golden, pink or purple - before sunrise.".

Word co-occurrence network (range 3 words) - ENG.jpg

Semantic Brand Score, which measures brand importance,[6] results from the standardized sum of its components: prevalence, diversity and connectivity.[7]


This dimension measures the frequency of use of a brand name, i.e. the number of times a brand is directly mentioned. Prevalence is linked to the concept of brand awareness,[4] with the idea that a brand that appears more often in a text is more familiar to that text authors. Similarly, the fact that a brand name is frequently mentioned increases its recognition and recall, for those who read it.


This dimension measures the diversity of the words associated with a brand. These are textual associations (and not mental ones as in the brand image theorized by Keller[4]), i.e. the words that are most frequently used in conjunction with a certain brand. Calculation is obtained by means of the degree centrality[8] indicator, which corresponds to the degree of the node representing the brand. Alternatively, it was suggested to calculate diversity through the measure of distinctivness centrality,[9] which gives more value to less redundant brand associations. The idea is that many distinctive textual associations make the discourse around a brand more informative, leading to greater brand strength[10] and importance.


This last dimension measures the level of connectivity of a brand with respect to general discourse, i.e. its ability to act as a bridge between other words (nodes) in the network. Ideally it represents the brokerage power of a brand, i.e. its ability to link different words, groups of words, or topics. Calculation is based on the metric of weighted betweenness centrality.[11][12]

Semantic Brand Score[edit]

The Semantic Brand Score is the standardized sum of prevalence, diversity and connectivity. The three components are all important and only together they represent the full construct of brand importance. Consider for example the case where a brand is frequently mentioned, but in a repetitive way with many posts having the same phrase "InventedCola is the best drink of all time". Prevalence in this case would be high, but diversity would be low. On the other hand, a brand frequently mentioned in a heterogeneous context would have both high prevalence and diversity. However, connectivity may still be low if the brand is discussed only as a niche of a wider discourse. When a brand is in-between different topics - it is important and acts as an intermediary for the whole context - then its connectivity is also high. The "InventedCola" brand could be central in one discourse (e.g. soft drinks) and peripheral in another (e.g. bar cocktails).

Some tutorials for the calculation of the metric using the Python programming language can be found online.[13]

Sentiment of textual brand associations[edit]

The informativeness of brand importance can be complemented by comparing its value with that of brand associations sentiment. The fact that a brand is frequently mentioned, even in diverse contexts, and is at the heart of a discourse, defines its importance. However, it may be useful to understand whether the feelings and opinions associated with it are positive or negative.

Use cases[edit]

Not only "brands"[edit]

The Semantic Brand Score can be used to measure the importance of any word, or set of words; it is therefore not limited to the analysis of brands in a strict sense. By "brand" one can also intend the name of a politician,[14] or a set of words that represent a concept (for example, the concept of "innovation" or a corporate core value).

Use cases[edit]

The measure was used to evaluate the transition dynamics that occur when a new brand replaces an old one.[6] The Semantic Brand Score is also useful to relate the importance of a brand to that of its competitors, or to analyze importance time trends of a single brand. In some applications, the measures obtained have also proved useful for forecasting purposes; for example, in the political scenario, a link has been found between brand importance of candidate names in online press and election outcomes.[15][7]

There are no limits on the text sources that can be analyzed: newspaper articles, emails, posts on online forums, blogs and social media, open text fields of interviews administered to consumers, etc.. The measure also works with different languages.

See also[edit]


  1. ^ Colladon, Andrea Fronzetti; Bella, Agostino La; Grippa, Francesca; Guardabascio, Barbara; Capano, Vincenzo D'Innella (2018). "Brand Intelligence in the Era of Big Data: Advances in the Use of the Semantic Brand Score". Poster Presented at the XXIX RSA AiIG 2018 - the Challenge of Management Engineering in a Changing Manufacturing World. doi:10.13140/rg.2.2.22783.66723.
  2. ^ Fronzetti Colladon, Andrea (2018). "Measuring Brand Importance through Semantic and Social Network Analysis: Applications of the Semantic Brand Score". XXXVIII Sunbelt Conference of the International Network for Social Network Analysis.
  3. ^ Alexandridis, Kostas; Takemura, Shion; Webb, Alex; Lausche, Barbara; Culter, Jim; Sato, Tetsu (November 2018). "Semantic knowledge network inference across a range of stakeholders and communities of practice". Environmental Modelling & Software. 109: 202–222. doi:10.1016/j.envsoft.2018.08.026.
  4. ^ a b c Keller, Kevin Lane (January 1993). "Conceptualizing, Measuring, and Managing Customer-Based Brand Equity". Journal of Marketing. 57 (1): 1–22. doi:10.1177/002224299305700101. ISSN 0022-2429. S2CID 220602603.
  5. ^ Aaker, David A. (April 1996). "Measuring Brand Equity Across Products and Markets". California Management Review. 38 (3): 102–120. doi:10.2307/41165845. JSTOR 41165845.
  6. ^ a b Fronzetti Colladon, Andrea (July 2018). "The Semantic Brand Score". Journal of Business Research. 88: 150–160. doi:10.1016/j.jbusres.2018.03.026.
  7. ^ a b Saporiti, Riccardo (14 May 2019). "Elezioni: è la Lega il brand che vale di più sui giornali". Il Sole 24 Ore - Info Data. Retrieved 21 May 2019.
  8. ^ Freeman, Linton C. (January 1978). "Centrality in social networks conceptual clarification". Social Networks. 1 (3): 215–239. CiteSeerX doi:10.1016/0378-8733(78)90021-7.
  9. ^ Fronzetti Colladon, Andrea; Naldi, Maurizio (2020-05-22). Xiao, Gaoxi (ed.). "Distinctiveness centrality in social networks". PLOS ONE. 15 (5): e0233276. arXiv:1912.03391. Bibcode:2020PLoSO..1533276F. doi:10.1371/journal.pone.0233276. ISSN 1932-6203. PMC 7244137. PMID 32442196.
  10. ^ Grohs, Reinhard; Raies, Karine; Koll, Oliver; Mühlbacher, Hans (June 2016). "One pie, many recipes: Alternative paths to high brand strength". Journal of Business Research. 69 (6): 2244–2251. doi:10.1016/j.jbusres.2015.12.037.
  11. ^ Brandes, Ulrik (May 2008). "On variants of shortest-path betweenness centrality and their generic computation". Social Networks. 30 (2): 136–145. CiteSeerX doi:10.1016/j.socnet.2007.11.001.
  12. ^ Freeman, Linton C. (March 1977). "A Set of Measures of Centrality Based on Betweenness". Sociometry. 40 (1): 35–41. doi:10.2307/3033543. JSTOR 3033543.
  13. ^ Colladon, Andrea Fronzetti (2019-04-16). "Calculating the Semantic Brand Score with Python". Medium. Retrieved 2019-04-17.
  14. ^ Guzmán, Francisco; Sierra, Vicenta (December 2009). "A political candidate's brand image scale: Are political candidates brands?". Journal of Brand Management. 17 (3): 207–217. doi:10.1057/bm.2009.19. ISSN 1350-231X. S2CID 167417115.
  15. ^ "Semantic Brand Score - Analytics Demo". Retrieved 2019-02-15.

External links[edit]