Complexity function: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
m better
abelian complexity function, cite Blanchet-Sadri & Fox (2013)
Line 21: Line 21:
For ''x'' a real number and ''b'' an integer ≥ 2 then the complexity function of ''x'' in base ''b'' is the complexity function ''p''(''x'',''b'',''n'') of the sequence of digits of ''x'' written in base ''b''.
For ''x'' a real number and ''b'' an integer ≥ 2 then the complexity function of ''x'' in base ''b'' is the complexity function ''p''(''x'',''b'',''n'') of the sequence of digits of ''x'' written in base ''b''.
If ''x'' is an [[irrational number]] then ''p''(''x'',''b'',''n'') ≥ ''n''+1; if ''x'' is [[rational number|rational]] then ''p''(''x'',''b'',''n'') ≤ ''C'' for some constant ''C'' depending on ''x'' and ''b''.<ref name=Bug91>Bugeaud (2012) p.91</ref>
If ''x'' is an [[irrational number]] then ''p''(''x'',''b'',''n'') ≥ ''n''+1; if ''x'' is [[rational number|rational]] then ''p''(''x'',''b'',''n'') ≤ ''C'' for some constant ''C'' depending on ''x'' and ''b''.<ref name=Bug91>Bugeaud (2012) p.91</ref>

The '''abelian complexity function''' similarly counts the number of occurrences of distinct factors of given length ''n'', where now we identify factors that differ only by a permutation of the positions.<ref>{{cite book | chapter=On the Asymptotic Abelian Complexity of Morphic Words | title=Developments in Language Theory. Proceedings, 17th International Conference, DLT 2013, Marne-la-Vallée, France, June 18-21, 2013 | pages=94-105 | year=2013 | doi=10.1007/978-3-642-38771-5_10 | isbn=978-3-642-38770-8 | series=Lecture Notes in Computer Science | volume=7907 | issn=0302-9743 | publisher=[[Springer-Verlag]] | location=Berlin, Heidelberg | | editor1-first=Marie-Pierre | editor1-last=Béal | editor2-first=Olivier | editor2-last=Carton | author1-first=Francine | author1-last=Blanchet-Sadri | author2-first=Nathan | editor2-last=Fox }}</ref>


==References==
==References==

Revision as of 21:41, 22 February 2014

In computer science, the complexity function of a string, a finite or infinite sequence u of letters from some alphabet, is the function pu(n) of a positive integer n that counts the number of different factors (consecutive substrings) of length n from the string u.[1][2][3][4][5]

For a string u of length at least n over an alphabet of size k we clearly have

the bounds being achieved by the constant word and a disjunctive word,[6] for example, the Champernowne word respectively.[7] For infinite words u, we have pu(n) bounded if u is ultimately periodic (a finite, possibly empty, sequence followed by a finite cycle). Conversely, if pu(n) ≤ n for some n, then u is ultimately periodic.[3][8]

An aperiodic sequence is one which is not ultimately periodic. An aperiodic sequence has strictly increasing complexity function (this is the Morse–Hedlund theorem),[9] so p(n) is at least n+1.[10]

A set S of finite binary words is balanced if for each n the subset Sn of words of length n has the property that the Hamming weight of the words in Sn takes at most two distinct values. A balanced sequence is one for which the set of factors is balanced.[11] A balanced sequence has complexity function at most n+1.[12]

A Sturmian word over a binary alphabet is one with complexity function n + 1.[13] A sequence is Sturmian if and only if it is balanced and aperiodic.[2][14] An example is the Fibonacci word.[13][15] More generally, a Sturmian word over an alphaber of size k is one with complexity n+k−1. An Arnoux-Rauzy word over a ternary alphabet has complexity 2n + 1:[13] an example is the Tribonacci word.[16]

For recurrent words, those in which each factor appears infinitely often, the complexity function almost characterises the set of factors: if s is a recurrent word with the same complexity function as t are then s has the same set of factors as t or δt where δ denotes the letter doubling morphism aaa.[17]

The topological entropy of an infinite sequence u is defined by

The limit exists as the logarithm of the complexity function is subadditive.[18][19] Every real number between 0 and 1 occurs as the topological entry of some sequence.[20]

For x a real number and b an integer ≥ 2 then the complexity function of x in base b is the complexity function p(x,b,n) of the sequence of digits of x written in base b. If x is an irrational number then p(x,b,n) ≥ n+1; if x is rational then p(x,b,n) ≤ C for some constant C depending on x and b.[6]

The abelian complexity function similarly counts the number of occurrences of distinct factors of given length n, where now we identify factors that differ only by a permutation of the positions.[21]

References

  1. ^ Lothaire (2011) p.7
  2. ^ a b Lothaire (2011) p.46
  3. ^ a b Pytheas Fogg (2002) p.3
  4. ^ Berstel et al (2009) p.82
  5. ^ Allouche & Shallit (2003) p.298
  6. ^ a b Bugeaud (2012) p.91
  7. ^ Cassaigne & Nicolas (2010) p.165
  8. ^ Allouche & Shallit (2003) p.302
  9. ^ Cassaigne & Nicolas (2010) p.166
  10. ^ Lothaire (2011) p.22
  11. ^ Allouche & Shallit (2003) p.313
  12. ^ Lothaire (2011) p.48
  13. ^ a b c Pytheas Fogg (2002) p.6
  14. ^ Allouche & Shallit (2003) p.318
  15. ^ de Luca, Aldo (1995). "A division property of the Fibonacci word". Information Processing Letters. 54 (6): 307–312. doi:10.1016/0020-0190(95)00067-M.
  16. ^ Pytheas Fogg (2002) p.368
  17. ^ Berstel et al (2009) p.84
  18. ^ Pytheas Fogg (2002) p.4
  19. ^ Allouche & Shallit (2003) p.303
  20. ^ Cassaigne & Nicolas (2010) p.169
  21. ^ Blanchet-Sadri, Francine (2013). "On the Asymptotic Abelian Complexity of Morphic Words". In Béal, Marie-Pierre; Fox, Olivier (eds.). Developments in Language Theory. Proceedings, 17th International Conference, DLT 2013, Marne-la-Vallée, France, June 18-21, 2013. Lecture Notes in Computer Science. Vol. 7907. Berlin, Heidelberg: Springer-Verlag. pp. 94–105. doi:10.1007/978-3-642-38771-5_10. ISBN 978-3-642-38770-8. ISSN 0302-9743. {{cite book}}: |author2-first= missing |author2-last= (help); Cite has empty unknown parameter: |1= (help)