Zero-width space

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Loadmaster (talk | contribs) at 21:05, 2 December 2016 (→‎Usage: Added borders to text-wrapping boxes. If there is a wikified way of doing this, please do so). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The zero-width space (ZWSP) is a non-printing character used in computerized typesetting to indicate word boundaries to text processing systems when using scripts that do not use explicit spacing, or after characters (such as the slash) that are not followed by a visible space but after which there may nevertheless be a line break. Normally, it is not a visible separation, but it may expand in passages that are fully justified.[1]

Usage

In HTML pages, the zero-width space can be used as a potential line-break in long words as an alternative to the <wbr> element. However, the zero-width space is not supported in all web browsers such as old versions of Internet Explorer (versions 6 and earlier).[2]

To show the effect of the zero-width space, the following words have been separated with zero-width spaces:

Lorem​Ipsum​Dolor​Sit​Amet​Consectetur​Adipiscing​Elit​Sed​Do​Eiusmod​Tempor​Incididunt​Ut​Labore​Et​Dolore​Magna​Aliqua​Ut​Enim​Ad​Minim​Veniam​Quis​Nostrud​Exercitation​Ullamco​Laboris​Nisi​Ut​Aliquip​Ex​Ea​Commodo​Consequat​Duis​Aute​Irure​Dolor​In​Reprehenderit​In​Voluptate​Velit​Esse​Cillum​Dolore​Eu​Fugiat​Nulla​Pariatur​Excepteur​Sint​Occaecat​Cupidatat​Non​Proident​Sunt​In​Culpa​Qui​Officia​Deserunt​Mollit​Anim​Id​Est​Laborum

And following words are not separated with these spaces:

LoremIpsumDolorSitAmetConsecteturAdipiscingElitSedDoEiusmodTemporIncididuntUtLaboreEtDoloreMagnaAliquaUtEnimAdMinimVeniamQuisNostrudExercitationUllamcoLaborisNisiUtAliquipExEaCommodoConsequatDuisAuteIrureDolorInReprehenderitInVoluptateVelitEsseCillumDoloreEuFugiatNullaPariaturExcepteurSintOccaecatCupidatatNonProidentSuntInCulpaQuiOfficiaDeseruntMollitAnimIdEstLaborum

On browsers supporting zero-width spaces, resizing the window will re-break the first text only at word boundaries, while the second text will not be broken at all.

Encoding

The zero-width space character is encoded in Unicode as U+200B ZERO WIDTH SPACE (&NegativeMediumSpace;, &NegativeThickSpace;, &NegativeThinSpace;, &NegativeVeryThinSpace;, &ZeroWidthSpace;).[3]

The TeX representation is \hskip0pt; the LaTeX representation is \hspace{0pt};[4] and the groff representation is \:.[5]

Its semantics and HTML implementation are similar to the soft hyphen.

See also

References

External links

  1. ^ The Unicode Standard 6.1, p. 366
  2. ^ "Archived copy". Archived from the original on December 14, 2010. Retrieved December 3, 2009. {{cite web}}: Unknown parameter |deadurl= ignored (|url-status= suggested) (help)CS1 maint: archived copy as title (link)
  3. ^ "General Punctuation – Unicode" (PDF). Retrieved 2013-07-20.
  4. ^ "The LaTeX Companion. Chapter 3: Basic Formatting Tools" (PDF). Retrieved 2014-02-08.
  5. ^ "groff(7) - Linux manual page". Retrieved 2014-02-08.