Insert a Unicode Line Break Value

A Unicode line break value is one of the many Unicode properties that you can insert via the Insert Token button on the Create panel.

Insert a Unicode line break value

Every Unicode code point has exactly one value for the Grapheme_Cluster_Break property. This property is part of Unicode Standard Annex 14 (UAX 14) titled “Unicode Line Breaking Algorithm”. This property is used to determine the boundaries between lines. Such a boundary is a line break opportunity. The property alone does not determine where the line break opportunities are. Rather, the rules in UAX 14 use the values that the characters before and after a position in the text have for this property to determine whether there the line may be broken at that position.

It’s not very likely that you would need to match this property with a regular expression. You could use it to implement the rules in UAX 14 using regular expressions. But most regex flavors that support this property also support \b{lb} to match an actual line break according to UAX 14. You can insert that with the Line Boundary item in the Anchor submenu of the Insert Token menu.