This document is mechanically formatted from the XML file for the LGR. It provides additional summary data and explanatory text. The XML file remains the sole normative specification of the LGR.
LGR Version | 1 |
---|---|
Unicode Version | 12.0.0 |
Number of elements in repertoire | 78 |
---|---|
Number of ranges in repertoire | 0 |
Number of code point sequences | 0 |
The following table lists the repertoire by code point (or code point sequence). The data in the Script and Name column are extracted from the Unicode character database. Where the comment in the original LGR is equal to the character name, it has been suppressed.
For any code point or sequence for which a variant is defined, the link to the associated variant set, or if mapped to itself, the variant type of that mapping is provided in the Variants column.
# | Code Point |
Glyph | Script | Name | Tags | Required Context | Variants | Comment | References |
---|---|---|---|---|---|---|---|---|---|
1 | U+002D | - | Common | HYPHEN-MINUS | not-when: hyphen-minus-disallowed | ||||
2 | U+0030 | 0 | Common | DIGIT ZERO | Common-digit | ||||
3 | U+0031 | 1 | Common | DIGIT ONE | Common-digit | ||||
4 | U+0032 | 2 | Common | DIGIT TWO | Common-digit | ||||
5 | U+0033 | 3 | Common | DIGIT THREE | Common-digit | ||||
6 | U+0034 | 4 | Common | DIGIT FOUR | Common-digit | ||||
7 | U+0035 | 5 | Common | DIGIT FIVE | Common-digit | ||||
8 | U+0036 | 6 | Common | DIGIT SIX | Common-digit | ||||
9 | U+0037 | 7 | Common | DIGIT SEVEN | Common-digit | ||||
10 | U+0038 | 8 | Common | DIGIT EIGHT | Common-digit | ||||
11 | U+0039 | 9 | Common | DIGIT NINE | Common-digit | ||||
12 | U+0062 | b | Latin | LATIN SMALL LETTER B | set 1 | Not part of repertoire | |||
13 | U+0063 | c | Latin | LATIN SMALL LETTER C | set 2 | Not part of repertoire | |||
14 | U+0064 | d | Latin | LATIN SMALL LETTER D | set 3 | Not part of repertoire | |||
15 | U+006A | j | Latin | LATIN SMALL LETTER J | set 4 | Not part of repertoire | |||
16 | U+006C | l | Latin | LATIN SMALL LETTER L | set 5 | Not part of repertoire | |||
17 | U+0070 | p | Latin | LATIN SMALL LETTER P | set 6 | Not part of repertoire | |||
18 | U+1403 | ᐃ | Canadian_Aboriginal | CANADIAN SYLLABICS I | |||||
19 | U+1405 | ᐅ | Canadian_Aboriginal | CANADIAN SYLLABICS O | |||||
20 | U+140A | ᐊ | Canadian_Aboriginal | CANADIAN SYLLABICS A | |||||
21 | U+140E | ᐎ | Canadian_Aboriginal | CANADIAN SYLLABICS WI | |||||
22 | U+141B | ᐛ | Canadian_Aboriginal | CANADIAN SYLLABICS NASKAPI WAA | |||||
23 | U+1424 | ᐤ | Canadian_Aboriginal | CANADIAN SYLLABICS FINAL RING | not-when: first-character-of-the-label | ||||
24 | U+1426 | ᐦ | Canadian_Aboriginal | CANADIAN SYLLABICS FINAL DOUBLE SHORT VERTICAL STROKES | not-when: first-character-of-the-label | ||||
25 | U+1427 | ᐧ | Canadian_Aboriginal | CANADIAN SYLLABICS FINAL MIDDLE DOT | not-when: first-character-or-followed-by-ineligible-chars | ||||
26 | U+1431 | ᐱ | Canadian_Aboriginal | CANADIAN SYLLABICS PI | |||||
27 | U+1433 | ᐳ | Canadian_Aboriginal | CANADIAN SYLLABICS PO | |||||
28 | U+1438 | ᐸ | Canadian_Aboriginal | CANADIAN SYLLABICS PA | |||||
29 | U+1448 | ᑈ | Canadian_Aboriginal | CANADIAN SYLLABICS Y-CREE PWAA | |||||
30 | U+1449 | ᑉ | Canadian_Aboriginal | CANADIAN SYLLABICS P | not-when: first-character-of-the-label | ||||
31 | U+144E | ᑎ | Canadian_Aboriginal | CANADIAN SYLLABICS TI | |||||
32 | U+1450 | ᑐ | Canadian_Aboriginal | CANADIAN SYLLABICS TO | |||||
33 | U+1455 | ᑕ | Canadian_Aboriginal | CANADIAN SYLLABICS TA | set 2 | ||||
34 | U+1465 | ᑥ | Canadian_Aboriginal | CANADIAN SYLLABICS NASKAPI TWAA | |||||
35 | U+1466 | ᑦ | Canadian_Aboriginal | CANADIAN SYLLABICS T | not-when: first-character-of-the-label | ||||
36 | U+146D | ᑭ | Canadian_Aboriginal | CANADIAN SYLLABICS KI | can-follow-1427 | set 6 | |||
37 | U+146F | ᑯ | Canadian_Aboriginal | CANADIAN SYLLABICS KO | can-follow-1427 | set 3 | |||
38 | U+1472 | ᑲ | Canadian_Aboriginal | CANADIAN SYLLABICS KA | can-follow-1427 | set 1 | |||
39 | U+1482 | ᒂ | Canadian_Aboriginal | CANADIAN SYLLABICS NASKAPI KWAA | |||||
40 | U+1483 | ᒃ | Canadian_Aboriginal | CANADIAN SYLLABICS K | can-follow-1427 | not-when: first-character-of-the-label | |||
41 | U+1484 | ᒄ | Canadian_Aboriginal | CANADIAN SYLLABICS KW | can-follow-1427 | not-when: first-character-of-the-label | |||
42 | U+148B | ᒋ | Canadian_Aboriginal | CANADIAN SYLLABICS CI | |||||
43 | U+148D | ᒍ | Canadian_Aboriginal | CANADIAN SYLLABICS CO | set 4 | ||||
44 | U+1490 | ᒐ | Canadian_Aboriginal | CANADIAN SYLLABICS CA | |||||
45 | U+14A0 | ᒠ | Canadian_Aboriginal | CANADIAN SYLLABICS NASKAPI CWAA | |||||
46 | U+14A1 | ᒡ | Canadian_Aboriginal | CANADIAN SYLLABICS C | not-when: first-character-of-the-label | ||||
47 | U+14A5 | ᒥ | Canadian_Aboriginal | CANADIAN SYLLABICS MI | |||||
48 | U+14A7 | ᒧ | Canadian_Aboriginal | CANADIAN SYLLABICS MO | |||||
49 | U+14AA | ᒪ | Canadian_Aboriginal | CANADIAN SYLLABICS MA | set 5 | ||||
50 | U+14BA | ᒺ | Canadian_Aboriginal | CANADIAN SYLLABICS NASKAPI MWAA | |||||
51 | U+14BB | ᒻ | Canadian_Aboriginal | CANADIAN SYLLABICS M | not-when: first-character-of-the-label | ||||
52 | U+14C2 | ᓂ | Canadian_Aboriginal | CANADIAN SYLLABICS NI | |||||
53 | U+14C4 | ᓄ | Canadian_Aboriginal | CANADIAN SYLLABICS NO | |||||
54 | U+14C7 | ᓇ | Canadian_Aboriginal | CANADIAN SYLLABICS NA | |||||
55 | U+14CF | ᓏ | Canadian_Aboriginal | CANADIAN SYLLABICS NASKAPI NWAA | |||||
56 | U+14D0 | ᓐ | Canadian_Aboriginal | CANADIAN SYLLABICS N | not-when: first-character-of-the-label | ||||
57 | U+14D5 | ᓕ | Canadian_Aboriginal | CANADIAN SYLLABICS LI | |||||
58 | U+14D7 | ᓗ | Canadian_Aboriginal | CANADIAN SYLLABICS LO | |||||
59 | U+14DA | ᓚ | Canadian_Aboriginal | CANADIAN SYLLABICS LA | |||||
60 | U+14EA | ᓪ | Canadian_Aboriginal | CANADIAN SYLLABICS L | not-when: first-character-of-the-label | ||||
61 | U+14EF | ᓯ | Canadian_Aboriginal | CANADIAN SYLLABICS SI | |||||
62 | U+14F1 | ᓱ | Canadian_Aboriginal | CANADIAN SYLLABICS SO | |||||
63 | U+14F4 | ᓴ | Canadian_Aboriginal | CANADIAN SYLLABICS SA | |||||
64 | U+1504 | ᔄ | Canadian_Aboriginal | CANADIAN SYLLABICS NASKAPI SWAA | |||||
65 | U+1505 | ᔅ | Canadian_Aboriginal | CANADIAN SYLLABICS S | when: allowed-at-initial-position-if-followed-by-specific-codepoints | ||||
66 | U+150C | ᔌ | Canadian_Aboriginal | CANADIAN SYLLABICS NASKAPI SPWA | |||||
67 | U+150D | ᔍ | Canadian_Aboriginal | CANADIAN SYLLABICS NASKAPI STWA | |||||
68 | U+150E | ᔎ | Canadian_Aboriginal | CANADIAN SYLLABICS NASKAPI SKWA | |||||
69 | U+150F | ᔏ | Canadian_Aboriginal | CANADIAN SYLLABICS NASKAPI SCWA | |||||
70 | U+1528 | ᔨ | Canadian_Aboriginal | CANADIAN SYLLABICS YI | |||||
71 | U+152A | ᔪ | Canadian_Aboriginal | CANADIAN SYLLABICS YO | |||||
72 | U+152D | ᔭ | Canadian_Aboriginal | CANADIAN SYLLABICS YA | |||||
73 | U+153D | ᔽ | Canadian_Aboriginal | CANADIAN SYLLABICS NASKAPI YWAA | |||||
74 | U+153E | ᔾ | Canadian_Aboriginal | CANADIAN SYLLABICS Y | not-when: first-character-of-the-label | ||||
75 | U+1546 | ᕆ | Canadian_Aboriginal | CANADIAN SYLLABICS RI | |||||
76 | U+1548 | ᕈ | Canadian_Aboriginal | CANADIAN SYLLABICS RO | |||||
77 | U+154B | ᕋ | Canadian_Aboriginal | CANADIAN SYLLABICS RA | |||||
78 | U+1550 | ᕐ | Canadian_Aboriginal | CANADIAN SYLLABICS R | not-when: first-character-of-the-label |
Number of variant sets | 6 |
---|---|
Largest variant set | 2 |
Ordinary Variants by Type | out-of-repertoire-var (6) blocked (12) |
The following tables list all variant sets defined in this LGR, except for singleton sets. Each table lists all variant mapping pairs of the set; one per row. Mappings are assumed to be symmetric: each row documents both forward (→) and reverse (←) mapping directions. In each table, the mappings are sorted by Source value in ascending code point order; shading is used to group mappings from the same source code point or sequence.
Where the type of both forward and reverse mappings are the same, a single value is given in the Type(s) column, otherwise the types for forward and reverse mappings, as well as comments and references are listed above one another.
A mapping where source and target are the same is reflexive. Variant sets consisting of only a single reflexive mapping are not shown as a set. Instead, the variant type of the mapping is listed in the Variants column of the Repertoire by Code Point table. Reflexive mappings that are part of a larger set are indicated with a “≡”.
In any LGR with variant specifications that are well behaved, all members within each variant set are defined as variants of each other; the mappings in each set are symmetric and transitive; and all variant sets are disjoint.
# | Source | Glyph | Target | Glyph | Type(s) | References | Comment | |
---|---|---|---|---|---|---|---|---|
1 | U+0062 | b | U+0062 | b | ≡ | out-of-repertoire-var | Out-of-repertoire | |
2 | U+0062 | b | U+1472 | ᑲ | ↔ | blocked | Cross-script homoglyph / |
# | Source | Glyph | Target | Glyph | Type(s) | References | Comment | |
---|---|---|---|---|---|---|---|---|
1 | U+0063 | c | U+0063 | c | ≡ | out-of-repertoire-var | Out-of-repertoire | |
2 | U+0063 | c | U+1455 | ᑕ | ↔ | blocked | Cross-script homoglyph / |
# | Source | Glyph | Target | Glyph | Type(s) | References | Comment | |
---|---|---|---|---|---|---|---|---|
1 | U+0064 | d | U+0064 | d | ≡ | out-of-repertoire-var | Out-of-repertoire | |
2 | U+0064 | d | U+146F | ᑯ | ↔ | blocked | Cross-script homoglyph / |
# | Source | Glyph | Target | Glyph | Type(s) | References | Comment | |
---|---|---|---|---|---|---|---|---|
1 | U+006A | j | U+006A | j | ≡ | out-of-repertoire-var | Out-of-repertoire | |
2 | U+006A | j | U+148D | ᒍ | ↔ | blocked | Cross-script homoglyph / |
# | Source | Glyph | Target | Glyph | Type(s) | References | Comment | |
---|---|---|---|---|---|---|---|---|
1 | U+006C | l | U+006C | l | ≡ | out-of-repertoire-var | Out-of-repertoire | |
2 | U+006C | l | U+14AA | ᒪ | ↔ | blocked | Cross-script homoglyph / |
# | Source | Glyph | Target | Glyph | Type(s) | References | Comment | |
---|---|---|---|---|---|---|---|---|
1 | U+0070 | p | U+0070 | p | ≡ | out-of-repertoire-var | Out-of-repertoire | |
2 | U+0070 | p | U+146D | ᑭ | ↔ | blocked | Cross-script homoglyph / |
The following table lists all top-level classes with their definition and the regular expression defining their members.
Name | Definition | Count | Members | References | Comment |
---|---|---|---|---|---|
Common-digits | Tag= Common-digit | 10 | {U+0030 U+0031 U+0032 U+0033 U+0034 U+0035 U+0036 U+0037 U+0038 U+0039} | Digits from the ASCII range; ⍟ | |
implicit | Tag= Common-digit | 10 | {U+0030 U+0031 U+0032 U+0033 U+0034 U+0035 U+0036 U+0037 U+0038 U+0039} | ||
implicit | Tag= can-follow-1427 | 5 | {U+146D U+146F U+1472 U+1483 U+1484} |
The following table lists all the top-level, or named rules defined in the LGR and indicates whether they are used as trigger in an action or as context (when or not-when) for a code point. (Any use of context rules for variants is not indicated).
Name | Regular Expression | Used as Trigger |
Used as Context |
Anchor | References | Comment |
---|---|---|---|---|---|---|
leading-combining-mark | (start) ([:class property:gc=Mn:]∪[:class property:gc=Mc:]) | True | False | False | [150] | RFC 5891 restrictions on placement of combining marks ⍟ |
hyphen-minus-disallowed | (((start))← ⚓|⚓ →((end))|((start)(any)(any)U+002D)← ⚓) | False | True | True | RFC 5891 restrictions on placement of U+002D HYPHEN-MINUS ⍟ | |
first-character-of-the-label | ((start))← ⚓ | False | True | True | Those following code points are not allowed as the initial (leading) point code of a label. | |
allowed-at-initial-position-if-followed-by-specific-codepoints | (((start))← ⚓ →((U+1450|U+144E|U+1455|U+1431|U+146D))|([:Common-digits:])← ⚓|(U+002D)← ⚓) | False | True | True | None | |
first-character-or-followed-by-ineligible-chars | (((start))← ⚓|⚓ →(^{[:class tag=can-follow-1427:]})|⚓ →((end))) | False | True | True | None |
The following table lists the actions that are used to assign dispositions to labels and variant labels, based on the specified conditions. The order of actions defines their precedence: the first action triggered by a label is the one defining its disposition.
# | Condition | Rule / Variant Set | Disposition | References | Comment | |
---|---|---|---|---|---|---|
1 | if label match | leading-combining-mark | → | invalid | labels with leading combining marks are invalid ⍟ | |
2 | if at least one variant is in | {out-of-repertoire-var} | → | invalid | any variant label with a code point out of repertoire is invalid ⍟ | |
3 | if at least one variant is in | {blocked} | → | blocked | any variant label containing blocked variants is blocked ⍟ | |
4 | if any label (catch-all) | → | valid | catch all (default action) ⍟ |