[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[groff] 17/28: groff_char(7): Rewrite 8-bit Latin-1 subsection.
From: |
G. Branden Robinson |
Subject: |
[groff] 17/28: groff_char(7): Rewrite 8-bit Latin-1 subsection. |
Date: |
Tue, 1 Sep 2020 07:43:08 -0400 (EDT) |
gbranden pushed a commit to branch master
in repository groff.
commit 3024a3e3c0f460dc83027d1bbc0b29f38eb7e760
Author: G. Branden Robinson <g.branden.robinson@gmail.com>
AuthorDate: Tue Sep 1 20:38:03 2020 +1000
groff_char(7): Rewrite 8-bit Latin-1 subsection.
* Retitle subsection to "Eight-bit encodings and Latin-1 supplement".
* Elide references to code point numbers to make discussion
ISO/EBCDIC-agnostic.
---
man/groff_char.7.man | 35 +++++++++++++++++++----------------
1 file changed, 19 insertions(+), 16 deletions(-)
diff --git a/man/groff_char.7.man b/man/groff_char.7.man
index b5612de..a8272b5 100644
--- a/man/groff_char.7.man
+++ b/man/groff_char.7.man
@@ -474,25 +474,29 @@ falling back to basic Latin glyphs only when necessary.
.
.
.\" ====================================================================
-.SS "8-bit character codes 160 to 255"
+.SS "Eight-bit encodings and \%Latin-1 supplement"
.\" ====================================================================
.
-Code points in the range 160\[en]255 are for the most part interpreted
-as characters according to the \%Latin-1 (ISO \%8859-1) standard,
-and correspond to those in the same range of the Unicode \[lq]C1
-Controls and \%Latin-1 Supplement\[rq] code chart.
+ISO 646 is a seven-bit code encoding 128 code points;
+eight-bit codes are twice the size.
+.
+ISO 8859-1 and code page 1047 allocated the additional space to what
+Unicode calls \[lq]C1 controls\[rq]
+(control characters)
+and the \[lq]\%Latin-1 supplement\[rq].
+.
+The C1 controls are neither printable nor usable as
+.I groff
+input.
.
.
.P
-Input characters in range 128\[en]159
-(the C1 controls on non-EBCDIC hosts)
-are defined as control characters and not printable;
-nor are two characters in the \%Latin-1 supplement.
+Two non-printing characters in the \%Latin-1 supplement are handled
+specially.
.
.
.TP
-160
-.
+NBSP
encodes the no-break space,
and is mapped to
.BR \[rs]\[ti] ,
@@ -500,8 +504,7 @@ the adjustable non-breaking space escape.
.
.
.TP
-173
-.
+SHY
encodes the soft hyphen character.
.
.I groff
@@ -512,7 +515,7 @@ on input it is mapped to the hyphenation control escape,
.
.
.P
-The remaining characters in the \%Latin-1 supplement print as
+The remaining characters in the \%Latin-1 supplement represent
themselves.
.
Although they can be specified directly with the keyboard on systems
@@ -521,8 +524,8 @@ it is more portable,
both to other
.I roff
systems and to \%UTF-8 environments,
-to use their glyph names;
-see the next section.
+to use their glyph names,
+shown below.
.
.P
.TS
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- [groff] 17/28: groff_char(7): Rewrite 8-bit Latin-1 subsection.,
G. Branden Robinson <=