diff options
author | wkj <devnull@localhost> | 2004-04-21 01:15:41 +0000 |
---|---|---|
committer | wkj <devnull@localhost> | 2004-04-21 01:15:41 +0000 |
commit | a31db67d14c0e50353eac3db342f3c969cabdf76 (patch) | |
tree | 0cebd9a65bab940355f5be0751f53238366555d7 /src/cmd/tcs/ex5.utf | |
parent | ba03b8910dd07511378e6f1007faf0539ed25598 (diff) | |
download | plan9port-a31db67d14c0e50353eac3db342f3c969cabdf76.tar.gz plan9port-a31db67d14c0e50353eac3db342f3c969cabdf76.tar.bz2 plan9port-a31db67d14c0e50353eac3db342f3c969cabdf76.zip |
Add tcs; it compiles in my world, but I haven't tried it in Russ's yet.
Diffstat (limited to 'src/cmd/tcs/ex5.utf')
-rw-r--r-- | src/cmd/tcs/ex5.utf | 50 |
1 files changed, 50 insertions, 0 deletions
diff --git a/src/cmd/tcs/ex5.utf b/src/cmd/tcs/ex5.utf new file mode 100644 index 00000000..b4baca0b --- /dev/null +++ b/src/cmd/tcs/ex5.utf @@ -0,0 +1,50 @@ +.tr -\(hy +.TL +Hello World +.br +or +.br +Καλημέρα κόσμε +.br +or +.br +こんにちは 世界 +.AU +Rob Pike +Ken Thompson +.AI +.MH +.AB +Plan 9 from Bell Labs has recently been converted from ASCII +to an ASCII-compatible variant of Unicode, a 16-bit character set. +In this paper we explain the reasons for the change, +describe the character set and representation we chose, +and present the programming models and software changes +that support the new text format. +Although we stopped short of full internationalization\(emfor +example, system error messages are in Unixese, not Japanese\(emwe +believe Plan 9 is the first system to treat the representation +of all major languages on a uniform, equal footing throughout all its +software. +.AE +.SH +Introduction +.PP +The world is multilingual but most computer systems +are based on English and ASCII or worse. +The pending release of Plan 9 [Pike90], a new distributed operating +system from Bell Laboratories, seemed a good occasion +to correct this chauvinism. +It is easier to make such deep changes when building new systems than +by retrofitting old ones. +.PP +The ANSI C standard [ANSIC] contains some guidance on the matter of +`wide' and `multi-byte' characters but falls far short of +solving the myriad associated problems. +We could find no literature on how to convert a +.I system +to larger character sets, although some individual +.I programs +have been converted. +This paper reports what we discovered as we +explored the problem of representing multilingual |