From ... Path: archiver1.google.com!news1.google.com!newsfeed.stanford.edu!news-spur1.maxwell.syr.edu!news.maxwell.syr.edu!newsfeed.esat.net!nslave.kpnqwest.net!nloc2.kpnqwest.net!nloc.kpnqwest.net!nmaster.kpnqwest.net!nreader2.kpnqwest.net.POSTED!not-for-mail Newsgroups: comp.lang.lisp,comp.lang.scheme Subject: Re: Wide character implementation References: <87wuw92lhc.fsf@becket.becket.net> <1016554947.964486@haldjas.folklore.ee> Mail-Copies-To: never From: Erik Naggum Message-ID: <3225568971513146@naggum.net> Organization: Naggum Software, Oslo, Norway Lines: 25 User-Agent: Gnus/5.09 (Gnus v5.9.0) Emacs/21.1 MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit Date: Tue, 19 Mar 2002 23:22:39 GMT X-Complaints-To: newsmaster@KPNQwest.no X-Trace: nreader2.kpnqwest.net 1016580159 193.71.199.50 (Wed, 20 Mar 2002 00:22:39 MET) NNTP-Posting-Date: Wed, 20 Mar 2002 00:22:39 MET Xref: archiver1.google.com comp.lang.lisp:29632 comp.lang.scheme:9510 * Sander Vesik | They use either UTF8 or UTF16 - you cannot rely on whetvere size | you pick to be suitably long forever, unicode is sort of inherently | variable-length (characters even have too possible representations | in many cases, ä and similar 8-) Variable-length characters? What the hell are you talking about? UTF-8 is a variable-length _encoding_ of characters that most certainly are intended to require a fixed number of bits. That is, unless you think the digit 3 take up only 6 bits while the letter A takes up 7 bits and the symbol ± takes up 8. Then you have variable-length characters. Few people consider this a meaningful way of talking about variable length. | Implement them as variable-length strings using say UTF-8. Also, saying | that most characters will not be wide may well be a wrong assumptin 8-) Real programming languages work with real character objects, not just UTF-8-encoded strings in memory. Acquire clue, _then_ post, OK? /// -- In a fight against something, the fight has value, victory has none. In a fight for something, the fight is a loss, victory merely relief.