more about build-index+cl-ppcre branch & encodings




On Wed, 2 Mar 2011, Robert Dodier wrote:

< I've updated my sandbox to revision 9c49048 and built Maxima.
< I'm seeing the same behavior today as I did a day or two ago;
< titles & content is displayed correctly in ISO-8859 locales,
< in UTF-8 locales, titles are correct and content is messed up.

Ok, I can confirm this: with revision 9c49048, when I set
export LC_ALL and LANG in the xterm, then the utf-8
locales are not displayed correctly. I believe this is due 
print-info-hashes not setting the external-format correctly.
(When I view es.utf8/maxima-index.lisp in emacs, the variable
buffer-file-coding-system is set to iso-latin-1-unix, the
default encoding on my system. After patching, it is utf-8-unix.)

I've patched that, and a few things Ray mentioned in this
thread. I now see what appears to be the correct characters
for both encodings and all languages and lisps in clisp, cmucl 
and sbcl and on both an xterm and gnome-terminal (debian testing
& ubuntu 10.04 less cmucl).

To be clear, with revision b5805315e8 in a freshly cloned
repo, I execute:


 ./bootstrap && \
 ./configure --enable-sbcl --enable-clisp --with-default-lisp=sbcl \ 
 --program-suffix=-post-5.23 --enable-all-lang && \
 make && \
 cd tests && \
 perl ./rtest-build-index.pl -verbose=1 -run_rtest_build_index=true \
 -lisps=sbcl:clisp #:cmucl

Leo

-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.