more about build-index+cl-ppcre branch & encodings
Subject: more about build-index+cl-ppcre branch & encodings
From: Leo Butler
Date: Tue, 8 Mar 2011 20:42:40 +0000 (GMT)
On Mon, 7 Mar 2011, Robert Dodier wrote:
< On 3/5/11, Leo Butler <l.butler at ed.ac.uk> wrote:
<
< > I've patched that, and a few things Ray mentioned in this
< > thread. I now see what appears to be the correct characters
< > for both encodings and all languages and lisps in clisp, cmucl
< > and sbcl and on both an xterm and gnome-terminal (debian testing
< > & ubuntu 10.04 less cmucl).
< >
< > To be clear, with revision b5805315e8 in a freshly cloned
< > repo, I execute:
< >
< > ./bootstrap && \
< > ./configure --enable-sbcl --enable-clisp --with-default-lisp=sbcl \
< > --program-suffix=-post-5.23 --enable-all-lang && \
< > make && \
< > cd tests && \
< > perl ./rtest-build-index.pl -verbose=1 -run_rtest_build_index=true \
< > -lisps=sbcl:clisp #:cmucl
<
< Works for me,
That's good to hear.
< but I had to update makeinfo (texinfo package)
< to 4.13, which generates UTF-8 characters when the
< documentencoding is UTF-8. (Makeinfo from texinfo 4.11
< generates ASCII approximations such as c, for cedilla.)
<
< I'm not too crazy about updating makeinfo.
I don't know: Makeinfo 4.11 is 3.5 years old and 4.13 is 2.5
years old. I think it is reasonable to expect folks to have
reasonably up-to-date kit.
< I suspect the
< existing Makefile machinery in doc/info/* which calls recode or
< iconv after generating ISO-8859 maxima.info files is a
< workaround for the inability of pre-4.13 makeinfo versions to
< actually generate UTF-8.
< Maybe instead of fudging the
< documentencoding in maxima.texi, we could
< s/coding: iso-8859-1/coding: utf-8/ in maxima.info* instead.
< That would make the declared encoding match the actual
< without requiring an update for makeinfo.
I'll look into this possibility.
<
< I like the idea of using a Lisp-based system to generate the
< help index, although I'm not convinced that the extra gyrations
< to explicitly track the encoding for an ordinary Maxima
< session are an improvement. (It is necessary to track the
< encoding when generating the indices since the maxima.info
< files have different encodings.)
I think that there is a better way to do this,
while still making it possible to set the encoding
and generate index files from different info files.
I'll see what I can do.
Leo
--
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.