ECL? was ..Re: Runtime determination of share directories?

Subject: ECL? was ..Re: Runtime determination of share directories?
From: Richard Fateman
Date: Fri, 23 Jan 2009 13:44:50 -0800
Michael Abshoff wrote:...

...
> Mhhh, maybe I did not name the right package? I am referring to the 
> recent work of Dieter Kaiser implementing more special functions and I 
> do recall him increasing the number of bits used internally for some 
> computations to ensure identical results on various lisps.
>
I noticed some mail about (double precision?) logarithm / gamma .  
Common Lisp allows for different floats, up to 4 in the standard, I 
think. And possibly more,  short, single, long, double.   It is 
certainly possible to support double-extended, double-double (which is, 
I think, in CMU-CL), and others.
> ......


> MPFR itself is pure C, but MPFR relies on GMP for the underlying 
> arithmetic which is partially written in assembler. But I don't see 
> how requiring MPFR and GMP present on the system would be an issue 
> since building a lisp from sources is often harder. 

Just because Sage requires the building of whatever from sources is 
pretty much irrelevant to Maxima. 
The vast vast majority of Maxima users just download and install a 
working executable (+ support files) suitable for their machine.

 If MPFR were to be made part of Maxima (which might be nice) it would 
involve some technical tricks. For example, to get the best out of MPFR 
and GMP, it would be necessary, in the install process, to distinguish 
among many x86 implementations to choose the optimal binary. I don't 
know the details of the current system, but the best code I have for a 
Pentium 4  does not run on an earlier Pentium.
The code for the earlier Pentium runs on the Pentium 4, but is slower.
A pure-C implementation of GMP is considerably slower on the Pentium 4.
So a Maxima using MPFR+GMP could use the pure-C version and be 10X 
slower, or at some stage figure what CPU is running.
Testing such a system build for all x86 implementations would be a 
hassle.  I, of course, only care about the computer I'm usually
using.
> Making it optional and falling back to a pure lisp implementation 
> would obviously be a good idea for small devices like PDAs since I 
> guess you want to make Maxima as sleek as possible on those devices.
If I want to make my life more difficult, I can try riding a unicycle or 
maybe constructing ships in bottles.  Doing mathematics by typing with 
my thumbs and seeing the answers on a 2x2inch screen is not so 
interesting to me.  But what do I know, I don't own an IPod.

>
> Well, take the latest official release, build it on Solaris running on 
> a  x86 cpu and run make check. It failed its test suite on every x86 
> based Solaris box I tried and that is a bad, bad thing. 
If you haven't reported it, maybe they don't know.  I don't know of 
anyone (else?) using Solaris on x86, and maybe they don't either.

> If you use quaddouble to do numerical work this is less of a issue IMHO,

Huh?  You are using quaddouble but not numerical?    Or do you mean you 
are using quaddouble for integers only?
> but I see little benefit from getting potentially wrong results 
> anywhere from 10 to 50% more quickly than MPFR if you want identical 
> results on any platform which MPFR does deliver.
Well, consider those people who are hacking away in dungeons trying to 
get 2 processors to work together to get a program to work 50% faster.  
You are saying you could do the job, but with one processor. But 
debugging the program is not worth it.
Imagine those dungeon-hackers... if they could get 1.5X for one 
processor, and another 50% from the second, they would have
(1.5)^2 or 2.25X speedup. 
> And by the way: quaddouble is released under the BSD license by 
> researchers working for LBNL and U.C. Berkeley, but according to
>
>   "http://crd.lbl.gov/~dhbailey/mpdist/";
>
> one can read that
>
>
>   "Incorporating this software in any commercial product requires a 
> license agreement"
>
>
> Maybe someone ought to clue these people in what it means to release 
> software under the BSD license. And I am sure someone should point 
> them to the wikipedia page about BSD to make 100% sure that they will 
> appreciate the irony.
I have forwarded your note to one of the people :)
>
>>
>>> On the same platform using quaddouble the number of partitions for 
>>> the first five hundred integers is incorrect in about half the 
>>> cases, so any time you are using some extended precision library 
>>> that is not proven to be correct and vigorously I get very nervous.
>>>   
>> Maybe the algorithm requires more precision?
>
> No, the problem with quaddouble is that it requires at least on x86 to 
> precisely set the FPU control word, i.e. rounding mode and so on. On 
> PowerPC or Sparc this is not possible to my recollection, but in our 
> experience arithmetic operations there also deliver less than the 212 
> bits promised. I have even seen cases where a single multiplication of 
> two numbers (and we did not attempt to hit a corner case) produced 
> results that were different in the last three of four bits.
In my own use of quaddouble from Lisp  on x86, I have to be careful that 
the fpu control word isn't messed up by other processes which may also 
be taking time from the same Lisp, and this might be the same kind of 
situation for Sparc.  x86 can do more arithmetic (fused multiply-add) in 
registers with more precision than Sparc2. I don't know about later 
Sparc or PowerPC.  In any case, quad-double does not, as I recall, 
guarantee the last few bits in the fraction, and thus if you are relying 
on them either to be correct or consistent across different 
architectures, then you should not be using this software.

I changed the code to set    the rounding modes, e.g.

void c_qd_add(const double *a, const double *b, double *c) {
    
  fpu_fix_start(NULL);    ////  ADDED

  qd_real cc;
  cc = qd_real(a) + qd_real(b);
  TO_DOUBLE_PTR(cc, c);
}

I frankly don't know if this guards the fpu control word adequately for 
all possible operating systems and architectures.
>
> If the documentation tells me that I get 212 bits of precision then it 
> should not matter which IEEE conformant CPU I am running the code on 
> (modulo compiler bugs), but quaddouble does for the purpose the Sage 
> project uses it not live up to the standard of reproducibility. 
Reproducibility down to the last few bits is, according to some people, 
not worth slowing down.  Maxima's bigfloats are way slower than they 
could be except that all basic arithmetic is done with 
round-to-nearest-even. It could probably be made 2 or 4 times faster.
Elementary functions like exp and cos are probably good to 1 unit in 
last place, but I never proved that. Since you can always up the 
precision by a bunch of bits, the advantages of correctly rounded can 
probably be obtained by less work.



> That does not mean it is not useful for other projects, but at least 
> Sage is not in the business of delivering potentially worng results 
> 50% faster. 
Well, wrong is in the eye of the beholder.  Have you heard the phrase 
"close enough for government work"?

> AFAIK the issue is known to quaddouble developers and Carl Witty and I 
> discussed the possibility of attempting to fix it by working around 
> potential miscompilations last night, but this is a waste of time 
> since even if we get it to work for some examples it will still not 
> even come close to the assurance that MPFR gives me. And correctness 
> should always come before speed in any software project.
certainly it was known in 2007 when I wrote the generic lisp interface 
to qd.

RJF
Prev by Date: ECL? was ..Re: Runtime determination of share directories?
Next by Date: ECL? was ..Re: Runtime determination of share directories?
Previous by thread: ECL? was ..Re: Runtime determination of share directories?
Next by thread: ECL? was ..Re: Runtime determination of share directories?
Index(es):
- Date
- Thread