Discussion about arrays and matrices in Maxima

Subject: Discussion about arrays and matrices in Maxima
From: Richard Fateman
Date: Tue, 19 May 2009 09:26:35 -0700
One way I have thought about addressing the overarching issue of 
compatibility vs progress is this way:

Start by writing a document describing how to use your new design.  A 
replacement for the manual,
a textbook on applications, a tutorial for students, etc.  Your book 
includes advice on how to take the (standard) Dodier-approved Maxima
and convert it into your version.

 Typically all that is needed is to insert a line in a .maximarc file 
that is read at startup time.

This documentation will include all of your improvements, as well as the 
parts of Maxima that you continue to use unchanged.

Thus you do not need a "fork".  Just a patch file.

Unfortunately this can degenerate into what we see now with operating 
systems, browsers, email-readers, editors: There is a list of
perhaps hundreds of  "options" to customize the behavior of the program, 
and no one can truly say if the program will work with all the features 
set in a particular way.

You mention (if at all) the parts that you have replaced, in some manner 
to show that they are 'deprecated'.  That is,
should not be used in new code.

It is clear that this happens in other CAS,   for example Maple has two 
packages,  linalg and LinearAlgebra  (or something like that)
that have identical capabilities.  One is better than the other.

Another example: Commercial Macsyma  has "matrice"  in addition to 
"matrix".  The documentation is online, but here is a taste of it...


MATRICE(symbol, options, object)

[MATRICE package] Creates a matrice described by the options with its
elements taken from the object and attaches the newly created matrice
to the symbol. The object may be a matrix, a two-dimensional array,
the word ZEROES (in which case the matrice will be filled with
zeroes), the pseudofunction VALUE(k) (in which case the nonzero
portion of the matrice will be filled with the value specified by k),
or a list of lists.  In the last case, the (sub)lists will be used to
fill the secondary arrays of the matrice directly so that this
operation performs the inverse of ICE2LIST.  (As a shorthand, a single
unlistified expression will abbreviate a sublist of identical copies
of this expression.)  If the object is omitted then MATRICE will call
ENTERMATRICE.  The options consist of zero or more descriptors
providing the parameters of the matrice.  If a descriptor is a
positive integer or a list containing a single positive integer then
the number will be taken for the row and column dimension of the
matrice.  If a descriptor is a list containing two positive integers
then the first integer will be taken to be the row dimension and the
second the column dimension of the matrice. (Note that if the object
that is provided to MATRICE is a matrix or a two-dimensional array
then the OBJECT's dimensions will be used for the matrice and so do
not need to be explicitly specified.)  A descriptor may also indicate
the mathematics of the matrice (REAL or COMPLEX - default:REAL), the
form of its elements (NUMERICAL, CRE [Canonical Rational Expressions]
or GENERAL_EXPRESSIONS [the default]) or its type (default is
GENERAL).  The possible types that may be specified are given in the
first column of the following table:

   GENERAL                  m x n
   HERMITIAN                n x n
  +GENERAL_BANDED           n x n*
  +HERMITIAN_BANDED         n x n
   UPPER_TRIANGULAR         m x n
   LOWER_TRIANGULAR         m x n
   UPPER_HESSENBERG         n x n
   LOWER_HESSENBERG         n x n
   SYMMETRIC                n x n    REAL HERMITIAN
  +BANDED                   n x n*   GENERAL_BANDED
   DIAGONAL                 m x n    GENERAL_BANDED(1)
   TRIDIAGONAL              n x n    GENERAL_BANDED(2)
   HERMITIAN_TRIDIAGONAL    n x n    HERMITIAN_BANDED(2)
  +SYMMETRIC_BANDED         n x n    REAL HERMITIAN_BANDED
   SYMMETRIC_TRIDIAGONAL    n x n    REAL HERMITIAN_BANDED(2)
   TRIANGULAR               m x n    UPPER_TRIANGULAR
   HESSENBERG               n x n    UPPER_HESSENBERG

  *May be m x n if the half-bandwidth is 1.

.... much more....


Mathematica has two or more matrix types, one of which is used for large 
dense numeric matrix data only.


I agree entirely with your comments on random. I have written a program 
that does something like this.  I found the lisp random number generator 
to be too slow (it is too general) and so I found another method to fill 
a ring-buffer with "enough" pseudo-random numbers for my purpose and 
then I used them one after another for my simulation.  Using a looped 
list, I could get the next number by using   (pop random_number_loop).
Of course these were not so random if you used it for long enough.
RJF



?iga Lenar?i? wrote:
> On 19. May, 2009, at 6:29 AM, Robert Dodier wrote:
>
>   
>> On 5/18/09, ?iga Lenar?i? <ziga.lenarcic at gmail.com> wrote:
>>
>>     
>>> So from my experience - it's hard to improve something in Maxima,
>>>       
>> Keep in mind that you are always free to maintain and distribute
>> your own Maxima packages (or even your own entire fork).
>> That might be a good way to publicize and refine your proposals.
>>
>> I don't mean to be difficult --- I'm generally sympathetic to your
>> proposals. Bear in mind that the wheel of Maxima grinds slowly,
>> but exceedlingly fine.
>>     
>
> Perhaps the wheel of Maxima is broken a bit :) If an improvement is  
> suggested (with lisp code) which is faster and/or more featureful and/ 
> or easier to use without breaking backwards compatibility, I think it  
> should be implemented in some form or another. For instance I have  
> ideas for some of these 'harmless' improvements:
> *) 'transpose' working on lists of lists (I find myself many times I  
> want to transform a list like [ [a1,a2,a3,..], [b1,b2,b3,...] ] into  
> [[a1,b1],[a2,b2],...] - I don't want to mess with converting to  
> matrix and back, just to use 'transpose')
> *) extending 'random' to quickly produce a list of numbers (random  
> numbers are often used in larger quantities).. random(1.0) would work  
> as it works now, while random(1.0, 100) would produce a list of 100  
> random numbers much quicker than makelist( random(1.0), i, 1, 100); -  
> also less code.
> If I write lisp code for these two cases, will that get into Maxima?  
> I don't see any potential harms in these two improvements...
> I also know that at some point Andrej V. posted a 'nintegrate'  
> function - to wrap up quadpack routines. I think users would find it  
> simpler to a) use something with 'integrate' in its name, b) to not  
> have to call different quadpack routines depending on the integration  
> bounds (finite, infinite..). It even supported 'romberg' as an  
> integration method. After some discussion (mainly about the name of  
> the function I think), nothing changed in Maxima, and now 3 years  
> later one still _has_ to call quadpack routines (which is absurd if a  
> better solution was presented). I fear this happens too often...
>
>   
>>> *) Matrices should internally represented as n-dimensional lisp
>>> arrays.
>>>       
>> I agree that a matrix should use a Lisp array for storage, but
>> I'm inclined to use a 1-d array, since it makes some restructuring
>> operations (e.g. transpose, row/column extraction) simpler.
>>
>> I can't tell if the translated Fortran code wants 1-d or 2-d arrays.
>> Fortran would be happy with 1-d, but there seems to be some
>> code to present 1-d as 2-d or vice versa, I'm not sure what's
>> going on.
>>     
>
> As far as transposition goes - in LAPACK some routines take as an  
> argument a transposition flag - so internally values are refered as a 
> [i][j] in a normal case or a[j][i] in transposed case - no actual  
> transposition is done. Maxima could have the same system  
> (transposition as a flag) but not necessarily. If a function layer is  
> written for writing and reading matrix elements internally with  
> (defun mmatrix-ref-element ..), then Maxima functions would see a  
> matrix as a transposed one or a nontransposed one without doing any  
> actual transposition on array elements. But perhaps it's simpler to  
> do actual transposition and then allow Maxima functions to work on  
> matrices with 'aref' directly?
>
> I don't see how 1-d arrays of 1-d arrays help row/column extraction -  
> one of them can be done via just returning the 1-d array, but the  
> other one must be done via 'manually' collecting the elements (so you  
> profit only on one of this cases). I have a feeling 1-d arrays of 1-d  
> arrays doesn't translate so well to 3-d matrices..
>
> Since matrix is rectangular (i.e. all rows must be of the same  
> length) I think using n-dimensional lisp arrays is better and  
> simpler. I imagine that using a 2d-array instead of and array of  
> arrays is also faster (because lisp compiler can optimize for  
> performance much more) for most operations. Also (aref (aref arry j)  
> i) is probably slower and also more complicated to write than (aref  
> arry (,i ,j)). For more than 2d matrices/arrays this is even more  
> obvious. Since Maxima is a lisp program, why not leverage all the  
> features lisp offers. Another complication is if we use general  
> arrays and 'double-float arrays. If matrices are n-dimensional lisp  
> arrays, one can easily check the type of array to decide how to treat  
> it, while in case of 1-d arrays of 1-d arrays, one has to look at the  
> type of innermost arrays!
>
> 1-d arrays of 1-d arrays (growable) could be used to implement Maxima  
> list internally with faster access in cases of long lists - but I  
> like the fact Maxima lists are represented with Lisp lists internally  
> - with some knowledge of lisp, one can easily write a function which  
> outputs a Maxima list via (cons '(mlist simp) mylist). I see this  
> ease of adding your own functions to Maxima as it's strongest  
> argument for expert use of Maxima compared to Mathematica. Virtually  
> no modification of some lisp routine is necessary to make it work in  
> Maxima, while extending MATLAB or Mathematica in a compiled language  
> (C) always requires a layer which translates your 'simple' results (a  
> C array perhaps) to something MATLAB/Mathematica can understand - and  
> this is very tiresome to do. While programs written in C have this  
> inherent problem (extending requires some layer for conversion of  
> datastructures), Maxima as a Lisp program seem to reap the benefits  
> of Lisp in this case.
>
> I see using n-dimensional arrays also as reaping the benefits of  
> using Lisp - no need to complicate things.
>
> What else do 1-d arrays of 1-d arrays offer, besides easier  
> transposition and row/column extraction? Perhaps using n-dimensional  
> arrays would pay off with better performance and cleaner, simpler code?
>
>   
>>> *) Entering matrices would probably be implemented as it is now -
>>> with matrix function.
>>>       
>> No opinion on this point.
>>     
>
> To explain what I was trying to say :)
> It would be nice to have a special syntax for entering matrices, like  
> we have for lists with [a,b,c,d] and sets {1,2,3,4}. But since we  
> have used up different parenthesis, the only way to enter a matrix is  
> to write
> matrix( [a,b], [c,d] );
> I think this is OK.
>
>   
>>> *) The only place a word 'array' should be mentioned in Maxima
>>> documentation is where implementation of matrices is explained,
>>> ideally (if current array facilities are some day removed). I think
>>> matrix (in Maxima language) is a suitable word for an array(its
>>> meaning in computer science).
>>>       
>> I'm not so sure about this. A matrix is an algebraic object, right?
>> while an array is just a storage container. I think confounding
>> fundamentally different kinds of objects for the sake of convenience
>> is a poor choice. (Incidentally I'm also opposed to treating lists
>> as synonymous with vectors --- they have different algebraic
>> properties.)
>>     
>
> Some mathematical abstractions don't translate that well into a  
> program. The best solution is to have some kind of balance between  
> 'mathematical correctnes' and usability. For instance let's take a  
> look at how sets are implemented in Maxima.
>
> The usual sets of interest in mathematics are infinite, which  
> translates very badly into a computer program. If we cover only  
> finite sets, then we are essencially working with lists (from a  
> computational point of view) that don't have elements repeated. For  
> instance we have decided to allow only finite sets because they  
> translate well into a program, but have also decided to treat sets  
> differently from lists to emphasize algebraic properties. The  
> consequences are:
> *) We have a set of functions which perform operations only on sets,  
> though a user would perhaps like to use them on lists also (without  
> conversion).
> *) Functions that are implemented to work on lists have to be  
> expanded to work on sets also. So in effect Maxima sets have less  
> functionality available than Maxima lists.
>
> For a user, who is learning high-school math, such a basic  
> implementation of sets is perhaps useful, though the same user would  
> be confused, why there are no sets of natural/real numbers he could  
> call 'length()' on and get a 'mathematical' answer...
>
> For a user, who knows his math and is using Maxima to compute  
> something, the distinction of list and sets is useless, since he  
> looks at things from a computational point of view (at least thats my  
> experience). He wants to operate on data structures, not to learn  
> what a 'set' is. Therefore having functions that work only on sets  
> and not on lists also is only an unnecessary complication for him.
>
> I see a similar situation with vectors - what does it bring to the  
> table for a user who knows his math? Vector is a mathematical  
> abstraction that doesn't translate well into a program. Since maxima  
> supports only finite-dimensional vectors written in the same basis -  
> I don't see what is gained by not seeing vectors as a list of  
> numbers. For computation I'd like to have vectors accepted as a  
> Maxima list or a Maxima matrix (preferably both are accepted) -  
> because I already know how to manipulate these two data forms. A  
> package (vect?) could be used when one wants to perform abstract  
> operation on vectors - though these best work on paper I'm afraid.
>
>   
>> A hash table is called an undeclared array in Maxima; I think that's
>> an unfortunate choice of words. I'd rather just call them hash tables
>> (and implement them with Lisp hash tables).
>>     
> I don't know about hashtables in Maxima - but in general if a Maxima  
> structure translates well into a Lisp type, then the Lisp type should  
> be used I think. In effect this means shorter/more understandable  
> code and reaping the performance of optimized Lisp implementations of  
> these types.
> Lisp complex numbers pop to mind #C(). Perhaps they are not general  
> enough for usage in Maxima?
>
>   
>> Another tangential point --- Maxima distinguishes arrays which are
>> values from arrays which are properties. I'm inclined to preserve
>> that distinction.
>>
>> Thanks for bringing up the topic. Perhaps in the near future
>> (i.e. less than 1 year) we'll really get rolling on it.
>>
>> best
>>
>> Robert Dodier
>>     
>
> I hope something will come out of this :) Lisp arrays are there to  
> use them - it would be foolish not to use them for matrices.
>
> Regards,
> Ziga Lenarcic
>
>
> _______________________________________________
> Maxima mailing list
> Maxima at math.utexas.edu
> http://www.math.utexas.edu/mailman/listinfo/maxima
>
Prev by Date: Java vs Lisp revisited
Next by Date: interesting plotting library
Previous by thread: Discussion about arrays and matrices in Maxima
Next by thread: Discussion about arrays and matrices in Maxima
Index(es):
- Date
- Thread