**Sparse Matrix Representation**

In Chapter 2, we saw that when matrices were sparse (i.e., many of the entries were zero), then much space and computing time could be saved if only the nonzero terms were retained explicitly. In the case where these nonzero terms did not form any “nice” pattern such as a’ triangle or a band, we devised a sequential scheme in which each nonzero term was represented by a structure with three data members: row, column, and value. These terms were organized sequentially. However, AS matrix operations such as addition, subtraction, and multiplication are performed, the number of nonzero terms in matrices will vary. Matrices representing partial computations (as in the case of polynomials) will be created and will have to be destroyed later to make space for additional matrices. Thus, sequential schemes for representing sparse matrices suffer from the same inadequacies as similar schemes for polynomials. In this section we study a very general linked-list scheme for sparse-matrix representation. As we have already seen, linked schemes facilitate efficient representation of varying-size structures, and here, too, our scheme overcomes the aforementioned shortcomings of the sequential representation studied in Chapter 2.

In the data representation we use, each column of a sparse matrix is represented by a circularly linked list with a head node. In addition, each row is also a circularly linked list with a head node.

Each node is represented by the class MatrixNode. This class has a field called head, which is used to distinguish between head nodes and nodes representing nonzero matrix elements. Each head node has three additional fields: down, right, and next. The total number of head nodes is max {number of rows, number of columns}. The head node for row i is also the head node for column i: The down field of a head node is used to link into a column list; the right field is used to link into a row list. The next field links the head nodes together.

All other nodes have six fields: head, row, col, down, right, and value (Figure 4.25). The down field is used to link to the next nonzero term in the same column and the right field links to the next nonzero term in the same row , then there is a node with head = FALSE, value = a j), row = i, and col = j. This node is linked into the circular linked lists for row i and column j. Hence, the node is simultaneously in two different lists.

As noted earlier, each head node is in three lists: a row list, a column list, and a list of head nodes. The list of head nodes itself has a head node that is identical to the nodes used to represent nonzero elements. The row and col fields of this node are used to store the matrix dimensions. The entire matrix is represented by class Matrix with the data member headnode, which points to this node.

The linked structure obtained for the 6 x 7 matrix, A, of Figure 4.26 is shown in Figure 4.27. Although Figure 4.27 does not show the values of the head fields, these values are readily determined from the node structure shown. For each nonzero term of A, we have one six-field node that is in exactly one column list and one row list. The head nodes are marked H0 to H6 and are drawn twice to simplify the figure. As seen in the figure, the right field of headnode is used to link into the list of head nodes. Notice that the whole matrix may be referenced through the head node, headnode , of the list of head nodes .

If we wish to represent an n x m sparse matrix with r nonzero terms, tile number of nodes needed is max{n,m} + r + 1. Although each node may require several words of memory, the total storage needed will be less than n m for sufficiently small r.

The required node structure may be defined in C++ using an anonymous union as in Program 4.30. Since all nodes contain fields head, down and right, these are declared outside the anonymous union. Head nodes also contain next while all other nodes “Contain row, col, and value. These fields are all incorporated into struct Triple. Space is allocated by the union declaration for the larger of fields next and triple (which is an object of type Triple). Because of this, fields row, col, and value are accessed indirectly through triple.:

**Sparse Matrix Input**

The first operation we shall consider is that of reading in a sparse matrix and obtaining its linked representation. We assume that the first input line consists of s.row (the number of rows), s.col (the number of columns), and s.value (the number of nonzero terms). The line is followed by s.value lines of input; each of these is a triple of the form (i, i. a;). These triples consist of the row, col, and value data members of the nonzero terms of the matrix. We also assume that these triples are ordered by rows and within rows by columns.

For example, the input for the 6 x 7 sparse matrix of Figure 4.26, which has seven nonzero terms, would take the following form: 6,7,7; 0, 2,11; 0, 5,13; 1,0,12; 1,6,14; . We shall not concern ourselves here with the actual format of this input on the input media (tape, disk, terminal, etc.) but shall assume that we have a mechanism to get the next triple (see Exercises for a possible input format). The function operator» (Program 4.31) also makes use of an auxiliary array head, of size max (s. row, s.col). head [i] is a pointer to the head node for column i and hence also for row i, permitting us efficient random access to columns while the input matrix is set up. Function operator» proceeds first by setting up all the head nodes and then by setting up each row list. simultaneously building the column lists. The next data member of head node i is used initially to keep track of the last node in column i. Eventually, in line 30.the head nodes are linked together through this data member.

**Analysis of Operator:** Since new works in a constant amount of time. all the head nodes may be set up in O(max(n,m)) time, where n is the number of rows and m the number of columns in the matrix being input. Each nonzero term can be Set up in a constant amount of time because of the use of the variable last and a random access scheme for the bottom most node in each column list. Hence. the for loop of lines 16-27 can be

carried out in O(r) time. The rest of the algorithm takes O(max(n,m» time. The total time is therefore O(max(n,m} + r) = O(n + m + r). Note that this time is asymptotically better than the input time of O(nm) for an n x m matrix using a two-dimensional array but slightly worse than the sequential sparse method of Section 2.3.

**Erasing a Sparse Matrix**

All the nodes of a sparse matrix may be returned one at a time using delete. A faster way to return the nodes is to set up an available-space list as was done for. polynomials. Assume that av points to the front of this list and that this list is linked through the data member right. Function Matrix::-Matrix 0 (Program 4.32) solves our problem in an efficient way.

**Analysis of Matrix():** Since each node is in only one row list, it is sufficient to return all the row lists of the matrix a. Each row list is circularly linked through the data member right. Thus, nodes need not be returned one by one, as a circular list can be erased in a constant amount of time. The computing time for the algorithm is readily seen to be O(n + m). Note that even if the available-space list had been linked through the data member down, erasing still could have been carried out in O(n + m) time.

The subject of manipulating sparse matrix structures is studied further in the exercises. The representation studied here is rather general. For most applications’ this generality is not needed. A simpler representation resulting in simpler algorithms is discussed in the exercises.