Name	Name	Last commit message	Last commit date
parent directory ..
.gitignore	.gitignore
CMakeLists.txt	CMakeLists.txt
Makefile	Makefile
README.md	README.md
bsrsm_vs2017.sln	bsrsm_vs2017.sln
bsrsm_vs2017.vcxproj	bsrsm_vs2017.vcxproj
bsrsm_vs2017.vcxproj.filters	bsrsm_vs2017.vcxproj.filters
bsrsm_vs2019.sln	bsrsm_vs2019.sln
bsrsm_vs2019.vcxproj	bsrsm_vs2019.vcxproj
bsrsm_vs2019.vcxproj.filters	bsrsm_vs2019.vcxproj.filters
bsrsm_vs2022.sln	bsrsm_vs2022.sln
bsrsm_vs2022.vcxproj	bsrsm_vs2022.vcxproj
bsrsm_vs2022.vcxproj.filters	bsrsm_vs2022.vcxproj.filters
main.cpp	main.cpp

rocSPARSE Level 3 BSR Triangular Solver Example

Description

This example illustrates the use of the rocSPARSE level 3 triangular solver using the BSR storage format.

This triangular solver is used to solve a linear system of the form

$$ A' X' = \alpha B'. $$

where

$A$ is a sparse triangular matrix of order $n$ whose elements are the coefficients of the equations,
given a matrix $C$, $C'$ is one of the following:
- $C' = C$ (identity)
- $C' = C^T$ (transpose $C$: $C_{ij}^T = C_{ji}$)
- $C' = C^H$ (conjugate transpose/Hermitian $C$: $C_{ij}^H = \bar C_{ji}$),
$\alpha$ is a scalar,
$X$ is a dense matrix of size $n \times nrhs$ which contains the unknowns of the system, and
$B$ is a dense matrix of size $n \times nrhs$ containing the constant terms of the equations,
the performed operation on $B$ and $X$ must be the same.

Obtaining the solution for such a system consists of finding concrete values of all the unknowns such that the above equality holds.

This is the same as solving the classical system of linear equations $A' x_i = \alpha b_i$, where $x_i$ and $b_i$ are the $i$-th rows or columns of $X$ and $B$, depending on the operation performed on $X$ and $B$. This is showcased in level 2 example bsrsv.

Application flow

Setup input data.
Allocate device memory and copy input data to device.
Initialize rocSPARSE by creating a handle.
Prepare utility variables for rocSPARSE bsrsm invocation.
Perform analysis step.
Call dbsrsm to solve $A' X' = \alpha B$
Check results. If no zero-pivots, copy solution matrix $X$ from device to host and compare with expected result.
Free rocSPARSE resources and device memory.
Print validation result.

Key APIs and Concepts

BSR Matrix Storage Format

The Block Compressed Sparse Row (BSR) storage format describes a sparse matrix using three arrays. The idea behind this storage format is to split the given sparse matrix into equal sized blocks of dimension bsr_dim and store those using the CSR format. Because the CSR format only stores non-zero elements, the BSR format introduces the concept of non-zero block: a block that contains at least one non-zero element. Note that all elements of non-zero blocks are stored, even if some of them are equal to zero.

Therefore, defining

mb: number of rows of blocks
nb: number of columns of blocks
nnzb: number of non-zero blocks
bsr_dim: dimension of each block

we can describe a sparse matrix using the following arrays:

bsr_val: contains the elements of the non-zero blocks of the sparse matrix. The elements are stored block by block in column- or row-major order. That is, it is an array of size nnzb $\cdot$ bsr_dim $\cdot$ bsr_dim.
bsr_row_ptr: given $i \in [0, mb]$
- if $0 \leq i < mb$, bsr_row_ptr[i] stores the index of the first non-zero block in row $i$ of the block matrix
- if $i = mb$, bsr_row_ptr[i] stores nnzb.
This way, row $j \in [0, mb)$ contains the non-zero blocks of indices from bsr_row_ptr[j] to bsr_row_ptr[j+1]-1. The corresponding values in bsr_val can be accessed from bsr_row_ptr[j] * bsr_dim * bsr_dim to (bsr_row_ptr[j+1]-1) * bsr_dim * bsr_dim.
bsr_col_ind: given $i \in [0, nnzb-1]$, bsr_col_ind[i] stores the column of the $i^{th}$ non-zero block in the block matrix.

Note that, for a given $m\times n$ matrix, if the dimensions are not evenly divisible by the block dimension then zeros are padded to the matrix so that $mb$ and $nb$ are the smallest integers greater than or equal to $\frac{m}{\texttt{bsr\_dim}}$ and $\frac{n}{\texttt{bsr\_dim}}$, respectively.

For instance, consider a sparse matrix as

$$ A= \left( \begin{array}{cccc:cccc:cc} 8 & 0 & 2 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 3 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 7 & 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 2 & 5 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ \hline 4 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 5 \\ 7 & 7 & 7 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ \hline 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 9 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 5 & 6 & 4 & 0 & 0 & 0 & 0 \\ \end{array} \right) $$

Taking $\texttt{bsr\_dim} = 4$, we can represent $A$ as an $mb \times nb$ block matrix

$$ A= \begin{pmatrix} A_{00} & O & O \\ A_{10} & O & A_{12} \\ A_{20} & A_{21} & O \end{pmatrix} $$

with the following non-zero blocks:

$$ \begin{matrix} A_{00}= \begin{pmatrix} 8 & 0 & 2 & 0\\ 0 & 3 & 0 & 0\\ 7 & 0 & 1 & 0\\ 2 & 5 & 0 & 0 \end{pmatrix} & A_{10}= \begin{pmatrix} 4 & 0 & 0 & 0 \\ 7 & 7 & 7 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{pmatrix} \\ \\ A_{12}= \begin{pmatrix} 0 & 0 & 0 & 0 \\ 5 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{pmatrix} & A_{20}= \begin{pmatrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 5 \\ 0 & 0 & 0 & 0 \end{pmatrix} \\ \\ A_{21}= \begin{pmatrix} 0 & 0 & 0 & 0 \\ 9 & 0 & 0 & 0 \\ 6 & 4 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{pmatrix} & \end{matrix} $$

and the zero matrix:

$$ O = 0_4= \begin{pmatrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{pmatrix} $$

Therefore, the BSR representation of $A$, using column-major ordering, is:

bsr_val = { 8, 0, 7, 2, 0, 3, 0, 5, 2, 0, 1, 0, 0, 0, 0, 0   // A_{00}
            4, 7, 0, 0, 0, 7, 0, 0, 0, 7, 0, 0, 0, 0, 0, 0   // A_{10}
            0, 5, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0   // A_{12}
            0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 5, 0   // A_{20}
            0, 9, 6, 0, 0, 0, 4, 0, 0, 0, 0, 0, 0, 0, 0, 0 } // A_{21}

bsr_row_ptr = { 0, 1, 3, 4 }

bsr_col_ind = { 0, 0, 2, 0, 1 }

rocSPARSE

rocSPARSE is initialized by calling rocsparse_create_handle(rocsparse_handle*) and is terminated by calling rocsparse_destroy_handle(rocsparse_handle).
rocsparse_pointer_mode controls whether scalar parameters must be allocated on the host (rocsparse_pointer_mode_host) or on the device (rocsparse_pointer_mode_device). It is controlled by rocsparse_set_pointer_mode.
rocsparse_direction dir: matrix storage of BSR blocks. The following values are accepted:
- rocsparse_direction_row: parse blocks by rows.
- rocsparse_direction_column: parse blocks by columns.
rocsparse_operation trans: matrix operation applied to the given matrix. The following values are accepted:
- rocsparse_operation_none: identity operation $A' = A$.
- rocsparse_operation_transpose: transpose operation $A' = A^\mathrm{T}$.
- rocsparse_operation_conjugate_transpose: conjugate transpose operation (Hermitian matrix) $A' = A^\mathrm{H}$. This operation is not yet supported.
rocsparse_mat_descr descr: holds all properties of a matrix. The properties set in this example are the following:
- rocsparse_diag_type: indicates whether the diagonal entries of a matrix are unit elements (rocsparse_diag_type_unit) or not (rocsparse_diag_type_non_unit).
- rocsparse_fill_mode: indicates whether a (triangular) matrix is lower (rocsparse_fill_mode_lower) or upper (rocsparse_fill_mode_upper) triangular.
rocsparse_[sdcz]bsrsm_buffer_size allows to obtain the size (in bytes) of the temporary storage buffer required for the rocsparse_[sdcz]bsrsm_analysis and rocsparse_[sdcz]bsrsm_solve functions. The character matched in [sdcz] coincides with the one matched in any of the mentioned functions.
rocsparse_solve_policy policy: specifies the policy to follow for triangular solvers and factorizations. The only value accepted is rocsparse_solve_policy_auto.
rocsparse_[sdcz]bsrsm_solve solves a sparse triangular linear system $A X = \alpha B$. The correct function signature should be chosen based on the datatype of the input matrix:
- s single-precision real (float)
- d double-precision real (double)
- c single-precision complex (rocsparse_float_complex)
- z double-precision complex (rocsparse_double_complex)
rocsparse_analysis_policy analysis: specifies the policy to follow for analysis data. The following values are accepted:
- rocsparse_analysis_policy_reuse: the analysis data gathered is re-used.
- rocsparse_analysis_policy_force: the analysis data will be re-built.
rocsparse_[sdcz]bsrsm_analysis performs the analysis step for rocsparse_[sdcz]bsrsm_solve. The character matched in [sdcz] coincides with the one matched in rocsparse_[sdcz]bsrsm_solve.
rocsparse_bsrsm_zero_pivot(rocsparse_handle, rocsparse_mat_info, rocsparse_int *position) returns rocsparse_status_zero_pivot if either a structural or numerical zero has been found during the execution of rocsparse_[sdcz]bsrsm_solve(....) and stores in position the index $i$ of the first zero pivot $A_{ii}$ found. If no zero pivot is found it returns rocsparse_status_success.

Demonstrated API Calls

rocSPARSE

rocsparse_analysis_policy
rocsparse_analysis_policy_reuse
rocsparse_bsrsm_zero_pivot
rocsparse_create_handle
rocsparse_create_mat_descr
rocsparse_create_mat_info
rocsparse_dbsrsm_analysis
rocsparse_dbsrsm_buffer_size
rocsparse_dbsrsm_solve
rocsparse_destroy_handle
rocsparse_destroy_mat_descr
rocsparse_destroy_mat_info
rocsparse_diag_type_non_unit
rocsparse_direction
rocsparse_direction_column
rocsparse_fill_mode_lower
rocsparse_handle
rocsparse_int
rocsparse_mat_descr
rocsparse_mat_info
rocsparse_operation
rocsparse_operation_none
rocsparse_pointer_mode_host
rocsparse_set_mat_diag_type
rocsparse_set_mat_fill_mode
rocsparse_set_pointer_mode
rocsparse_solve_policy
rocsparse_solve_policy_auto
rocsparse_status
rocsparse_status_zero_pivot
rocsparse_status_success

HIP runtime

hipFree
hipMalloc
hipMemcpy
hipMemcpyDeviceToHost
hipMemcpyHostToDevice

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bsrsm

bsrsm

README.md

rocSPARSE Level 3 BSR Triangular Solver Example

Description

Application flow

Key APIs and Concepts

BSR Matrix Storage Format

rocSPARSE

Demonstrated API Calls

rocSPARSE

HIP runtime

Files

bsrsm

Directory actions

More options

Directory actions

More options

Latest commit

History

bsrsm

Folders and files

parent directory

README.md

rocSPARSE Level 3 BSR Triangular Solver Example

Description

Application flow

Key APIs and Concepts

BSR Matrix Storage Format

rocSPARSE

Demonstrated API Calls

rocSPARSE

HIP runtime