PLASMA
2.8.0
PLASMA - Parallel Linear Algebra for Scalable Multi-core Architectures
|
int PLASMA_sgebrd_Tile_Async | ( | PLASMA_enum | jobu, |
PLASMA_enum | jobvt, | ||
PLASMA_desc * | A, | ||
float * | S, | ||
float * | E, | ||
PLASMA_desc * | T, | ||
float * | U, | ||
int | LDU, | ||
float * | VT, | ||
int | LDVT, | ||
PLASMA_sequence * | sequence, | ||
PLASMA_request * | request | ||
) |
PLASMA_sgebrd_Tile_Async - reduces a general complex M-by-N matrix A to upper or lower bidiagonal form B using a two-stage approach First stage: reduction to band bidiagonal form (orthogonal matrices Q1 and P1); Second stage: reduction from band to bidiagonal form (orthogonal matrices Q2 and P2). Let Q = Q1 * Q2 be the global left unitary transformation; Let P = P1 * P2 be the global right unitary transformation; Q**T * A * P = B. If M >= N, B is upper bidiagonal; if M < N, B is lower bidiagonal. Note: T is incomplete and contains only the block reflectors of the first stage. Therefore, Q and P can not be built completely. Non-blocking equivalent of PLASMA_sgebrd_Tile(). May return before the computation is finished. Allows for pipelining of operations at runtime.
[in] | sequence | Identifies the sequence of function calls that this call belongs to (for completion checks and exception handling purposes). |
[out] | request | Identifies this function call (for exception handling purposes). |