cuda_get

IMSL C Stat Library

cuda_get

Get the threshold used by the specified function to determine if the NVIDIA CUDA Toolkit algorithm will be used.

Synopsis

#include <imsls.h>

int imsls_cuda_get (Imsls_cuda cuda_name)

Required Arguments

Imsls_cuda cuda_name (Input)
An enumerator which specifies the IMSL function for which the threshold will be retrieved. cuda_name must be one of the values defined in Table 16.

Return Value

Returns the threshold value used to determine when the NVIDIA CUDA Toolkit version of the function specified by cuda_name will be used. A return value of zero indicates that the IMSL version of the specified function will always be used. A return value greater than zero is the threshold value being used by the function specified by cuda_name. If the problem size is greater than or equal to threshold, the NVIDIA Toolkit algorithm will be used.

See Programming Notes for Using NVIDIA® CUDA™ Toolkit for more information on NVIDIA’s CUDA Toolkit integration into IMSL C Numerical Library.

Synopsis with Optional Arguments

#include <imsls.h>

int imsls_cuda_get (Imsls_cuda cuda_name,

IMSLS_GET_DEVICE, int *idevice,

0)

Optional Arguments

IMSLS_GET_DEVICE, int *idevice (Output)
Returns a value specifying whether the NVIDIA CUDA Toolkit algorithm was used for the last call to the IMSL function specified by cuda_name. Returns 1 if the NIVIDIA Toolkit algorithm was used and 0 if the IMSL version of the algorithm was used. A value of ‑1 indicates that the IMSL function specified by cuda_name has not been used.

Description

This function returns the threshold value for a specified function. It can optionally be used to return information on the last invocation of the specified function.

Example

Function imsls_f_hypothesis_partial uses Basic Linear Algebra Subprogram SGEMM to improve performance. In this example the threshold value of the function SGEMM is manipulated to force the use of both the IMSL and NVIDIA CUDA Toolkit algorithms.

#include <imsls.h>

#include <stdio.h>

int main() {

int nreg, nobs = 16, n_independent=2, n_dependent=1,

n_coefficients=12, nh, nhp=6, idevice, iswitch;

float *coefficients, *h, z[16*11];

Imsls_f_regression *info;

float hp[72] = {0, 0, 0, 0, 0, 0,

1, 0, 0, 0, 0, 0,

0, 0, 0, 0, 0, 0,

0, 1, 0, 0, 0, 0,

0, 0, 0, 0, 0, 0,

0, 0, 1, 0, 0, 0,

0, 0, 0, 0, 0, 0,

0, 0, 0, 1, 0, 0,

0, 0, 0, 0, 0, 0,

0, 0, 0, 0, 1, 0,

0, 0, 0, 0, 0, 0,

0, 0, 0, 0, 0, 1};

float x[16][2] = {1, 1,

1, 1,

1, 1,

1, 2,

1, 2,

1, 3,

1, 3,

1, 3,

2, 1,

2, 1,

2, 2,

2, 2,

2, 2,

2, 3,

2, 3,

2, 3};

float y[16] = {19, 20, 21, 24, 26, 22, 25, 25,

25, 27, 21, 24, 24, 31, 32, 33};

int nvef[3] = {1, 1, 2};

int indef[4] = {0, 1, 0, 1};

/* Turn off printing of warning errors */

imsls_error_options(

IMSLS_SET_PRINT,

IMSLS_WARNING, 0,

0);

nreg = imsls_f_regressors_for_glm(nobs, &x[0][0], 2, 0,

IMSLS_INDICES_EFFECTS, 3, nvef, indef,

IMSLS_REGRESSORS_USER, z,

0);

coefficients = imsls_f_regression(nobs, nreg, z, y,

IMSLS_N_DEPENDENT, n_dependent,

IMSLS_REGRESSION_INFO, &info,

0);

/* Turn printing of warning errors back on */

imsls_error_options(

IMSLS_SET_PRINT,

IMSLS_WARNING, 1,

0);

/* Get the current threshold value for sgemm */

iswitch = imsls_cuda_get(

IMSLS_CUDA_SGEMM,

0);

/* Set the threshold value of sgemm to 0

so that use of the IMSL version is ensured */

imsls_cuda_set(

IMSLS_CUDA_SGEMM, 0,

0);

/* Call routine which uses SGEMM */

nh = imsls_f_hypothesis_partial(info, nhp, hp,

IMSLS_H_MATRIX, &h,

0);

/* Check to see what version of sgemm was used */

imsls_cuda_get(

IMSLS_CUDA_SGEMM,

IMSLS_GET_DEVICE, &idevice,

0);

if (!idevice)

printf("The IMSL version of SGEMM was used.\n");

else

printf("Error: The CUDA version of SGEMM was used.\n");

/* Set the threshold value to be very small to ensure the

CUDA version of SGEMM will be used. */

imsls_cuda_set(

IMSLS_CUDA_SGEMM, 2,

0);

/* Call routine which uses SGEMM */

nh = imsls_f_hypothesis_partial(info, nhp, hp,

IMSLS_H_MATRIX, &h,

0);

/* Check to see what version of sgemm was used */

imsls_cuda_get(

IMSLS_CUDA_SGEMM,

IMSLS_GET_DEVICE, &idevice,

0);

if (!idevice)

printf("Error: The IMSL version of SGEMM was used.\n");

else

printf("The CUDA version of SGEMM was used.\n");

/* Set the threshold value to the original setting */

imsls_cuda_set(

IMSLS_CUDA_SGEMM, iswitch,

0);

/* Release GPU memory */

imsls_cuda_free();

}

Output

The IMSL version of SGEMM was used.

The CUDA version of SGEMM was used.

Warning Errors

IMSLS_CUDA_ENUM_NAME	The argument specified for “cuda_name” = # is not valid.
IMSLS_CUDA_NOT_IMPLEMENTED	The specified function name does not have a CUDA implementation.
IMSLS_FCN_NOT_USED	The specified function name has not yet been used.
IMSLS_CUDA_NOT_AVAIL	The CUDA Toolkit algorithms are not implemented using this version of the library. Use the CUDA link environment variables to leverage the CUDA Toolkit algorithms.