5.7. cudaFuncAttributes Struct Reference
[Data types used by CUDA Runtime]
CUDA function attributes
Public Variables
- int binaryVersion
- int cacheModeCA
- size_t constSizeBytes
- size_t localSizeBytes
- int maxDynamicSharedSizeBytes
- int maxThreadsPerBlock
- int numRegs
- int preferredShmemCarveout
- int ptxVersion
- size_t sharedSizeBytes
Variables
- int cudaFuncAttributes::binaryVersion [inherited]
-
The binary architecture version for which the function was compiled. This value is the major binary version * 10 + the minor binary version, so a binary version 1.3 function would return the value 13.
- int cudaFuncAttributes::cacheModeCA [inherited]
-
The attribute to indicate whether the function has been compiled with user specified option "-Xptxas --dlcm=ca" set.
- size_t cudaFuncAttributes::constSizeBytes [inherited]
-
The size in bytes of user-allocated constant memory required by this function.
- size_t cudaFuncAttributes::localSizeBytes [inherited]
-
The size in bytes of local memory used by each thread of this function.
- int cudaFuncAttributes::maxDynamicSharedSizeBytes [inherited]
-
The maximum size in bytes of dynamic shared memory per block for this function. Any launch must have a dynamic shared memory size smaller than this value.
- int cudaFuncAttributes::maxThreadsPerBlock [inherited]
-
The maximum number of threads per block, beyond which a launch of the function would fail. This number depends on both the function and the device on which the function is currently loaded.
- int cudaFuncAttributes::numRegs [inherited]
-
The number of registers used by each thread of this function.
- int cudaFuncAttributes::preferredShmemCarveout [inherited]
-
On devices where the L1 cache and shared memory use the same hardware resources, this sets the shared memory carveout preference, in percent of the maximum shared memory. This is only a hint, and the driver can choose a different ratio if required to execute the function.
- int cudaFuncAttributes::ptxVersion [inherited]
-
The PTX virtual architecture version for which the function was compiled. This value is the major PTX version * 10 + the minor PTX version, so a PTX version 1.3 function would return the value 13.
- size_t cudaFuncAttributes::sharedSizeBytes [inherited]
-
The size in bytes of statically-allocated shared memory per block required by this function. This does not include dynamically-allocated shared memory requested by the user at runtime.