| /llvm-project-15.0.7/clang/tools/amdgpu-arch/ |
| H A D | AMDGPUArch.cpp | 45 std::vector<std::string> *GPUs = in iterateAgentsCallback() local 52 GPUs->push_back(GPUName); in iterateAgentsCallback() 62 std::vector<std::string> GPUs; in main() local 63 Status = hsa_iterate_agents(iterateAgentsCallback, &GPUs); in main() 68 for (const auto &GPU : GPUs) in main() 71 if (GPUs.size() < 1) in main()
|
| /llvm-project-15.0.7/clang/test/Driver/ |
| H A D | hip-output-file-name.hip | 25 // Output bundled PPE for two GPUs for mixed compilation. 30 // Output bundled PPE for two GPUs for mixed compilation with -save-temps. 35 // Output unbundled PPE for two GPUs for device only compilation. 40 // Output bundled PPE for two GPUs for device only compilation with --gpu-bundle-output. 45 // Output unbundled PPE for two GPUs for device only compilation with --no-gpu-bundle-output. 62 // Output bundled PPE for two GPUs for mixed compilation. 67 // Output bundled PPE for two GPUs for mixed compilation. 72 // Output bundled PPE for two GPUs for device only compilation with --gpu-bundle-output. 77 // Output unbundled PPE for two GPUs for device only compilation.
|
| H A D | amdgpu-mcpu.cl | 1 // Check that -mcpu works for all supported GPUs.
|
| /llvm-project-15.0.7/llvm/lib/Target/AMDGPU/ |
| H A D | R700Instructions.td | 10 // - Available to R700 and newer VLIW4/VLIW5 GPUs 11 // - Available only on R700 family GPUs.
|
| H A D | AMDGPUCallingConv.td | 1 //===---- AMDCallingConv.td - Calling Conventions for Radeon GPUs ---------===// 9 // This describes the calling conventions for the AMD Radeon GPUs.
|
| H A D | CaymanInstructions.td | 10 // family GPUs.
|
| H A D | EvergreenInstructions.td | 10 // - Available to Evergreen and newer VLIW4/VLIW5 GPUs 11 // - Available only on Evergreen family GPUs.
|
| /llvm-project-15.0.7/llvm/test/CodeGen/AMDGPU/ |
| H A D | r600-encoding.ll | 4 ; The earliest R600 GPUs have a slightly different encoding than the rest of 5 ; the VLIW4/5 GPUs.
|
| H A D | local-memory.r600.ll | 47 ; this consistently on evergreen GPUs.
|
| H A D | mad_int24.ll | 7 ; Signed 24-bit multiply is not supported on pre-Cayman GPUs.
|
| H A D | 32-bit-local-address-space.ll | 4 ; On Southern Islands GPUs the local address space(3) uses 32-bit pointers and
|
| H A D | mul_int24.ll | 8 ; Signed 24-bit multiply is not supported on pre-Cayman GPUs.
|
| /llvm-project-15.0.7/mlir/include/mlir/Dialect/AMDGPU/ |
| H A D | AMDGPU.td | 52 available on AMD GPUs, including extensions in newer GPUs. 105 intrinsics available on AMD GPUs, including extensions in newer GPUs. 144 of AMD GPUs.
|
| /llvm-project-15.0.7/llvm/docs/ |
| H A D | AMDGPUOperandSyntax.rst | 430 .. WARNING:: GFX7 does not support *xnack* feature. For availability of this feature in other GPUs,… 636 0.0 The same as integer constant 0. All GPUs 637 0.5 Floating-point constant 0.5 All GPUs 638 1.0 Floating-point constant 1.0 All GPUs 639 2.0 Floating-point constant 2.0 All GPUs 640 4.0 Floating-point constant 4.0 All GPUs 641 -0.5 Floating-point constant -0.5 All GPUs 642 -1.0 Floating-point constant -1.0 All GPUs 643 -2.0 Floating-point constant -2.0 All GPUs 644 -4.0 Floating-point constant -4.0 All GPUs
|
| H A D | CompileCudaWithLLVM.rst | 89 forwards-compatible with e.g. ``sm_35`` GPUs. 100 If you're using GPUs, you probably care about making numerical code run fast. 502 Modern CPUs and GPUs are architecturally quite different, so code that's fast 527 64-bit integer divides are much slower than 32-bit ones on NVIDIA GPUs. 533 function inlining need to be more aggressive for GPUs than for CPUs because
|
| /llvm-project-15.0.7/openmp/docs/remarks/ |
| H A D | OMP100.rst | 12 To facilitate OpenMP semantics on GPUs we provide a runtime mechanism through
|
| /llvm-project-15.0.7/mlir/docs/Dialects/ |
| H A D | GPU.md | 10 for GPUs). Its goal is to abstract away device- and driver-specific
|
| /llvm-project-15.0.7/clang/include/clang/Config/ |
| H A D | config.h.cmake | 50 /* Default architecture for OpenMP offloading to Nvidia GPUs. */
|
| /llvm-project-15.0.7/mlir/test/Integration/GPU/CUDA/TensorCore/ |
| H A D | wmma-matmul-f16.mlir | 10 // Test case to check the working of Tensor cores on Nvidia GPUs. The kernel has already
|
| /llvm-project-15.0.7/openmp/docs/ |
| H A D | SupportAndFAQ.rst | 80 available GPUs failed, you should also set: 216 Q: Does Clang support `<math.h>` and `<complex.h>` operations in OpenMP target on GPUs? 220 target regions that are compiled for GPUs.
|
| /llvm-project-15.0.7/openmp/docs/optimizations/ |
| H A D | OpenMPOpt.rst | 108 - 2019 EuroLLVM Developers’ Meeting: "Compiler Optimizations for (OpenMP) Target Offloading to GPUs…
|
| /llvm-project-15.0.7/clang/docs/ |
| H A D | HLSLSupport.rst | 128 parallel nature of GPUs in a hardware abstracted language. 132 example, register spilling is often excessively expensive on GPUs, so HLSL
|
| /llvm-project-15.0.7/openmp/libomptarget/cmake/Modules/ |
| H A D | LibomptargetGetDependencies.cmake | 17 # CUDA : required to control offloading to NVIDIA GPUs.
|
| /llvm-project-15.0.7/mlir/include/mlir/Dialect/Tosa/IR/ |
| H A D | TosaOpBase.td | 37 be deployed on a wide range of different processors, such as CPUs or GPUs,
|
| /llvm-project-15.0.7/clang/ |
| H A D | CMakeLists.txt | 334 "Default architecture for OpenMP offloading to Nvidia GPUs." FORCE) 335 message(WARNING "Resetting default architecture for OpenMP offloading to Nvidia GPUs to sm_35") 338 "Default architecture for OpenMP offloading to Nvidia GPUs.")
|