AMDGPU.td - OpenGrok history log for /llvm-project-15.0.7/llvm/lib/Target/AMDGPU/AMDGPU.td

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
Revision tags: llvmorg-20.1.0, llvmorg-20.1.0-rc3, llvmorg-20.1.0-rc2, llvmorg-20.1.0-rc1, llvmorg-21-init, llvmorg-19.1.7, llvmorg-19.1.6, llvmorg-19.1.5, llvmorg-19.1.4, llvmorg-19.1.3, llvmorg-19.1.2, llvmorg-19.1.1, llvmorg-19.1.0, llvmorg-19.1.0-rc4, llvmorg-19.1.0-rc3, llvmorg-19.1.0-rc2, llvmorg-19.1.0-rc1, llvmorg-20-init, llvmorg-18.1.8, llvmorg-18.1.7, llvmorg-18.1.6, llvmorg-18.1.5, llvmorg-18.1.4, llvmorg-18.1.3, llvmorg-18.1.2, llvmorg-18.1.1, llvmorg-18.1.0, llvmorg-18.1.0-rc4, llvmorg-18.1.0-rc3, llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init, llvmorg-17.0.6, llvmorg-17.0.5, llvmorg-17.0.4, llvmorg-17.0.3, llvmorg-17.0.2, llvmorg-17.0.1, llvmorg-17.0.0, llvmorg-17.0.0-rc4, llvmorg-17.0.0-rc3, llvmorg-17.0.0-rc2, llvmorg-17.0.0-rc1, llvmorg-18-init, llvmorg-16.0.6, llvmorg-16.0.5, llvmorg-16.0.4, llvmorg-16.0.3, llvmorg-16.0.2, llvmorg-16.0.1, llvmorg-16.0.0, llvmorg-16.0.0-rc4, llvmorg-16.0.0-rc3, llvmorg-16.0.0-rc2, llvmorg-16.0.0-rc1, llvmorg-17-init, llvmorg-15.0.7, llvmorg-15.0.6, llvmorg-15.0.5, llvmorg-15.0.4, llvmorg-15.0.3, llvmorg-15.0.2, llvmorg-15.0.1, llvmorg-15.0.0, llvmorg-15.0.0-rc3, llvmorg-15.0.0-rc2, llvmorg-15.0.0-rc1, llvmorg-16-init
# 5a5439cb	21-Jul-2022	Matt Arsenault <[email protected]>	AMDGPU: Refine user-sgpr-init16-bug It only applies to gfx1100 and gfx1102, and for wave32.
# 9fa5a6b7	15-Jul-2022	Stanislav Mekhanoshin <[email protected]>	[AMDGPU] Support for gfx940 fp8 conversions Differential Revision: https://reviews.llvm.org/D129902
Revision tags: llvmorg-14.0.6, llvmorg-14.0.5, llvmorg-14.0.4, llvmorg-14.0.3, llvmorg-14.0.2, llvmorg-14.0.1, llvmorg-14.0.0, llvmorg-14.0.0-rc4, llvmorg-14.0.0-rc3, llvmorg-14.0.0-rc2, llvmorg-14.0.0-rc1, llvmorg-15-init
# 0bdaef38	24-Jan-2022	Matt Arsenault <[email protected]>	AMDGPU: Add gfx11 feature to force initializing 16 input SGPRs The total user+system SGPR count needs to be padded out to 16 if fewer inputs are enabled.
# e243ead6	18-May-2022	Joe Nash <[email protected]>	Reland [AMDGPU] gfx11 vop3dpp instructions There was an issue with encoding wide (>64 bit) instructions on BigEndian hosts, which is fixed in D127195. Therefore reland this. gfx11 adds the ability Reland [AMDGPU] gfx11 vop3dpp instructions There was an issue with encoding wide (>64 bit) instructions on BigEndian hosts, which is fixed in D127195. Therefore reland this. gfx11 adds the ability to use dpp modifiers on vop3 instructions. This patch adds machine code layer support for that. The MCCodeEmitter is changed to use APInt instead of uint64_t to support these wider instructions. Patch 16/N for upstreaming of AMDGPU gfx11 architecture Differential Revision: https://reviews.llvm.org/D126483 show more ...
# eaed07eb	06-Jun-2022	Joe Nash <[email protected]>	Revert "[AMDGPU] gfx11 vop3dpp instructions" This reverts commit 99a83b1286748501e0ccf199a582dc3ec5451ef5.
# 99a83b12	18-May-2022	Joe Nash <[email protected]>	[AMDGPU] gfx11 vop3dpp instructions gfx11 adds the ability to use dpp modifiers on vop3 instructions. This patch adds machine code layer support for that. The MCCodeEmitter is changed to use APInt i [AMDGPU] gfx11 vop3dpp instructions gfx11 adds the ability to use dpp modifiers on vop3 instructions. This patch adds machine code layer support for that. The MCCodeEmitter is changed to use APInt instead of uint64_t to support these wider instructions. Patch 16/N for upstreaming of AMDGPU gfx11 architecture Depends on D126475 Reviewed By: rampitec, #amdgpu Differential Revision: https://reviews.llvm.org/D126483 show more ...
# 18ed279a	14-Apr-2022	Joe Nash <[email protected]>	[AMDGPU] gfx11 subtarget features & early tests Tablegen definitions for subtarget features and cpp predicate functions to access the features. New Sub-TargetProcessors and common latencies. Simple [AMDGPU] gfx11 subtarget features & early tests Tablegen definitions for subtarget features and cpp predicate functions to access the features. New Sub-TargetProcessors and common latencies. Simple changes to MIR codegen tests which pass on gfx11 because they have the same output as previous subtargets or operate on pseudo instructions which are reused from previous subtargets. Contributors: Jay Foad <[email protected]> Petar Avramovic <[email protected]> Patch 4/N for upstreaming of AMDGPU gfx11 architecture Depends on D124538 Reviewed By: Petar.Avramovic, foad Differential Revision: https://reviews.llvm.org/D125261 show more ...
# 7e71a039	14-Apr-2022	Joe Nash <[email protected]>	[AMDGPU] Split FeatureAtomicFaddInsts FeatureAtomicFaddInsts is replaced with three more granular features. Contributors: Petar Avramovic <[email protected]> Patch 3/N for upstreaming of AMDG [AMDGPU] Split FeatureAtomicFaddInsts FeatureAtomicFaddInsts is replaced with three more granular features. Contributors: Petar Avramovic <[email protected]> Patch 3/N for upstreaming of AMDGPU gfx11 architecture Depends on D124537 Reviewed By: foad, #amdgpu, arsenm Differential Revision: https://reviews.llvm.org/D124538 show more ...
# 7f97ac94	19-Apr-2022	Austin Kerbow <[email protected]>	Revert "[AMDGPU] Omit unnecessary waitcnt before barriers" This reverts commit 8d0c34fd4fb66ea0d19563154a59658e4b7f35d4.
# fb8d23b8	16-Mar-2022	Jay Foad <[email protected]>	[AMDGPU] Define new feature HasFlatScratchSVSMode. NFC. This is by analogy with HasFlatScratchSTMode and is slightly more informative than using isGFX940Plus. Differential Revision: https://reviews [AMDGPU] Define new feature HasFlatScratchSVSMode. NFC. This is by analogy with HasFlatScratchSTMode and is slightly more informative than using isGFX940Plus. Differential Revision: https://reviews.llvm.org/D121804 show more ...
# c7218164	09-Mar-2022	Jay Foad <[email protected]>	[AMDGPU] Remove HasAtomicFaddInstsGFX90X and HasAtomicFaddInstsGFX940 These compound predicates are not required, since we can use a combination of setting the SubtargetPredicate (to a subtarget pre [AMDGPU] Remove HasAtomicFaddInstsGFX90X and HasAtomicFaddInstsGFX940 These compound predicates are not required, since we can use a combination of setting the SubtargetPredicate (to a subtarget predicate like isGFX940Plus) and OtherPredicates (to a list of feature predicates like HasAtomicFaddInsts) instead. NFC. Differential Revision: https://reviews.llvm.org/D121289 show more ...
# 932f6281	03-Mar-2022	Stanislav Mekhanoshin <[email protected]>	[AMDGPU] new gfx940 fp atomics Differential Revision: https://reviews.llvm.org/D121028
# 2c830c8f	01-Mar-2022	Stanislav Mekhanoshin <[email protected]>	[AMDGPU] gfx940: support V_FMAMK_F32 and V_FMAAK_F32 Differential Revision: https://reviews.llvm.org/D120769
# 8d0c34fd	25-Feb-2022	Austin Kerbow <[email protected]>	[AMDGPU] Omit unnecessary waitcnt before barriers It is not necessary to wait for all outstanding memory operations before barriers on hardware that can back off of the barrier in the event of an ex [AMDGPU] Omit unnecessary waitcnt before barriers It is not necessary to wait for all outstanding memory operations before barriers on hardware that can back off of the barrier in the event of an exception when traps are enabled. Add a new subtarget feature which tracks which HW has this ability. Reviewed By: #amdgpu, rampitec Differential Revision: https://reviews.llvm.org/D120544 show more ...
# 35ec58d8	01-Mar-2022	Stanislav Mekhanoshin <[email protected]>	[AMDGPU] gfx940 removes all image instructions Differential Revision: https://reviews.llvm.org/D120763
# 2e2e64df	28-Feb-2022	Stanislav Mekhanoshin <[email protected]>	[AMDGPU] Add gfx940 target This is target definition only. Differential Revision: https://reviews.llvm.org/D120688
# a5d4f82b	11-Feb-2022	Sebastian Neubauer <[email protected]>	[AMDGPU] Make enable-flat-scratch a subtarget feature Use a subtarget feature instead of a command line argument to reduce global state. We want to enable flat scratch for graphics in some cases and [AMDGPU] Make enable-flat-scratch a subtarget feature Use a subtarget feature instead of a command line argument to reduce global state. We want to enable flat scratch for graphics in some cases and this doesn't work well with command line options. Differential Revision: https://reviews.llvm.org/D119425 show more ...
# 4e077c0a	25-Jan-2022	Stanislav Mekhanoshin <[email protected]>	[AMDGPU] Remove feature register-banking Since RegBankReassign pass was removed this feature is not use for anything. Differential Revision: https://reviews.llvm.org/D118195
Revision tags: llvmorg-13.0.1, llvmorg-13.0.1-rc3, llvmorg-13.0.1-rc2, llvmorg-13.0.1-rc1
# d1f45ed5	11-Nov-2021	Neubauer, Sebastian <[email protected]>	[AMDGPU][NFC] Fix typos Differential Revision: https://reviews.llvm.org/D113672
# a0633f5c	08-Nov-2021	skc7 <[email protected]>	[AMDGPU] Test Commit. NFC Reviewed By: hsmhsm Differential Revision: https://reviews.llvm.org/D113379
Revision tags: llvmorg-13.0.0, llvmorg-13.0.0-rc4, llvmorg-13.0.0-rc3
# c9683926	31-Aug-2021	Joe Nash <[email protected]>	[AMDGPU] Enable ds_min/ds_max on more subtargets Adds patterns for f64 ds_min/ds_max. Shrinks HasLDSFPAtomics scope to enable f32. Reviewed By: rampitec Differential Revision: https://reviews.llvm [AMDGPU] Enable ds_min/ds_max on more subtargets Adds patterns for f64 ds_min/ds_max. Shrinks HasLDSFPAtomics scope to enable f32. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D108994 Change-Id: Id890b677841ee588b20d42b1bb3f4cdbf6e9ba1a show more ...
Revision tags: llvmorg-13.0.0-rc2, llvmorg-13.0.0-rc1, llvmorg-14-init
# 7d4baf25	22-Jul-2021	Carl Ritson <[email protected]>	[AMDGPU] Add maximum NSA size limit ISA feature Add maximum NSA size limit as an ISA feature. Use this to reduce NSA usage on GFX10.1 to avoid stability issues with 4 and 5 dwords NSA instructions. [AMDGPU] Add maximum NSA size limit ISA feature Add maximum NSA size limit as an ISA feature. Use this to reduce NSA usage on GFX10.1 to avoid stability issues with 4 and 5 dwords NSA instructions. Maintain use of longer NSA instructions on GFX10.3. Note: this also contains some minor fixes for GlobalISel which did not work correctly with non-NSA form instructions on GFX10. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D103348 show more ...
Revision tags: llvmorg-12.0.1, llvmorg-12.0.1-rc4, llvmorg-12.0.1-rc3
# c3cc9d1e	25-Jun-2021	Jay Foad <[email protected]>	[AMDGPU] Removed unused Predicate HasOffset3fBug. NFC. The predicate definition didn't make sense anyway because it was defined as being the opposite of what the name suggests.
Revision tags: llvmorg-12.0.1-rc2
# 294efbbd	08-Jun-2021	Brendon Cahoon <[email protected]>	Reland "[AMDGPU] Add gfx1013 target" This reverts commit 211e584fa2a4c032e4d573e7cdbffd622aad0a8f. Fixed a use-after-free error that caused the sanitizers to fail.
# 211e584f	08-Jun-2021	Brendon Cahoon <[email protected]>	Revert "[AMDGPU] Add gfx1013 target" This reverts commit ea10a86984ea73fcec3b12d22404a15f2f59b219. A sanitizer buildbot reports an error.
12 3 4 5 6 7 8 9