|
Revision tags: llvmorg-20.1.0, llvmorg-20.1.0-rc3, llvmorg-20.1.0-rc2, llvmorg-20.1.0-rc1, llvmorg-21-init, llvmorg-19.1.7, llvmorg-19.1.6, llvmorg-19.1.5, llvmorg-19.1.4, llvmorg-19.1.3, llvmorg-19.1.2, llvmorg-19.1.1, llvmorg-19.1.0, llvmorg-19.1.0-rc4, llvmorg-19.1.0-rc3, llvmorg-19.1.0-rc2, llvmorg-19.1.0-rc1, llvmorg-20-init, llvmorg-18.1.8, llvmorg-18.1.7, llvmorg-18.1.6, llvmorg-18.1.5, llvmorg-18.1.4, llvmorg-18.1.3, llvmorg-18.1.2, llvmorg-18.1.1, llvmorg-18.1.0, llvmorg-18.1.0-rc4, llvmorg-18.1.0-rc3, llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init, llvmorg-17.0.6, llvmorg-17.0.5, llvmorg-17.0.4, llvmorg-17.0.3, llvmorg-17.0.2, llvmorg-17.0.1, llvmorg-17.0.0, llvmorg-17.0.0-rc4, llvmorg-17.0.0-rc3, llvmorg-17.0.0-rc2, llvmorg-17.0.0-rc1, llvmorg-18-init, llvmorg-16.0.6, llvmorg-16.0.5, llvmorg-16.0.4, llvmorg-16.0.3, llvmorg-16.0.2, llvmorg-16.0.1, llvmorg-16.0.0, llvmorg-16.0.0-rc4, llvmorg-16.0.0-rc3, llvmorg-16.0.0-rc2, llvmorg-16.0.0-rc1, llvmorg-17-init, llvmorg-15.0.7, llvmorg-15.0.6, llvmorg-15.0.5, llvmorg-15.0.4, llvmorg-15.0.3, llvmorg-15.0.2, llvmorg-15.0.1, llvmorg-15.0.0, llvmorg-15.0.0-rc3, llvmorg-15.0.0-rc2, llvmorg-15.0.0-rc1, llvmorg-16-init, llvmorg-14.0.6, llvmorg-14.0.5, llvmorg-14.0.4, llvmorg-14.0.3, llvmorg-14.0.2, llvmorg-14.0.1, llvmorg-14.0.0, llvmorg-14.0.0-rc4, llvmorg-14.0.0-rc3, llvmorg-14.0.0-rc2, llvmorg-14.0.0-rc1, llvmorg-15-init, llvmorg-13.0.1, llvmorg-13.0.1-rc3, llvmorg-13.0.1-rc2, llvmorg-13.0.1-rc1, llvmorg-13.0.0, llvmorg-13.0.0-rc4 |
|
| #
ca57b80c |
| 20-Sep-2021 |
Mateja Marjanovic <[email protected]> |
Code quality: Combine V_RSQ
Combine V_RCP and V_SQRT into V_RSQ on AMDGPU for GlobalISel.
Change-Id: I93c5dcb412483156a6e8b68c4085cbce83ac9703
|
| #
db6bc2ab |
| 17-Nov-2021 |
Mirko Brkusanin <[email protected]> |
[AMDGPU][GlobalISel] Fold G_FNEG above when users cannot fold mods
If possible fold fneg into instruction above if users cannot fold mods and we know it will decrease instruction count. Follows same
[AMDGPU][GlobalISel] Fold G_FNEG above when users cannot fold mods
If possible fold fneg into instruction above if users cannot fold mods and we know it will decrease instruction count. Follows same logic as SDAG combiner in choosing opportunities to combine.
Differential Revision: https://reviews.llvm.org/D112827
show more ...
|
|
Revision tags: llvmorg-13.0.0-rc3, llvmorg-13.0.0-rc2, llvmorg-13.0.0-rc1, llvmorg-14-init, llvmorg-12.0.1, llvmorg-12.0.1-rc4, llvmorg-12.0.1-rc3, llvmorg-12.0.1-rc2, llvmorg-12.0.1-rc1 |
|
| #
fb7be0d9 |
| 27-Apr-2021 |
Petar Avramovic <[email protected]> |
AMDGPU/GlobalISel: Remove redundant G_FCANONICALIZE
Add basic version of isCanonicalized for global-isel. Copied from sdag. Add post legalizer combine that deletes G_FCANONICALIZE when its input is
AMDGPU/GlobalISel: Remove redundant G_FCANONICALIZE
Add basic version of isCanonicalized for global-isel. Copied from sdag. Add post legalizer combine that deletes G_FCANONICALIZE when its input is already Canonicalized.
Differential Revision: https://reviews.llvm.org/D96605
show more ...
|
|
Revision tags: llvmorg-12.0.0, llvmorg-12.0.0-rc5, llvmorg-12.0.0-rc4, llvmorg-12.0.0-rc3, llvmorg-12.0.0-rc2, llvmorg-11.1.0, llvmorg-11.1.0-rc3 |
|
| #
faeed774 |
| 01-Feb-2021 |
Thomas Symalla <[email protected]> |
Fixed includes.
Differential Revision: https://reviews.llvm.org/D93708
|
| #
09508d28 |
| 01-Feb-2021 |
Thomas Symalla <[email protected]> |
Reverted whitespace changes.
Differential Revision: https://reviews.llvm.org/D90968
|
|
Revision tags: llvmorg-12.0.0-rc1, llvmorg-13-init, llvmorg-11.1.0-rc2, llvmorg-11.1.0-rc1 |
|
| #
52bfb501 |
| 12-Jan-2021 |
Thomas Symalla <[email protected]> |
Formatting changes
|
| #
7d24026e |
| 12-Jan-2021 |
Thomas Symalla <[email protected]> |
Formatting changes.
|
| #
bcd6c2d2 |
| 12-Jan-2021 |
Thomas Symalla <[email protected]> |
Updating formatting changes.
|
| #
ecbed4e0 |
| 12-Jan-2021 |
Thomas Symalla <[email protected]> |
Resolve formatting changes.
|
| #
cdfd9b3b |
| 11-Jan-2021 |
Thomas Symalla <[email protected]> |
Move Combiner to PreLegalize step
|
| #
9a8da909 |
| 05-Jan-2021 |
Thomas Symalla <[email protected]> |
Reverted unintended git-format change.
|
| #
dae85e46 |
| 05-Jan-2021 |
Thomas Symalla <[email protected]> |
Fixed the lit tests and a bug in the implementation.
|
| #
88a832ae |
| 04-Jan-2021 |
Thomas Symalla <[email protected]> |
Refactored the pattern matching.
|
| #
fce3230b |
| 22-Dec-2020 |
Thomas Symalla <[email protected]> |
Added early exit.
|
| #
d722924f |
| 22-Dec-2020 |
Thomas Symalla <[email protected]> |
Added comments.
|
| #
ec043967 |
| 22-Dec-2020 |
Thomas Symalla <[email protected]> |
clang-format
|
| #
62af0305 |
| 22-Dec-2020 |
Thomas Symalla <[email protected]> |
Added clamp i64 to i16 global isel pattern.
|
| #
560d7e04 |
| 20-Jan-2021 |
dfukalov <[email protected]> |
[NFC][AMDGPU] Split AMDGPUSubtarget.h to R600 and GCN subtargets
... to reduce headers dependency.
Reviewed By: rampitec, arsenm
Differential Revision: https://reviews.llvm.org/D95036
|
| #
6a87e9b0 |
| 25-Dec-2020 |
dfukalov <[email protected]> |
[NFC][AMDGPU] Reduce include files dependency.
Reviewed By: rampitec
Differential Revision: https://reviews.llvm.org/D93813
|
|
Revision tags: llvmorg-11.0.1, llvmorg-11.0.1-rc2, llvmorg-11.0.1-rc1 |
|
| #
0031418d |
| 03-Nov-2020 |
Petar Avramovic <[email protected]> |
AMDGPU/GlobalISel: Use same builder/observer in post-legalizer-combiner
Change match/apply functions into methods of new target specific combiner helper class. Use reference to MachineIRBuilder from
AMDGPU/GlobalISel: Use same builder/observer in post-legalizer-combiner
Change match/apply functions into methods of new target specific combiner helper class. Use reference to MachineIRBuilder from helper instead of constructing new MachineIRBuilder each time new instruction needs to made. Allows correct tracking of newly created instructions.
Differential Revision: https://reviews.llvm.org/D90623
show more ...
|
|
Revision tags: llvmorg-11.0.0, llvmorg-11.0.0-rc6, llvmorg-11.0.0-rc5, llvmorg-11.0.0-rc4, llvmorg-11.0.0-rc3, llvmorg-11.0.0-rc2 |
|
| #
66ffa0e9 |
| 15-Aug-2020 |
Matt Arsenault <[email protected]> |
AMDGPU/GlobalISel: Fix using post-legal combiner without LegalizerInfo
|
|
Revision tags: llvmorg-11.0.0-rc1 |
|
| #
16bcd545 |
| 26-Jul-2020 |
Matt Arsenault <[email protected]> |
AMDGPU/GlobalISel: Mark GlobalISel classes as final
|
|
Revision tags: llvmorg-12-init, llvmorg-10.0.1, llvmorg-10.0.1-rc4, llvmorg-10.0.1-rc3, llvmorg-10.0.1-rc2 |
|
| #
db777eae |
| 12-Jun-2020 |
Matt Arsenault <[email protected]> |
AMDGPU/GlobalISel: Fix asserts on non-s32 sitofp/uitofp sources
The combine to form cvt_f32_ubyte0 was assuming the source type was always 32-bit, but this needs to tolerate any legal source type.
|
| #
e35ba099 |
| 16-Jun-2020 |
Daniel Sanders <[email protected]> |
[gicombiner] Allow generated combiners to store additional members
Summary: Adds the ability to add members to a generated combiner via a State base class. In the current AArch64PreLegalizerCombiner
[gicombiner] Allow generated combiners to store additional members
Summary: Adds the ability to add members to a generated combiner via a State base class. In the current AArch64PreLegalizerCombiner this is used to make Helper available without having to provide it to every call.
As part of this, split the command line processing into a separate object so that it still only runs once even though the generated combiner is constructed more frequently.
Depends on D81862
Reviewers: aditya_nandakumar, bogner, volkan, aemerson, paquette, arsenm
Reviewed By: arsenm
Subscribers: jvesely, wdng, nhaehnle, kristof.beyls, hiraditya, kerbowa, llvm-commits
Tags: #llvm
Differential Revision: https://reviews.llvm.org/D81863
show more ...
|
|
Revision tags: llvmorg-10.0.1-rc1 |
|
| #
0ba40d4c |
| 29-Mar-2020 |
Matt Arsenault <[email protected]> |
AMDGPU/GlobalISel: Combines for V_CVT_F32_UBYTE[0-3]
Ports the existing DAG combines, minus the simplify demanded bits which seems to have no equivalent now. Without these, this isn't particularly h
AMDGPU/GlobalISel: Combines for V_CVT_F32_UBYTE[0-3]
Ports the existing DAG combines, minus the simplify demanded bits which seems to have no equivalent now. Without these, this isn't particularly helpful in most of the IR sample cases.
show more ...
|