summaryrefslogtreecommitdiff
AgeCommit message (Collapse)AuthorFilesLines
5 daysradv/ci: update lists for TAHITI and Zink/Polaris10Samuel Pitoiset2-2/+9
1f4662cc4ed0c5b87479eb71e53a1320ab1b414b introduced regressions. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28795>
2024-03-19zink/ci: enable RADV_PERFTEST=shader_object for polaris10Samuel Pitoiset1-0/+1
It's passing in CI now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28273>
2024-03-08radv: allow RADV_PERFTEST=shader_object on GFX10/GFX10.3Samuel Pitoiset2-4/+3
VK_EXT_shader_object is now supported on all GPUs! It's still not enabled by default because it needs more testing and more fixing (Renoir still hangs for some reasons and Zink/RADV on Polaris10 still has flakes). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27880>
2024-02-16radv/ci: switch vkcts-polaris10 from mupuf to KWS' farmMartin Roukala (né Peres)1-1/+1
The DUT that currently hosts the Polaris10 will be used for NVK testing, so let's move the polaris10 job to another farm. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27652>
2024-02-05radv/uvd: uvd kernel checks for full dpb allocation.Dave Airlie1-0/+5
The CTS image allocation sometimes doesn't try to allocate a complete DPB, but the amdgpu kernel module checks for this, so always make the DPB max sized on uvd instances. Fixes part of video decode on Fiji/Polaris Cc: mesa-stable Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27186>
2024-01-30zink/ci: update polaris10's expectations after piglit uprevMartin Roukala (né Peres)1-0/+3
Fixes: 1c29869c206a ("Uprev Piglit to e9316bcd12544aaf7e753ce37fe50d64165d9598") Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27357>
2024-01-29radv/ci: enable RADV_PERFTEST=shader_object for vkcts-polaris10-valveSamuel Pitoiset1-0/+1
This now runs +571k tests for ESO in ~34 minutes compared to ~30 minutes without shader_object support. Definitely acceptable for a manual job. Note that the next VKCTS uprev will reduce the number of ESO tests by a lot because most of them weren't useful for coverage. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27139>
2024-01-16zink/ci: skip more arb_shader_image_load_store.* on Polaris10/Navi10Samuel Pitoiset2-0/+8
This subset seems to be the root cause of the recent GPU hangs that have been uncovered since a077c14f150 ("zink: Fix resizable BAR detection logic"). Skip it completely until the issue is correctly fixed. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27077>
2023-11-20radv: re-enable sparseResidencyImage3D on POLARIS10+Samuel Pitoiset2-3/+1
Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7214 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26260>
2023-11-20ac/surface: change tile mode for 3D PRT surfaces with bpp < 64 on GFX6-8Samuel Pitoiset1-3/+9
This is loosely based on PAL. This seems to fix 3D PRT support with RADV on Polaris10. THIN means the tile is a 2D slice. THICK means the tile is a 3D box. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26260>
2023-11-09radv+zink/ci: add polaris10 flakesEric Engestrom1-3/+1
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26135>
2023-11-09radv/ci: add polaris10 flakesEric Engestrom1-0/+1
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26135>
2023-10-30ci/zink+radv: add another flake on polarisEric Engestrom1-0/+3
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25959>
2023-10-26ac/gpu_info: remove bogus assertion about number of COMPUTE/SDMA queuesSamuel Pitoiset1-3/+0
For example, my polaris10 GPU now returns 3 compute queues. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25876>
2023-10-10zink: set ZINK_DEBUG=quiet for polaris jobsMike Blumenkrantz1-0/+1
modifiers aren't supported here, so this will otherwise spam infinitely Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25645>
2023-10-10zink/ci: remove 42 tests from the zink-radv-polaris10-fails listMartin Roukala (né Peres)1-42/+0
Not sure which MR fixed them, but I'll take these fixes! Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25629>
2023-10-04zink/ci: remove 19 tests from the zink-radv-polaris10-fails listMartin Roukala (né Peres)1-19/+0
It seems that most crashes in glgetteximage-related tests were fixed, likely by [1] \o/. [1]: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25460 Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25535>
2023-09-13zink/ci: update list of expected failures for POLARIS10/NAVI10Samuel Pitoiset2-10/+3
Fixed since piglit has been updated to f7db20b03de6896d013826c0a731bc4417c1a5a0. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25199>
2023-09-05zink: polaris ci updatesMike Blumenkrantz1-0/+8
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25056>
2023-09-04radv/ci: re-enable vkd3d-polaris10-valveSamuel Pitoiset3-7/+7
Like the vkcts job, this was disabled a while ago but it seems to be working well again. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25025>
2023-09-01ci/amd: split the polaris10 rules into one for each farmEric Engestrom2-4/+13
There is now one polaris10 in each farm, so we need two rules for which one to use. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24996>
2023-09-01radv/ci: re-enable vkcts-polaris10-valveSamuel Pitoiset4-8/+10
This was disabled a long time ago because of unknown GPU hangs during boot but it seems stable again for some reasons. This also bumps the job timeouts to make sure it will be able to finish. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24968>
2023-09-01zink/ci: add zink-radv-polaris10-valveSamuel Pitoiset4-0/+1047
This introduces Zink/RADV testing on Polaris10! Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24976>
2023-05-15amd/drm-shim: add polaris10Rhys Perry1-0/+190
Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22989>
2023-05-12amd/drm-shim: Add raphael&mendocino, polaris12 and gfx1100.Bas Nieuwenhuizen1-0/+442
Decided to follow the chip names pretty much.This set happens to be what is in my workstation currently. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22975>
2023-03-28radv/ci: update CI lists for Polaris10 and PitcairnSamuel Pitoiset2-444/+0
Recent ac/surface changes fixed those. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22127>
2023-03-06radv/ci: update CI lists for CTS 1.3.5.0 on GFX110/POLARIS10/PITCAIRNSamuel Pitoiset3-147/+590
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21689>
2023-03-03radeonsi/ci: update flakes and gfx8-polaris11 resultsMarek Olšák2-14/+14
Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21641>
2023-01-20radeonsi: report 0 block size for Polaris HEVC encodingRose Hudson1-2/+3
makes encoded videos resemble the input again :) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7992 Fixes: c4482a3c1a9 ("radeonsi/vcn: enable multi-slice encoding") Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20714>
2022-11-21aco: ensure MRT0 is written with dual source blendingRhys Perry1-0/+13
Fixes crucible test func.shader.dualsrc_mrt0_undef on polaris10. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: 22.3 mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19806>
2022-11-21radv: disable MRT compaction with dual-source blendingRhys Perry1-2/+3
Fixes crucible tests func.shader.dualsrc_mrt0_undef on navi21 and func.shader.dualsrc_mrt1_undef on polaris10. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: 22.3 mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19806>
2022-10-25radeonsi/tests: update expected results for polaris11Pierre-Eric Pelloux-Prayer1-19/+18
Using the latest released LLVM version 15.0.1. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18865>
2022-09-09radv: Switch to dynamic rendering onlyJason Ekstrand18-2034/+643
Also, update list of expected failures. dEQP-VK.image.sample_texture.*_bit_compressed_format_two_samplers_* now reliably pass on Polaris10 (GFX8) and Pitcairn (GFX6). Stoney has new failures but given there is already a lot of depth/stencil resolve failures, we shouldn't worry about them. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15587>
2022-09-07radv: only expose sparseResidencyImage3D on GFX9+Samuel Pitoiset1-1/+1
It's currently broken on Polaris10 and breaks running VKCTS entirely. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18457>
2022-09-01radv/ci: disable the POLARIS10 jobsMartin Roukala (né Peres)1-2/+2
A kernel bug makes the first-context's creation likely to hang the GPU, which most times than not end up failing the job despite multiple retries. Disable the jobs until the bug gets bisected or worked around effectively. Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18346>
2022-08-26aco: allow direct_fetch=true for vec4 VS input loadsRhys Perry1-1/+1
This seems to be a (mostly harmless) mistake from 369b8cffea2. fossil-db (navi21): Totals from 15 (0.01% of 135636) affected shaders: Instrs: 1992 -> 1999 (+0.35%) Latency: 13557 -> 13567 (+0.07%); split: -0.24%, +0.31% InvThroughput: 4059 -> 4065 (+0.15%); split: -0.20%, +0.34% Copies: 186 -> 193 (+3.76%) fossil-db (polaris10): Totals from 5 (0.00% of 135610) affected shaders: Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18225>
2022-08-22radv: rework linking shaders in NIRSamuel Pitoiset1-194/+293
Replace these ugly loops with something much cleaner and easier to maintain. There shouldn't be any functional changes. No fossils-db changes (tested on NAVI21, VEGA10 and POLARIS10). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18138>
2022-06-01radv/ci: allow polaris10 to retry more timesMartin Roukala (né Peres)1-0/+1
The Polaris10 in CI has been getting insta-hangs when starting dEQP. Let's give it 5 attempts to get its act together, as it won't affect the run time dramatically (max 5 minutes), but will provide more reliable results for developers. Tracking of hangs (and many other issues) is done through scrapping the execution logs, processing them to find these issues, then pushing the data to influxdb. This allows us to plot the failure rate over time, and see if the situation is getting better or worse. Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16793>
2022-04-25radeonsi/ci: add polaris11 baselineMarek Olšák3-0/+270
Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16112>
2022-04-18radv: fix initializing pipeline_key::topology for GFX9 and olderSamuel Pitoiset1-2/+2
This is used to determine the geometry shader info on GFX9, and it looks like it was broken for topologies that use adjacency. This is also used to remove PSIZ from shaders that don't need it. Found by inspection. fossils-db (Polaris10): Totals from 140 (0.10% of 135960) affected shaders: SGPRs: 10448 -> 9696 (-7.20%) VGPRs: 4376 -> 4264 (-2.56%) CodeSize: 164316 -> 161028 (-2.00%) Instrs: 26449 -> 25767 (-2.58%) Latency: 184448 -> 180468 (-2.16%) InvThroughput: 80772 -> 79092 (-2.08%) VClause: 337 -> 328 (-2.67%); split: -2.97%, +0.30% SClause: 859 -> 813 (-5.36%); split: -5.70%, +0.35% Copies: 1027 -> 790 (-23.08%) PreSGPRs: 2751 -> 2331 (-15.27%) PreVGPRs: 3887 -> 3836 (-1.31%) Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15948>
2022-04-13radv,ac/nir: lower global access to _amd global access intrinsicsRhys Perry4-0/+193
fossil-db (Sienna Cichlid): Totals from 400 (0.30% of 134621) affected shaders: VGPRs: 18696 -> 18688 (-0.04%) CodeSize: 2031348 -> 1946640 (-4.17%) Instrs: 374703 -> 360226 (-3.86%) Latency: 4200727 -> 4108628 (-2.19%); split: -2.20%, +0.01% InvThroughput: 1059935 -> 1029441 (-2.88%); split: -2.88%, +0.00% VClause: 5777 -> 5771 (-0.10%) SClause: 11890 -> 10891 (-8.40%); split: -8.57%, +0.17% Copies: 34035 -> 33259 (-2.28%); split: -2.98%, +0.70% Branches: 11108 -> 11100 (-0.07%); split: -0.08%, +0.01% PreSGPRs: 15999 -> 15942 (-0.36%); split: -0.44%, +0.08% PreVGPRs: 16994 -> 16970 (-0.14%) fossil-db (Polaris10): Totals from 400 (0.29% of 135668) affected shaders: SGPRs: 23799 -> 22919 (-3.70%); split: -4.30%, +0.61% VGPRs: 18480 -> 18472 (-0.04%) CodeSize: 2090316 -> 2041592 (-2.33%) Instrs: 395461 -> 385747 (-2.46%); split: -2.46%, +0.00% Latency: 5045768 -> 5020196 (-0.51%); split: -0.53%, +0.02% InvThroughput: 2694320 -> 2689886 (-0.16%); split: -0.23%, +0.07% VClause: 5982 -> 5968 (-0.23%) SClause: 12064 -> 10823 (-10.29%); split: -10.33%, +0.04% Copies: 48233 -> 48322 (+0.18%); split: -0.47%, +0.65% PreSGPRs: 16409 -> 16358 (-0.31%); split: -0.39%, +0.08% fossil-db (Pitcairn): Totals from 400 (0.29% of 135668) affected shaders: SGPRs: 22431 -> 22215 (-0.96%); split: -2.60%, +1.64% VGPRs: 18776 -> 18560 (-1.15%); split: -1.21%, +0.06% CodeSize: 2104440 -> 2017708 (-4.12%) MaxWaves: 2363 -> 2367 (+0.17%) Instrs: 413099 -> 397446 (-3.79%) Latency: 5507707 -> 5450251 (-1.04%); split: -1.12%, +0.07% InvThroughput: 2838867 -> 2786903 (-1.83%); split: -1.83%, +0.00% VClause: 10334 -> 10097 (-2.29%) SClause: 12346 -> 11005 (-10.86%); split: -10.89%, +0.02% Copies: 54034 -> 52065 (-3.64%); split: -3.99%, +0.35% PreSGPRs: 17916 -> 17857 (-0.33%); split: -0.40%, +0.07% PreVGPRs: 16917 -> 16893 (-0.14%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14124>
2022-04-13aco: don't expand smem/mubuf global loadsRhys Perry1-14/+51
For example, dwordx3->dwordx4 or ubyte3->dwordx2. Global loads don't have the bounds checking that buffer loads have that makes this safe. The alignment checks are added to global_load_callback() in case byte_align_loads=false, align=1 and bytes_needed=3. Without them, the callback will create a dword load. fossil-db (Sienna Cichlid): Totals from 267 (0.20% of 134621) affected shaders: CodeSize: 1603352 -> 1606568 (+0.20%) Instrs: 294946 -> 295482 (+0.18%); split: -0.00%, +0.18% Latency: 2997003 -> 2997052 (+0.00%); split: -0.02%, +0.02% InvThroughput: 526645 -> 526659 (+0.00%) SClause: 9179 -> 9185 (+0.07%); split: -0.02%, +0.09% Copies: 25363 -> 25375 (+0.05%); split: -0.08%, +0.13% Branches: 8298 -> 8299 (+0.01%) fossil-db (Polaris10): Totals from 267 (0.20% of 135668) affected shaders: CodeSize: 1636672 -> 1638756 (+0.13%); split: -0.00%, +0.13% Instrs: 308484 -> 308733 (+0.08%); split: -0.01%, +0.09% Latency: 3446045 -> 3446904 (+0.02%); split: -0.00%, +0.03% InvThroughput: 1206722 -> 1206828 (+0.01%); split: -0.00%, +0.01% SClause: 9308 -> 9311 (+0.03%); split: -0.08%, +0.11% Copies: 36933 -> 36921 (-0.03%); split: -0.08%, +0.05% fossil-db (Pitcairn): Totals from 275 (0.20% of 135668) affected shaders: SGPRs: 17616 -> 17520 (-0.54%); split: -0.64%, +0.09% VGPRs: 15428 -> 15540 (+0.73%); split: -0.23%, +0.96% CodeSize: 1885792 -> 1929120 (+2.30%); split: -0.00%, +2.30% MaxWaves: 1284 -> 1285 (+0.08%) Instrs: 368963 -> 376095 (+1.93%); split: -0.00%, +1.94% Latency: 5122922 -> 5168398 (+0.89%); split: -0.01%, +0.90% InvThroughput: 2562866 -> 2604279 (+1.62%) VClause: 9268 -> 9296 (+0.30%); split: -0.13%, +0.43% SClause: 10702 -> 10705 (+0.03%); split: -0.05%, +0.07% Copies: 48620 -> 50629 (+4.13%); split: -0.08%, +4.21% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14124>
2022-03-28radv,aco: lower color exports in NIRSamuel Pitoiset2-132/+172
fossils-db (Sienna Cichlid): Totals from 27108 (20.09% of 134913) affected shaders: VGPRs: 1260608 -> 1261424 (+0.06%); split: -0.00%, +0.07% CodeSize: 112795868 -> 112785892 (-0.01%); split: -0.05%, +0.04% MaxWaves: 628608 -> 628448 (-0.03%); split: +0.00%, -0.03% Instrs: 20750003 -> 20749314 (-0.00%); split: -0.01%, +0.00% Latency: 288088081 -> 288015865 (-0.03%); split: -0.06%, +0.04% InvThroughput: 53944847 -> 53961693 (+0.03%); split: -0.01%, +0.04% VClause: 396463 -> 396467 (+0.00%); split: -0.02%, +0.02% SClause: 842088 -> 842150 (+0.01%); split: -0.03%, +0.04% Copies: 1244982 -> 1259026 (+1.13%); split: -0.01%, +1.14% PreSGPRs: 1251949 -> 1251909 (-0.00%) PreVGPRs: 1099647 -> 1100879 (+0.11%); split: -0.03%, +0.14% fossils-db (Polaris10): Totals from 23928 (17.60% of 135960) affected shaders: SGPRs: 1751792 -> 1751024 (-0.04%); split: -0.05%, +0.01% VGPRs: 1098964 -> 1098556 (-0.04%); split: -0.13%, +0.09% CodeSize: 99893472 -> 99837940 (-0.06%); split: -0.06%, +0.00% MaxWaves: 138322 -> 138306 (-0.01%); split: +0.03%, -0.04% Instrs: 19213995 -> 19211980 (-0.01%); split: -0.02%, +0.01% Latency: 273026926 -> 273109402 (+0.03%); split: -0.01%, +0.04% InvThroughput: 111160907 -> 111195187 (+0.03%); split: -0.04%, +0.07% VClause: 343058 -> 343097 (+0.01%); split: -0.02%, +0.03% SClause: 802756 -> 802884 (+0.02%); split: -0.04%, +0.06% Copies: 1729387 -> 1739208 (+0.57%); split: -0.04%, +0.61% PreSGPRs: 1090264 -> 1090303 (+0.00%); split: -0.00%, +0.01% PreVGPRs: 959490 -> 960600 (+0.12%); split: -0.04%, +0.15% Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15263>
2022-03-11radv: optimize the number of loaded components for VS inputs in NIRSamuel Pitoiset1-4/+19
fossils-db (Sienna Cichlid): Totals from 3691 (2.74% of 134913) affected shaders: VGPRs: 121368 -> 121584 (+0.18%); split: -0.36%, +0.54% CodeSize: 7597912 -> 7561140 (-0.48%); split: -0.66%, +0.18% MaxWaves: 104706 -> 104772 (+0.06%) Instrs: 1441229 -> 1437652 (-0.25%); split: -0.53%, +0.28% Latency: 5500766 -> 5482101 (-0.34%); split: -0.45%, +0.11% InvThroughput: 804401 -> 797178 (-0.90%); split: -1.09%, +0.20% VClause: 25185 -> 25143 (-0.17%); split: -0.50%, +0.33% SClause: 27486 -> 27445 (-0.15%); split: -0.57%, +0.42% Copies: 143816 -> 147900 (+2.84%); split: -0.54%, +3.38% PreSGPRs: 109584 -> 110396 (+0.74%); split: -0.04%, +0.79% PreVGPRs: 95541 -> 94583 (-1.00%); split: -1.12%, +0.12% fossils-db (Polaris10): Totals from 1773 (1.30% of 135960) affected shaders: SGPRs: 80848 -> 80864 (+0.02%); split: -0.14%, +0.16% VGPRs: 56424 -> 55600 (-1.46%); split: -1.47%, +0.01% CodeSize: 1732588 -> 1696840 (-2.06%); split: -2.07%, +0.01% MaxWaves: 12103 -> 12106 (+0.02%) Instrs: 347684 -> 341597 (-1.75%); split: -1.76%, +0.01% Latency: 2542840 -> 2523946 (-0.74%); split: -0.95%, +0.21% InvThroughput: 924601 -> 905102 (-2.11%); split: -2.13%, +0.02% VClause: 9565 -> 9545 (-0.21%); split: -0.51%, +0.30% SClause: 10587 -> 10333 (-2.40%); split: -2.82%, +0.43% Copies: 19321 -> 20307 (+5.10%); split: -0.78%, +5.88% PreSGPRs: 30879 -> 30875 (-0.01%); split: -0.20%, +0.18% PreVGPRs: 41211 -> 41270 (+0.14%); split: -0.73%, +0.87% Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15317>
2022-03-08radv,aco,llvm: lower post shuffle vertex in NIRSamuel Pitoiset3-35/+34
fossils-db (Sienna Cichlid): Totals from 774 (0.57% of 134913) affected shaders: VGPRs: 26496 -> 26312 (-0.69%) CodeSize: 1825936 -> 1828812 (+0.16%); split: -0.04%, +0.20% MaxWaves: 22046 -> 22062 (+0.07%) Instrs: 347634 -> 347975 (+0.10%); split: -0.05%, +0.15% Latency: 1363949 -> 1356426 (-0.55%); split: -0.59%, +0.04% InvThroughput: 221529 -> 221380 (-0.07%); split: -0.10%, +0.04% VClause: 5682 -> 5676 (-0.11%); split: -1.46%, +1.36% SClause: 7485 -> 7411 (-0.99%); split: -1.48%, +0.49% Copies: 30481 -> 30420 (-0.20%); split: -0.51%, +0.31% PreVGPRs: 19717 -> 19656 (-0.31%) fossil-db (Polaris10): Totals from 896 (0.66% of 135960) affected shaders: SGPRs: 49824 -> 49648 (-0.35%); split: -0.39%, +0.03% VGPRs: 31040 -> 29948 (-3.52%); split: -3.62%, +0.10% CodeSize: 875960 -> 875920 (-0.00%); split: -0.06%, +0.05% MaxWaves: 6380 -> 6429 (+0.77%) Instrs: 171522 -> 171482 (-0.02%); split: -0.07%, +0.05% Latency: 1356082 -> 1334386 (-1.60%); split: -1.61%, +0.01% InvThroughput: 553389 -> 552957 (-0.08%); split: -0.08%, +0.00% VClause: 4317 -> 4244 (-1.69%); split: -2.41%, +0.72% SClause: 6157 -> 6139 (-0.29%); split: -0.45%, +0.16% Copies: 9340 -> 9235 (-1.12%); split: -1.24%, +0.12% PreVGPRs: 22366 -> 22116 (-1.12%) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15113>
2022-03-01radv: remove color exports in presence of holesSamuel Pitoiset2-5/+10
If there is holes, eg. if only MRT0 and MRT2 are exported, we have to set MRT1 to SPI_SHADER_32_R to avoid a GPU hang but the export can still be removed from the fragment shader. fossils-db (Sienna Cichlid): Totals from 565 (0.42% of 134913) affected shaders: VGPRs: 13328 -> 11456 (-14.05%) CodeSize: 613232 -> 548224 (-10.60%); split: -11.13%, +0.53% LDS: 284672 -> 296960 (+4.32%) MaxWaves: 17624 -> 17684 (+0.34%) Instrs: 113056 -> 100445 (-11.15%); split: -11.68%, +0.53% Latency: 684327 -> 639348 (-6.57%); split: -7.17%, +0.60% InvThroughput: 122877 -> 104382 (-15.05%); split: -15.18%, +0.13% VClause: 2601 -> 2323 (-10.69%); split: -10.77%, +0.08% SClause: 5629 -> 5443 (-3.30%); split: -3.91%, +0.60% Copies: 9393 -> 8720 (-7.16%); split: -8.22%, +1.05% PreSGPRs: 14623 -> 13666 (-6.54%); split: -6.76%, +0.22% PreVGPRs: 9847 -> 8503 (-13.65%) fossils-db (Polaris10): Totals from 565 (0.42% of 135960) affected shaders: SGPRs: 28064 -> 27104 (-3.42%) VGPRs: 12516 -> 10544 (-15.76%); split: -15.79%, +0.03% CodeSize: 516920 -> 456536 (-11.68%); split: -11.68%, +0.00% MaxWaves: 4369 -> 4418 (+1.12%) Instrs: 97771 -> 85903 (-12.14%); split: -12.14%, +0.00% Latency: 767482 -> 708545 (-7.68%); split: -7.97%, +0.29% InvThroughput: 280017 -> 235744 (-15.81%) VClause: 2270 -> 2090 (-7.93%); split: -8.50%, +0.57% SClause: 5185 -> 5012 (-3.34%); split: -3.86%, +0.52% Copies: 8328 -> 7555 (-9.28%); split: -9.35%, +0.07% Branches: 1143 -> 1113 (-2.62%) PreSGPRs: 13816 -> 12725 (-7.90%); split: -7.92%, +0.02% PreVGPRs: 9707 -> 8270 (-14.80%) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15108>
2022-02-22radv,aco,llvm: lower adjusting vertex alpha in NIRSamuel Pitoiset3-81/+80
Instead of duplicating the same lowering in both compiler backends. This pass will be used to do more VS input lowering. fossils-db (Polaris10): Totals from 48 (0.04% of 135960) affected shaders: VGPRs: 1692 -> 1684 (-0.47%) CodeSize: 54016 -> 53964 (-0.10%); split: -0.11%, +0.01% MaxWaves: 339 -> 341 (+0.59%) Instrs: 11260 -> 11247 (-0.12%); split: -0.13%, +0.02% Latency: 88165 -> 88113 (-0.06%); split: -0.07%, +0.01% InvThroughput: 36153 -> 36093 (-0.17%) Copies: 583 -> 568 (-2.57%) fossils-db (Pitcairn): Totals from 43 (0.03% of 135960) affected shaders: VGPRs: 1548 -> 1552 (+0.26%) CodeSize: 47900 -> 47820 (-0.17%) Instrs: 10751 -> 10731 (-0.19%) Latency: 83029 -> 82873 (-0.19%) VClause: 168 -> 164 (-2.38%) SClause: 393 -> 391 (-0.51%) Copies: 705 -> 685 (-2.84%) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15076>
2021-12-17nir/opt_access: infer CAN_REORDER for global accessRhys Perry1-10/+18
fossil-db (Sienna Cichlid): Totals from 352 (0.26% of 134621) affected shaders: VGPRs: 17240 -> 17272 (+0.19%) CodeSize: 1753640 -> 1755744 (+0.12%); split: -0.04%, +0.16% Instrs: 323190 -> 323801 (+0.19%); split: -0.03%, +0.22% Latency: 3241205 -> 3241293 (+0.00%); split: -0.10%, +0.10% InvThroughput: 568927 -> 568067 (-0.15%); split: -0.16%, +0.00% SClause: 12109 -> 10444 (-13.75%); split: -13.76%, +0.01% Copies: 27802 -> 27717 (-0.31%); split: -0.56%, +0.26% PreSGPRs: 14699 -> 14690 (-0.06%) PreVGPRs: 15793 -> 15799 (+0.04%) fossil-db (Polaris10): Totals from 348 (0.26% of 135668) affected shaders: SGPRs: 21446 -> 21574 (+0.60%); split: -0.15%, +0.75% VGPRs: 17004 -> 16996 (-0.05%); split: -0.09%, +0.05% CodeSize: 1782796 -> 1783060 (+0.01%); split: -0.03%, +0.05% Instrs: 337828 -> 337921 (+0.03%); split: -0.03%, +0.06% Latency: 3726328 -> 3726721 (+0.01%); split: -0.09%, +0.10% InvThroughput: 1307917 -> 1299841 (-0.62%); split: -0.62%, +0.00% VClause: 4327 -> 4337 (+0.23%); split: -0.09%, +0.32% SClause: 12178 -> 10529 (-13.54%); split: -13.55%, +0.01% Copies: 40227 -> 40244 (+0.04%); split: -0.19%, +0.24% PreSGPRs: 14946 -> 14937 (-0.06%) PreVGPRs: 15637 -> 15643 (+0.04%) fossil-db (Pitcairn): Totals from 351 (0.26% of 135668) affected shaders: SGPRs: 20382 -> 20619 (+1.16%); split: -0.79%, +1.95% CodeSize: 1789732 -> 1789836 (+0.01%); split: -0.04%, +0.04% MaxWaves: 1947 -> 1949 (+0.10%) Instrs: 352274 -> 352318 (+0.01%); split: -0.04%, +0.06% Latency: 4057829 -> 4058226 (+0.01%); split: -0.08%, +0.09% InvThroughput: 1332245 -> 1317578 (-1.10%); split: -1.11%, +0.01% VClause: 8581 -> 8583 (+0.02%); split: -0.13%, +0.15% SClause: 12187 -> 10552 (-13.42%); split: -13.43%, +0.02% Copies: 44906 -> 44915 (+0.02%); split: -0.24%, +0.26% PreSGPRs: 16571 -> 16562 (-0.05%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14227>
2021-12-14radv: have the null winsys set more fieldsRhys Perry1-0/+13
I copied stuff from ac_gpu_info.c until there were no Sienna Cichild or Polaris10 fossil-db changes between real hardware and RADV_FORCE_FAMILY. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14126>
2021-12-09aco: don't create unnecessary addition in indirect get_sampler_desc()Rhys Perry1-3/+4
I don't think this has any effect on GFX9+ because the addition is combined into the load. fossil-db (polaris10): Totals from 12595 (9.29% of 135627) affected shaders: SGPRs: 1054348 -> 1054860 (+0.05%); split: -0.02%, +0.07% VGPRs: 667240 -> 667320 (+0.01%); split: -0.01%, +0.02% CodeSize: 82761508 -> 82512816 (-0.30%); split: -0.30%, +0.00% MaxWaves: 62182 -> 62181 (-0.00%) Instrs: 16072934 -> 16010764 (-0.39%); split: -0.39%, +0.00% Latency: 582819635 -> 582287964 (-0.09%); split: -0.13%, +0.04% InvThroughput: 276460536 -> 276417613 (-0.02%); split: -0.06%, +0.05% VClause: 261656 -> 261654 (-0.00%); split: -0.01%, +0.01% SClause: 680952 -> 680854 (-0.01%); split: -0.05%, +0.04% Copies: 1727202 -> 1727742 (+0.03%); split: -0.12%, +0.15% Branches: 547050 -> 547033 (-0.00%); split: -0.01%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14043>