| Mesa 24.1.0 Release Notes / 2024-05-22 |
| ====================================== |
| |
| Mesa 24.1.0 is a new development release. People who are concerned |
| with stability and reliability should stick with a previous release or |
| wait for Mesa 24.1.1. |
| |
| Mesa 24.1.0 implements the OpenGL 4.6 API, but the version reported by |
| glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / |
| glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. |
| Some drivers don't support all the features required in OpenGL 4.6. OpenGL |
| 4.6 is **only** available if requested at context creation. |
| Compatibility contexts may report a lower version depending on each driver. |
| |
| Mesa 24.1.0 implements the Vulkan 1.3 API, but the version reported by |
| the apiVersion property of the VkPhysicalDeviceProperties struct |
| depends on the particular driver being used. |
| |
| SHA256 checksum |
| --------------- |
| |
| :: |
| |
| b7eac8c79244806b1c276eeeacc329e4a5b31a370804c4b0c7cd16837783f78b mesa-24.1.0.tar.xz |
| |
| |
| New features |
| ------------ |
| |
| - VK_EXT_map_memory_placed on RADV, ANV and NVK |
| - VK_KHR_shader_subgroup_rotate on RADV and ANV and NVK |
| - VK_KHR_load_store_op_none on RADV, ANV, NVK and Turnip |
| - VK_KHR_line_rasterization on RADV, ANV, NVK and Turnip |
| - VK_KHR_index_type_uint8 on RADV, ANV, NVK and Turnip |
| - VK_KHR_shader_expect_assume on all Vulkan drivers |
| - VK_KHR_shader_maximal_reconvergence on RADV, ANV and NVK |
| - VK_KHR_shader_quad_control on RADV |
| - OpenGL 4.6 on Asahi |
| - OpenGL ES 3.2 on Asahi |
| - Mali G610 and G310 on Panfrost |
| - Mali T600 on Panfrost |
| - VK_KHR_shader_subgroup_uniform_control_flow on NVK |
| - alphaToOne/extendedDynamicState3AlphaToOneEnable on RADV |
| - VK_EXT_device_address_binding_report on RADV |
| - VK_EXT_external_memory_dma_buf for lavapipe |
| - VK_EXT_queue_family_foreign for lavapipe |
| - VK_EXT_shader_object on RADV |
| - VK_EXT_nested_command_buffer on NVK and RADV |
| - VK_EXT_queue_family_foreign on NVK |
| - VK_EXT_image_drm_format_modifier on NVK |
| |
| |
| Bug fixes |
| --------- |
| |
| - anv: unbounded shader cache |
| - radv: Crash due to nir validation fail in Enshrouded |
| - bisected: turnip: deqp regressions |
| - android: sRGB configs no longer exist after !27709 |
| - [24.1-rc4] fatal error: intel/dev/intel_wa.h: No such file or directory |
| - vcn: rewinding attached video in Totem cause [mmhub] page fault |
| - When using amd gpu deinterlace, tv bt709 properties mapping to 2 chroma |
| - ci: switch from CI_JOB_JWT to id_tokens |
| - VCN decoding freezes the whole system |
| - [RDNA2 [AV1] [VAAPI] hw decoding glitches in Thorium 123.0.6312.133 after https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28960 |
| - nvk: Tracker issue for gamescope support |
| - nvk: Implement VK_EXT_image_drm_format_modifier |
| - WSI: Support VK_IMAGE_ASPECT_MEMORY_PLANE_i_BIT_EXT for DRM Modifiers in Vulkan |
| - [Regression][Bisected] EGL/Wayland: QT applications terminated by SIGSEGV (Address boundary error) when using dGPU |
| - radv: Enshrouded GPU hang on RX 6800 |
| - NVK Zink: Wrong color in Unigine Valley benchmark |
| - panforst: T604 issue with using u32 for flat varyings |
| - [anv] FINISHME: support YUV colorspace with DRM format modifiers |
| - gen9/11 test became flaky: piglit.spec.!opengl 1_4.blendminmax |
| - 24.0.6: build fails |
| - mesa 24 intel A770 KOTOR black shadow smoke scenes |
| - [bisected][regression] kitty fails to start due to \`glfwWindowHint(GLFW_SRGB_CAPABLE,true)` |
| - r600: bisected 5eb0136a3c561 breaks a number of piglits |
| - Graphical glitches in RPCS3 after updating Vulkan Intel drivers |
| - [R600] OpenGL and VDPAU regression in Mesa 23.3.0 - some bitmaps get distorted. |
| - VAAPI radeonsi: VBAQ broken with HEVC |
| - radv/video: 10-bit support |
| - radv: vkCmdWaitEvents2 is broken |
| - Zink: enabled extensions and features may not match |
| - glRasterPos: "Assertion \`prog->base_serialized_nir' failed." if a shader is loaded from the shader cache |
| - radv: mesa-9999/src/amd/vulkan/radv_image_view.c:147: radv_set_mutable_tex_desc_fields: Assertion \`(plane->surface.u.gfx9.surf_pitch * plane->surface.bpe) % 256 == 0' failed. |
| - ACO doesn't hide lds_param_load latencies |
| - ACO doesn't form a VMEM clause for image stores in one case on GFX11 |
| - r600: Valheim hangs CAYMAN gpu (regression/bisected) |
| - r600: Artifacts in Oxygen Not Included around air ducts and pipes (regression, bisected) |
| - radv: UMR wave parsing format is outdated |
| - radv: GetImageMemoryRequirements2 does not look at VkImagePlaneMemoryRequirementsInfo |
| - RADV, regression : Objects randomly appear/disappear on Unreal Engine 4 titles using D3D12 backend on Polaris |
| - mesa 23.1.0-rc3 flickering textures/lighting in Unreal 4 games Polaris10 |
| - ACO tests SIGSEGV in debian-vulkan job with LTO enabled |
| - radv: Address binding report for images is incorrect. |
| - blorp: avoid dirtying push constants in 3D |
| - anv: flaky vkd3d-proton test_buffer_feedback_instructions_sm51 |
| - FTBFS: commit aaccc25a4dd9ccfc134e51a7e81168334d63a909 broke mesa snapshot build |
| - d3d12_screen.cpp:60:10: fatal error: ShlObj.h: No such file or directory |
| - r300: crash when compiling some GSK shaders |
| - anv: vkd3d-proton test_stress_suballocation failure |
| - d3d12: Zwift renders with bad textures/lighting |
| - nir_opt_remove_phis breaks divergence analysis |
| - intel: Require 64KB alignment when using CCS and multiple engines |
| - NVK: Misrendering with Civilization 6 |
| - radv: RDR2 might need zerovram |
| - intel-clc build failure, i think? |
| - Issues rendering gtk4 window decorations on v3d on Fedora-40/mesa-24.0 |
| - clc: Failure when linking with llvm+clang 18.1 (-Dshared-llvm=disabled) |
| - LLVM-18 build issue |
| - vulkan/wsi/x11: VK_SUBOPTIMAL_KHR is never reported by the swapchain |
| - Broken vaapi encoding on Radeon RX 6900XT |
| - RUSTICL creating a shared reference to mutable static is discouraged and will become a hard error |
| - anv: GPU hang on Assassin's Creed Valhalla while running benchmark |
| - nvk: dota 2 crashes after ~5 seconds in game |
| - dzn: conflicting defines with DirectX headers 1.613.0 |
| - VAAPI: Incorrect HEVC block size reported with radeonsi |
| - radv: WWE 2K24 has very quirky DCC issues on RDNA2 |
| - anv: Dirt 5 crashes at tryCreatingPipelineStateFromCache |
| - freedreno: remove headergen2 |
| - freedreno: remove headergen2 |
| - vulkan/wsi: crash in dEQP-VK.wsi.wayland.swapchain.simulate_oom.min_image_count |
| - Document that Zink on MoltenVK is not expected to work |
| - KiCAD 3D Viewer - rounded pads rendered incorrectly (texture mapping or stencil test error) |
| - OpenSCAD rendering incorrect and inconsistent on radeonsi |
| - intel/fs: regression on MTL with 64bit values in UBO |
| - ci: split debian-build-testing? |
| - [freedreno] Black background on SuperTux Kart with postmarketOS and Oneplus 6T |
| - [radv] Half-Life Alyx renders solid black for reflective surfaces |
| - iris: iris_resource_get_handle returns wrong modifier |
| - [RX 7900 XTX] Helldivers 2 cause GPU reset |
| - radeon: Crash in radeon_bo_can_reclaim_slab |
| - regression/bisected: commit 4e3f3c10e14d8778781c81f39ced659b5ec2f148 broke mesa snapshot build |
| - RV530 renders improperly at non 4:3 resolutions. |
| - anv: new cooperative matrix failures with CTS 1.3.8.0 |
| - nvk: Missing implementation of VkImageSwapchainCreateInfoKHR and VkBindImageMemorySwapchainInfoKHR |
| - mesa > 23.1.9 [opencl,video_cards_nouveau] fails to build due to missing symbol vl_video_buffer_is_format_supported |
| - intel/meson: Make intel_stub_gpu work with \`meson devenv` |
| - Follow-up from "iris: Fix plane indexing and handling on image import" |
| - nvk,nak: Implement shaderStorageImageMultisample |
| - nvk,nak: Implement VK_KHR_shader_subgroup_uniform_control_flow |
| - \`[gfxhub0] no-retry page fault` triggered by \`AMD_TEST=testdmaperf` on gfx90c APU |
| - nvk: glcts hangs |
| - v3d: Line rendering broken when smoothing is enabled |
| - PowerVR reports minMemoryMapAlignment of 64 |
| - RADV: GPU crash when setting 'RADV_DEBUG=allbos' |
| - [intel] mesa ftbfs with time_t64 |
| - d3d12_resource.cpp:307:49: error: no matching function for call to 'ID3D12Heap::GetDesc()' |
| - radv regression between a337a0c8072d0be487e43c2b7b132e003c6d5a5e and 83f741124b66818053b6b1b2f7e42f5217a27004 |
| - [build failure] [armhf] - error: #error "_TIME_BITS=64 is allowed only with _FILE_OFFSET_BITS=64" |
| - R400 should have native support for sin/cos in VS |
| - [radv] Crash when VkGraphicsPipelineCreateInfo::flags = ~0u |
| - intel: all workarounds disabled with ATS skus |
| - vulkan: GPL now broken |
| - Gen4 assertion \`force_writemask_all' failed. |
| - src/gallium/auxiliary/rtasm/rtasm_x86sse.c:198:10: runtime error: store to misaligned address 0x7fabba0cd011 for type 'int', which requires 4 byte alignment |
| - [radv] Holographic projection texture glitch in Rage 2 |
| - RustiCL: Callbacks are not called upon errors |
| - MTL: regressions in vulkancts due to BO CCS allocations |
| - zink: spec\@ext_external_objects\@vk-image-overwrite fail |
| - vaapi: radeonsi: surface_region.{x,y} is not honored in processing when source is RGB |
| - nvk: Implement VK_EXT_shader_object |
| - nvk: Implement VK_EXT_graphics_pipeline_library |
| - turnip: UBWC disabled for MSAA |
| - KHR-Single-GL46.arrays_of_arrays_gl.AtomicUsage fails on MTL |
| - GTF-GL46.gtf42.GL3Tests.texture_storage.texture_storage_texture_as_framebuffer_attachment fails on MTL |
| - nvk: Implement VK_KHR_maintenance5 |
| - [intel][anv][build][regression] - genX_grl.h:27:10: fatal error: grl/grl_cl_kernel.h: No such file or directory |
| - RX 6600 VDPAU not recognizing HEVC_MAIN_10 correctly |
| - Running an app on another AMD GPU (offload, DRI_PRIME) produces corrupted frames on Wayland. |
| - regression in radeonsi since 9aa205668bcbf701f8f694551c284cd8e4cc17a3 (crashes in vbo_save_playback_vertex_list) |
| - clang/libclc related Mesa build failures |
| - Ninja Install Error |
| - anv: add a dri config to enable implicit fencing on external memory interop |
| - VDPAU declares a texture as "immutable" without also setting its ImmutableLevels attribute. |
| - Segfault in glsl_to_nir.cpp nir_visitor::visit when assigning interface block |
| - [rusticl]WARNING: Project targets '>= 1.1.0' but uses feature deprecated since '1.0.0': module rust has been stabilized. drop "unstable-" prefix from the module name |
| - RX6600 hardware HEVC video decode fails for VDPAU but works for VA-API. (Can lock up GPU!) |
| - Rusticl panics when getting program build logs using opencl.hpp |
| - ue5 game issues lighting Rog Ally 7080u (z1e) |
| - Missing textures in RoboCop: Rogue City with mesh shaders enabled |
| - Intel/anv: Allow pre-compiled shader caches to be reused across multiple devices |
| - radv: Multiview PSO forgets to export layer in some cases. |
| - -Dintel-rt=enabled fails to build on 32-bit |
| - MTL: regressions in vulkancts due to BO CCS allocations |
| - intel: build failures |
| - regression/bisected commit 4de62731f4db56360026cbb6a3b8566f86f22466 broke HW acceleration in the Google Chrome |
| - i386 intel build failure: meson.build:45:6: ERROR: Unknown variable "prog_intel_clc". |
| - rusticl: clEnqueueFillBuffer (among others) fails on buffers created from GL object. |
| - MTL raytracing regression |
| - [ANV/DG2] Unexpectedly slow replay of RenderDoc frame capture of Resident Evil 4 Remake |
| - zink: flickering artifacts in Selaco |
| - [ADL] gpu hang on dEQP-VK.synchronization.internally_synchronized_objects.pipeline_cache_graphics |
| - Turnip spam on non-turnip devices |
| - Intermittent compiler failures when building valhall tests |
| - panfrost: graphical artifacts on T604 (T600) |
| - Dying Light native artifacts on Intel A770 |
| - r300: Amnesia: The Dark Descent heavy corruption |
| - [ANV/DG2] Age of Empires IV fullscreen "banding" artefacts |
| - [mtl][anv] dEQP-VK.pipeline.monolithic.depth.format.d32_sfloat.compare_ops.* failures when run multithreaded |
| - [mtl][anv] flaky tests in pipeline.monolithic.extended_dynamic_state*stencil_state_face* series |
| - Broken colors/dual-source blending on PinePhone (Pro) since 23.1.0 |
| - r600/sfn: "Indexed locks of kcache banks 14 and 15 are ignored" in the ALU clause documentation |
| - turnip: Logarithmic-time subgroup reductions using brcst.active and getlast.w8 instructions on a6xx gen4+ |
| - GTF-GL46.gtf42.GL3Tests.texture_storage.texture_storage_compressed_texture_data regression |
| - microsoft/compiler: Missing globally-coherent logic |
| - Regression between 23.0.4 and 23.1.0: texture glitches in osgEarth |
| - [Broadcom] Warning when runnin every OpenGL game on Vulkan using ZINK |
| - radeonsi unsynchronized flips/tearing with KMS DRM rendering on 780M |
| - radeonsi has an unchecked hard dependency on libelf |
| - DR crashes with mesa 24 and rusticl (radeonsi) |
| - Piglit tests assert on gen9 with zink |
| - vlc crashes when playing 1920x1080 video with Radeon RX6600 hardware acceleration and deinterlacing enabled. |
| - [radeonsi] Regression: graphical artifacting on water texture in OpenGOAL |
| - Assertion when creating dmabuf-compatible VkImage on Tigerlake |
| - Palworld fails to launch on Intel Arc unless "force_vk_vendor" is set to "-1". |
| - panfrost: implement line smoothing |
| - r300: backend DCE fails in piglit glsl-vs-copy-propagation-1.shader_test |
| - [AMDGPU RDNA3] Antialiasing is broken in Blender |
| - MTL: vulkan cooperative matrix tests gpu hang on MTL |
| - nvk: Implement VK_KHR_zero_initialize_workgroup_memory |
| - Assassin's Creed Odyssey wrong colors on Arc A770 |
| - VAAPI: EFC on VCN2 produces broken H264 video and crashes the HEVC encoder |
| - etnaviv, modesetting, and glxgears |
| - The Finals fails to launch with DX12 on Intel Arc unless "force_vk_vendor" is set to -1. |
| - nvk: \`VK_KHR_zero_initialize_workgroup_memory` and \`VK_KHR_shader_subgroup_extended_types` not marked as complete in features.txt |
| - nvk: Implement variableMultisampleRate |
| - VA-API CI tests freeze |
| - radv: games render with garbage output on RX5600M through PRIME with DCC |
| - Warning when use ALIGN over uint64_t and uintptr_t |
| |
| |
| Changes |
| ------- |
| |
| Adrian Perez de Castro (1): |
| |
| - Revert "egl/wayland: Remove EGL_WL_create_wayland_buffer_from_image" |
| |
| Agate, Jesse (6): |
| |
| - amd/vpelib: Studio Range Handling |
| - amd/vpelib: White Screen Fix |
| - amd/vpelib: VPT Failing Test Cases |
| - amd/vpelib: VPE integration for HLG |
| - amd/vpelib: Add PQ Norm to VPE interface |
| - amd/vpelib: Refactor norm factor logic |
| |
| Alan Liu (4): |
| |
| - radeonsi/vpe: Add environment variable to set embbuf number |
| - radeonsi/vpe: Don't map and unmap emb_buffer every time in process_frame |
| - amd/vpelib: remove unused header file |
| - radeonsi/vpe: support vpe 1.1 |
| |
| Alejandro Piñeiro (3): |
| |
| - broadcom/compiler: fix coverity warning (unitialized pointer read) |
| - v3dv/bo: use mtx_lock/unlock on cache_init too |
| - v3dv: expose VK_EXT_depth_clip_enable |
| |
| Alexandre Marquet (1): |
| |
| - pan/mdg: quirk to disable auto32 |
| |
| Alyssa Rosenzweig (328): |
| |
| - nir/lower_ssbo: rewrite |
| - nir/lower_blend: return progress |
| - nir/lower_io_to_temporaries: return prog |
| - nir/lower_clip_cull_distance_arrays: return prog |
| - nir: return prog from drawpixels |
| - nir/lower_bitmap: return prog |
| - nir/lower_alpha_test: rewrite with intrinsics_pass |
| - nir/lower_point_size_mov: return prog |
| - nir/lower_passthrough_edgeflags: return progress |
| - nir/lower_io_arrays_to_elements: return prog |
| - nir/lower_flatshade: fix metadata |
| - glsl: return progress in point size linking |
| - glsl: don't use NIR_PASS_V |
| - glsl: fix metadata in gl_nir_zero_initialize_clip_distance |
| - mesa/st: return progress in st_nir_lower_wpos_ytransform |
| - mesa/st: use instructions_pass for plane lowering |
| - mesa/st: return progress lowering builtins |
| - mesa/st: don't use NIR_PASS_V |
| - agx: rm deadcode |
| - agx: ingest undefs in the backend |
| - agx: stop lowering in opt loop |
| - agx: only lower vars to ssa once |
| - agx: fix metadata in layer lowering |
| - agx: unset silly nir opts |
| - agx: return progress from passes |
| - asahi: return progress from passes |
| - asahi: avoid silly internal NIR_PASS in gs lowering |
| - asahi: don't use NIR_PASS for removing entrypoints |
| - asahi: don't use NIR_PASS_V |
| - nir/passthrough_gs: plug leak |
| - compiler,gallium: move u_decomposed_prim to common |
| - nir/passthrough_gs: flesh out gs_in_prim |
| - compiler: add a vs.tes_agx bit |
| - asahi: add more uapi stubs |
| - asahi: gut macOS related code |
| - asahi: lower poly stipple |
| - asahi: Implement skeleton for tessellation |
| - asahi: fix metadata for images with VS lowered to GS |
| - asahi: implement VBO robustness |
| - asahi: implement reset queries |
| - asahi: enable robustness |
| - asahi: fix unbound ssbos |
| - asahi: optimize more when linking libagx |
| - asahi: decode uniform_high records |
| - agx: implement load_subgroup_invocation |
| - agx: lower more subgroups |
| - agx: introduce ballot pseudo |
| - agx: fuse ballot+cmp |
| - nir: add active_subgroup_invocation_agx sysval |
| - agx: implement active_subgroup_invocation_agx |
| - agx: optimize first_invocation |
| - agx: optimize vote_eq |
| - asahi: fix prim restart unrolling with indirects |
| - asahi: delete bogus assertion |
| - asahi: plug passthrough tcs leak |
| - asahi: rework meta shader infra |
| - asahi: plug geometry shader leaks |
| - asahi: plug pre-gs leak |
| - asahi: plug early_serialized_nir leak |
| - asahi: plug so target leak |
| - asahi: plug glsl type leak |
| - asahi: plug geometry heap leak |
| - asahi: fix UB in qbo's |
| - agx: add some more bitop tests |
| - Revert "asahi: don't canonicalize nans/flush denorms when copying" |
| - asahi: sync with query mismatches |
| - asahi: enable tcs caching |
| - asahi: don't sync for uninitialized buffer |
| - asahi: fix valid buffer tracking for SSBO/image/XFB |
| - asahi: handle read-only SSBOs |
| - asahi: honour discard_whole | persistent |
| - agx: only run early tests if needed |
| - docs/asahi: fix strided linear note |
| - ail: add tests for linear<-->twiddled copies |
| - ail: port tiling routines to c++ |
| - ail: use template for tiled memcpy |
| - agx: don't inline imms into stack_store |
| - agx: optimize b2x(inot) |
| - agx: reassociate bcsel with ior/iand |
| - asahi: implement pipeline stats as a checkbox |
| - asahi: log geometry shaders separate from xfb |
| - asahi: don't use util_resource_size |
| - asahi: fix vbo dirty track |
| - asahi: force inline ppp update logic |
| - asahi: skip set if tested |
| - asahi: rm dead |
| - asahi: track bit count, not word count |
| - asahi: enable compblit behind dbg flag |
| - asahi: allow disk cache with compblit |
| - asahi: assert invariant |
| - asahi: drop silly else |
| - asahi: rewrite queries |
| - asahi: split up stage uniform upload |
| - asahi: dirty track stage uniforms |
| - asahi: collapse stage uniform upload |
| - asahi: optimize "no changes" case |
| - asahi: optimize no changes descriptor case |
| - asahi: move some code into dirty tracking |
| - asahi: drop any_draws |
| - asahi: fix instance count with indirect draw |
| - asahi: collapse if |
| - asahi: hoist xfb code |
| - asahi: hoist layer id code |
| - asahi: rm blank |
| - asahi: track batches with incoherent writes |
| - asahi: optimize memory_barrier |
| - asahi,agx: use intrinsics pass |
| - agx: clamp register file based on workgroup size |
| - agx: improve scratch size accounting |
| - asahi: add has_scratch to shader key |
| - agx: set nr_preamble_gprs for preamble scratch |
| - asahi: allocate preamble scratch |
| - agx: allow 16-bit immediate on stack load/store |
| - agx: print register vectors |
| - agx: introduce "memory variables" |
| - agx: add spill/fill lowering pass |
| - agx: unit test memory parallel copies |
| - agx: unit test spill/fill lowering |
| - agx: add parallel copy printing |
| - agx: add =spill debug option |
| - asahi: bump max threads per wg |
| - asahi: drop xfb hack |
| - asahi: allow vertex/geom/tess side effects |
| - agx: fix buffer overflow with varying slots |
| - asahi,agx: use hw clip distance |
| - asahi: fix dirty tracking issue |
| - asahi: rip out existing MDI+GS implementation |
| - libagx: fix buggy align macro |
| - asahi: make GS flatshade_first more dynamic |
| - libagx: use native static_assert on host |
| - libagx: use real PACKED macro |
| - libagx: static assert some sizes |
| - libagx: generalize vertex_id_for_topology |
| - asahi: simplify IA mode handling |
| - asahi: add shader_info::outputs for gs lower |
| - asahi: add geometry parameters for separable GS |
| - asahi: rework shader stage handling a bit |
| - asahi: separate GS from VS |
| - asahi: rm arrayed output lowering |
| - asahi: allow bindful GS textures |
| - asahi: shrink GS key |
| - asahi: infer stage in descriptor update |
| - asahi: be a bit more methodical with shader stages |
| - nir: rm load_vert_id_in_prim_agx |
| - asahi: allow lowering bindings after lowering textures |
| - asahi: collapse indirection with GS |
| - asahi: support stage override in sysval lower |
| - asahi: set gs_grid[0] even for direct draws |
| - asahi: use load_instance_id in gs lowering |
| - asahi: fix vertex out size calc |
| - asahi: invert geometry shaders |
| - asahi: implement GS disk caching |
| - asahi: rm dead |
| - asahi: simplify expressions involving xfb |
| - asahi: avoid silly psiz writes even with gs |
| - asahi: eliminate tri fan % |
| - asahi: make provoking vertex dynamic |
| - asahi: make gs topology dynamic |
| - asahi: support GS in shaderdb |
| - asahi: always support ARB_clip_control |
| - asahi: make clip_halfz dynamic |
| - asahi: rm ia key |
| - agx: remove discard -> zs_emit lower |
| - agx: rm dead sample count argument |
| - agx: call agx_nir_lower_sample_mask earlier |
| - agx: rm unused backend nr_samples |
| - agx: rm unused opt_ixor_bcsel |
| - agx: sink wait_pix |
| - asahi: Implement ARB_texture_barrier by decompression |
| - asahi: quelch gcc warning |
| - agx: rm ridiculous dependency |
| - agx: decouple compiler from genxml |
| - agx: use #pragma once |
| - asahi/lib: use #pragma once |
| - ail: use #pragma once |
| - asahi: use #pragma once |
| - asahi: clean up format table renderability |
| - asahi: split out genxml/ directory |
| - agx: move SSBO lowering |
| - agx: call texture lowering in the driver |
| - agx: move texture lowering into lib |
| - agx: decouple from libagx |
| - asahi: reorder compiler before clc |
| - asahi: precompile helper program |
| - agx: add "is helper program?" key bit |
| - asahi: advertise GL4.6 and ES3.2 |
| - docs: update for GL4.6 and ES3.2 on asahi |
| - vulkan: add vk_index_type_to_restart helper |
| - tu: use vk_index_to_restart |
| - anv,hasvk: use vk_index_to_restart |
| - util/hash_table: add u64 foreach macro |
| - util/ralloc: add memdup |
| - treewide: use ralloc_memdup |
| - panfrost: Add a library to build CSF command streams |
| - panfrost: Add support for the CSF job frontend |
| - nir/opt_shrink_vectors: hoist alu helpers |
| - nir/opt_shrink_vectors: shrink some intrinsics from start |
| - util: add _mesa_hash_table_u64_num_entries |
| - nir/print: do not print empty lists on intrinsics |
| - util/hash_table: add DERIVE macro |
| - panfrost: derive ht |
| - asahi: derive ht |
| - nvk: derive ht |
| - radeonsi: derive ht |
| - v3d: derive ht |
| - glsl_types: derive ht |
| - asahi: bump maximum samplers for Blender |
| - asahi: allow more samplers for shaderdb |
| - asahi: move more code out of agx_preprocess_nir |
| - asahi/lib: fix overread with stateful |
| - asahi: fix overread with samplers |
| - asahi: clarify how unroll index buffers are offsetted |
| - asahi: zero more in the unroll path |
| - asahi: fix unit mismatch with unroll path |
| - asahi: fix stage accounting for meta compute shaders |
| - asahi: export build_meta_shader |
| - asahi: add flush_query_writers helper |
| - asahi: add helper to classify queries |
| - asahi: accelerate QBO copies |
| - asahi: fix depth bias interactions with points/lines |
| - asahi: implement CDM stream linking for GS |
| - asahi: be robust against tess batch changes |
| - asahi: stop merging VS and TCS |
| - asahi: drop TCS key |
| - asahi: drop asahi_vs_next_stage |
| - libagx: improve static assert message |
| - asahi/clc: fix mem leaks |
| - agx/opt_cse: alloc less |
| - agx: fix stack smash with spilling |
| - agx: fix allocating phi sources past the reg file |
| - agx: add more asserts |
| - agx: add num_successors helper |
| - agx: fix 16-bit mem swaps |
| - agx: scalarize vector phis |
| - agx: allow vector phis to pass validation |
| - agx: assert phis don't have .kill set |
| - agx: fix bogus implicit cast with 2d msaa arrays |
| - agx: sink harder |
| - agx: implement live range splits of phis |
| - agx: don't leak shuffle copies |
| - agx: add more iterator macros |
| - agx: add temp_like helper |
| - agx: add before_function cursor |
| - agx: add limit for max sources per non-phi |
| - agx: coalesce phi webs |
| - agx: try to coalesce moves |
| - agx: drop scratch regs for spilling |
| - agx: validate phi sources for consistency |
| - agx: add SSA reindexing pass |
| - agx: add SSA repair pass |
| - agx: add Braun-Hack spiller pass |
| - agx: switch to Braun-Hack spiller |
| - agx: use dense reg_to_ssa map |
| - agx: make add_successor public |
| - agx: add helpers for multiblock unit tests |
| - agx: add tests for SSA repair |
| - agx: move spill/fills accounting to shaderdb |
| - agx: enable indirect temps |
| - agx: generalize remat code |
| - agx: implement get_sr remat |
| - asahi: use less bindless samplers |
| - agx: add more shaderdb stats |
| - agx: fix lowering uniforms with abs/neg |
| - agx: restrict high uniforms with textures |
| - agx: extract "accepts uniform?" ISA query |
| - agx: model 64-bit uniform restriction on ALU |
| - agx: extract agx_is_float_src |
| - agx: promote constants to uniforms |
| - agx: compact 32-bit constants |
| - agx: test constant compaction |
| - agx: implement load_subgroup_id |
| - libagx: polyfill glsl ballot() |
| - libagx: accelerate restart unroll across a subgroup |
| - libagx: accelerate prim restart unroll across wg |
| - libagx: deal with silly NIR |
| - libagx: parallelize prefix sum over 1024 threads |
| - agx: use funop short form |
| - agx: split select opt into its own pass |
| - agx: vectorize uniform_store |
| - agx: start a crude cycle model |
| - agx/opt_preamble: improve preamble cost function |
| - agx/opt_preamble: restrain ourselves |
| - agx/opt_preamble: preamble cycle estimates |
| - agx/opt_preamble: improve rewrite cost est |
| - docs/asahi: document UVS |
| - nir: add offset to load_coefficients_agx |
| - nir: add intrinsics for lowered VS outputs |
| - asahi: add agx_push_packed |
| - asahi: drop =varyings debug |
| - asahi: extract agx_cf_binding |
| - agx: explicitly assign coeff registers |
| - agx: pack indirect CF |
| - agx: handle indirect varyings |
| - asahi: advertise indirect fs inputs |
| - agx: rm unnecessary iter hack |
| - agx: pack indirect st_vary |
| - agx: inline imm into st_vary |
| - asahi: rewrite varying linking |
| - asahi: drop now-empty base key |
| - asahi: make point size replacement dynamic |
| - asahi: stop using GLSL indirect lowering |
| - agx/lower_vbo: dce as we go |
| - asahi: drop dead linked_so code |
| - asahi: use ht derive more |
| - asahi: fix _packed USC structs |
| - asahi: delete layer id code |
| - asahi: don't set writes_memory for tib spilling |
| - agx: optimize out wait_pix in some cases |
| - agx: inline sampler states |
| - agx: always reserve sampler #0 for txf |
| - asahi: fix bit sizes in point sprite lower |
| - nir: add samples_log2_agx sysval |
| - nir: add export/load_exported_agx intrinsics |
| - agx: wire up samples_log2 sr |
| - agx: generalize preloaded cache |
| - agx: implement exports |
| - agx: document non-monolithic ABI |
| - asahi: add agx_usc_push_packed helper |
| - asahi: constify agx_build_tilebuffer_layout |
| - asahi: don't allocate tib space for gaps |
| - nir: add intrinsics for non-monolithic agx shaders |
| - agx: drop shader stage assertion |
| - asahi: static assert blend key size |
| - agx: add agx_shader_part data structure |
| - agx: add main_size info |
| - asahi: add fast linker |
| - asahi/clc: stop padding binaries |
| - asahi: switch to VS/FS prolog/epilog system |
| |
| Amber (3): |
| |
| - tu: wideLines support for a7xx. |
| - tu: Add MESA_VK_DYNAMIC_RS_LINE_WIDTH to tu_rast_state. |
| - tu: re-emit vertex buffer on MESA_VK_DYNAMIC_VI_BINDINGS_VALID dirty. |
| |
| Amber Harmonia (1): |
| |
| - freedreno/common: Fix register stomper ranges for A7XX |
| |
| Andres Calderon Jaramillo (1): |
| |
| - radeonsi: get enc/dec caps from kernel only on amdgpu |
| |
| Antoine Coutant (2): |
| |
| - clc: retrieve libclang path at runtime. |
| - drisw: fix build without dri3 |
| |
| Anton Bambura (2): |
| |
| - panfrost: Enable Mali-T600 |
| - docs/panfrost: Document Mali-T600 support |
| |
| Antonio Gomes (6): |
| |
| - mesa/st: Skip querying PCI values in interop_query_device_info if version >= 4 |
| - rusticl/gl: Bump mesa_glinterop_device_info to version 4 |
| - gallium: Add new PIPE_CAP_CL_GL_SHARING |
| - iris: Set PIPE_CAP_CL_GL_SHARING to true |
| - radeonsi: Set PIPE_CAP_CL_GL_SHARING to true |
| - rusticl/device: Verify for PIPE_CAP_CL_GL_SHARING when enabling gl_sharing |
| |
| Arthur Huillet (1): |
| |
| - nvk: remove useless MME scratch 26 usage |
| |
| Asahi Lina (17): |
| |
| - asahi: libagx: introduce AGX_STATIC_ASSERT |
| - agx: Rename some SRs |
| - nir: Add AGX-specific helper opcodes |
| - agx: Hook up AGX helper NIR intrinsics |
| - agx: Hook up helper intrinsics into CL |
| - agx: Add scaffolding to build the helper shader at device init |
| - agx: compiler: Add fence_helper_exit_agx barrier |
| - agx: compiler: Export scratch size to the driver |
| - agx: compiler: Enable stack_adjust |
| - asahi: libagx: Move PACKED and GLOBAL macros to libagx.h |
| - asahi: cmdbuf: Fix scratch bucket offset/size |
| - asahi: Implement scratch allocation |
| - asahi: scratch: Add feature to debug core IDs |
| - asahi: Hook up scratch |
| - asahi: Allocate scratch for shaders |
| - asahi: Enable scratch debugging |
| - asahi: batch: Trace before waiting for syncobj |
| |
| Assadian, Navid (2): |
| |
| - amd/vpelib: Apply inverse gamut remap to background |
| - amd/vpelib: Use uint64 for buffer size |
| |
| Axel Davy (5): |
| |
| - frontend/nine: Fix ff ps key |
| - frontend/nine: Fix programmable vs check |
| - frontend/nine: Fix missing light flag check |
| - frontend/nine: Fix destruction race |
| - frontend/nine: Reset should EndScene |
| |
| Bas Nieuwenhuizen (10): |
| |
| - util/disk_cache: Add marker on cache usage. |
| - radv: Remove ray_launch_size_addr_amd system value. |
| - radv: Add winsys argument to buffer map/unmap. |
| - radv/winsys: Use radv_buffer_map wrapper. |
| - radv/amdgpu: Use mmap directly. |
| - radv: Support for mapping a buffer at a fixed address. |
| - radv: Implement reserving the VA range on unmap. |
| - radv: Expose VK_EXT_map_memory_placed. |
| - radv: Fix differing aspect masks for multiplane image copies. |
| - radv: Use zerovram for Enshrouded. |
| |
| Benjamin Lee (14): |
| |
| - nak: support predicate swaps on SM50 |
| - nak: support predicate sel on SM50 |
| - nak: fix frnd on SM50 |
| - nak: implement FSWZADD on SM50 |
| - nak: implement FLO on SM50 |
| - nak: fix iabs on SM50 with an explicit i2i op |
| - nak: implement rro op on SM50 |
| - nak: use rro when emitting mufu on SM50 |
| - nak: implement kill op on SM50 |
| - nak: implement cs2r op on SM50 |
| - nak: handle nop ops from NAK IR on SM50 |
| - nak: fix lod mode encoding for SM50 tld op |
| - nak: fix tex offset encoding on SM50 |
| - nvk: disable shaderResourceMinLod on pre-sm70 |
| |
| Benjamin Tissoires (3): |
| |
| - CI: add mr-label-maker.yml config |
| - .mr-label-maker.yml: fix wrong label |
| - CI: add a test for checking the validity of .mr-label-maker.yml |
| |
| Biju Das (1): |
| |
| - gallium: Add Renesas rzg2l-du DRM entry point |
| |
| Blisto (1): |
| |
| - driconf: set vk_x11_strict_image_count for Atlas Fallen Vulkan |
| |
| Bob Beckett (2): |
| |
| - panfrost: Add an entry for panthor in the renderonly_drivers[] array |
| - panfrost: Add the gallium glue to get panfrost loaded when panthor is detected |
| |
| Boris Brezillon (193): |
| |
| - panvk: Fix tracing |
| - panvk: Fix access to unitialized panvk_pipeline_layout::num_sets field |
| - panfrost: Kill unused forward declarations in pan_texture.h |
| - panfrost: Add a per-gen panfrost_format_from_pipe_format() helper |
| - panfrost: Add a per-gen panfrost_blendable_format_from_pipe_format() helper |
| - panfrost: Make panfrost_format_to_bifrost_blend() a per-gen helper |
| - panfrost: Add panfrost_[blendable]_format_table() helpers |
| - panfrost: Move panfrost_is_yuv() to pan_format.h |
| - panfrost: Move YUV-debugging out of panfrost_new_texture() |
| - panfrost: Stop passing a panfrost_device to panfrost_new_texture() |
| - panfrost: Don't pass a panfrost_device to panfrost_format_supports_afbc() |
| - panfrost: Don't pass a panfrost_device to panfrost_afbc_can_tile() |
| - panfrost: Stop passing a panfrost_device to pan_blend_get_internal_desc() |
| - panfrost: Stop exposing pan_blend_create_shader() |
| - panfrost: Stop passing a panfrost_device to pan_blend_create_shader() |
| - panfrost: Stop passing a panfrost_device to pan_inline_rt_conversion() |
| - panfrost: Make the pan_blend logic panfrost_device-agnostic |
| - panfrost: Get rid of unused panfrost_device arguments in pan_blitter.c |
| - panfrost: Pass the tile buffer budget through pan_fb_info |
| - panfrost: Pass the sample position array through pan_fb_info |
| - panfrost: Pass no_hierarchical_tiling info through pan_tiler_context |
| - panfrost: Pass tiler heap info through pan_tiler_context |
| - panvk: Inline pan_wls_mem_size() |
| - panfrost: Make pan_desc.{c,h} panfrost_device agnostic |
| - panfrost: Drop unused panfrost_device forward declaration in pan_shader.h |
| - panfrost: Make pan_layout.c panfrost_device agnostic |
| - panfrost: Make pan_sample.c panfrost_device agnostic |
| - panfrost: Make pan_encoder.h panfrost_device agnostic |
| - panfrost: Remove unused header inclusions from pan_blitter.h |
| - panfrost: Make pan_blitter.h includable from non per-gen files |
| - panfrost: Make pan_blitter.{c,h} panfrost_device agnostic |
| - panfrost: Make pan_indirect_dispatch panfrost_device agnostic |
| - panfrost: Make pan_pool.h panfrost_{device,bo} agnostic |
| - panfrost: Make pan_props.c panfrost_device agnostic |
| - panfrost: Make pan_texture.{c,h} panfrost_bo agnostic |
| - panfrost: Make pan_desc.{c,h} panfrost_bo agnostic |
| - panfrost: Remove uneeded pan_device.h inclusions |
| - panfrost: Make panfrost_texfeatures.c panfrost_device agnostic |
| - panfrost: Make pan_perf panfrost_device agnostic |
| - panfrost: Add a helper to retrieve a panfrost_bo from a pan_kmod_bo |
| - panvk: Get rid of unused pdev arguments passed to some meta helpers |
| - panvk: Stop passing panfrost_device around in internal meta helpers |
| - panvk: Store various physical device properties at the physical_device level |
| - panvk: Use vk_device::drm_fd instead of going back to the physical device |
| - panvk: Move panfrost_device and panvk_meta to panvk_device |
| - panvk: Add a decode context at the panvk_device level |
| - panvk: Instantiate our own blitter/blend_shader caches |
| - panvk: Add pan_kmod_{vm,dev} objects to panvk_device |
| - panvk: Add the concept of private BO |
| - panvk: Transition panvk_pool to panvk_priv_bo |
| - panvk: Transition panvk_descriptor_set to panvk_priv_bo |
| - panvk: Transition panvk_pipeline to panvk_priv_bo |
| - panvk: Transition panvk_{image,buffer}_view to panvk_priv_bo |
| - panvk: Track blit src/dst using pan_kmod_bo objects |
| - panvk: Keep a ref to a pan_kmod_bo in panvk_image |
| - panvk: Keep a ref to a pan_kmod_bo in panvk_buffer |
| - panvk: Keep tiler_heap and sample_positions BOs at the panvk_device level |
| - panvk: Move away from panfrost_{bo,device} |
| - panfrost: Move pan_{bo,device}.{c,h} to the gallium driver dir |
| - panfrost: Clamp the render area to the damage region |
| - panfrost: v4 doesn't have Blend descriptors |
| - panfrost: Pad compute jobs with zeros on v4 |
| - pan/va: Add missing valhall_enums dep to valhall_disasm |
| - pan/kmod: Fix typo in pan_kmod_vm_op_check() helper |
| - pan/kmod: Add a PAN_KMOD_VM_FLAG_TRACK_ACTIVITY flag |
| - pan/kmod: Reject pre 1.1 panfrost kernel drivers |
| - panfrost: Rework the way we compute thread info |
| - panfrost: Prepare support for GPU variants |
| - pan/perf: Reject panthor kernel driver |
| - drm-uapi: Add panthor uAPI |
| - pan/kmod: Add a backend for panthor |
| - panfrost: Add v10 support to libpanfrost |
| - pan/genxml: Various CS related improvements in v10.xml |
| - pan/decode: Introduce the concept of usermode queue |
| - panfrost: Don't allocate a tiler heap buffer on v10+ |
| - pan/genxml: Make sure pan_pack() evaluates 'dst' only once |
| - panfrost: Relax position result alignment constraint on v10+ |
| - panfrost: Add arch-specific context init/cleanup hooks |
| - panfrost: Add a panfrost_context_reinit() helper |
| - panfrost: Add a cleanup_batch() method to panfrost_vtable |
| - panfrost: Enable v10 in the gallium driver |
| - panfrost: Advertize G610 support |
| - panfrost: Advertize G310 support |
| - panfrost: Update the release note to mention G310/G610 addition |
| - vk/meta: Add the PUSH_DESCRIPTOR_BIT flag when creating blit pipeline layouts |
| - vk/meta: Fix base_type selection in build_{clear,blit}_shader() |
| - panvk: Fix call ordering in panvk_DestroyDevice() |
| - panvk: clang-format the source files |
| - panvk: Kill the panvk_pack_color() prototype |
| - panvk: Add VKAPI_{ATTR,CALL} specifiers to all panvk-specific entrypoints |
| - panvk: Do not handle invalid NULL memory object in BindImageMemory2() |
| - panvk: Get rid of unused panvk_image_get_plane_size() helper |
| - panvk: Get rid of the custom device lost handling |
| - panvk: Fix allocation scope of command buffer sub-objects |
| - panvk: Add missing util_dynarray_init() in panvk_cmd_open_batch() |
| - panvk: Don't open-code panvk_cmd_open_batch() in CmdBeginRenderPass2() |
| - panvk: Don't allocate a TEXTURE descriptor in CreateImageView() |
| - panvk: s/panvk_event_op/panvk_cmd_event_op/ |
| - panvk: Allocate descriptor set arrays using vk_multialloc_zalloc() |
| - panvk: Don't pass a device where we don't need one |
| - panvk: Get rid of unused panvk_cmd_buffer fields |
| - panvk: Kill panvk_{Create,Destroy}SamplerYcbcrConversion() |
| - panvk: Drop panvk_framebuffer |
| - panvk: Get rid of panvk_pipeline_cache |
| - panvk: Make panvk_buffer_view inherit from vk_buffer_view |
| - panvk: Make panvk_device_memory inherit from vk_device_memory |
| - panvk: Make pan_AllocateMemory() robust to errors |
| - panvk: Add extra checks to panvk_MapMemory() |
| - panvk: Implement {Map,Unmap}Memory2KHR |
| - panvk: Make panvk_sampler inherit from vk_sampler |
| - panvk: Fix GetPhysicalDeviceProperties2() to report accurate info |
| - panvk: Get rid of fields we already have in vk_xxx objects |
| - panvk: Disable global offset on varying and non-VS attribute descriptors |
| - panfrost: Move the image attribute offset adjustment to a NIR pass |
| - panvk: Implement dynamic rendering entry points |
| - nir: Extend nir_get_io_offset_src_number() to support load_push_constant |
| - nir: Extend nir_lower_mem_access_bit_sizes() to support push constants |
| - pan/bi: Lower push constant accesses |
| - pan/bi: Lower load_push_constant with dynamic indexing |
| - pan/bi: Update the push constant count when emitting load_push_constant |
| - panvk: Move some macros to panvk_macros.h |
| - panvk: Move image related definitions to panvk_image.{h,c} |
| - panvk: Move the VkBuffer logic to its own source file |
| - panvk: Move the VkBufferView logic to its own file |
| - panvk: Move the VkDeviceMemory logic to panvk_device_memory.{c,h} |
| - panvk: Move the VkSampler logic to its own file |
| - panvk: Move panvk_pipeline definition to panvk_pipeline.h |
| - panvk: Move VkImageView logic to its own source files |
| - panvk: Move the VkEvent logic to panvk_event.{c,h} |
| - panvk: Move panvk_descriptor_{set,pool} definitions to panvk_descriptor_set.h |
| - panvk: Move VkDescriptorSetLayout logic to panvk_descriptor_set_layout.{c,h} |
| - panvk: Move VkPipelineLayout logic to its own file |
| - panvk: Move shader related definitions to panvk_[vX\_]shader.{c,h} |
| - panvk: Kill panvk_[vX\_]cs.{c,h} |
| - panvk: Move panvk_{draw,dispatch}_info definitions to panvk_vX_cmd_buffer.c |
| - panvk: Move the VkCommandPool logic to panvk_cmd_pool.{c,h} |
| - panvk: Move VkQueue logic to panvk_[vX\_]queue.{c,h} |
| - panvk: Add a panvk_arch_dispatch_ret() variant |
| - panvk: Make the device creation/destruction per-arch |
| - panvk: Move the VkInstance logic to panvk_instance.{c,h} |
| - panvk: Move the VkPhysicalDevice logic to panvk_physical_device.{c,h} |
| - panvk: Move panvk_meta definitions to panvk_meta.h |
| - panvk: Move panvk_device definition to panvk_device.h |
| - panvk: Move the panvk_cmd_buffer definitions in panvk_cmd_buffer.h |
| - panvk: Move the panvk_priv_bo logic to panvk_priv_bo.{c,h} |
| - panvk: Move panvk_wsi definitions to panvk_wsi.h |
| - panvk: Kill panvk_private.h |
| - panvk: Make panvk_buffer_view per-gen |
| - panvk: Make panvk_image_view per-gen |
| - panvk: Make panvk_sampler a per-gen |
| - panvk: Make panvk_cmd_buffer per-gen |
| - panvk: Make panvk_shader per-gen |
| - panvk: Make panvk_descriptor_set per-gen |
| - panvk: Make panvk_descriptor_set_layout per-gen |
| - panvk: Make panvk_pipeline per-gen |
| - panvk: Make panvk_queue per-gen |
| - panvk: Make panvk_pipeline_layout per-gen |
| - panvk: Fix attach-less rendering |
| - panvk: Fix the colorAttachmentCount check in begin_rendering_init_fbinfo() |
| - pan/bi: Support fragment store_output() with a non-zero offset |
| - panvk: Don't assume VkGraphicsPipelineCreateInfo::pColorBlendState != NULL |
| - pan/bi: Allow subpass sampler dims |
| - panvk: Fix input attachment support |
| - panvk: Fill pan_tls_info::wls::instances |
| - panvk: Make sure the sample_pattern is set in the tiler descriptor |
| - panvk: We don't support resolve operations yet |
| - pan/bi: Extend bi_emit_texc() to support wider direct tex/sampler idx |
| - panvk: Don't assume pViewportState != NULL |
| - panvk: Fix img2buf copies with image X offset not aligned on 16 pixels |
| - panvk: Fix has_non_vs_attribute() test in panvk_draw_prepare_vs_attribs() |
| - panvk: Make sure we pick a valid wrap_mode_r value for unnormalizedCoordinates |
| - panvk: Fix depth/stencil image views |
| - panvk: Make sure we have a decode context created when we need one |
| - panvk: Don't advertize vertex_buffer cap on sRGB formats |
| - panvk: Swizzle the border color on v7 when the format is BGR |
| - panvk: Re-order things in panvk_physical_device_init() |
| - panvk: Fill maxCustomBorderColorSamplers |
| - panvk: Skip tiler jobs when the vertex shader doesn't write the position |
| - panvk: Make sure we use the proper format for views of depth+stencil images |
| - panvk: Abort on fault when PANVK_DEBUG=sync |
| - panvk/ci: Make sure we catch GPU faults |
| - panvk/ci: Enable dEQP-VK.pipeline.monolithic.* |
| - panvk: Add support for KHR_push_descriptor |
| - panvk/ci: Re-enable copy_and_blit tests |
| - panvk: Stop declaring one push constant array per graphics stage |
| - panvk: Pass the push constant array to draw/dispatch calls |
| - panvk: Stop lowering push constant loads to UBO loads |
| - panvk: Dissociate UBO and push_constant emission |
| - nir/lower_blend: Fix nir_blend_logicop() for 8/16-bit integer formats |
| - panfrost: do not write outside num_wg_sysval |
| - panfrost: Add the BO containing fragment program descriptor to the batch |
| - pan/kmod: Fix a syncobj leak in the panthor backend |
| - pan/kmod: Make default allocator thread-safe |
| |
| Boyuan Zhang (5): |
| |
| - radeonsi/vcn: only use multi slices reflist when available |
| - meson: bump the minimal required vdpau version to 1.4 |
| - ac/gpu_info: Add vcn dec and enc version query |
| - radeonsi/vcn: choose rc_per_pic by encode verison |
| - radeonsi/vcn: mark rc_per_pic as obsoleted |
| |
| Błażej Szczygieł (2): |
| |
| - gallivm/ssbo: replace run time loop by compile time loop |
| - gallivm/ssbo: mask offset with exec_mask instead of building the 'if' |
| |
| Caio Oliveira (268): |
| |
| - intel/compiler/xe2: Implement instruction compaction for DPAS. |
| - intel/compiler: Add couple of tests for fs_combine_constants |
| - intel/compiler: Fix rebuilding the CFG in fs_combine_constants |
| - intel: Use an intel enum for cmat scope |
| - intel/compiler: Enable lower_rotate_to_shuffle in subgroup lowering |
| - anv: Advertise VK_KHR_shader_subgroup_rotate |
| - iris: Remove unused brw_* includes |
| - iris: Remove prototypes for unsupported Gfx versions |
| - iris: Remove unused paramater |
| - iris: Call blorp_finish() when destroying context |
| - crocus: Call blorp_finish() when destroying context |
| - intel/compiler: Rename brw_image_param to isl_image_param |
| - intel/compiler: Rename BRW_WM_MSAA_* enums to INTEL_MSAA_* |
| - intel/compiler: Rename BRW_TESS_* enums to INTEL_TESS_* |
| - intel/compiler: Rename DISPATCH_MODE_* enums to INTEL_DISPATCH_MODE_* |
| - intel/compiler: Rename brw_vue_map to intel_vue_map |
| - intel/compiler: Rename brw_cs_dispatch_info to intel_cs_dispatch_info |
| - intel/compiler: Move disassemble functions to own header file |
| - intel/compiler: Include brw_disasm_info.h where its used |
| - intel/compiler: Merge intel_disasm.[ch] into corresponding brw files |
| - intel: Rename i965_{asm,disasm} tools to brw_{asm,disasm} |
| - intel/blorp: Don't require specific prog_data type in callback |
| - intel/blorp: Remove brw\_ prefix when not applicable |
| - intel/blorp: Simplify blorp_compile_fs() interface |
| - intel/blorp: Simplify blorp_compile_cs() interface |
| - intel/blorp: Use a struct to return blorp_compile_*() results |
| - intel/blorp: Remove outdated reference in comment |
| - intel/blorp: Move brw_blorp_get_urb_length helper |
| - intel/blorp: Avoid brw types in blorp_priv.h |
| - intel/blorp: Move brw_compiler.h include to where is needed |
| - intel/blorp: Use a Meson dependency for blorp |
| - intel: Add missing dependencies on blorp |
| - intel/decoder: Move decoder to a separate module |
| - intel/compiler: Collect NIR-only passes in intel_nir.h |
| - intel/compiler: Rename the passes and files related to intel_nir.h |
| - intel/compiler: Rename brw_gfx_ver_enum.h to intel_gfx_ver_enum.h |
| - intel: Remove brw\_ prefix from process debug function |
| - intel/isl: Include compiler generic header |
| - anv: Remove lower_atomics from storage image lowering opts |
| - iris: Remove no-ops from storage image lowering |
| - intel/compiler: Use "intel" prefix for walk_order enum |
| - iris: Add stage to iris_compiled_shader |
| - iris: Don't use prog_data to guard 3DSTATE_CONSTANT_* code |
| - iris: Reduce dependency on brw_*_prog_data structs |
| - iris: Take ownership of prog_data when applying it |
| - iris: Use uint32_t instead of brw_param_builtin |
| - iris: Move compiler creation to iris_program.c |
| - iris: Add IRIS_MAX_* constants to replace BRW_MAX_* usage |
| - iris: Add helper to access use_tcs_multi_patch |
| - iris: Add helper for indirect_ubos_use_sampler |
| - iris: Move iris_get_compiler_options to iris_program.c |
| - iris: Include brw_compiler.h only when needed |
| - intel/meson: Remove usage of meson.source_root and meson.build_root |
| - intel/meson: Fix warning about broken str.format |
| - intel/elk: Fork Gfx8- compiler by copying existing code |
| - intel/elk: Compile ELK library, tests and tools |
| - intel/elk: Remove compiler specific devinfo hash |
| - intel/elk: Remove a bunch of files that don't apply for Gfx8- |
| - intel/elk: Use common code in intel/compiler |
| - intel/elk: Remove stages not used in Gfx8- |
| - intel/elk: Remove DPAS lowering |
| - intel/elk: Rename files to use elk prefix |
| - intel/elk: Rename header guards |
| - intel/elk: Update doxygen-like file comments |
| - intel/elk: Rename C++ namespace |
| - intel/elk: Rename symbols |
| - intel/elk: Don't include elk_eu_defines.h in elk_nir.h |
| - intel/elk: Create separate header for opcodes |
| - intel/blorp: Move brw specific code to a separate file |
| - intel/blorp: Explicitly include brw_compiler.h header |
| - intel/blorp: Add ELK support |
| - intel/blorp: Remove Gfx9+ references in elk code |
| - intel/decoder: Add ELK support |
| - crocus: Use ELK compiler |
| - hasvk: Use ELK compiler |
| - iris: Rename screen->compiler to screen->brw |
| - iris: Use ELK compiler for Gfx8 |
| - intel/tools: Add ELK support for aubinator |
| - intel/tools: Add ELK support for aubinator_error_decode |
| - intel/tools: Add ELK support for intel_hang_replay |
| - intel/tools: Add ELK support for aubinator_viewer |
| - intel/tools: Add ELK support for intel_hang_viewer |
| - intel: Use _brw suffix for genX headers that rely on brw |
| - intel/meson: Rename libintel_compiler to libintel_compiler_brw |
| - intel/tools: Add extra compiler device sha only for Gfx9+ |
| - intel/elk: Move nir_options to its own c/h file pair |
| - intel-clc: Use correct set of nir_options when building for Gfx8 |
| - intel/elk: Use anonymous namespace in fs_combine_constants |
| - intel/elk: Remove tests for Gfx9+ |
| - intel/brw: Remove assembler tests for Gfx8- |
| - intel/brw: Remove EU compaction tests for Gfx8- |
| - intel/brw: Remove EU validation tests for Gfx8- |
| - intel/brw: Remove pass test cases for Gfx8- |
| - intel/brw: Assert Gfx9+ |
| - intel/compiler: Remove has_render_target_reads from wm_prog_data |
| - intel/brw: Remove Gfx8- passes from optimize() |
| - intel/brw: Pull opt_copy_propagation out of fs_visitor |
| - intel/brw: Pull opt_cmod_propagation out of fs_visitor |
| - intel/brw: Pull opt_saturate_propagation out of fs_visitor |
| - intel/brw: Pull dead_code_eliminate out of fs_visitor |
| - intel/brw: Pull opt_combine_constants out of fs_visitor |
| - intel/brw: Pull opt_cse out of fs_visitor |
| - intel/brw: Pull bank_conflicts out of fs_visitor |
| - intel/brw: Pull peephole_sel out of fs_visitor |
| - intel/brw: Pull redundant_halt out of fs_visitor |
| - intel/brw: Pull opt_algebraic out of fs_visitor |
| - intel/brw: Pull split/compact virtual_grf opts out of fs_visitor |
| - intel/brw: Pull opt_split_sends out of fs_visitor |
| - intel/brw: Pull opt_zero_samples out of fs_visitor |
| - intel/brw: Pull eliminate_find_live_channel out of fs_visitor |
| - intel/brw: Pull remove_extra_rounding_modes out of fs_visitor |
| - intel/brw: Pull register_coalesce out of fs_visitor |
| - intel/brw: Pull lower_constant_loads out of fs_visitor |
| - intel/brw: Pull lower_pack out of fs_visitor |
| - intel/brw: Pull lower_simd_width out of fs_visitor |
| - intel/brw: Pull lower_barycentrics out of fs_visitor |
| - intel/brw: Pull lower_logical_sends out of fs_visitor |
| - intel/brw: Pull fixup_nomask_control_flow out of fs_visitor |
| - intel/brw: Pull lower_integer_multiplication out of fs_visitor |
| - intel/brw: Pull lower_sub_sat out of fs_visitor |
| - intel/brw: Pull lower_derivatives out of fs_visitor |
| - intel/brw: Pull lower_regioning out of fs_visitor |
| - intel/brw: Pull fixup_sends_duplicate_payload out of fs_visitor |
| - intel/brw: Pull lower_uniform_pull_constant_loads out of fs_visitor |
| - intel/brw: Pull lower_find_live_channel out of fs_visitor |
| - intel/brw: Pull lower_load_payload out of fs_visitor |
| - intel/brw: Use references for a couple of backend_shader passes |
| - intel/brw: Simplify OPT macro usage in fs_visitor::optimize |
| - intel/brw: Pull fixup_3src_null_dest out of fs_visitor |
| - intel/brw: Pull emit_dummy_memory_fence_before_eot out of fs_visitor |
| - intel/brw: Pull emit_dummy_mov_instruction out of fs_visitor |
| - intel/brw: Pull lower_scoreboard out of fs_visitor |
| - intel/brw: Pull optimize() out of fs_visitor |
| - intel/brw: Move optimize and small optimizations to brw_fs_opt.cpp |
| - intel/brw: Move virtual GRF opts into their own file |
| - intel/brw: Move fs algebraic to its own file |
| - intel/brw: Move small lowering passes into brw_fs_lower.cpp |
| - intel/brw: Move lower_integer_multiplication to its own file |
| - intel/brw: Expose flag_mask/bit_mask fs helpers |
| - intel/brw: Move lower_simd_width to its own file |
| - intel/brw: Move workarounds to a separate file |
| - intel/blorp: Remove Gfx8- references in BRW code |
| - intel/brw: Move brw_compile_* functions out of vec4-specific files |
| - intel/brw: Move type_size_* functions out of vec4-specific file |
| - intel/brw: Always use scalar shaders |
| - intel/brw: Remove vec4 backend |
| - intel/brw: Remove now unused vec4-only opcodes |
| - intel/brw: Remove unused legacy shader stages |
| - intel/brw: Remove Gfx8- code from disassembler |
| - intel/brw: Remove Gfx8- code from assembler |
| - intel/brw: Remove Gfx8- code from brw_compile_* functions |
| - intel/brw: Remove Gfx8- code from scheduler |
| - intel/brw: Remove Gfx8- code from register allocator |
| - intel/brw: Remove Gfx8- code from thread payload |
| - intel/brw: Remove Gfx8- code from NIR conversion |
| - intel/brw: Remove Gfx8- code from lower storage image pass |
| - intel/brw: Remove Gfx8- code from lower logical sends |
| - intel/brw: Remove Gfx8- code from generator |
| - intel/brw: Remove Gfx8- code from backend passes |
| - intel/brw: Remove Gfx8- code from EU compaction |
| - intel/brw: Remove Gfx8- code from IR performance analysis |
| - intel/brw: Remove Gfx8- code from EU emission |
| - intel/brw: Remove Gfx8- code from EU validation |
| - intel/brw: Remove Gfx8- code from NIR passes |
| - intel/brw: Remove Gfx4-5 manual compression selection |
| - intel/brw: Remove Gfx8- code from EU codegen helpers |
| - intel/brw: Remove Gfx8- code from NIR options |
| - intel/brw: Remove Gfx8- code from register type helpers |
| - intel/brw: Remove Gfx8- specific EU inst helpers |
| - intel/brw: Remove Gfx8- code from inst FC and F macros |
| - intel/brw: Replace inst F8 macro with F macro |
| - intel/brw: Remove Gfx8- code from inst F20 macros |
| - intel/brw: Remove Gfx8- code from inst FD20 and FV20 macros |
| - intel/brw: Remove Gfx8- code from inst FI macros |
| - intel/brw: Remove Gfx8- code from inst BRW_IA*_ADDR_IMM macros |
| - intel/brw: Remove Gfx8- code from inst FFDC, FDC and FD macros |
| - intel/brw: Update comments for FK macro |
| - intel/brw: Replace inst FF macro with F or F20 macros |
| - intel/brw: Remove F16TO32 and F32TO16 opcodes |
| - intel/brw: Remove Gfx8- code from builder |
| - intel/brw: Remove Gfx8- code from fs_inst |
| - intel/brw: Remove Gfx8- code from VUE map |
| - intel/brw: Remove Gfx8- code from SIMD lowering |
| - intel/brw: Remove Gfx8- code from visitor |
| - intel/brw: Remove Gfx8- remaining opcodes |
| - intel/brw: Remove MRF type |
| - intel/brw: Inline brw_nir_apply_sampler_key code |
| - intel/brw: Remove unused attrib workarounds |
| - intel/brw: Remove edgeflag_is_last VS parameter |
| - intel/brw: Remove Gfx8- fields from \*_prog_key structs |
| - intel/brw: Remove Gfx8- fields from \*_prog_data structs |
| - intel/brw: Use a single register set |
| - intel/brw: Remove runtime_check_aads_emit |
| - intel/brw: Remove automatic_exec_sizes |
| - intel/brw: Use fs_visitor instead of backend_shader in various passes |
| - intel/brw: Fold fs_instruction_scheduler into instruction_scheduler |
| - intel/brw: Change cfg_t to refer to fs_visitor |
| - intel/brw: Move dump_* functions into fs_visitor |
| - intel/brw: Fold backend_shader into fs_visitor |
| - intel/brw: Remove extra stage_prog_data field in fs_visitor |
| - intel/brw: Remove brw_shader.h |
| - intel/meson: Add dependencies for brw and elk |
| - intel/compiler: Remove nir_print_instr hack in disasm_info |
| - intel/brw: Use C++ for brw_disasm_info.c |
| - intel/brw: Hide the definition of cfg_t et al from C code |
| - intel/brw: Use fs_inst in cfg_t |
| - intel/brw: Use fs_inst explicitly in various passes |
| - intel/brw: Use fs_inst in disasm_annotate() |
| - intel/brw: Move functions from backend_instruction into fs_inst |
| - intel/brw: Fold backend_instruction into fs_inst |
| - intel/brw: Remove typedefs from fs_builder |
| - intel/brw: Fold backend_reg into fs_reg |
| - intel/brw: Simplify usage of reg immediate helpers |
| - intel/compiler: Fix SIMD lowering when instruction needs a larger SIMD |
| - intel/elk: Remove split sends |
| - intel/elk: Remove DPAS opcode |
| - intel/elk: Remove BTD and RT opcodes |
| - intel/elk: Remove DP4A opcode |
| - intel/elk: Remove ROR and ROL opcodes |
| - intel/elk: Remove IADD3 opcode |
| - intel/elk: Remove EU compaction logic for Gfx9+ |
| - intel/elk: Remove encoding for Gfx9+ |
| - intel/elk: Remove SYNC opcode and SWSB annotations |
| - intel/elk: Remove Gfx12 SFIDs and related LSC code |
| - intel/elk: Remove Gfx9+ sampler messages and modes |
| - intel/elk: Rename symbols for A64 OWord Block R/W messages |
| - intel/elk: Remove Gfx9+ dataport messages |
| - intel/elk: Remove FB_READ opcodes |
| - intel/elk: Remove Gfx12.5 URB message |
| - intel/elk: Remove ex_desc and ex_mlen from elk_inst |
| - intel/elk: Remove Xe2 logical sends lowering |
| - intel/elk: Remove unused sources from ELK_SHADER_OPCODE_SEND |
| - intel/elk: Remove unused SEND features |
| - intel/elk: Remove validation code for Gfx9+ |
| - intel/elk: Remove Gfx9+ from nir conversion |
| - intel/elk: Remove Gfx9+ from compile/run functions |
| - intel/elk: Remove FB_WRITE_LOGICAL_SRC_SRC_STENCIL |
| - intel/elk: Remove Gfx9+ from passes |
| - intel/elk: Remove Gfx9+ from thread payload |
| - intel/elk: Remove Gfx9+ from EU emission |
| - intel/elk: Remove coarse pixel handling |
| - intel/elk: Remove Gfx9+ from FS generator |
| - intel/elk: Remove Gfx9+ from Reg related code |
| - intel/elk: Remove Gfx9+ from asm grammar |
| - intel/elk: Remove Gfx9+ from disasm |
| - intel/elk: Remove Gfx9+ from NIR auxiliary code |
| - intel/elk: Remove use_tcs_multi_patch |
| - intel/elk: Remove Gfx9+-only passes |
| - intel/elk: Remove uses of intel_device_info_is_9lp() |
| - intel/elk: Remove remaining Gfx9+ code |
| - intel/elk: Remove multi-polygon support |
| - intel/elk: Clean up unused code in elk_compiler.h |
| - intel/brw: Use hstride instead of stride for accumulator |
| - intel/brw: Use helper to create accumulator register |
| - intel/brw: Fix validation of accumulator register |
| - anv: Enable VK_KHR_shader_maximal_reconvergence |
| - intel/tools: Make intel_stub_gpu work when using meson devenv |
| - intel/brw: Implement quad_vote_any and quad_vote_all |
| - intel/brw: Use predicates for quad_vote_any and quad_vote_all when available |
| - anv: Enable VK_KHR_shader_quad_control |
| - intel/brw: Handle Xe2 in brw_fs_opt_zero_samples |
| - intel/brw: Remove vestiges of sources on IF opcode, only valid on Gfx6 |
| - intel/brw: Add a src array for the common case in fs_inst |
| - intel/brw: Refactor FS validation macros |
| - intel/brw: Remove two duplicated validate calls in optimizer |
| - intel/brw: Move validate out of fs_visitor |
| - intel/brw: Support FIXED_GRF when generating code for CLUSTER_BROADCAST |
| - intel/brw: Lower VGRFs to FIXED_GRFs earlier |
| |
| Casey Bowman (1): |
| |
| - anv: Override VendorID for Hitman 3 |
| |
| Charlie Turner (2): |
| |
| - amd, radeonsi: Lower minimum supported video dimensions for AV1 |
| - {vulkan,radv,anv}/video: fix issue in H264 scaling lists derivation |
| |
| Chia-I Wu (7): |
| |
| - radv: fix pipeline stats mask |
| - meson: fix a build ereror |
| - radv: hide the sparse queue when radv_legacy_sparse_binding |
| - radv: hide the sparse queue on older kernels |
| - radv: set VK_SYNC_FEATURE_GPU_MULTI_WAIT |
| - aco: fix nir_op_pack_32_4x8 handling |
| - radv: fix 2d/3d image copy on compute queue |
| |
| Chris Rankin (4): |
| |
| - vdpau: Declare texture object as immutable using helper function. |
| - vdpau: Refactor query for video surface formats. |
| - meson: bump the minimal required vdpau version to 1.5 |
| - frontends/vdpau: Add support for VDPAU AV1 decoding. |
| |
| Christian Duerr (1): |
| |
| - panfrost: Fix dual-source blending |
| |
| Christian Gmeiner (100): |
| |
| - .gitignore: Add .venv folder |
| - etnaviv/isa: Add missing dep of encode.py/decode.py calls on isa.py |
| - isaspec: encode.py: Include assert.h |
| - isaspec: encode.py: Include util/log.h |
| - etnaviv: Remove no_oneconst_limit from etna_inst |
| - isaspec: encode: Constify encode.type |
| - isaspec: encode: Constify bitset_params |
| - etnaviv: Remove not used etna_assemble_set_imm(..) |
| - etnaviv: Fix how we determine the max supported number of varyings |
| - etnaviv: isa: Remove duplicate #instruction-alu-atomic |
| - etnaviv: isa: Add dsx and dsy opcodes |
| - etnaviv: isa: Add frc opcode |
| - etnaviv: isa: Add norm_dp2, norm_dp3 and norm_dp4 opcodes |
| - etnaviv: isa: Add bit_extract opcode |
| - etnaviv: isa: Correct dp2 opcode |
| - etnaviv: isa: Add branch_any opcode |
| - etnaviv: isa: Name cond enum value 22 |
| - etnaviv: isa: Add movai opcode |
| - etnaviv: isa: Add bit_rev opcode |
| - etnaviv: isa: Add texldb opcode |
| - etnaviv: isa: Add texldl opcode |
| - etnaviv: isa: Add texldd opcode |
| - etnaviv: isa: Remove note about GC3000 |
| - etnaviv: isa: Add div opcode |
| - etnaviv: isa: Reorder instructions |
| - etnaviv: isa: Rename reg_group u2 to u |
| - etnaviv: isa: Add internal register group |
| - etnaviv: isa: Add movar opcode |
| - etnaviv: isa: Move {TEX_SWIZ} |
| - etnaviv: isa: Correct SRC0_AMODE |
| - etnaviv: isa: Correct #instruction-cf-src1-src2 bitset name |
| - etnaviv: isa: Correct #instruction-alu-no-dst-maybe-src1-src2 name |
| - etnaviv: isa: Correct #instruction-alu-no-dst-has-src0-src1 expr name |
| - etnaviv: isa: Combine branch and branch_if |
| - etnaviv: isa: Support unary branch instruction |
| - etnaviv: isa: Support unary texkill instruction |
| - etnaviv: isa: Support multiple encodings for texldl |
| - etnaviv: isa: Fix #instruction-tex-src0-src1-src2 bitset |
| - etnaviv: isa: Support multiple encodings for texldb |
| - isaspec: Remove not used isa_decode_hook |
| - isaspec: deocde: Hide all the internals ISA details |
| - isaspec: decode: Add isa specific functions |
| - isaspec: decode: Make isa_decode_bitset(..) private |
| - freedreno/isa: Rework meson dependency for libir3decode |
| - etnaviv: isa: Rework meson dependency for libetnaviv_decode |
| - isaspec: deocde: Make isa_bitset arrays static |
| - isaspec: deocde: Make isa_decode_field(..) private |
| - isaspec: decode: Add libisaspec |
| - isaspec: deocde: Remove generic functions from public interface |
| - etnaviv: isa: Define a dontcare bit in atomic instructions |
| - etnaviv: isa: Add name attributes |
| - etnaviv: isa: Generate c header containing enums |
| - etnaviv: isa: Generate opcode enum |
| - etnaviv: isa: Add an empty libetnaviv_encode |
| - etnaviv: Link against libetnaviv_encode |
| - etnaviv: Move struct etna_inst to src/etnaviv |
| - etnaviv: isa: Make use of generated enums |
| - etnaviv: isa: Add rouding to etna_inst |
| - etnaviv: Set dst.use for MOVAR |
| - etnaviv: isa: Add encode support |
| - etnaviv: isa: Add isa_assemble_instruction(..) |
| - etnaviv: Switch to isa_assemble_instruction(..) |
| - etnaviv: Move swizzle related macros to scr/etnaviv |
| - etnaviv: Switch to macros from isa.h |
| - etnaviv: Remove isa.xml.h |
| - etnaviv: Do not set tex.amode for rounding |
| - ci/etnaviv: Remove duplicates |
| - ci/etnaviv: Do not skip tex-miplevel piglits |
| - etnaviv: Remove offline shader compiler |
| - etnaviv: Introduce common etna_core_info |
| - etnaviv: drm: Make use of etna_core_info |
| - etnaviv: drm: Add etna_gpu_get_core_info(..) |
| - etnaviv: Switch to etna_core_info |
| - etnaviv: Move hw header to common place |
| - etnaviv: Introduce etna_feature enum |
| - etnaviv: common: Add feature bitset |
| - etnaviv: drm: Initialize etna_core_info based on kernel features |
| - etnaviv: Switch to etna_core APIs |
| - etnaviv: drm: Query some id values in etna_gpu_new(..) |
| - etnaviv: hwdb: Import gc_feature_database from NXP |
| - etnaviv: hwdb: Import gc_feature_database from Amlogic |
| - etnaviv: hwdb: Import gc_feature_database from ST |
| - ci: Install python3-pycparser in build container |
| - etnaviv: hwdb: Generate hwdb.h |
| - etnaviv: hwdb: Add etna_query_feature_db(..) |
| - etnaviv: drm: Make use of hwdb |
| - etnaviv: common: Add enum etna_core_type |
| - etnaviv: common: Add some limit values |
| - etnaviv: hwdb: Fill limits |
| - etnaviv: drm: Fill limits |
| - etnaviv: Copy values from etna_core_info |
| - etnaviv: drm: Remove fallback value for ETNA_GPU_NUM_CONSTANTS |
| - etnaviv: Drop not needed check if seamless cube map is supported |
| - etnaviv: hwdb: Drop stdint.h dependency |
| - nvk: Remove duplicate DRM_NODE_RENDER check |
| - meson: Add missing newline at eof |
| - etnaviv: Switch to etna_core_disable_feature(..) |
| - etnaviv: Fix disabling of features |
| - etnaviv: drm: Drop NPU-related params |
| - clc: Always use spir for 32 bit |
| |
| Collabora's Gfx CI Team (4): |
| |
| - Uprev Piglit to e9316bcd12544aaf7e753ce37fe50d64165d9598 |
| - Uprev Piglit to 2a1c49a81cd9a6bf5d0c3a9b87225be94771ca96 |
| - Uprev Piglit to 1e631479c0b477006dd7561c55e06269d2878d8d |
| - Uprev Piglit to dd6f7eaf82e8dd442da28b346c236141cbcce0b1 |
| |
| Connor Abbott (56): |
| |
| - freedreno: Add a7xx crashdump-related registers and enums |
| - ir3/ra: Add specialized shared register RA/spilling |
| - ir3: Set branchstack earlier |
| - ir3: Rewrite (jp) and branchstack handling |
| - ir3: Calculate physical edges correctly |
| - ir3: Fix comment thinko |
| - ir3/ra: Fix bug with collect source handling |
| - tu: Add more info to ldg inline uniform path |
| - ir3/a7xx: Fix load_global_ir3 with immediate offset |
| - ir3: Initial support for pushing globals with ldg.k |
| - tu: Follow pipeline compatibility rules for dynamic descriptors |
| - tu: Reenable MSAA UBWC on a6xx gen1 |
| - tu: Enable UBWC for SNORM formats on a740+ |
| - tu: Enable UBWC for storage images on a7xx |
| - vk/graphics_state: Remove bogus assert in CmdSetSampleMaskEXT |
| - vk/graphics_state: Add stubs required by VK_EXT_shader_objects |
| - freedreno/afuc: Decode (peek) modifier |
| - freedreno/afuc: Add missing ALU encode case for bic |
| - freedreno/afuc: Bump max instructions for a7xx |
| - freedreno/afuc: Fix setbit/clrbit parsing |
| - freedreno/afuc: Use left recursion in parser |
| - freedreno/afuc: Improve jump table handling |
| - freedreno/afuc: Add .align directive |
| - freedreno/afuc: Add more general T_IDENTIFIER in lexer |
| - freedreno/afuc: Add support for multiple sections when assembling |
| - freedreno/afuc: Allow -e option on a7xx |
| - freedreno/afuc: Emulate THREAD_SYNC on a660 |
| - freedreno/afuc: Run entire bootstrap routine |
| - freedreno/afuc: Add a7xx test case |
| - freedreno/afuc: Add magic control reg values for a740 |
| - freedreno/afuc: Add section on reassembling firmwares and relocations |
| - freedreno/a7xx: Add CP_CCHE_INVALIDATE |
| - tu: Implement CCHE invalidation |
| - nir/divergence_analysis: Add ir3-specific intrinsics |
| - nir/divergence_analysis: Add uniform_load_tears option |
| - nir/divergence_analysis: Fix load_view_index divergence in VS |
| - ir3: Allow single-predecessor phis |
| - ir3: Run divergence analysis at the end |
| - ir3: Remove loop shared copy check |
| - ir3: Use divergence analysis for (jp) and physical CFG |
| - freedreno/afuc: Switch to using the GPU ID in the firmware |
| - freedreno/afuc: Add a7xx new-style branch instructions |
| - freedreno/afuc: Add initial support for a750 |
| - freedreno: Make has_ibo_ubwc a7xx specific |
| - freedreno,tu: Disable UBWC for storage images on a750 |
| - ir3/legalize: Fix intra-block state propagation with loops |
| - ir3: Rewrite nop insertion |
| - docs/android: Fix example meson cross file |
| - docs/android: Improve instructions for replacing driver |
| - ir3: Don't use non-contiguous component masks for FS |
| - ir3: Don't pack FS inlocs |
| - freedreno/a7xx: Register updates from kgsl |
| - ir3: Add scan_clusters_macro to ir3_valid_flags() |
| - ir3: Add scan_clusters.macro to is_subgroup_cond_mov_macro() |
| - ir3/ra: Don't demote movmsk instructions to non-shared |
| - docs/android: Fix example meson cross file again |
| |
| Constantine Shablia (6): |
| |
| - panvk: implement vkGetBufferDeviceAddress |
| - panvk: advertise bufferDeviceAddress |
| - vulkan/runtime: fix typo |
| - mesa: fix typo |
| - pan/bi: fix 1D array tex coord lowering |
| - panfrost: report correct MAX_VARYINGS |
| |
| Corentin Noël (37): |
| |
| - zink: Avoid the use of negative array offsets |
| - zink: Use memmove when dealing with overlapping memory |
| - glsl: Make sure to not cast ir_dereference_variable into ir_variable |
| - glsl: Make sure that the variable is a ir_variable before unreferencing it |
| - zink: Initialize zink_shader_object |
| - zink: Initialize zink_bindless_descriptor to zero on creation |
| - zink: Initialize pipe_query_result |
| - zink: Do not shadow the variable ret |
| - zink: Avoid variable shadowing everywhere |
| - zink: Only call reapply_color_write if EXT_color_write_enable is available |
| - ci_run_n_monitor: Allow the upstream format to not exist |
| - zink: use symbolic values instead of 0 |
| - zink: do not use undefined stage mask if on missing KHR_synchronization2 |
| - glsl: Ensure that we are dealing with ir_variable and ir_rvalue |
| - venus: sync protocol for VK_EXT_attachment_feedback_loop_layout |
| - venus: enable VK_EXT_attachment_feedback_loop_layout |
| - zink: Return early if the file descriptor could not have been duplicated/acquired |
| - ci: Update virglrenderer and crosvm |
| - zink: Make wrap_surface return a zink_ctx_surface directly |
| - zink: Use an intermediary variable for create_surface |
| - zink: Separate the template from the wrapped surface |
| - zink: Return early if the source could not have been acquired |
| - zink: Move zink_surface_destroy before zink_create_surface |
| - zink: Make sure to not leak anything on surface creation failure |
| - zink: Change zink_get_surface to return a zink_surface |
| - zink: Add error logging on surface creation failure |
| - st_pbo/compute: Use the correct structure type when allocating a specialized key |
| - zink: Make sure to initialize all the fields of VkMemoryBarrier |
| - dri/kopper: Assume a non-null drawable in flush_frontbuffer |
| - zink: Removed unused function |
| - zink: Removed unused num_texel_buffers member |
| - zink: Removed unused push_valid member |
| - zink: Remove ctx from zink_gfx_program |
| - ci: Change propagated variables into an array |
| - ci: Add VK_DRIVER_FILES passthrough from jobs to tests |
| - ci: Allow to pass LIBGL_ALWAYS_SOFTWARE to the guest environment |
| - ci: Add zink-venus-lvp job |
| |
| Daniel Almeida (17): |
| |
| - nak/sm50 add support for suld |
| - nak/sm50: add support for suatom |
| - nak/sm50: add support for isberd |
| - nak: sm50: add support for OpOut |
| - nak: sm50: fadd: ensure src[0] is in a register |
| - nak/sm50: legalize: display instruction on panic |
| - nak/sm50: add support for brev |
| - nak: sm50: fix some legalization issues |
| - nak/sm50: add a memstream abstraction |
| - nak/sm50: add an annotate debug flag |
| - nak/sm50: support annotations through OpAnnotate |
| - nak/sm50: sprinkle OpAnnotate in optimization passes |
| - meson,ci: Add the paste crate |
| - nil: Add the start of a Rust library |
| - nil: Rewrite nil_format in rust |
| - nil: Re-implement nil_image in Rust |
| - nil: Rewrite the TIC code in Rust |
| |
| Daniel Schürmann (52): |
| |
| - aco/insert_exec_mask: unify exec restore code after divergent control flow |
| - aco/insert_exec_mask: replace phi for loop restore mask with explicit copies |
| - aco/insert_exec_mask: only create loop phis for exec mask if necessary |
| - aco: give spiller more room to assign spilled SGPRs to VGPRs |
| - spirv: Fix SpvOpExpectKHR |
| - vulkan: enable VK_KHR_shader_expect_assume |
| - spirv: Update headers and grammar JSON |
| - aco/insert_exec_mask: Fix unconditional demote at top-level control flow. |
| - aco/insert_exec_mask: tiny refactor |
| - aco: always terminate quads if they have been demoted entirely |
| - aco/insert_exec_mask: Reduce latency when switching to WQM. |
| - spirv: implement SPV_KHR_maximal_reconvergence |
| - aco: enable WQM if demote is used with maximal reconvergence |
| - radv: enable VK_KHR_shader_maximal_reconvergence |
| - spirv: implement SPV_KHR_quad_control |
| - radv: enable VK_KHR_shader_quad_control |
| - radv: fix initialization of radv_shader_layout->use_dynamic_descriptors |
| - aco: rematerialize constants in every basic block during optimizer |
| - aco: reorder code and use namespaces in aco_interface.cpp |
| - aco/util: small_vec few additions |
| - aco: use small_vec as Block::edge_vec for predecessors and successors |
| - aco/spill: refactor SSA repairing |
| - aco/spill: don't allocate extra spill_id for phi operands in add_coupling_code() |
| - aco/spill: add spills_entry interferences only when necessary |
| - aco/spill: refactor adding spilled vars into separate function add_to_spills() |
| - aco/spill: keep live-out variables spilled at branch blocks |
| - aco/spill: don't prefer to spill phis at merge blocks |
| - aco/spill: add interferences with variables spilled at loop headers |
| - aco/spill: avoid re-spilling loop-carried variables in process_block() |
| - aco/spill: avoid re-spilling loop-carried variables in add_coupling_code() |
| - aco/spill: keep loop-carried variables spilled at loop headers |
| - aco/spill: keep loop-carried variables spilled at merge blocks |
| - aco/spill: select more loop-carried variables to be spilled |
| - aco/spill: keep loop variables spilled during nested loops |
| - aco: use instr_class::branch to identify SOPP branches |
| - aco: remove SOPP_instruction::block member |
| - aco: unify different SALU types into single struct SALU_instruction |
| - aco/builder: use accessor functions instead of casting to subtypes |
| - aco: change return type of create_instruction() to Instruction* |
| - aco: defer instruction size from aco::Format in create_instruction() |
| - aco: remove create_instruction() template parameter |
| - aco: move create_instruction() to aco_ir.cpp |
| - aco/spill: Fix assertion for nested loops |
| - aco/spill: pass live_vars to spill_ctx |
| - aco/spill: compute live-in variables from live-out |
| - aco/spill: maintain valid live vars at any point |
| - aco/spill: use live variables instead of next_use_distances in add_coupling_code() |
| - aco/spill: gather information about average use distances |
| - aco/spill: use average use distances in process_block() |
| - aco/spill: use average use distances in init_live_in_vars() for merge blocks |
| - aco/spill: use average use distances to spill loop variables |
| - aco/ra: fix kill flags after renaming fixed Operands |
| |
| Daniel Stone (50): |
| |
| - egl: Return BAD_CONFIG when robust access unsupported |
| - st/dri: Use correct pipe_resource for GL texture image export |
| - dri: Redeclare __DRI_IMAGE_FORMAT_* as PIPE_FORMAT_* |
| - st/dri2: Remove __DRI_IMAGE_FORMAT conversion |
| - st/dri2: Pass pipe_format to driCreateConfigs |
| - st/dri2: Use u_format to get config format information |
| - util: Add util_format_get_component_shift |
| - st/dri: Remove format tables from driCreateConfigs |
| - st/dri: Completely remove mesa_format from config setup |
| - st/dri: Add transient HAS_ZS() helper |
| - st/dri: Rework depth/stencil format selection |
| - st/dri: Use pipe_format for Z/S modes |
| - st/dri: Check format properties from format helpers |
| - st/dri: Store pipe_format in gl_config |
| - egl/wayland: Remove format-query fallback |
| - st/dri: Reuse stored renderbuffer format |
| - st/dri: Reuse stored texture format |
| - dri/kopper: Move format -> FourCC translation up a level |
| - dri/kopper: Add translations for sRGB formats |
| - dri/kopper: Reorder format tables |
| - dri/kopper: Flatten pipe_format/DRIImage/FourCC conversion |
| - egl/wayland: Query image FourCC for linear copies |
| - egl/wayland: s/DRI_IMAGE_FORMAT/pipe_format/g |
| - egl/wayland: Add opaque-equivalent FourCCs |
| - egl/wayland: Fix EGL_EXT_present_opaque |
| - egl/wayland: Use pipe_format to look up configs |
| - egl/wayland: Use FourCC to look up wl_buffer support |
| - egl/wayland: Add helper to check server format support |
| - egl/wayland: Use helper to look up visual |
| - egl/wayland: Eliminate double loop for configs |
| - egl/wayland: Simplify alternate-format fallback for configs |
| - egl/wayland: Remove WL_SHM_* format listings |
| - egl/wayland: Use pipe_format for format names |
| - egl/wayland: Remove shift/size masks |
| - egl: Fail display creation if no EGLConfigs created |
| - egl/wayland: Remove check for EGLConfig presence |
| - egl/gbm: Remove check for EGLConfig presence |
| - egl/x11: Remove check for EGLConfig presence |
| - egl/android: Remove check for EGLConfig presence |
| - egl/{surfaceless,device}: Remove check for EGLConfig presence |
| - egl: Automatically set EGLConfig ID |
| - egl: Use pipe_format for pbuffer configs |
| - gbm/dri: Query DRIImage for FourCC directly |
| - gbm: Remove hardcoded color-channel data |
| - egl/android: Remove hard-coded color-channel data |
| - egl/x11: Compare config shifts/sizes locally |
| - egl: Remove shifts/sizes from dri2_add_config argument |
| - st/dri: Use pipe_format from config directly |
| - egl/dri: Use pipe_format instead of DRI_IMAGE_FORMAT |
| - egl/wayland: Remove EGL_WL_create_wayland_buffer_from_image |
| |
| Danylo Piliaiev (37): |
| |
| - freedreno/replay: Delete all buffers after each submission |
| - freedreno/replay: Correctly free iova on msm backend |
| - freedreno/replay: Add WSL backend for Windows |
| - ir3: Fix "print" meta instruction synchronization |
| - ir3: Add fullsync and fullnop ir3 dbg options for over-syncing |
| - freedreno/replay: Make meta "print" instruction take any number of regs |
| - tu: Do not print anything on systems without Adreno GPU |
| - tu/a7xx: Make A7XX_RB_UNKNOWN_8E06 value configurable per-gen |
| - tu: Define and set to zero all SP_*_VGPR_CONFIG regs |
| - ir3: Add ldg.k instruction |
| - tu/a7xx: Correctly set A7XX_HLSQ_UNKNOWN_A9AE.SYSVAL_REGS_COUNT |
| - tu/a7xx: Do not preload shaders, HW does it by default |
| - tu: Use SS6_INDIRECT consts upload path for 3d blits |
| - turnip,ir3/a750: Implement consts loading via preamble |
| - tu: Use SS6_INDIRECT for VS params |
| - turnip,ir3/a750: Implement inline uniforms via ldg.k |
| - tu/a750: Consider vertex attr buff in gmem allocation |
| - freedreno,tu: Move varying interp and varying repl modes to xml |
| - freedreno/devices: Update magic regs for a7xx |
| - tu: Exclude more a7xx regs from stomping |
| - tu: Add workaround for D3D11 games accessing UBO out of bounds |
| - tu/a7xx: Write even more magic regs to fix rendering issues on Android |
| - tu: Do not emit zero-sized fs params |
| - freedreno/a7xx: Fix base_align for non-UBWC depth-stencil |
| - tu/autotuner: Use CP_EVENT_WRITE7 for submission fence |
| - tu: Update prim restart state when we switch from/to indexed draw |
| - tu: Fix dynamic state not always being emitted |
| - meson: Correctly get sizeof_pointer with cross-compilers |
| - freedreno/devices: Do not write to 8E79 on a750, KGSL has it protected |
| - freedreno/replay: Use real queueid for submissions and waits |
| - freedreno,tu/a7xx: Add PC_TESS_PARAM_SIZE and PC_TESS_FACTOR_SIZE |
| - tu: Update RP state depending on pipeline in first RP draw |
| - tu: Emit non-draw-state state at the first draw call |
| - freedreno/devices: Add A740v3 from Quest 3 |
| - util/vma: Add function to get max continuous free size |
| - freedreno/replay: Allocated maximum available size for cs overriding |
| - ir3: Do not set clip/cull mask if no one writes clip/cull |
| |
| Dario Mylonopoulos (1): |
| |
| - llvmpipe: fixed race condition in lp_rast_destroy that causes a crash on windows |
| |
| Dave Airlie (46): |
| |
| - vulkan/video: drop unused function. |
| - vulkan/video: rename some of the parameter tracking structs. |
| - vulkan/video: start to wrap the video structs for deep copies. |
| - vulkan/video: start deep copying the parameters structures |
| - vulkan/video: constify the encoding apis. |
| - radv/video: refactor sq start/end code to avoid decode hangs. |
| - radv: don't submit empty command buffers on encoder ring. |
| - gallivm: fix coroutines with llvm 18 |
| - gallivm: passing fp16_split_fp64 to fp16 lowering. |
| - nvk: allow 3d compressed textures |
| - nvk: mem cannot be null in binding buffers/images. |
| - zink: use sparse residency for buffers. |
| - vulkan: update registry/includes to 1.3.277 |
| - vulkan/video: add AV1 decode support to common code |
| - radv: fix correct padding on uvd |
| - radv: init decoder ip block earlier. |
| - radv/uvd: uvd kernel checks for full dpb allocation. |
| - radv: don't submit 0 length on UVD either. |
| - egl: don't bind zink under dri2/3 |
| - glx/dri3: handle zink fallback if loader picks it. |
| - loader: handle picking zink for nouveau for certain GPUs. |
| - nouveau/winsys: fix bda heap leak. |
| - nvk: fix dri options leak. |
| - egl/dri2: if zink is preferred from dri3 skip dri2 paths. |
| - radv/video: fix filling out decode operations. |
| - radv/video: use vcn ip version in more places. |
| - radv: rename it_ptr to it_probs_ptr in advance of adding av1 |
| - radv/video: use proper struct sizes for decoder structs. |
| - radv/video: add VK_KHR_video_decode_av1 support. |
| - nvk: free leaked cmd_buffer descriptors state. |
| - nvk: only unmap heap bos that were mapped |
| - nvk: enable a mappable bar heap when rebar is disabled. |
| - radv/video: fix h265 decode with unaligned w/h |
| - mesa: reorder st context teardown |
| - vulkan/video: copy the profile over for h264 encode. |
| - radv/video: export unified queue header/tail functions. |
| - radv: add direct cs emit for a dword. |
| - radv: add encoder queue support pieces and encoder queries. |
| - radv/video: add parameter patching calls. |
| - radv/video: add initial support for encoding with h264. |
| - radv/video: add h265 encode support |
| - radv/video: enable video encoding behind perftest flag |
| - radv/video: handle encode control parameters better. |
| - radv/video: don't advertise timestamp bits for decode/encode |
| - egl/dri2: don't bind dri2 for zink |
| - radv/video/encode: fix quality params on v2 hw. |
| |
| David (Ming Qiang) Wu (1): |
| |
| - frontends/va: make vlVaSyncSurface blocking |
| |
| David Heidelberg (57): |
| |
| - ci/deqp: uprev deqp-runner for Linux too to 0.18.0 |
| - ci/lima: update expectations, failing tests are being skipped |
| - ci: bump kernel to 6.6.12, modularize i915, add Transparent Huge Pages |
| - ci: shorter kernel tag, included Vivante NPU patches |
| - ci: disable Valve farm in Keywords |
| - ci: bump libdrm to 2.4.120 |
| - ci/VK-GL-CTS: add patches to fix dEQP-VK.glsl.derivate crashes |
| - ci: Valve farm (Keywords location) works again |
| - meson: upgrade zlib wrap to 1.3.1 |
| - util: use crc32_z instead of crc32 and bump zlib dep to 1.2.9 |
| - ci: bump kernel to 6.6.16 + enable X2APIC |
| - ci/freedreno: add fail found by new Piglit |
| - ci/etnaviv: update expectations |
| - ci: temporarily disable Collabora farm |
| - ci: enable Collabora farm |
| - ci: re-enable Collabora farm after maintenance |
| - ci/intel: decompose anv-tgl-test so we can specify custom devices for TGL |
| - ci/intel: add acer-cp514-2h-11{30,60}g7-volteer |
| - ci/intel: move machine definition to the intel-tgl-skqp job |
| - ci/intel: split asus-cx9400-volteer into acer-cp514-2h-11{30,60}g7-volteer |
| - drm-shim: Avoid invalid file and time bits combination |
| - intel/tools: avoid invalid time and file bits combination |
| - ci/deqp: backport Implement support for the EGL_EXT_config_select_group extension GL-CTS patch |
| - ci/freedreno: update expectations comment |
| - ci/deqp: add EGL patch for correct suite (GLES, not GL) |
| - nine: convert licenses block to SPDX |
| - nine: fill missing licenses headers and copyrights |
| - nine: drop useless and a bit too long line |
| - ci: uprev kernel to 6.6.21 |
| - ci/freedreno: disable workarounds for Adreno 618, 630, and 660 |
| - ci/freedreno: mark fails resolved by "drm/msm/gem: Add metadata uapi" |
| - ci: reduce irrelevant output to a simple list of libraries |
| - util: move gen_zipped_file into generic util and rename to gen_zipped_xml_file |
| - ci/r300: implement rules for d3d9 testing |
| - ci/svga: add missed test and gl-rules include |
| - r300: convert to SPDX license block and fix small typos |
| - r300: add missing licence to the r300_public.h |
| - r300: add missing copyright header |
| - docs: we support EGL 1.5 for a long time |
| - ci/amd: meld radv-traces into radv-raven-traces |
| - ci/amd: drop old PIGLIT_REPLAY_DESCRIPTION_FILE surpassed by PIGLIT_TRACES_FILE |
| - frontend/nine: fix typos |
| - r600: update licensing to SPDX header |
| - r600: add license header to r600_formats.h |
| - r600: add license info to the r600_opcodes.h |
| - r600: add license information to the sfn_shader_gs.h |
| - r600: fix typos |
| - ci: disable sona devices, all devices are offline |
| - ci/intel: sona device_type is back online |
| - ci: temporarily disable Android test builds |
| - ci: disable Igalia farm |
| - meson: implement split-debug |
| - freedreno/ci: move the disabled jobs from include to the main file |
| - ci/deqp: correct EGL_EXT_config_select_group detection |
| - egl/x11: Move RGBA visuals in the second config selection group |
| - winsys/i915: depends on intel_wa.h |
| - subprojects: uprev perfetto to v45.0 |
| |
| David Rosca (31): |
| |
| - radeonsi/vcn: Fix H264 slice header when encoding I frames |
| - frontends/va: Fix updating AV1 rate control parameters |
| - radeonsi/vcn: Don't reinitialize encode session on bitrate/fps change |
| - frontends/va: Only set VP9 segmentation fields when segmentation is enabled |
| - frontends/va: Separate QP for I/P/B frames |
| - radeonsi/vcn: Use temporal_layer_index to select temporal layer |
| - radeonsi/vcn: Implement separate QP for I/P/B frames |
| - radv/video: Set maxActiveReferencePictures to 16 for H264/5 |
| - frontends/vdpau: Fix cdef strengths and lr_unit_shift in AV1 decode |
| - frontends/vdpau: Support creating VDP_CHROMA_TYPE_420_16 surfaces |
| - radv/video: Fix setting slice QP |
| - radv/video: Set correct bitstream buffer size |
| - radv/video: Set VBV buffer size and level |
| - radv/video: Select temporal layer when encoding each frame |
| - radv/video: Set maxSublayerCount to 4 for H265 |
| - radv/video: Avoid resetting rate control every frame |
| - radv/video: Implement per picture type min/max QP |
| - radv/video: Set correct bit depth and format for 10bit input |
| - radv/video: Check encode profiles and bit depth in capabilities query |
| - radv/video: Report maxBitrate in encode capabilities |
| - radeonsi/vcn: Allocate session buffer in VRAM |
| - radeonsi/vcn: Fix 10bit HEVC VPS general_profile_compatibility_flags |
| - radeonsi/vcn: Only enable VBAQ with rate control mode |
| - frontends/va: Fix AV1 slice_data_offset with multiple slice data buffers |
| - Revert "radeonsi/vcn: AV1 skip the redundant bs resize" |
| - frontends/va: Only increment slice offset after first slice parameters |
| - radeonsi: Update buffer for other planes in si_alloc_resource |
| - frontends/va: Store slice types for H264 decode |
| - radeonsi/vcn: Ensure DPB has as many buffers as references |
| - radeonsi/vcn: Allow duplicate buffers in DPB |
| - radeonsi/vcn: Ensure at least one reference for H264 P/B frames |
| |
| David Stern (1): |
| |
| - vulkan/wsi/x11: Explicitly discard errors from xcb_present_pixmap. |
| |
| David Tobolik (1): |
| |
| - rusticl: implement cl_khr_suggested_local_work_size |
| |
| Derek Foreman (10): |
| |
| - egl/wayland: Fix possible buffer leak |
| - loader/wayland: Add named queue fallback |
| - egl/wayland: Give names to our Wayland event queues |
| - vulkan/wsi/wayland: Give names to our Wayland event queues |
| - vulkan/wsi/wayland: Remove confusing comment |
| - vulkan/wsi/wayland: Adjust presentation id locking |
| - vulkan/wsi/wayland: Use wl_display_dispatch_queue_timeout |
| - vulkan/wsi/wayland: More descriptive name for swapchain queue |
| - vulkan/wsi/wayland: Fix use after free |
| - vulkan/wsi/wayland: Remove unused get_min_image_count_for_mode_group |
| |
| Dmitry Baryshkov (11): |
| |
| - freedreno/drm: don't crash for unsupported devices |
| - freedreno/regs: define the wide bus enable bit in DSI_VID_CFG0 |
| - freedreno/registers: fix generation dependencies |
| - freedreno/registers: add missing copyright imports |
| - freedreno/registers: inline mdp4_csc group |
| - freedreno/registers: fix WB doffsets array in mdp5.xml |
| - freedreno/registers: support processing display display headers |
| - freedreno/registers: limit the rules schema |
| - freedreno/registers: drop unsupported features from schema |
| - freedreno/rnn: drop headergen2 |
| - freedreno/rnn: drop custom aprintf function |
| |
| Dmitry Osipenko (2): |
| |
| - virtio/vdrm: Fix lockup in vdrm_host_sync() |
| - iris: Use Mesa internal drm-uapi headers |
| |
| Dylan Baker (8): |
| |
| - intel/vulkan: assume() that we don't use "ISL_NUM_FORMATS" |
| - intel/hasvk: assume() we don't get ISL_NUM_FORMATS |
| - meson: drop intel-cl deprecation of 'false' |
| - meson: rework intel-rt option to be a feature |
| - meson: Allow building intel-clc for the host if it can be run |
| - intel/brw: track last successful pass and leave the loop early |
| - nvk: drop meson version check that is always true |
| - nouveau: require cbindgen >= 0.25 |
| |
| Echo J (9): |
| |
| - nvk: Set ICD version to 1.3 |
| - nvk: Implement the VR-related display extensions |
| - nak: Rip out a few dead_code statements |
| - nvk: Add NVK to the Vulkan device name |
| - nvk: Advertise VK_VALVE_mutable_descriptor_type |
| - nvk: Implement calibrated timestamps |
| - vulkan: Add implicit pipeline caching support |
| - nvk: Use implicit pipeline cache |
| - nvk: Don't advertise residencyAlignedMipSize on MaxwellB+ |
| |
| Emma Anholt (2): |
| |
| - ci: Add full-run xfails missed in the 1.3.7.0 CTS update. |
| - ci: Disable VK full runs that time out since 1.3.7.0 (hasvk, anv-tgl, a630) |
| |
| Emmanuel Vadot (1): |
| |
| - util: Allow kcmp on FreeBSD |
| |
| Eric Engestrom (282): |
| |
| - VERSION: bump to 24.1 |
| - docs: reset new_features.txt |
| - docs: update calendar for 24.0.0-rc1 |
| - ci: make sure we evaluate the python-test rules first |
| - docs: fix syntax highlighting on non-code text snippet |
| - docs: fix syntax highlighting on shell commands |
| - ci/deqp: ensure that in \`default` builds, wayland + x11 + xcb are all built |
| - zink+anv/ci: add known failures |
| - ci: fix job dependency error in MRs for bin/ci/* scripts |
| - nouveau/ci: don't run nouveau (gl) tests on nvk changes |
| - amd/ci: simplify deqp config |
| - amd/ci: add flakes seen today |
| - docs: update calendar for 24.0.0-rc2 |
| - zink+radv/ci: drop duplicates flakes lines |
| - CODEOWNERS: add myself as as person of contact for CI changes |
| - CODEOWNERS: remove myself as a person of contact for a few things |
| - radv/ci: sort navi21 flakes |
| - amd/ci: add flakes seen today |
| - amd/ci: consider much more of dEQP-VK.query_pool.statistics_query.host_query_reset.* to be flaky |
| - r300/ci: add flakes |
| - ci/deqp: backport fix for zlib.net not allowing tarball download anymore |
| - rpi3/ci: update piglit & deqp expectations |
| - rpi4/ci: skip more of the dEQP-VK.ssbo.phys.layout.* tests that timeout occasionally |
| - rpi3/ci: add flake seen today |
| - rpi4/ci: add timeouts seen today |
| - rpi5/ci: add flake seen today |
| - docs: add release notes for 23.3.4 |
| - docs: update calendar for 23.3.4 |
| - docs: add sha256sum for 23.3.4 |
| - docs: update calendar for 24.0.0-rc3 |
| - ci_run_n_monitor: drop always-true condition |
| - ci_run_n_monitor: allow passing multiple targets |
| - ci/deqp: fix default target check when target is not specified |
| - ci/deqp: simplify version log dump |
| - ci/deqp: avoid storing the huge list of vk tests on android builds |
| - ci/deqp: move editable part to the top of the file |
| - ci/deqp: split vk and gl builds |
| - ci/deqp: drop the implicit DEQP_TARGET; explicitly set \`default` in VK builds |
| - ci/deqp: only compile EGL tests in GL builds, not VK builds |
| - ci/deqp: only compile the test binaries that are relevant to the build |
| - ci/deqp: only keep the mustpass lists that are relevant to the build |
| - ci: bump the image tags to rebuild all the deqp variants |
| - Revert "bin/ci: Add GitLab basic token validation" |
| - Reapply "bin/ci: Add GitLab basic token validation" |
| - util: rename __check_suid() to __normal_user() |
| - tree-wide: use __normal_user() everywhere instead of writing the check manually |
| - zink+anv/ci: add a couple more flakes |
| - util: simplify logic in __normal_user() |
| - util: check for setgid() as well in __normal_user() |
| - ci: always skip dEQP-VK.info.device_extensions |
| - vk/util: fix 'beta' check for physical device features |
| - vk/util: fix 'beta' check for physical device properties |
| - ci: when specifying a driver remove all other ones |
| - docs: update calendar for 24.0.0 |
| - docs: add release notes for 24.0.0 |
| - docs: add sha256sum for 24.0.0 |
| - docs/release-calendar: add planned 24.0.x bugfix releases |
| - docs: add release notes for 23.3.5 |
| - docs: update calendar for 23.3.5 |
| - docs: add sha256sum for 23.3.5 |
| - v3d-rpi4-gl: reduce the parallelism from 10 to 8 |
| - docs/calendar: add 24.1 branchpoint and release schedule |
| - ci: drop dash in image tags dates |
| - ci: enforce maximum image tag length |
| - ci: reduce maximum image tags length from 30 to 20 |
| - ci: explain purpose of the word after the date in image tags |
| - panfrost: fix UB caused by shifting signed int too far |
| - ci_run_n_monitor: avoid spamming a ton of "new status: created" for all the jobs at the beginning |
| - ci: build panvk in debian-vulkan job |
| - nouveau/tests: fix null dereference |
| - ci: build nvk in debian-vulkan job |
| - v3dv/ci: test the WSI on rpi4 and rpi5 |
| - radv: enable VK_EXT_headless_surface on all platforms except Windows |
| - v3dv: enable VK_EXT_headless_surface on all platforms except Windows |
| - tu: enable VK_EXT_headless_surface on all platforms except Windows |
| - anv: enable VK_EXT_headless_surface on all platforms except Windows |
| - hasvk: enable VK_EXT_headless_surface on all platforms except Windows |
| - dzn: enable VK_EXT_headless_surface on all platforms except Windows |
| - nvk: enable VK_EXT_headless_surface on all platforms except Windows |
| - panvk: enable VK_EXT_headless_surface on all platforms except Windows |
| - vn: enable VK_EXT_headless_surface on all platforms except Windows |
| - lvp: enable VK_EXT_headless_surface on all platforms except Windows |
| - pvr: enable VK_EXT_headless_surface on all platforms except Windows |
| - ci_run_n_monitor: warn user if they forgot to push the branch |
| - ci_run_n_monitor: add some types for gitlab objects |
| - ci_run_n_monitor: update job when it goes through enable_job() |
| - ci_run_n_monitor: add method to get a pipeline job by its id |
| - ci_run_n_monitor: track new job when retrying a job |
| - ci_run_n_monitor: refresh job state when starting it |
| - gitlab_gql: print error returned by server in --print-merged-yaml |
| - ci_run_n_monitor: implicitly include \`parallel:` jobs |
| - ci_run_n_monitor: print the target regex before adding the X/N bit |
| - docs: add release notes for 24.0.1 |
| - docs: add sha256sum for 24.0.1 |
| - docs: add release notes for 23.3.6 |
| - docs: update calendar for 23.3.6 |
| - docs: add sha256sum for 23.3.6 |
| - docs: update calendar for 24.0.1 |
| - ci_run_n_monitor: explain why/when there might be no tracked remote |
| - ci_run_n_monitor: allow detached heads as well |
| - docs: add release notes for 24.0.2 |
| - docs: add sha256sum for 24.0.2 |
| - docs: update calendar for 24.0.2 |
| - ci_run_n_monitor: fix handling of optional jobs again |
| - ci_run_n_monitor: read job logs as utf-8 |
| - vk/util: trivial cleanups in vk_icd_gen.py |
| - vk/util: print a nice error in vk_icd_gen.py when VK_HEADER_VERSION is not defined |
| - ci/android: use a specific version of android-cuttlefish |
| - ci: document which image tags to bump when touching build-mold.sh |
| - ci: uprev mold to the latest release |
| - ci/image-tags: move KERNEL_ROOTFS_TAG to group the test images together |
| - ci/deqp: only apply the android patches to the android build |
| - ci/deqp: build deqp-egl using mold as well |
| - ci/deqp: make deql-egl for android less of a special case |
| - ci/deqp: control the GL release independently of VK |
| - ci/deqp: control the GLES release independently of GL |
| - r300/ci: group tex-miplevel-selection flakes together |
| - r300/ci: add another tex-miplevel-selection flake |
| - iris/ci: add pbuffer flakes for amly, same as apl and glk |
| - panfrost/ci: skip dEQP-GLES31.functional.copy_image.non_compressed.* on t760 as they hang |
| - rpi3/ci: update expectations for vc4-rpi3-gl-piglit-full:arm32 2/4 |
| - freedreno/ci: add another a618 flake |
| - zink+anv: update expectations |
| - r300/ci: add flakes |
| - radeonsi/ci: add vangogh piglit flake |
| - zink+radv: update navi31 expectations (one test fixed) |
| - softpipe: update expectations |
| - ci/deqp: drop zlib url patch |
| - ci/deqp: split vk/gl/gles patches |
| - ci/deqp-runner: inline never-used DEQP_VARIANT variable |
| - ci/deqp: use the proper gl/gles releases for deqp-gl*, deqp-gles*, deqp-egl |
| - ci/venus-lavapipe: drop unused DEQP_VER that's being overwritten by DEQP_SUITE anyway |
| - ci/lavapipe: fold \`DEQP_VER: vk` and drop .deqp-test-vk |
| - docs: delay 24.1 branchpoint by 2 weeks |
| - vk/update-aliases: drop VK_ERROR\_ prefix substitution |
| - ci/deqp-runner: do a release build instead of debug |
| - ci/deqp-runner: set android rust target in the caller (debian/x86_64_test-android.sh) |
| - ci/deqp-runner: bring "install from crate" & "install from git" to feature parity |
| - ci/deqp-runner: update repo url |
| - ci/deqp-runner: fix list of image tags to update |
| - ci/image-tags: re-generate all the images building deqp-runner |
| - docs: add release notes for 24.0.3 |
| - docs: add sha256sum for 24.0.3 |
| - ci/deqp: document which build produces which binary |
| - ci: include all the src/\**/gitlab-ci.yml files |
| - nouveau: add missing vl lib |
| - nouveau/ci: fix yaml indentation |
| - nouveau/ci: only trigger jobs for relevant changes |
| - Revert "nouveau: add missing vl lib" |
| - ci/deqp: backport fix for dEQP-VK.wsi.direct_drm.* bug |
| - vc4/ci: add flake |
| - radeonsi/ci: udpate expected failures |
| - r300: mark new fails |
| - v3dv/ci: update expectations |
| - v3d/ci: mark spec\@ext_framebuffer_blit\@fbo-blit-check-limits as fixed |
| - vc4/ci: add another \`spec\@!opengl 1.1\@depthstencil-default_fb-drawpixels` flake |
| - vc4/ci: add another \`spec\@arb_vertex_buffer_object\@vbo-subdata-many draw` flake |
| - v3dv/ci: mark the \`dEQP-VK.wsi.*.maintenance1.deferred_alloc.*` flakes seen so far as happening on all platforms |
| - v3dv/ci: add other flakes seen during nightly run |
| - ci: fix shader-db job existence condition |
| - v3dv/ci: assume dEQP-VK.wsi.wayland.swapchain.simulate_oom.* have been fixed |
| - v3dv/ci: add more flakes |
| - v3dv/ci: assume list of dEQP-VK.wsi.*.maintenance1.present_modes.* flakes is the same between xcb & xlib and between rpi4 & rpi5 |
| - ci: enable MESA_VK_ABORT_ON_DEVICE_LOSS globally |
| - ci/deqp-runner: split gl & gles groups to use the correct binary |
| - ci/deqp-runner: print deqp-gles version log as well |
| - ci: deduplicate converting the current job runtime into %M:%S |
| - ci: convert the job start date into a timestamp only once |
| - ci: simplify unnecessarily complex printf |
| - radv/ci: sort tahiti flakes |
| - radv/ci: add a bunch of flakes seen recently |
| - v3dv/ci: track regression |
| - rpi/ci: add flakes |
| - radv/ci: add more flakes |
| - v3dv/ci: add more flakes |
| - docs: update calendar for 24.0.3 |
| - docs: update calendar for 24.0.4 |
| - docs: add release notes for 24.0.4 |
| - docs: add sha256sum for 24.0.4 |
| - v3dv/ci: another batch of flakes |
| - radv/ci: another batch of flakes |
| - radv/ci: another batch of flakes |
| - radv/ci: dEQP-VK.spirv_assembly.type.vec4.i8.mod_geom Fail -> Crash on tahiti |
| - ci: don't run rustfmt on every core change |
| - ci_run_n_monitor: explain how to pass multiple targets without having to use regexes |
| - rpi/ci: another batch of flakes |
| - docs: mesa also implements gles 3.0+ |
| - docs/egl: various wording improvements |
| - ci: take kws farm offline |
| - ci: restore kws farm |
| - radv/ci: simplify tahiti flakes list |
| - ci: fold .test-check into its only user, python-test |
| - ci: run python-test when editing the CI itself |
| - ci: run python-test automatically only in merge pipelines |
| - docs/macos: drop reference to former github mirror |
| - docs/nir: vec4 reference |
| - docs/envvars: fix reference |
| - docs/isl: fix references to ISL_AUX_USAGE_CCS_* |
| - docs/isl: stop trying to link to classic drivers code |
| - docs/isl: VK_FORMAT_xxx_PACKEDn is not a real format, don't try to link to it |
| - docs/isl: fix enum references |
| - docs: fix inline c identifier reference -> inline code |
| - isl: fix inline c identifier reference -> inline code |
| - nir: add missing stdint include |
| - docs/anv: fix envvar documentation |
| - docs/nvk: fix envvar documentation |
| - ci: mark vmware farm as offline |
| - ci: add missing rule to disable vmware farm |
| - ci: raise the log level threshold of spirv logs |
| - docs/envvars: document some vulkan loader env vars |
| - docs: replace references to the deprecated VK_ICD_FILENAMES with the new VK_DRIVER_FILES |
| - docs: replace references to the deprecated VK_INSTANCE_LAYERS with the new VK_LOADER_LAYERS_ENABLE |
| - docs/zink: format the envvar value as code instead of plain text |
| - meson: add VK_DRIVER_FILES to devenv, alongside the old VK_ICD_FILENAMES |
| - ci: drop unused VK_ICD_FILENAMES passthrough from jobs to tests |
| - ci: use the new VK_DRIVER_FILES env var |
| - ci/deqp: backport fix for dEQP-VK.pipeline.*.render_to_image.*.huge.* |
| - ci: fix nightly build |
| - ci: fix nightly build (v2) |
| - ci/llvmpipe: make sure manual jobs don't auto-retry |
| - ci/llvmpipe: fix out of date fails list |
| - ci/lavapipe: fix out of date fails list |
| - ci/lavapipe: skip test that sometimes times out |
| - ci: add nightly full run of llvmpipe |
| - ci: add nightly full run of lavapipe |
| - gallium/dri: reuse existing meson variables |
| - meson: regroup glvnd lines to get an easier-to-review diff in the next commit |
| - meson: turn \`glvnd` option into a feature |
| - ci: explicitly disable glvnd to avoid regression when making it auto |
| - meson: auto-enable glvnd when libglvnd is installed |
| - mr-label-maker: include \*/gitlab-ci-inc.yml in GitLab CI changes |
| - mr-label-maker: be explicit about the various CI files |
| - docs: add release notes for 24.0.5 |
| - docs: update calendar for 24.0.5 |
| - docs: add sha256sum for 24.0.5 |
| - ci: delete mistaken duplicate llvmpipe-{fails,skips}.txt |
| - etnaviv: avoid re-defining prog_python |
| - egl: drop dead dri2_dpy param in dri2_wl_visual_idx_from_config() |
| - lavapipe: add 1 new failure and 1 new timeout since CTS uprev to 1.3.8.0 |
| - vk/overlay-layer: drop unused imports |
| - vk/overlay-layer: fix None checks |
| - vk/overlay-layer: simplify print and make it more readable |
| - docs/rusticl: add an intro explaining what Rusticl is |
| - wsi/x11: drop unused param in x11_present_to_x11_sw() |
| - radv: initialize a couple of variables |
| - util: simplify loop logic in util_format_get_first_non_void_channel() |
| - util/futex: replace double-cast check with a simple sign check |
| - docs/ci: explain how gitlab considers "changes" when pushing on a fork branch |
| - rpi5/ci: sort flakes |
| - rpi5/ci: add flakes from last night's run |
| - rpi4/ci: sort flakes |
| - rpi4/ci: add new flakes from last night's run |
| - radeonsi/ci: update vangogh expectations after piglit uprev |
| - llvmpipe/ci: update expectations after piglit uprev |
| - VERSION: bump for 24.1.0-rc1 |
| - .pick_status.json: Update to 4660ee1deaace6457bf5fbf3fc8810e4a2453cb5 |
| - ci: fix container rules on release branches and tags |
| - .pick_status.json: Update to 84632dce93f44e8d88cda47648cfd4cc0958918f |
| - .pick_status.json: Update to 8248cc0bf45d0d7558cc3d77a63dcd078a96aa66 |
| - ci: pass MESA_VK_ABORT_ON_DEVICE_LOSS through to the DUT |
| - .pick_status.json: Update to 86281ef15fca378ef48bcb072a762168e537820d |
| - .pick_status.json: Update to 47f6e24ad5dfcb59dd1511800aee8c56b4f8fee4 |
| - meson: simplify \`-gsplit-dwarf` compiler argument check |
| - meson: move tsan-blacklist.txt to build-support with the other build support files |
| - VERSION: bump for 24.1.0-rc2 |
| - .pick_status.json: Update to 603982ea802b3846e91a943b413a7baf430e875d |
| - .pick_status.json: Update to 569c2fcf952a3ec13ddf77c0058e769bf68f3aaf |
| - .pick_status.json: Update to 9666756f603f0285d8a93ef93db1c7ec702b671f |
| - .pick_status.json: Update to b8e79d2769b4a4aed7e2103cf0405acc5bdadb86 |
| - VERSION: bump for 24.1.0-rc3 |
| - .pick_status.json: Update to 18c53157318d6c8e572062f6bb768dfb621a55fd |
| - .pick_status.json: Update to 406dda70e7c9baa59c975eb64025e7c3b210c3bc |
| - .pick_status.json: Update to 5502ecd7716045e76f13f007a4aa5f5653c80ecd |
| - util/format: add missing null check in util_format_is_srgb() |
| - .pick_status.json: Update to d516721cd0cb16d0b601c42c01de0fdcc4ae887b |
| - .pick_status.json: Update to aa9244c8f6bfa3fb33cf233104b00fc44fc9459f |
| - .pick_status.json: Mark a45f1990860db3a8da6d7251bb627a314dfb8423 as denominated |
| - VERSION: bump for 24.1.0-rc4 |
| - .pick_status.json: Update to b2282e3a571f18b48b8b717ec32da1d0ed93f1b5 |
| - .pick_status.json: Update to 471ac97a4af751226bc51076130deae252bb481e |
| - .pick_status.json: Update to 2487a875527f636565a7b39036690fbf7c5d46db |
| - .pick_status.json: Update to 3584fc64828ad2ad4d486572ec915aab8321aadd |
| |
| Eric R. Smith (13): |
| |
| - panfrost: fix panfrost drm-shim |
| - panfrost: add lowering pass for multisampled images |
| - panfrost: support multi-sampled image load/store |
| - panfrost: protect alpha calculation from accessing non-existent component |
| - panfrost: make drm-shim work again for panfrost |
| - panfrost: make sure blends always have 4 components |
| - panfrost: mark indirect compute buffer as read |
| - gallium: handle copy_image of depth textures |
| - panfrost: fix polygon offset calculation for floating point Z |
| - panfrost: fix a GPU/CPU synchronization problem |
| - panfrost: mark separate_stencil as valid when surface is valid |
| - panfrost: fix an incorrect stencil clear optimization |
| - panfrost: add a barrier when launching xfb jobs in CSF |
| |
| Erico Nunes (2): |
| |
| - Revert "ci: lima farm is down" |
| - ci: enable shader-db on lima |
| |
| Erik Faye-Lund (32): |
| |
| - panfrost: add support for forcing sample-counts |
| - panfrost: pass reduced primitive type instead of points |
| - panfrost: add line_smooth shader-key and lowering |
| - panfrost: clean up active_prim update |
| - panfrost: implement line-smoothing |
| - mesa/main: add support for EXT_texture_storage |
| - mesa: fix error-handling for ETC2/RGTC textures |
| - glapi: move EXT_texture_storage to the right position |
| - targets/va: override LIBVA_DRIVERS_PATH in devenv |
| - mesa/main: fix _mesa_base_tex_format for BGRA |
| - mesa/main: mark GL_BGRA as color-renderable |
| - mesa/main: mark GL_BGRA8_EXT as color-renderable |
| - mesa/main: work around chrome/firefox bug |
| - mesa/main: allow GL_BGRA for FBOs |
| - panvk: do not handle illegal null |
| - glsl: Make error_value a real ir_rvalue type |
| - panfrost: give afbc-packing its own flag |
| - panfrost: add driconf infrastructure |
| - panfrost: add pan_force_afbc_packing driconf |
| - mesa: prefer read-format of RG for snorm |
| - gallium: remove always-false parameter |
| - panvk: use integers instead of strings |
| - panfrost: silence compiler warning |
| - panfrost: add tiler-heap driconfs |
| - panvk: wire up version-overriding |
| - panfrost: implement a driver-specific max-miplevel |
| - panfrost: use perf_debug_ctx instead of perf_debug |
| - panfrost: perf_debug_ctx -> perf_debug |
| - panfrost: use util_debug_message for perf_debug |
| - panfrost: do not deref potentially null pointer |
| - panfrost: correct first-tracking for signature |
| - panvk: avoid dereferencing a null-pointer |
| |
| Erik Kurzinger (2): |
| |
| - wsi/wayland: don't use explicit sync with sw |
| - wsi/x11: support explicit sync |
| |
| Faith Ekstrand (284): |
| |
| - nvk: Add an explicit mapping from shader stages to cbuf bindings |
| - nvk: Return an nvk_cbuf_map from nvk_lower_nir() |
| - nvk: Use s instead of set_idx in CmdBindDescriptorSets |
| - nvk: Rework descriptor set binding |
| - nvk: Make dynamic cbuf indices relative to the descriptor set |
| - nvk: Handle missing descriptor sets in nvk_nir_lower_descriptors |
| - nvk: Invalidate state after secondary command buffers |
| - nvk: Set a minimum of one patch control point |
| - nak: Disallow gl_FragData and set MRT correctly |
| - nak: Add explicit padding to nak_shader_info |
| - nvk: Emit SET_ANTI_ALIAS at draw time when no render targets are bound |
| - nvk: Move SET_HYBRID_ANTI_ALIAS_CONTROL to draw time |
| - nvk: Advertise variableMultisampleRate and EDS3RasterizationSamples |
| - nvk: Add a couple more features to features.txt |
| - nak: Stop passing --explicit-padding to bindgen |
| - nak: Implement nir_op_pack_half_2x16_rtz_split |
| - nak: Implement nir_op_ufind_msb_rev |
| - nak: Rename OpBrev to OpBRev |
| - nak: Implement nir_op_bfm |
| - nouveau/mme/fermi: Stop truncating iadd immediates |
| - nouveau/mme: Stop using isaspec |
| - nvk: Set framebufferIntegerColorSampleCounts |
| - nvk: Unref shaders on pipeline free |
| - nvk: Add a #define for max shared memory size |
| - nvk: Properly configure the min/max shared mem size |
| - nvk: Implement VK_KHR_zero_initialize_workgroup_memory |
| - nir,spirv: Add support for SPV_NV_shader_sm_builtins |
| - nak: Add support for SPV_NV_shader_sm_builtins |
| - nvk: Advertise VK_NV_shader_sm_builtins |
| - nvk/draw: Map cbuf slots to shaders, not cbuf_maps |
| - nak: Refactor shader upload math |
| - nvk: Wire up nir_opt_large_constants |
| - nak: Enable NIR fuse_ffmaN |
| - nak: Legalize OpBMsk |
| - nvk: Don't exnore ExternalImageFormatInfo |
| - nvk: Set maxInlineUniformTotalSize |
| - nak: Fix TCS output reads |
| - anv: Add helpers for getting the surface state from an image view |
| - anv: Advertise VK_EXT_attachment_feedback_loop_layout |
| - nak: Choose S2R vs CS2R based on sysval index |
| - nak: Add a source barrier intrinsic |
| - nak: Loop to ensure we get accurate shader clocks |
| - nvk: Stop requiring dedicated allocations |
| - nvk: Advertise Vulkan 1.3 |
| - nvk: Do a second submit to check for errors in the sync case |
| - nvk: Whitespace fixes |
| - nvk: Disable all cbufs in nvk_queue_init_context_draw_state() |
| - nvk: Call lower_compute_system_values after zer_initialize_workgroup_memory |
| - nak/nir: Stop lowering load_local_invocation_index |
| - nil: Set the level offset to 0 in nil_image_for_level |
| - nvk: Fix whitespace in nvk_image.c |
| - nouveau/winsys: Re-order channel creation |
| - nouveau/winsys: Allow only allocating a subset of engines |
| - nvk/queue: Pull DRM specfc stuff into nvk_queue_drm.c |
| - nvk/queue: Refactor the push builder a bit |
| - nvk: Move the nouveau_ws_context to nvk_queue |
| - nvk: Add an array of queue families to nvk_physical_device |
| - nvk/queue: Rework context state init |
| - nvk/queue: Only initialize the necessary engines |
| - nvk: Use VM_BIND for contiguous heaps instead of copying |
| - nvk: Only map heaps that explicitly request maps |
| - nvk: Add an upload queue |
| - nvk: Add an upload queue to nvk_device |
| - nvk: Use the upload queue for shader uploads |
| - nvk: Don't set CONSTANT_BUFFER_SELECTOR with a zero size |
| - nvk/heap: Use nvk_heap_bo::addr instead of bo->offset |
| - nvk/heap: Rework over-allocation |
| - nvk: Convert shader addresses to offsets in nvk_shader.c |
| - vulkan: Update XML and headers to 1.3.278 |
| - nvk: Use nouveau_ws_bo_new_mapped() for descriptors |
| - nouveau/winsys: Add a fixed_addr to nouveau_ws_bo_map |
| - nvk: Implement VK_EXT_map_memory_placed |
| - nvk: Invalidate the texture cache before MSAA resolves |
| - nvk: Don't use WAIT_AVAILABLE in nvk_upload_queue_sync |
| - drm-uapi: Sync nouveau_drm.h |
| - nouveau/winsys: Add a vram_used query |
| - nvk: Add a nouveau_ws_device to nvk_physical_device |
| - nvk: Add a hand-rolled nvk_memory_heap struct |
| - nvk: Use 3/4 of total system memory for the VRAM heap |
| - nvk: Add an available query to nvk_memory_heap |
| - nvk: implement EXT_memory_budget |
| - nouveau/winsys: Getch the BAR size from the kernel |
| - nvk/heap: Upload shaders on the CPU when we have a map |
| - nvk: Upload shaders on the CPU when we have ReBAR |
| - nvk: Expose a host-visible VRAM type when we have REBAR |
| - nvk: Only expose VK_KHR_present_id/wait when we have WSI |
| - nvk: Advertise VK+KHR_incremental_present |
| - nil: Add PIPE_FORMAT_R5G5B5A1_UNORM |
| - nak: Add writes_point_size to nak_shader_info |
| - nvk: Handle missing gl_PointSize in the last geometry stage |
| - nvk/copy: Handle VK_REMAINING_ARRAY_LAYERS |
| - vulkan/meta: Handle VK_REMAINING_ARRAY_LAYERS in blit and resolve |
| - nvk: Use VkPipelineCreateFlags2 flag names |
| - nvk: Advertise VK_KHR_maintenance5 |
| - vulkan: Add a vk_get_subgroup_size() helper |
| - vulkan: Move the descriptor set limit to vk_limits.h |
| - vulkan: Add runtime code for VK_EXT_shader_object |
| - vulkan: Add a vk_render_pass_state_has_attachment_info() helper |
| - vulkan: Rework vk_render_pass_state::attachments |
| - vulkan: Add a new dynamic state for render pass attachments |
| - vulkan: Add a vk_pipeline base struct |
| - vulkan: Add push constant ranges to vk_pipeline_layout |
| - vulkan: Add a BLAKE3 hash to vk_descriptor_set_layout |
| - vulkan: Add generic graphics and compute VkPipeline implementations |
| - nvk: Populate vk_descriptor_set_layout::blake3 |
| - nvk/shader: Refactor some helpers |
| - nvk: Move populate_fs_key to nvk_shader.c |
| - nvk: Pass an array of descriptor sets to nvk_lower_nir |
| - nvk: Move nir_lower_patch_vertices to nvk_lower_nir() |
| - nvk: Use vk_render_pass_state::attachments for write masks |
| - nvk: Switch to shader objects |
| - nvk: Advertise VK_KHR_graphics_pipeline_library |
| - nvk: Advertise VK_EXT_shader_object |
| - nak: Implement nir_op_iadd3 on SM70+ |
| - nir: Add an imad opcode |
| - nak: Move NAK_FS_OUT_COLOR next to the enum |
| - nak: Add support for imad on Volta+ and enable it in simple cases |
| - nvk: Advertise a CTS version of 1.3.7.3 |
| - nvk: Drop the non-conformant warning on Turing-Ada |
| - nvk: Don't print the NVK_I_WANT_A_BROKEN_VULKAN_DRIVER warning in release builds |
| - meson: Rename nouveau-experimental to nouveau and build by default on x86 |
| - vulkan/pipeline: Whitespace fix |
| - vulkan/pipeline: Handle fully compiled library shaders properly |
| - nvk: Advertise VK_KHR_pipeline_library |
| - docs/nvk: Update the conformance status section |
| - docs/nvk: Update the NVK_DEBUG docs |
| - docs/nvk: Document NAK_DEBUG |
| - nil: Enable A8_UNORM for storage buffers |
| - vulkan/pipeline: Always init pipeline cache objects |
| - nak: Fix printing of OpIsberd |
| - nak/sm50: Fix encoding of immediates in OpFFma |
| - nak/sm50: Use OpBfe instead of OpBRev for nir_op_find_lsb |
| - nak: Support F2I for 8-bit integers on SM50 |
| - nvk: Return os_page_size for minMemoryMapAlignment |
| - nouveau: Import g_nv_name_released.h from NVIDIA OGK |
| - nvk: Report official GPU names from NVIDIA when we have them |
| - nvk: Use row_stride_B instead of width for render and copies |
| - nil: Rework tiling calculations |
| - nil: Add a concept of width to tile sizes |
| - nil: Add a concept of sliced 3D image views |
| - nvk: Use "real" 3D image views |
| - nvk/queue: Add a push_bind helper |
| - nvk: Refactor opaque image binds |
| - nvk/queue: Add support for non-opaque sparse binds |
| - nak: Rename resident to fault |
| - nak: Plumb is_sparse through from NIR for texture ops |
| - nak/nir: Add sparse support to shrink_image_load() |
| - nak: Wire up sparse residency for texture ops |
| - nil: Fix a typo in a comment |
| - nvk: Document the register name for the helper load workaround |
| - nvk: Always wait for the FALCON in set_priv_reg |
| - nvk: Disable the Out Of Range Address exception |
| - nvk: Drop a bunch of dev->pdev and just use pdev |
| - nvk: Add and use more cmd_buffer_*_cls helpers |
| - nvk: Replace more dev->pdev with nvk_device_physical() |
| - nvk: Drop nvk_device::pdev |
| - zink: Remove interpolateAtSample() when not multi-sampling |
| - nil: Move Z slice offset calculations to a helper |
| - nvk: Add a nil_image helper variable in BeginRendering |
| - nvk: Manually offset array and Z slices in BeginRendering |
| - nil: Advertise support for PIPE_FORMAT_R5G6B5_UNORM |
| - nil: Whitespace fix |
| - nil: Add support for larger textures on Pascal+ |
| - nil: Add a helper to view a MSAA image as samples |
| - nil: Expose nil_pix_extent_sa() |
| - nvk: Use a HW generation names instead of chipsets |
| - nvk: Stop pretending to handle Intel image intrinsics |
| - nvk: Use different descriptor layouts for storage vs. sampled images |
| - nvk: Implement shaderStorageImageMultisample |
| - zink: Rework sparse texture lowering |
| - nvk: Ignore rasterizationSamples when handling sampleShadingEnable |
| - nvk: Always set SET_ATTRIBUTE_POINT_SIZE |
| - Revert "nvk: Enable VK_KHR_shader_subgroup_uniform_control_flow" |
| - nvk: Move the mutableDescriptorType enable |
| - nir: Take a nir_def in nir_goto_if() |
| - nir/print: Inline print_ssa_use() |
| - nir/builder: Correctly handle decl_reg or undef as the first instruction |
| - nir: Improve the comment for nir_block::imm_dom |
| - nir: Add a sort_unstructured_blocks() helper |
| - nir: Validate that unstructured blocks are in reverse PDFS order |
| - nir/lower_reg: Remove dead reg_decl intrinsics |
| - nir/lower_reg: Support unstructured control-flow |
| - nir/repair_ssa: Support unstructured control-flow |
| - nir/gather_types: Support unstructured control-flow |
| - nir: Mark divergent regs in phis_to_regs_block() |
| - nir: Add a lower_terminate_to_demote pass |
| - nak: Add a copy_fs_outputs_nv intrinsic |
| - nak: Move barrier removal into its own pass |
| - nak: Add a condition to bar_break_nv |
| - nak/nir: Add a control-flow lowering pass |
| - nak: Add more NIR wrappers for walking the NIR CFG |
| - nak: Add NIR helpers for jump instructions |
| - nak: Add helpers for emitting jumps |
| - nak: Handle unstructured NIR |
| - nak: Use the new lowering pass on SM70+ |
| - nak: Remove the old barriers pass |
| - nak/nir: Use nir_lower_terminate_to_demote() |
| - nvk: Advertise VK_KHR_shader_maximal_reconvergence |
| - nvk: Advertise VK_KHR_shader_subgroup_uniform_control_flow |
| - nak/nir: Emit nir_intrinsic_ald_nv directly for system values |
| - nak/nir: Rename load_interpolated_input |
| - nak/nir: Add a load_fs_input hepler for flat inputs |
| - nak/nir: Emit nir_intrinsic_ipa_nv directly for FS system values |
| - nak/nir: Use nir_io_semantics for varyings and attributes |
| - nak: Break lower_fs_inputs into its own file |
| - nak/nir: Clean up lower_fs_inputs a bit |
| - nak: Call nir_lower_io_to_temporaries for FS outputs |
| - nak/nir: Use nir_io_semantics for FS outputs |
| - nak: Drop lower_io_arrays_to_elements_no_indirects for FS outputs |
| - nak: Simplify over-all I/O lowering |
| - nak: Don't write undefined FS outputs |
| - nak: Plumb through LDC modes |
| - nak: Implement load_ubo with an indirect cbuf index |
| - nvk: Support VkBindMemoryStatusKHR |
| - nvk: Advertise VK_KHR_maintenance6 |
| - nir: Delete the rest of the CF list when adding a halt |
| - nak: Don't do a scope break cascade for nir_jump_halt |
| - nil: Add a CSV version of the format table |
| - nil: Re-organize the format table |
| - nil: Switch to using the CSV generated table |
| - nil: Drop bogus color formats from non-renderable luminance/alpha formats |
| - nil: Remove 2-bit SNORM from the format table |
| - nil: Drop unneeded types from formats |
| - nvk: Use the page-aligned BO size for the descriptor pool |
| - nvk: Use a VMA heap for descriptor memory |
| - nvk: Use a linked list for descriptor sets in a pool |
| - nvk: Add a _pad field to nvk_cbuf |
| - nvk: Delete dead descriptor code |
| - nvk: Add a _pad field to nvk_fs_key |
| - nvk: Add a bunch of -Wpadded errors |
| - vulkan: Add a bunch of -Wpadded errors |
| - nouveau: Move .rustfmt.toml from NAK to src/nouveau/ |
| - nouveau: Use hyphenated arguments to class_parser.py |
| - nouveau/headers: Add initial Rust bindings |
| - nouveau/headers: Add Rust bindings for texture headers |
| - ci: Add cbindgen to the build images |
| - nil: Move nil_tic_format to nil_format_table.h |
| - nil: Move to a single header file |
| - nvk: Stop using nvk_extent4d short names |
| - nil: Rename nil_tiling::gob_height_8 to gob_height_is_8 |
| - nak/bitview: Add a SetField<f32> implementation |
| - nil: Delete unused USAGE bits |
| - nil: Make the Rust library the primary build target |
| - nil: Add Extent/Offset4D::new() helpers |
| - nil: Drop the nil_extent/offset4d() helpers |
| - nil: Take a format in el_to_B() |
| - nil: Enforce units via Rust types |
| - nil: cbindgen is required |
| - nvk: Improve the unsupported handle type error |
| - nvk: Restrict shaderFloat16 to Ampere+ for now |
| - nouveau/headers: Move the classes into a submodule for Rust |
| - nouveau/headers: Generate Rust for QMDs |
| - nak: Add helpers for filling QMDs |
| - nvk: QMDs are 64 dwords |
| - nvk: Use the NAK helpers to fill QMDs |
| - nouveau: Import the hwref headers from Nvidia OGK |
| - nouveau/headers: Add the MMU headers to the Rust crate |
| - nil: Use the enums from the hwref headers for PTE kinds |
| - nil: add s8 pte kind |
| - nil: Be more speicific about Maxwell in the format table |
| - nil: Advertise S8_UINT on MaxwellB+ |
| - nvk: Hash ycbcr conversions in the descriptor set layout hash |
| - nvk: Re-emit sample locations when rasterization samples changes |
| - nvk/meta: Restore set_sizes[0] |
| - nvk/upload_queue: Only upload one line of data |
| - vulkan/wsi: Bind memory planes, not YCbCr planes. |
| - nvk: Improve the GetMemoryFdKHR error |
| - nouveau/winsys: Take a reference to BOs found in the cache |
| - nouveau/winsys: Make BO_LOCAL and BO_GART separate flags |
| - nvk: Allow GART for dma-bufs |
| - nil: Use the right PTE kind for Z32 pre-Turing |
| - nvk: Set color/Z compression based on nil_image::compressed |
| - nil: Default to NV_MMU_PTE_KIND_GENERIC_MEMORY on Turing+ |
| - nvk: Allow VK_IMAGE_ASPECT_MEMORY_PLANE_0_BIT |
| - drm-uapi: Sync nouveau_drm.h |
| - nouveau/winsys: Add back nouveau_ws_bo_new_tiled() |
| - nvk: Support image creation with modifiers |
| - nvk: Set tile mode and PTE kind on dedicated dma-buf BOs |
| - nvk: Implement DRM format modifier queries |
| - nvk: Advertise VK_EXT_queue_family_foreign |
| - nvk: Advertise VK_EXT_image_drm_format_modifier |
| - nvk/wsi: Advertise modifier support |
| - zink: Set workarounds.can_do_invalid_linear_modifier for NVK |
| - nvk/meta: Save and restore set_dynamic_buffer_start |
| |
| Felix DeGrood (6): |
| |
| - driconf: Change vendorid on Palworld for Intel |
| - driconf: Fake vendorid for RDR2 |
| - mesa-overlay: defer listening to socket until first frame |
| - driconf: add SotTR DX12 to Intel XeSS workaround |
| - iris: Increase target batch size to 128 KB |
| - intel/ds: add pipe control reasons to perfetto flushes |
| |
| Francisco Jerez (36): |
| |
| - intel/fs: Use full 32-bit sample masks when immediate. |
| - intel/eu/validate: SEND instructions don't have immediate encodings on Gen12+. |
| - intel/eu/gfx12.5+: Don't fail validation with ARF register restriction error for indirect addressing. |
| - intel/compiler/xe2: Add Xe2 bounds to FF() macro. |
| - intel/compiler/xe2: Implement codegen of general instruction controls. |
| - intel/compiler/xe2: Implement codegen of 2-source instruction operands. |
| - intel/compiler/xe2: Implement codegen of indirect immediates. |
| - intel/compiler/xe2: Implement codegen of three-source instructions. |
| - intel/compiler: Add assume() checks to brw_compact_inst_(set\_)bits(). |
| - intel/compiler/xe2: Implement codegen of compact instructions. |
| - intel/compiler/xe2: Implement instruction compaction. |
| - intel/compiler/xe2: Fix for NibCtrl field removal. |
| - intel/compiler/xe2: Fix for the removal of most predication modes. |
| - intel/compiler/xe2: Add extra flag registers. |
| - intel/compiler/xe2: Fix for the removal of AccWrCtrl. |
| - intel/ir/xe2+: Add support for 32 SBID tokens to performance model. |
| - intel/fs/xe2+: Disable bank conflict mitigation pass for now. |
| - intel/eu/xe2+: Translate brw_reg fields in REG_SIZE units to physical 512b GRF units during codegen. |
| - intel/fs: Set the default execution group to 0 when not representable by the platform. |
| - intel/fs: Emit QUAD_SWIZZLE instructions with WE_all for derivative lowering. |
| - intel/fs/xe2+: Allow SIMD16 MULH instructions. |
| - intel/brw/xe2: Render target reads have been removed from the hardware. |
| - intel/brw/xe2+: Update encoding of FB write descriptor message control. |
| - intel/brw/xe2+: Update encoding of FB write extended descriptor. |
| - intel/brw/xe2+: Double allowed SIMD width of FB write SEND messages. |
| - intel/brw/xe2+: Allow FS stencil output in SIMD16 dispatch mode. |
| - intel/brw/xe2+: Allow dual-source blending in SIMD16 mode. |
| - intel/blorp/xe2+: Don't use replicated-data clears. |
| - intel/brw/gfx12: Setup PS thread payload registers required for ALU-based pixel interpolation. |
| - intel/brw/xe2+: Setup PS thread payload registers required for ALU-based pixel interpolation. |
| - iris/xe2+: Disable coherent framebuffer fetch. |
| - intel/brw/xehp+: Replace lsc_msg_desc_dest_len()/lsc_msg_desc_src0_len() with helpers to do the computation. |
| - intel/eu/xehp+: Don't initialize mlen and rlen descriptor fields from lsc_msg_desc*(). |
| - intel/brw/xehp+: Drop redundant arguments of lsc_msg_desc*(). |
| - intel/fs/gfx20+: Implement sub-dword integer regioning restrictions. |
| - intel/fs/gfx20+: Handle subdword integer regioning restrictions in copy propagation. |
| |
| Frank Binns (3): |
| |
| - CODEOWNERS: update Imagination maintainers |
| - pvr: fix up some includes |
| - pvr: split out device info into per GPU headers |
| |
| Friedrich Vock (27): |
| |
| - radv/rt: Add workaround to make leaves always active |
| - radv: Fix shader replay allocation condition |
| - nir: Make is_trivial_deref_cast public |
| - nir: Handle casts in nir_opt_copy_prop_vars |
| - radv/amdgpu: Fix build on BSD |
| - winsys/amdgpu: Fix build on BSD |
| - util: Provide a secure_getenv fallback for platforms without it |
| - vulkan: Use secure_getenv for trigger files |
| - aux/trace: Guard triggers behind __normal_user |
| - vtn: Use secure_getenv for shader dumping |
| - mesa/main: Use secure_getenv for shader dumping |
| - radv: Use secure_getenv in radv_builtin_cache_path |
| - radv: Use secure_getenv for RADV_THREAD_TRACE_TRIGGER |
| - util/disk_cache: Use secure_getenv to determine cache directories |
| - radv/rt: Write inactive node data in ALWAYS_ACTIVE workaround |
| - radv/rt: Optimize update shader VGPR usage |
| - radv,driconf: Enable active AS leaf workaround for Jedi Survivor |
| - radv/rt: Handle monolithic pipelines in capture/replay |
| - vulkan/runtime: Allow more than 8 DRM devices |
| - radv: Set SCRATCH_EN for RT pipelines based on dynamic stack size |
| - radv/rt: Fix frontface culling with emulated RT |
| - radv/rt: Force active leaves for every updateable accel struct |
| - radv,driconf: Remove active accel struct workaround |
| - radv: Only enable SEs that the device reports |
| - radeonsi: Only enable SEs that the device reports |
| - aco/tests: Insert p_logical_start/end in reduce_temp tests |
| - aco/spill: Insert p_start_linear_vgpr right after p_logical_end |
| |
| Ganesh Belgur Ramachandra (1): |
| |
| - compiler,glsl: fix warning when -finstrument-functions is used |
| |
| Georg Lehmann (60): |
| |
| - aco: reassign split vector to SOPC |
| - aco: stop scheduling at p_logical_end |
| - nir: add ballot_relaxed and as_uniform intrinsics |
| - aco: implement as_uniform and ballot_relaxed |
| - ac/llvm: implement as_uniform and ballot_relaxed |
| - nir: add lowering for boolean shuffle |
| - radv: lower boolean shuffle |
| - radeonsi: lower boolean shuffle |
| - aco: remove boolean shuffle isel |
| - aco: fix printing dpp8 |
| - aco: validate v_permlane opsel correctly |
| - aco: support v_permlane64_b32 |
| - aco/gfx11: use v_nop to resolve VcmpxPermlaneHazard |
| - aco/gfx11: resolve VcmpxPermlaneHazard for v_permlane64 |
| - aco: implement rotate |
| - radv: enable VK_KHR_shader_subgroup_rotate |
| - radv: report rotate subgroup feature bits |
| - anv: report rotate subgroup feature bits |
| - aco/gfx11+: disable v_pk_fmac_f16_dpp |
| - aco: add packed fma dpp note to README-ISA |
| - aco: don't remove branches that skip v_writelane_b32 |
| - aco/print_ir: don't use alloca for input modifiers |
| - aco: print neg prettier for packed math |
| - aco: don't print hi() for permlane opsel |
| - aco: print permlane16 bc/fi |
| - aco: print exec/vcc_lo/hi for single dword access |
| - aco/gfx11+: limit hard clauses to 32 instructions |
| - radv/gfx11+: add rtwave32 perftest option |
| - aco: use fmamk/ak instead of fma with inline constant for more VOPD |
| - nir: remove rotate scope |
| - nir/divergence_analysis: fix subgroup mask |
| - aco: create pseudo instructions with correct struct |
| - aco/post-ra: rename overwritten_subdword to allow additional uses |
| - aco/post-ra: assume scc is going to be overwritten by phis at end of blocks |
| - aco: store if pseudo instr needs scratch reg |
| - aco/post-ra: track pseudo scratch sgpr/scc clobber |
| - aco/ssa_elimination: check if pseudo scratch reg overwrittes regs used for v_cmpx opt |
| - aco/builder: improve v_mul_imm for negative imm |
| - aco/builder: use 24bit mul if low bits of imm are zero |
| - aco/optimizer: combine v_mul_i32_i24 and add to mad |
| - aco: avoid full 32bit imul for uniform reduce/scan |
| - aco: don't combine mul+add_clamp to mad_clamp |
| - aco/ra: use SDWA for 16bit instructions when the second byte is blocked |
| - aco/vn: remove instruction hash templates |
| - aco: use v1 definition for v_interp_p1lv_f16 |
| - aco/assembler: add vintrp high_16bit support |
| - aco: swap opsel and wait_exp for vinterp |
| - aco: support high_16bits FS IO |
| - aco/tests: add assembler tests for interp high_16bits |
| - aco/gfx9: all non legacy opsel instructions only write 16bits |
| - aco: use v_interp_p2_f16 opsel |
| - aco: add ra test for hi v_interp_p2_f16 |
| - radv: sink alu |
| - radv: move alu |
| - nir: don't try to optimize exclusive min/max scan to inclusive |
| - nir: rename to nir_opt_16bit_tex_image |
| - ac/nir: add ac_nir_opt_pack_half |
| - radv: use ac_nir_opt_pack_half |
| - radv, radeonsi: don't use D16 for f2f16_rtz |
| - zink: use bitcasts instead of pack/unpack double opcodes |
| |
| George Ouzounoudis (1): |
| |
| - vulkan: Fix null pointer dereferencing on sample locations state |
| |
| Gert Wollny (76): |
| |
| - r600: lower dround_even also on hardware that supports fp64 |
| - virgl: Use better reporting for mirror_clamp features |
| - ci: Upref virglrenderer |
| - zink: Factor out create buffer from resource_object_create |
| - zink: shorten lifetime of success variable in resource_object_create |
| - zink: Factor out create sampler conversion in resource_object_create |
| - zink: factor out get_format_feature_flags in resource_object_create |
| - zink: factor out get_image_memory_requirement in resource_object_create |
| - zink: reduce number of #ifdefs in resource_create_object |
| - zink: extract get_export_flags from resource_object_create |
| - zink: extract function allocate_bo from resource_create_object |
| - zink: redesign the allocation try loop to test all heaps |
| - zink: extract function create_image from resource_object_create |
| - zink: extract function update_alloc_info from resource_object_create |
| - zink: extract update_obj_info from resource_object_create |
| - zink: extract debug_resource_mem from resource_object_create |
| - zink: drop duplicate assignment to obj->alignment |
| - zink: extract allocate_bo_and_update_obj from resource_object_create |
| - zink: Move more code to create_image and create_buffer |
| - zink: simplify call to get_export_flags |
| - zink: remove duplicate arguments and use VkMemoryRequirements locally |
| - zink: use enums as return values in resource_object_create |
| - radv: Fix compilation with gcc-13 and tsan enabled |
| - nir/lower_int64: Fix compilation with gcc-13 and tsan enabled |
| - nir/builder: Fix compilation with gcc-13 when tsan is enabled |
| - zink: Fix return type and values of create_buffer and create_images |
| - zink: extract check_unordered_exec from zink_get_cmdbuf |
| - zink: remove duplicate check and assignment in zink_resource_image_needs_barrier |
| - zink: extract emit_memory_barrier from zink_resource_image_barrier |
| - zink: extract emit_memory_barrier::for_buffer from zink_resource_buffer_barrier |
| - zink: extract update_unordered_access_and_get_cmdbuf |
| - zink/sync: remove duplicate assignments in UNSYNCHRONIZED case |
| - zink: move zink_resource_copies_reset out of exportable_lock |
| - zink: remove invalid scope in bo allocation loop |
| - r600: handle indirect access to kcache 14 and 15 |
| - zink/nir_to_spirv: emit ViewportIndex cap also for inputs |
| - zink: use only ZINK_BIND_DESCRIPTOR |
| - zink: decrease aggressiveness of increasing descriptor data space adaptive |
| - zink/nir-to-spirv: Make sure sampleid for InterpolateAtSample is int |
| - nir-to-spirv: Cast SSBO input pointer when needed |
| - zink: set handle type also for user memory |
| - zink: acquire - maybe clear timeout after waiting for presentation fence |
| - nir_to_spirv: Allow LOD for external images |
| - zink: ctx->last_fence really wants to be a batch_state, so accomodate it |
| - zink: another fence that is better off as a batch state |
| - ntv: remove store_def_raw |
| - ntv: remove store_ssa_def |
| - ntv: pass def->index to store_def |
| - ntv: simplify increasing the number of dest componnets for sparse tex |
| - zink/ntv: introduce structure using the source params |
| - zink/ntv: extract get_tex_srcs |
| - zink/ntv: use new struct to pass texture parameters |
| - zink/ntv: extract find_sampler_and_texture_index |
| - zink/ntv: simplify evaluation of sampled_type |
| - zink/ntv: extract get_texture_load |
| - zink/ntv: extract get_texop_dest_type |
| - zink/ntv: Extract move_tex:proj_into_coord |
| - zink/ntv: replace if-chain with switch in emit_tex |
| - zink/ntv: extract picking the image to load from |
| - zink/ntv: extract emit_tex_readop as function that reads texture pixel data |
| - zink/ntv: pull result out of cases and use a common store_def |
| - zink: if AcquireNextImageKHR fails with VK_NOT_READY or VK_TIMEOUT retry |
| - meson: Add blacklist when compiling with tsan |
| - futex: disable futexes when compiling with tsan |
| - util/u_queue: read fence->signalled locked with TSAN |
| - tsan-blacklist: ignore race when reading lp_fence signalled status |
| - llvmpipe: Don't emit certain debug code when TSAN is enabled |
| - tsan-blacklist: Ignore race in get_max_abs_timeout_ns |
| - tsan-blacklist: surpress two race conditions in TC |
| - r600/sfn: Add array element parent also to array |
| - r600/sfn: Use dependecies to order barriers and LDS/RAT instructions |
| - r600/sfn: call nir_lower_doubles explicitely |
| - r600/sfn: when emitting fp64 op2 groups pre-load values |
| - r600/sfn: Don't put b2f64 conversion into ALU group |
| - zink/kopper: Wait for last QueuePresentKHR to finish before acquiring for readback |
| - mesa/st: don't use base shader serialization when uniforms are not packed |
| |
| Guilherme Gallo (25): |
| |
| - ci/lava: Turn the r8152 issue check into a counter |
| - ci/lava: Detect r8152 issue during boot phase |
| - ci/lava: Detect hard resets during test phase |
| - bin/ci: Don't submit jobs on integration test |
| - ci/lava: Ignore DUT feedback messages |
| - ci/lava: Fix the integration test |
| - bin/ci: Propagate the token to GitlabGQL |
| - bin/ci: Move get_token_from_default_dir to common |
| - bin/ci: Refactor read_token function |
| - bin/ci: Add GitLab basic token validation |
| - ci/lava: Broader R8152 error handling |
| - radv+zink/ci: Update xfiles based on nightly run |
| - radv/ci: Update xfiles based on nightly run |
| - v3d/ci: Update xfiles based on nightly run |
| - freedreno/ci: Update xfiles based on nightly run |
| - etnaviv/ci: Update xfiles based on nightly run |
| - r300/ci: Update xfiles based on nightly run |
| - ci/a618: Rebalance a618-limozeen jobs |
| - ci/a618: Add zink-tu-a618-full |
| - ci/lava: A few formatting cleanups |
| - ci/lava: Introduce unretriable exception handling |
| - ci/lava: Don't run jobs if the remaining execution time is too short |
| - ci/lava: Fix how exception entry in structured log |
| - ci: Add S3 id_token for all jobs |
| - ci: Use id_tokens for JWT auth |
| |
| Gurchetan Singh (6): |
| |
| - mesa/util: Check __ANDROID__ when for detecting for Android |
| - mesa/util: add <linux/fcntl.h> |
| - mesa/util: use DETECT_OS_ANDROID in anon_file.c |
| - mesa/vulkan: use a simpler path for header in trampoline gen |
| - mesa/vulkan: use DETECT_OS_ANDROID |
| - vk_image.c: #ifndef _WIN32 --> DETECT_OS_LINUX + DETECT_OS_BSD |
| |
| Haihao Xiang (1): |
| |
| - anv: Fix typo in transition_color_buffer |
| |
| Hannes Mann (2): |
| |
| - gallium/pipe: Add contiguous planes per-surface attribute |
| - frontends/va: Only export one handle for contiguous planes |
| |
| Hans-Kristian Arntzen (20): |
| |
| - wsi/x11: Remove unused vk_alpha in get_dri3_modifiers. |
| - wsi/x11: Compare modifiers before signalling SUBOPTIMAL. |
| - wsi/x11: Add drirc option to ignore SUBOPTIMAL. |
| - wsi/x11: Add workaround for Detroit Become Human. |
| - wsi/x11: Rewrite implementation to always use threads. |
| - wsi/x11: Implement VK_EXT_swapchain_maintenance1. |
| - wsi/x11: Keep track of multiple presentation requests. |
| - wsi/x11: Make chain->status atomic. |
| - wsi/wl: Refactor out code to update current present ID. |
| - wsi/wl: Improve fallback for present_wait. |
| - wsi/common: Allow KHR_present_wait on WL. |
| - wsi/x11: Disable vk_xwayland_wait_ready by default on most drivers. |
| - wsi/x11: Rewrite logic for how we consider minImageCount. |
| - radv: export multiview in VS/TES/GS for depth-only rendering |
| - wsi/wl: Fix deadlock in dispatch_queue_timeout. |
| - wsi/wayland: Replace surface pilfer with retired bool. |
| - wsi/wayland: Init outstanding list earlier. |
| - wsi/x11: Return OUT_OF_DATE on sw resize. |
| - vulkan/runtime: Check correct callback list for binding report. |
| - radv: Store range rather than bo_size in VkBuffer/VkImage. |
| |
| Helen Koike (6): |
| |
| - ci/ci_run_n_monitor: move get_gitlab_pipeline_from_url() to gitlab_common |
| - ci/ci_gantt_chart: add tool to analyse pipeline execution time |
| - ci/ci_gantt_chart: add timeout vertical line |
| - ci/ci_gantt_chart: add option to save output to a file |
| - ci/ci_gantt_chart: show duration on hover |
| - ci/ci_post_gantt: add script that post gantt to Marge's messages |
| |
| Hsieh, Mike (4): |
| |
| - amd/vpelib: add new tf enum and add flag for geometric scaling |
| - amd/vpelib: skip gamma remap and cs conversion when geometric scaling |
| - amd/vpelib: geometric scaling fix |
| - amd/vpelib: Add UID for 3d Lut and control logic |
| |
| Hyunjun Ko (3): |
| |
| - anv/video: fix out-of-bounds read |
| - anv/video: fix scan order for scaling lists on H265 decoding. |
| - anv/video: Fix to set correct offset and size for parsing h265 slice header. |
| |
| Iago Toral Quiroga (49): |
| |
| - broadcom/compiler: fix incorrect flags setup in non-uniform if path |
| - broadcom/compiler: fix incorrect flags update for subgroup elect |
| - broadcom/compiler: add new SFU instructions in V3D 7.x |
| - broadcom/compiler: don't move subgroup reduction instructions above setmsf |
| - broadcom/compiler: support subgroup ballot |
| - broadcom/compiler: support subgroup shuffle |
| - broadcom/compiler: support subgroup vote |
| - broadcom/compiler: support subgroup quad |
| - v3dv: expose more subgroup features on V3D 7.x |
| - broadcom/compiler: be more careful with unifa in non-uniform control flow |
| - brodcom/compiler: implement non-compute TSY barrier |
| - broadcom/compiler: support subgroup reduction operations from fragment shaders |
| - v3dv: allow subgroup operations in fragment shaders |
| - broadcom/compiler: fix lane selection for subgroups in fragment shaders |
| - v3d,v3dv: fix BO allocation for shared vars |
| - v3dv: fix subpass clear with draw call for multi-layered framebuffers |
| - v3dv: always set view index before drawing |
| - v3dv: fix copying v3dv_end_query_info into primaries from secondaries |
| - v3dv: refactor checking and adding pending jobs |
| - v3dv: add a helper to constrain clip window to render area |
| - v3dv: add helper to check if we need to use a draw for a depth/stencil clear |
| - v3dv: add helper to build a render pass for dynamic rendering |
| - v3dv: add a helper to setup a framebuffer for dynamic rendering |
| - v3dv: add a vk_render_pass_state to pipelines |
| - v3dv: don't assume that pipelines have a render pass |
| - v3dv: implement vkCmdBeginRendering and vkCmdEndRendering |
| - v3dv: implement dynamic rendering resume/suspend |
| - v3dv: rename SECONDARY job type to INCOMPLETE |
| - v3dv: fix resume address patching for secondary command buffers |
| - v3dv: handle render pass continue flag with dynamic passes |
| - v3dv: also emit subpass clears with secondary command buffers |
| - v3dv: enable VK_KHR_dynamic_rendering |
| - broadcom/ci: skips for tests that don't check for extension support correctly |
| - broadcom/ci: add new expected test failures |
| - broadcom/ci: add a test that fails only in CI |
| - broadcom/ci: add skips for unsupported features |
| - v3dv: fix image creation when exceeding maxResourceSize |
| - v3d: implement fix for GFXH-1602 |
| - broadcom/compiler: fix workaround for GFXH-1602 |
| - v3dv: require multisync kernel |
| - v3dv: drop single sync kernel interface |
| - v3dv: add a v3dv_job_clone helper |
| - v3dv: fix job pointers from cloned CLs |
| - v3dv: store the offset of the BRANCH instruction in a CL |
| - v3dv: fix job suspend with command buffer simultaneous use flag |
| - broadcom/compiler: enable perquad with uses_wide_subgroup_intrinsics |
| - v3d/simulator: size counter_values array correctly on V3D 7.x |
| - broadcom/ci: document external causes for some CTS 1.3.8 failures |
| - v3dv: fix VK_KHR_vertex_attribute_divisor |
| |
| Ian Romanick (54): |
| |
| - nir: Minor clean up in nir_alu_srcs_negative_equal |
| - intel/compiler: Disable DPAS instructions on MTL |
| - intel/compiler: Use u_foreach_bit64 in brw_get_compiler_config_value |
| - intel/compiler: Track lower_dpas flag in brw_get_compiler_config_value |
| - intel/compiler: Track mue_compaction and mue_header_packing flags in brw_get_compiler_config_value |
| - intel/fs: Fix shift counts for 8- and 16-bit types |
| - intel/rt: Don't directly generate umul_32x16 |
| - intel/compiler/xe2: Update get_sampler_lowered_simd_width |
| - intel/fs: Move opcode modification before the switch that emits srcs |
| - intel/compiler/xe2: Use new sample_*_mlod messages |
| - nir: Pack texture LOD and array index to a single 32-bit value |
| - intel/compiler/xe2: Emit texture instructions w/ combined LOD and array index |
| - intel/compiler/xe2: Set SIMD mode for sampler messages |
| - nir: Add documentation for subgroup\_.._mask |
| - intel/fs: Delete stale comment in nir_intrinsic_ballot implementation |
| - nir: Mark nir_intrinsic_load_global_block_intel as divergent |
| - intel/fs: Enable nir_opt_uniform_atomics in all shader stages |
| - intel/fs: Use constant of same type to write flag |
| - intel/fs: Add fast path for ballot(true) |
| - nir: Initial framework for optimizing uniform subgroup operations |
| - intel/fs: Use nir_opt_uniform_subgroup |
| - nir: Optimize uniform iadd, fadd, and ixor reduction operations |
| - nir: Optimize uniform vote_all and vote_any |
| - i915: Fix value returned for PIPE_CAP_MAX_TEXTURE_CUBE_LEVELS |
| - intel/brw: Silence "statement may fall through" warning |
| - intel/brw: Correctly dump subnr for FIXED_GRF in INTEL_DEBUG=optimizer |
| - intel/compiler: Enforce 64-bit RepCtrl restriction in eu_validate |
| - intel/brw: Integer multiply w/ DW and W sources is not commutative |
| - intel/brw: Combine constants for src0 of integer multiply too |
| - intel/brw: Combine constants for src0 of POW instructions too |
| - intel/brw: Avoid a silly add with zero in assign_curb_setup |
| - intel/fs: Don't allow 0 stride on MOV destination |
| - intel/brw/xe2: Correctly disassemble RT write subtypes |
| - intel/brw: Fix handling of accumulator register numbers |
| - intel/brw: Allow SIMD16 F and HF type conversion moves |
| - intel/brw: Remove last vestiges of could_coissue |
| - intel/brw: Clear write_accumulator flag when changing the destination |
| - intel/brw: Use enums for DPAS source regioning |
| - nir: intel/brw: Change the order of sources for nir_dpas_intel |
| - intel/brw/xe2+: DPAS must be SIMD16 now |
| - intel/brw/xe2+: Use phys_nr and phys_subnr in DPAS encoding |
| - intel/brw/xe2: Update brw_nir_analyze_ubo_ranges to account for 512b physical registers |
| - intel/brw/xe2: Update uniform handling to account for 512b physical registers |
| - intel/compiler: Ensure load_barycentric_at_sample and load_interpolated_input remain together |
| - intel/brw: Don't call nir_opt_remove_phis before nir_convert_from_ssa |
| - intel/elk: Don't call nir_opt_remove_phis before nir_convert_from_ssa |
| - intel/brw: Delete stray nir_opt_dce |
| - intel/elk: Delete stray nir_opt_dce |
| - intel/brw/xe2+: Implement Wa 22016140776 |
| - intel/brw/xe2+: Only apply Wa 22016140776 to math instructions |
| - intel/brw: Fix handling of cmat_signed_mask |
| - nir: intel/brw: Remove cmat_signed_mask from dpas_intel intrinsic |
| - intel/brw: Fix optimize_extract_to_float for i2f of unsigned extract |
| - intel/elk: Fix optimize_extract_to_float for i2f of unsigned extract |
| |
| Isaac Marovitz (1): |
| |
| - asahi: Add >16 Sampler Access for Ryujinx |
| |
| Iván Briano (10): |
| |
| - anv: flush query clears for all gens |
| - anv, hasvk: pMutableDescriptorTypeLists can be out of range on pool creation |
| - compiler/types: fix serialization of cooperative matrix |
| - intel/cmat: fix stride calculation in cmat load/store |
| - nir/algebraic: avoid double lowering of some fp64 operations |
| - nir/lower_doubles: preserve sign of zero if we are asked to |
| - nir/lower_doubles: preserve NaN when asked to do so |
| - anv, hasvk: check requirements for USAGE_INPUT_ATTACHMENT properly |
| - anv: check requirements for VK_IMAGE_USAGE_FRAGMENT_SHADING_RATE |
| - anv: fix casting to graphics_pipeline_base |
| |
| JCWasmx86 (1): |
| |
| - meson: Fix invalid kwarg name |
| |
| Jan Beich (1): |
| |
| - util: mimic KCMP_FILE via KERN_FILE on DragonFly and FreeBSD |
| |
| Jani Nikula (1): |
| |
| - docs: fix doc build 'intel/dev/intel_device_info_gen.h' file not found |
| |
| Javier Martinez Canillas (2): |
| |
| - clc: silence a warn_unused_result |
| - gallium: Add ssd130x to the list of kmsro drivers |
| |
| Jesse Natalie (115): |
| |
| - ci/windows: Update WARP to 1.0.9 NuGet |
| - mesa: Consider mesa format in addition to internal format for mip/cube completeness |
| - ci/windows: Rev Vulkan SDK and piglit |
| - d3d12: Set up spirv-as and fix expectations |
| - microsoft/compiler: Declare shader model 6.8 / validator 1.8 |
| - microsoft/compiler: Handle comparison bias/gradient sampling |
| - dzn: Add a debug option to enable experimental shader models |
| - microsoft/compiler: Add feature flags for new comparison sampling ops |
| - dzn: Implement maintenance3 VariableDescriptorCountLayoutSupport |
| - dzn: Fix enhanced barrier layout for depth blits |
| - dzn: Handle VkBindImageMemorySwapchainInfoKHR |
| - dzn: Disable depth/stencil for partial binding from dynamic rendering |
| - spirv2dxil: Fix the spirv2dxil command line tool |
| - spirv2dxil: Handle aliasing/overlapping UBO/SSBO variables |
| - util: Detect arm64ec as aarch64 (and x86_64) |
| - glsl: Work around MSVC arm64 optimizer bug |
| - dzn: Don't set view instancing mask until after the PSO |
| - dzn: Fix path passed to CreateDeviceFactory |
| - d3d12: Fix path passed to CreateDeviceFactory |
| - microsoft/compiler: Use double pack/unpack instead of int for reduce ops on doubles |
| - dzn: Add a stencil blit fallback |
| - dzn: Add missing condition to immutable sampler init loop |
| - dzn: Add missing blit source barriers for enhanced barriers |
| - microsoft/compiler: Respect ACCESS_COHERENT in UAV variable data |
| - microsoft/compiler: Add a pass for promoting ACCESS_COHERENT on loads/stores |
| - spirv2dxil: Lower the Vulkan memory model and coherent loads/stores |
| - dzn: Add missing handling of VK_PIPELINE_STAGE_2_DRAW_INDIRECT_BIT |
| - dzn: Add barrier to copy source for DispatchIndirect copies |
| - dzn: Support non-static samplers for meta |
| - dzn: Add a debug flag for forcing off native view instancing |
| - dzn: Don't resolve for RESOLVE_MODE_NONE |
| - dzn: Use correct format for depth/stencil resolves |
| - dzn: Use blits for all non-averaging resolves |
| - microsoft/compiler: Only use simplified subgroup ID algorithm for compute |
| - d3d12: Subgroup ballot |
| - microsoft/compiler: Relax assert for SPIR-V barriers |
| - spirv2dxil: Remove dead branches early during shader compilation |
| - spirv2dxil: Trivial fixes for tessellation shaders |
| - dzn: Simultaneous-access is mutually exclusive with MSAA |
| - dzn: Fix tessellation shader insertion into PSO desc |
| - dzn: Add a driconf option to disable dzn for specific apps and use it for RDR2 |
| - microsoft/compiler: For emulating scan, ensure all threads are active when reading cross-lane |
| - microsoft/compiler: Fix wave size control for SM6.6+ |
| - microsoft/compiler: Fix wave size control for SM6.8+ |
| - wgl: Support a single-buffered winsys framebuffer |
| - wgl: Flush frontbuffer when calling swapbuffers on single-buffered fb |
| - wgl: Add no-gdi-single-buffered and gdi-double-buffered PFDs |
| - wgl: Enable WGL_ARB_pixel_format_float |
| - wgl: Add HDR pixel formats |
| - winsys/d3d12: Support single-buffered mode |
| - d3d12: Support R16G16B16A16_FLOAT display targets |
| - microsoft/compiler: Fix SM6.6 non-bindless handle annotation for UAV counter |
| - dzn: Fix conditions for barrier in texture-converting copy case |
| - wgl: Check for stw_device->screen before trying to destroy it |
| - spirv2dxil: Set push constant register space to nonzero |
| - microsoft/compiler: Remove deref load/store/atomic ops that statically go out of array bounds |
| - microsoft/compiler: Remove code after discard/terminate in later optimization steps |
| - wgl: Initialize DEVMODE struct |
| - d3d12: Point sprite lowering pass needs to handle arrays |
| - nir_lower_tex_shadow: For old-style shadows, use vec4(result, 0, 0, 1) |
| - spirv2dxil: Support buffer_device_address |
| - dzn: Support bufferDeviceAddress |
| - wgl: Delete unused context param to swap |
| - wgl: Check for null before dereferencing ctx in swap |
| - nir_tests: Add /bigobj when compiling with MSVC |
| - dzn: Include vulkan_core.h instead of vulkan.h in the device enum header |
| - dzn: Initialize memoryTypeBits for querying properties on imported handles |
| - microsoft/compiler: domainLocation component index needs to be i8 |
| - microsoft/compiler: Disable GS streams workaround for validator 1.8 |
| - ci/windows: Update DirectX-Headers, Agility SDK, zlib, DXC, and WARP |
| - ci/debian: Update DirectX-Headers |
| - nir: Handle ptr_as_array for build_deref_follower |
| - microsoft/compiler: Don't store static-indexing handles that are dynamically emitted |
| - microsoft/clc: When possible, compute a part-constant "pointer" value for kernel inputs |
| - microsoft/compiler: Simplify code emitting CL globals |
| - clc: Move libclc helpers back to microsoft/clc |
| - microsoft/clc: Add linkage capability to libclc build to silence warning |
| - microsoft/clc: Adjust order of UAV binding assignment |
| - microsoft/clc: Install clon12compiler |
| - wgl: The default swap interval is supposed to be 1 |
| - d3d12: Fix d3d12_lower_triangle_strip if multiple vars are in a single location |
| - microsoft/compiler: When sorting variables, put unused variables last |
| - microsoft/compiler: Move kill-unused/undefined varying pass from spirv to common |
| - microsoft/compiler: Simplify I/O component type enum handling |
| - microsoft/compiler: Expect front-facing var as an input |
| - microsoft/compiler: Improve linking helpers |
| - microsoft/compiler: Don't duplicate work from gather_info in var sorting |
| - d3d12: Move some lowering passes to pre-variant |
| - d3d12: Lower uniforms to UBO by nir options |
| - d3d12: Minor logging improvements |
| - d3d12: Fix var splitting pass writemasks |
| - d3d12: Explicitly add tess factor vars to tess signatures |
| - d3d12: Forward front-facing for passthrough GS |
| - d3d12: Capture always_active_io in varying data |
| - d3d12: Use TES inputs rather than VS outputs for TCS variant key |
| - d3d12: Add primitive ID sysval to input bitmask (for GS in) |
| - d3d12: Gather info less and before the final compilation steps |
| - d3d12: Remove variables instead of adding them for linking |
| - d3d12: Don't compile useless variants during shader creation |
| - microsoft/compiler: Add a fractional var mask for variable sorting |
| - d3d12: Set fractional var masks |
| - d3d12: Add a debug flag for loading WinPixGpuCapturer.dll |
| - ci/windows: Bump Agility SDK to 1.613.2 for ExecuteIndirect validation fix |
| - microsoft/compiler: Handle base vertex/instance sysvals as DXIL intrinsics |
| - spirv2dxil: Support passing first vertex / base instance to DXIL backend |
| - spirv2dxil: Output more specific metadata for whether draw sysvals are needed |
| - dzn: Delete dzn structs for indirect draw args and use D3D ones |
| - dzn: Query options21 |
| - dzn: Understand whether first-vertex and base-instance are needed for a pipeline |
| - dzn: Update pipeline cache params to take all options into account |
| - dzn: Rework indirect drawing keys for shaders and command signatures |
| - dzn: Add a hash table of command signatures with non-default strides |
| - dzn: Don't copy app indirect args if we don't need to |
| - glsl: Use a stable attr sort for VS in / FS out |
| - d3d12: Include <shlobj.h> with lowercase name |
| |
| Job Noorman (43): |
| |
| - tu: support l1 dcache size on musl |
| - ir3: fix setting shared flag on parallel copy arguments |
| - ir3: optimize read_first.macro to a mov |
| - ir3: fix printing of brcst.active and quad_shuffle |
| - ir3: optimize subgroup operations using brcst.active |
| - ir3: set reconvergence for scan_clusters.macro |
| - ir3: add disassembly for flat.b |
| - ir3: update a0/a1 users when cloning instructions |
| - ir3: fix alignment of spill slots |
| - ir3: validate instruction block pointer |
| - ir3: add terminators to blocks |
| - ir3: fix instruction count before kill_sched |
| - ir3: print branch sources |
| - ir3: remove OPC_B and brtype from cat0 |
| - ir3: remove comp1/2 from cat0 |
| - ir3: allow liveness calculation for different register types |
| - ir3: allow finding SSA uses for a subset of uses |
| - ir3: implement RA for predicate registers |
| - ir3: validate no registers are invalid after RA |
| - ir3: integrate predicates into RA validation |
| - ir3: optimize bitwise ops that can directly write predicates |
| - ir3: insert predicate conversions after their source |
| - ir3: fold negations into cmps.ne zero |
| - nir: add search helper is_only_used_by_if |
| - ir3: fold and/or and negations into branches |
| - freedreno/ci: Update pixmark-piano-v2 hash |
| - ir3: fix freeing incorrect register in loops |
| - ir3: fix returning false instead of NULL |
| - freedreno/registers: fix installation of schema |
| - zink: print shaderdb info via debug message callback |
| - ir3: calculate SSA uses at the start of predicates RA |
| - ir3: fix finding uses of reloaded defs in predicates RA |
| - ir3-disasm: run clang-format |
| - ir3-disasm: remove unused #includes |
| - ir3-disasm: add options to specify GPU by chip ID or name |
| - ir3-disasm: add option to disassemble hex number |
| - freedreno,computerator: support initialization of buffers |
| - ir3: remove unnecessary tessellation epilogue |
| - ir3: model predt/predf without sources |
| - ir3: add support for precolored sources in predicate RA |
| - ir3: add support for predication |
| - freedreno/drm-shim: add a730, a740, and a750 |
| - freedreno/drm-shim: remove duplicate entry for a630 |
| |
| Jonathan Gray (4): |
| |
| - intel/dev: update DG2 device names |
| - intel/dev: update DG2 device names |
| - intel/dev: update DG2 device names |
| - intel/dev: 0x7d45 is mtl-u not mtl-h |
| |
| Jonathan Marek (1): |
| |
| - tu/a750: Basic a750 support |
| |
| Jordan Justen (26): |
| |
| - intel/dev/common: Add xe2 support to get_l3_list() |
| - intel/dev: Add ARL platform enums |
| - intel/dev: Add intel_device_info_is_mtl_or_arl() |
| - intel/l3: Define l3 config for ARL |
| - iris: Extend MTL modifiers to ARL devices |
| - intel/i915: ARL also supports the set-PAT uapi |
| - intel/dev: Define engine prefetch for ARL |
| - isl: Define MOCS for ARL |
| - isl: Handle ARL in isl_drm_modifier_get_score() |
| - intel/compiler: Lower DPAS instructions on ARL except ARL-H |
| - anv/drirc: Extend option to disable FCV optimization to ARL |
| - anv/query: Follow MTL code paths on ARL |
| - intel/dev: Add device info for ARL |
| - intel/compiler: Set branch shader required-width as 16 for xe2 |
| - intel/compiler: Implement nir_intrinsic_load_topology_id_intel for xe2 |
| - intel/compiler: Verify SIMD16 is used for xe2 BTD/RT dispatch |
| - intel/dev: Add 2 additional ADL-N PCI ids |
| - intel/compiler: Adjust fs_visitor::emit_cs_terminate() for Xe2 |
| - intel/dev: Adjust device strings for ATS-M devices |
| - intel/dev: Add ATS-M PCI ID for Data Center GPU Flex 170G |
| - intel/compiler/fs: Restore SIMD32 restriction for ray_queries on Xe2 |
| - intel/compiler: nib_ctrl no longer exists on Xe2+ |
| - intel/dev/mesa_defs.json: Add LNL WA entries |
| - intel/dev: Add 0x56be and 0x56bf DG2 PCI IDs |
| - intel/dev: Change ATS-M 0x56c2 string from 170G to 170V |
| - intel/brw: Avoid getting a stride of 0 for nir_intrinsic_exclusive_scan |
| |
| Jose Maria Casanova Crespo (7): |
| |
| - ci: Adds /usr/local/bin to PATH at piglit-traces.sh |
| - v3d: Fix indentation at v3d_flush_jobs_writing_resource |
| - v3d: Only flush jobs that write texture from different job submission. |
| - v3d: Early return if job is not writing the resource |
| - v3d: Implement GL_ARB_texture_barrier |
| - broadcom/compiler: needs_quad_helper_invocation enable PER_QUAD TMU access |
| - ci: re-enable Igalia farm |
| |
| Joshua Ashton (34): |
| |
| - winsys/amdgpu: Hook up guilt to amdgpu_ctx_set_sw_reset_status |
| - winsys/amdgpu: Limit usage of query_reset_state2 |
| - radv/amdgpu: Handle -ENODATA and -ETIME from cs_submit |
| - radv: Mark device loss if QueueSubmit failed immediately |
| - radv: Remove check_status |
| - radv/amdgpu: Remove ctx_query_reset_status |
| - radv: Add radv_get_tdr_timeout_for_ip helper |
| - radv: Ensure vkGetQueryPoolResults returns in finite-time |
| - android: Use system = 'android' in crossfile |
| - meson: Enable zink in gallium_drivers by default |
| - meson: Enable d3d12 in gallium_drivers by default on Windows |
| - anv: Enable EXT_swapchain_maintenance1 |
| - v3dv: Enable EXT_swapchain_maintenance1 |
| - lavapipe: Enable EXT_swapchain_maintenance1 |
| - v3dv: Enable EXT_swapchain_colorspace |
| - lavapipe: Enable EXT_swapchain_colorspace |
| - wsi: Pass wsi_drm_image_params to wsi_configure_native_image |
| - wsi: Pass wsi_drm_image_params to wsi_configure_prime_image |
| - wsi: Add explicit_sync to wsi_image_info |
| - wsi: Add explicit_sync to wsi_drm_image_params |
| - build: Add linux-drm-syncobj-v1 wayland protocol |
| - wsi: Track if timeline semaphores are supported |
| - wsi: Add acquired member to wsi_image |
| - wsi: Track CPU side present ordering via a serial |
| - wsi: Get timeline semaphore exportable handle types |
| - wsi: Add common infrastructure for explicit sync |
| - ci: Bump wayland-protocols version to 1.34 |
| - ci: Bump DEBIAN_BASE_TAG for now |
| - meson: Update wayland-protocols wrap to 1.34 |
| - meson: Bump wayland-protocols requirement to 1.34 |
| - wsi: Implement linux-drm-syncobj-v1 |
| - tu: Expose VK_EXT_surface/swapchain_maintenance1 |
| - radv: Enable KHR_video_queue if encode is enabled |
| - radv: Properly initialize imageCreateFlags in GetPhysicalDeviceVideoFormatPropertiesKHR |
| |
| José Expósito (2): |
| |
| - zink: add render-passes HUD query |
| - meson: Update proc_macro2 meson.build patch |
| |
| José Roberto de Souza (88): |
| |
| - intel/isl/xe2: Disable route of Sampler LD message to LSC |
| - anv: Fix PAT entry for userptr in integrated GPUs |
| - intel/genxml/xe2: Remove L3ALLOC |
| - intel/dev: Reduce usage of intel_device_info_compute_system_memory() |
| - intel: Make memory heaps consistent between KMDs |
| - anv: Fix calculation of syncs required in Xe KMD |
| - iris: Avoid read of uninitialized value in blorp_clear_stencil_as_rgba() |
| - iris: Fix return of iris_wait_syncobj() |
| - iris: Wait for drm_xe_exec_queue to be idle before destroying it |
| - intel/common: Add functions to handle async vm bind |
| - anv: Start to use intel_bind_timeline |
| - iris: Start to use intel_bind_timeline |
| - anv: Switch to truly asynchronous VM binding in Xe KMD |
| - iris: Switch to truly asynchronous VM binding in Xe KMD |
| - intel: Fix intel_get_mesh_urb_config() |
| - anv: Drop include to common/i915/intel_gem.h |
| - intel/common: Fix location of C++ support macro in intel_gem.h |
| - intel: Remove circular dependency between intel/dev and intel/common |
| - intel/common: Add intel_engines_supported_count() |
| - anv: Use intel_engines_supported_count() |
| - iris: Use intel_engines_supported_count() |
| - intel: Sync i915_drm.h |
| - intel/common: Implement i915_engines_is_guc_semaphore_functional() |
| - intel: Sync xe_drm.h |
| - intel/common: Implement xe_engines_is_guc_semaphore_functional() |
| - iris: Fix iris_batch_is_banned() check |
| - anv: Use DRM_XE_VM_BIND_OP_UNMAP_ALL to unbind whole bos |
| - docs/anv: Add recommended GuC firmware version |
| - iris: Set (EXEC_OBJECT_SUPPORTS_48B_ADDRESS | EXEC_OBJECT_PINNED) in a single place |
| - iris: Remove iris_bo::kflags |
| - iris: Move i915 set and get tiling uAPI calls to i915 specific code |
| - iris: Remove more i915_drm.h includes from common code |
| - intel: Move intel_define.h to i915/intel_define.h |
| - intel/common: Remove more i915_drm.h includes from common code |
| - intel/tools/error_decode: Add function to try to open error dump file |
| - intel/tools/error_decode: Simply error message handling |
| - intel/tools/error_decode: Add support to search for Xe KMD error dumps |
| - intel/tools/error_decode: Detect and split error dump file parsing by KMD |
| - intel: Sync xe_drm.h |
| - anv/xe: Add VMs to error dump |
| - iris/xe: Add VMs to error dump |
| - intel/tools/error_decode: Move code that can be shared between i915 and Xe error decoders |
| - intel/tools/error_decode: Parse Xe KMD error dump file |
| - intel/tools: Fix compilation in 32 bits |
| - intel/nullhw: Fix 32bits compilation warnings |
| - iris: Add IRIS_HEAP_DEVICE_LOCAL_CPU_VISIBLE_SMALL_BAR heap type |
| - iris: Force lmem cpu accessible for bos with clear-color |
| - iris/xe: Consider pat_index while unbinding the bo |
| - anv: Call flush_pipeline_select_gpgpu() for compute engines in compute code paths |
| - anv: Skip cmd_buffer_emit_bt_pool_base_address() in blitter and video engines |
| - intel: Drop pre-production steppings |
| - anv: Fix Xe KMD userptr unbind |
| - intel/dev: Nuke 'ver == 10' check |
| - intel/dev: Nuke display_ver |
| - intel: Enable Xe KMD support by default |
| - iris: Set BO_ALLOC_NO_SUBALLOC when allocating bo for slab |
| - anv: Replace the 2 sparse booleans by 1 enum |
| - anv: Set VK_QUEUE_PROTECTED_BIT during queue families initialization |
| - anv: Set VM control to true in Xe KMD |
| - intel/tools/error_decode: Fix parsing in Xe decoder |
| - intel/tools/error_decode: Add function to print batch in Xe decoder |
| - intel/tools/error_decode: Parse HW context in Xe decoder |
| - iris: Move tiling_to_modifier() implementation to i915 folder |
| - iris: Remove i915_drm.h include from iris_indirect_gen.c |
| - intel/decoder: Fix binding table pointer entry being marked as invalid |
| - anv: Set STATE_COMPUTE_MODE mask bit when zeroing compute mode |
| - intel/genxml: Add more instdone registers |
| - intel/genxml/gfx125: Fix definition of INTERFACE_DESCRIPTOR_DATA::Thread group dispatch size |
| - intel/genxml/xe2: Update definition of INTERFACE_DESCRIPTOR_DATA |
| - anv: Create protected engine context when i915 supports vm control |
| - anv: Remove protected memory types from default_buffer_mem_types |
| - intel/tools/error2hangdump: Print out_filename when failed to open it |
| - intel/tools/error2hangdump: Replace drm_i915_gem_engine_class by intel_engine_class |
| - intel/tools: Move Xe KMD error decode functions to a separated file |
| - intel/tools: Move ascii85_decode_char() to error_decode_lib |
| - intel/tools: Move more Xe KMD error decode functions to error_decode_xe_lib |
| - intel/tools/error2hangdump: Move code that will be shared with Xe parser to error2hangdump_lib |
| - intel/tools/error2hangdump: Move i915 parser to a function |
| - intel/tools/error2hangdump: Add Xe KMD support |
| - anv: Add missing ANV_BO_ALLOC_INTERNAL |
| - iris: Add comments to BO_ALLOC flags |
| - iris: Avoid creation of slabs and cache buckets of lmem heaps in integrated gpus |
| - iris: Avoid allocation of not needed iris_bucket_cache |
| - intel/tools/aubinator_error_decode: Move definition of option_color to header |
| - intel/decoder: Add intel_print_group_custom_spacing() |
| - intel/tools: Parse INSTDONE registers in Xe KMD error dump |
| - intel: Sync xe_drm.h |
| - intel/dev: Read GFX IP version during runtime |
| |
| Juan A. Suarez Romero (32): |
| |
| - Revert "v3d: use kmsro to create drm screen on real hw" |
| - v3d: show warning on creating a v3d screen on real hw |
| - v3d/vc4/ci: reset the list of timeout tests |
| - Revert "v3d: show warning on creating a v3d screen on real hw" |
| - broadcom/simulator: protect simulator BO rallocs with mutexes |
| - v3d/ci: run OpenGL 3.1 tests |
| - v3dv/ci: increase timeout for full jobs in 30min |
| - ci: disable Igalia farm |
| - Revert "ci: disable Igalia farm" |
| - Revert "ci: disable Igalia farm" |
| - v3d/ci: update expected results |
| - v3d/ci: update expected list |
| - Revert "v3d/ci: update expected list" |
| - vc4/ci: update expected list |
| - v3d/ci: add new failures |
| - v3dv/ci: update expected list |
| - v3dv/ci: remove crashes from expected list |
| - v3d,v3d: use new simulator |
| - v3dv: disable Early Z for multisampled 16-bit depth buffers |
| - v3d: disable Early Z for multisampled 16-bit depth buffers |
| - broadcom/compiler: fix SFU check for 7.1 |
| - v3dv: mark some promoted extensions as supported |
| - v3d: add load_fep_w_v3d intrinsic |
| - v3d: fix line coords with perspective projection |
| - compiler,gallium: move u_reduced_prim to common |
| - v3dv: assume that rasterization state can be NULL |
| - v3dv: enable smooth line rendering |
| - broadcom/ci: add new expected failures |
| - v3d: configure polygon mode when enabled |
| - broadcom/ci: update expected results |
| - v3dv/ci: update expected list |
| - nir/lower_clip: update inputs/ouputs read/written bitmask |
| |
| Juston Li (13): |
| |
| - venus: refactor query feedback cmds |
| - venus: acquire mutex when recycling query feedback cmds |
| - venus: free query batches for VK_COMMAND_POOL_RESET_RELEASE_RESOURCES_BIT |
| - venus: add comments for query feedback batch free list |
| - venus: recycle linked query feedback immediately during submission |
| - venus: handle empty resolved query feedback list |
| - venus: fix image reqs cache store locking |
| - venus: extract cache hash/equals functions into common |
| - venus: image format properties cache |
| - venus: move feedback on empty last batch to prior batch |
| - venus: fix VkDeviceGroupSubmitInfo cmd counts from feedback |
| - venus: extend device format prop cache with VkFormatProperties3 |
| - Revert "zink: store last pipeline directly for zink_gfx_program::last_pipeline" |
| |
| Kai Wasserbäch (2): |
| |
| - fix(FTBFS): clc: adapt to new LLVM 19 DiagnosticHandlerTy |
| - fix(FTBFS): clover: adapt to new LLVM 19 DiagnosticHandlerTy |
| |
| Karol Herbst (131): |
| |
| - nak/algebraic: merge run and main function |
| - nak/algebraic: write code to an output file |
| - clc: use spirv triple starting with llvm-17 |
| - clc: add support for the native spir-v backend |
| - rusticl/kernel: run opt/lower_memcpy later to fix a crash |
| - rusticl/kernel: add a few comments in regards to pass ordering |
| - rusticl/kernel: no need to reset the scratch size anymore |
| - nir/printf: remove treat_doubles_as_floats |
| - clc: require LLVM-14 |
| - clc: merge blocks handling optional features |
| - clc: require LLVM-15 |
| - nir: rework and fix rotate lowering |
| - rusticl/program: rework debug logging option |
| - rusticl/spirv: do not attempt to parse spirv after failed link |
| - rusticl/spirv: use bool::then_Some inside SPIRVBin::link |
| - rusticl/program: add clc_validator_options helper function |
| - rusticl/program: add debug option to validate internal spirvs |
| - nak/opt_out: fix comparison in try_combine_outs |
| - nak: simplify phi_dsts |
| - nak: make it compile with clippy |
| - rusticl/meson: use rust_abi instead of rust_crate_type |
| - rust/spirv: fix clippy lint on unneeded late initialization |
| - rusticl/kernel: check that local size on dispatch doesn't exceed limits |
| - nak/meson: specify rust flags globally and allow some clippy lints |
| - nak: fix clippy::extra_unused_lifetimes warnings |
| - nak: fix clippy::mem_replace_with_default warnings |
| - nak: fix clippy::useless_conversion warnings |
| - nak: fix clippy::needless_lifetimes warnings |
| - nak: fix clippy::needless_borrow warnings |
| - nak: fix clippy::while_let_loop warnings |
| - nak: fix clippy::match_like_matches_macro warnings |
| - nak: fix clippy::needless_return warnings |
| - nak: fix clippy::redundant_closure warnings |
| - nak: fix clippy::unwrap_or_default warnings |
| - nak: fix clippy::manual_while_let_some warnings |
| - nak: fix clippy::clone_on_copy warnings |
| - nak: fix clippy::single_match warnings |
| - rusticl/util: add a wrapper around "thread-safe" C types |
| - rusticl/mesa/device: convert to ThreadSafeCPtr |
| - rusticl/mesa/screen: convert to ThreadSafeCPtr |
| - rusticl/mesa: add thread-safe wrapper for pipe_image_views |
| - rusticl/context: store SVM pointers as usize |
| - rusticl/gl: mark GLCtxManager as Send + Sync |
| - rusticl/mem: make Mem Send/Sync by storing mapping ptrs as usize |
| - rusticl/program: mark NirKernelBuild as Send and Sync |
| - meson: remove opencl-external-clang-headers option and rely on shared-llvm |
| - clc: force fPIC for every user when using shared LLVM |
| - nir/lower_cl_images: record image_buffers and msaa_images |
| - rusticl/mem: properly handle buffers |
| - rusticl/mem: support GL_TEXTURE_BUFFER |
| - rust/api: add RustTypes enum |
| - rusticl/util: support nested structs in offset_of! |
| - rusticl/api: allow CLObjectBase to be placed anywhere |
| - rusticl/icd: move get_ref_vec_from_arr into the Rusticl type |
| - rusticl/icd: move refcnt() and get rid of needless atomic ops |
| - rusticl/icd: move retain() and release() |
| - rusticl/icd: move get_arc_vec_from_arr and rename it |
| - rusticl/icd: fold leak_ref into its only consumer |
| - rusticl/icd: move get_ref() |
| - rusticl/device: deduplicate devices with sorting |
| - rusticl/icd: move from_arc() and rename it |
| - rusticl/event: drop from_cl_arr and use arcs_from_arr |
| - rusticl/icd: move get_arc() and rename it |
| - rusticl/icd: split Arc part out of CLObject into new trait |
| - rusticl/device: get rid of pointless Arc overhead |
| - rusticl/icd: actually allow dispatching CL types |
| - rusticl/mem: split into Buffer and Image |
| - rusticl/mem: use pattern matching in is_parent_buffer |
| - rusticl/mem: move fill methods into concrete types |
| - core/memory: drop Arc for &Arc<Queue> function parameters |
| - rusticl/mem: move map methods into concrete types |
| - rusticl/mem: move shadow sync methods into concrete types |
| - rusticl/mem: split unmap into Buffer and Image versions |
| - rusticl/mem: move copy and write buffer impls into Buffer |
| - rusticl/mem: split read_to_user_rect into Buffer and Image versions |
| - rusticl/mem: split write_from_user_rect into buffer and image |
| - rusticl/mem: move copy_to_rect into Buffer |
| - rusticl/mem: split copy_to into Buffer and Image |
| - rusticl/mem: split Buffer::copy_to into Buffer and Image versions |
| - rusticl/mem: split Image::copy_to into Buffer and Image versions |
| - rusticl/mem: get rid of pixel_size |
| - rusticl/mem: move tx_image into Image |
| - rusticl/mem: fold tx_raw into tx |
| - rusticl/image: call tx on the parent buffer directly |
| - rusticl/mem: move is_parent_buffer into Image |
| - rusticl/mem: move tx into Buffer |
| - rusticl/mem: remove get_res |
| - rusticl/mem: move comment describing how mapping works |
| - rusticl/mem: reorganize Image::map |
| - rusticl/mem: move MemBase::map into the users |
| - rusticl/mem: move tx_raw_async methods into Buffer and Image |
| - rusticl/mem: move Buffer and Image specific fields into the subtypes |
| - rusticl/mem: reorganize MemBase::from_gl a little |
| - rusticl/mem: move pipe_image_host_access into Image |
| - rusticl/kernel: recalculate scratch and shared memory after opts |
| - rusticl/program: fix CL_PROGRAM_BINARIES for devs with no builds |
| - meson/rusticl: import rust instead of unstable-rust |
| - clc: include opencl-c.h for extensions needing it |
| - meson: do not pull in clc for clover |
| - intel: Only build shaders with anv and iris |
| - zink: lower unaligned memory accesses |
| - rusticl/context: complete conversion of SVM pointers to usize |
| - rusticl/memory: store host_ptr as usize |
| - rusticl/memory: make closures Send and Sync |
| - rusticl/event: make EventSig Send + Sync |
| - rusticl/spirv: mark SPIRVBin as Send and Sync |
| - rusticl/kernel: make it Send and Sync |
| - rusticl/icd: verify all cl classes are Send and Sync |
| - rusticl/meson: remove -Aclippy::arc-with-non-send-sync flag |
| - rusticl/kernel: make builds private |
| - rusticl/event: we need to call the CL_COMPLETE callback on errors as well |
| - rusticl/kernel: assign sampler locations before DCEing variables |
| - rusticl/device: support query_memory_info to retrieve available memory |
| - drm-uapi: Sync nouveau_drm.h |
| - nvk: use c.get_supported_arguments for compiler flags |
| - nouveau: import libdrm_nouveau |
| - nouveau: call glsl_type_singleton_init_or_ref earlier |
| - nouveau/drm: drop immediate parameter from nouveau_pushbuf_new |
| - nouveau/drm: rely on nouveau_pushbuf::channel being always set |
| - nouveau/drm: drop unsued chan argument from nouveau_pushbuf_kick |
| - nouveau/drm: remove nouveau_client::id |
| - rusticl/util: make create_pipe_box indepentend of pipe_box's field types |
| - meson: fix link failure with llvm-18 |
| - rusticl/program: handle -cl-no-subgroup-ifp |
| - nouveau: fix potential double-free in nouveau_drm_screen_create |
| - nir: fix nir_shader_get_function_for_name for functions without names. |
| - rusticl: use stream uploader for cb0 if prefered |
| - rusticl/icd: remove CLObject |
| - event: break long dependency chains on drop |
| - rusticl/mesa/context: flush context before destruction |
| - nir/lower_cl_images: set binding also for samplers |
| |
| Kenneth Graunke (76): |
| |
| - iris: Don't return timestamps modulo 36-bits |
| - intel/dev: Fix typo (ajust -> adjust) |
| - iris: Implement query_memory_info() on discrete cards |
| - intel/nir: Pass devinfo and prog_data to brw_nir_lower_cs_intrinsics |
| - intel: Add driver support for hardware generated local invocation IDs |
| - intel: Use hardware generated compute shader local invocation IDs |
| - driconf: Advertise GL_EXT_shader_image_load_store on iris for SVP13 |
| - iris: Implement INTEL_DEBUG=heaps |
| - intel/fs: Don't include sync.nop in instruction count statistics |
| - intel/fs: Don't rely on CSE for VARYING_PULL_CONSTANT_LOAD |
| - intel/brw: Delete enum brw_urb_write_flags |
| - intel/brw: Delete more unused defines |
| - intel/brw: Delete legacy SFIDs |
| - intel/brw: Delete SIMD4x2 URB opcodes |
| - intel/brw: Delete more unused compression stuff |
| - intel/brw: Delete SINCOS |
| - intel/brw: Delete constant_buffer_0_is_relative |
| - intel/brw: Delete compiler->supports_shader_constants |
| - intel/brw: Delete enum gfx6_gather_sampler_wa |
| - intel/brw: Delete brw_wm_prog_key::line_aa |
| - intel/brw: Delete unnecessary brw_wm_prog_data fields |
| - intel/brw: Delete some swizzling functions |
| - intel/brw: Delete brw_eu_util.c |
| - intel/brw: Change unit tests to use TEX_LOGICAL instead of TEX |
| - intel/brw: Delete SHADER_OPCODE_TXF_CMS[_LOGICAL] |
| - intel/brw: Delete SHADER_OPCODE_TXF_UMS |
| - intel/brw: Allow CSE on TXF_CMS_W_GFX12_LOGICAL |
| - intel/brw: Delete legacy texture opcodes |
| - intel/brw: Mark FIND[_LAST]_LIVE_CHANNEL as not writing the flag |
| - intel/brw: Replace CS_OPCODE_CS_TERMINATE with SHADER_OPCODE_SEND |
| - intel/brw: Avoid copy propagating any fixed registers into EOTs |
| - intel/brw: Handle SHADER_OPCODE_SEND without src[3] in copy prop |
| - intel/brw: Add assertions that EOT messages live in g112+ |
| - intel/brw: Copy the smaller payload in fixup_sends_duplicate_payload |
| - intel/brw: Make register coalescing obey the g112-g127 restriction |
| - intel/brw: Call constant combining after copy propagation/algebraic |
| - intel/brw: Remove SIMD lowering to a larger SIMD size |
| - intel/brw: Unindent code after previous change |
| - iris: Fix tessellation evaluation shaders that use scratch |
| - intel/brw: Emit better code for read_invocation(x, constant) |
| - iris: Remove suballocation in iris_flush_resource() |
| - iris: Eliminate prototype introduced in the previous patch |
| - ra: Add debug functions for printing spill costs and benefits |
| - intel/fs: Avoid generating useless UNDEFs for every SSA def |
| - intel/brw: Split out 64-bit lowering from algebraic optimizations |
| - intel/brw: Don't consider UNIFORM_PULL_CONSTANT_LOAD a send-from-GRF |
| - intel/brw: Eliminate top-level FIND_LIVE_CHANNEL & BROADCAST once |
| - intel/brw: Fix check for 64-bit SEL lowering types |
| - intel/brw: Assert that min/max are not happening in 64-bit SEL lowering |
| - intel/brw: Use correct execution pipe for lowering SEL on DF |
| - intel/brw: Unify DF and Q/UQ lowering for MOV |
| - Revert "intel/brw: Don't consider UNIFORM_PULL_CONSTANT_LOAD a send-from-GRF" |
| - intel/brw: Fix opt_split_sends() to allow for FIXED_GRF send sources |
| - intel/brw: Fix register coalescing's LOAD_PAYLOAD dst offset handling |
| - intel/brw: Fix destination stride assertion in copy propagation |
| - intel/brw: Allow changing types for LOAD_PAYLOAD with 1 source |
| - intel/brw: Delete brw_fs_lower_minmax |
| - anv, hasvk: Save the original instance ID |
| - anv, hasvk: Move multiview remapping loop below output stores |
| - anv, hasvk: Fix nir_lower_multiview to re-emit outputs before EmitVertex |
| - intel/brw: Stop checking mlen on math opcodes in CSE pass |
| - intel/brw: Rearrange fs_inst fields |
| - intel/brw: Fix generate_mov_indirect to check has_64bit_int not float |
| - intel/brw: Fix lower_regioning for BROADCAST, MOV_INDIRECT on Q types |
| - intel/brw: Update comments for indirect MOV splitting |
| - intel/brw: Don't mention gfx7 limitations in shuffle comments |
| - intel/brw: Drop dead CHV checks. |
| - intel/brw: Drop align16 support in brw_broadcast() |
| - intel/brw: Drop gfx7 scratch message setup code |
| - intel/brw: Delete if_depth_in_loop |
| - intel/brw: Delete fs_visitor::vgrf helper |
| - intel/brw: Drop default size of 1 from bld.vgrf() calls |
| - intel/brw: Use SHADER_OPCODE_SEND for coherent framebuffer reads |
| - intel/brw: Replace FS_OPCODE_LINTERP with BRW_OPCODE_PLN |
| - intel/brw: Make an fs_builder::SYNC helper |
| - isl: Set MOCS to uncached for Gfx12.0 blitter sources/destinations |
| |
| Konrad Dybcio (1): |
| |
| - freedreno/registers: Add some HWCG regs |
| |
| Konstantin (8): |
| |
| - util/printf: Include stdio.h |
| - util/printf: Expose util_printf_prev_tok |
| - ac/debug: Handle the output of recent umr versions |
| - radv/debug: Canonicalize shader addr |
| - radv: Canonicalize addresses in radv_find_shader |
| - radv/debug: Try to find unbound shaders |
| - radv/debug: Dump descriptor binding information |
| - ac/parse_ib: Always print the value of the whole register |
| |
| Konstantin Seurer (105): |
| |
| - nak/repair_ssa: Remap PHI sources as well |
| - ac/llvm: Enable helper invocations for quad OPs |
| - radv: Vectorize load_global_constant |
| - lavapipe: Fix DGC vertex buffer handling |
| - gallivm: Use saturating fpto*i conversions |
| - lavapipe: Mark vertex elements dirty if the stride changed |
| - lavapipe: Report the correct preprocess buffer size |
| - radv: Implement NIR debug printf |
| - llvmpipe: Stop refcounting sample functions |
| - llvmpipe: Compile sample functioins on demand |
| - radv/rt: Use doubles inside intersect_ray_amd_software_tri |
| - llvmpipe: Fix building with llvm11 |
| - nir/print: Don't print shared_size twice |
| - nir/print: Rename workgroup-size to workgroup_size |
| - radv/radix_sort: clang-format |
| - radv: Reduce the amount of radv_device_to_handle calls |
| - radv: Make radv_write_user_event_marker non-static |
| - radv: Emit user events during acceleration structure builds |
| - radv: Skip unused acceleration structure build paths |
| - radv/sqtt: Set SeparateCompiled for monolithic RT pipelines |
| - radv/sqtt: Handle ray tracing pipelines with no traversal shader |
| - radv/rt: Lower ray payloads like hit attribs |
| - radv/rra: Rename rra_chunk_type to rra_chunk_version |
| - radv/rra: Use memcpy for chunk descriptions |
| - radv/rra: Remove useless variable |
| - radv/rra: Refactor error handling |
| - radv/rra: Dump basic ray history tokens |
| - docs: Document RADV_RRA_TRACE_HISTORY_SIZE |
| - radv/rra: Implement ahit/isec counters |
| - amd/common: Use the correct register table for GFX10_3 |
| - radv: Wire up ac_gather_context_rolls |
| - zink: Always set mfence->submit_count to the fence submit_count |
| - Revert "zink: always force flushes when originating from api frontend" |
| - llvmpipe: Use full subgroups when possible |
| - gallivm: Consider the initial mask when terminating loops |
| - lavapipe: Advertise VK_KHR_shader_maximal_reconvergence |
| - ci: Update llvmpipe trace checksums |
| - ac/parse_ib: Add and use print_addr |
| - ac/parse_ib: Dump the ADDR field of PKT3_SET_BASE |
| - ac/parse_ib: Annotate addresses with UAF/OOB info |
| - ac/parse_ib: Handle 32bit PKT3_DISPATCH_INDIRECT addrs |
| - ac/parese_ib: Handle more packets |
| - radv/rra: Avoid reading past the ray history buffer |
| - radv/meta: Add shader - device mapping for radv_build_printf |
| - vulkan/cmd_queue: Implement CmdBuildAccelerationStructuresKHR |
| - lavapipe: Implement VK_KHR_acceleration_structure |
| - lavapipe: Add ray traversal code |
| - lavapipe: Implement VK_KHR_ray_query |
| - lavapipe: Advertise VK_KHR_deferred_host_operations |
| - lavapipe: Advertise VK_KHR_acceleration_structure |
| - lavapipe: Advertise VK_KHR_ray_query |
| - lavapipe/ci: Document ray query failures |
| - docs: Document lavapipe ray tracing features |
| - vulkan: Implement DebugMarkerSetObjectNameEXT |
| - radv/rt: Implement RADV_DEBUG=shaderstats |
| - radv/rt: Add radv_ray_tracing_stage_info |
| - radv/rt: Fixup constant args |
| - aco: Only fix used variables to registers |
| - radv/rt: Avoid passing unused data to the next stage |
| - radv/rt: Inline constant trace_ray srcs into the traversal shader |
| - radv/rt: Inline constant information about ray flags |
| - radv/rt: Fix raygen_imported condition |
| - zink: Handle aoa derefs of images |
| - ac: Annotate context rolls |
| - ac/parse_ib: Replace the parameter list with ac_ib_parser |
| - ac/parse_ib: Implement annotations |
| - radv: Add support for IB annotations |
| - radv: Add an IB annotation layer |
| - ac: Improve context roll readability |
| - radv: Use radv_buffer_map for parsing IBs |
| - radv/rt: Use 32-bit offsets for load_sbt_entry |
| - radv: Skip more acceleration structure build markers |
| - radv/printf: Use fprintf instead of printf |
| - nir/print: Fix printing booleans with bit_size>1 |
| - nir/serialize: Encode data for temporaries |
| - nir: Add lavapipe ray tracing intrinsics |
| - llvmpipe: Fix function call handling |
| - lavapipe: Add lvp_spirv_to_nir |
| - lavapipe: Make lvp_shader_init non-static |
| - lavapipe: Make lvp_create_pipeline_nir non-static |
| - lavapipe: Lower mem_constant variables |
| - lavapipe: Defer binding compute state |
| - lavapipe: Remove unused ray tracing variables |
| - lavapipe: Add more ray tracing helpers |
| - lavapipe: Pass lvp_ray_flags into lvp_aabb_intersection_cb |
| - lavapipe: Use the pipeline type in get_pcbuf_size |
| - lavapipe: Inline fill_ubo0 |
| - lavapipe: Add an api_stage parameter to update_pcbuf |
| - lavapipe: Fix a memory leak in lvp_push_internal_buffer |
| - lavapipe: Implement VK_KHR_ray_tracing_pipeline |
| - lavapipe: Implement KHR_ray_tracing_maintenance1 |
| - lavapipe: Implement VK_EXT_pipeline_library_group_handles |
| - lavapipe: Implement VK_KHR_ray_tracing_position_fetch |
| - radv: Destroy leaf_updateable_pipeline |
| - lavapipe: Handle accel struct queries in handle_copy_query_pool_results |
| - lavapipe: Implement ray_tracing_maintenance1 queries |
| - lavapipe: Do nort use NIR_PASS during lowering |
| - lavapipe: Handle multiple planes in GetDescriptorEXT |
| - lavapipe: Explicitely support ycbcr formats |
| - Revert "gallivm/ssbo: mask offset with exec_mask instead of building the 'if'" |
| - radv: Handle all dependencies of CmdWaitEvents2 |
| - nir/print: Do not access invalid indices of load_uniform |
| - radv: Fix radv_shader_arena_block list corruption |
| - radv: Remove arenas from capture_replay_arena_vas |
| - radv: Zero initialize capture replay group handles |
| |
| Krzysztof Kurek (1): |
| |
| - panfrost: fix shift overflow in \`bi_fold_constant` |
| |
| Leo Liu (2): |
| |
| - radeonsi: fix video processing path without VPE enabled |
| - ac/gpu_info: Fix broken UVD firmware query |
| |
| Lepton Wu (1): |
| |
| - llvmpipe: Set "+64bit" for X86_64 |
| |
| Lin, Ricky (1): |
| |
| - amd/vpelib: Rename the parameters of init vpe function |
| |
| Lionel Landwerlin (186): |
| |
| - anv: fix disabled Wa_14017076903/18022508906 |
| - intel/aux_map: fix fallback unmapping range on failure |
| - anv: hide vendor ID for The Finals |
| - intel/decoder: make vertex data decoding optional |
| - intel/decoder: don't ignore BT entries at offset 0 |
| - intel/genxml: add CCS_INSTDONE register |
| - intel/genxml: add GAM done register description |
| - intel/hang_viewer: add aux-tt view |
| - anv: export descriptor flushing functions |
| - anv: fix include guards |
| - anv: fix missing header |
| - anv: move generated draw flush helper to its own file |
| - anv: move draw commands to their own file |
| - anv: move compute/ray-tracing commands to their own file |
| - anv: rename video command file |
| - nir/alu_srcs_negative_equal: bail earlier if possible |
| - nir/comparison_pre_tests: update expectations |
| - anv: using a single struct for kernel upload |
| - anv: fix pipeline executable properties with graphics libraries |
| - isl: add print helpers for debug |
| - anv: implement undocumented tile cache flush requirements |
| - anv: reorder anv_astc_emu.c |
| - anv: remove unused perfetto declarations |
| - anv: rename layers entrypoints |
| - anv: add BO flag for internal driver allocations |
| - anv: track total state stream allocated blocks from the pool |
| - anv: track imported ray tracing pipeline groups |
| - anv: initial RMV support |
| - vulkan/runtime: handle new image layout |
| - anv: don't prevent L1 untyped cache flush in 3D mode |
| - anv: promote EXT_index_type_uint8 to KHR |
| - anv: promote EXT_line_rasterization to KHR |
| - anv: promote EXT_load_store_op_none to KHR |
| - anv: add missing alignment for AUX-TT mapping |
| - intel/ds: track predication of blorp operations |
| - vulkan/runtime: add helper to query attachment layout |
| - anv: ensure consistent layout transitions in render passes |
| - anv: add check that in renderpass barriers apply to attachments |
| - anv: handle image feedback loop usage |
| - anv: implement VK_EXT_attachment_feedback_loop_dynamic_state |
| - anv/hasvk: don't report error when intel_get_device_info_from_fd fails |
| - anv: factor out aux-tt binding logic for future reuse |
| - anv: rename aux_tt image field |
| - anv: retain ccs image binding address |
| - anv: fix transfer barriers flushes with compute queue |
| - vulkan/runtime: handle new dynamic states for attachment remapping |
| - docs/features: drop gen8+/gen9+ on Anv |
| - docs/features: synchronize new features for Anv |
| - vulkan/multialloc: bump max number to 16 |
| - vulkan/runtime: rework VK_KHR_dynamic_rendering_local_read state tracking |
| - anv: reduce cache flushing for indirect commands on Gfx12.5+ |
| - anv: don't unmap AUX ranges at BO delete |
| - isl: printout sparse usage |
| - isl: add a no-aux-align usage flag |
| - anv: move ALLOC_HOST_CACHED_COHERENT as define |
| - anv: use address helper to compute address u64 value |
| - intel/aux_map: add BSpec reference |
| - intel/aux_map: add helper to compute offset in aux data |
| - anv: re-introduce BO CCS allocations |
| - intel/dev: fix missing dependency on generated packing heaers |
| - anv: factor out post submit queue debug code |
| - intel/fs: indent lowering code to make it more readable |
| - intel/fs: rerun divergence prior to lowering non-uniform interpolate at sample |
| - anv: fix incorrect flushing on shader query copy |
| - meson: add a new option to enable intel-clc without building RT shaders |
| - intel/compiler: make default NIR compiler options visible |
| - intel-clc: move ISA generation to its own function |
| - intel/clc: add ability to output NIR |
| - intel-clc: print text input |
| - genxml: enable opencl code generation |
| - genxml: generate opencl packing headers |
| - genxml: remove NDEBUG_UNUSED |
| - intel/ds: new tracepoints for generated commands |
| - meson: add option to install intel-clc |
| - ci: build a host version of mesa for cross builds |
| - anv: rewrite internal shaders using OpenCL |
| - intel/shaders: add iris variant of indirect draws generation shader |
| - intel/shaders: enable gfx8 support |
| - iris: make binding table shifting values available outside iris_state.c |
| - iris: make KSP helper available outside iris_state.c |
| - iris: make URB programming available outside iris_state.c |
| - iris: factor out index buffer emission |
| - iris: add an option for not emit draw parameters |
| - iris: enable generated indirect draws |
| - meson: enforce build of intel-clc with anv/iris |
| - anv: remove redundant asserts |
| - anv: don't allocate aux padded BOs with host pointers |
| - anv: fix buffer marker cache flush issues on MTL |
| - anv: enable query clear/copy using shaders on MTL/ARL |
| - anv: fixup push descriptor shader analysis |
| - anv: factor out descriptor buffer flushing |
| - anv: reenable ANV_ALWAYS_BINDLESS |
| - anv: remove unused definition |
| - anv: fix Wa_16013994831 macros |
| - anv: fix emission of Wa_14015055625 |
| - genxml: generate opencl temporary variables with private qualifier |
| - intel/clc: lower temp function/shader variables together |
| - intel/clc: workaround LLVM17 opaque pointers |
| - anv: disable Wa_16013994831 |
| - ci/anv: add more testing for optimization paths |
| - intel/ci: bump anv/tgl fraction to 6 |
| - intel/nir: only consider ray query variables in lowering |
| - anv: limit depth flush on dynamic render pass suspend |
| - anv: add missing generated file dep |
| - anv: optimize push descriptor updates |
| - anv: add new heap/pool for descriptor buffers |
| - anv: create new helper for small allocations |
| - anv: add a second dynamic state heap for descriptor buffers |
| - anv: move aux-tt to general state pool |
| - anv: allocate slice_hash for descriptor buffer |
| - anv: allocate border colors for descriptor buffers |
| - anv: allocate fsr states for descriptor buffer |
| - anv: implement data write entry points for EXT_descriptor_buffer |
| - anv: compute a sampler hash based on parameters |
| - anv: add embedded sampler parameters in descriptor set layout hash |
| - intel/fs: add plumbing for embedded samplers |
| - nir: add additional flag to resource_intel for embedded samplers |
| - anv: add embedded sampler support |
| - anv: add new helper to update binding table pool offset |
| - anv: add descriptor set layout support for descriptor buffers |
| - anv: add pipeline/shader support for descriptor buffers |
| - anv: handle push descriptor writes with descriptor buffers |
| - anv: implement descriptor buffer binding |
| - anv: disable mutable combined image/sampler in descriptor buffer |
| - anv: expose VK_EXT_descriptor_buffer |
| - anv: fix non matching image/view format attachment resolve |
| - anv: fix helper usage for CmdUpdateBuffer() |
| - anv: remove some wrapping around mmap |
| - anv: add support for VK_EXT_map_memory_placed |
| - anv: delay internal shader upload to when needed |
| - anv: fix companion command buffer initialization |
| - anv: fix incorrect ISL usage in buffer view creation |
| - anv/iris/blorp: use the right MOCS values for each engine |
| - anv: try to keep the pipeline in GPGPU mode when buffer transfer ops |
| - anv: don't copy the null descriptor from the GPU memory |
| - intel/fs: fixup sampler header message |
| - anv: return unsupported for FSR images on Gfx12.0 |
| - intel/fs: remove some unused send helpers |
| - anv: ignore descriptor alignment for inline uniforms |
| - intel/fs: bump max simd size of some messages for xe2 |
| - anv: track embedded sampler counts in layouts |
| - anv: allocate pipeline bindings tables dynamically on the heap |
| - anv: avoid partially compiled warning with GPL |
| - blorp: handle a few allocation failure cases |
| - anv: fix invalid border color free |
| - anv: fix block pool allocation failure |
| - anv: fix temporary state pool allocation failures |
| - anv: fix bitfield checks in gfx runtime flushing |
| - anv: fix query clearing with blorp compute operations |
| - blorp: add support for cached dynamic states |
| - anv: reduce blorp dynamic state emissions |
| - anv: optimize emission of dynamic state with blorp |
| - anv: fix protected memory allocations |
| - anv: pull surface state copies for secondary in one loop |
| - anv: disable protected content around surface state copies |
| - anv: disable generated draws in protected command buffers |
| - anv: update protection fault property |
| - anv: fix incorrect blorp dynamic state heap usage |
| - intel/fs: printout a couple of more late compile steps |
| - intel/fs: fixup instruction scheduling last grf write tracking |
| - anv: add missing data flush out of L3 for transform feedback writes |
| - anv: mark descriptors & pipeline dirty after blorp compute |
| - isl: set NullPageCoherencyEnable for depth/stencil sparse surfaces |
| - anv: only check patch_control_points changes in runtime flush |
| - anv: increase maxResourceDescriptorBufferRange on DG2+ |
| - anv: reuse vk_common_GetImageSubresourceLayout |
| - anv: move all format props checks to anv_get_image_format_properties() |
| - drirc: rename hasvk only option |
| - vulkan: track compression control flags on vk_image |
| - anv: implement VK_EXT_image_compression_control |
| - anv: disable capture replay with descriptor buffer |
| - anv: remove useless dynamic state allocation for samplers |
| - anv: add capture/replay support for image with descriptor buffers |
| - anv: add capture/replay support for buffer with descriptor buffers |
| - anv: add a new reserved pool for capture/release |
| - anv: enable shader border color capture/replay |
| - anv: enable capture/replay with descriptor buffers |
| - anv: disable dual source blending state if not used in shader |
| - intel/brw: fixup wm_prog_data_barycentric_modes() |
| - anv: fixup alloc failure handling in reserved_array_pool |
| - anv: fix leak of custom border colors |
| - anv: fix ycbcr plane indexing with indirect descriptors |
| - brw: add more condition for reducing sampler simdness |
| - anv: fix push constant subgroup_id location |
| - nir/divergence: add missing load_printf_buffer_address |
| - anv: use weak_ref mode for global pipeline caches |
| |
| Louis-Francis Ratté-Boulianne (5): |
| |
| - panfrost: factor out method to check whether we can discard resource |
| - panfrost: add copy_resource flag to pan_resource_modifier_convert |
| - panfrost: add can_discard flag to pan_legalize_afbc_format |
| - panfrost: Legalize before updating part of a AFBC-packed texture |
| - panfrost: Add AFBC packing support for RG formats |
| |
| Luc Ma (1): |
| |
| - gallium/u_blitter: Fix a few uninitialized fb_state |
| |
| Luca Bacci (1): |
| |
| - meson,windows: Use relative paths in Vulkan ICD manifest files |
| |
| Lucas Fryzek (19): |
| |
| - egl/wayland/sw: don't invert y \`wl_surface_damage_buffer` |
| - drisw/winsys: Flip y coordinate when creating pipe boxes |
| - drisw: clamp damage region to texture bounds |
| - llvmpipe: explicitly reject (most) yuv formats |
| - gallium: Add dmabuf arg to memory fd allocation API |
| - llvmpipe: Implement dmabuf handling |
| - drisw: reuse kopper image extension vtable if modifiers/dmabuf is supported |
| - llvmpipe: conditionally export PIPE_CAP_DMABUF |
| - lavapipe: support VK_EXTERNAL_MEMORY_HANDLE_TYPE_DMA_BUF_BIT_EXT |
| - lavapipe: EXT_external_memory_dma_buf |
| - llvmpipe: make it possible to import and bind unbacked resources |
| - lavapipe: include drm_fourcc.h |
| - lavapipe: check drm modifier info during image create |
| - lavapipe: EXT_image_drm_format_modifier |
| - venus/ci: Add patch for modifiers test to check import/export bits |
| - lp: Wrap udmabuf usage in HAVE_LIBDRM ifdef |
| - ci/lp: Remove ext buffer YUV tests from fails |
| - llvmpipe: Only return null resource handle when dt is not mapped |
| - llvmpipe: Only use udmabuf if header is found |
| |
| Lucas Stach (18): |
| |
| - etnaviv: disable 64bpp render/sampler formats |
| - etnaviv: track resource sharing |
| - etnaviv: only add shared resources to implicit flush list |
| - etnaviv: implicitly update shared texture resources |
| - etnaviv: don't use int filter for depth textures |
| - etnaviv: tex_desc: emit texture comparator |
| - etnaviv: fix fixpoint conversion of negative values |
| - ci/etnaviv: update expectations |
| - etnaviv: fix depth writes without testing |
| - etnaviv: rs: take src dimensions into account when increasing height alignment |
| - etnaviv: use correct blit box sizes when copying resource |
| - etnaviv: fix separate depth/stencil clears |
| - etnaviv: trigger TS derivation after slow clear |
| - etnaviv: split TS and non-TS RS clear commands |
| - etnaviv: ci: update expectation with fixed depth/stencil clears |
| - etnaviv: rs: treat depth-only clear to X8Z24 surfaces as full clear |
| - ci/etnaviv: update expectation after piglit uprev |
| - etnaviv: flip the switch on MSAA support |
| |
| Luigi Santivetti (1): |
| |
| - pvr: return the OS page size for minMemoryMapAlignment |
| |
| Lynne (1): |
| |
| - radv/av1: limit profile and bit depth to supported values |
| |
| M Henning (10): |
| |
| - nvk: Don't clobber vb0 after repeated blits |
| - nak: Remove assert on nir->info.outputs_written |
| - nvk: Early-out impossible descriptor allocations |
| - nak: Fix ldg/stg/atomg encoding to use globalmem |
| - nak: Set fewer bits in writes_color |
| - nak: Use undef for unset FSOut components |
| - nak: Remove old union_find implementation |
| - nak: Rewrite union_find and use it in repair_ssa |
| - nak: Count GLOBAL_SIZE_OFFSET in bytes, not words |
| - nvk: Don't use a descriptor cbuf if it's too large |
| |
| Manuel Stoeckl (1): |
| |
| - util/disk_cache: try getenv(HOME) before getpwuid->pw_dir |
| |
| Marcin Ślusarz (1): |
| |
| - intel/compiler/xe2: fix decoding of sampler simd mode |
| |
| Marek Olšák (244): |
| |
| - gallium/u_vbuf: replace unnecessary dst_index with "i" |
| - gallium: remove unbind_trailing_count from set_vertex_buffers |
| - cso: don't unbind vertex buffers when enabling/disabling u_vbuf |
| - winsys/amdgpu: merge loops decrementing num_active_ioctls & unreferencing bufs |
| - winsys/amdgpu: cosmetic touchups |
| - winsys/amdgpu: don't clear buffer list elements after IB submission |
| - winsys/amdgpu: add more fence_reference helpers |
| - winsys/amdgpu: don't clear fence list elements after IB submission |
| - winsys/amdgpu: remove misplaced duplicated comment |
| - winsys/amdgpu: represent IB_MAX_SUBMIT_DWORDS in bytes |
| - winsys/amdgpu: represent max_ib_size_dw in bytes |
| - winsys/amdgpu: cosmetic touchups around IB sizes |
| - amd: unify NIR options between RADV and radeonsi |
| - ac/nir: don't write TCS outputs to memory if no_varying is set |
| - ac/nir: rename clipdist_enable_mask -> clip_cull_dist_mask |
| - ac/nir: optimize out multiplications in small line culling |
| - ac/nir: simplify code at the beginning of ac_nir_gs_shader_query |
| - ac,radeonsi: emulate GS primitive pipeline stat on gfx11 because of culling |
| - radeonsi: report more detailed output stats for shader-db |
| - radeonsi: expose shader profiles to other .c files |
| - radeonsi: don't use staging uploads for buffers & shaders with all VRAM visible |
| - radeonsi: deduplicate gfx10_ngg_get_vertices_per_prim / get_num_vert_per_prim |
| - radeonsi: change GS_STATE_PROVOKING_VTX_INDEX to 1 bit PROVOKING_VTX_FIRST |
| - radeonsi: split si_update_ngg_prim_state_sgpr into 2 functions |
| - radeonsi: pack GS_STATE_ESGS_VERTEX_STRIDE better to save 2 bits |
| - radeonsi: remove no-op additions for viewport0_y_inverted |
| - radeonsi: remove unused preloaded instance_divisor_constbuf |
| - radeonsi: rename \*trivial_vs_prolog -> \*trivial_vs_inputs |
| - radeonsi/gfx11: clean up MAX_ALLOWED_TILES_IN_WAVE programming |
| - radeonsi/ci: update gfx11 flakes |
| - radeonsi/gfx11: flush DB before Z/S clear to work around dEQP failures |
| - radeonsi: don't flush CS before and after every blitter invocation |
| - mesa,gallium: move the thread scheduler to src/util |
| - gallium: rename PIPE\_.._PIN_THREADS_TO_L3_CACHE -> .._UPDATE_THREAD_SCHEDULING |
| - st/mesa: rename ST_L3_PINNING_DISABLED -> ST_THREAD_SCHEDULER_DISABLED |
| - util: add mesa_pin_threads environment variable that sets a static affinity mask |
| - glthread: apply the thread scheduling policy when the context is created |
| - glthread: apply the thread scheduling policy when a batch executes synchronously |
| - gallium/hud: add "csv" option to print values to stdout as CSV |
| - nir: remove INTERP_MODE_COLOR |
| - nir: relax validation failure for generic TCS outputs with no_varying |
| - nir: remove and replace underused option pack_varying_options |
| - nir: replace lower_io_variables with a GLSL NIR flag |
| - nir: add a lower_mediump_io callback into options |
| - nir: add vertex divergence into nir_divergence_analysis |
| - winsys/amdgpu: fix a race condition when reading ws->num_buffers |
| - winsys/amdgpu: add real buffers of slab entries in the CS thread |
| - winsys/amdgpu: change the signature of amdgpu_add_bo_fences_to_dependencies |
| - winsys/amdgpu: move code out of amdgpu_add_bo_fences_to_dependencies for reuse |
| - winsys/amdgpu: merge 2 loops iterating over slab entries in amdgpu_cs_submit_ib |
| - winsys/amdgpu: merge 2 loops iterating over sparse BOs in amdgpu_cs_submit_ib |
| - winsys/amdgpu: merge 2 loops iterating over real BOs in amdgpu_cs_submit_ib |
| - winsys/amdgpu: skip code checking RADEON_USAGE_SYNCHRONIZED for slabs |
| - winsys/amdgpu: simplify amdgpu_do_add_buffer to remove memset |
| - winsys/amdgpu: don't ref/unref slab BOs in amdgpu_cs_submit_ib |
| - radeonsi: use num_vertex_buffers instead of ARRAY_SIZE |
| - radeonsi/ci: add gfx11 flakes |
| - gallium: always set vertex elements before setting vertex buffers |
| - gallium/u_blitter: set take_ownership=true for set_vertex_buffers |
| - st/mesa: set take_ownership=true for set_vertex_buffers in st_draw_quad |
| - gallium/util: add take_ownership parameter into util_draw_vertex_buffer |
| - st/mesa: set take_ownership=true for util_draw_vertex_buffer in st_DrawTex |
| - st/mesa: set take_ownership=true for set_vertex_buffers in st_pbo_draw |
| - gallium/hud: set take_ownership=true for set_vertex_buffers |
| - cso: remove CSO_UNBIND_VERTEX_BUFFER0 |
| - gallium/u_threaded: remove the count=0 path from tc_call_set_vertex_buffers |
| - gallium/u_threaded: allow drivers to change tc_call_set_vertex_buffers function |
| - gallium: remove take_ownership from set_vertex_buffers, assume it's true |
| - gallium/noop: don't leak resources due to take_ownership |
| - radeonsi,aco: remove the VS prolog |
| - gallium/u_threaded: expose helpers for filling set_vertex_buffers externally |
| - st/mesa: rename attribs -> arrays in st_atom_array to indicate non-zero strides |
| - st/mesa: do (inputs_read & enabled_arrays) outside setup_arrays |
| - st/mesa: do (inputs_read & ~enabled_arrays) outside st_setup_current |
| - st/mesa: move a piece of _mesa_draw_array_attrib out of the loop in setup_arrays |
| - st/mesa: cosmetic touchups in st_atom_array.cpp |
| - st/mesa: change the update enum of vertex elements |
| - st/mesa: move st_update_functions into st_context |
| - st/mesa: constify the pipe_draw_info parameter and remove obsolete comments |
| - mesa: inline {Create,Draw}GalliumVertexState callbacks |
| - mesa: inline _mesa_set_vao_immutable |
| - mesa: add gl_vertex_array_object::NonIdentityBufferAttribMapping |
| - util/idalloc: make deleting invalid IDs a no-op |
| - mesa: remove unused _mesa_HashTable code |
| - mesa: clean up unnecessary _mesa_HashTable locked/unlocked wrappers |
| - mesa: re-format main/hash.h, move inlines to the end, some code to main/hash.c |
| - mesa: fold _mesa_HashDeleteAll into _mesa_DeleteHashTable |
| - mesa: remove _mesa_HashTable::InDeleteAll |
| - st/mesa: merge 3 unlikely blocks in _mesa_get_bufferobj_reference |
| - st/mesa: remove !obj checking in _mesa_get_bufferobj_reference when it's useless |
| - mesa: fix incorrect _mesa_HashInsertLocked parameter in _mesa_EndList |
| - mesa: use util_idalloc_alloc_range for _mesa_HashFindFreeKeyBlock |
| - winsys/amdgpu: convert amdgpu_cs.c to .cpp |
| - winsys/amdgpu: enable unlimited number of parallel queues for VCN |
| - util/idalloc: optimize foreach by tracking the greatest non-zero element |
| - mesa: declare _mesa_HashTable::id_alloc as non-pointer |
| - mesa: declare _mesa_HashTable inside structures instead of as a pointer |
| - mesa: remove isGenName parameter from _mesa_HashInsert |
| - mesa: use util_idalloc_foreach for looping in _mesa_HashTable |
| - mesa: replace _mesa_HashTable::ht with util_sparse_array for faster lookups |
| - d3d12: make DrawTransformFeedback not depend on the vertex buffer offset |
| - mesa: don't use the slow VAO path except for drivers that want to use it |
| - st/mesa: add VAO fast path C++ template variants for st_update_array callback |
| - st/mesa: optimize st_update_arrays using lots of C++ template variants |
| - glthread: re-enable thread scheduling in st/mesa when glthread is disabled |
| - glthread: use _mesa_glthread_fence_call() instead of duplicating that code |
| - glthread: add no_error variants of glDrawElements* |
| - glthread: add no_error variants of glDrawArrays* |
| - glthread: remove cmd_size from constant-sized calls |
| - glthread: clean up how vertex stride is packed |
| - glthread: pack "size" in Pointer calls as 16 bits |
| - mesa: deduplicate get_index_size_shift code |
| - mesa: deduplicate is_index_type_valid code |
| - glthread: pack the primitive type to 8 bits |
| - glthread: pack the index type to 8 bits |
| - glthread: rewrite glDrawElements call packing |
| - glthread: rewrite glDrawArrays call packing |
| - glapi: fix type names for glthread and handle all types |
| - glthread: sort fixed-sized parameters before returning them |
| - glthread: move global marshal_XML.py functions into class marshal_function |
| - glthread: precompute fixed_params and variable_params lists |
| - gltrhead: merge 3 blocks conditional on marshal_sync in print_async_body |
| - glthread: separate unmarshal function generation into print_unmarshal_func |
| - glthread: separate marshal code generation into print_marshal_async_code |
| - glthread: remove "if True" from print_marshal_async_code |
| - glapi: pass pointer size to python for glthread from meson |
| - glthread: pack glVertexAttribPointer calls better |
| - glthread: fix multi draws with a negative draw count |
| - glthread: pack uploaded user vertex buffers and offsets better |
| - glthread: deduplicate batch finalization code |
| - glthread: don't check cmd_size for small variable-sized calls |
| - glthread: use marshal_count instead of count for more functions |
| - glthread: rewrite glBindBuffer packing |
| - glthread: add a packed variant of glDrawElements with 16-bit count and indices |
| - glthread: add a packed version of DrawElementsUserBuf |
| - glthread: generate packed versions of gl*Pointer/Offset calls |
| - amd: update addrlib |
| - mesa: deduplicate initialization of gl_pixelstore_attrib |
| - mesa: move struct gl_pixelstore_attrib into glthread.h |
| - glthread: track glPixelStore(GL_UNPACK_*) |
| - glthread: execute small glBitmap asynchronously |
| - glthread: execute small glDrawPixels asynchronously |
| - glthread: invert _mesa_glthread_has_no_{un}pack_buffer by removing the negation |
| - amd/registers: add correct gfx11.x enums for BINNING_MODE |
| - radeonsi: disable binning correctly on gfx11.5 |
| - radeonsi/gfx11: fix programming of PA_SC_BINNER_CNTL_1.MAX_ALLOC_COUNT |
| - radeonsi/gfx10.3: add a GPU hang workaround for legacy tess+GS |
| - radeonsi: allocate only one set of tessellation rings per device |
| - radeonsi/gfx11: program the attribute ring right before draws |
| - radeonsi: program tessellation rings right before draws |
| - radeonsi/gfx11: program SAMPLE_MASK_TRACKER_WATERMARK optimally for APUs |
| - ac: use the gfx11 shadowed register tables for gfx11.5 |
| - radeonsi/gfx11: add missing DCC_RD_POLICY setting |
| - radeonsi: add radeonsi_cache_rb_gl2 option enabling GL2 caching for CB and DB |
| - nir/divergence_analysis: change function prototypes |
| - nir/divergence_analysis: load_primitive_id is convergent within a primitive |
| - nir/divergence_analysis: load_instance_id is convergent within a primitive |
| - nir/divergence_analysis: handle derefs of system values |
| - nir: print nir_io_semantics::invariant |
| - nir: add nir_block::divergent to indicate a divergent entry condition |
| - ac/llvm: fix SSBO bounds checking by using raw instead of struct opcodes |
| - radeonsi: fix the DMA compute shader |
| - radeonsi: don't test so many wave limits for AMD_TEST=testdmaperf |
| - nir: add a utility computing post-dominance of SSA uses |
| - nir: add nir_opt_varyings, new pass optimizing and compacting varyings |
| - nir/tests: add tests for nir_opt_varyings |
| - radeonsi: set the lower_mediump_io callback for GLSL |
| - radeonsi: set trivial NIR options for nir_opt_varyings |
| - radeonsi: enable uniform propagation for varyings except VP/Energy |
| - radeonsi: add test failures due to incorrect tests for nir_opt_varyings |
| - st/mesa: get dual slot input info from NIR if IO is lowered |
| - st/mesa: lower sysvals slightly sooner |
| - st/mesa: skip a few NIR passes that don't work with lowered IO |
| - glsl/linker,st/mesa: enable nir_opt_varyings and lower IO in the linker |
| - amd/ci: update stoney results |
| - r300: port scanout pitch alignment from the DDX to fix DRI3 |
| - r300: enable tiling for scanout to fix DRI3 performance |
| - radeonsi/ci: run GLCTS, ESCTS, and dEQP from the glcts directory |
| - radeonsi/ci: update failures |
| - Unbreak Viewperf by reverting "util: use crc32_z instead of crc32 and bump zlib dep to 1.2.9" |
| - gallium: use u_box_3d to initialize pipe_box instead of non-designated initializers |
| - gallium: increase the size of pipe_box y, height fields to allow bigger textures |
| - nir: rename AMD XFB intrinsics to \*_gfx11_amd |
| - nir,amd: add nir_intrinsic_load_debug_log_desc_amd and its use |
| - aco: implement aco_is_gpu_supported using switch statement |
| - aco: add a helper printing shader asm by disassembling via LLVM |
| - ac/llvm: remove remnants of gfx10 NGG streamout |
| - radeonsi: implement the shader debug log from ac_nir_store_debug_log_amd |
| - nir/validate: validate interp_mode of load_barycentric_* |
| - nir/lower_io: add nir_io_semantics::interp_explicit_strict |
| - nir/validate: validate more fields of nir_io_semantics |
| - tgsi_to_nir: translate TG4 |
| - nir/opt_varyings: don't generate IO with unsupported bit sizes |
| - nir/opt_varyings: simplify nir_io_semantics::num_slots of directly-indexed slots |
| - nir/opt_varyings: handle load_input_vertex |
| - ac/surface: add radeon_surf::thick_tiling |
| - ac/nir: allow 16-bit results for resinfo |
| - ac/llvm: simplify extracting an element in get_image_coords |
| - ac/llvm: add support for 16-bit coordinates (A16) for image (non-sampler) opcodes |
| - ac/llvm: allow image loads to return less than 4 components, trim DMASK |
| - ac/llvm: remove handling of input and output loads/stores that are lowered |
| - ac/llvm: remove unused fields of ac_shader_abi |
| - ac/llvm: simplify the optimization barrier and apply it to the whole vector |
| - ac: add helper ac_get_ip_type_string to remove duplication |
| - nir: add more build helpers |
| - nir: allow FP16 in nir_format_linear_to_srgb |
| - nir: add nir_intrinsic_optimization_barrier_sgpr_amd |
| - nir: change "user_data_amd" sysval from 4 to 8 components |
| - nir/use_dominance: set the root as post-dominator of unmovable instructions |
| - util: add new format helpers |
| - util: import pipe_box and its helpers |
| - ac/llvm: fix assertions for texture instructions with 16-bit LOD bias |
| - ac/llvm: always trim components of texture instructions, trim DMASK |
| - ac/surface: constify and reindent NIR meta address-from-coord function params |
| - radeonsi/ci: update gfx11 failures |
| - radeonsi/gfx11: don't prefetch constants in binaries into the instruction cache |
| - radeonsi/gfx11: enable DCC fast clears for 8-bit and 16-bit formats |
| - radeonsi: use the same nir_lower_subgroups_options as RADV |
| - radeonsi: add the radeonsi_optimize_io option into the shader cache key |
| - radeonsi: check has_stable_pstate in the winsys |
| - radeonsi: move TCS epilog key bits to the key->ge.opt section |
| - radeonsi: fix initialization of occlusion query buffers for disabled RBs |
| - radeonsi: don't expose samples_identical and don't lower FMASK if it's disabled |
| - radeonsi: allow input NIR to use descriptors in image opcodes |
| - radeonsi: move blitter resource_copy_region implementation to si_gfx_copy_image |
| - radeonsi: move blitter clear_render_target impl into si_gfx_clear_render_target |
| - radeonsi: preserve NaNs in draw-based resource_copy_region |
| - radeonsi: use simpler UINT fallback formats for draw-based resource_copy_region |
| - radeonsi: remove si_use_compute_copy_for_float_formats |
| - radeonsi: change allow_flat_shading to make it a single condition |
| - radeonsi: don't call resource_copy_region in pipe->blit |
| - radeonsi/gfx11: implement DCC clear to "single" for fast non-0/1 clears |
| - radeonsi: disable VRS flat shading for selected 8xMSAA and thick tiling cases |
| - radeonsi: don't use si_get_flush_flags() for flushing images |
| - radeonsi: don't flush CB in si_launch_grid_internal_images if not needed |
| - radeonsi: don't flush CB and DB if there have been no draw calls |
| - radeonsi: enable fast FB clears for conditional rendering |
| - radeonsi: make clear_render_target clear DCC directly instead of via pipe->clear() |
| - radeonsi: don't add whether NIR is used into the shader key |
| - radeonsi: only expose 8 EQAA samples due to shader limitations |
| - radeonsi: always run nir_opt_16bit_tex_image |
| - radeonsi: use ip_type in debug code instead of hardcoding GFX |
| - radeonsi: implement user_data_amd for 5, 6, and 7 components correctly |
| - util: shift the mask in BITSET_TEST_RANGE_INSIDE_WORD to be relative to b |
| |
| Mark Collins (32): |
| |
| - tu/kgsl: Fix sync_wait'd FD in kgsl_syncobj_wait |
| - tu/a7xx: Update CCU layout logic for A7XX |
| - tu: Allow GMEM on A7XX when TU_DEBUG=gmem |
| - tu: Set A7XX registers in \`tu6_tile_render_begin` |
| - tu: Set \`CP_THREAD_CONTROL::CONCURRENT_BIN_DISABLE` in A7XX HW init |
| - tu: Only set PC/VFD PWR_CNTL regs on A6XX |
| - tu: Use \`CP_SET_PSEUDO_REG` for A7XX VSC stream regs |
| - tu/autotune: Use \`CP_EVENT_WRITE7::ZPASS_DONE` on A7XX |
| - tu: Set \`RB_UNKNOWN_88E4` for A7XX event blits |
| - freedreno/devices: Update A7XX tile values |
| - tu: Use full size color CCU in sysmem mode |
| - tu: Update CCU layout selection logic for seperate stencil stores |
| - tu: Allow event blit to resolve depth stencil formats |
| - tu: Fix 2D blit path for GMEM stores on A7XX |
| - tu: Use \`Z24_UNORM_S8_UINT_AS_R8G8B8A8` for A7XX GMEM D24S8 blits/clear |
| - tu: Disable LRZ properly on A7XX |
| - tu: Set RB_CCU_CNTL during HW init on A7XX |
| - tu: Fix CP_BLIT sync on A7XX |
| - tu: Clear \`VSC_UNKNOWN_0D08` on A7XX |
| - tu: Add blit cache flushing for input attachments |
| - tu: Unconditionally enable GMEM on A7XX |
| - fd/replay: Fix wrbuffer name extraction |
| - fd/replay: Dump wrbuf into cwd rather than exe directory |
| - fd/replay: Clamp dumped wrbuf to buffer size |
| - fd/replay: Clear wrbufs after submitting cmdstreams for DRM |
| - fd/replay: Add wrbuf support for KGSL/DXG |
| - fd/replay: Error when VMA AS allocation fails |
| - fd/replay+rddecompiler: Add option to clear wrbufs at start |
| - fd/rddecompiler: Disable IR3 cache for replay context |
| - fd/decode: Build generate_rd executable rather |
| - fd/replay: Use generate_rd as default CS generator |
| - fd/decode: Fix "OPTSIONS" typo in help messages |
| |
| Mark Janes (18): |
| |
| - hasvk: add missing linker arguments |
| - util: add parson for handling json files |
| - intel/dev: specify struct intel_device_info type details in python |
| - intel/dev: generate declarations for struct intel_device_info |
| - intel/tools: add intel device meson dependencies |
| - intel/dev: implement json serialization for intel_device_info |
| - intel/dev/tools: add json as an output format for intel_dev_info |
| - intel/tools: load json device info in drm_shim |
| - intel/dev: improve meson invocation for intel_device_info gen |
| - intel/compiler: generate a hash function to use with the shader cache |
| - iris: use device info sha in device renderer string |
| - anv: use intel_device_info to set device UUID |
| - intel/tools: move intel_dev_info to intel/tools |
| - intel/tools: add shader compiler hash key to json devinfo format |
| - pan/va: Add missing valhall_enums dep to bifrost_tests |
| - intel/dev: declare workarounds required by ATSM platforms |
| - intel/dev: remove pci revision from shader cache key |
| - intel/compiler: drop unused ray-tracing fields from cache hash |
| |
| Martell Malone (3): |
| |
| - nine: r500 under 20 fragments cap is a warning |
| - nine: detect emulation fallback of d3d coordinates |
| - nine: update verbiage for enduser device messages |
| |
| Martin Krastev (5): |
| |
| - svga/ci: land vmware mesa-ci lava farm |
| - svga/ci: workaround vmware farm's inability to use public DNS 8.8.8.8 |
| - svga/ci: re-enable vmware farm |
| - svga/ci: add two new piglit flakes to svga |
| - svga/ci: disable vmware farm |
| |
| Martin Roukala (né Peres) (10): |
| |
| - radeonsi/ci: update vangogh's expectations after piglit uprev |
| - zink/ci: update navi31's expectations after piglit uprev |
| - zink/ci: update polaris10's expectations after piglit uprev |
| - radv/ci: switch vkcts-polaris10 from mupuf to KWS' farm |
| - radv/ci: add a vkcts-tahiti job |
| - radv/ci: add a vkd3d-tahiti job |
| - ci/b2c: rename .b2c-test-{vk,gl} to .b2c-x86_64-test-{vk,gl} |
| - ci/b2c: rename .deqp-test-valve into .b2c-deqp-test |
| - ci/b2c: allow setting the DTB to be used |
| - ci/valve: remove the traces runner |
| |
| Mary Guillemard (37): |
| |
| - nouveau: nvidia_header: Add AMPERE_A in vk_push_print |
| - nouveau: nvidia_header: Add TURING_COMPUTE_A and AMPERE_COMPUTE_A in vk_push_print |
| - nouveau: nvidia_header: Add AMPERE_COMPUTE_B in vk_push_print |
| - nouveau: nvidia-headers: Add compute array parsing to class_parser.py |
| - nouveau: nvidia-headers: Add nv_push_dump tool |
| - nouveau: mme: Add a dumper |
| - agx: Add more bitops in agx_bitop_table |
| - agx: Remove and/or/xor pseudo ops |
| - agx: Fuse not into and/or/xor |
| - agx: Add a bitop optimizer pass |
| - pan/bi: assert indices when offsets are present in bi_emit_tex_valhall |
| - pan/lib: Remove variables in blitter |
| - pan/bi: Rework indices for tex on Valhall |
| - pan/bi: Rework indices for image on Valhall |
| - pan/bi: Rework indices for attributes on Valhall |
| - pan/bi: Lower ubo table in indices for Valhall |
| - panfrost, pan/lib: Move pan_resource_table to panfrost |
| - nvk: Always copy conditional rendering value before compare |
| - drm-shim: Add io region handling in mmap |
| - panfrost: Add support for Panthor in drm-shim |
| - docs: Document Mali-G610 in drm-shim section |
| - panfrost: group up stubbed params in drm-shim |
| - nouveau: Add support for TERT opcodes in vk_push_print |
| - nouveau: Fix NINC TERT handling in vk_push_print |
| - nak: Support unaligned swizzles in 8/16 bits vec srcs |
| - nak: move folding logic to Src::fold_imm |
| - nak: Add F16 and F16v2 sources |
| - nak: Improve copy propagation pass to handle F16 |
| - nak: Add 16-bits float operations |
| - nvk: Advertise shaderFloat16 |
| - nvk: Allow various alu op to be vectorized for 2xfp16 |
| - nak: Allow SHF to use immediate encoding for shift |
| - panvk: Return os_page_size for minMemoryMapAlignment |
| - panvk: Fix driver UUID not being filled |
| - panvk: Move to vk_properties |
| - panvk: Advertise VK_KHR_driver_properties |
| - nak: Pass has_mod to all form of src2 requiring it |
| |
| Mary Strodl (2): |
| |
| - rusticl: set OCL_ICD_VENDORS as directory, not file |
| - NirShader: don't fail on null constant_buffer |
| |
| Matt Turner (5): |
| |
| - util: Add DETECT_ARCH_HPPA macro |
| - util/tests: Disable half-float NaN test on hppa/old-mips |
| - meson: Limit intel_vk_rt to x86_64 |
| - anv/drirc: Add option to control implicit sync on external BOs |
| - intel: Build float64 shader only for Vulkan |
| |
| Matthew Waters (1): |
| |
| - teximage: allow glCopyTex{Sub}Image[123]D into R/RG textures with OpenGL ES 2.0 |
| |
| Max R (11): |
| |
| - d3d10umd: Fix compilation |
| - winsys/gdi: Handle R8G8B8 formats |
| - winsys/gdi: Custom acquisition of hDC |
| - d3d10umd: Use flush_frontbuffer for Present |
| - virgl: Fix compilation on MSVC |
| - virgl: Fix crash when no VE bound |
| - virgl: Implement PIPE_QUERY_GPU_FINISHED |
| - virgl: Allow importing resources without known templ |
| - virgl: Pass cmd_buf to flush_frontbuffer |
| - d3d10umd, meson: Allow naming d3d10umd DLLs |
| - d3d10umd: Rename d3d10sw target to d3d10umd |
| |
| Michel Dänzer (2): |
| |
| - egl/wayland: Flush after blitting to linear copy |
| - wsi/wayland: Dispatch event queue in wsi_wl_swapchain_queue_present |
| |
| Mike Blumenkrantz (313): |
| |
| - vk/cmdbuf: add back deleted maint6 workgraph bits |
| - lavapipe: use pushconstants2 for dgc |
| - lavapipe: fix devenv icd filename |
| - zink: fix separate shader patch variable location adjustment |
| - lavapipe: delete extra descriptor buffer layout validation |
| - zink: use local screen variable in surface creation |
| - zink: hook up maint6 |
| - zink: use maint6 for multi-layer compressed surface creation |
| - zink: set more dynamic states when using shader objects |
| - lavapipe: KHR_dynamic_rendering_local_read |
| - zink: always map descriptor buffers as COHERENT |
| - zink: fix descriptor buffer unmaps on screen destroy |
| - lavapipe: RM2024 extension promotions |
| - zink: add a tu flake |
| - zink: prune dmabuf export tracking when adding resource binds |
| - zink: fix sparse bo placement |
| - zink: zero allocate resident_defs array in ntv |
| - zink: move sparse lowering up in file |
| - zink: run sparse lowering after all optimization passes |
| - zink: add back (safe) optimizations after sparse lowering |
| - zink: split out sparse_residency_code_and lowering |
| - mesa: plumb errors through to texture allocation |
| - zink: adjust swizzled deref loads by the variable component offset |
| - nir/lower_io: fix handling for compact arrays with indirect derefs |
| - zink: only add arrays to indirect non-tcs variables |
| - zink: promote a conditional on gfx shader destroy |
| - zink: clamp zink_gfx_lib_cache::stages_present for generated tcs |
| - zink: promote gpl libs freeing during shader destroy out of prog loop |
| - zink: don't add VK_IMAGE_CREATE_2D_ARRAY_COMPATIBLE_BIT for sparse textures |
| - zink: add a ci skip |
| - ci: bump VVL to snapshot-2024wk06 |
| - zink: update vvl expectations |
| - mesa: check driver format support for certain GetInternalformativ queries |
| - zink: always enable glsl_correct_derivatives_after_discard |
| - zink: add a750 baseline |
| - zink: delete maxDescriptorBufferBindings checks |
| - zink: flag the use_img as unsync access with buf2img copies, not the swapchain |
| - zink: pre-check formats for samplecount support |
| - zink: validate sample count on image create |
| - zink: add an assert for dummy fb surface creation |
| - zink: compute bo unique_id on use, not creation |
| - zink: avoid infinite recursion on (very) small BAR systems in bo alloc |
| - egl/x11/swrast: deduplicate ANGLE_sync_control_rate enablement |
| - drisw: hook up EXT_buffer_age |
| - drisw/egl: delete unused buffer age handling |
| - vk/wsi/x11/sw: use swapchain depth for putimage |
| - mesa: add more driver support checks for more format queries |
| - zink: add checks/compat for low-spec descriptor buffer implementations |
| - zink: add a second fence disambiguation case |
| - zink: force host-visible allocations for MAP_COHERENT resources |
| - zink: hook up KHR_dynamic_rendering_local_read |
| - zink: use KHR_dynamic_rendering_local_read |
| - ci: make clang-format job warn on failure instead of killing the pipeline |
| - zink: handle stencil_fallback in zink_clear_depth_stencil |
| - zink: don't destroy the current batch state on context destroy |
| - zink: only scan active batch states for free states if > 1 exist |
| - zink: fix longstanding issue with active batch state recycling |
| - zink: assert that batch_id is valid in zink_screen_check_last_finished() |
| - zink: move flagging rp_changed in zink_update_fbfetch() to caller |
| - zink: don't pre-init dummy fbfetch surface when missing nullDescriptor feature |
| - zink: also set null fbfetch surfaces when no fb surface is bound |
| - zink: break out null fbfetch init for descriptor buffer |
| - zink: create/resize dummy surfaces on-demand |
| - zink: start out with 256x256 sized dummy surfaces |
| - zink: don't pre-init null fbfetch info |
| - zink: clamp in_rp clears to fb size |
| - zink: fix (dynamic rendering) execution of scissored clears during flush |
| - zink: fix swapchain readback conditional |
| - zink: lock buffer age when chundering swapchain for readback |
| - zink: flag acquired swapchain image as readback target on acquire, not present |
| - zink: make kopper_swapchain_image::acquired the resource that acquired it |
| - zink: add a swapchain readback case for reading differently-acquired image |
| - zink: make readback attempts count towards ZINK_READBACK_THRESHOLD |
| - zink: update swapchain readback cache on create |
| - zink: set and manage a flag indicating that swapchain readback needs updating |
| - zink: only update swapchain readback cache on create if necessary |
| - zink: only update swapchain readback cache when necessary |
| - zink: use new flag to determine whether swapchain readback cache is usable |
| - zink: update nv blob baseline |
| - zink: add nvk baseline |
| - ci: disable clang-format job |
| - zink: apply all storage memory masks to control barriers if no modes are specified |
| - zink: emit SpvCapabilityImageMSArray for ms arrayed storage images |
| - zink: null out bo usage when allocating from slab |
| - zink: fix unsynchronized read-mapping of device-local buffers |
| - zink: delete unused buffer map conditional |
| - zink: force max buffer alignment on return ptrs for mapped staging buffers |
| - gallium: add a nboxes param to flush_frontbuffer |
| - winsys/sw: propagate nboxes to displaytarget_display() |
| - drisw: plumb through a swapBuffersWithDamage interface |
| - egl/wayland/sw: move swrast_update_buffers() directly into swapbuffers |
| - egl/wayland/sw: move dri2_wl_swrast_commit_backbuffer() directly into swapbuffers |
| - egl/wayland: unify back/current swapping between zink and swrast |
| - egl/wayland/sw: split out surface attach from dri2_wl_swrast_commit_backbuffer() |
| - egl/wayland/sw: call dri2_wl_swrast_attach_backbuffer() before swap |
| - egl/wayland/sw: trigger damage from put_image2 |
| - egl/wayland/sw: move partial->full copy promotion to swapbuffers |
| - egl/wayland/sw: fix no-op updating of current backbuffer |
| - egl/wayland/sw: pass damage region through from put_image2 to wl_surface_damage |
| - egl/wayland/sw: clamp putimage geometry to surface size |
| - drisw/xlib: loop over all the boxes in display() hook |
| - drisw/winsys: loop over all the boxes in display() |
| - drisw: pass all frontend swapbuffer damage rects through |
| - egl/kopper: plumb through SwapBuffersWithDamage |
| - egl/kopper: advertise EXT_swap_buffers_with_damage only in non-sw mode |
| - egl/wayland: split out kopper vtable |
| - egl/wayland: add a separate hook for kopper buffer age |
| - egl/wayland: split out kopper swapbuffers functions |
| - egl/kopper: call swrast buffer age query for kopper+swrast |
| - kopper: set drawable buffer age |
| - egl/wayland/kopper: actually call kopper swapbuffer functions |
| - egl/wayland: split out kopper update_buffers |
| - egl/wayland: delete swrast references to zink |
| - zink: fix stencil-only blitting with stencil fallback |
| - zink: make zink_kopper_present_info public |
| - zink: use a slab allocator for zink_kopper_present_info |
| - zink: hook up VK_KHR_incremental_present |
| - zink: use VK_KHR_incremental_present to propagate damage rects |
| - zink: hook up KHR_partial_update |
| - vulkan/dispatch_table: add an uncompacted version of the table |
| - zink: use uncompacted vk_dispatch_table |
| - egl/dri2: use the right egl platform enum |
| - glx: only print zink failure-to-load messages if explicitly requested |
| - zink: stop enabling EXT_conservative_rasterization |
| - lavapipe bump descriptor buffer address space limits |
| - zink: fix PIPE_CAP_MAX_SHADER_PATCH_VARYINGS |
| - zink: call CmdSetRasterizationStreamEXT when using shader objects |
| - nvk: bump NVK_PUSH_MAX_SYNCS to 256 |
| - zink: update nvk baseline |
| - util/blitter: iterate samples in stencil_fallback |
| - mesa: fix CopyTexImage format compatibility checks for ES |
| - zink: update nvk baseline with nvk changes |
| - driconf: add radv_zero_vram for Crystal Project (1637730) |
| - zink: update nv baseline |
| - zink: track whether shaders use load_barycentric_at_sample |
| - zink: apply zink_shader::uses_sample to fs variant updating |
| - zink: destroy batch states after copy context |
| - zink: set VkExternalMemoryBufferCreateInfo for opaque fds too |
| - zink: simplify vb masking on bind |
| - mesa: force rendertarget usage on required-renderable formats |
| - zink: try getting sparse page size again without storage bit on fail |
| - u/inlines: constify util_res_sample_count() |
| - zink: only add STORAGE bit for sparse images based on multisample usage |
| - zink: nvk baseline updates |
| - zink: set the sparse format usage flags directly based on queried props |
| - zink: delete faked_e5sparse |
| - zink: rename optimal_key in update_gfx_program_optimal() |
| - zink: use the sanitized key in update_gfx_program_optimal() |
| - zink: always sync and replace separable progs even with ZINK_DEBUG=noopt |
| - zink: add even more strict checks for separate shader usage |
| - zink: be even stricter with shader object usage about blocking invalid usage |
| - zink: remove stale comments for DRLR usage |
| - zink: add a pass to strip out multisample storage image ops |
| - zink: don't deref swapchain image array with UINT32_MAX |
| - zink: handle image_deref_samples when stripping MS image instrs |
| - zink: iterate all the modes when doing separate shader fixups |
| - mesa/st: add 'base_serialized_nir' |
| - mesa/st: add is_draw_shader param to st_finalize_nir |
| - mesa/st: when creating draw shader variants, use the base nir and skip driver opts |
| - mesa/st: use sanitized shader keys for feedback draws |
| - zink: do io fixup on patch variables too |
| - zink: defer present barrier to flush if a clear is pending |
| - zink: clamp present region size |
| - zink: clamp swapchain renderarea instead of asserting |
| - zink: set dynamic rendering color attachment layouts |
| - radv: inline radv_device_fault_detection_enabled |
| - ci: bump VVL to v1.3.281 |
| - nir/divergence: add zink intrinsics |
| - nir/opt_varyings: update alu type when rewriting src/dest for moved ops |
| - zink: only check that CUBE_COMPATIBLE for images doesn't subtract flags |
| - zink: don't use set_foreach_remove with dmabuf_exports |
| - zink: make descriptor pool creation more robust |
| - zink: fix shaderdb pipeline compile |
| - zink: delete some ntv dead code |
| - zink: always sort io variables by location after re-creating them |
| - zink: use outputs_written mask to detect edge flag usage |
| - zink: update xfb info after lower_to_scalar |
| - zink: run scan_nir before variable rework |
| - zink: apply component offset for CLIP/CULL DIST1 location derefs |
| - zink: manually calc clip/cull distance sizes |
| - zink: add a helper to detect clip/cull dist locations |
| - zink: always use shader sizes for clip/cull dist variables |
| - zink: fix generated variable expansion |
| - zink: check for arrayness rather than tess io vars for indirect array vars |
| - zink: track a mask of arrayed io locations on shaders |
| - zink: call gather_info during shader creation |
| - zink: always check patch io during rework_io_vars |
| - zink: don't clobber indirect array reads with missing components |
| - zink: fix io slot calculation for vertex inputs in add_derefs |
| - zink: fix add_derefs case for compact arrays |
| - zink: only use location_frac for deref array indexing for compact variables |
| - llvmpipe: fix DRAW_USE_LLVM=0 |
| - nir/lower_wpos_ytransform: move new value load to start of function, reuse |
| - nir/lower_wpos_ytransform: reuse input zw components for fragcoord rewrite |
| - nir/lower_wpos_ytransform: update comment to reflect variable usage |
| - nir/lower_wpos_ytransform: scalarize emit_wpos_adjustment |
| - nir/lower_wpos_ytransform: fix for lowered io |
| - glsl: handle xfb resources for spirv before running varying opts |
| - mesa: clamp binary pointer in ShaderBinary if length==0 |
| - gallium: rework PIPE_CAP_POINT_SIZE_FIXED |
| - zink: delete some maintenance5 psiz pruning |
| - zink: fix add_derefs for partial interp loads of derefs |
| - zink: assert that ntv interp handling isn't doing implicit component expansion |
| - egl/x11: disable swapbufferswithdamage for zink without kopper |
| - glx/egl: fix LIBGL_KOPPER_DISABLE |
| - glsl: set PSIZ bit in outputs_written when injecting a 1.0 psiz write |
| - nir/lower_clamp_color_outputs: fix use with lowered io |
| - nir/lower_flatshade: break out location checking |
| - nir/lower_flatshade: fix with lowered io |
| - nir/lower_alpha_test: fix use with lowered io |
| - nir/lower_two_sided_color: rework for lowered io |
| - nir/lower_drawpixels: fix for lowered io |
| - nir/lower_clip_disable: fix for lowered io |
| - nir/lower_point_size_mov: rework. |
| - nir/lower_point_size_mov: fix for lowered io |
| - nir/texcoord_replace: fix scalarized io handling |
| - nir/dominance: fix comment |
| - drisw: reorder image extensions |
| - sw_winsys: add displaytarget_create_mapped |
| - winsys/null: implement displaytarget_create_mapped |
| - winsys/drisw: implement displaytarget_create_mapped |
| - winsys/drisw: implement dmabuf handling |
| - lavapipe: add a function for asserting external memory handle types |
| - winsys: add WINSYS_HANDLE_TYPE_UNBACKED |
| - winsys: add more stride members to winsys_handle |
| - lavapipe: EXT_queue_family_foreign |
| - lavapipe: rework mem handle type assert to handle dmabuf |
| - lavapipe: handle drm image format queries |
| - lavapipe: handle drm image imports |
| - docs: update lavapipe features |
| - nir: add compact_arrays to nir_shader_compiler_options |
| - nir/gather_info: fix gathering for compact arrayed builtins |
| - zink: set compact_arrays in compiler options |
| - microsoft/compiler: set compact_arrays in compiler options |
| - lavapipe: don't clamp index buffer size for null index buffer draws |
| - v3d: set use_clipdist_array=true for lower_clip? |
| - nir/lower_clip: surgerize for lowered io |
| - nir/lower_clip: handle scalarized io |
| - zink: block LA formats with srgb |
| - llvmpipe: clamp 32bit query results to low 32 bits rather than MIN |
| - lavapipe: clamp 32bit query results to low 32 bits rather than MIN |
| - agx: set compact_arrays in compiler options |
| - v3d: set compact_arrays in compiler options |
| - intel: set compact_arrays in compiler options |
| - freedreno: set compact_arrays in compiler options |
| - glsl: stop using PIPE_CAP_NIR_COMPACT_ARRAYS and check compact_arrays |
| - ttn: stop using PIPE_CAP_NIR_COMPACT_ARRAYS and check compact_arrays |
| - glsl: move an assert from st_context over to avoid using PIPE_CAP_NIR_COMPACT_ARRAYS |
| - mesa: delete LowerCombinedClipCullDistance from consts |
| - st/program: stop using PIPE_CAP_NIR_COMPACT_ARRAYS and use compact_arrays |
| - nine: stop checking PIPE_CAP_NIR_COMPACT_ARRAYS and use compact_arrays |
| - gallium: delete PIPE_CAP_NIR_COMPACT_ARRAYS |
| - zink: set indirect io compiler flags |
| - zink: set lower_to_scalar |
| - zink: rework rework_io_vars |
| - zink: set nir_io_glsl_lower_derefs in compiler options |
| - zink: add a pass to fix vertex input locations |
| - zink: enable opt_varyings with ZINK_DEBUG=ioopt |
| - zink: ci updates |
| - nir/remove_unused_io_vars: check all components to determine variable liveness |
| - ci: kill piano trace globally |
| - nir: print i/o variables in location order |
| - lavapipe: disable stencil test if no stencil attachment |
| - egl: fix defines for zink's dri3 check |
| - egl/android: fix zink loading |
| - egl: use os_get_option for MESA_LOADER_DRIVER_OVERRIDE |
| - zink: disable buffer reordering correctly on shader image binds |
| - nir/print: stop trying to match i/o vars using base/driver_location |
| - zink: add ZINK_DEBUG=nopc to completely disable precompilation |
| - zink: destroy shaderdb pipelines |
| - zink: add VK_PIPELINE_CREATE_CAPTURE_STATISTICS_BIT_KHR for shaderdb |
| - brw/lower_a2c: fix for scalarized fs outputs |
| - zink: copy shader name when copying shader info |
| - zink: run nir_lower_io_to_scalar (mostly) unconditionally and earlier |
| - zink: vectorize io loads/stores when possible |
| - zink: ci updates |
| - zink: prune some piglit cts fails |
| - loader: delete unused param from pipe_loader_vk_probe_dri() |
| - glx: fix some indentation |
| - glx: add an 'implicit' param to createScreen |
| - glx: pass implicit load param through allocation |
| - dri: plumb a 'implicit' param through createNewScreen interfaces |
| - gbm: plumb an 'implicit' param through device creation |
| - frontends/dri: plumb an 'implicit' param through screen init |
| - pipe-loader: plumb a flag for implicit driver load through screen creation |
| - zink: don't print error messages when failing an implicit driver load |
| - glx: silence more implicit-load zink errors |
| - mesa/st: don't use serialized_nir for cached shaders |
| - zink: make NOREORDER mode context-based |
| - zink: disable command reordering for compute-only contexts |
| - nir: store variable names to io instrs during io lowering |
| - nir/lower_io_to_scalar: preserve variable names when splitting io |
| - nir/clone: preserve intrinsic name field across clones |
| - nir/print: print io instr->name if available |
| - zink: preserve/merge variable names when generating new variables |
| - glthread: check for invalid primitive modes in DrawElementsBaseVertex |
| - zink: reconstruct features pnext after determining extension support |
| - zink: prune zink_shader::programs under lock |
| - zink: fully wait on all program fences during ctx destroy |
| - kopper: fix bufferage/swapinterval handling for non-window swapchains |
| - zink: slightly better swapinterval failure handling |
| - kopper: don't set drawable buffer age |
| - zink: clean up accidental debug print |
| - egl/x11: disable dri3 with LIBGL_KOPPER_DRI2=1 as expected |
| - zink: add a batch ref for committed sparse resources |
| - u_blitter: stop leaking saved blitter states on no-op blits |
| - freedreno/replay: use inttypes format string for 64bit |
| - frontends/dri: only release pipe when screen init fails |
| - frontends/dri: always init opencl_func_mutex in InitScreen hooks |
| - zink: clean up semaphore arrays on batch state destroy |
| - egl/dri2: fix error returns on dri2_initialize_x11_dri3 fail |
| - nir/lower_aaline: fix for scalarized outputs |
| - nir/linking: fix nir_assign_io_var_locations for scalarized dual blend |
| |
| Mike Hsieh (1): |
| |
| - amd/vpelib: Add param check for geometric scaling and refactor |
| |
| Mohamed Ahmed (14): |
| |
| - nil: change image_level_size() to take tiling in account |
| - nil: Add helper function to get tile size in pixels |
| - nil: Add helpers for conversion from pixel values to tiles |
| - nil: Expose tiling_extent_B() |
| - nil: Add support for sparse resident images |
| - nvk: add sparse queries |
| - nvk: enable sparse residency features |
| - nak: wire up shader resource residency intrinsics |
| - nak: wire up sparse image loads |
| - nvk: advertise shader resource residency |
| - nil: Add a nil_image::compressed bit |
| - nil: Add some helpers for DRM format modifiers |
| - nil: Support creating images with DRM modifiers |
| - nvk: enable rendering to DRM_FORMAT_MOD_LINEAR images |
| |
| Mykhailo Skorokhodov (2): |
| |
| - egl/wayland: Fix sRGB format look up for config |
| - ci/lima: expect fail of window_8888_colorspace_srgb on wayland |
| |
| Nanley Chery (13): |
| |
| - iris: Don't memset the extra_aux memory range |
| - iris: Don't memset CCS on integrated gfx12 |
| - iris: Enable pass-through state init for gfx12 CCS |
| - isl: Pick a better initial state for zeroed MCS |
| - iris: Copy main ISL surf when reallocating in place |
| - iris: Report the correct modifier for Tile4 images |
| - iris: Use resource_get_param in resource_get_handle |
| - intel/isl: Remove inconsistency when choosing Tile64 |
| - intel/isl: Remove inconsistency when encoding Tile64 |
| - intel/isl: Remove a CCS_D check from gfx12+ code |
| - intel/isl: Enable a 64KB alignment WA for flat-CCS |
| - intel/isl: Use Tile64 to align images for CCS WA |
| - intel/isl: Disable miptails to align LODs for CCS WA |
| |
| Neil Armstrong (1): |
| |
| - freedreno: Add a750 clock gating control related registers |
| |
| Nikita Popov (1): |
| |
| - Pass no-verify-fixpoint option to instcombine in LLVM 18 |
| |
| Oskar Viljasaar (3): |
| |
| - vulkan/properties: Start looping from the next member in GPDP2 |
| - tu: Use common physical device properties infrastructure |
| - compiler/types: Fix glsl_dvec*_type() helpers |
| |
| Patrick Lerda (16): |
| |
| - glsl/nir: fix gl_nir_cross_validate_outputs_to_inputs() memory leak |
| - r300: fix vertex_buffer related refcnt imbalance |
| - r300: fix r300_destroy_context() related memory leaks |
| - r300: fix memory leaks when register allocation fails |
| - r300: fix constants_remap_table memory leak |
| - radeonsi/gfx10: fix main_shader_part_ngg_es memory leak |
| - r300: enable R400 cos and sin hardware vertex shader opcodes |
| - ac/llvm,radeonsi: fix memory leaks triggered by ac_nir_translate() errors |
| - r300: fix NIR passes regression |
| - r300: fix constants_remap_table memory leak related to the dummy shader path |
| - r300: fix r300_draw_elements() behavior |
| - panfrost: remove panfrost_create_shader_state() related dead code |
| - gallium/auxiliary/vl: fix typo which negatively impacts the src_stride initialization |
| - clover: fix pipe_box update regression |
| - clover: fix memory leak related to optimize |
| - r600: fix vertex state update clover regression |
| |
| Paul Gofman (3): |
| |
| - glsl: allow out arrays in #110 with allow_glsl_120_subset_in_110 |
| - driconf: add a workaround for Joe Danger 2 |
| - driconf: add a workaround for Joe Danger |
| |
| Paulo Zanoni (35): |
| |
| - zink: fix bind size handling in buffer_bo_commit() |
| - anv/sparse: add an extra step before anv_sparse_bind_resource_memory() |
| - anv/sparse: allow binding operations to match the resource size |
| - anv+zink/ci: remove recently fixed tests from the crash list |
| - anv/sparse: don't issue a single bind operation per vm_bind ioctl |
| - anv/sparse: leave the semaphore waits and signals to the vm_bind ioctl |
| - anv/sparse: don't use the bind_timeline when doing sparse binding |
| - anv: change the vm_bind-related kmd_backend vfuncs to return VkResult |
| - anv: add an anv_pipe_bits bit to allow invalidating the TLB |
| - anv/trtt: invalidate the TLB after writing TR-TT entries |
| - anv/trtt: update GFX_TRTT_VA_RANGE for LNL |
| - anv: don't leak device->vma_samplers |
| - anv: set shaderFloat64 to true when fp64_workaround_enabled |
| - driconf/anv: set fp64_workaround_enabled to DIRT 5 |
| - anv/xe: don't leak xe_syncs during trtt submission |
| - anv/xe: don't overwrite the result from vk_sync_wait() |
| - vulkan: don't zero-initialize STACK_ARRAY()'s stack array |
| - anv, iris: add missing CS_STALL bit for GPGPU texture invalidation |
| - anv: reduce struct anv_image_memory_range from 32 to 24 bytes |
| - vulkan: reduce struct vk_object_base by 8 bytes |
| - anv/sparse: remove useless isl_surf_get_tile_info() call |
| - anv/sparse: remove unnecessary popcount assertions |
| - anv/sparse: adjust sparse_bind_image_memory debug messages |
| - anv/sparse: remove unused dump_vk_sparse_memory_bind() |
| - anv/sparse: replace device->using_sparse with device->num_sparse_resources |
| - anv/sparse: rework anv_free_sparse_bindings() error handling |
| - anv/xe: extract anv_vm_bind_to_drm_xe_vm_bind() |
| - anv/xe: add a 'flags' parameter to the vm_bind() kmd_backend function |
| - anv/xe: slightly improve error handling for the vm_bind ioctl |
| - anv/xe: assert we're using drm_syncobjs only once |
| - anv/xe: de-duplicate xe_exec_fill_sync() |
| - anv/xe: rename and refactor xe_exec_fill_sync() |
| - anv/sparse: fail the right way in anv_GetDeviceImageSparseMemoryRequirements() |
| - anv: const-correct anv_{image,buffer}_is_sparse() |
| - isl: add ISL_TILING_64_XE2 to isl_tiling_to_name() |
| |
| Pavel Ondračka (38): |
| |
| - r300: fix reusing of color varying slots for generic ones |
| - r300: skip draw if vertex shader does not write gl_Position |
| - r300/ci: switch to deqp-runner suite |
| - r300/ci: add the KHR gles2 tests |
| - r300/ci: move streaming-texture-leak from fails to skips |
| - r300: fix writemask for nir_intrinsic_load_ubo_vec4 |
| - r300: skip backend DCE for vertex shaders |
| - r300: remove R3xx/R4xx backend absulute modifier lowering |
| - r300/ci: add dEQP on RV380 |
| - r300: remove backend SLE and SGT support |
| - r300: add r300_is_only_used_as_float helper |
| - r300: optimize out more modifiers produced later |
| - r300: lower comparison ops early in NIR |
| - r300: remove SGE, SNE, SLT, SGE lowering in the backend |
| - r300: remove the remaining of backend constant folding |
| - r300: remove backend support for SUB |
| - r300/ci: update piglit fails |
| - r300: remove compiler tests |
| - r300/ci: add two more observed piglit flakes |
| - r300: fix vs output register indexing |
| - r300: add explicit flrp lowering |
| - ci: install xwayland in x86_64_test-gl |
| - ci: build nine in debian-testing |
| - ci: build nine tests |
| - r300/ci: enable nine tests |
| - r300: explicitly check if sin/cos input is already in correct range |
| - r300: move sin/cos input fixups to finalize_nir |
| - r300: remove some late NIR passes |
| - nir/lower_vec_to_regs: always set cursor before inserting decl_reg |
| - r300: check for the extra restrictions on presubtract swizzles |
| - r300: move presubtract pass later |
| - r300: optimize swizzle for inline constants |
| - r300: inline unoptimized_ra ntr option |
| - r300: get rid of the unused ubo_vec4_max ntr option |
| - r300: remove the ntr lower_cmp option |
| - r300: move lower_fabb option out of the options struct |
| - r300: remove nir_to_rc_options wrapper |
| - r300/ci: failures list update |
| |
| Peyton Lee (6): |
| |
| - radeonsi/vpe: remove wait source surface fence and while loop |
| - radeonsi/vpe: disable info log |
| - radeonsi/vpe: move flush to si_vpe_processor_end_frame |
| - radeonsi/vpe: support multi-buffer |
| - radeonsi/vpe: pre-allocate stream struceutre |
| - radeonsi/vpe: add support for p010 |
| |
| Philip Rebohle (1): |
| |
| - radv: Remove dead shared variables after optimization loop. |
| |
| Philipp Zabel (6): |
| |
| - rusticl: work around reference-to-mutable-static warnings |
| - etnaviv: common: Add PIPE_3D feature bit |
| - etnaviv: Avoid duplicate query of ETNA_GPU_FEATURES_0 parameter |
| - etnaviv: hwdb: Add VIP_V7 and NN_XYDP0 feature bits |
| - etnaviv: Add nn_core_version field to etna_specs |
| - etnaviv/nn: Extend post-multiplier for v8 architecture |
| |
| Pierre-Eric Pelloux-Prayer (21): |
| |
| - radeonsi: compute epitch when modifying surf_pitch |
| - Revert "ci/radeonsi: disable VA-API testing on raven" |
| - radeonsi: emit cache flushes before draw registers |
| - radeonsi: adjust flags for si_compute_shorten_ubyte_buffer |
| - winsys/amdgpu: use syncobj rather than amdgpu fence |
| - ac, radeonsi: remove has_syncobj, has_fence_to_handle |
| - radeonsi: try to disable dcc if compute_blit is the only option |
| - meson: require libelf when radeonsi is built |
| - egl/drm: flush before calling get_back_bo |
| - radv: don't remove the blit queue from the device queues |
| - winsys/amdgpu: unmap user fence BO before destroy |
| - winsys/amdgpu: remove unused amdgpu_fence_is_syncobj |
| - wsi/wl: flush connection on swapchain failure |
| - mesa: deal with vbo_save_vertex_list::modes being NULL |
| - wsi/wl: check wsi_wl_surface's validity before use |
| - egl/wayland: use __DRI_IMAGE_PRIME_LINEAR_BUFFER in get_back_bo |
| - winsys/radeon: pass priv instead NULL to radeon_bo_can_reclaim |
| - radeonsi: preserve alpha if needed in kill_ps_outputs_cb |
| - amd: fix addrlib regression |
| - aco: don't use python 3.7+ feature in aco_opcodes.py |
| - radv: don't use python 3.9 feature in radv_annotate_layer_gen.py |
| |
| Qiang Yu (1): |
| |
| - radeonsi: split RADEON_USAGE_NEEDS_IMPLICIT_SYNC into CB and DB flags |
| |
| Ray Smith (2): |
| |
| - panfrost: Don't try to set bifrost blendable format on midgard |
| - panfrost: Fix format tables for v4 and v5 |
| |
| Rhys Perry (84): |
| |
| - radv: do nir_shader_gather_info after radv_nir_lower_rt_abi |
| - nir/lower_non_uniform: set non_uniform=false when lowering is not needed |
| - nir/lower_shader_calls: remove CF before nir_opt_if |
| - aco: fix labelling of s_not with constant |
| - aco: add VOPD format |
| - aco: add VOPD statistic |
| - aco: refactor schedule_ilp main loop |
| - aco: implement VOPD scheduler |
| - aco: enable VOPD scheduler |
| - aco: fix >8 byte linear vgpr copies |
| - aco/tests: fix to_hw_instr.swap_linear_vgpr |
| - aco: refactor create_vopd_instruction |
| - aco: swap operands to create VOPD instructions |
| - aco: turn v_mov_b32 into addition to create VOPD instructions |
| - aco: improve printing of VOPD instructions |
| - aco/tests: add tests for VOPD operand swapping |
| - aco/tests: use raw strings in form_hard_clauses.nsa |
| - radv: support minmax filter for more formats |
| - aco/ra: don't initialize assigned in initializer list |
| - aco/ra: fix GFX9- writelane |
| - aco: don't combine linear and normal VGPR copies |
| - aco/ra: disable p_start_linear_vgpr allocation hint |
| - aco: allow p_start_linear_vgpr to use multiple operands |
| - aco: require linear vgpr uses to be late kill |
| - aco: only allow linear vgpr kills in top-level blocks |
| - aco/ra: constify various RegisterFile |
| - aco/ra: move parallelcopy creation into helper |
| - aco/ra: change get_reg_bounds() helper |
| - aco/ra: rework linear VGPR allocation |
| - aco/ra: disable live range splitting of linear vgprs |
| - aco/ra: emit linear VGPR parallel copy separately |
| - aco/tests: add tests for linear VGPR register allocation |
| - aco: optimize for purely linear VGPR copies |
| - nir/algebraic: don't create 64-bit min/max/ior if lowered |
| - nir/algebraic: remove duplicated iand(ien, ine)/ior(ieq, ieq) patterns |
| - nir/algebraic: optimize 64-bit comparisons with zero'd halves to 32-bit |
| - nir/lower_int64: allow 64-bit comparisons when lowering minmax |
| - nir/search: fix nir_replace_instr() debug code |
| - aco: don't pass constant to is_overwritten_since() |
| - radv: don't advertise DGC with LLVM |
| - radv: stop using 5/8 component SSBO stores |
| - radv,aco: allow VS prologs to increase VGPR usage |
| - aco: don't reuse misaligned attribute destination VGPRs in VS prologs |
| - aco/util: add small_vec |
| - radv: use dual_color_blend_by_location with Half-Life Alyx |
| - aco/cssa: reset equal_anc_out if merging fails |
| - aco/cssa: update comments |
| - aco: fix GFX6 buffer_load_dwordx4 opcode number |
| - aco: rename opcode->instruction |
| - aco: refactor VOPC opcode list |
| - aco: use single tuple for all opcode numbers |
| - aco: use op() |
| - aco: move dot/wmma instructions into VOP3P list |
| - aco: unify MIMG opcode lists |
| - aco/gfx11: fix scratch ST mode assembly |
| - aco: split instruction assembly into functions |
| - aco: always emit float mode for merged shaders compiled separately |
| - aco: avoid breaking clauses with waitcnt |
| - nir: add mqsad_4x8, shfr and nir_opt_mqsad |
| - aco: implement mqsad_4x8 and shfr |
| - ac/llvm: implement mqsad_4x8 and shfr |
| - amd: set has_shfr32=true |
| - radv: optimize msad_4x8 to mqsad_4x8 |
| - radv: memset radv_pipeline_cache_object data |
| - nir: add nir_remove_after_cf_node helper |
| - aco: remove unreachable merge blocks |
| - aco: ensure loop exits exist in NIR |
| - aco: save/reset/combine has_divergent_continue in uniform branches |
| - nir,aco: add test intrinsics |
| - aco/tests: add isel test helpers |
| - aco/tests: add control flow tests |
| - aco: assume no unreachable blocks |
| - aco: don't include the clause in VMEM_CLAUSE_MAX_GRAB_DIST |
| - aco: remove occupancy check in dealloc_vgprs() |
| - aco/tests: don't assume constructor order |
| - aco/tests: remove LLVM 11 code |
| - radv: cache RT stage info |
| - aco: include LDSDIR in latency/etc stats |
| - aco: make store clauses more aggressively |
| - aco: schedule LDSDIR instructions |
| - aco: schedule LDS instructions |
| - aco: split vop3p results |
| - aco/waitcnt: fix DS/VMEM ordered writes when mixed |
| - aco: create lcssa phis for continue_or_break loops when necessary |
| |
| Rob Clark (31): |
| |
| - freedreno/a6xx: fix comment |
| - freedreno/registers: Pass full args to dump_c() |
| - freedreno/registers: De-duplicate xml_reg_files |
| - freedreno/registers: Don't re-parse files |
| - freedreno/registers: Generate copyright comment blurb |
| - freedreno/registers: Add basic kernel header support |
| - freedreno/registers: A couple newline changes |
| - tu/drm/virtio: Fix dmabuf import |
| - freedreno/drm: Submit should hold ref to device |
| - freedreno/drm: Fix teardown crash harder |
| - freedreno/decode: Fix prefetch handling for IB1 crash |
| - freedreno: Fix MSAA z/s layout in GMEM |
| - freedreno/crashdec: Find potential fault buffers |
| - tu: Give suballoc bo's a name |
| - freedreno/a6xx: Add dual_color_blend_by_location |
| - freedreno/a6xx: Fix z/s preserving sysmem clear blit |
| - freedreno/pps: Don't re-init perfcntrs |
| - freedreno: Add bo usage hints |
| - freedreno/drm: Add perfetto memory tracing |
| - tu: Add perfetto memory tracing |
| - pps: Enable memory traces |
| - pps: Config tweaks to avoid loosing traces |
| - freedreno/registers: Add license header |
| - egl/android: Fix gl_config dereference |
| - freedreno/drm/virtio: Fix deadlock on exit |
| - freedreno+virgl: Add missing driconf |
| - freedreno: Update a618 xfails |
| - ci: Add deqp fix for pipeline_statistics_3 tests |
| - tu: Fix a6xx lineWidthGranularity |
| - egl/android: Fix sRGB visuals |
| - freedreno/ir3: Fix ldg/stg offset |
| |
| Robert Beckett (1): |
| |
| - vulkan/wsi: fix force_bgra8_unorm_first |
| |
| Robert Mader (5): |
| |
| - crocus: Support offset query for multi-planar planes |
| - panfrost: Use pipe resource helper |
| - egl: Implement EGL_EXT_config_select_group |
| - egl: Implement EGL_MESA_x11_native_visual_id |
| - egl/x11: Allow all RGB visuals to match 32-bit RGBA EGLConfigs |
| |
| Robin Kertels (3): |
| |
| - nvk: Enable EXT_nested_command_buffer. |
| - nak: Enable lowering rotate to shuffle. |
| - nvk: Advertise VK_KHR_shader_subgroup_rotate. |
| |
| Rohan Garg (35): |
| |
| - anv: refactor emit_dynamic_buffer_binding_table_entry |
| - isl,blorp,anv: introduce ISL_TILING_64_XE2 for Xe2+ platforms |
| - anv: untyped data port flush required when a pipeline sets the VK_ACCESS_2_SHADER_STORAGE_READ_BIT |
| - anv: factor out common code for determining surface usage from a VkDescriptorType |
| - anv: cleanup duplicate robustness flag calculations |
| - anv: add a command streamer stall on Xe2+ when switching pipelines |
| - intel/compiler: Xe2+ can do URB load/store with a byte offset |
| - anv: drop duplicated 3DSTATE_SLICE_TABLE_STATE_POINTERS emission |
| - anv, blorp: Set COMPUTE_WALKER Message SIMD field |
| - intel/genxml: update PIPE_CONTROL so that we can decode it on the CCS |
| - iris,anv: WA 1509820217 is no impact for Xe2+ |
| - intel/brw: Use the dimensions supplied in the instruction |
| - intel/brw: Cleanup send generation |
| - intel/brw: Update written size depending on the LSC message |
| - intel/brw: Set the right cache control bits for xe2 |
| - intel/brw: Adjust src1 length bits for xe2+ |
| - anv,blorp: implement restrictions from WA 1406738321 |
| - anv: 3D surfaces have fewer layers for higher miplevels |
| - isl: enable CCS for 3D surfaces on gen12.5 and above |
| - intel/brw: account for sources when determining if a operation uses half floats |
| - intel/brw: Xe2+ can do SIMD16 for extended math on HF types |
| - intel/brw: update disassembly for MATH pipe |
| - intel/brw: adjust the copy propgation pass to account for wider GRF's on Xe2+ |
| - intel/brw: minor rework to de duplicate variable assignment |
| - intel/brw: Handle typed surface and atomic messages for xe2+ |
| - intel/brw: Lower DWORD scattered read writes to lsc |
| - intel/eu/validate: Allow SIMD16 for mixed mode float operations on xe2+ |
| - iris: slow clear higher miplevels on single sampled 8bpp resources that have TILE64 |
| - intel/blorp: add fast clear rectangle dimensions for single sampled TILE64 CCS surfaces |
| - isl: allow CCS on single sampled TILE64 surfaces |
| - anv: Enable HiZ on multi-LOD depth buffers. |
| - anv: use u_foreach_bit to iterate over the the view mask like we do for transition_clear_color |
| - anv: formatting fix when printing pipe controls |
| - anv: allocate space for generated indirect draw id's using the temporary allocation helper |
| - Revert "iris: slow clear higher miplevels on single sampled 8bpp resources that have TILE64" |
| |
| Roland Scheidegger (2): |
| |
| - auxiliary/draw: fix streamout overflow calculation |
| - auxiliary/rtasm: fix unaligned stores |
| |
| Romain Naour (1): |
| |
| - glxext: don't try zink if not enabled in mesa |
| |
| Ruijing Dong (6): |
| |
| - radeonsi/vcn: data structure av1 enc long term reference. |
| - radeonsi/vcn: vcn4 av1 long term ref support |
| - frontends/va: get av1 encoding ref frame infos for L0. |
| - radeonsi/vcn: add enc surface alignment caps |
| - frontends/va: add surface alignment attribute |
| - radeonsi/vcn: update to use correct padding size. |
| |
| Ryan Neph (3): |
| |
| - venus: fix shmem leak on vn_ring_destroy |
| - virgl: use PIPE_MAX_SAMPLERS in bind_samplers_states |
| - venus: reclaim signal semaphore feedback resources for wasteful clients |
| |
| Sagar Ghuge (28): |
| |
| - intel/fs: Track instance id in gs_thread_payload |
| - vulkan/runtime: Track VkSharingMode in vk_image |
| - anv: Disable compression if we have concurrent sharing mode |
| - intel/compiler/xe2: Handle 6-bit message type for Gfx20+ |
| - intel/compiler: Add texture operation lowering pass |
| - intel/compiler: Use nir_tex_src_backend1 to pack LOD and array index |
| - nir: Drop intel specific lowering code |
| - intel/compiler: Lower texture operation to combine LOD and AI |
| - intel/dev: Update max_subslices_per_slice comment |
| - intel/compiler: Fix disassembly of URB message descriptor on Xe2+ |
| - anv: Drop warnings for engine initialization failure |
| - anv: Set timestampValidBits to 64bits |
| - intel/compiler: Trim vector properly till array index |
| - intel/compiler: Adjust sample_b parameter according to new layout |
| - intel/compiler: Pack LOD/bias and array index on TG4 messages |
| - intel/compiler: Pack texture LOD and offset to a single 32-bit value |
| - intel/compiler: Add helper method to decide if header is required |
| - intel/compiler: Add gather4_i/l/[_c]/b sampler message |
| - intel/compiler: Add texture gather offset LOD/Bias message support |
| - nir: Allow nir_texop_tg4 in implicit derivative |
| - intel/compiler: Enable packing of offset with LOD or Bias |
| - anv: Implement VK_AMD_texture_gather_bias_lod |
| - anv/xe: Consider pat_index while unbinding the bo |
| - anv: Fix typo in DestinationAlphaBlendFactor value |
| - anv: Use appropriate argument format for indirect draw |
| - isl: Update isl_swizzle_supports_rendering comment |
| - isl: Update shader channel select for missing components |
| - intel/compiler: Disassemble mlen/rlen/ex_mlen in units of registers |
| |
| Saleemkhan Jamadar (1): |
| |
| - radeonsi/vcn: set jpeg reg version for gfx 1151 |
| |
| Samuel Pitoiset (419): |
| |
| - radv: constify stages in radv_rt_fill_group_info() |
| - radv/rt: re-use radv_ray_tracing_stage::sha1 for hashing RT pipelines |
| - radv: correctly return VK_ERROR_OUT_OF_DEVICE_MEMORY when mapping a BO fails |
| - radv/nir: pass radv_shader_stage to some radv_nir_xxx() functions |
| - radv/nir: remove useless struct for nir_shader typedef |
| - radv: remove one unused parameter in radv_fill_shader_info_ngg() |
| - radv: move radv_pipeline_key::mesh_fast_launch_2 to the per-device cache key |
| - radv: add radv_shader_stage_key to radv_shader_stage |
| - radv: use radv_shader_stage_key directly with pre-existing fields |
| - radv: add optimisations_disabled to radv_shader_stage_key |
| - radv: remove unecessary radv_nir_compiler_options::key |
| - radv: remove unused lower_rt_instruction_monolithic_state::key |
| - radv: stop passing the pipeline key when compiling compute/rt shaders |
| - radv: re-organize radv_pipeline_key |
| - radv: add vertex_robustness1 to radv_shader_stage_key |
| - radv: introduce radv_graphics_state_key |
| - zink/ci: skip more arb_shader_image_load_store.* on Polaris10/Navi10 |
| - radv: add keep_statistic_info to radv_shader_stage_key |
| - radv: add shader_version to radv_shader_stage_key |
| - radv: pass radv_shader_stage_key to radv_pipeline_stage_init() |
| - radv: make sure to retain shaders key for imported shaders with GPL |
| - radv: cleanup radv_generate_pipeline_key() |
| - radv: add radv_pipeline_get_shader_key() |
| - radv/rt: cleanup radv_parse_rt_stage() |
| - radv: hash radv_shader_stage_key |
| - radv: stop hashing radv_pipeline_key for compute/rt pipelines |
| - radv: remove the pipeline key for compute pipelines |
| - radv: remove the pipeline key for ray tracing pipelines |
| - radv: remove an extra new line in radv_shader.h |
| - radv: pass radv_graphics_state_key to radv_hash_shaders() |
| - radv: remove radv_generate_pipeline_key() |
| - radv: rename radv_pipeline_key to radv_graphics_pipeline_key |
| - radv: delay emitting streamout enable at draw time |
| - aco: silent checking if clrxdisasm is available |
| - radv: fix indirect dispatches on the compute queue on GFX7 |
| - radv: fix indirect draws with NULL index buffer on GFX10 |
| - radv: remove unused parameter to gather_shader_info_mesh() |
| - radv: add a per-stage key field for mesh shaders with a task shader |
| - vulkan: bump headers/registry to 1.3.276 |
| - lavapipe: fix build since vulkan spec update |
| - vulkan: promote VK_EXT_line_rasterization to KHR |
| - vulkan: promote VK_EXT_index_type_uint8 to KHR |
| - radv: add a helper for binding the custom blend mode |
| - radv: add a helper to get the VGT_GS_OUT value |
| - radv: prevent accessing NULL pipelines when emitting VBO with ESO |
| - radv: re-emit the TCS epilog when a new TCS is bound |
| - radv: enable prologs/epilogs in-memory cache for shader objects |
| - radv: add required NV entrypoints for VK_EXT_shader_object |
| - radv: initialize default dynamic state when beginning a new cmdbuf |
| - radv: add radv_shader_stage::next_stage field |
| - radv: add radv_shader_layout::dynamic_offset_count |
| - radv: add support for creating/destroying shader objects |
| - radv: make some pipeline graphics helpers non-static for ESO |
| - radv: add support for binding/emitting shader objects |
| - radv: advertise VK_EXT_shader_object on GFX6-8 |
| - radv: advertise VK_KHR_load_store_op_none |
| - radv: promote VK_EXT_line_rasterization to KHR |
| - radv: advertise VK_KHR_line_rasterization |
| - radv: promote VK_EXT_index_type_uint8 to KHR |
| - radv: advertise VK_KHR_index_type_uint8 |
| - radv: use device->vk.enabled_features instead of iterating twice |
| - radv: fix segfault when getting device vm fault info |
| - radv/ci: enable RADV_PERFTEST=shader_object for vkcts-polaris10-valve |
| - radv: refactor gfx103_pipeline_emit_vgt_draw_payload_cntl() |
| - radv: refactor gfx103_pipeline_emit_vrs_state() |
| - radv: use the non-emitted graphics pipeline for the needed dynamic states |
| - radv: fix the late scissor emission workaround with ESO on GFX9 |
| - radv: set NGG fields in vgt_shader_key for ESO on GFX10+ |
| - radv: do not ignore RADV_DYNAMIC_FRAGMENT_SHADING_RATE for ESO on GFX10.3+ |
| - radv: emit more default states for ESO on GFX10.3+ |
| - radv: export alpha-to-coverage via MRTZ for ESO on GFX11 |
| - radv: fix detecting invalid binaries with ESO |
| - radv: fix emitting tess domain origin for merged TES+GS on GFX9 |
| - radv: emit required programming for tess on GFX10+ in radv_emit_hw_vs() |
| - radv: rebind mesh/task shaders when restoring meta context |
| - radv: determine next stage for mesh/task with ESO |
| - radv: ignore unneeded dynamic states with mesh shaders and ESO |
| - radv: determine the last VGT api stage with mesh shaders and ESO |
| - radv: bind and emit mesh/task shaders with ESO |
| - radv: prevent crashes when a task shader is compiled unlinked with ESO |
| - radv: init the shader key in radv_shader_stage_init() for ESO |
| - radv: add support for VK_SHADER_CREATE_NO_TASK_SHADER_BIT_EXT |
| - radv: add a helper to know if device fault detection is enabled |
| - radv: refactor dumping GPU hang reports by using chunks |
| - radv: add support for keeping GPU hang reports in memory |
| - radv: export GPU hang reports through VK_EXT_device_fault |
| - radv: enable deviceFaultVendorBinary if RADV_DEBUG=hang is set |
| - radv: remove radv_graphics_state_key::dynamic_patch_control_points |
| - radv: determine the workgroup size for TCS earlier |
| - radv: set the default workgroup size for VS as LS |
| - radv: constify radv_device in radv_emit_shader_pointer() |
| - radv: check active NIR stages before trying to merge shaders on GFX9+ |
| - radv: only merge shader info stages if both stages exist on GFX9+ |
| - radv: rework shader arguments for separate compilation of VS+TCS on GFX9+ |
| - radv: always mark drawid/base_instance used with ESO |
| - radv: force TCS stage for VS as LS compiled separately on GFX9+ |
| - radv: always emit PGM_RSRC1_HS when emitting the TCS epilog state |
| - radv: add support for emitting VS+TCS compiled separately on GFX9+ |
| - radv: do not allow to enable VK_EXT_shader_object with LLVM |
| - radv: add a workaround for mipmaps and minLOD on GFX6-8 |
| - radv/sqtt: fix describing queue submits for RGP |
| - radv: limit maxIndirectCommandsTokenCount to 512 |
| - radv: remove one indentation level in radv_fill_shader_info_ngg() |
| - radv: squash GFX10/GFX10.3 NGG restrictions in the same condition |
| - radv: always set GS as NGG if present on GFX11 |
| - radv: use next_stage to determine the NGG stage |
| - radv: check for MESA_SHADER_TESS_EVAL in radv_fill_shader_info_ngg() |
| - radv: determine the ES stage earlier when processing binary config |
| - radv: determine the workgroup size for GS non-NGG earlier |
| - radv: set the default workgroup size for VS/TES as ES |
| - radv: change the user SGPR idx of AC_UD_TES_STATE |
| - radv: add a new user SGPR for the ESGS ring item size |
| - radv/nir: lower esgs_vertex_stride for GS compiled separately on GFX9+ |
| - radv: rework shader arguments for separate compilation of VS+GS on GFX9+ |
| - radv: declare streamout buffers for VS+GS compiled separately on GFX9+ |
| - radv: force GS stage for VS as ES compiled separately on GFX9+ |
| - radv: add support for emitting VS+GS compiled separately on GFX9+ |
| - radv/ci: remove VKD3D_CONFIG=dxr11 for navi21/navi31 |
| - radv: remove unused radv_indirect_command_layout::state_offset |
| - radv: only load 3x32-bit elements when emitting draws with mesh shader |
| - docs: fix RADV_DEBUG=nonggc description |
| - radv: add RADV_DEBUG=nongg_gs for GFX10/GFX10.3 |
| - radv: add radv_disable_ngg_gs and enable it for Persona 3 Reload |
| - radv: fix RGP barrier reason for RP barriers inserted by the runtime |
| - radv: force GS stage for TES as ES compiled separately on GFX9+ |
| - radv: declare streamout buffers for TES+GS compiled separately on GFX9+ |
| - radv: declare AC_UD_TES_STATE for separate compilation of GS on GFX9+ |
| - radv: bind the vertex input SGPR only for relevant stages |
| - radv: add support for emitting TES+GS compiled separately on GFX9+ |
| - radv: allow RADV_PERFTEST=shader_object on GFX9/VEGA10 |
| - radv/ci: enable RADV_PERFTEST=shader_object on VEGA10 |
| - radv: cleanup radv_shader_combine_cfg_vs_tcs() |
| - radv: fix emitting VS prologs for merged shaders compiled separately on GFX10+ |
| - radv: clear RADV_CMD_DIRTY_SHADERS when resetting the shader object state |
| - radv: clear the custom blend mode when resetting gfx pipeline state |
| - radv: fix re-emitting DB_RENDER_CONTROL when resetting gfx pipeline state |
| - radv: make sure to reset the GS copy shader with ESO |
| - radv: fix selecting shader variants with ESO |
| - radv: fix setting the rasterized primitive for ESO |
| - radv: enable GS_FAST_LAUNCH=2 by default for RDNA3 APUs (Phoenix) |
| - radv: only configure {XYZ_DIM,DRAW_INDEX}_REG for mesh shaders if enabled |
| - radv: re-enable GS_FAST_LAUNCH=2 by default on GFX11 |
| - radv: stop using conditional rendering internally when preprocessing DGC |
| - radv: disable conditional rendering if enabled when preprocessing DGC |
| - radv: pass the ES stage when emitting geometry shader with ESO |
| - radv: determine the ES stage for merged NGG shaders compiled separately |
| - radv: prefix radv_vgt_shader_key::streamout with ngg |
| - radv: set radv_vgt_shader_key::ngg_streamout for ESO |
| - radv: determine the number of invocations only for VS/TES as NGG |
| - radv: store the number of outputs for VS/TES as NGG |
| - radv: use radv_shader_info for computing NGG LDS layout |
| - radv: fix a compilation warning in radv_bind_graphics_shaders() |
| - spirv: only consider IO variables when adjusting patch locations for TES |
| - radv: move mesh_fast_launch_2 to radv_physical_device |
| - radv: initialize disk cache slightly later when creating a physical device |
| - radv: introduce a per physical device cache key |
| - radv: fix binary shaders compatibility with ESO |
| - radv: fix indirect dispatches on compute queue with conditional rendering on GFX7 |
| - radv: remove the union in radv_shader_object |
| - radv: fix a big memleak with VK_EXT_shader_object |
| - radv: free NIR shaders when creating linked shaders with ESO |
| - radv: simplify binding the GS copy shader with ESO |
| - radv: rename radv_emit_shaders() to radv_emit_graphics_shaders() |
| - radv: simplify emitting VGT_ESGS_RING_ITEMSIZE for ESO |
| - radv: re-emit more states when a shader compiled separately is bound |
| - radv: only enable emulated mesh/task shader invocations on GFX10.3 |
| - radv: add support for mesh primitives queries on GFX11 |
| - radv: add support for task shader invocations queries on GFX11 |
| - radv: remove a TODO about adding mesh/task queries on GFX11 |
| - radv: store/reset conditional rendering user info in the helpers |
| - radv: add support for conditional rendering on the compute queue with DGC |
| - radv: remove unused parameter in gfx10_get_ngg_query_info() |
| - radv: do not set gs.has_pipeline_stat_query twice for NGG GS |
| - radv: use so.num_outputs to determine if NGG shaders need XFB queries |
| - radv: determine NGG query info before linking shader info |
| - radv: pass gfx10_ngg_info to gfx10_get_ngg_info() |
| - radv: pass radv_shader_info to gfx10_get_ngg_info() |
| - radv: determine NGG culling info before NGG info |
| - radv: compute NGG scratch LDS base in gfx10_get_ngg_info() |
| - radv: compute the total LDS usage in gfx10_get_ngg_info() |
| - radv: disable VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 with minmax filter on GFX6 |
| - radv/ci: enable RADV_PERFTEST=shader_object for vkcts-tahiti-valve |
| - radv: clean up MAX_ALLOWED_TILES_IN_WAVE programming |
| - radv: add missing RADV_DEBUG_NO_NGG_GS to the physical device cache key |
| - radv: fix conditional rendering with direct mesh+task draws and multiview |
| - radv: move conditional rendering for compute in radv_cmd_state |
| - radv: get the pipeline layout info from the push constant token with DGC |
| - radv: add a helper to calculate the compute resource limits |
| - radv: add a function to get compute pipeline metadata for DGC |
| - radv: add support for VK_PIPELINE_CREATE_INDIRECT_BINDABLE_BIT_NV |
| - radv: implement vkGetPipelineIndirectXXX() for DGC |
| - radv: implement vkCmdUpdatePipelineIndirectBufferNV() |
| - radv: implement indirect compute pipeline binds with DGC |
| - radv: handle indirect pipeline binds with scratch and DGC |
| - radv: force shader BOs to be local BOS with DGC indirect compute pipelines |
| - radv: enable deviceGeneratedComputePipelines |
| - radv: fix conditional rendering on compute queue on GFX6 |
| - radv: add missing conditional rendering for indirect dispatches on GFX6 |
| - radv: add a helper to emit PKT3_COND_EXEC |
| - radv: add a new user SGPR for NGG shaders compiled separately with ESO |
| - radv: lower lds_ngg_{gs_out_vertex_base,_scratch_base} with ESO |
| - radv: add support for emitting NGG shaders with ESO |
| - radv: allow RADV_PERFTEST=shader_object on GFX11 |
| - radv: enable radv_zero_vram for RAGE2 |
| - radv: preserve streamout_buffers user SGPR for VS/TES + GS compiled separately |
| - radv: always use ace_cs for the gang CS variable |
| - radv: refactor emitting the view index for task shaders |
| - radv: allocate a 32-bit value for the MEC fw bug with indirect mesh+task earlier |
| - radv: stop passing radv_cmd_buffer to draw functions with task shaders |
| - radv/ci: remove RT tests from the VANGOGH skip list |
| - radv/ci: remove dEQP-VK.robustness.* from the VANGOGH skip list |
| - radv: disable NGG in more situations with ESO on GFX10/GFX10.3 |
| - radv: implement has_vgt_flush_ngg_legacy_bug for ESO |
| - radv: allow RADV_PERFTEST=shader_object on GFX10/GFX10.3 |
| - radv,aco: stop duplicating PS/TCS epilog fields |
| - radv: add a helper to emit PS/TCS epilogs |
| - radv/ci: enable RADV_PERFTEST=shader_object for VKCTS jobs on GFX10+ |
| - radv/ci: mark dEQP-VK.shader_object.binding.mesh_swap_task as flake on NAVI21 |
| - radv: stop using the custom blend mode for PS epilogs |
| - radv: re-emit RB+ state with PS epilogs only when the col format changes |
| - radv: fix emitting default blend state for PS without epilogs and ESO |
| - radv: allow RADV_PERFTEST=shader_object on all GFX9 GPUs |
| - radvi/ci: enable RADV_PERFTEST=shader_object for RENOIR |
| - util/u_debug: fix parsing of "all" again |
| - ci: use Linux kernel 6.6 for RADV |
| - radv/ci: update list of flakes for VKCTS jobs |
| - zink/ci: update list of flakes for RADV jobs |
| - ac/nir: fix exporting NGG streamout outputs with implicit PrimId from VS/TES |
| - radv: fix determining if PrimId is used for merged shaders compiled separately |
| - radv: determine radv_vgt_shader_key::has_ngg_xxx with the last VGT shader |
| - radv: rework generating vgt_shader_key for pipelines |
| - radv: fix wave32 support with ESO |
| - radv: add helpers to bind the GS copy shader and the RT prolog |
| - radv: fix RADV_PERFTEST=dmashaders with ESO |
| - radv: emit VGT_GS_OUT_PRIM_TYPE as part of the dynamic primitive topology |
| - radv: disable binning correctly on GFX11.5 |
| - radv: fix programming of PA_SC_BINNER_CNTL_1.MAX_ALLOC_COUNT on GFX11 |
| - radv: program SAMPLE_MASK_TRACKER_WATERMARK optimally for GFX11 APUs |
| - radv: add a GPU hang workaround for legacy tess+GS for GFX10.3 |
| - radv: fix occlusion queries with MSAA and no attachments |
| - radv: add radv_force_pstate_peak_gfx11_dgpu and enable it for Helldivers 2 |
| - zink/ci: enable RADV_PERFTEST=shader_object for polaris10 |
| - radv: add a workaround for null IBO on GFX6 |
| - zink/ci: update CI lists |
| - radv: always export MRTZ in FS epilogs with ESO on GFX11 |
| - radv: trigger a new PS epilog when the framebuffer is dirty with ESO |
| - zink/ci: allow RADV_PERFTEST=shader_object on NAVI31 |
| - radv: invalidate L2 metadata for VK_ACCESS_2_MEMORY_READ_BIT |
| - radv: make sure to disable NGG culling with TES when the FS stage is unknown |
| - zink/ci: enable RADV_PERFTEST=shader_object for NAVI10/VANGOGH |
| - radv/rmv: add missing logging when sparse BOs are destroyed |
| - radv/rmv: add missing logging when events are destroyed |
| - radv/rmv: fix logging of per-queue destroyed BOs |
| - radv/rmv: fix logging sparse residency |
| - radv/winsys: move BO size to radeon_winsys_bo |
| - radv/rmv: remove BO size parameter in radv_rmv_log_bo_allocate() |
| - radv: make some create resources helpers static |
| - radv/rmv: remove unnecessary is_internal parameter to some helpers |
| - radv: add radv_bo_{create,destroy}() helpers |
| - radv/rmv: prevent logging BOs allocated in GDS/OA domains |
| - radv/rmv: log allocated/destroyed BOs in radv_buffer_{create,destroy}() |
| - radv: add radv_bo_virtual_bind() helper |
| - radv: fix conditional rendering with mesh+task and multiview (again) |
| - radv: remove useless RADV_DEBUG=nomemorycache |
| - radv: implement alpha-to-one |
| - radv: advertise alphaToOne |
| - radv: advertise extendedDynamicState3AlphaToOneEnable with ACO |
| - docs: add alpha-to-one features for RADV |
| - radv: rename radv_physical_device variables to pdev everywhere |
| - radv/winsys: rename gpu_info to pci_ids in the null winsys |
| - radv: rename radeon_info variables to gpu_info everywhere |
| - radv: rename radv_physical_device::rad_info to info |
| - radv: remove radv_device::physical_device |
| - radv: remove radv_queue::device |
| - radv: remove radv_cmd_buffer::device |
| - radv: remove radv_device::instance |
| - radv: remove radv_physical_device::instance |
| - radv: declare radv_cmd_update_descriptor_xxx() in radv_descriptor_set.h |
| - radv: declare format related functions in radv_formats.h |
| - radv: pass a radv_physical_device to radv_use_llvm_for_stage() |
| - radv: move radv_device_supports_etc() to radv_physical_device.c |
| - radv: move some VK_DEFINE_NONDISP_HANDLE_CASTS to radv_descriptor_set.h |
| - radv: add radv_sampler.h |
| - radv: add radv_event.h |
| - radv: add radv_buffer_view.h |
| - radv: add radv_buffer.h |
| - radv: add radv_video.h |
| - radv: add radv_image.h |
| - radv: add radv_image_view.h |
| - radv: add radv_query.h |
| - radv: add radv_perfcounter.h |
| - radv: add radv_device_generated_commands.h |
| - radv: enable radv_zero_vram for Red Dead Redemption 2 |
| - vulkan/debug_utils: add a helper for reporting address binding |
| - radv: implement VK_EXT_device_address_binding_report |
| - radv: advertise VK_EXT_device_address_binding_report |
| - radv: move radv_prim_vertex_count to si_cmd_buffer.c |
| - radv: move radv_userdata_locations to radv_shader_args.h |
| - radv: move radv_shader_{layout,stage} to radv_shader.h |
| - radv: add radv_device_memory.h |
| - radv: add radv_instance.h |
| - radv: add radv_queue.h |
| - radv: add radv_physical_device.h |
| - radv: add radv_rra.h |
| - radv: add radv_device.h |
| - radv: add radv_pipeline_cache.h |
| - radv: add radv_pipeline.h |
| - radv: add radv_pipeline_compute.h |
| - radv: add radv_pipeline_rt.h |
| - radv: add radv_pipeline_graphics.h |
| - radv: add radv_wsi.h |
| - radv: add radv_sqtt.h |
| - radv: add radv_shader_object.h |
| - radv: add radv_spm.h |
| - radv: add radv_cmd_buffer.h |
| - radv: add radv_rmv.h |
| - radv: add radv_cp_reg_shadowing.h |
| - radv: add radv_printf.h |
| - radv: move radv_get_tdr_timeout_for_ip() to radv_query.h |
| - radv: move radv_queue_ring() to radv_queue.c |
| - radv: add radv_nir_to_llvm.h |
| - radv: add radv_android.h |
| - radv: add radv_shader_info.h |
| - radv: move CP DMA related code to radv_cp_dma.c/h |
| - radv: move more cmd buffer related code to radv_cmd_buffer.c |
| - radv: merge radv_write_guardband() with radv_emit_guardband_state() |
| - radv: merge radv_write_scissors() with radv_emit_scissor() |
| - radv: move radv_get_viewport_xform() to radv_pipeline_graphics.c |
| - radv: move radv_create_gfx_config() to radv_device.c |
| - radv: move radv_emit_{compute,graphics}() to radv_queue.c |
| - radv: move code related to sample positions to radv_device.c |
| - radv: rename si_cmd_buffer.c to radv_cs.c |
| - radv: remove unused radv_printflike() |
| - radv: remove pre-declarations needed for WSI entrypoints |
| - radv: remove remaining forward declarations and comments in radv_private.h |
| - radv: replace RADV_FROM_HANDLE by VK_FROM_HANDLE |
| - radv: add missing endif comment for some headers |
| - radv: rename remaining phys_dev occurrences to pdev |
| - radv: replace radv_minify() by u_minify() |
| - radv: replace align_{u32,u64}() by align{64}() |
| - radv: replace align_u32_npot() by ALIGN_NPOT |
| - radv: replace radv_float_to_{u,s}fixed() by util_{un}signed_fixed() |
| - util: add util_is_aligned() |
| - radv: replace radv_is_aligned() by util_is_aligned() |
| - radv: move RADV_SUPPORT_CALIBRATED_TIMESTAMPS to radv_physical_device.c |
| - radv: move RADV_API_VERSION to radv_instance.h |
| - radv: move CLOCK_MONOTONIC_RAW define to radv_physical_device.h |
| - radv: move RADV_USE_WSI_PLATFORM define to radv_wsi.h |
| - radv: remove radv_private.h |
| - radv: make radv_get_vgt_index_size() static |
| - radv: move radv_get_user_sgpr() to radv_shader.c |
| - radv: move radv_queue_family_to_ring() to radv_queue.c |
| - radv: remove old comment in radv_cs.c |
| - radv: move radv_printf_data to radv_printf.h |
| - radv: make sure the heap budget is less than or equal to the heap size |
| - radv: use SPDX-License-Identifier |
| - radv: enable VK_EXT_shader_object by default |
| - aco: use SPDX-License-Identifier |
| - ci: uprev vkd3d-proton to c3b385606a93baed42482d822805e0d9c2f3f603 |
| - docs: mark VK_KHR_maintenance6 as DONE for RADV |
| - radv: determine if the cache is disabled at device creation time |
| - radv: add skip_shaders_cache also for compute/rt pipelines |
| - radv: stop using a graphics pipeline for generating the graphics key |
| - radv/rt: constify device in radv_init_rt_stage_hashes() |
| - radv/rt: handle creation feedback like graphics/compute pipelines |
| - radv/rt: stop passing pCreateInfo to radv_ray_tracing_pipeline_cache_search() |
| - radv/rmv: fix missing image bind logging for WSI images |
| - radv: fix missing addr binding report for WSI image binds |
| - radv: fix addr binding report for disjoint image binds |
| - radv/rmv: fix image binds logging for disjoint images |
| - radv: add a helper to set image bindings |
| - radv: fix missing unbind report when an image is destroyed |
| - radv: fix missing unbind report when a buffer is destroyed |
| - radv/rt: remove dead code about intersection shaders in radv_pipeline_get_shader_key() |
| - radv: add a helper for hashing pipelines |
| - radv: rework and add a helper for hashing a compute pipeline |
| - radv: stop ignoring shader stages that don't need to be imported with GPL |
| - radv: add missing SQTT markers when an indirect indexed draw is used with DGC |
| - radv/rt: use radv_pipeline_hash_shader_stage() |
| - radv/rt: stop computing unused hash for the traversal shader |
| - radv: use canonicalized VA for VM fault reports |
| - radv: simplify importing pipeline layout with GPL |
| - radv: return early when PS is NULL in radv_pipeline_init_blend_state() |
| - radv: simplify checking for PS epilogs in radv_pipeline_init_blend_state() |
| - radv: remove unused parameter in radv_skip_graphics_pipeline_compile() |
| - radv: simplify the check for exporting multiview in the last VGT stage |
| - radv/rt: remove unnecessary pipeline parameter to radv_rt_fill_group_info() |
| - radv/rt: remove unnecessary pipeline parameter to radv_generate_rt_shaders_key() |
| - radv/rt: initialize shader group capture/replay in a separate function |
| - radv/rt: rework handle_from_stages to pass hashes directly |
| - radv/rt: insert shaders to cache right after they are compiled |
| - radv/rt: add radv_rt_pipeline_compile() |
| - radv: clear color attachments without exports before compaction |
| - ci: uprev CTS to vulkan-cts-1.3.8.0 |
| - radv/ci: add one more flake since CTS 1.3.8.0 for RENOIR |
| - radv/ci: update lists for TAHITI and Zink/Polaris10 |
| - radV/ci: reduce the parallelism of navi21 to 3 |
| - radv: fix waiting for occlusion queries on GFX6-8 |
| - radv: return per plane requirements for disjoint images |
| - zink/ci: update CI lists since piglit uprev |
| - radv/rt: remove unnecessary param to radv_ray_tracing_pipeline_cache_insert() |
| - radv/rt: move radv_ray_tracing_pipeline::sha1 to radv_pipeline |
| - radv: use radv_pipeline::sha1 for graphics/compute pipelines |
| - radv: rework pipeline cache search helpers |
| - radv: add RADV_DEBUG=psocachestats to report per-pipeline cache hits/misses |
| - vulkan: pass cmdbuf level to vk_command_buffer_ops::create() |
| - radv/amdgpu: do not use IB2 for nested command buffers |
| - radv: track if nested command buffers uses indirect draws |
| - radv: advertise VK_EXT_nested_command_buffer |
| - ac,radeonsi: add helpers to compute the number of tess patches/lds size |
| - radv: rework the number of tess patches computation |
| - ac: allow to use 64K of LDS for tessellation on GFX9+ |
| - ci: uprev CTS to 1.3.8.2 |
| - radv: fix image format properties with fragment shading rate usage |
| - radv: remove bogus VkShaderCreateInfoEXT::flags being 0 assert for compute |
| - radv: allow 3d views with VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT |
| - radv: mark some formats as unsupported on GFX8/CARRIZO |
| - radv: set image view descriptors as buffer for non-graphics GPU |
| - radv: only set ALPHA_IS_ON_MSB if the image has DCC on GFX6-9 |
| |
| Sathishkumar S (5): |
| |
| - ac/gpu_info: query the number of ip instance |
| - radeonsi/vcn: avoid hard-coding the number of jpeg instance |
| - ac/gpu_info: fix regression in vulkan hw decode |
| - radeonsi/vcn: use num_instances from radeon_info |
| - ac/gpu_info: update multimedia info |
| |
| Sean Anderson (2): |
| |
| - gallium: lima: Don't round height when checking alignment |
| - Add Xilinx ZynqMP KMSRO entrypoint |
| |
| Sebastian Wick (4): |
| |
| - radeonsi: Destroy queues before the aux contexts |
| - util: Add timespec_sub_saturate to avoid negative time for deadlines |
| - loader/wayland: Add fallback wl_display_dispatch_queue_timeout |
| - vulkan/wsi/wayland: Use dispatch_queue_timeout in acquire_next_image |
| |
| Sergi Blanch Torne (10): |
| |
| - ci: disable Collabora's farm due to maintance |
| - Revert "ci: disable Collabora's farm due to maintance" |
| - ci: disable Collabora's farm due to maintance |
| - Revert "ci: disable Collabora's farm due to maintance" |
| - ci: Nightly run expectations update |
| - ci: disable Collabora's farm due to maintance |
| - Revert "ci: disable Collabora's farm due to maintance" |
| - Uprev Piglit to f7ece74a107a2f99b2f494d978c84f8d51faa703 |
| - ci: kernel stored in a different s3 bucket |
| - ci: identify and label S3 buckets |
| |
| Shih, Jude (4): |
| |
| - amd/vpelib: Need a debug flag to support 2tap downscaling |
| - amd/vpelib: Add VPE prefix on API to avoid naming conflict |
| - amd/vpelib: Solve the downscaling problem for 2 tap |
| - amd/vpelib: Solve link error due to missing static for one function |
| |
| Sil Vilerino (25): |
| |
| - d3d12: Do not assume multi-subregion support when querying for driver encode support |
| - d3d12: Implement cap for PIPE_VIDEO_CAP_ENC_INTRA_REFRESH |
| - d3d12: Clean up H264 video decode interlaced code path |
| - frontends/va, d3d12: Fix PIPE_VIDEO_SLICE_MODE_MAX_SLICE_SICE -> PIPE_VIDEO_SLICE_MODE_MAX_SLICE_SIZE typo |
| - d3d12: d3d12_video_encoder_negotiate_current_h264_slices_configuration to use correct mode when intra-refresh is on |
| - d3d12: Do not use PIPE_BIND_DISPLAY_TARGET for d3d12_video_buffer |
| - d3d12: AV1 encode - Configure CQP using qp and new qp_inter parameters |
| - d3d12: H264 encode - Update CQP using current frame type as per VA frontend change |
| - d3d12: HEVC encode - Update CQP using current frame type as per VA frontend change |
| - frontend/va: Support media only post proc without compositor using shaders or surfaces |
| - frontend/va: Use get_resources in VaDeriveImage for media only devices without get_surfaces support |
| - d3d12: Add partial media, compute, graphics support with CORE and GENERIC feature levels |
| - d3d12: Refactor graphics functions from context and blit to separate files |
| - d3d12: Add GetDesc wrapper for ID3D12Heap |
| - d3d12: Only check D3D12_FEATURE_DATA_PLACED_RESOURCE_SUPPORT_INFO for D3D_FEATURE_LEVEL_1_0_GENERIC |
| - d3d12: Video Encode - Add driver workaround for rate control reconfiguration |
| - d3d12: Implement PIPE_VIDEO_CAP_ENC_SURFACE_ALIGNMENT |
| - d3d12: Bump directx-headers dependency to v613 |
| - d3d12: Support H264 slice L0/L1 active number override |
| - d3d12: Support HEVC slice L0/L1 active number override |
| - d3d12: Fix leak of batch->bos on video-only builds |
| - d3d12: Fix leak dxil_module::serialized_dependency_table |
| - d3d12: Fix util_blitter_destroy destruction ordering |
| - vl_win32_screen_create: Take ownership of winsys injected to created d3d12_screen |
| - d3d12/ci: Add vainfo with appverifier CI check |
| |
| Simon Ser (2): |
| |
| - egl/wayland: ensure wl_drm is available before use |
| - egl/wayland: explain why implicit modifier downgrade is allowed |
| |
| Stéphane Cerveau (1): |
| |
| - vulkan/video: hevc: b-frames can be reference or not |
| |
| Surafel Assefa (1): |
| |
| - radeonsi: Adds return on failure to get plane info |
| |
| Sviatoslav Peleshko (6): |
| |
| - nir: Use alu source components count in nir_alu_srcs_negative_equal |
| - anv,driconf: Add sampler coordinate precision workaround for AoE 4 |
| - driconf: Apply dual color blending workaround to Dying Light |
| - anv: Store host-located copy of NULL surface state for faster memcpy |
| - dri: Flush the context after flush_resource when creating shareable image |
| - anv: Fix descriptor sampler offsets assignment |
| |
| Tapani Pälli (35): |
| |
| - hasvk: remove cmd_buffer_ray_query_globals function decl |
| - hasvk: remove gfx9 specific code from emit_sample_pattern |
| - hasvk: remove softpin (GFX_VERx10 >= 90) related code |
| - hasvk: remove gfx9 specific cs stall from emit_ps_depth_count |
| - anv: check for wa 16013994831 in emit_so_memcpy_end |
| - iris: expand pre-hiz data cache flush to gfx >= 125 |
| - anv: expand pre-hiz data cache flush to gfx >= 125 |
| - iris: replace constant cache invalidate with hdc flush |
| - anv: move \*bits_for_access_flags to genX_cmd_buffer |
| - anv: use workaround framework for Wa_22018402687 |
| - intel/blorp: add a TODO note about stencil buffer resolve |
| - intel: refactor urb configuration, add intel_urb_config |
| - intel/common: provide a helper for urb setup comparison |
| - blorp/crocus: refactor blorp_emit_urb_config |
| - iris: implement Wa_16014912113 |
| - anv: implement Wa_16014912113 |
| - blorp: implement Wa_16014912113 callback for drivers |
| - anv: flush tile cache independent of format with HIZ-CCS flush |
| - anv: revert cache flushing changes for indirect commands |
| - intel/blorp: disable use of REP16 independent of format |
| - iris: make sure DS and TE are sent in pairs on >= gfx125 |
| - iris: make sure aux is disabled for external objects |
| - anv: make sure aux is disabled for memory objects |
| - hasvk: make sure aux is disabled for memory objects |
| - crocus: make sure aux is disabled for memory objects |
| - anv: use workaround framework for Wa_16013000631 |
| - anv: setup distribution granularity with Wa_14019166699 |
| - iris: refactor function that checks primitive id usage |
| - iris: setup distribution granularity with Wa_14019166699 |
| - anv: disable fcv optimization on >= gfx125 |
| - intel/blorp: remove unused blorp batch flag |
| - intel/compiler: add assert for Wa_22017182272 |
| - anv: add dirty tracking for push constant data |
| - iris: change stream uploader default size to 2MB |
| - anv: skip gfx push constants alloc optimization on gfx9/11 |
| |
| Tatsuyuki Ishi (8): |
| |
| - radv: Recompute max_waves after postprocessing RT config |
| - radv: never set DISABLE_WR_CONFIRM for CP DMA clears and copies |
| - util: Optimize mesa_hex_to_bytes |
| - radv: Add radv_spirv_to_nir_options that summarize early gfx states. |
| - radv: Rename cache_search_nir to cache_lookup_nir_handle. |
| - radv: Re-sort RADV_PERFTEST env vars in docs |
| - radv: Implement NIR caching behind RADV_PERFTEST=nircache. |
| - radv: Remove radv_queue::device again |
| |
| Teng, Jin Chung (1): |
| |
| - d3d12: HEVC Encode - Query slice config mode based on user slice setting |
| |
| Thomas H.P. Andersen (6): |
| |
| - nvk: promote VK_EXT_index_type_uint8 to KHR |
| - nvk: promote VK_EXT_line_rasterization to KHR |
| - nvk: promote load_store_op_none to KHR |
| - docs: update features.txt for nvk |
| - nvk: support driconf option force_vk_vendor |
| - driconf: override vendor id for X4 Foundations on NVK |
| |
| Thong Thai (2): |
| |
| - radeonsi/vcn: remove EFC support for renoir |
| - frontends/va/postproc: do not use efc if image is to be translated |
| |
| Timothy Arceri (78): |
| |
| - glsl: add nir version of validate_geometry_shader_emissions() |
| - glsl: use nir version of geom stream validation |
| - glsl: remove now unused GLSL IR validate_geometry_shader_emissions() |
| - glsl: don't tree graft globals |
| - Revert "ci: Enable GALLIUM_DUMP_CPU=true only in the clang job" |
| - glsl: add basic params for AMD_gpu_shader_half_float extension |
| - glsl: add half float support to the parser |
| - glsl: add explicit half float conversion support |
| - glsl: update assert to allow for half float support |
| - glsl: add glsl_type_is_float_16() helper |
| - glsl: add implicit half float conversions |
| - glsl: add ubo packing support for half floats |
| - glsl: skip conversion of half float back to float for GL queries |
| - glsl: add some new helpers for half float builtin functions |
| - glsl: add half float angle and trigonometry functions |
| - glsl: add half float exponential functions |
| - glsl: add f2f16() helper to ir_builder |
| - glsl: add half float support for common functions |
| - glsl: add support for half float packing functions |
| - glsl: add half float geometric functions |
| - glsl: add half float matrix functions |
| - glsl: add half float vector relational functions |
| - glsl: allow half float varyings |
| - glsl: add half float interpolation functions |
| - glsl: add half float derivative functions |
| - glsl: add half float AMD_shader_trinary_minmax functions |
| - compiler/types: Add a contains_32bit helper |
| - gallium: add PIPE_CAP_FP16 for AMD_gpu_shader_half_float |
| - glsl: add missing error check for half float varying |
| - nir: allow gather info to handle nir_deref_type_array_wildcard |
| - glsl: support array wildcards in lower named interface blocks |
| - glsl: split var copies before lowering named interfaces |
| - glsl: fix potential crash in expression flattening |
| - glsl: move some lowering to the compiler |
| - glsl_to_nir: merge function param handling |
| - glsl_to_nir: support conversion of struct/array function params |
| - glsl_to_nir: support conversion of struct/array function returns |
| - glsl_to_nir: support conversion of opaque function params |
| - glsl: don't inline functions in glsl ir |
| - nir: add some nir_parameter fields |
| - glsl: add missing define to linker_util.h |
| - glsl: add nir version of function recursion detection |
| - glsl: move function inlining out of glsl_to_nir() |
| - glsl: make use of nir recursion detection |
| - glsl: implement nir version of lower discard flow |
| - glsl: make use of nir lower discard flow |
| - glsl: remove now unused glsl ir lower discard pass |
| - glsl: make an explicitly safe version of visit_exec_list() |
| - glsl_to_nir: never convert instructions after jump |
| - glsl: remove unrequired do_lower_jumps() call |
| - glsl: move invariant builtin validation to the nir linker |
| - nir: add max_array_access data field |
| - nir: add implicit_sized_array data field |
| - glsl: add resize_tes_inputs() to the nir linker |
| - nir: add variable field from_ssbo_unsized_array |
| - glsl: don't remove redefined per vertex block |
| - glsl: add nir implemenation of block validation |
| - glsl: switch to NIR block validation |
| - glsl: call new nir resize_tes_inputs() pass |
| - glsl: remove now unused resize_tes_inputs() |
| - glsl: remove now unused glsl ir block validation |
| - glsl: move some linking calls to gl_nir_link_glsl() |
| - glsl: switch verify_subroutine_associated_funcs() to nir |
| - nir: add subroutine fields to nir_function |
| - glsl: move link_assign_subroutine_types() to the nir linker |
| - glsl: move check_explicit_uniform_locations() to NIR linker |
| - glsl: move mode_string() to helper |
| - glsl: add some data members to nir_variable |
| - glsl: make validate_intrastage_arrays() usable across files |
| - glsl: move cross_validate_uniforms() to the nir linker |
| - glsl: use shader info to store gs verts |
| - glsl: use info from shader when linking |
| - glsl: move validate_{stage}_shader_executable() to the nir linker |
| - glsl: remove now unused do_dead_functions() |
| - glsl: remove FragDepthLayout field |
| - glsl: remove ActiveStreamMask field |
| - glsl: remove UsesEndPrimitive field |
| - glsl: inline _mesa_copy_linked_program_data() |
| |
| Timur Kristóf (121): |
| |
| - radv: Correctly select SDMA support for PRIME blit. |
| - nir: Fix divergence of reductions. |
| - nir: Fix divergence analysis of load_patch_vertices_in. |
| - nir: Cleanup divergence analysis for mesh shaders. |
| - nir: Clean up divergence analysis for TES patch input loads. |
| - aco: Eliminate SCC copies when possible. |
| - radv: Lower mesh shader draw ID to zero when they have a task shader. |
| - radv: Extract input and output stride info to new functions. |
| - radv: Use mapped driver locations for determining I/O strides. |
| - aco: Allow passing constant operand to is_overwritten_since. |
| - radv/llvm: Remove dead code. |
| - radv: Allow NGG culling with LLVM. |
| - compiler: Add helper for counting tess level components. |
| - ac/nir/tess: Always record tess level info and use it at the end. |
| - ac/nir/tess: Don't record mapped tess level location. |
| - ac/nir/tess: Split tess factor write into multiple functions. |
| - ac/nir/tess: Emit tess factor output independently of whether it can be passed by registers. |
| - ac/nir/tess: Refactor how the end of HS is emitted. |
| - aco: Use common helper for counting tess level components. |
| - aco: Use tess factors when TCS jumps to epilog. |
| - radv: Declare tess_lvl_in/out args for TCS epilogs. |
| - radv: Always pass tess factors to epilogs in registers. |
| - radv, aco: Delete now dead TCS epilog code. |
| - nir: Add two new AMD specific tess intrinsics. |
| - radeonsi: Implement new intrinsics for monolithic shaders. |
| - radv: Copy TES primitive mode to TCS info. |
| - radv: Implement new tess intrinsics. |
| - radv: Call nir_opt_dead_cf in radv_optimize_nir_algebraic. |
| - ac/nir/tess: Emit tess factor stores based on new intrinsics. |
| - radv: Completely delete TCS epilogs. |
| - radv, aco: Remove the code that jumped to RADV's TCS epilogs. |
| - ac/llvm, radeonsi: Handle tess_rel_patch_id in common code. |
| - radeonsi: Put HS output count in TCS offchip layout, not patch data offset. |
| - radeonsi: Implement dynamic TCS intrinsics for non-monolithic shaders. |
| - radeonsi: Delete TCS epilogs entirely. |
| - aco: Delete all TCS epilog code. |
| - radeonsi: Add number of VS outputs to TCS output layout. |
| - radeonsi: Remove tess bits from VS state. |
| - radeonsi: Use one more bit for number of patches in TCS offchip layout. |
| - ac/nir/tess: Remove dead code that was meant for epilogs. |
| - radv: Add number of LS and HS outputs to tcs_offchip_layout. |
| - radv: Change input patch size in TCS offchip layout to match RadeonSI. |
| - radv: Change number of patches in TCS offchip layout to match RadeonSI. |
| - radv: Include output patch size in TCS offchip layout. |
| - radv: Reuse TCS offchip layout between TCS and TES. |
| - nir/gather_info: Record per-primitive outputs without variables. |
| - nir: Record per-primitive inputs without variables. |
| - nir/recompute_io_bases: Sort per-primitive PS inputs last. |
| - ac/nir: Introduce ac_nir_calc_io_offset_mapped. |
| - ac/nir/tess: Load tess factors from variable when they are passed in registers. |
| - ac/nir/tess: Clarify when a TCS output is stored in LDS or VRAM. |
| - ac/nir/tess: Return undef when loading an unwritten TCS output. |
| - ac/nir/tess: Map TCS LDS IO locations without gaps. |
| - ac/nir/tess: Calculate reserved LDS outputs based on IO info. |
| - ac/nir/tess: Remove superfluous args for reserved TCS outputs. |
| - ac/nir/tess: Clarify when VS-TCS I/O can use registers. |
| - radv: Only add extra dword to LS-HS stride when there are LS outputs. |
| - radv: Pass key structures to gather intrinsic info. |
| - radv: Extract gather_load_vs_input_info function. |
| - radv: Slightly refactor gather_intrinsic_store_output_info. |
| - radv: Record PS input clip/cull mask instead of number. |
| - radv: Use NIR IO semantics to determine GS output info. |
| - radv: Add helper for determining per-attribute vertex buffer descriptors. |
| - radv: Add helper to determine usage of VS prologs. |
| - radv: Remove unused VS input usage mask. |
| - radv: Use NIR IO semantics to determine VS input info. |
| - radv: Use IO semantic location for shader output info. |
| - aco/optimizer_postRA: Remove a check from SCC no-compare optimization. |
| - radv: Use NIR IO semantics to determine FS input info. |
| - radv: Remove I/O variables after nir_lower_io. |
| - radv: Slightly refactor the determination of max_ps_params. |
| - radv: Increase maximum allowed PS params for enabling NGG culling. |
| - radv: Remove unused gfx_level from gfx10_emit_ge_pc_alloc. |
| - ac/nir/ngg: Don't create dummy output variable for primitive ID. |
| - ac/nir/ngg: Use IO semantics for determining instance rate inputs. |
| - ac/nir/ngg: Rename confusing driver_location variable in mesh shader lowering. |
| - radv: Use NIR IO semantics for VS input location mapping. |
| - radv: Don't set driver locations for mesh shaders. |
| - radv: Don't set driver locations for FS outputs. |
| - radv: Don't set driver locations for last pre-rasterization stage. |
| - radv: Keep track of TCS outputs that need LDS. |
| - radv: Remove dead code for creating per-patch IO mask. |
| - radv: Add radv_gather_unlinked_io_mask to shader info header. |
| - radv: Always use fixed I/O locations for TCS outputs in VRAM. |
| - radv: Clean up gathering linked I/O info. |
| - nir/print: Print per-primitive and explicit strict IO info. |
| - nir/recompute_io_bases: Fix per-primitive inputs. |
| - nir/gather_info: Clear per-primitive I/O masks at the beginning. |
| - nir/lower_io_to_scalar: Support explicit (and per-vertex) FS inputs. |
| - nir/lower_io_to_scalar: Support per-primitive outputs. |
| - nir/opt_varyings: Allow optimizing primitive ID for MS -> FS. |
| - nir/opt_varyings: Support per-primitive I/O. |
| - nir/opt_varyings: Fix explicit and per-vertex FS inputs. |
| - nir/opt_varyings: Add early return when producer stage is task. |
| - nir/opt_varyings: Only propagate constant MS outputs, not other uniforms. |
| - nir/opt_varyings: Debug print during relocate_slot. |
| - nir/opt_varyings: Fix relocate_slot so it doesn't mix up 32-bit and 16-bit I/O. |
| - nir/opt_varyings: Add workaround for RADV mesh shader multiview. |
| - ac/nir/ngg: Remove support for loading mesh shader outputs. |
| - ac/nir/ngg: Refactor MS primitive indices for scalarized IO. |
| - ac/nir/ngg: Slightly refactor mesh shader cull flag stores. |
| - ac/nir/ngg: Use just one IO semantics variable in MS output store. |
| - ac/nir/ngg: Refactor update_ms_output_info. |
| - ac/nir/ngg: Refactor MS output store into two functions. |
| - ac/nir/ngg: Split 16-bit MS output stores by components. |
| - ac/nir/ngg: Enable packing 16-bit mesh shader outputs. |
| - radv: Run DCE before deleting I/O variables. |
| - radv: Only consider interpolated inputs as 16-bit float. |
| - radv: Refactor emitting PS input types. |
| - radv: Remove superfluous bool arg from slot_to_ps_input. |
| - radv: Allow using high 16 bits of PS input slots. |
| - radv: Rename per_vertex_shaded_mask to explicit_strict_shaded_mask. |
| - radv: Rename LDS related variables in get_tcs_num_patches. |
| - radv: Calculate VRAM tess patch size independently of LDS size. |
| - ac/nir/tess: Split I/O mapping to two functions. |
| - ac/nir/tess: Use LDS IO mapping when loading tess levels from LDS. |
| - ac/nir/ngg: Implement packed 16-bit VS/TES outputs in non-dedicated slots. |
| - ac/nir/ngg: Implement packed 16-bit GS outputs in non-dedicated slots. |
| - ac/nir/lower_legacy_vs: Implement packed 16-bit VS/TES outputs in non-dedicated slots. |
| - ac/nir/lower_legacy_gs: Implement packed 16-bit GS outputs in non-dedicated slots. |
| - ac/nir/ngg: Fix packing 16-bit MS outputs. |
| |
| Tomeu Vizoso (20): |
| |
| - ci: disable Igalia farm |
| - gallium/util: Fix pipe_buffer_copy |
| - mesa: Import TensorFlow Lite headers |
| - teflon: Initial commit |
| - etnaviv: Update headers from rnndb |
| - etnaviv: Add a bunch of new params for NPUs |
| - etnaviv: Don't emit boilerplate for compute only contexts |
| - etnaviv: Use NN cores to accelerate convolutions |
| - etnaviv: Use TP cores to accelerate tensor transformations |
| - teflon: Add table with known supported models to docs |
| - etnaviv: Don't init the blitter in compute-only contexts |
| - etnaviv/nn: Implement zero run length encoding of weights |
| - teflon: Enable convolutions with number of output channels not divisible by 8 |
| - etnaviv/nn: Ensure tile_y is > 0 |
| - etnaviv/nn: Fix calculation of remaining out channels |
| - etnaviv/nn: Move unused field to its right place in the struct |
| - etnaviv/nn: Enable image cache |
| - etnaviv/nn: Don't shortcut ZRL bits calculation |
| - etnaviv/nn: Keep track of the sign bit when decrementing to zero |
| - etnaviv/nn: Make parallel jobs disabled by default |
| |
| Tranquillity Codes (1): |
| |
| - intel: Skip ioctls for querying device info when hardware is unsupported |
| |
| Valentine Burley (27): |
| |
| - tu: Promote VK_EXT_index_type_uint8 to KHR |
| - tu: Promote VK_EXT_load_store_op_none to KHR |
| - tu: Promote VK_EXT_line_rasterization to KHR |
| - docs: Update features.txt for anv, nvk and tu |
| - nvk: Enable VK_KHR_shader_subgroup_uniform_control_flow |
| - nvk: Advertise VK_KHR_vertex_attribute_divisor |
| - nvk: Reorder device features |
| - tu: Implement VK_KHR_map_memory2 |
| - tu: Advertise VK_KHR_vertex_attribute_divisor |
| - tu: Reorder device features |
| - nvk: Fix missing implementation of creating images from swapchains |
| - nvk: Expose VK_EXT_display_control |
| - nvk: Expose VK_EXT_surface/swapchain_maintenance1 |
| - nvk: Expose VK_EXT_swapchain_colorspace |
| - docs/features: Add missing VK_EXT_surface/swapchain_maintenance1 entry |
| - tu/rmv: Remove tu_rmv_DebugMarkerSetObjectNameEXT |
| - nvk: Trivially expose three VK_GOOGLE extensions |
| - tu: Expose VK_KHR_surface_protected_capabilities |
| - tu: Trivially expose three VK_GOOGLE extensions |
| - docs: Update features.txt for tu |
| - docs: Update features.txt and new_features.txt for anv and nvk |
| - nvk: Add support for version 2 of all descriptor binding commands |
| - tu: Move tu_BindImageMemory2() to tu_image.cc |
| - tu: Replace TU_HAS_SURFACE with TU_USE_WSI_PLATFORM |
| - tu: Fix missing implementation of creating images from swapchains |
| - tu: Replace TU_FROM_HANDLE with VK_FROM_HANDLE |
| - drm-shim: Stub syncobj reset ioctl |
| |
| Vasily Khoruzhick (4): |
| |
| - lima: ppir: always use vec4 for output register |
| - lima: ppir: use dummy program if FS has empty body |
| - lima: gpir: abort compilation if load_uniform instrinsic src isn't const |
| - lima: update expected CI failures |
| |
| Vignesh Raman (5): |
| |
| - ci: Add kmod |
| - ci: disable Collabora's farm due to maintenance |
| - Split debian-build-testing job |
| - ci: Implement support for replaying ANGLE restricted traces |
| - ci: handle missing dri libraries during listing |
| |
| Vinson Lee (2): |
| |
| - intel/disasm: Remove duplicate variable reg_file |
| - intel/clc: Fix file descriptor leak |
| |
| Visan, Tiberiu (1): |
| |
| - amd/vpelib: revert SRGB to 709 |
| |
| Vlad Schiller (2): |
| |
| - pvr: Implement VK_EXT_memory_budget |
| - pvr: Implement VK_KHR_index_type_uint8 |
| |
| Yifan Zhang (2): |
| |
| - amd: Add code to enable gfx11.5.1 |
| - radv: initialize video decoder for GFX11.5.1 |
| |
| Yiwei Zhang (105): |
| |
| - venus: avoid redundant layout transition for optimal internal layout |
| - venus: populate oom from ring submit alloc failures |
| - vulkan/wsi/wayland: fix returns and avoid leaks for failed swapchain |
| - venus: ensure object id is unique |
| - venus: fix pipeline layout lifetime |
| - venus: drop some redundant comment |
| - venus: fix pipeline derivatives |
| - venus: fix to respect the final pipeline layout |
| - venus: allow tls ring submission to utilize the entire ring shmem |
| - venus: default to enable GPL |
| - venus: force async pipeline create on threads creating descriptor pools |
| - venus: use obj handle instead of id in device memory report |
| - anv: refactor wsi_memory_allocate_info handling |
| - anv: optimize the implicit fencing support of external memory |
| - anv: extend implicit fencing support for case requiring implicit write |
| - vulkan/util: drop redundant code gen from vk_extensions_gen.py |
| - vulkan/runtime: refactor to use DETECT_OS_ANDROID instead of ANDROID |
| - v3dv: refactor to use DETECT_OS_ANDROID instead of ANDROID |
| - venus: refactor to use DETECT_OS_ANDROID instead of ANDROID |
| - hasvk: refactor to use DETECT_OS_ANDROID instead of ANDROID |
| - anv: refactor to use DETECT_OS_ANDROID instead of ANDROID |
| - radv: refactor to use DETECT_OS_ANDROID instead of ANDROID |
| - turnip: refactor to use DETECT_OS_ANDROID instead of ANDROID |
| - egl: refactor to use DETECT_OS_ANDROID instead of ANDROID |
| - gallium: refactor to use DETECT_OS_ANDROID |
| - util: refactor to use DETECT_OS_ANDROID |
| - meson: drop -DANDROID |
| - venus: update tracepoints to align with later optimizations |
| - venus: fix the cmd stride used for qfb recording |
| - venus: rewrite fence feedback interception to minimize batches |
| - venus: refactor to add vn_cached_storage |
| - venus: use vn_cached_storage for vn_queue_submission allocs |
| - venus: misc cleanups for queue submission |
| - venus: simplify feedback types tracking during submission |
| - venus: massive feedback renamings for consistency and clarity |
| - venus: refactor to add vn_queue_submission_setup_batch |
| - venus: simplify to drop the struct vn_feedback_cmds accessor |
| - venus: refactor semaphore feedback |
| - venus: add vn_set_temp_cmd helper to initialize feedback batch cmd |
| - venus: fix to ensure sfb cmds can get recycled |
| - venus: mandate a few venus capsets long required before 1.0 |
| - venus: sync protocol for VK_KHR_fragment_shading_rate |
| - venus: add VK_KHR_fragment_shading_rate |
| - vulkan: fix runtime libraries' dep against generated headers |
| - venus: fix ffb batch prepare for a corner case and avoid a memcpy UB |
| - vulkan: remove unused wsi_common_entrypoints include and dep |
| - vulkan: properly ensure wsi_entrypoints header gen order |
| - vulkan: remove header files from lib source files |
| - vulkan: refactor the runtime header gen order dependency |
| - anv/hasvk: default image_read_without_format to true |
| - venus: qfb to track cmd handle directly |
| - venus: combine query record and reset |
| - venus: massive qfb renamings |
| - venus: minor cmd count related refactors |
| - venus: drop vn_get_temp_cmd_ptr |
| - venus: simplify vn_cmd_reset and apply more code sharing |
| - venus: refactor query record recycle |
| - venus: rewrite qfb vn_feedback helpers |
| - venus: refactor vn_queue_submission_add_query_feedback |
| - venus: add vn_queue_submission_get_resolved_query_records |
| - venus: optimize to further batch query records |
| - venus: roundtrip now belongs to ring |
| - venus: minor naming cleanups |
| - venus: ensure shmem is attached to renderer before use for guest vram |
| - venus: avoid excessive ring notifications |
| - venus: further reduce idle timeout from 5ms to 1ms |
| - venus: add enum vn_relax_reason |
| - venus: avoid constant busy wait for query result waiting |
| - venus: deprecate unused perf env vars |
| - venus: decorate cmd enqueue macro internals with compiler hints |
| - venus: add a more relaxed polling strategy |
| - venus: cleanup 2 TODOs from 1.3 support |
| - venus: remove obsolete TODOs |
| - venus: use STACK_ARRAY to simplify modifier query |
| - venus: use STACK_ARRAY to simplify BindBufferMemory2 |
| - venus: use STACK_ARRAY to simplify BindImageMemory2 |
| - venus: use STACK_ARRAY to simplify render pass creation |
| - venus: use STACK_ARRAY to simplify physical device enumeration |
| - venus: use STACK_ARRAY to simplify set layout creation |
| - venus: use STACK_ARRAY to simplify sync wait |
| - venus: rely on enum vn_descriptor_type for internal trackings |
| - venus: move async_set_allocation check outside helpers |
| - venus: set alloc to skip earlier for reserved and invalid bindings |
| - venus: optimize mutable state restore |
| - venus: misc set alloc and cleanup refactors |
| - venus: drop vn_should_sanitize_descriptor_set_writes |
| - venus: refactor descriptor set update and push |
| - venus: use STACK_ARRAY to simplify descriptor set update and push |
| - venus: use more relaxed profile for TLS ring seqno wait |
| - venus: avoid the redundant template entry |
| - venus: fix to drop an extra ; |
| - venus: simplify push descriptor update with template |
| - venus: optimize set update template data population |
| - venus: simplify need and ignore rules for desc image info |
| - venus: use STACK_ARRAY to simplify set template update and push |
| - venus: clean up legacy descriptor update template bits |
| - venus: fix swapchain image memory bind |
| - venus: fix VkDeviceGroupSubmitInfo::deviceMask for feedback cmds |
| - venus: avoid client allocators for ring internals |
| - venus: fix to destroy all pipeline handles on early error paths |
| - turnip: msm: clean up iova on error path |
| - turnip: msm: fix racy gem close for re-imported dma-buf |
| - turnip: virtio: fix error path in virtio_bo_init |
| - turnip: virtio: fix iova leak upon found already imported dmabuf |
| - turnip: virtio: fix racy gem close for re-imported dma-buf |
| |
| Yogesh Mohan Marimuthu (6): |
| |
| - winsys/amdgpu: sws instead of ws for amdgpu_screen_winsys |
| - winsys/amdgpu: rws instead of ws for radeon_winsys |
| - winsys/amdgpu: aws instead of ws for amdgpu_winsys |
| - winsys/amdgpu: use _destroy_locked() for failure to create winsys |
| - winsys/amdgpu: remove tab space |
| - winsys/amdgpu: add more comments for winsys create in header file |
| |
| Yonggang Luo (37): |
| |
| - util: Add function util_is_power_of_two_nonzero_uintptr and macro IS_POT_NONZERO |
| - asahi,panfrost: Use IS_POT_NONZERO to replace util_is_power_of_two_nonzero for different size |
| - treewide: Use util_is_power_of_two_nonzero{64|_uintptr} when needed |
| - svga: Cleanup duplicate ALIGN macro defines |
| - nouveau: Use align64 instead of ALIGN over input layer_size_B |
| - treewide: Use align64 instead of ALIGN for 64 bit value parameter |
| - util: Update ALIGN prototype to match align |
| - compiler/spirv: The spirv shader is binary, should write in binary mode |
| - compiler/spirv: There is not need unqualify const in function vtn_string_literal |
| - compiler/spirv: vtn_add_printf_string support for handling OpBitcast |
| - zink: Update zink-anv-tgl flakes |
| - treewide: Remove vulkan/runtime vulkan/util prefix in include path |
| - freedreno/vulkan: Use vk_dynamic_graphics_state_init instead of direct assignment |
| - vulkan/runtime: Mark vk_default_dynamic_graphics_state to be private |
| - Revert "meson/vulkan/util: allow venus to drop compiler deps" |
| - vulkan: allow building venus without libcompiler |
| - glx: Remove DEBUG code in xfont.c |
| - panfrost/shared: avoid use gallium helper in pan_minmax_cache.* |
| - panfrost/meson: remove redundant gallium include from meson files |
| - treewide: Replace the invalid usage #if DEBUG with #ifdef DEBUG |
| - util: Cleanup strtod.(h|c) by introduce _mesa_get_locale |
| - meson: Extract with_mesa_debug and with_mesa_ndebug for latter usage |
| - meson: Define MESA_DEBUG for latter usage |
| - treewide: Replace usage of macro DEBUG with MESA_DEBUG when possible |
| - meson: Remove the non-used -DDEBUG manually |
| - intel/meson: Remove redundant inc_gallium |
| - radv: Remove redundant inc_gallium |
| - radv: Rename src/amd/vulkan/vk_format.h to src/amd/vulkan/radv_formats.h |
| - vulkan: Move vk_format_is_alpha and vk_format_is_alpha_on_msb into vk_format.h from pvr |
| - pvr: inline and remove vk_format_get_channel_width |
| - pvr: Merge imagination/vulkan/vk_format.h into imagination/vulkan/pvr_formats.h |
| - pvr: Add pvr\_ prefix for vk_format_* functions in pvr_formats.h |
| - util: Fixes futex_wait on win32 |
| - util: futex_wait use TIME_MONOTONIC on win32 for consistence with other platform |
| - util: Turn futex_wake parameter to int32_t for consistence across platforms |
| - broadcom/common: Now "util/box.h" is under src, so remove the FIXME |
| - nouveau: Fixes error: unused import: \`crate::nvh_classes_cl906f::\*` |
| |
| Yusuf Khan (4): |
| |
| - nvk: remove some dead code files |
| - nvk: fix valve segfault from setting a descriptor set from NULL |
| - crocus: fix potential null pointer dereference if transfer_mapping fails |
| - nouveau: Fix crash when destination or source screen fences are null |
| |
| Zack Rusin (1): |
| |
| - svga: Fix instanced draw detection |
| |
| Zan Dobersek (11): |
| |
| - freedreno: add fd_rd_output facilities for gzip-compressed RD dumps |
| - tu/msm: fix RD_CHIP_ID size used when dumping RD |
| - tu: tu_device should clean up its global bo |
| - vulkan/rmv: enable logging miscellaneous internal resources |
| - tu: add RMV support |
| - freedreno/fdl: avoid overflow in layout size computations |
| - tu: fix memory leaks in tu_shader |
| - fd: enable prefixing the RD output filename |
| - tu/autotune: use SAMPLE_COUNT_END_OFFSET when writing the ending sample count |
| - tu: RB_SAMPLE_COUNT_ADDR is also used on a7xx |
| - tu/query: improve CP_EVENT_WRITE7::ZPASS_DONE usage |
| |
| antonino (1): |
| |
| - zink: plug leak in \`zink_create_quads_emulation_gs` |
| |
| chyyran (1): |
| |
| - util/format/fxt1: include "u_format_pack.h" instead of "util/format/u_format_pack.h" |
| |
| daoxiang.gong (1): |
| |
| - zink - Fix for minLod and maxLod when mipmap filter is disabled |
| |
| duncan.hopkins (19): |
| |
| - compiler/clc: fix compiler issue on MacOS with st_mtim[e] in stat. |
| - egl: MacOS platform guard around pthread_condattr_setclock() |
| - egl: Added DRI3 code guards. |
| - egl: Changed EGLNativeDisplayType size check to make sure it is big enough instead of exactly the same size. |
| - gallium/dri: Switch xf86drm.h for util/libdrm.h to allow for the no-op shim to be used. |
| - gallium/dri: Added XCB dependency to frontends/dri/libdrm build. Fix header issues with xcb.h being used. |
| - util: Updated util/libdrm.h stubs with drmGetMagic() |
| - dri: guarded DRI code. |
| - glx: Switched DRI2 functions over to use Apple specific alternatives and extension name. |
| - meson: relaxed some meson restrictions on MacOS/Apple allowing for wider build support. |
| - apple: Extended Apple feature support using GLX_USE_APPLE. |
| - apple: Meson defines GLX_USE_APPLE to allow for Gallium drivers to work on MacOS. |
| - zink: Fixed header location and compiling issue with [[deprecated]] from newer MoltenVK versions. |
| - zink: use portability EXT on Apple. |
| - zink: stopped the use of VkFormatProperties3 if the reported API is less than 1.3 or VK_KHR_format_feature_flags2 not present. |
| - zink: removed \`MESA_PRIM_QUADS` from the supported \`PIPE_CAP_SUPPORTED_PRIM_MODES`. |
| - zink: Avoid issues when kopper tries using XCB WSI on Apple. |
| - zink/apple: added \`moltenvk-dir` search to allow MoltenVK to be sourced from brew. |
| - zink/apple: update docs to reflect the current status of Zink on macOS. |
| |
| nyanmisaka (2): |
| |
| - frontends/va: Report vendor and device ID through VADisplayPCIID |
| - radeonsi/uvd_enc: update to use correct padding size |
| |
| qbojj (1): |
| |
| - vulkan: Fix calculation of flags in vk_graphics_pipeline_state_fill |
| |
| thfrwn (1): |
| |
| - mesa: fix off-by-one for newblock allocation in dlist_alloc |