| Mesa 25.2.0 Release Notes / 2025-08-06 |
| ====================================== |
| |
| Mesa 25.2.0 is a new development release. People who are concerned |
| with stability and reliability should stick with a previous release or |
| wait for Mesa 25.2.1. |
| |
| Mesa 25.2.0 implements the OpenGL 4.6 API, but the version reported by |
| glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / |
| glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. |
| Some drivers don't support all the features required in OpenGL 4.6. OpenGL |
| 4.6 is **only** available if requested at context creation. |
| Compatibility contexts may report a lower version depending on each driver. |
| |
| Mesa 25.2.0 implements the Vulkan 1.4 API, but the version reported by |
| the apiVersion property of the VkPhysicalDeviceProperties struct |
| depends on the particular driver being used. |
| |
| SHA checksums |
| ------------- |
| |
| :: |
| |
| SHA256: 7c726b21c074d14d31d253d638b741422f3c0a497ce7f1b4aaaa917d10bd8d4f mesa-25.2.0.tar.xz |
| SHA512: f36e4f9f619becfc89f9cf3704d21f5f1532f1e658808fd30c0edce8c28cd65dc6578a21333e896aa5eaccf09da99efd9127a8d3d46681e8360f618cf63bab94 mesa-25.2.0.tar.xz |
| |
| |
| New features |
| ------------ |
| |
| - removed clover frontend |
| - VK_EXT_vertex_input_dynamic_state/vertexInputDynamicState on panvk |
| - pushDescriptor on panvk |
| - VK_EXT_vertex_input_dynamic_state on panvk |
| - VK_EXT_vertex_attribute_divisor on panvk |
| - supportsNonZeroFirstInstance on panvk |
| - GL_ARB_blend_func_extended on v3d |
| - dualSrcBlend on v3dv |
| - VK_KHR_maintenance4 on panvk/v10+ |
| - VK_KHR_maintenance5 on panvk/v10+ |
| - VK_EXT_direct_mode_display on panvk |
| - VK_EXT_extended_dynamic_state[2] on panvk |
| - Vulkan 1.2 on panvk/v10+ |
| - VK_KHR_shader_quad_control on panvk/v10+ |
| - multiDrawIndirect on panvk/v10+ |
| - VK_KHR_draw_indirect_count on panvk/v10+ |
| - VK_KHR_shader_integer_dot_product on panvk |
| - VK_KHR_shader_terminate_invocation on panvk |
| - VK_EXT_shader_demote_to_helper_invocation on panvk |
| - VK_EXT_shader_replicated_composites on panvk |
| - VK_EXT_depth_bias_control on panvk |
| - VK_KHR_shader_bfloat16 on anv/gfx125+ and radv/gfx12+ |
| - VK_KHR_robustness2 on RADV |
| - VK_EXT_zero_initialize_device_memory on RADV and NVK |
| - VK_EXT_primitive_topology_list_restart on panvk |
| - VK_EXT_image_2d_view_of_3d on panvk |
| - VK_EXT_texel_buffer_alignment on panvk |
| - cl_khr_kernel_clock on freedreno, iris, llvmpipe, nvc0, panfrost, radeonsi and zink with llvm-19 or newer |
| - GL_KHR_texture_compression_astc_hdr on panfrost and asahi |
| - cl_ext_buffer_device_address on iris, llvmpipe, radeonsi and zink |
| - Completed OpenCL 2.0 coarse grain buffer SVM support for iris and radeonsi |
| - VK_EXT_shader_subgroup_ballot on panvk |
| - VK_EXT_shader_subgroup_vote on panvk |
| - Vulkan video support on GFX12 (RDNA4) for RADV |
| - VK_KHR_load_store_op_none on panvk |
| - VK_EXT_load_store_op_none on panvk |
| - VK_EXT_scalar_block_layout on radv/gfx6 |
| - VK_EXT_inline_uniform_block on panvk |
| - cl_khr_fp16 on asahi, freedreno, llvmpipe, panfrost, radeonsi and zink |
| - GL_ARB_shader_clock on panfrost/v6+ |
| - VK_EXT_texture_compression_astc_hdr on panvk |
| - MSAA with 8 and 16 sample counts on panfrost |
| - cl_khr_spirv_queries |
| - VK_EXT_depth_clamp_zero_one on panvk |
| - VK_KHR_depth_clamp_zero_one on panvk |
| - VK_EXT_depth_clip_control on panvk |
| - VK_KHR_maintenance9 on RADV |
| - VK_KHR_unified_image_layouts on RADV (RDNA3+), tu and panvk |
| - VK_KHR_vulkan_memory_model on panvk |
| - Vulkan 1.3 on panvk/v10+ |
| - VK_KHR_present_wait2 |
| - VK_EXT_descriptor_indexing on panvk/v10+ |
| - cl_khr_priority_hints |
| - VK_KHR_maintenance6 on panvk/v10+ |
| - CL_sRGBA and CL_sBGRA images |
| - cl_khr_extended_bit_ops |
| - VK_KHR_shader_clock on panvk |
| - GL_EXT_shader_realtime_clock on panfrost/v6+ |
| - VK_KHR_shader_float8 on RADV (RDNA4+) |
| - GL_EXT_window_rectangles on r600 |
| - GL_EXT_shader_image_load_store on r600/evergreen+ |
| - GL_EXT_shader_clock on all drivers that support GL_ARB_shader_clock |
| - CL_UNORM_INT_101010_2 and cl_ext_image_unorm_int_2_101010 |
| - timestamps on panvk/v10+ |
| - VK_KHR_calibrated_timestamps on panvk/v10+ |
| - VK_EXT_non_seamless_cube_map on panvk |
| - VK_ARM_shader_core_properties on panvk/v10+ |
| - removed X11 DRI2 support |
| - removed EGL_MESA_drm_image |
| - removed pre-dmabuf wl_drm support |
| - deprecated EGL_WL_bind_wayland_display |
| - VK_KHR_shader_atomic_int64 on panvk/v10+ |
| - VK_EXT_host_image_copy on RADV (RDNA1+) |
| - VK_KHR_cooperative_matrix on nvk/turing+ |
| - VK_KHR_host_image_copy on panvk |
| - Vulkan 1.4 on panvk/v10+ |
| |
| |
| Bug fixes |
| --------- |
| |
| - [RADV] Graphical glitches in Ghost of Tsushima on Polaris |
| - radv: RT regressions |
| - macOS: use of undeclared identifier 'free_zombie_glx_drawable' |
| - macOS: Undefined symbols "_pipe_loader_drm_probe_fd", referenced from: _dri_get_drm_device_info in libdri.a[7](dri_util.c.o) |
| - Segfault when activating DPMS on i915 hardware |
| - nvk Blackwell support |
| - radv: regression: commit a7291074c800 break lighting in Like a Dragon: Infinite Wealth |
| - [bisected regression] Latest mesa-git keeps /dev/dri/renderD129 always open with new applications even though they don't use it at all |
| - High GPU usage when using Zink for eglgears_x11 (on X11) |
| - Segfault in X11 image acquire code with timeout=0 |
| - Race condition with timeline semaphores |
| - Crash from iris_set_sampler_views in chromium/chrome with accelerated video decoding |
| - rusticl: aco: LLVM outperforms ACO in clpeak for \`short` benchmarks on hawaii |
| - rusticl: aco: Performance regression in clpeak for char benchmarks on hawaii |
| - zink on tu assertion failing doing shader-db runs. |
| - nir_algebraic silently ignores operand conditions in some cases |
| - lavapipe: valgrind triggers errors with CTS unit tests when creating a vulkan device |
| - radv: gfx12 RGP captures don't support instruction timings for graphics pipelines |
| - radv: more glcts fails KHR-GL46.shading_language_420pack.initializer_list_initializer* |
| - radv: regression in KHR-GL46.gpu_shader5_gl.float_encoding |
| - !36097 breaks Xwayland (& others) |
| - radeonsi: Broken VAAPI video color conversion |
| - Gallium HUD broken since !34054 |
| - RADV indirect buffer size limitation |
| - v3dv: regression in vkAllocateMemory importing gbm bo |
| - etnaviv: instruction count regression due to 2193ae0253e5 (etnaviv: compiler: Call nir_lower_alu_width(..)) |
| - etnaviv: instruction count regression due to 2193ae0253e5 (etnaviv: compiler: Call nir_lower_alu_width(..)) |
| - Vulkan WSI (and zink) use threads on X11 even when the X connection isn't thread-safe |
| - agx: Crashes in BO management in CEF/Chromium when resizing browser surface |
| - RADV/NAVI31: spec\@glsl-1.50\@gs-max-output,Crash - Assertion \`max_esverts >= max_verts_per_prim && max_gsprims >= 1' |
| - mediafoundation: DX12 encoder fails at D3D12 OpenSharedHandle with E_INVALIDARG |
| - [regression][bisected] [FirePro W4100]: crashing/rebooting |
| - zink: regression with spec\@arb_indirect_parameters\@conditional-render and RADV |
| - Descriptor set layout with binding flags fails due to indices not matching bindings |
| - piglit bindless texture tests crash |
| - ir3,turnip: Think about better heuristics of THREAD64 vs THREAD128 for fragment shader |
| - regression - e0cdf4df causes much lower pp512 on llama.cpp |
| - [radeonsi] Artifacts in Team Fortress 2 (bisected) |
| - eglgears_wayland segfault on zink+nvk with PRIME |
| - vn_renderer_virtgpu.c:13:10: fatal error: 'xf86drm.h' file not found |
| - error[E0425]: cannot find value \`MAP_DROPPABLE` in module \`linux_raw_sys::general` |
| - brw: mad instruction printing broken on Gfx11 |
| - Add something simiar to draw_bkp for compute dispatch |
| - Add something simiar to draw_bkp for compute dispatch |
| - broadcom: use VARYING_BIT_* macros more |
| - radv: RGB9E5 rendering does not ignore alpha write mask |
| - Confidential issue #13431 |
| - Confidential issue #13432 |
| - dzn: dzn_meta_init may return VK_SUCCESS when failing, leading to application crashes |
| - ci: artifacts have weird filenames |
| - anv: Add support for Float16 as cooperative matrix destination |
| - brw_nir_lower_cmat produces invalid NIR for OpVariable initializers |
| - src/asahi/lib/meson.build:65:52: ERROR: Unknown variable "inc_virtio_vdrm". |
| - turnip: loadOp must be deferred |
| - hasvk_tests: ../src/vulkan/runtime/vk_log.c:40: vk_object_to_device: Assertion \`obj->device' failed. |
| - radv: android: Why \`VkNativeBufferANDROID::handle::numFds` must be 1 in function \`radv_image_from_gralloc` |
| - Single-slice fallback in st_CompressedTexSubImage doesn't work |
| - Regression: Mesa 25.1.1 causes ground texture flickering in DOTA 2 |
| - GPU process crash via WebGPU shader - wild-deref in Mesa try_opt_exclusive_scan_to_inclusive |
| - rusticl: Meson: \`rust.bindgen()` does not pass edition before 1.7.0 resulting in build failures on some configurations. |
| - vkGetPhysicalDeviceImageFormatProperties2 not correctly implemented for VK_FORMAT_G8_B8R8_2PLANE_420_UNORM + VK_IMAGE_CREATE_EXTENDED_USAGE_BIT |
| - ci_run_n_monitor broken |
| - RADV regression: make a parameter to specify preferred memory heap |
| - anv, bmg: Striped pattern on ground in Heroes of Valor |
| - New Mesa drivers failing to launch some flatpak apps |
| - PanVK: Vkmark - CS_FAULT.EXCEPTIONS when running specifically vkmark -b clear |
| - [ANV][LNL/BMG] - The Last of Us™ Part II Remastered (2531310) - Multicolored dots present on some objects |
| - Oddworld: Strangers Wrath bad shading on NPC chickens |
| - RADV: Unimplemented instrinsic instr when creating a pipeline with a task shader |
| - FTBFS LLVM21 CLC error: no matching function for call to ‘clang::TextDiagnosticPrinter |
| - ci: update GL CTS |
| - mesa:freedreno / cffdump-shadow failure |
| - [anv] [feature request] support for emulated shaderStorageImageReadWithoutFormat |
| - v3d crashes on Raspberry Pi 5 if no monitor connected |
| - aco/gfx12: don't insert waitcnts between d16 loads that write to separate register halves |
| - [mediafoundation] [rfc] Ideas for single pass build compatibility |
| - anv: Support fast clears in anv_CmdClearDepthStencilImage |
| - 25.1.2 - instant crash on upgrade, very first attempt at a new surface |
| - [ANV][LNL] - Sid Meier's Civilization VII (1295660) - Linux version hangs when starting the "Graphics Benchmark". |
| - Steam game recording on Intel ANV resulting in green corrupted image due to bug with VK_FORMAT_G8_B8R8_2PLANE_420_UNORM rendering |
| - Confidential issue #13281 |
| - anv, regression: Missing terrain in It Takes Two on BMG |
| - "breaking-limit" benchmark will Freeze before starting. |
| - rusticl: fails to build on non-Linux |
| - rusticl: Assertion failed for ACO and stuck for LLVM (radeonsi) |
| - rusticl: CL_MEM_HOST_READ_ONLY cl_image2d reads zeroes from host on AMD Vega8 |
| - 9070 XT: Missing VK_KHR_video_decode_queue |
| - Bug (bad code optimization?) in the GLSL (compute) shader compiler |
| - nvk: Broken lighting in Trine 5 |
| - RX9070 hard crash with Mafia Definitive Edition |
| - [RADV] mesa-git: GPU hangs in Oblivion Remaster |
| - [Intel Arc a770] Sky: Children of the Light, meshes are visible around text and spirits |
| - RADV: Potential bug with vulkan fragment shader interpolation (on outputs from mesh shaders?) |
| - In the game "Foundation" a buildings areas of effect is missing |
| - ANV: Dota 2 May 22 2025 update crashing in vkCmdBindDescriptorSets with no validation error |
| - RustiCL / fdo: infinite loop in compiler on astype OpenCL test |
| - Vulkan Video engages during playback of format which is not supported by my Fiji GPU |
| - radv/video: naluSliceSegmentEntryCount is ignored |
| - Weston hangs on virtio (non-accelerated version) on QEMU with Mesa main |
| - KVM/qemu: GDM fails to start / gnome-shell crashes after update to mesa-dri-drivers-25.0.3 |
| - radv: Copying from non-resident sparse buffer hangs on GFX9 |
| - Follow-up from "ci: Use test-* containers as the base for rootfs on LAVA & baremetal" |
| - DRI_PRIME env var on EGL Surfaceless platform is ineffective |
| - radv: VK_EXT_primitive_topology_list_restart bug |
| - [RADV][GFX9] Recent Mesa-git broken on AMD Vega 64 with ring sdma0 timeouts when launching DXVK games |
| - ACO: IR Validation error "SDWA operand selection size" triggered by compute shader on VEGA20 |
| - RADV: Gibberish output with llama.cpp (Vulkan compute) on Radeon VII (Vega20) with Mesa 25.1.0, works on 25.0.5 |
| - main fails to build with missing defines in d3d12 video code |
| - Flickering in Control after mesa-git-25.2.0_devel.205323.7c4f501e99d.d41d8cd |
| - regression with mr 34601 on the witcher 3 |
| - Blending broken in game SkinDeep |
| - Radeon R5 (Mullins) H264 VA-API encoding acceleration doesn't work |
| - ci/debian: arm64 does not respect \`DEBIAN_BUILD_TAG` |
| - nvk: lib_rs_gen.py requires \`rustfmt` |
| - radv: vkd3d-proton test failure with predication + EXT_dgc |
| - nvk: Implement VK_EXT_zero_initialize_device_memory |
| - mesa-25.0.4 aborts Xserver due to ACO "Unsupported opcode" v_lshlrev_b16 |
| - In SkinDeep, GL_LINES causes GL_INVALID_OPERATION with radeonsi and llvmpipe |
| - radv: UB and artifacts when copying a \`COMBINED_IMAGE_SAMPLER` with an immutable sampler |
| - RADV: Dynamic state multiple viewport corruption |
| - intel: Improve color fast-clear handling |
| - Follow-up from "intel/isl: Add and use a multi-engine surf usage bit" |
| - [drm:amdgpu_uvd_cs_pass2 [amdgpu]] \*ERROR* )Handle 0x48780001 already in use! |
| - glGetInternalformativ returns incorrect information for GL_STENCIL_INDEX8 |
| - RadeonSI: Psychonauts rendering regression since !29895 |
| - [r600g] Rejected CS when using dolphin's GPU texture decoder |
| - radeonsi: Assertion \`src_bit_size == bit_size' failed. when running without MESA_GLSL_DISABLE_IO_OPT=1 |
| - radeonsi vdpau + Packed YUY2 = assert |
| - Indiana Jones and The Great Circle, Graphical corruption on 9070 XT. |
| - glPushAttrib/glPopAttrib broken with glColorMaterial and ligthing |
| - FTBFS (LLVM 21): ../src/compiler/clc/clc_helpers.cpp:867:60: error: ‘std::shared_ptr<clang::TargetOptions> clang::CompilerInvocationBase::TargetOpts’ is protected within this context |
| - RADV:RX 9070:Mesa-25.0.5 GTA 5 Enhanced GPU HANG |
| - [Feature request] Add support for VK_EXT_map_memory_placed vulkan extension on hk |
| - [ANV/DG2] nvpro-samples/vk_raytracing_tutorial_KHR/ray_tracing_reflections renders not all reflections |
| - venus: Missing VK_KHR_swapchain |
| - radv: Flickering in Kingdom Come: Deliverance II |
| - RADV regression causes severe glitches in Hunt Showdown 1896 on Polaris |
| - Z-Fighting in Tomb Raider IV - VI Remastered Linux |
| - [anv] VK_ERROR_DEVICE_LOST on Linux 6.13.8 while playing Dota 2 on Intel Graphics |
| - Request for coop_matrix and bfloat16 (ML/AI related) VK exts and VK_KHR_shader_clock for Asahi HoneyKrisp driver (if/when possible/supported in HW) |
| - Variable Rate Shading (VRS) produces very aliased results on RADV with an AMD gpu |
| - WWE 2k23 small "artifacts" |
| - RadeonSI - ACO does not spill enough SGPRs despite low VGPR pressure in some scenarios |
| - [ANV][LNL] - Jusant (1977170) - Game crashes with an Unreal Engine error at launch |
| - RADV: Performance Regression (~18%) on Vega 64 (GFX9) in Cyberpunk 2077 caused by commit 08918f08805f (MR 34361) |
| - [macOS] glapi_gentable.c:46:10: fatal error: 'glapitable.h' file not found |
| - radeonsi: CL conformance test \`vector_swizzle` fails since 177427877bb50ad7ba24abfa13e55a2684d804df |
| - commit 90faadae regression - Warhammer 40,000: Space Marine 2 crashing |
| - RADV: GPU hangs always at certain places in the game "Sniper Elite: Resistance" |
| - Random mesa crashes in kwin_wayland on a 6600XT |
| - Patch to fix clinfo on rusticl |
| - radv/aco: Ghost of Tsushima hangs and causes gpu resets on RDNA 3 GPU |
| - mesa-vulkan-driver-git.x86_64 causes strange colored rectangle artifacts in Final Fantasy XIV |
| - NVK - Out of video memory error when starting Marvel Rivals |
| - NVK - Out of video memory error when starting Marvel Rivals |
| - radv/rt ray tracing performance regression in control found in mesa git |
| - NAK: copy-prop is failing to fold fabs into fadd |
| |
| |
| Changes |
| ------- |
| |
| Aaron Ruby (1): |
| |
| - gfxstream: Prune the wait/signal semaphore lists on queue submissions |
| |
| Adam Jackson (12): |
| |
| - loader: Use RTLD_LOCAL not RTLD_GLOBAL |
| - glx: Don't try to dlopen ourselves |
| - util: Remove unused USED macro |
| - util/cpu: Remove util_cpu_caps_t::family |
| - util/cpu: Teach the Linux code about getauxval() |
| - rusticl: Add RUSTICL_FEATURES=intel |
| - rusticl: Enable cl_intel_subgroups if you ask nicely |
| - vtn: Handle SPV_KHR_uniform_group_instructions |
| - vtn: (Silently) handle FunctionParameterAttributeNo{Capture,Write} |
| - vtn/opencl: Handle OpenCLstd_F{Min,Max}_common |
| - rusticl: Implement cl_intel_required_subgroup_size |
| - meson: Require LLVM 8 or newer |
| |
| Aditya Swarup (2): |
| |
| - anv: Disable fast clear when surface width is 16k |
| - iris: Disable fast clear when surface width is 16k |
| |
| Adrián Larumbe (12): |
| |
| - pan/kmod: Add BO-labeling kernel module operation |
| - drm-uapi: Sync panthor_drm.h |
| - drm-uapi: Sync panfrost_drm.h |
| - pan/kmod: Add Panthor BO Labeling IOCTL support |
| - pan/kmod: Add Panfrost BO Labeling IOCTL support |
| - panfrost: Fix Panfrost BO leak in error handling path |
| - panfrost: Label kernel BO's for newly created objects |
| - panfrost: Flag resources with imported BOs |
| - panfrost: Expand BO label for panfrost resources |
| - mesa: Implement label sharing from GL objects with UM drivers |
| - panfrost: Implement pipe screen's resource_label interface |
| - panfrost: Add missing util_range_init() |
| |
| Agate, Jesse (1): |
| |
| - amd/vpelib: Add macro to perform ceil division |
| |
| Albert Liu (1): |
| |
| - gfxstream: Add \`VK_EXT_memory_budget` to allowlists. (mesa3d side) |
| |
| Alejandro Piñeiro (2): |
| |
| - v3d/compiler: properly handle the RA debug option |
| - nir/lower_clip: update comment |
| |
| Aleksi Sapon (3): |
| |
| - lp: fix gnu-empty-initializer warning |
| - draw: fix prim_info.start for linear_run_elts |
| - draw: remove unused prim_flags from run_linear_elts |
| |
| Alessandro Astone (3): |
| |
| - android: Link with libc++fs up to SDK 35 |
| - asahi: Do not require fopencookie on Android |
| - asahi: Avoid AND-ing with string literal in static_assert |
| |
| Alyssa Rosenzweig (176): |
| |
| - hk: fix underbinding scratch |
| - hk: fix tessellation + clipper queries |
| - hk: fix null FS corner cases |
| - hk: drop FS null checks |
| - hk: reindent/unscope |
| - hk: fill sparse.write with nonzero values |
| - ail: generalize ail_space_bits |
| - ail: support twiddled |
| - asahi: pack sample count in s/w texture descriptor |
| - asahi: extend tile width/height in texture desc |
| - asahi: generalize tiling checks |
| - asahi: generalize compression check |
| - asahi: identify ZLS tiling bits |
| - asahi: plumb ZLS tiling bits |
| - hk: plumb ZLS tiling |
| - asahi: identify ZLS compress load/store bits |
| - asahi,hk: factor out zls_control pack helper |
| - asahi: let booleans be your guide |
| - hk: handle HIC with twiddled |
| - agx: model sources as late-kill in demand calcs |
| - agx: late-kill sources |
| - agx: early-kill sources only if it won't shuffle |
| - agx: remove silly cls argument |
| - hk: fix patch count = 0 handling |
| - agx: delete more inots |
| - asahi: fix possible null deref |
| - nir: factor out nir_verts_in_output_prim helper |
| - agx: add agx_is_shader_empty helper |
| - hk: optimize out empty dispatches |
| - asahi: optimize out empty dispatches |
| - libagx: factor out _libagx_end_primitive |
| - agx/nir_lower_gs: optimize static topologies |
| - agx/nir_lower_gs: clean comment |
| - hk: bump wg sizes for geometry shader |
| - asahi,hk: use indirect-local dispatches for GS |
| - agx/nir_lower_gs: clean up state/info duplication |
| - agx/nir_lower_gs: remove silly comment |
| - agx/nir_lower_gs: clean up more state->info |
| - agx/nir_lower_gs: privatize lower_output_to_var_state |
| - agx/nir_lower_gs: drop silly fwd decl |
| - agx/nir_lower_gs: use common nir_verts_in_output_prim |
| - agx/nir_lower_tess: drop pointless helpers |
| - agx/nir_lower_tess: drop dead todo |
| - libagx: use #pragma once in geometry.h |
| - libagx: drop libagx_popcount |
| - libagx: drop libagx_sub_sat |
| - util: optimize bitcount on OpenCL |
| - asahi: clang-format |
| - agx: use abi.h defines |
| - agx: plumb vertex_id_zero_base |
| - agx/nir_lower_gs: fix type confusion |
| - agx/nir_lower_gs: avoid redundant sr read's |
| - agx/nir_lower_gs: don't use nir_def_rewrite_uses |
| - agx/nir_lower_gs: rework gs rast shape handling |
| - agx/nir_lower_gs: compact static topologies |
| - agx/nir_lower_gs: bound static topologies |
| - libagx: do not include heap in geometry params |
| - libagx: rename agx_geometry_state to agx_heap |
| - libagx: use common heap alloc for tessellator |
| - asahi: do not use "Null" layout |
| - hk: advertise VK_KHR_shader_quad_control |
| - nir: add nir_lower_default_point_size pass |
| - agx/nir_lower_gs: use nir_lower_default_point_size |
| - agx/nir_lower_tess: use nir_lower_default_point_size |
| - hk: use nir_lower_default_point_size |
| - vtn: remove spurious texel buffer warning |
| - nir/lower_printf: fix vectors with nir_printf_fmt |
| - vtn: ignore SpvDecorationFPFastMathMode |
| - meson: make ffs available in OpenCL code |
| - docs/features: add honeykrisp |
| - vulkan: pass enabled features to drivers |
| - hk: make hk_lower_nir static |
| - hk: gate min LOD emulation on the feature |
| - asahi: clang-format |
| - hk: gate custom border colour emulation on feature |
| - asahi: advertise ASTC HDR formats |
| - nir: generalize bitfield_reverse bit size |
| - nir/lower_alu: compact bitcount lowering |
| - nir/lower_int64: add bitfield_reverse lowering |
| - asahi: drop dead surface stuff |
| - asahi: clang-format |
| - agx: fix sample_mask packing overflow |
| - agx: add XML description of the AGX2 ISA |
| - agx: add instruction encoding cases |
| - agx: add XML-based disassembler |
| - agx: drop encoding masks |
| - hk: clean out unused driconf |
| - asahi: fix shader-db with fbfetch |
| - hk: enable ETC, ASTC |
| - hk: enable another format |
| - hk: clarify A8_UNORM issue |
| - hk: fix texture state count |
| - nir/opt_preamble: hoist reorderable SSBO loads on AGX |
| - nir/opt_sink: sink agx backfacing |
| - nir: model dynamic uniform layout on hk |
| - agx: add headers for uniform abi |
| - asahi: clean up fs prolog pass |
| - asahi: use #defines for prologs/epilogs |
| - asahi: use #defines for uniforms |
| - hk: use #defines for uniforms |
| - hk: don't advertise b4g4r4a4 format |
| - hk: make image heap / root uniform dynamic |
| - hk: stop reserving pile of uniform regs for literally no reason |
| - hk: stop uploading blend constants |
| - hk: don't give DXVK fake min/max |
| - hk: make border colour lint quieter |
| - hk: disable custom border for proton |
| - hk: optimize !largePoints |
| - hk: drop psiz writes with !largePoints |
| - asahi,hk: fix ts allocation for more shader types |
| - ail: generalize solid mode values |
| - ail,asahi,hk: distinguish logical vs allocation compression |
| - libagx: add agx_2d grid helper |
| - libagx: export coordinate twiddle routine |
| - libagx: add fast-clear kernels |
| - hk: implement vkCmdClear*Image with fast-clears |
| - hk: use fast-clears for empty render passes |
| - hk: merge adjacent CDM control streams |
| - nir/lower_blend: fix snorm factor clamping |
| - hk: enable snorm rendering |
| - nir: plumb load_global_bounded |
| - hk: optimize load_global_bounded |
| - hk: sink SSBO loads |
| - nir,hk: sink lowered UBOs |
| - hk: use new path for !soft fault UBO |
| - nir/search_helpers: add has_multiple_uses helper |
| - agx: optimize sign fix ups |
| - agx: defer nir_opt_idiv_const |
| - hk,asahi: move scratch BO to common |
| - asahi: use zero/scratch page for null textures |
| - hk: eliminate null_sink |
| - asahi,hk,libagx: drop null checks |
| - libagx: optimize xfb offset pointers |
| - agx/nir_lower_gs: rework XFB prim calculation |
| - agx/nir_lower_gs: CLify more XFB code |
| - agx/nir_lower_gs: remove random idiv const call |
| - libagx: port pre-GS to CL |
| - hk: fix gs static topology + indirect draw |
| - nir/opt_algebraic: add trichotomy identity |
| - nir: add intrinsics for geometry shader lowering |
| - nir: add rasterization_stream sysval |
| - libagx: add agx_vdm_barrier |
| - asahi: clang-format |
| - asahi: flush around XFB |
| - asahi,hk: significantly rework GS |
| - hk: advertise more GS features |
| - hk: optimize point size writes with GS/TS |
| - nir/lower_gs_intrinsics: drop stuff added for AGX |
| - nir: rename AGX geom/tess intrinsics |
| - treewide: use nir_break_if |
| - treewide: use nir_break_if with named if |
| - nir/lower_system_values: optimize global ID |
| - nir/lower_tex: optimize LOD bias lower for txl |
| - treewide: use VARYING_BIT_* |
| - nir: fuse ffma even with float controls |
| - nir/opt_algebraic: optimize signed pow in Control |
| - nir/lower_alu: optimize min/max signed zeros |
| - agx: lower alu after scalarizing |
| - nir/opt_preamble: add "register class" concept |
| - nir/rewrite_image_intrinsic: handle explicit coord |
| - nir/rewrite_image_intrinsic: handle non-derefs |
| - nir: fix AGX intrinsic flag |
| - agx: add stats for preamble inst |
| - nir,agx: switch to bindless_image_agx intrinsic |
| - agx: handle image_sparse_load |
| - agx: model scoreboard slot for state stores |
| - agx: plumb texture state store instruction |
| - agx: add bindless texture promotion support |
| - asahi/clc: promote bindless textures |
| - hk: promote bindless textures |
| - asahi: use nir_rewrite_image_intrinsic |
| - agx: fix incorrect 16-bit promotions with comparisons |
| - asahi: fix disasm of large shaders |
| - nir/lower_tex: revert "optimize LOD bias lower for txl" |
| - agx: make sure denorm flushing really happens |
| - asahi: enable virtgpu support |
| |
| Antonino Maniscalco (2): |
| |
| - zink: wait for sparse queue to go idle |
| - freedreno/decode: expose lookback as a cli option |
| |
| Antonio Ospite (29): |
| |
| - meson: bump default value of platform-sdk-version to Android 14 |
| - ci/android: only use custom kernel for venus GPU_MODEs |
| - ci/deqp: force overwriting exiting files when compressing with zstd |
| - ci/android: specify EXCLUDE_FILTERS after INCLUDE_FILTERS when launching Android CTS |
| - ci/android: strip tailing spaces in Android CTS expectation files |
| - ci/android: show how to add more Android CTS test cases |
| - ci/android: fix printing the original GLES and Vulkan versions |
| - ci/android: get only the first device from eglinfo and vulkaninfo |
| - ci/android: show also the device name when printing GLES and VK info |
| - ci/android: silence mesa error about \`//.cache` |
| - ci: set up XDG_RUNTIME_DIR to be under /tmp which is tmpfs |
| - ci/android: skip org.skia.skqp.SkQPRunner#UnitTest_BulkTextureRectTest |
| - ci/android: skip org.skia.skqp.SkQPRunner#UnitTest_SurfaceResolveTest |
| - ci/android: configure both hard limits and soft limits |
| - ci/android: use newer cuttlefish images |
| - ci/android: use \`curl -O` to avoid having to specify the same filename again |
| - ci/android: test that all available modules ran in android-cts-runner.sh |
| - ci/android: get aapt from the build-tools from Google instead of Debian |
| - ci/android: trust system GLES if ANGLE_TAG is not specified |
| - ci/android: support issuing custom commands before running Android CTS |
| - ci/android: download mesa artifacts in a more robust way |
| - ci/android: download S3_ANDROID_ARTIFACT_NAME in cuttlefish-runner.sh |
| - ci: fix using _section_switch and _uncollapsed_section_switch |
| - ci/android: allow unbound ARTIFACTS_BASE_URL in android-cts-runner.sh |
| - ci/android: have a cleaner structure for results from Android CTS |
| - ci/android: update comment about ANDROID_CTS_MODULES |
| - build: stop calling unreachable() without arguments |
| - util: avoid calling UNREACHABLE(str) macro without arguments |
| - libcl: avoid calling UNREACHABLE(str) macro without arguments |
| |
| Arunpravin Paneer Selvam (1): |
| |
| - amdgpu: Add queue id support to the user queue wait IOCTL |
| |
| Asahi Lina (4): |
| |
| - asahi: Add refcnt assert to agx_bo_unreference() |
| - asahi: Add BO handles to resource debug |
| - asahi: Mark KMS exported resource BOs as shared |
| - asahi: Ensure shared BOs have a prime_fd |
| |
| Ashley Smith (10): |
| |
| - panvk: Add support for VK_KHR_multiview on v7 |
| - panvk: Expose support for multiview on v7 |
| - mesa: Add support for GL_EXT_shader_realtime_clock |
| - panfrost: Fix shader_clock support for v6+ |
| - panvk: enable VK_KHR_shader_clock |
| - panfrost: Enable GL_EXT_shader_realtime_clock on panfrost v6+ |
| - mesa: Add support for GL_EXT_shader_clock |
| - bi/va: Add instructions required for KHR_shader_atomic_int64 |
| - pan/va: Add support for 64-bit atomic operations |
| - panvk: Enable VK_KHR_shader_atomic_int64 on Valhall |
| |
| Autumn Ashton (2): |
| |
| - tu: Implement VK_KHR_unified_image_layouts |
| - radv: Fix handling of NULL pColorAttachmentLocations in vkCmdSetRenderingAttachmentLocations |
| |
| Bas Nieuwenhuizen (4): |
| |
| - radv: Add support for not having gfx queues. |
| - radv: Don't init gfx preamble without gfx support. |
| - radv: Add CDNA register settings. |
| - radv: Use packed invocation ids for newer CDNA. |
| |
| Bo Hu (8): |
| |
| - gfxstream: update sudbdecode snapshot code |
| - gfxstream:: update code gen for vkResetCommandPool |
| - gfxstream: [vulkan snapshot]: update codegen for vkCmdBindPipeline |
| - gfxstream: [vulkan snapshot]: update to support vkCmdCopyBuffer |
| - gfxstream: [vulkan snapshot]: update codegen to handle failed api calls |
| - gfxstream: [vulkan snapshot]: update codegen for vkUpdateDescriptorSetWithTemplateSized |
| - gfxstream: update codegen for vkQueueFlushCommandsGOOGLE |
| - gfxstream: [vulkan snapshot]: update codegen for vkCmdCopyImageToBuffer |
| |
| Boris Brezillon (95): |
| |
| - panvk: Advertise support for VK_EXT_vertex_input_dynamic_state |
| - panvk: Set .pushDescriptor=true |
| - panvk/csf: Pass less arguments to emit_vs_attrib() |
| - panvk/csf: Fix instance attribute offseting |
| - panvk: Advertise support for VK_EXT_vertex_attribute_divisor |
| - panvk: Set supportsNonZeroFirstInstance=true |
| - panvk: Advertise support for VK_KHR_maintenance4 |
| - panvk: Implement CmdBindIndexBuffer2() |
| - panvk: Pass a const image to is_disjoint() |
| - panvk: Don't pass a dev to panvk_image_init() |
| - panvk: Call vk_image_finish() in GetDeviceImageMemoryRequirements() |
| - panvk: Implement GetImageSubresourceLayout2 and GetDeviceImageSubresourceLayoutKHR() |
| - pan/format: Disable image storage on A8_UNORM |
| - panvk: Advertise support for VK_KHR_maintenance5 |
| - panvk: Implement CmdBindVertexBuffers2() |
| - panvk: Advertise support for VK_EXT_extended_dynamic_state[2] |
| - panvk: Advertise support for VK_KHR_shader_terminate_invocation |
| - pan/bi: Fix and improve the !abs && !coarse case in bi_emit_derivative() |
| - pan/bi: Lower is_helper_invocation |
| - pan/{bi,va}: Lower terminate to demote |
| - panvk: Advertise VK_EXT_shader_demote_to_helper_invocation support |
| - pan/format: Make pan_format::bind a 5-bit field |
| - pan: Drop the format argument passed to drm_is_mtk_tiled() |
| - pan/format: Change the way we expose supported modifiers |
| - pan: Rename/move panfrost_format_supports_mtk_tiled() |
| - pan: Consolidate AFBC helpers in a single source file |
| - pan: Consolidate AFRC helpers in a single source file |
| - pan/layout: Refactor WSI explicit image layout support |
| - pan: Draw a clear line between image layout, image and texture helpers |
| - pan: Use a consistent pan\_ prefix across src/panfrost/* |
| - pan: Add a pan_image_mip_level_size() helper and use it for 3D attribute initialization |
| - pan/image: Get rid of pan_image_mem::offset |
| - pan/layout: Split pan_image_layout |
| - pan/texture: s/pan_texture_emit/pan_sampled_texture_emit/ |
| - pan/texture: Don't mix image and buffer views |
| - pan/format: Draw a clear line between native and emulated modifiers |
| - pan/afbc: Add helpers to query AFBC header alignment requirements |
| - pan/afrc: Fix pan_format_supports_afrc() |
| - pan/afrc: Let's not pretend we support AFRC(YUV) |
| - pan/afrc: Reject AFRC(compressed) |
| - pan/layout: Drop the AFBC(compressed) tests in AFBCFormatInvariant |
| - pan/layout: Check the wsi_layout consistency in wsi_row_pitch_to_row_stride() |
| - pan/layout: Don't align WSI import offset |
| - pan/layout: Add an option to enforce WSI row pitch at layout init time |
| - pan/layout: Don't base the WSI row_pitch calculation on the image width |
| - util/format: Add missing entries for {R8_B8G8,G8_B8R8}_422_UNORM |
| - pan/layout: Test WSI import behavior on all supported format/mods |
| - panfrost: Add a debug flag to enforce WSI row pitch at import time |
| - util/format: Handle emulated YUV formats in util_format_get_plane_{height,width}() |
| - pan/image: Teach pan_image/layout about planar images |
| - panfrost: Draw a clear line between AFBC and MTK_TILED conversion shaders |
| - panfrost: Make the MTK detiling more robust |
| - pan/image: Start checking image view consistency |
| - pan/format: Add an entry for BGR8 to the blendable table |
| - panvk: Make panvk_device.{c,h} panvk_queue agnostic |
| - pan/genxml: Make sure we catch overflows on 32-bit integer fields |
| - panfrost: Make sure we don't use AFBC on resources that are too big |
| - panfrost: Let ::can_create_resource() report when the layout init failed |
| - panvk: Make sure we don't use AFBC on images that are too big |
| - panvk: Lower maxImageDimension{2D,3D,Cube} to match the HW caps |
| - panvk: Check the return of pan_image_layout_init() |
| - pan/layout: Fix size_B calculation for AFBC(3D) |
| - pan/layout: Fix get_plane_blocksize() |
| - pan/layout: Relax alignment constraints on pre-v7 for imports of linear/u-tiled |
| - pan/layout: Use uint64_t types for offsets and sizes |
| - pan/layout: Drop pan_image_slice_layout::afbc::{stride_sb,nr_sblocks} |
| - pan/genxml: Reconcile AFBC RT properties naming on v5+ |
| - pan/afbc: Fix header alignment requirement on Midgard |
| - pan/afxc: s/pan_format_supports_afxc/pan_afxc_supports_format/ |
| - pan/genxml: Introduce per-type plane descriptors |
| - pan/texture: Split the texture payload emission logic per modifier |
| - pan/genxml: Get rid of the Plane descriptor |
| - pan/desc: Split the attachment descriptor emission per modifier |
| - pan/image: Get rid of pan_iview_get_surface() |
| - pan/layout: Get rid of pan_image_surface_{offset,stride}() |
| - pan/afbc: Add the pan_afbc_{super,render}block_size_el() helpers |
| - pan/format: De-duplicate get_plane_blocksize() |
| - pan/layout: Split the logic per modifier |
| - pan/layout: Get rid of pan_image_[render]block_size_el() |
| - pan/layout: Document the strict property |
| - pan/layout: Fix WSI.Import test |
| - pan/layout: Don't mix AFBC and non-AFBC properties |
| - panfrost: Allow AFBC(3D) on Valhall |
| - pan: Add the concept of modifier handler |
| - pan: Make it so all pan_image_xxx helpers get passed an image |
| - pan/layout: Interleave header/body in AFBC(3D) |
| - pan/genxml: Fix the texture dimension field on v9+ |
| - pan/genxml: Add missing upper-32 bits of the buffer size on v12+ |
| - pan/genxml: Rework the RT/ZS emission logic |
| - pan/layout: Allow bigger size/surface stride on v12+ |
| - panvk: Bump the max image size on v11+ |
| - panfrost: Force strict import rules for AFBC(YUV) |
| - panvk: Make imports strict at all times except for AFBC(RGB) |
| - pan/kmod: Explicitly set the user MMIO offset when the KMD supports it |
| - pan/kmod: Expose the raw GPU ID through pan_kmod_dev_props |
| |
| Boyuan Zhang (10): |
| |
| - pipe: add boolean for CENC standard |
| - radeonsi: add header file for cenc parameters |
| - radeonsi/vcn: add subsample for cenc |
| - radeonsi/vcn: add drm key for cenc |
| - radeonsi/vcn: add drm keyblob for cenc |
| - frontends/va: enable cenc for protected playback |
| - pipe: add gaps_in_frame for h264 |
| - frontends/va: get gaps_in_frame for h264 dec |
| - radeon/vcn: add gaps_in_frame flag to h264 sps |
| - ci/fluster: remove 3 pass cases resulted by gaps_in_frame |
| |
| Brian Paul (14): |
| |
| - svga: handle null target pointers in svga_set_stream_output_targets() |
| - svga: increase SVGA_MAX_FRAMEBUFFER_DEFAULT_SAMPLES to 16 |
| - svga: 80-column wrapping and misc code cleanups |
| - svga: handle NULL surface in svga_surface_needs_propagation() |
| - svga: remove unneeded assignments in svga_set_framebuffer_state() |
| - gallium/util: minor clean-ups in u_framebuffer.c |
| - svga: whitespace clean-ups in svga_screen_cache.h |
| - svga: asst. clean-ups in svga_surface.c |
| - svga: rework framebuffer state |
| - svga: stop using pipe_context::create_surface() |
| - svga: assorted code clean-ups |
| - svga: assorted code clean-ups in svga drm code |
| - gallivm: always use mkdir() instead of std::filesytem::create_directory() |
| - util: add/use new os_mkdir() function |
| |
| Caio Oliveira (36): |
| |
| - spirv, radv, intel: Add NIR intrinsic for cmat conversion |
| - spirv, nir: Delay calculation of shared_size when using explicit layout |
| - spirv, nir: Allow non-Aliased workgroup memory blocks |
| - hk: Don't expect garbage on shared_size |
| - intel/executor: Reorganize -h and --help |
| - intel/executor: Fix bfloat example for converting F to packed BF |
| - util: Add functions to convert float to/from bfloat16 |
| - nir: Add opcodes for BFloat16 |
| - brw, nir: Use glsl_base_type instead of nir_alu_type for \@dpas_intel |
| - brw: Implement support for BFloat16 ALU opcodes |
| - brw: Consider bfloat16 in copy propagation |
| - brw: Consider bfloat16 in lower regioning pass |
| - brw: Consider bfloat16 in lower simd width pass |
| - brw/cmat: Store more information about cmat slices |
| - brw/cmat: Extract emit_packed_alu1() function |
| - brw/cmat: Support different src/dst packing factors in emit_packed_alu1 |
| - brw/cmat: Implement conversion from/to BFloat16 |
| - spirv: Move Convert opcodes handling to its own function |
| - spirv: Refactor to use glsl_type to pick ALU ops |
| - spirv: Add bfloat16 support to SpecConstantOp |
| - spirv: Implement Conversions to/from bfloat16 |
| - spirv: Use bfdot for SpvOpDot with BFloat16 |
| - intel: Add support for BFloat16 as cooperative matrix source |
| - intel: Don't require dpas_intel src2 to match destination |
| - brw/disasm: Don't print src1 information for SEND gather |
| - brw: Fix MAD instruction usage in spilling logic |
| - intel/executor: update SFID names in macros to match recent changes |
| - intel/executor: allow single line comments in macro lines |
| - brw: Use the right width in brw_nir_apply_key for BS shaders |
| - brw: Don't use simd_select for BS shaders |
| - spirv: Add more restrictions around Blocks |
| - iris: Refactor BRW_KEY_INIT to get an iris_prog_base_key |
| - intel: Add INTEL_DEBUG=no-vrt |
| - brw: Add scoreboard test for edge case involving baked dependency |
| - brw: Fix comparison with unordered_mode when making baked dependency |
| - brw: Remove extra iteration on instructions from brw_opt_address_reg_load |
| |
| Calder Young (11): |
| |
| - iris: Fix accidental writes to global dirty bit instead of local |
| - iris: set dependency between SF_CL and CC states |
| - intel_aux_map: Avoid creating new table pages when removing |
| - anv: Support render to aspect other than IMAGE_ASPECT_COLOR_BIT |
| - anv: Support multi-planar formats in anv_formats_are_compatible |
| - anv: Fix typo when checking format's extended usage flag |
| - iris: Fix issue with conditional dispatching |
| - intel/genxml: Update AVP instructions for Gfx125 and Xe2 |
| - anv: Add support for AV1 video decoding on Gfx125 and Xe2 |
| - anv: Fix tiling for H.265 and VP9 video surfaces on GFX 12.5+ |
| - isl: Set tiling requirements for video surfaces |
| |
| Caleb Callaway (9): |
| |
| - driconf: Jusant needs force_vk_vendor=-1 on Intel devices |
| - intel/perf: intel_monitor documentation tweak |
| - intel/debug: shader dump filter |
| - intel/compiler: fix SHA generation for shader replace |
| - anv: ANV_DEBUG_WAIT_FOR_ATTACH |
| - docs: Add CPU frequency governor perf tip |
| - anv: Increase max VBs to 33 on Gen11+ |
| - iris: re-emit push constants at compute batch start |
| - iris: ISP invalidate at end of compute batches |
| |
| Caterina Shablia (5): |
| |
| - panvk: ensure we don't go over the hardware FAU limit |
| - panvk: bump maxPushConstantSize to 256 |
| - panvk: prepare for multiple queue types |
| - panvk: move panvk_image_plane_bind to the beginning of the file |
| - panvk: add panvk_as_alloc and panvk_as_free helpers |
| |
| Chan, Roy (1): |
| |
| - amd/vpelib: fix doxgen warnings |
| |
| Chang, Tomson (1): |
| |
| - amd/vpelib: Update headers |
| |
| Charlotte Pabst (1): |
| |
| - mesa: clear program info when updating program string |
| |
| Chen, Phoebe (1): |
| |
| - amd/vpelib: Fix memory leak from segment ctx |
| |
| Chia-I Wu (7): |
| |
| - panvk: check for CS error |
| - panvk: remove debug_syncobjs |
| - panvk: propagate CS error in barrier sync wait |
| - panvk: define more utrace tracepoints |
| - panvk/csf: add panvk_cache_flush_is_nop |
| - panvk/csf: prepare for indirect dispatch tracing |
| - panvk/csf: add more utrace tracepoints |
| |
| Christian Gmeiner (47): |
| |
| - etnaviv: isa: Add txf instruction |
| - etnaviv: nir: Legalize txf lod src |
| - etnaviv: nir: Add support for txf texture operation |
| - etnaviv: nir: Use nir_shader_tex_pass(..) |
| - etnaviv: nir: Legalize txd derivatives src's |
| - etnaviv: compiler: Handle f2f16 and f2f32 |
| - etnaviv: compiler: Call nir_lower_alu_width(..) |
| - etnaviv: compiler: Enable more pack/unpack lowerings |
| - lima: Move fdot lowering from NIR to lima |
| - zink: Fix NIR validation error in cubemap-to-array lowering |
| - asahi: Don't use deprecated NIR_PASS_V macro |
| - util/perf: Change _mesa_trace begin functions to return void * |
| - util/perf: Add sysprof integration |
| - treewide: Do not use NIR_PASS_V for nir_divergence_analysis(..) |
| - vc4: Use nir_shader_intrinsics_pass(..) for vc4_nir_lower_io(..) |
| - vc4: Use nir_shader_intrinsics_pass(..) for vc4_nir_lower_blend(..) |
| - lima: Convert to use nir_shader_intrinsics_pass |
| - asahi/lib: Don't use deprecated NIR_PASS_V macro |
| - nir/inline_uniforms: Convert to use nir_shader_intrinsics_pass(..) |
| - zink: fixup_io_locations: Preserve all metadata |
| - zink: Don't use deprecated NIR_PASS_V macro anymore |
| - st/mesa: Remove NIR_PASS_V usage |
| - ir2: Remove NIR_PASS_V usage |
| - crocus: Return progress for crocus_lower_storage_image_derefs(..) |
| - crocus: Don't use deprecated NIR_PASS_V macro |
| - etnaviv: tex_desc: Add seamless cube map support |
| - nouveau/nir: Return progress for nv_nir_move_stores_to_end(..) |
| - nouveau/nir: Don't use deprecated NIR_PASS_V macro |
| - etnaviv: isa: Add texldgpcf instruction |
| - etnaviv: nir: Legalize txd comparator |
| - etnaviv: Push nir_tex_instr down to etna_emit_tex(..) |
| - etnaviv: nir: Use texldgpcf opcode for shadow texture gradients |
| - etnaviv: nir: Add assertion to prevent src2 conflicts in texture instructions |
| - etnaviv: nir: Use texldlpcf opcode for shadow texture look-ups with explicit LOD |
| - etnaviv: nir: Use nir offset lowering |
| - etnaviv: nir: Move pre-halti5 tex lowering |
| - etnaviv: nir: Use nir offset lowering for txb |
| - nir: Unvendor sampler_lod_parameters(_pan) |
| - etnaviv: nir: Support nir_intrinsic_load_sampler_lod_parameters |
| - etnaviv: nir: Add nir_texop_tex offset lowering |
| - etnaviv: nir: Add nir_texop_txl offset lowering |
| - etnaviv: nir: Add nir_texop_txd offset lowering |
| - etnaviv: nir: Enable vectorization with 4-component width limit |
| - etnaviv: nir: Preserve dot product instructions |
| - mesa: Include mask value in glStencilMask VERBOSE_API debug output |
| - etnaviv: blt: Clear only requested color buffers |
| - etnaviv: rs: Clear only requested color buffers |
| |
| Christoph Neuhauser (2): |
| |
| - egl: Add EGL_EXT_device_query_name and EGL_EXT_device_persistent_id |
| - egl: Fix DRI utility function compilation on macOS |
| |
| Christoph Pillmayer (36): |
| |
| - panfrost: Enable more than 16 varyings on v9+ |
| - panfrost: Remove fixed_varying_mask from uncompiled_shader |
| - panfrost: Use LD_VAR_BUF if possible on v9+ |
| - panfrost/ci: Update spec\@ext_transform_feedback\@max-varyings result |
| - panfrost: Make ls tracker mandatory in cs_builder |
| - panvk: Change pending_stores from bitset to bool |
| - panvk: Add automatic load/store cs WAITs |
| - panvk: Remove explicit LS waits |
| - panfrost: Remove unused ls_sb_slot from contexts |
| - panvk: Remove unused path in flush_tiling |
| - panvk: Implement VK_EXT_inline_uniform_block |
| - panvk: Advertise VK_EXT_inline_uniform_block |
| - panvk: Fix start label position in cs_while_start |
| - panvk: Add support for 64bit conditionals in CSF |
| - panvk: Use ls_tracker in cs_if_else |
| - panvk: Manually flush loads in panvk_cmd_draw_indirect |
| - panvk: Move cs_wait_slot(s) and cs_flush_loads/stores |
| - panvk: Fix ls_tracker usage in cs_loop |
| - panvk: Add tests for ls tracker behavior in cs_loop |
| - panvk: Add cs_single_link_list |
| - panvk: Fix occlusion query oq_chain order |
| - panvk: Handle occlusion queries + multiview |
| - panvk: Propagate occ query state from secondary to primary |
| - panvk: Pull CSF barrier logic into separate function |
| - panvk: Rename reset_oq_batch |
| - panvk: Add conditional wait in emit_barrier |
| - panvk: Move get_subqueue_stages to header |
| - panvk: Add timestamp write and reset |
| - panvk: Add timestamp copy |
| - panvk: Advertise timestamp support |
| - panvk: Handle IUBs in decoder |
| - panvk: Implement VK_KHR_calibrated_timestamps |
| - panvk: Advertise VK_KHR_calibrated_timestamps |
| - panvk: Fix IUB desc type check |
| - panvk: Allow invalid enum values in get_image_format_properties |
| - vk/sync: Pass dependencyFlags in vk_common_CmdPipelineBarrier |
| |
| Christopher Gautier (2): |
| |
| - panfrost, panvk: Use arch version 9 to distinguish between Bifrost & Valhall |
| - panfrost, panvk: The size of resource tables needs to be a multiple of 4. |
| |
| Collabora's Gfx CI Team (12): |
| |
| - Uprev ANGLE to f355e2b37ed43939e2753fc7dacccf75abb4c1a3 |
| - Uprev Piglit to c50d9aa54f85e0af9d72fab86c73f89356d96399 |
| - Uprev Piglit to 1498c397ea35119692b579dd6f523de4651c663f |
| - Uprev ANGLE to 3540a326ec8497700523eb2d3eca90ae21806e8e |
| - Uprev ANGLE to db33baf4eb0d7954f0110cddc30acb9cdc12e2d4 |
| - Uprev ANGLE to db71e8fa7c26d18f76d7b9e9474447b20f1c73b3 |
| - Uprev Piglit to 1767af745ed96f77b16c0c205015366d1fbbdb22 |
| - Uprev ANGLE to ec4d8f8e4d3f1a9e64430abd8b6b436253732adf |
| - Uprev Piglit to c32e5cc717e846ffa43408174181dc16a31d3df0 |
| - Uprev ANGLE to b1359277066648c3577ea8b57cc5e0bccee7b6cc |
| - Uprev Piglit to 685ea49b47ae52560975cfcdb0ea0d4d27fead16 |
| - Uprev Piglit to a0a27e528f643dfeb785350a1213bfff09681950 |
| |
| Connor Abbott (25): |
| |
| - tu: Fix flushing when using a staging buffer for copies |
| - freedreno: Add compute_lb_size device info |
| - freedreno/a6xx: Define CONSTANTRAMMODE |
| - freedreno/a6xx, turnip: Set CONSTANTRAMMODE correctly |
| - ir3: Take LB restriction on constlen into account on a7xx |
| - freedreno: Remove compute_constlen_quirk |
| - freedreno: Fix CP_RESET_CONTEXT_STATE bitfield names |
| - tu: Use safe-const GS variant for VPC binning state |
| - tu: Add perf debug for FDM being disabled |
| - tu: Remove outdated comment |
| - tu: Add tu_bo_init_new_cached() helper |
| - tu: Use cached coherent memory when possible for query pools |
| - tu: Use a temporary descriptor to patch input attachments |
| - tu: Make sure to re-emit viewports if per_view_viewport changes |
| - tu: Re-emit viewports/scissors when has_fdm changes |
| - util/u_trace: Add u_trace_move() |
| - tu: Give renderpass events a separate trace buffer |
| - tu: Remove now-redundant tu_trace_render_pass_start() |
| - tu: Don't leave tile store CS writeable |
| - tu/cs: Assert that the CS is not writeable when resetting |
| - tu: Don't allow changing writeableness in a condition |
| - tu: Add debug flag to force disable FDM |
| - vk/runtime: Handle VK_PIPELINE_CREATE_2_PER_LAYER_FRAGMENT_DENSITY_BIT_VALVE |
| - tu: Split out viewport faking from per-view viewports |
| - tu: Implement VK_VALVE_fragment_density_map_layered |
| |
| Corentin Noël (23): |
| |
| - mesa: enable GL name reuse for virgl |
| - vtest: Silence some coverity issues |
| - vtest: Be more resilient when a resource creation has failed |
| - virgl: Ensure to not overflow when encoding string marker |
| - virgl: Bail out on file descriptor duplication failure |
| - virgl: Avoid possible double free when destroying the hw resource |
| - llvmpipe: Avoid function name leak on fail paths |
| - llvmpipe: Asserts that the format at least has one non-void channel |
| - llvmpipe: Check allocated memory instead of argument |
| - llvmpipe: Do bitwise operation on unsigned constants |
| - llvmpipe: Close mem_fd or dmabuf_fd on error |
| - llvmpipe: Check for negative size before mapping the memory |
| - llvmpipe: Make sure to not divide by zero on debug output |
| - llvmpipe: Make sure to not divide by zero on debug output |
| - ci: Update CrosVM and Virglrenderer |
| - virgl: Free resource memory on creation failure |
| - egl/wayland: Plug leak when implicit modifiers are unsupported |
| - glx: Free all populated fields of the glx_display on creation failure |
| - mesa/st: Fix memory leak in st_create_common_variant |
| - mesa/st: Fix leak in st_create_fp_variant |
| - venus: Fix leak in syncobj creation failure path |
| - virgl: Add more Gallium formats to the list |
| - virgl: Add more Gallium formats to the list |
| |
| Daniel Lang (2): |
| |
| - etnaviv: hwdb: update gc_feature_database from ST |
| - etnaviv: hwdb: update gc_feature_database from NXP |
| |
| Daniel Schürmann (45): |
| |
| - nir: add option to move terminate{_if} out of loops |
| - radv: move terminate{_if} out of loops. |
| - aco: remove block_kind_continue_or_break workaround and tests |
| - aco/isel: remove check for empty exec mask on uniform continues |
| - aco/isel: assert that terminate intrinsics don't appear in loops |
| - nir/tests: add tests for nir_move_terminate_out_of_loops |
| - aco/insert_exec_mask: don't create unnecessary loop-header and -exit phis after demote |
| - aco: rename aco_instruction_selection_setup.cpp -> aco_isel_setup.cpp |
| - aco: move instruction selection files to /compiler/instruction selection/ subfolder |
| - aco/isel: move some helper functions into a separate file |
| - aco/isel: move control-flow helper functions into separate file |
| - aco/isel: move select_trap_handler_shader() into separate file |
| - aco/isel: move select_rt_prolog() into separate file |
| - aco/isel: move select_vs_prolog() into separate file |
| - aco/isel: move select_ps_prolog() into separate file |
| - aco/isel: move select_ps_epilog() into separate file |
| - aco/isel: rename aco_instruction_selection.cpp -> aco_isel_nir.cpp |
| - aco/isel: move visit_alu_instr() into separate file |
| - aco/isel: move visit_intrinsic() into separate file |
| - aco: simplify Operand() constructor |
| - aco/live_var_analysis: set same lateKill flags for same operands |
| - aco: introduce concept of vector-aligned Operands |
| - aco/print_ir: only print 'lateKill' if requested via print_kill flag |
| - aco/print_ir: print parenthesis around vector-aligned operands |
| - aco/assembler: support vector-aligned operands on MIMG instructions |
| - aco/ra: don't use kill-flags as indicator in get_reg_create_vector() |
| - aco/ra: only change registers of already handled operands in update_renames() |
| - aco/ra: Always rename copy-kill operands, even if the temporary doesn't match |
| - aco/ra: handle register assignment of vector-aligned operands |
| - aco/ra: add affinities for MIMG vector-aligned operands |
| - aco/validate: validate register assignment of vector-aligned operands |
| - aco/isel: use vector-aligned operands for image_bvh64_intersect_ray |
| - aco/ra: always fill moved operands when handling vector-operands |
| - aco/ra: always set fill_operands=true when handling operands |
| - nir: remove recursive check in nir_lower_phis_to_scalar() |
| - nir/lower_phis_to_scalar: remove exec_list dead_instrs |
| - nir: add nir_vectorize_cb callback parameter to nir_lower_phis_to_scalar() |
| - aco/isel: refactor emission of bitwise logical operations |
| - aco: allow subdword vector-definitions on some VALU instructions |
| - aco: allow vectorized nir_op_mov |
| - radv: don't lower subdword phis to scalar |
| - radv: vectorize some integer arithmetic and bcsel with scalar condition |
| - aco: split vectorized bcsel and bitwise logic VGPR definitions |
| - aco/isel: allow to select SGPR defs for vectorized bcsel and logical operations |
| - aco/ra: always use bytes for register stride requirements |
| |
| Daniel Stone (33): |
| |
| - ci/piglit: Use structured tagging for Piglit |
| - ci: Add firmware to test-base container |
| - ci: Add packages for hardware CI testing |
| - ci: Add ci-kdl to test-base container |
| - ci/panfrost: Really document T860 array flakes |
| - ci/bare-metal: Remove remnants of old bare-metal setups |
| - ci/panfrost: Demote T720 to nightly runs only |
| - ci: Bump v6.14 kernel for updated Panthor scheduler |
| - ci: Remove baremetal rules special case |
| - ci/baremetal: Fix broken dependency declaration |
| - ci/freedreno: Fix dependencies for trace jobs |
| - ci/windows: Use build rules for the build job |
| - ci: Only run rustfmt when necessary |
| - ci: Skip check-only container jobs for pre-merge |
| - ci: Use placeholder-job for sanity |
| - ci: Fix missing pipelines on user pipelines in MRs |
| - egl/wayland: Remove unnecessary plane validity check |
| - egl/wayland: Move EGL components to wayland-drm |
| - dri2: Remove format components mapping |
| - dri: Garbage-collect old DRI interfaces |
| - x11: Remove DRI2 support |
| - egl: Remove EGL_MESA_drm_image |
| - vulkan: Remove build-system remnants of wl_drm support |
| - egl/wayland: Remove support for non-dmabuf wl_drm |
| - egl/wayland: Use DRM formats rather than wl_drm |
| - dri: Remove dri2_from_names |
| - dri: Remove getBuffersWithFormat |
| - dri: Remove DRI2 loader extension |
| - dri: Remove remnants of TexBuffer extension |
| - egl/wayland: Move bind_wayland_display to legacy build option |
| - dri: Convert DRI_IMAGE_FORMAT to pipe_format |
| - dri: Convert pipe_format <-> FourCC lookup to a table |
| - dri: Expand pipe_format <-> FourCC lookup table |
| |
| Danylo Piliaiev (39): |
| |
| - ir3: Move nir_intrinsic_barycentric_sysval to common ir3 |
| - ir3,tu,freedreno: Allow more tex coord interpolations for prefetch |
| - tu,freedreno: Don't fallback to LINEAR with DRM_FORMAT_MOD_QCOM_COMPRESSED |
| - ir3: VARYING_SLOT_LAYER output is used for binning |
| - tu: Don't enable secondary command buffer tracepoint by default |
| - tu: Add total renderpasses,dispatches to cmdbuf tracepoint |
| - freedreno/percntr: Expose LRZ derived counters |
| - freedreno/regs: Document fast border color in sampler |
| - tu: Implement fast border colors in sampler |
| - tu: Remove builtin border color logic |
| - freedreno/a6xx: Implement fast border colors in sampler |
| - tu/lrz: Fix DONT_CARE not resulting in disabled LRZ |
| - tu/lrz: Fix NOLRZ dbg option |
| - tu/lrz: Call tu_lrz_write_disable_reason once per RP |
| - tu/lrz: Disable LRZ if RP writes depth but doesn't set on GPU dir |
| - tu: Fix disable_fs state update condition |
| - tu/lrz: Consider FS depth layout when gl_FragDepth is written |
| - tu/lrz: Add tu_ignore_frag_depth_direction driconf |
| - tu/lrz: Don't disable LRZ test for blend+depth write |
| - tu: Don't disable EARLY_Z if SampleMask is written without d/s write |
| - nir: Add option to not lower gl_InstanceIndex |
| - tu,freedreno: Use HW option to auto add base instance to instance id |
| - tu: Fix state.lrz.force_late_z condition not being updated |
| - freedreno/regs: A6XX_EARLY_LRZ_LATE_Z is really A6XX_EARLY_Z_LATE_Z |
| - tu: Use EARLY_Z_LATE_Z even when LRZ is disabled |
| - tu: Use EARLY_Z_LATE_Z with alpha-to-coverage |
| - tu: Occlusion query counting should happen after FS that kills |
| - tu: Use EARLY_Z when there is no depth/stencil tests |
| - freedreno: Use fast variants of {BC4/BC5}_SNORM formats |
| - tu: Rename u_trace renderpass iterators |
| - tu: Fix tracepoints with renderpasses spanning several cmdbufs |
| - tu: Suballocate u_trace buffers to avoid allocations on submit |
| - tu: Cache copy timestamp cs to avoid allocations on submit |
| - tu: Add MESA_TRACE_FUNC to submit and BO alloc paths |
| - tu: Add info about usage to command buffers tracepoint |
| - tu/perfetto: Ignore events with zero duration |
| - freedreno/regs: update A2D definitions |
| - tu: Use safe-const binning VS when safe-const full VS is used |
| - tu: Fix nullptr dereference in cmd_buffer tracepoint |
| |
| Dave Airlie (45): |
| |
| - nouveau/headers: Add stub blackwell class headers |
| - nvk: Fix compute class comparison in dispatch indirect |
| - nvk: Don't emit MME FIFO config on Blackwell+ |
| - nvk: Add hopper priv registers |
| - nak: Use TexOffsetMode for all texture ops |
| - nvk/headers: sync up the latest blackwell compute class headers |
| - Revert "hasvk/elk: stop turning load_push_constants into load_uniform" |
| - nak/qmd: Move slm size to a separate macro |
| - nvk: bindless cbufs on hopper/blackwell are different. |
| - nak: add support for cbuf2 format to non uniform lowering. |
| - amd: move vp9 probs table to common code. |
| - vulkan/video: add vp9 decode |
| - ac/vcn: add new firmware flag to pass uncompresed header offset. |
| - radv/video: add KHR_video_decode_vp9 support. |
| - radv/video: add some missing hevc header bits |
| - radv/video: port hevc slice header encoding from radeonsi |
| - anv: only expose VK_KHR_cooperative_matrix on devices with hw instructions. |
| - radv/video: move session defines to header as encode needs more. |
| - vulkan/video: add support for AV1 encoding to runtime |
| - radv/video: add support for AV1 encoding |
| - nvk/nil: add texture header v2 support for hopper/blackwell |
| - nvk/nil: Use PTE_KIND_GENERIC_MEMORY for everything on Blackwell+ |
| - nouveau/headers: add a single field to skip parsing headers. |
| - nvk: add clc86f class for hopper host methods. |
| - nvk: handle hopper invalidate sequence for memory barrier |
| - nvk: SET_REFERENCE is not legal anymore. |
| - nvk: invalidate raster cache after setting shading rate |
| - nvk: Add hopper shared memory window alignment |
| - nak/sm70: fix imnmx on blackwell. |
| - nvk/sm90: fix su atom vs atom encoding |
| - nvk/qmd: fix shared memory size calcs for blackwell. |
| - nouveau/winsys: Add blackwell support |
| - nouveau/headers: generate c56f host methods. |
| - nvk/hopper: set texture header version |
| - nvk/nak: handle sm100 lod mode encoding. |
| - nouveau: workaround linear/z rendering interaction |
| - nak: Add misc Blackwell tex encoding bits |
| - nouveau/headers: add a hacked up clc9b5 dma-copy |
| - nvk: add support for separate depth/stencil for blackwell |
| - nak/sm70: allow first parameter of hfma2 to be non-reg |
| - nak: add divergent attribute and wrapper for nir_load_sysval_nv |
| - nak: add uniform support for s2r |
| - nak: don't set the divergent flag on uniform sysvals |
| - nak: disable imma 8x8x16 on Blackwell+ |
| - spirv: move cmat store barrier after the store. |
| |
| David Heidelberg (2): |
| |
| - docs: Drop distro unmaintained and deprecated file. |
| - docs: Rename distro to distribution |
| |
| David Neto (3): |
| |
| - mesa: flush stderr when dumping nir validation errors |
| - spirv: spirv-to-c-array: use '-' to specify stdin |
| - nir: Support printing cmat constants |
| |
| David Rosca (74): |
| |
| - radeonsi/vcn: Fix decode target index for H264 interlaced streams |
| - radeonsi/vce: Move all code to radeon_vce.c |
| - radeonsi/uvd_enc: Move all code to radeon_uvd_enc.c |
| - frontends/vdpau: Fix creating surfaces with 422 chroma |
| - frontends/va: Correctly derive HEVC StCurrBefore, StCurrAfter and LtCurr |
| - radeonsi/vcn: Stop using HEVC direct reflist |
| - radeonsi/uvd: Stop using HEVC direct reflist |
| - frontends/va: Only keep current slice RefPicList for HEVC |
| - pipe/video: Remove unused UseRefPicList |
| - frontends/va: Set HEVC NumShortTermPictureSliceHeaderBits |
| - pipe/video: Remove st_rps_bits and UseStRpsBits |
| - radeonsi/uvd_enc: Remove unused stream_handle |
| - radeonsi/vcn: Remove unused stream_handle for encode |
| - radeonsi/vcn: Stop using stream handle for decode |
| - ac/uvd: Add ac_uvd_alloc_stream_handle |
| - radv/video: Use ac_uvd_alloc_stream_handle |
| - radeonsi/video: Use ac_uvd_alloc_stream_handle |
| - radeonsi/vce: Fix bitstream buffer size |
| - radeonsi/vce: Only send one task per IB |
| - radeonsi/vce: Fix output quality and performance in speed preset |
| - radv/video: Support encoding multiple slices |
| - radv/video: Remove carrizo workaround from VCN decode |
| - radv/video: Limit 10bit H265 decode support to stoney and newer |
| - radeonsi/vcn: Get rid of not_referenced |
| - radeonsi/vcn: Remove carrizo workaround |
| - radeonsi/vcn: Support disabling HEVC dependent slice segments |
| - frontends/va: Fix H264 top/bottom is reference flags |
| - radeonsi/vcn: Use picture fence in JPEG decode |
| - radv: Use RADEON_SURF_VIDEO_REFERENCE for video DPB images |
| - radv/video: Set correct minCodedExtent for encode |
| - radv/video: Enable decode on VCN5 |
| - radv/video: Add VCN5 encode support |
| - radv/video: Add radv_enc_h264/5_pic_type |
| - radv/video: Set all pic params for H264 encode refs |
| - radeonsi/vcn: Fix encoding multiple tiles with recent VCN4 firmwares |
| - ac/gpu_info: Add vcn_fw_revision |
| - radv/video: Only enable VP9 decode with supported firmware |
| - radv/video: Fix indentation |
| - radv/video: Prefer visible VRAM for host visible context buffers |
| - radeonsi/vpe: Change some error messages to warnings |
| - vulkan/video: Add bit depth to session and session params |
| - frontends/va: Fix SyncSurface when used to sync coded buffer |
| - radeonsi/vcn: Stop forcing OBU frame for first frame on VCN4 |
| - frontends/va: Fix RefRpsIdx calculation in st_ref_pic_set |
| - radeonsi/video: Fix RefRpsIdx calculation in st_ref_pic_set |
| - radeonsi/video: Fix copy paste errors in HEVC header encoder |
| - radeonsi/video: Fix assert in radeon_bs_code_ns |
| - frontends/va: Fix leaking fences in GetImage/PutImage |
| - radeonsi/vcn: Allow B-frames as L0 reference |
| - radv/video: Add couple missing encode flags and stdSyntaxFlags |
| - ac/surface: Support RADEON_SURF_FORCE_SWIZZLE_MODE on gfx12 |
| - radv: Don't allow linear tiling for video DPB images |
| - radv: Enable tiling for video images on VCN5 |
| - radv/video: Add support for decode tier3 |
| - radeonsi/video: Set correct minimum size for VP9 decode |
| - radv/video: Set correct H264/5 decode minCodedExtent |
| - radv/video: Set correct VP9 decode minCodedExtent |
| - Revert "radeonsi/vcn: Stop using stream handle for decode" |
| - radeonsi/vcn: Correctly handle tile swizzle |
| - radv/video: Fix encode when using layered source image |
| - radeonsi/uvd: Set H264 gaps_in_frame_num_value_allowed_flag |
| - radv/video: Add bit depth and profile check for AV1 encode |
| - radv/video: Add bit depth and profile check for VP9 decode |
| - radv/video: Set encodeInputPictureGranularity for AV1 encode |
| - radv/video: Send slice control, spec misc and deblocking params every frame |
| - radv/video: Add more encode session params overrides |
| - radv/video: Fix encode bitstream buffer offset and alignment |
| - radv/video: Fix setting H265 encode cu_qp_delta on VCN2 |
| - radv/video: Fix session_init and rc_per_pic on VCN2 |
| - radv/video: Use the new defines for H264 SPS info flags |
| - frontends/va: Add H264 encode more_rbsp_data PPS flag |
| - radeonsi/vcn: Use more_rbsp_data flag for H264 PPS encode |
| - radeonsi: Add missing DEBUG_NAMED_VALUE_END to radeonsi_shader_debug_options |
| - vulkan/video: Fix h265 level values |
| |
| Derek Foreman (22): |
| |
| - loader: Move the wayland protocol build into loader |
| - wsi/wayland: Add a bool to track the presentation id fallback path |
| - wsi/wayland: Count outstanding feedbacks |
| - wsi/wayland: Refactor some buffer management code into loader |
| - wsi/wayland: Move perfetto flow_ids into loader_wayland_buffer |
| - wsi/wayland: Move buffer name string into common code |
| - wsi/wayland: Refactor some surface management code into loader |
| - wsi/wayland: Early return from tracing function |
| - wsi/wayland: Move presentation tracing code into loader |
| - loader/wayland: Move acquisition time tracking into perfetto flows |
| - egl/dri/wayland: Use loader_wayland_buffer abstraction |
| - egl/dri/wayland: Use loader_wayland_surface abstraction |
| - egl/dri/wayland: Refactor throttle code |
| - egl/dri/wayland: Refactor buffer wait out of get_back_bo() |
| - egl/dri/wayland: Use presentation feedback to track frame delivery |
| - egl/dri/wayland: Add some arbitrary perfetto trace points |
| - egl/dri/wayland: Fix memory leak in wayland buffer handling |
| - vulkan/wsi: Add basic support for PresentWait2 |
| - wsi/wayland: Add support for PresentWait2 |
| - wsi/display: Add support for PresentWait2 |
| - wsi/x11: add support for PresentWait2 |
| - vulkan: Enable PresentWait2 on many physical devices |
| |
| Dmitry Baryshkov (5): |
| |
| - mesa-clc: add an option to force inclusion of OpenCL headers |
| - rusticl/device: relax some params for embdded profile |
| - ir3: enable lower_pack_64_4x16 |
| - meson: stop building XA by default |
| - freedreno: replace fixed array for globabl_bindings with dynamic array |
| |
| Dmitry Osipenko (3): |
| |
| - util/disk_cache: Re-enable multi-file cache by default |
| - virtio/vpipe: Correct vdrm_vpipe_connect() definition |
| - venus: Fix vn_CreateImage crashing with debug build of venus |
| |
| Doug Nazar (1): |
| |
| - glx: Fix memory leak with software dri |
| |
| Dylan Baker (9): |
| |
| - blorp: cast uint32_t -> int64_t to avoid potential overflow |
| - anv: avoid potential integer overflow |
| - anv: attempt to make coverity happy |
| - anv: Add comment why we overmap and then unmap a region |
| - meson: update b_sanitize check for Meson >= 1.8 |
| - meson: use feature options for LLVM |
| - meson: change RadV dependency on LLVM to an ACO tests dependency |
| - meson: use .enable_if for intel tools dependency on expat |
| - meson: set the \`legacy-x11` option as deprecated |
| |
| Echo J (2): |
| |
| - subprojects: Don't ignore Meson wrappers in gitignore |
| - subprojects: Don't use native option for rustc-hash dependency |
| |
| Ella Stanforth (17): |
| |
| - asahi/lib: Move alpha_to_one and alpha_to_coverage lowering to common code. |
| - v3d/compiler: Fixup output types for all 8 outputs |
| - v3d/compiler: Only lower logic ops for color buffers that exist |
| - nir/alpha_to_coverage: Add an intrinsic for better dithering |
| - v3d/compiler: Implement software blend lowering |
| - v3d: Implement dual source blending |
| - v3dv: Implement dual source blending |
| - v3d/ci: move depthstencil-default_fb-drawpixels-24_8 samples=4 to flakes |
| - nir/printf: break out va_list handling |
| - nir/printf: add new helper to printf at a specific pixel. |
| - v3d/compiler: remove int/uint tracking |
| - v3d/compiler: remove return_channels from the shader key |
| - v3d/compiler: remove unused texture swizzle |
| - v3d/compiler: remove num_samplers_used from shader key |
| - v3d/compiler: use mask for 16bit and 32bit return values |
| - v3d/compiler: remove requirement for format information for fbfetch |
| - v3d/compiler: Fix ub when using memcmp for texture comparisons. |
| |
| Emma Anholt (69): |
| |
| - wsi: Label the WSI blit command buffer with a name. |
| - zink: Add debug names to our command buffers. |
| - perfetto: Create a common MesaRenderpassIncrementalState. |
| - perfetto: Move the debug_markers to the incremental state. |
| - intel/ds: Move "have we already sent initial state?" into the helper. |
| - perfetto: Allow tracepoint args to map to native perfetto fields. |
| - anv/ds: Associate the VkCommandBuffer some anv-only renderstage events. |
| - perfetto: Add helpers for passing VkDebugUtilsObjectNameInfoEXT to perfetto. |
| - anv/ds: Forward VkDebugUtilsObjectNameInfoEXT to perfetto. |
| - intel/ds: Fix formatting of stage index. |
| - tu/perfetto: Attach VkCommandBuffer handles to the GPU events. |
| - tu/perfetto: Move "have we already sent initial state?" into the helper. |
| - tu/perfetto: Forward VkDebugUtilsObjectNameInfoEXT to perfetto. |
| - tu: Disable Z reads for always/never. |
| - u_trace: Fix payload refcounting in u_trace_clone_append(). |
| - ir3: Enable nir_opt_combine_barriers. |
| - ir3: Enable nir_opt_acquire_release_barriers. |
| - ci/tu: Add a frequent a618 flake I've noticed. |
| - tu,freedreno: Rename the MRT enable_blend field to blend_reads_dest. |
| - tu,freedreno/a6xx: Give the FS_CTRL UNK24 bit a name: INOUTREGOVERLAP. |
| - tu: Add a link to my branch for stencil read optimization. |
| - intel: Use the common NIR lowering for fquantize2f16. |
| - nir: Split nir_load_frag_coord_zw to separate z/w intrinsics. |
| - nir: Add SYSTEM_VALUE_FRAG_COORD_Z/W. |
| - nir: Make pixel_coord/frag_coord_zw be peephole-able sysvals. |
| - nir: Add support for load_frag_coord_zw to nir_opt_fragdepth. |
| - intel/shim: Report support for PXP status. |
| - intel/shim: Report I915_PARAM_HAS_GEN7_SOL_RESET. |
| - intel/fs: Don't bother checking if load_frag_coord uses interpolation. |
| - intel/elk: Save the UW pixel x/y as a temp on gfx6+. |
| - intel/elk: Save the UW pixel x/y as a temp. |
| - intel/elk: Lower load_frag_coord to load_{pixel_coord,frag_coord_z/w} in NIR. |
| - intel/elk: Use the pixel_coord UW x/y values for noncoherent FB reads. |
| - intel/elk: Reduce this->pixel_x/y usage in gfx4 interp setup. |
| - intel/elk: Retire the global float pixel_x/y values. |
| - intel/elk: Fix some tabs in gen4 URB setup. |
| - intel/elk: Move pre-gen6 gl_FragCoord.w -> interpolation lowering to NIR. |
| - intel/elk: Move pre-gen6 smooth interpolation 1/w multiply to NIR. |
| - intel/elk: Use pixel_z for gl_FragCoord.z on pre-gen6. |
| - intel/elk: Move wpos_w setup right into nir_intrinsic_load_frag_w. |
| - mesa: Retire the OptimizeForAOS code. |
| - vc4: Drop VS ucp lowering code. |
| - gallium: Remove the need for set_clip_state() on !PIPE_CAP_CLIP_PLANES. |
| - v3d: Stop advertising support for HW clip planes. |
| - ci/tu: Add some recent flakes and consolidate lists into regexes. |
| - ci/tu: Don't forget to report flakes on a660, too. |
| - ci/tu: Sync up a660's flakes with a618 where they seem to match up. |
| - ci/tu: Fix a typo in the flakes list, add a couple more tess/gs flakes. |
| - nir: Add a pass to reassociate multiplication of mat*mat*vec. |
| - ir3: Enable NIR matrix reassociation. |
| - freedreno/a2xx: Disable interpolated input intrinsics. |
| - freedreno/a2xx: Dump the intrinsic name instead of a number when compile failing. |
| - freedreno/a2xx: Share the shader state create/delete functions. |
| - turnip: Fix subpass depth/stencil change detection. |
| - turnip: Move end-of-subpass resolves to a helper function. |
| - turnip: Emit tile stores at subpass end time. |
| - turnip: Share gmem allocations between attachments. |
| - tu: Block ib2 skipping for pre-final subpass resolves. |
| - glx: clean up a litttle remains of DRI2 support. |
| - egl/x11: Match loader_destroy_screen_resources() with init(). |
| - dri2: Finish removing the now-unused dep_xcb_dri2. |
| - ir3: Store the sha1 string of the compiled shader in the shader info. |
| - tu: Include the CS binary sha1 in CS dispatch tracepoints. |
| - tu: Add tracepoints around draws, with shader sha1s. |
| - ir3: Consider double_threadsize in the shader's sha1. |
| - ir3: Dump max_waves and double_threadsize with shader native code. |
| - ir3: Prioritize max waves over doubling threadsize for non-fp16 shaders. |
| - ir3: Clarify a bit of the CS threadsize logic. |
| - i915: Avoid calling drm_intel_get_aperture_sizes(). |
| |
| Eric Engestrom (298): |
| |
| - VERSION: bump to 25.2 |
| - docs: reset new_features.txt |
| - docs: update calendar for 25.1.0-rc1 |
| - docs: update calendar for 25.0.4 |
| - docs: add release notes for 25.0.4 |
| - docs: add sha sum for 25.0.4 |
| - meson: remove duplicate \`deprecated` for \`gallium-xa` option |
| - meson: remove duplicate \`deprecated` for \`power8` option |
| - ci: give high priority to post-merge jobs as well |
| - pick-ui: make \`Backport-to: 25.0` backport to 25.0 \*and more recent release branches* |
| - ci: bump apitrace |
| - glx: drop dead GL_LIB_NAME define |
| - ci/vkd3d: drop unused 32-bit build |
| - turnip/ci: drop skip of test_vrs_depth_write_dxbc as it no longer hangs |
| - ci: uprev vkd3d-proton |
| - aco: help clang 20 do some additions and subtractions |
| - ci: bump to fedora 42 |
| - pick-ui: add missing dependency |
| - docs: update calendar for 25.1.0-rc2 |
| - ci/vkd3d: drop misguided "something failed" error message |
| - ci/vkd3d: fix test failure list when no test failed |
| - ci/vkd3d: quieten the mesa version check |
| - ci/vkd3d: only keep logs of failed tests |
| - ci/vkd3d: fix RESULTS_DIR variable name |
| - ci/vkd3d: treat reading undefined variables as an error |
| - ci/vkd3d: fail shell pipeline when part of it fails |
| - ci: drop dead wgetrc as there are no uses of wget in our CI images |
| - ci: use curl instead of wget in download-git-cache.sh |
| - ci/build: drop b2c files from the builds artifacts |
| - ci/build: drop expectation files from python-artifacts |
| - ci/build: drop lava scripts from builds artifacts |
| - ci/build: drop unused VERSION from python-artifacts |
| - ci/build: split git commit sha command out of echo |
| - ci/build: rename variable to avoid changing the meaning of existing variable |
| - ci/build: drop unnecessary shellcheck disables |
| - ci/test: fix annoying yamllint warning about 2 spaces before comments |
| - amd/ci: ci yaml indentation |
| - radv/ci: move radv-kabini-vkd3d out of gitlab-ci-inc.yml |
| - ci/test: rename .b2c-vkd3d-proton-test to .test-vkd3d-proton |
| - ci/ci-tron: default HWCI_TEST_SCRIPT to deqp-runner, as it's almost always what's run |
| - ci/test: make generic fdo runner test jobs use the S3 artifacts too |
| - ci/build: drop install.tar from gitlab artifacts |
| - ci: take google-freedreno farm offline |
| - ci: use https:// to install ci-fairy |
| - ci: rename misleading \*-postmerge stages to \*-nightly |
| - amd/ci: disable retry on nightly radeonsi-vangogh-glcts-full job |
| - amd/ci: document regression in e612e840...e210b79c |
| - docs: update calendar for 25.1.0-rc3 |
| - docs: update calendar for 25.0.5 |
| - docs: add release notes for 25.0.5 |
| - docs: add sha sum for 25.0.5 |
| - ci: drop tracking of removed folder |
| - radv/ci: document flakes seen recently |
| - radeonsi/ci: document flakes seen recently |
| - turnip/ci: document flakes seen recently |
| - zink+radv/ci: document flakes seen recently |
| - zink+nvk/ci: document flakes seen recently |
| - lavapipe/ci: document flakes (including a flaky timeout) seen recently |
| - etnaviv/ci: document two fixed tests |
| - llvmpipe/ci: document regression in a02b6e6b...676e26ae |
| - ci: pass vk_require_etc2 from radv jobs through to the duts |
| - v3dv/ci: fix malformatted flakes line |
| - vc4/ci: document flakes seen over the last 7 days |
| - v3d/ci: document flakes seen over the last 7 days |
| - v3dv/ci: document flakes seen over the last 7 days |
| - turnip/ci: document a750 fixed tests |
| - radv/ci: add recent flakes |
| - etnaviv/ci: add recent flakes |
| - r300/ci: add recent flakes |
| - lavapipe/ci: add recent flakes |
| - docs: update calendar for 25.1.0 |
| - docs: add release notes for 25.1.0 |
| - docs: add sha sum for 25.1.0 |
| - docs: add calendar for 25.1.x releases |
| - ci/b2c: drop dead /runner-before-script.sh code |
| - ci/vkd3d: stop supressing vulkaninfo errors |
| - etnaviv/ci: split model and revision tags |
| - etnaviv/ci: introduce common .austriancoder-ci-tron |
| - ci/ci-tron: deduplicate setting the runner tags for most jobs |
| - amd/ci: simplify radeonsi-vangogh-glcts jobs definition |
| - radv/ci: inline .vkcts-test-valve |
| - radv/ci: fix inheritance/override order |
| - zink+turnip/ci: fix inheritance/override order |
| - ci/ci-tron: add check that the required variables are set |
| - ci/ci-tron: print all the variables |
| - ci/build: stop uploading python-artifacts to gitlab artifacts |
| - radv/ci: increase swap size on kabini to 16g |
| - radv/ci: set swap size for all amd jobs to 16g |
| - radv/ci: rename .test-radv to .ci-tron-test-radv |
| - ci/ci-tron: uprev b2c to 0.9.15.1 |
| - amd/ci: split .ci-tron-test-amdgpu from .ci-tron-test-radv |
| - amd/ci: set kernel for amdgpu jobs in .ci-tron-test-amdgpu |
| - ci/ci-tron: document which kernel version is provided in a given b2c release |
| - ci/ci-tron: drop special case for default x86_64 kernel (= uprev to 6.13.7) |
| - zink+nvk/ci: document more flakes |
| - nvk/ci: tighten job timeouts a little bit |
| - ci-tron: drop default timeouts |
| - nvk/ci: drop CI_TRON_TIMEOUT__BOOT_CYCLE__MINUTES |
| - turnip/ci: drop CI_TRON_TIMEOUT__BOOT_CYCLE__MINUTES |
| - radv/ci: move the timeout from polaris10 job template to polaris10 job |
| - radv/ci: drop unnecessary CI_TRON_TIMEOUT__BOOT_CYCLE__MINUTES |
| - radeonsi/ci: document regression |
| - radeonsi/ci: document flakes |
| - radv/ci: document flakes |
| - broadcom/ci: document fixed tests |
| - lavapipe/ci: replace large (and growing) list of flakes with a regex |
| - lavapipe/ci: document flakes |
| - docs/linkcheck: print summary of problems found |
| - docs/conf.py: fix python formatting (whitespace changes only) |
| - docs/linkcheck: ignore a couple more domains blocking the linkcheck user-agent |
| - docs/linkcheck: ignore loging wall for broadcom |
| - docs/ci: allow running linkcheck in pre-merge pipelines |
| - docs/release-calendar: add 25.2 branchpoint & rc dates |
| - docs: update calendar for 25.0.6 |
| - docs: add release notes for 25.0.6 |
| - docs: add sha sum for 25.0.6 |
| - docs/release-calendar: fix release numbers |
| - r300/ci: switch radeon.ko jobs to common kernel (6.13.7) |
| - ci/ci-tron: replace crude dump of env vars with reproducible job script |
| - vmware/ci: move jobs to nightly until farm is stabilized |
| - ci: clear S3_JWT_FILE_SCRIPT as it also contains the S3_JWT |
| - ci: avoid unsetting S3_JWT too early in container image builds |
| - ci/container: fix whitespace for yamllint |
| - ci: uprev vkd3d |
| - docs: update calendar for 25.1.1 |
| - docs: add release notes for 25.1.1 |
| - docs: add sha sum for 25.1.1 |
| - radv/ci: document fixed angle test |
| - radeonsi/ci: document fixed piglit tests |
| - r300/ci: document fixed test |
| - amd/ci: document radeonsi flakes |
| - amd/ci: document radv flakes |
| - rpi/ci: document flakes |
| - rpi/ci: skip timing out test |
| - lavapipe/ci: document flakes |
| - lavapipe/ci: skip timing out tests |
| - delete xa leftovers |
| - amd/ci: uprev amdgpu.ko jobs to kernel 6.14.8 |
| - ci: uprev vkd3d |
| - ci/vkd3d: collapse section by default |
| - gallium: drop dynamic pipe-loader leftovers |
| - docs: update calendar for 25.0.7 |
| - docs: add release notes for 25.0.7 |
| - docs: add sha sum for 25.0.7 |
| - ci: drop unneeded printing of \`pass`/\`fail` alongside the exit_code |
| - svga/ci: fix job stages |
| - svga/ci: fix indentation |
| - ci: improve names of push-to-upstream-{default,staging}-branch pipelines |
| - ci: improve name of fork pipelines |
| - ci: improve name of merge-request pipelines |
| - ci: improve explanation of the pre-merge/merge-request pipelines |
| - ci: inline the only anchor reference that was on its own line |
| - docs: update calendar for 25.1.2 |
| - docs: add release notes for 25.1.2 |
| - docs: add sha sum for 25.1.2 |
| - docs: update calendar for 25.1.3 |
| - docs: add release notes for 25.1.3 |
| - docs: add sha sum for 25.1.3 |
| - radv/ci: document flakes seen over the last week |
| - radeonsi/ci: document flakes seen over the last week |
| - broadcom/ci: document flakes seen over the last week |
| - lavapipe/ci: document flakes seen over the last week |
| - nvk/ci: add missing VKD3D_PROTON_TAG |
| - turnip/ci: add missing VKD3D_PROTON_TAG |
| - lavapipe/ci: add missing VKD3D_PROTON_TAG |
| - ci/piglit: avoid copying /piglit across filesystems when unused |
| - radeonsi/ci: update glcts expectations |
| - etnaviv/ci: document new failure |
| - freedreno/ci: document fixed & new failures |
| - nvk+zink/ci: document new failures |
| - lavapipe/ci: document new failure |
| - lavapipe/ci: skip timing out tests |
| - radeonsi/ci: document flakes seen recently |
| - radv/ci: document flakes seen recently |
| - broadcom/ci: document flakes seen recently |
| - freedreno/ci: document flakes seen recently |
| - lavapipe/ci: document flakes seen recently |
| - radv+zink/ci: document flakes seen recently |
| - broadcom/ci: catch more similar flakes |
| - lavapipe/ci: catch all the dEQP-VK.synchronization2.timeline_semaphore.* flakes |
| - ci_run_n_monitor: fix padding in links |
| - meson: drop unnecessary \`-D legacy-x11=none` |
| - docs: update calendar for 25.1.4 |
| - docs: add release notes for 25.1.4 |
| - docs: add sha sum for 25.1.4 |
| - hasvk: only run tests on x86 (32 & 64 bit) |
| - turnip/ci: document vkd3d regression |
| - radeonsi/ci: reduce coverage of radeonsi-vangogh-glcts |
| - meson: include VkLayer_INTEL_nullhw in the devenv |
| - meson: include VkLayer_MESA_device_select in the devenv |
| - meson: include VkLayer_MESA_overlay in the devenv |
| - egl/meson: simplify glvnd json configuration |
| - egl/meson: reuse existing glvnd_vendor_name variable |
| - egl/meson: move glvnd config file after libEGL |
| - egl/meson: allow passing a full path to the glvnd json file |
| - egl/meson: include glvnd in the devenv |
| - intel/compiler tests: rewrite subprocess handling in run-test.py |
| - intel/compiler tests: fix path-to-string conversion |
| - intel/compiler tests: fix "is there something after the options" check |
| - intel/compiler tests: fix variable type for getopt_long() return value |
| - lavapipe/ci: fix flakes regex |
| - lavapipe/ci: remove duplicate flakes line |
| - lavapipe/ci: document recent flakes |
| - radeonsi/ci: document recent flakes |
| - radv/ci: document recent flakes |
| - broadcom/ci: document recent flakes |
| - zink/ci: document recent flakes |
| - r300/ci: update expectations and document recent flakes |
| - lavapipe/ci: drop redundant \`stage:` |
| - lavapipe/ci: drop unneeded MESA_VK_IGNORE_CONFORMANCE_WARNING |
| - lavapipe/ci: promote vkd3d job from nightly to merge pipelines |
| - lavapipe/ci: drop redundant .lavapipe-rules in .lavapipe-test |
| - lavapipe/ci: move job timeout definition to jobs |
| - lavapipe/ci: explicitly state in the job name which test is being run |
| - docs: drop outdated line about meson version |
| - meson: drop error for meson < 1.3 since we require 1.3+ already |
| - meson: bump required meson version to 1.4 |
| - meson: allow "venus without virgl" builds |
| - meson: fix vdrm inclusion logic |
| - virtio: move inc_virtio up one folder |
| - meson: split subdir for virtio/vdrm and virtio/vulkan |
| - freedreno/drm: fix libvdrm inclusion logic |
| - radv/ci: sort previously added flakes |
| - radeonsi/ci: document flakes |
| - radv/ci: document flakes |
| - nvk/ci: document flakes |
| - zink+radv/ci: document flakes |
| - zink+nvk/ci: document flakes |
| - zink+lavapipe/ci: document flakes |
| - broadcom/ci: document flakes |
| - ci/build: use !reference to build scripts instead of yaml anchors |
| - ci: make sure that sanity job must pass before running other jobs |
| - ci: fix yamllint warnings about whitespace |
| - broadcom/ci: move job timeout definitions to jobs |
| - ci-tron: add ci-tron-arm64-test-{asan,ubsan}-{gl,vk} job templates |
| - docs: update calendar for 25.1.5 |
| - docs: add release notes for 25.1.5 |
| - docs: add sha sum for 25.1.5 |
| - bin/symbols-check: add missing newline before function |
| - bin/symbols-check: fix fields length condition before accessing fields |
| - bin/symbols-check: ignore \`nm` lines that don't have a symbol name |
| - bin/symbols-check: ignore version of platform symbols |
| - bin/symbols-check: sort platform symbols |
| - bin/symbols-check: document new platform symbols exported since symbols-check was broken |
| - meson: only run symbols-check if \`nm` is available |
| - freedreno/ci: fix a750-piglit-cl rules |
| - freedreno: rename \`.gitlab-ci` folder to \`tests` |
| - panfrost/meson: drop invalid C-only -Wno-override-init from C++ args |
| - docs/perfetto: update link to documentation |
| - broadcom/ci: add definitions for ci-tron jobs |
| - broadcom/ci: add ci-tron variant of each job |
| - broadcom/ci: skip tests that sometimes time out |
| - panfrost/meson: drop invalid C-only -Wno-override-init from C++ args |
| - crocus/ci: disable jobs on anholt farm |
| - ci: document anholt farm as decommissioned |
| - hasvk/ci: disable jobs on anholt farm |
| - nvk/ci: document usual run time for ad106 job |
| - vc4/ci: sort dEQP-GLES2 flakes |
| - broadcom/ci: document recent flakes |
| - radeonsi/ci: sort stoney flakes |
| - radeonsi/ci: document recent flakes |
| - radv/ci: document recent flakes |
| - zink+radv/ci: document recent flakes |
| - zink+lavapipe/ci: document recent flakes |
| - ci: uprev apitrace |
| - docs: improve "backport MR" instructions |
| - etnaviv/ci: document fixed tests |
| - zink+nvk/ci: document crash->fail change from !36031 |
| - zink+radv/ci: document new failures |
| - lavapipe/ci: document one fixed and two new failures |
| - radeonsi/ci: document recent flakes |
| - radv/ci: document recent flakes |
| - broadcom/ci: document recent flakes |
| - zink+radv/ci: document recent flakes |
| - llvmpipe/ci: document recent flakes |
| - lavapipe/ci: document recent flakes |
| - turnip+zink/ci: add piglit to the a750 job |
| - ci: fix priority of nightly pipeline & staging pipeline jobs |
| - nvk/ci: document vkd3d regression |
| - zink+nvk/ci: fix mistake in yesterday's crash->fail improvement update |
| - freedreno/ci: document recent flakes |
| - zink+radv/ci: document recent flakes |
| - ci/piglit: drop LAVA variable from non-LAVA jobs |
| - ci/piglit: provide default device name |
| - ci/piglit: provide default results file name |
| - ci/lava: inherit .piglit-traces-test in .lava-piglit-traces and deduplicate configs |
| - ci: move \`script:` override from .piglit-traces-test to llvmpipe-traces |
| - VERSION: bump for 25.2.0-rc1 |
| - .pick_status.json: Update to f4166ab1e1d6dec2355c0d861c75319248d4de08 |
| - .pick_status.json: Mark 80be9153f9debe9021c425c3b10de7804e4a0d7e as denominated |
| - ci: fix rustfmt job rules |
| - ci: fix rustfmt job rules (one more case) |
| - VERSION: bump for 25.2.0-rc2 |
| - .pick_status.json: Update to f4436d606e0ae76ef6092b850cf79fa3e014f61e |
| - .pick_status.json: Mark f4596e78283aa9124ad3358d488167c8a72d6176 as denominated |
| - VERSION: bump for 25.2.0-rc3 |
| - .pick_status.json: Update to 20dab5f819f7016b8d7f4e88927855a0e5ff7a61 |
| - meson: fix VkLayer_MESA_device_select in the devenv |
| |
| Eric R. Smith (14): |
| |
| - pan/layout: Use _{B,el,px,sb} suffixes to clarify things |
| - panfrost, panvk: fix G31 use of SHADER_MODE_EARLY_ZS_ALWAYS |
| - panfrost, panvk: spread hierarchy mask bits out when max_levels < 8 |
| - util/format: fix some missing cases in util_format_get_plane_format |
| - pan/afbc: Teach AFBC about YUV formats |
| - panfrost: add support for AFBC modifiers on YUV formats |
| - util/format: add support for YU08 and YU10 formats |
| - gallium/st: Teach the state tracker about YU08 and YU10 formats |
| - dri2: Teach the DRI layer about YU08 and YU10 |
| - pan: Teach libpanfrost about YU08/Y010 |
| - panvk: add a driConf to force enable atomics in shaders |
| - panfrost: spill registers in SSA form |
| - panfrost: fix SSA register allocation |
| - panvk: use minimum attachment size for frame buffer size |
| |
| Erico Nunes (9): |
| |
| - lima: drop unused fields from lima_context_framebuffer |
| - lima: drop tiled_w and tiled_h from lima_surface |
| - lima: handle luminance and intensity pixel formats |
| - lima: move reload flags from surface to resource |
| - lima: drop util_framebuffer_init usage |
| - lima: drop lima_surface |
| - lima: fix array limit in texture mipmap descriptor |
| - lima: ppir: fix check for discard_block in optimization |
| - lima: ppir: fix store_output optimization for modifiers |
| |
| Erik Faye-Lund (82): |
| |
| - panvk: support vulkan 1.2 on v10+ |
| - panvk: re-enable KHR_shader_quad_control on v10+ |
| - docs/features: mark off missing panvk extensions |
| - pan/ci: add fragment_out flakes from t860 to t760 |
| - gallium/aux: move util_pipe_tex_to_tgsi_tex to u_blitter.c |
| - r600: ingest u_blitter_clear_buffer |
| - gallium/aux: fixup bad indent |
| - pan/ci: remove no longer failing test |
| - panvk: expose EXT_texel_buffer_alignment support |
| - mesa/main: remove non-existing function prototype |
| - panfrost: v4 does not support 16x msaa |
| - panfrost: do not try to use 4x4 tiles on v4 gpus |
| - panfrost: do not calculate max-msaa on v4 |
| - panfrost: re-align table |
| - panfrost: properly compute tile-buffer requirements |
| - panfrost: change tie-breaking rule for 16x MSAA |
| - panfrost: pass quirks through __VA_ARGS__ |
| - panfrost: add max_4x_msaa quirk |
| - util/format: add ASTC HDR formats |
| - util/format: add util_format_is_astc_hdr()-helper |
| - mesa/main: support astc_hdr formats |
| - gallium/st: enable KHR_texture_compression_astc_hdr when supported |
| - panfrost: correct texfeat-bit for ASTC LDR |
| - panfrost: set decode_hdr when needed |
| - pan/lib: wire up ASTC HDR formats |
| - mesa/main: fixup bad indent |
| - mesa/main: use _mesa_is_desktop_gl_compat-helper |
| - mesa/main: use _mesa_has_Foo-helpers |
| - mesa/main: do not allow non-gles1 enums in gles1 |
| - mesa/main: fix ARB_texture_float quirk |
| - mesa/main: clean up invalid pname-check |
| - mesa/main: do not allow desktop enums on gles |
| - mesa/main: fix texture view enum checks |
| - mesa/main: add _mesa_has_texture_multisample-helper |
| - mesa/main: correct error-checks for GL_TEXTURE_2D_MULTISAMPLE_ARRAY |
| - mesa/main: fix multisample enum checks |
| - mesa/main: fix error-handling for gles1/2 |
| - mesa/main: fix enum errors |
| - vulkan/format: map ASTC HDR formats |
| - panvk: add EXT_texture_compression_astc_hdr support |
| - panfrost: do not double-insert shader into hash-table |
| - panfrost: plug leak of modifier conversion shaders |
| - panfrost: remove surface create/destroy hooks |
| - lima: drop R16G16B16_FLOAT support |
| - lima: add driconf infrastructure |
| - lima: make fp16 render-targets opt-in with driconf |
| - upanfrost: make 128-bit opt-in with driconf on v4 |
| - docs/features: add GL_EXT_shader_realtime_clock |
| - Revert "mesa: limit number of error raised by invalid GL_TEXTURE_MAX_ANISOTROPY_EXT" |
| - mesa/main: introduce MAX_SAMPLES define |
| - mesa/main: search all the way to MAX_SAMPLES |
| - st/mesa: search for smallest supported sample-count |
| - mesa/main: correct internalformat query condition |
| - mesa/main: use _mesa_check_sample_count instead of open-coding error |
| - mesa/main: add missing error-check for samples |
| - mesa/main: more careful ext checking |
| - mesa/main: contain mutlisample advanced limitations |
| - panfrost: enable 8x and 16x msaa modes when supported |
| - meson: use literal false instead of string |
| - panfrost/ci: fix sorting of fails |
| - panfrost/ci: add new half-float fails |
| - pan/bi: rework memory barriers |
| - pan/bi: handle barriers with NONE scope |
| - panvk: optimize barriers |
| - panvk: enable KHR_vulkan_memory_model |
| - panvk: expose vulkan 1.3 support |
| - docs/features: mark vk 1.3 as complete on panvk/v10+ |
| - panfrost/ci: explain g52 fails |
| - panfrost/ci: add new 8x/16x msaa fails on t760 |
| - panfrost/ci: bump timeout for panfrost-t760-gles |
| - panfrost/ci: fix up msaa fails... again |
| - panfrost: disable 16x MSAA on t760 before r1p0 |
| - panvk: fixup bad indent |
| - panvk: pass flags to panvk_sampler_fill_desc |
| - panvk: implement VK_EXT_non_seamless_cube_map support |
| - st/pbo: use sized nir-types for download-path |
| - panfrost: limit sample_shading to bifrost and later |
| - mesa/st: do not check single-sampled for max_samples |
| - Revert "lima: make fp16 render-targets opt-in with driconf" |
| - Revert "upanfrost: make 128-bit opt-in with driconf on v4" |
| - pan/ci: remove non-existent flag from PAN_MESA_DEBUG |
| - docs/features: add missing panvk extension |
| |
| Eve (1): |
| |
| - radv: add RADV_PERFTEST option to turn off gtt spilling |
| |
| Faith Ekstrand (236): |
| |
| - nak/legalize: Take a RegFile in copy_alu_src_and_lower_fmod |
| - nak/legalize: Take a RegFile in copy_alu_src_and_lower_ineg() |
| - nak/sm70: Fix the bit74_75_ar_mod assert |
| - nak/sm20: Add initial SM20 encoding |
| - nak/sm20: Add float ops |
| - nak/sm20: Add integer ops |
| - nak: Lower 64-bit shifts in NIR on Kepler A and earlier |
| - nak: Record and set DOES_FP64 in the SPH |
| - nak/sm20: Add conversion ops |
| - nak/nir: Use Maxwell input interpolation for SM20+ |
| - nak/sm20: Add shader I/O ops |
| - nak/sm20: Add control-flow ops |
| - nak/nir: Use Kepler texture source ordering on SM30 |
| - nak/sm20: Add texture ops |
| - nak/sm20: Add more memory ops |
| - nak/sm20: Add subgroup ops |
| - nak/sm20: Add double ops |
| - nak/sm20: Implement OpBar |
| - nvk: Return an error for Kepler storage images instead of asserting |
| - nak/sm20: Improve folding of ffma and dfma |
| - nak/sm20: Use the correct index field for OpS2R |
| - nak: Handle OpFRnd in is_fp64() |
| - nak/sm20: Fix legalization of float source types |
| - nak/sm20: Use the immediates instead of rZ in OpShfl |
| - nvk: Maxwell+ is now conformant |
| - docs/nvk: Update conformance and hardware support information |
| - nak/sm20: Call copy_alu_src_if_fabs() first |
| - nak/sm20: Fix legalization of IAdd and IMul |
| - nak: Match on the SrcRef directly in Src::is_nonzero() |
| - nak/sm20: Use SrcRef::as_u32() |
| - nak: Take a SrcType in Src::as_u32() |
| - nak: Fold source modifiers in Src::as_u32() |
| - nak: Handle SrcType::F64 in Src::is_fneg_zero() |
| - nak: Use as_u32() directly in Src::is_fneg_zero() |
| - nak: Fold Src::fold_imm() into the legalization pass |
| - nak: Replace Src::new_zero() with a ZERO constant |
| - nak: Move SSAValue and friends to a new ssa_value.rs file |
| - nak: Print the % for SSA predicates |
| - nak: Set lower_pack_64_4x16 |
| - nak: Stop relying on nir_lower_pack |
| - nvk: Stop printing errors for invalid dma-buf image queries |
| - docs/nvk: Improve the NVK docs page |
| - docs/nvk: Add section about NVK+Zink |
| - docs/systems: Point people at the NVK page first |
| - nak: sm32: Add Texture ops |
| - nak: Improve documentaiton for SSAValue and SSARef |
| - nak: Improve documentation on RegFile |
| - nak: Improve documentation for PerRegFile |
| - nak: Improve documentation for RegRef |
| - nak: Fix some rusdoc warnings |
| - compiler/rust: Better document CFG |
| - nak/sm70: Increase the number of UGPRs on Blackwell+ to 80 |
| - nak/sm70: Properly encode ldc on Blackwell+ |
| - nak/hw_tests: Copy data stride and invocations to avoid cbuf sources |
| - nak/lower_copy: Implement copy from CBuf as ldc on Blackwell+ |
| - nak/copy_prop: Don't propagate cbufs into ALU on Blackwell+ |
| - nak/sm70: Fix bra offset encoding for for Hopper+ |
| - nak/sm70: Use rZ for the 3rd source of lea when .hi is not set |
| - nak/sm70: imnmx takes and returns more predicates on Blackwell+ |
| - nak/nir/lower_tex: Use nir_tex_instr_add_src() |
| - nak/sm70: Don't set a predicate destination on redg |
| - nak/sm20: Remove some unnecessary Option<> |
| - nak: Fix OpShf folding for shift >= 64 |
| - nak: Add HW tests for OpShr and OpShl |
| - nak: Reduce shift immediates instead of adding copies |
| - nak: Add a helper for reducing OpShfl lane and c immediates |
| - nak: Add a helper to reduce OpPrmt sel immediates |
| - nvk: Implement VK_EXT_zero_initialize_device_memory |
| - nak: Use Default::default() for BitSet constructors |
| - compiler/rust,nak: Rename BitSet::get() to contains() |
| - compiler/rust/bitset: Make BitSet a generic container type |
| - nak/liveness: Use an SSA value as the last_use HashMap key |
| - nak/liveness: Use typed bitsets |
| - nak/repair_ssa: Rename Phi to PhiTracker |
| - nak: Use op instead of phi when referring to OpPhiSrcs or OpPhiDsts |
| - nak: Add a Phi struct type |
| - nak/dce: Use BitSet for live phis and SSA values |
| - nvk: Disable R64_[US]INT formats when shaderImageInt64Atomics isn't supported |
| - nak/sm20: Don't allow 64-bit atomics |
| - nvk: 64-bit atomics exist on Kepler B+ |
| - nouveau/mme: Don't install the HW tests |
| - nak/sm20: Encode OpViLd instead of OpIsberd |
| - nak/sm20: Add encodings for OpLdSharedLock and OpStSCheckUnlock |
| - nak/sm20: Fix the encoding of fset |
| - nak/sm20: Fix a comple misplaced bits |
| - nvk: Allocate the correct VAB size on Kepler |
| - nak/sm20: Encode OpAL2P |
| - bitview: Implement SetFieldU64 for all BitViewMutable |
| - nak: Scalarize non-constant ald/ast on Kepler |
| - nvk: Reserve a sampler for TXF on Kepler |
| - nvk: Drop the union from nvk_buffer_view |
| - nil: Align linear image strides to the image alignment |
| - nouveau: Move bitview to a common rust util folder |
| - nvk: Lower images to addresses on Kepler |
| - nak/sm20: Encode surface address ops |
| - nak/sm20: Encode OpSuLdGa and OpSuStGa |
| - nvk: Use NAK by default on Kepler |
| - nvk: Don't disable features based on NVK_USE_NAK |
| - nvk: Remove codegen support |
| - nvk: Inline nvk_compile_nir_with_nak() |
| - nvk: Only pass texture handles to NAK |
| - nvk: Stop doubling root descriptors |
| - nak: Unconditionally call lower_io_to_temporaries in preprocess_nir |
| - nouveau: Move codegen back to the src/gallium/drivers/nouveau |
| - nvk/nvkmd: Add a force_mem_to_gart() helper |
| - nvk: Don't assert memory placement on import |
| - nvk: Only allow importing mappable dma-bufs to HOST_VISIBLE types |
| - nvk: Disallow GetMemoryFdProperties with OPAQUE_FD_BIT |
| - nak: Handle suld.constant on Turing and Volta in legalization |
| - nak/sm50: Use MemScope::CTA for constant image loads |
| - nak: Set MemOrder::Constant for CAN_REORDER image loads on all hardware |
| - nak: Set cache ops on global memory access on Kepler |
| - nak: Set cache ops on surface load/store ops |
| - nak/sm20: pixld has a predicate destination |
| - nvk: Don't use INVALIDATE_TEXTURE_DATA_CACHE_NO_WFI on Kepler |
| - nak: Plumb the ShaderModel through to Ld/StCacheOp::select() |
| - nak: Print cache ops on suldga and sustga |
| - nak: Handle MemScope::System on Kepler |
| - nak/sm50: Encode cache ops on Maxwell |
| - nak/sm50: Use ld.ci for constant loads |
| - nvk/lower_descriptors: Drop buffer_address_to_ldcx_handle() |
| - nvk: Rework non-uniform access lowering |
| - nvk: Run nir_opt_non_uniform_access for UBOs on Turing+ |
| - nvk: Claim shaderUniformBufferArrayNonUniformIndexingNative |
| - vulkan/pipeline: Hash null descriptor robustness info |
| - nak/sm70: Fix r2ur for Blackwell+ |
| - nak/qmd: Allow program addresses to be shifted |
| - nak/qmd: Add QMD version 5.0 for Blackwell+ |
| - nvk: Add a new arena data structure |
| - nvk/heap: Use an nvk_mem_arena |
| - nvk/descriptor_table: Add a couple new helpers |
| - nvk: Use an nvk_arena for nvk_descriptor_table |
| - nvk: Add an nvk_mem_stream struct |
| - nvk/upload_queue: Use an nvk_mem_stream internally |
| - nvk/queue: Rename submit_simple() to push() |
| - nvk: Use an nvk_mem_stream for nvk_queue_push() |
| - nvk: Set memory windows in engine init |
| - nvk: Don't set queue lost from nvk_queue_push() |
| - nvk: Drop nvk_buffer_address() |
| - nil: Rework the descriptor interface |
| - nil: Don't use Fermi bits in the Maxwell null descriptor |
| - nvk: Add an nvk_sampler_header struct and return it by value |
| - nir/lower_input_attachments: Stop assuming tex src indices |
| - nir/lower_input_attachments: Don't ignore tex coordinates |
| - nir: Add a new load_input_attachment_coord intrinsic |
| - nvk: Add a load_root_table() helper |
| - nvk: Switch to nir_intrinsic_load_input_attachment_coord |
| - nvk: Lower input attachments earlier |
| - vulkan: Add a vk_sampler_state struct |
| - vulkan/sampler: Add border color swizzle info to vk_sampler_state |
| - nvk: Use vk_sampler_state for nvk_sampler_get_header() |
| - vulkan: Drop vk_sampler_init() |
| - nir: Move nir_steal_tex_src() to nir.h |
| - nir: Add more tex_src helpers |
| - nvk: Use nir_tex_get/steal_src in nvk_nir_lower_descriptors() |
| - nil: Add more GOBType enums |
| - nil: Use GOBType::choose() to pick gob kinds for modifiers |
| - nil: Match on gob types in the tiled image copy code |
| - nil: Add new GOBTypes for blackwell+ |
| - nil/copy: Add a simpler CopyGOBLines trait |
| - nil: Improve Tiling and GOBType documentation |
| - nil: Fix the GOBType::TuringColor2D description |
| - nil: Rename a few GOB types |
| - nak/hw_tests: Align the shared memory window to 4GB on Hopper+ |
| - nvk/bitview: Add a new ToFieldBits trait |
| - nvk/bitview: Add a set_field2() trait and method |
| - nak: Use the new set_field2() helper for a few things |
| - nil: Use set_field2() for a few descriptor cases |
| - nak: Use set_ureg_src() for Blackwell texture offsets and handles |
| - nak: Rework printing of TexLodMode |
| - nak: Add better is_blackwell*() helpers |
| - nak: Tell NIR to lower invalid implicit LODs |
| - nak: Add a new TexDerivMode enum and plumb it through |
| - nak: Add a TexDerivMode to OpFSwz and OpFSwzAdd |
| - nak: Use fswzadd.ndv on Blackwell+ |
| - nil: Reorder the checks in GOBType::choose() again |
| - nil: Add a new GOBType for Z24 on Blackwell+ |
| - nil: Remove some unused renames from cbindgen.toml |
| - nvk/copy: Set the gob_kind for tiled copies on Blackwell+ |
| - nil/copy: Base swizzling on the per-plane pipe_format |
| - nvk: Use util_format_stencil_only() |
| - nvk: Allow sparse binding of YCbCr images |
| - nvk: Allocate separate planes for depth and stencil on Blackwell+ |
| - nak: Don't se .yld or .reuse_mask on Blackwell+ |
| - nak: I/O offsets are unsigned when combined with RZ |
| - nvk: Pass the pipe_format to copy_remap_format() |
| - nak: Surface handles are not allowed to be rZ |
| - nak: Encode ureg handle sources for surface ops on Blackwell |
| - ci: Bump libwayland to 1.24.0 |
| - ci: Build weston 14.0.1 from source |
| - ci: Add a weston.ini |
| - ci: Upgrade XWayland to 24.1.8 |
| - zink: Clean up file descriptor closing in export_dmabuf_semaphore() |
| - zink: Add a resource_get_dma_buf() helper |
| - glx: Drop LIBGL_DRI3_DISABLE |
| - zink/kopper: Don't recycle unused acquire semaphores |
| - docs: Improve the docs for LIBGL_KOPPER_DRI2 |
| - loader: Report DRI_PRIME errors earlier |
| - kopper: Dead code get_image() and get_image_shm() |
| - egl,glx,kopper: Switch back to DRIkopperLoaderExtension::GetDrawableInfo |
| - egl: Add a shared kopper_pbuffer_loader_extension |
| - egl/x11: Stop mixing kopper and swrast extensions |
| - egl/x11: Always select Zink when requested |
| - egl/x11: Move the multibuffers check into dri2_initialize_x11() |
| - egl/x11: Rework LIBGL_KOPPER_DRI2 checks |
| - egl/x11: Only use kopper vtables when kopper is enabled |
| - egl/x11: Add a separate dri2_initialize_x11_kopper helper |
| - egl/wayland: Move kopper higher up the file |
| - egl/wayland: Only use kopper calbacks if kopper is enabled |
| - egl/wayland: Use the DRM path for Zink when !ForceSoftware |
| - egl/wayland: Delete the kopper checks from the kopper paths |
| - egl/surfaceless: Select kopper based on dri2_dpy->kopper |
| - egl: Rename dri2_detect_swrast() and also detect kopper |
| - egl: Choose kopper based on driver name rather than Options.Zink |
| - egl/x11: Allow the loader to choose zink |
| - gallium/dri3: Drop loader_dri3_buffer::size |
| - vulkan/wsi/x11: Refuse to connect to thread-unsafe Displays |
| - nak/sm20: TexDepBar::textures_left is 6 bits |
| - nvk: Kepler is now Vulkan 1.2 conformant |
| - nouveau: Import the Blackwell 3D class headers from NVIDIA |
| - meson: Require libX11 >= 1.8 |
| - meson: Only build src/x11 if with_platform_x11 |
| - x11: Move loader_x11.c/h to x11_dri3.c/h |
| - x11: Add an x11_xlib_display_is_thread_safe() helper |
| - glx: Refuse to initialize if Display is not thread-safe |
| - egl/x11: Refuse to initialize if Display isn't thread-safe |
| - dri: Get rid of __DRIbackgroundCallableExtension |
| - nak: Wire up the mma predicate on Hopper+ |
| - nir/instr_set: Rework tex instr hash/compare |
| - nvk: Bump the conformance version to 1.4.3 |
| - nvk: Add an nvk_is_conformant() helper |
| - loader: Ignore NOUVEAU_USE_ZINK on Hopper+ |
| - vulkan: Rename a bunch of vk_sync_timeline helpers |
| - vulkan: Hold a reference to pending vk_sync_timeline_points |
| - vulkan/wsi/x11: Handle VK_NOT_READY in AcquireNextImage() |
| - nvk: Blackwell is now Vulkan 1.4 conformant |
| |
| Felix DeGrood (2): |
| |
| - anv: Do conservative oversubscription of pages to 2MB |
| - intel/tools: add intel_measure.py |
| |
| Feng Jiang (2): |
| |
| - virgl: Make max_hw_atomic_counter_buffers less than PIPE_MAX_HW_ATOMIC_BUFFERS |
| - mesa/st: Fix potential array out-of-bounds in st_bind_hw_atomic_buffers() |
| |
| Frank Binns (1): |
| |
| - pvr: add missing refcounting for descriptor set layouts |
| |
| GKraats (1): |
| |
| - EGL: legacy-x11=dri2 should support hardware driver |
| |
| Ganesh Belgur Ramachandra (1): |
| |
| - radeonsi: change do_update_shaders boolean to a bitmask |
| |
| Georg Lehmann (134): |
| |
| - nir/opt_algebraic: generalize fmax(fadd(a, b), 0.0) to fsat by not requiring fneg |
| - nir/opt_algebraic: optimize fmax(ffma(a, b, c), 0.0) to fsat |
| - nir/opt_algebraic: turn exact fmin(1.0, a) into fsat if a is not NaN and not negative |
| - nir: add range analysis for ffmaz |
| - nir: add is_a_number analysis for ffma |
| - nir: improve fadd is_a_number analysis by using the range |
| - nir/opt_algebraic: disable fsat(a + 1.0) opt if a can be NaN |
| - aco: set opsel_hi to 1 for WMMA |
| - nir,amd: add neg_lo/hi modifiers to cmat_matmul_amd |
| - radv: apply fneg/fabs modifiers to wmma |
| - aco/insert_NOPs: allow WMMA with constant C matrix |
| - aco/isel: create WMMA with constant C matrix if possible |
| - aco/insert_exec: only restore wqm mask after control flow if necessary |
| - aco/insert_exec: reset temporary when recreating wqm mask from exact mask |
| - radv/nir/lower_cmat: use radv_nir_cmat_bits consistently |
| - radv/nir/lower_cmat: use cmat_mul instead of duplicating hw details for type conversion |
| - radv/nir/lower_cmat: tightly pack 8bit gfx11 acc matrix |
| - aco/insert_exec: disable empty quads when leaving divergent control, even if not top level |
| - radv,aco,nir: keep the A and B base type for cmat_muladd_amd |
| - aco: support bf16 wmma |
| - radv/nir/opt_cmat: support bfloat16 |
| - radv/nir/lower_cmat: handle bf16 conversions |
| - nir: add an option to lower bf2f and f2bf |
| - ac/nir: set lower_bfloat16_conversions |
| - nir: add bfdot2_bfadd and use it for lowering bfdot if supported |
| - aco: support nir_op_bfdot2_bfadd |
| - ac/llvm: support nir_op_bfdot2_bfadd |
| - ac/nir: enable nir_op_bfdot2_bfadd |
| - radv/gfx12+: enable VK_KHR_shader_bfloat16 |
| - aco/optimizer: remove label_vopc |
| - aco/optimizer: remove label_vop3p |
| - aco/optimizer: remove label_add_sub |
| - aco/optimizer: remove label_dpp8 and label_dpp16 |
| - aco/optimizer: remove label_f2f32 |
| - aco/optimizer: remove label_minmax |
| - aco/optimizer: remove label_vec |
| - aco/optimizer: remove label_split |
| - aco/optimizer: remove label_bitwise |
| - aco/optimizer: remove label_mul |
| - aco/optimizer: add semantic aliases for info.instr |
| - aco/optimizer: store parent_instr for all temps |
| - aco: replace novalidateir with novalidate debug option |
| - aco/optimizer: validate context data |
| - aco/optimizer: apply f2f32 without label_usedef |
| - aco/optimizer: label p_extract as usedef |
| - aco/optimizer: apply dpp without label_usedef |
| - aco/optimizer: create fma without label_usedef |
| - aco/optimizer: remove label_usedef |
| - aco/optimizer: remove instr_usedef_labels completely |
| - gallium: remove left over clover files |
| - nir/opt_algebraic: some bitfield_select optimizations |
| - radeonsi: always lower alu bit sizes |
| - aco: add type information for operands/definitions |
| - aco: swap operands without instructions |
| - aco/optimizer: optimize packed fneg with negative constant |
| - aco/optimizer: only use get_operand_size for alu |
| - aco: replace get_operand_size with get_operand_type |
| - aco: don't use constant_bits when not applying constants |
| - aco: assume sram ecc is enabled on Vega20 |
| - radeonsi: remove unscaled poly offset support |
| - zink: remove unscaled poly offset support |
| - r600: remove unscaled poly offset support |
| - nvc0: remove unscaled poly offset support |
| - gallium: remove polygon_offset_units_unscaled pipe cap |
| - gallium: remove tgsi_div pipe cap |
| - docs/gallium: remove some nine mentions |
| - aco: clamp exponent of 16bit ldexp |
| - radv: expose scalarBlockLayout on GFX6 |
| - nir/opt_if: don't replace constant uses with other uniform values |
| - nir/opt_if: limit rewrite_uniform_uses iand recursion |
| - radv: don't accidentally expose samplerFilterMinmax through Vulkan 1.2 |
| - aco: do not use v_cvt_pk_u8_f32 for f2u8 |
| - aco: remove p_v_cvt_pk_u8_f32 |
| - aco: add a readme entry for v_pk_cvt_u8_f32 |
| - spirv: use feq for OpIsInf |
| - vulkan: update headers/xml for 1.4.318 |
| - radv/nir/lower_cmat: use common matrix layout on gfx12 |
| - nir/opt_intrinsic: fix inclusive scan rewrite with multiple uses |
| - aco: allow nir divergence to be printed again |
| - radv: dump NIR for executable info after nir_to_asm |
| - aco: optimize boolean phi with empty else block |
| - compiler: add float8 glsl types |
| - util: add float8 conversion functions |
| - nir: add float8 conversion opcodes |
| - spirv: vtn_has_decoration helper |
| - spirv,nir: emit saturating float8 cmat convert |
| - spirv: support float8 conversions |
| - spirv: create float8 types |
| - spirv: support float8 spec constant op |
| - spirv: support float8 capabilities |
| - aco: emit float8 wmma |
| - aco/lower_to_hw: support saturating fp8 conversions |
| - aco: select fp32 to float8 conversions |
| - nir,aco: optimize FP16_OFVL pattern created by vkd3d-proton |
| - aco/isel: fix get_alu_src with 8bit vec2 source |
| - aco: select float8 to fp32 conversions |
| - radv: vectorize float8 conversions |
| - radv/nir/lower_cmat: handle float8 conversions |
| - radv: expose VK_EXT_shader_float8 |
| - nir: add cmat_transpose |
| - spirv: implement CooperativeMatrixConversionsNV |
| - radv/nir/lower_cmat: implement use conversions/transpose |
| - radv: advertise VK_NV_cooperative_matrix2/cooperativeMatrixConversions behind an env var |
| - radv/ci: test VK_NV_cooperative_matrix2 |
| - nir/loop_analyze: always consider comparisions between induction var and constant free |
| - nir/loop_analyze: handle vector selections properly |
| - nir/loop_analyze: consider movs/vecs free |
| - radv/nir/lower_cmat: fix gfx11 B->ACC conversion |
| - radv/nir/lower_cmat: share cmat_load/cmat_store code |
| - radv/nir/lower_cmat: set optimal load/store alignment |
| - radv/nir/lower_cmat: use nir_src_as_deref |
| - nir/shrink_vectors: shrink larger vectors too |
| - nir/opt_shrink_vectors: also split vecs into distinct smaller vecs if possible |
| - radv: keep fp8 conversions vectorized |
| - aco/optimizer: generalize p_create_vector of split vector opt |
| - nir/opt_sink: don't assume moving conversion can't increase register pressure |
| - nir,radv: add an option to not move 8/16bit vecs |
| - nir/opt_algebraic: optimize 16bit vec2 comparison followed by b2i16 using usub_sat |
| - aco/isel: implement 16bit vec2 shifts |
| - aco: remove unused swap_srcs from emit_vop3p_instruction |
| - aco/vn: remove dead instructions early |
| - radv/nir/lower_cmat: use explicit shift when calculating gfx12 wave64 layout |
| - aco: add a dedicated pass for better float MODE insertion |
| - aco: only insert fp mode when needed |
| - ac/llvm: support vec2 extract |
| - ac/nir: lower uniform extract_i8/u8 to 32bit |
| - aco: vectorize 16bit extracts |
| - aco: vectorize conversions from 8bit to 16bit |
| - radv/nir/lower_cmat: use v_permlanex16_b32 instead of ds_swizzle_b32 for GFX11 ACC->B |
| - radv/nir/lower_cmat: convert matrix use in smaller type |
| - nir,aco: add byte_perm_amd |
| - radv/nir/lower_cmat: vectorize GFX11 ACC -> B conversion |
| - radv/nir/lower_cmat: vectorize GFX11 B -> ACC conversion |
| - nir/opt_remove_phis: skip unreachable phis |
| |
| Gert Wollny (6): |
| |
| - r600/sfn: dump the lowered shader when translation fails |
| - mesa/st: Re-instate test for shader_clock cap |
| - r600: Correct nir_indirect_supported_mask |
| - r600: remove the use of separate nir options for fs |
| - r600/sfn: Add support for indirect VS input read |
| - r600/sfn: catch remaining indirect inputs when lowering GS input arrays |
| |
| Gorazd Sumkovski (1): |
| |
| - panfrost: Fix incorrect condition in assert |
| |
| Guilherme Gallo (34): |
| |
| - ci: Extract target job handling in CI monitor script |
| - bin/ci: crnm: Improve job enabling robustness |
| - bin/ci: crnm: Improve timer display formatting |
| - bin/ci: crnm: Fix job duration calculation |
| - ci: bump apitrace version |
| - ci: Update build-apitrace.sh header with the right tag |
| - ci: Update setup-rootfs.sh for test-* containers |
| - ci: Keep important packages for rootfs |
| - ci: Add rootfs export script |
| - ci/lava: Improve timeout estimation logic for case/suite runs |
| - ci/lava: Parametrize message burst length on unit tests |
| - ci/lava: make overlay->compression optional |
| - bin/ci: crnm: Sanitize n_colums value |
| - ci/panfrost: Reuse file list YAML anchors |
| - ci/build: Remove CPP_ARGS var duplication |
| - ci/lava: SSH tweaks |
| - ci/android: Add gitlab section for logs uploads |
| - ci/android: disable errexit in trap function |
| - ci/android: reduce launch_cvd memory limit to 4GB by default |
| - ci/lava: Fix type hint errors in GitlabSection |
| - ci/lava: Style fixes in test_lava_log |
| - ci/lava: Don't fail if the section times mismatches |
| - ci/lava: Update image tag to apply the changes |
| - ci/turnip: Update a couple of flaky tests on a660-vk |
| - ci: Simplify filter_env_vars using indirect expansion |
| - ci/lava: Avoid eval when generating env script |
| - ci/android: export container for DUTs |
| - ci/virtio: Add an Android Venus job on Intel Comet Lake |
| - ci/android: Store stripped CTS on S3 |
| - ci/android: Disable zipbomb detection for CTS |
| - ci/android: Use structured tag for Android CTS version |
| - ci/android: Add a job to run CTS on Intel CML |
| - ci/android: Move ANDROID_CTS_MODULES to build script |
| - ci/android: Add dEQP module for CML cuttlefish test |
| |
| Gurchetan Singh (30): |
| |
| - gfxstream: make sure by default descriptor is negative |
| - gfxstream: add VulkanMapper |
| - gfxstream: get rid of logspam in virtualized case |
| - gfxstream: fix missing include |
| - gfxstream: fix suballocation logic using u_mm |
| - gfxstream: add virgl_hw.h in VirtGpu.h header |
| - gfxstream: fix opening virtgpu DRM |
| - gfxstream: fix virtio-gpu on Goldfish interactions |
| - subprojects: fix missing space between colon and license string |
| - subprojects: update quote to 1.0.35 |
| - subprojects: update syn to 2.0.87 |
| - subprojects: add more syn features |
| - subprojects: add cfg-if |
| - subprojects: add bitflags |
| - subprojects: add libc |
| - subprojects: add errno |
| - subprojects: add linux-raw-sys |
| - subprojects: add rustix |
| - subprojects: add log |
| - subprojects: add thiserror |
| - subprojects: add remain |
| - subprojects: add zerocopy |
| - mesa: import mesa3d_util |
| - mesa: import mesa3d_protocols |
| - mesa: import virtgpu kumquat |
| - mesa: import virtgpu_kumquat_ffi |
| - mesa: add option to enable virtgpu_kumquat FFI for gfxstream |
| - gfxstream: proper depedendences in non virtgpu-kumquat case |
| - util: rust: update to rustix 1.0.7 |
| - mesa: subprojects: remove linux-raw-sys backend and use libc |
| |
| Han, Szu Chih (2): |
| |
| - amd/vpelib: add VPE_TF_SRGB |
| - amd/vpelib: add TF_BT709 |
| |
| Hans-Kristian Arntzen (3): |
| |
| - radv: Consider that DGC might need shader reads of predicated data. |
| - radv: Timestamps are not valid on dedicated sparse queue. |
| - ac/nir: Avoid 0/0 when computing texel buffer size on Polaris. |
| |
| Hoe Hao Cheng (9): |
| |
| - zink: fix win32 detection in zink_device_info |
| - zink: make zink_device_info look for KHR versions of promoted exts |
| - zink: apply core functions to extension functions |
| - zink: apply EXT-suffixed functions to their unsuffixed counterpart |
| - zink/codegen: add indentation to the mako templates |
| - zink/codegen: support double-loading dynamic properties arrays |
| - zink: remove fixup_driver_props |
| - zink/codegen: check for core promotion using the latest registry entry |
| - zink/codegen: init properties struct fully for double-loaded extensions |
| |
| Hyunjun Ko (9): |
| |
| - vulkan/video: Fix wrong parsing for H265 decoding |
| - anv: Set tc/beta offset according to the flag from PPS. |
| - anv: Always allocate cdf tables when independent profiles provided |
| - anv: Allocate MV buffers enough for AV1 decoding. |
| - anv: Fix to set CDEF filter flag correctly. |
| - intel/genxml: fix HCP_VP9 commands |
| - anv: add default vp9 tables from FFMpeg and intel-vaapi-driver |
| - anv: Initial support for VP9 decoding |
| - anv: enable the KHR_video_decode_vp9 extension |
| |
| Iago Toral Quiroga (4): |
| |
| - frontend/dri: don't call set_damage_region with a null resource |
| - v3d: fix incorrect resource unref |
| - broadcom/compiler: handle moving last ubo load in the block correctly |
| - v3dv: switch to using nir_intrinsic_load_input_attachment_coord |
| |
| Ian Romanick (19): |
| |
| - brw/algebraic: Greatly simplify brw_opt_constant_fold_instruction |
| - brw/cmod: Delete some stale comment text |
| - brw/algebraic: Convert some NOT to MOV |
| - brw/cmod: Remove special handling of NOT |
| - brw/cmod: Fix some errors when propagating from CMP to ADD.SAT |
| - brw/cmod: Don't propagate from CMP to possible Inf + (-Inf) |
| - brw/cmod: Allow integer CMP to ADD propagation only for Z and NZ |
| - brw/reg_allocate: Optimize spill offset calculation using more SIMD8 |
| - brw/reg_allocate: Optimize spill offset calculation using integer MAD |
| - nir/algebraic: Optimize some open-coded extract_i8 |
| - nir/algebraic: Generalize an existing bfi(a, 0, ...) pattern |
| - brw: Fully write temporary destinations |
| - brw/nir: Use nir_opt_reassociate_matrix_mul |
| - brw: Only apply GRF 127 send workaround to Gfx9 |
| - brw/reg_allocate: Require SIMD32 for destination / source interference on Xe2 |
| - brw/reg_allocate: Only add interference for the source with the hazard |
| - brw/reg_allocate: Adjust source / destination hazard conditions for broadcast |
| - brw/reg_allocate: Check source / destination hazard for all larger SIMD |
| - nir/print: Don't segfault checking has_debug_info |
| |
| Icenowy Zheng (4): |
| |
| - zink: Do not use demote on IMG blobs |
| - zink: don't assert geometryShader for IMG proprietary driver |
| - zink: reject IMG blob < 24.1\@6554834 unless enforced |
| - Revert "zink: reject Imagination proprietary driver w/o geometryShader" |
| |
| Iván Briano (14): |
| |
| - anv: expose promoted KHR_depth_clamp_zero_one |
| - brw: make HALT instruction act as barrier in new CSE pass |
| - anv, hasvk: ignore QFOT if both src and dst queue families are equal |
| - anv: vkCmdTraceRays* are not covered by conditional rendering |
| - intel/genxml: update some instructions for Xe2+ |
| - brw: implement load_input_vertex intrinsic |
| - brw: check if the FS needs vertex_attributes_bypass to be set |
| - brw/anv: add provoking vertex to fs_msaa_flags |
| - brw: add lowering passes for FS barycentric inputs |
| - anv: set HW state for fragment shader barycentric |
| - anv: enable VK_KHR_fragment_shader_barycentric |
| - anv: don't report custom sample locations for sample count 1 |
| - anv: move view_usage check to before setting the protected bit on it |
| - intel: Re-disable ray tracing on 32 bits |
| |
| James Price (1): |
| |
| - spirv: Fix cooperative matrix in OpVariable initializer |
| |
| Janne Grunau (7): |
| |
| - venus: Do not use instance pointer before NULL check |
| - venus: virtgpu: Require stable wire format |
| - asahi: build asahi_clc for -Dtools=asahi |
| - asahi: Drop unnecessary idep_mesaclc dependency |
| - panfrost: build panfrost_compile for -Dtools=panfrost |
| - ci: Switch cross-builds to '-D tools=panfrost' |
| - gallium/dril: Add entrypoint for apple (asahi) kms driver |
| |
| Jason Macnak (12): |
| |
| - gfxstream: Update to new logging |
| - gfxstream: Update iostream header |
| - gfxstream: Update codegen to use gfxstream_common_base |
| - gfxstream: Update codegen after many aemu -> gfxstream changes |
| - gfxstream: Update codegen for aemu -> gfxstream stream interface |
| - gfxstream: Remove old include |
| - gfxstream: Update gfxstreaml vk xml location |
| - gfxstream: Update GfxApiLogger namespace and header file |
| - gfxstream: Update codegen license |
| - gfxstream: use VkSnapshotApiCallHandle outside of VkReconstruction |
| - gfxstream: Fix strict aliasing violations in VkDecoder |
| - gfxstream: Update snapshot decoder to handle imageless framebuffer |
| |
| Jayanth Vutukuri (1): |
| |
| - Add libzstd static library. |
| |
| Jesse Natalie (21): |
| |
| - microsoft/compiler: Force load_output => undef in tess_ctrl main func |
| - d3d12: Add tc memory throttles |
| - d3d12: Minor fixes to residency algorithm when eviction is needed |
| - winsys/d3d12: Support no-alpha formats through the DXGI swapchain path |
| - d3d12: Handle sampler view creation on B8G8R8X8 |
| - d3d12: Handle a null threaded context |
| - nir_gather_output_deps: Fix incorrect enum in switch |
| - formats: Cast enum to int before shifting |
| - microsoft/compiler: Cast one enum to another instead of to int to resolve warning |
| - d3d12: Optimize redundant flushes |
| - d3d12: Fix 'surface conversion' path for RTVs |
| - microsoft/compiler: Fix PSV0 for validator 1.8 |
| - microsoft/compiler: Clamp bias to DXIL valid range |
| - d3d12: Fail-fast on PSO creation failures |
| - microsoft/compiler: Separate 'advanced texture ops' from SM6.7 |
| - dzn: Roll up initialization failure in dzn_meta_init |
| - d3d12: Store fence FD type in the fence |
| - d3d12: Fix video fence lifetime issues |
| - gallium/video: Frontends separate in_fence and out_fence |
| - mediafoundation: Fix fence handling |
| - mediafoundation: Use C++ brace initialization instead of C-style |
| |
| Jesse.Zhang (1): |
| |
| - winsys/amdgpu: Add support for queue priority in Mesa |
| |
| Jianxun Zhang (14): |
| |
| - intel/dev: Differentiate displayable PAT entry of compression (xe2) |
| - iris: New compressed heaps for scanout buffers (xe2) |
| - anv: Use different PAT entries for compressed resources |
| - isl: Add Xe2 CCS modifiers (xe2) |
| - iris: Align size of compressed scanout buffers to 64KB on BMG (xe2) |
| - iris: Add new modifiers into code (xe2) |
| - iris: Choose PAT entry on imported buffers (xe2) |
| - iris: Update conditions when flushing resource (xe2) |
| - iris: Refactor function iris_flush_resource |
| - iris: Enable Xe2 modifier (xe2) |
| - anv: Treat imported compressed buffers as displayable (xe2) |
| - anv: Align size of compressed scanout buffers to 64KB on BMG (xe2) |
| - anv: Don't choose compression modifier when aux is disabled |
| - anv: Enable compression on CCS modifiers (xe2) |
| |
| Job Noorman (61): |
| |
| - nir/opt_shrink_vectors: enable for load_ubo_vec4 |
| - ir3: use opt_shrink_vectors |
| - ir3/isa: add nop encoding for bary.f/flat.b |
| - ir3: don't use VS input regs for binning variant |
| - nir/lower_io_to_vector: remove can_read_output assert |
| - tu: scalarize IO before linking |
| - ir3: calculate sstall/systall across blocks |
| - ir3: add mergedregs to ir3_compiler |
| - ir3/legalize: add cycle to ir3_legalize_state |
| - ir3/legalize: normalize nop state at block start |
| - ir3/legalize: remove ctx argument from delay_calc/update |
| - ir3/legalize: extract ir3_required_sync_flags helper |
| - ir3/legalize: extract sync_update helper |
| - ir3/legalize: add ir3_init_legalize_state helper |
| - ir3/legalize: extract ir3_merge_pred_legalize_states helper |
| - ir3/legalize: remove unused parameter from delay_update |
| - ir3/legalize: add ir3_update_legalize_state helper |
| - ir3/legalize: add syncs based on previous instr to sync_update |
| - ir3/legalize: apply ss/sy to state in sync_update |
| - ir3/legalize: make ir3_legalize_state and helpers public |
| - ir3/postsched: use legalize state for delay/sync calc |
| - freedreno/ci: update expectations |
| - freedreno/drm-shim: add support for MSM_PARAM_UCHE_TRAP_BASE |
| - ir3: don't free constant_data after assembling |
| - Revert "ir3: optimize SSBO offset shifts for nir_opt_offsets" |
| - ir3: don't vectorize 8bit SSBO accesses |
| - ir3/lower_io_offsets: set progress when scalarizing UAV loads |
| - ir3: optimize more before opt_preamble |
| - ir3: move collect/split helpers to ir3.{c,h} |
| - ir3: add support for collects in ir3_get_src_shared |
| - ir3: flatten nested collects |
| - ir3: add helpers to handle 64b values |
| - ir3: add ir3_split_off_scalar helper |
| - ir3/lower_preamble: add support for 64b values |
| - ir3: add support for u2u64 |
| - ir3: don't vectorize 64b values |
| - ir3: use ir3_64b helpers where possible |
| - ir3: ingest global addresses as 64b values from NIR |
| - nir: remove unused global_atomic(_swap)_ir3 intrinsics |
| - ir3: enable nir_opt_uniform_subgroup |
| - freedreno/decode: don't count alias regs for GPR stats |
| - ir3/legalize: don't force (ss) after predt |
| - ir3/legalize: propagate force_ss/sy across blocks |
| - ir3/legalize: apply initial GS/TCS sync in legalize |
| - ir3: add helper to apply half shared mov quirk |
| - ir3: support half regs for read_{first,cond,getlast} |
| - ir3/a7xx: disable half shared mov quirk |
| - ir3/parser: add uinteger helper |
| - ir3/isa: add isaspec definition for movs |
| - ir3: make backend aware of movs |
| - ir3: add codegen for movs |
| - ir3/cf: add support for movs |
| - ir3/isa: ignore bit 54 in alias encoding |
| - ir3/merge_regs: fix merge set alignment check |
| - ir3: add subreg move optimization |
| - ir3/legalize: emit predication quirk nops in next block |
| - ir3/legalize: merge predication quirk nops with existing nops |
| - mailmap,bin/people.csv: fix my preferred email address |
| - freedreno/computerator: add support for UBOs |
| - ir3/legalize: prevent infinite loop when inserting (ss)nop |
| - nir/opt_uniform_subgroup: use ballot_bit_count |
| |
| John Anthony (10): |
| |
| - panvk: Enable VK_EXT_direct_mode_display |
| - panvk: fix maxInlineUniformTotalSize |
| - panvk: add panvk_get_iub_desc_count and panvk_get_iub_size |
| - panvk: fix allocated size for sets with variable count iub |
| - panvk: use variable count when initializing iub buffer descriptor |
| - panvk: count iubs correctly when checking if layout is supported |
| - panvk: handle 0 count for vdc when checking if layout is supported |
| - panvk: report the maximum supported size for a variable count iub |
| - pan/lib: Add some shader core properties to model table |
| - panvk: Enable VK_ARM_shader_core_properties |
| |
| Jonathan Gray (3): |
| |
| - vulkan: add missing include for FALLTHROUGH |
| - util: fix OpenBSD/powerpc64 build |
| - intel/dev: update BMG device names |
| |
| Jordan Justen (9): |
| |
| - intel/dev: Add BMG PCI IDs 0xe220-0xe223 |
| - intel/dev: Add PTL PCI IDs 0xb084-0xb087 |
| - intel/dev: Update names for BMG G31 PCI IDs |
| - anv: Set Xe3 as supported |
| - intel/dev: Enable PTL PCI IDs (without INTEL_FORCE_PROBE) |
| - intel/dev: Add WCL platform enum |
| - intel/dev/mesa_defs.json: Add WCL WA entries |
| - intel/dev: Add WCL device info |
| - intel/dev: Add WCL PCI IDs |
| |
| Jose Maria Casanova Crespo (22): |
| |
| - vc4: Remove offset from vc4_surface |
| - vc4: Remove tiling from vc4_surface |
| - vc4: Stop calling deprecated util_framebuffer_init |
| - vc4: Remove struct vc4_surface and create/destroy surface functions |
| - v3d: Remove swap_rb logic from v3d_surface |
| - v3d: Remove rt_format from v3d_surface |
| - v3d: Remove never used offset from v3d_surface |
| - v3d: Remove tiling from v3d_surface |
| - v3d: Remove padded_height_of_output_image_in_uif_blocks from v3d_surface |
| - v3d: Remove internal_type and internal_bpp from v3d_surface |
| - v3d: Remove separate_stencil from v3d_surface |
| - v3d: Stop calling deprecated util_framebuffer_init |
| - v3d: Remove struct v3d_surface and create/destroy surface functions |
| - v3d: Add missing newline char on perf debug message |
| - v3d: Only apply TLB load invalidation on first job after FB state update |
| - v3d: Force job submit if the number of attached BOs is over 2048 |
| - v3d: Rename MAX_JOB_SCISSORS to V3D_JOB_MAX_SCISSORS |
| - v3d: Avoid fast TLB blit if reused job doesn't store the color buffer |
| - v3d: avoid submit of supertile coordinates on jobs without rasterization |
| - v3d: fix support for no buffer object bound |
| - v3d: Fix depth resource invalidation with separate_stencil |
| - v3dv: Do not increase TFU READAHEAD for imported buffers size |
| |
| Joshua Duong (1): |
| |
| - gfxstream: VkDecoder: break from seqno logic when puid dies. |
| |
| José Roberto de Souza (49): |
| |
| - iris: Fix IRIS_HEAP_SYSTEM_MEMORY_CACHED_COHERENT slab parent allocation |
| - intel: Program XY_FAST_COLOR_BLT::Destination Mocs for gfx12 |
| - intel: Fix the MOCS values in XY_FAST_COLOR_BLT for Xe2+ |
| - intel: Fix the MOCS values in XY_BLOCK_COPY_BLT for Xe2+ |
| - intel: Add has_partial_mmap_offset to intel_device_info |
| - gallium: Remove pb_buffer.h include from pb_slab.h |
| - util: Move pb_slab from gallium to util |
| - anv: Export anv_bo_is_small_heap() |
| - anv: Move VMA alignment requirements to its own function |
| - anv: Add the base infrastructure to support memory pool |
| - anv: Implement anv_slab_bo and enable memory pool |
| - anv: Add support for ANV_BO_ALLOC_AUX_CCS in anv_slab_bo |
| - anv: Add support for ANV_BO_ALLOC_DESCRIPTOR_POOL in anv_slab_bo |
| - anv: Add support for ANV_BO_ALLOC_DYNAMIC_VISIBLE_POOL in anv_slab_bo |
| - anv: Skip anv_bo_pool if memory pool is enabled |
| - anv: Add support for batch buffers in anv_slab_bo in Xe KMD |
| - anv: Add support for batch buffers in anv_slab_bo in i915 |
| - anv: Remove useless if block |
| - anv: Always grow fixed address pools by 2MB in platforms that there is a performance gain |
| - anv: Align size of bos larger than 1MB to 64k to enable 64k pages |
| - iris: Remove iris_slab_free cast |
| - anv: Fix assert failure in discrete GPUs when allocating a LMEM+SMEM slab parent |
| - anv: Remove ANV_BO_ALLOC_HOST_CACHED from ANV_BO_ALLOC_MAPPED assert() on anv_device_alloc_bo() |
| - intel/tools: Fix batch buffer decoder |
| - anv: Reduce memory pool usage in MTL and ARL |
| - anv: Implement missing part of Wa_1604061319 |
| - anv: Enable preemption due 3DPRIMITIVE in GFX 12 |
| - iris: Restrict platforms that needs Wa_1604061319 |
| - anv: Drop '#if GFX_VERx10 >= 125' inside of '#if GFX_VERx10 >= 125' |
| - anv: Remove a '#if GFX_VER >= 30' block inside of a else of '#if GFX_VERx10 >= 125' |
| - intel: Return PTL stepping |
| - intel/tools: Replace error_decode_xe_read_hw_sp_or_ctx_line() by a more generic function |
| - intel/tools: Rename and better detect topic changes |
| - intel/tools: Skip dump of binaries in unknown sections |
| - Revert "anv: Enable preemption due 3DPRIMITIVE in GFX 12" |
| - intel/common: Add function to compute optimal compute engine async thread limits |
| - intel/common: Use as much as possible spec recommended values for compute engine async thread limits |
| - iris: Emit STATE_COMPUTE_MODE before COMPUTE_WALKER when new async compute limits are needed |
| - blorp: Emit STATE_COMPUTE_MODE before COMPUTE_WALKER |
| - anv: Emit STATE_COMPUTE_MODE before COMPUTE_WALKER when new async compute limits are needed |
| - anv: Read the correct register for aux table invalidation when in GPGPU mode in render engine |
| - anv: Flush before invalidate aux map in copy and video engines |
| - anv: Do not emit batch_emit_fast_color_dummy_blit() for video engine |
| - anv: Reduce compiled code for Wa_16018063123 |
| - iris: Reduce compiled code for Wa_16018063123 |
| - anv: Decode and print async submit batch when debug flag is set |
| - iris: Program DispatchWalkOrder and ThreadGroupBatchSize with optimized values for regular computer walkers |
| - intel/blorp: Program DispatchWalkOrder and ThreadGroupBatchSize with optimized values for regular computer walkers |
| - anv: Program DispatchWalkOrder and ThreadGroupBatchSize with optimized values for regular computer walkers |
| |
| Juan A. Suarez Romero (24): |
| |
| - v3d/vc4/ci: update fraction and parallel values |
| - v3d/vc4/ci: Add -gl suffix to the GL suite names |
| - Revert "ci: disable Igalia's farm" |
| - ci/baremetal: fix ubsan gl target |
| - broadcom/ci: update expected results |
| - v3d/v3dv/ci: update expected results |
| - broadcom/ci: update test expected results |
| - v3dv: free indirect CSD job with queue cpu |
| - gallium/util: fix num primitives for line loops |
| - broadcom/v3dv: skip implicit synchronization tests |
| - broadcom/ci: add skqp testing |
| - broadcom/ci: update expected test results |
| - vc4: return progress on custom nir lowering |
| - vc4: don't use deprecated NIR_PASS_V macro |
| - v3d/compiler: don't use deprecated NIR_PASS_V macro |
| - v3dv: don't use deprecated NIR_PASS_V macro |
| - v3d,v3dv: set max supertiles to 256 |
| - v3dv: consider render area when configuring supertiles |
| - vc4,v3d: update expected results |
| - vc4: free RA interference graph on failure |
| - vc4/ci: disable skqp job |
| - broadcom/ci: update expected results |
| - broadcom/ci: unlock some CI-Tron jobs |
| - docs/features: GL_ARB_timer_query implemented for v3d |
| |
| Julia Zhang (1): |
| |
| - radeonsi: small fixes of radeonsi renderstage |
| |
| Juston Li (3): |
| |
| - driconf: enable custom_border_colors_without_format for ANGLE-on-anv |
| - anv/android: refactor anb resolve to fix align assertion |
| - anv: fix uninitialized mutex lock in anv_slab_bo_deinit() |
| |
| K900 (1): |
| |
| - gfxstream: fix build on 32-bit |
| |
| Kai Wasserbäch (1): |
| |
| - fix(FTBFS): clc: switch to new non-owned \`TargetOptions` for LLVM 21 |
| |
| Karmjit Mahil (6): |
| |
| - tu: Fix segfault in fail_submit KGSL path |
| - anv,tu: Bypass RMV pcie_family_id check |
| - tu: Add "check_cmd_buffer_status" debug option |
| - tu,freedreno: Add pkt_field_{get,set} helper macro |
| - tu: Use A2D for float16 formats |
| - tu: Remove handling of float16 in the 3D blit path |
| |
| Karol Herbst (132): |
| |
| - ac/llvm: use mul24 intrinsics |
| - rusticl/device: fix panic when disabling 3D image write support |
| - rusticl: allow packagers to enable radeonsi by default |
| - nir_lower_mem_access_bit_sizes: fix negative chunk offsets |
| - nak: fix handling of delays > 15 |
| - r600: fix r600_buffer_from_user_memory for rusticl |
| - r600: remove all clover related code |
| - radeonsi: remove more clover related code |
| - nv50: move pipe_grid_info::input into the driver |
| - nvc0: remove support for pipe_grid_info::input |
| - gallium: remove pipe_binary_program_header |
| - gallium: remove pipe_compute_caps::max_block_size_clover |
| - gallium: remove pipe_compute_caps::max_threads_per_block_clover |
| - gallium: remove pipe_compute_caps::ir_target |
| - gallium: remove pipe_compute_caps::images_supported |
| - gallium: remove pipe_compute_caps::max_private_size |
| - gallium: remove pipe_compute_caps::max_input_size |
| - gallium: remove pipe_compute_state::req_input_mem |
| - gallium: remove pipe_context::set_compute_resources and PIPE_BIND_COMPUTE_RESOURCE |
| - gallium: remove pipe_grid_info::pc and PIPE_SHADER_IR_NATIVE |
| - gallium: remove pipe_grid_info::input |
| - iris: parse global bindings for every gen |
| - iris/xe: fix compute shader start address |
| - iris/xe: take the grids variable_shared_mem into account |
| - nir: add nir_opt_algebraic_integer_promotion |
| - radeonsi: fix variable_shared_size assert in si_switch_compute_shader |
| - nir: fix use-after-free on function parameter names |
| - vtn: fix use-after-free on function parameter names |
| - frontend/dri: add wrapper around pipe_loader_get_driinfo_xml |
| - glx,egl: use driGetDriInfoXML |
| - glx,egl: drop libpipe_loader_dynamic dependency |
| - pipe-loader: remove libpipe_loader_dynamic |
| - gallium: remove dynamic pipe-loader |
| - clc: wire up cl_khr_kernel_clock |
| - rusticl: generate bindings for llvm version macros |
| - rusticl: support cl_khr_kernel_clock |
| - include: sync CL headers |
| - gallium: add fixed address resource API |
| - gallium: new VM interfaces for SVM |
| - zink: set unordered_read/write after buffer_barrier in set_global_binding |
| - zink: implement resource_get_address |
| - lp: implement resource_get_address |
| - rusticl/mesa: wrap new VM interfaces |
| - rusticl/mesa: wrap new VM interfaces SVM edition |
| - rusticl/mesa: add util_vma_heap wrapper |
| - rusticl/kernel: rework validation in clSetKernelExecInfo |
| - rusticl: implement cl_ext_buffer_device_address |
| - rusticl: move SVM allocation into core |
| - rusticl/kernel: add an SVM kernel argument value |
| - rusticl: add support for coarse-grain buffer SVM |
| - rusticl: add memory debugging |
| - include: remove C++ OpenCL headers |
| - llvmpipe remove llvmpipe_screen::allow_cl |
| - rusticl/kernel: implement CL_INVALID_ARG_VALUE for image args in clSetKernelArg |
| - rusticl/memory: properly set pipe_image_view::access |
| - rusticl/kernel: rework unsafe block inside clSetKernelArg |
| - ac/nir: fix unaligned single component load/stores |
| - rusticl: fix compilation on non Linux systems |
| - rusticl: move unmap out of debug_assert_eq statement |
| - rusticl: check the returned pointer of mmap |
| - freedreno/ci: add CL fp16 skips |
| - util/u_printf: support printing fp16 values |
| - clc: support fp16 spec constants |
| - vtn/opencl: support fp16 builtins |
| - nir/scale_fdiv: handle fp16 fdiv |
| - rusticl: enable proper fp16 support |
| - clc: use new createTargetMachine overload with llvm-21 |
| - clc: fix DiagnosticOptions related build failure with llvm-21 |
| - rusticl/mesa: catch invalid use of resource_assign_vma in debug builds |
| - iris: implement resource_get_address |
| - iris: implement SVM interfaces |
| - include: update CL headers |
| - rusticl/device: refactor Device::new |
| - rusticl: implement cl_khr_spirv_queries |
| - ac/llvm: fix bitfield ops |
| - rusticl: implement cl_khr_priority_hints |
| - rusticl/image: fix sub-buffer images |
| - rusticl/image: enable sRGB support |
| - vtn: mark BitInstructions cap as supported |
| - clc: support cl_khr_extended_bit_ops |
| - llvmpipe: more bitfield_extract lowering |
| - agx: enable bitfield lowering for cl_khr_extended_bit_ops |
| - ir3: enable bitfield lowering for cl_khr_extended_bit_ops |
| - panfrost: lower bitfield_extract8|16 |
| - r600: lower bitfield_extract8|16 |
| - v3d: lower bitfield_extract8|16 |
| - zink: check for VK_KHR_maintenance9 |
| - zink: lower bitfield operations without maintenance9 |
| - rusticl: support cl_khr_extended_bit_ops |
| - include: sync OpenCL headers |
| - rusticl/icd: use default value for cl_icd_dispatch |
| - rusticl: cl_khr_icd 2.0.0 |
| - radeosi: assert addresses are not NULL in a couple of places |
| - radeonsi: implement resource_get_address |
| - radeonsi: implement SVM interfaces |
| - rusticl: allow packagers to enable freedreno by default |
| - rusticl/queue: signal events directly on panics |
| - rusticl/event: change Queue argument to Context in EventSig |
| - rusticl/event: check the queue timed to detect enabled profiling in call |
| - rusticl/event: convert queue to Weak reference |
| - rusticl/queue: do not block when dropping a queue |
| - rusticl/queue: use let else statement when receiving new events |
| - rusticl/queue: reuse vector between queue iterations |
| - rusticl/queue: fix wrong_self_convention and needless_borrow clippy warnings |
| - clc: add support for cl_ext_image_unorm_int_2_101010 |
| - rusticl/formats: support CL_UNORM_INT_101010_2 |
| - rusticl/formats: support cl_ext_image_unorm_int_2_101010 |
| - rusticl: use image_copy_buffer |
| - rusticl: reset VMA address of resource inside SVMAlloc::drop |
| - rusticl/mesa: add PipeResource::new_ref |
| - rusticl: drop Arc around PipeResource |
| - doc/features.txt: synchronize OpenCL extension list with official spec |
| - docs/features.txt: entirely rework OpenCL feature 2.x reporting |
| - nak: Provide our own callback to nir_lower_phis_to_scalar |
| - nak: stop lowering all phis to 32 bits. |
| - nak: keep phis vectorized if they fit into a register |
| - nv50,nvc0: Set the texture for 3D blitter surfaces |
| - nvc0: remove dead surface tracking code |
| - nouveau: move util_framebuffer_init into the driver |
| - nak: fix MMA latencies for Ampere |
| - nak: Add cooperative matrix lowering pass |
| - nak: support faster back to back latencies for MMA |
| - nvk: add support for 16x8x16 IMMA on Ampere+ |
| - vtn/opencl: set exact on all ffmas and mads |
| - zink: disallow intensity buffer images |
| - zink: disable shader images for intensity formats |
| - rusticl/mem: relax flags validation for clGetSupportedImageFormats |
| - rusticl/queue: do not return event status errors on flush/finish |
| - zink: properly unbind sampler views with imported 2D resource |
| - rusticl/mesa: use pipe_sampler_view_reference |
| - rusticl/queue: clear shader images when destroying queues |
| - nv50: fully migrate away from util_framebuffer_init |
| |
| Kenneth Graunke (5): |
| |
| - nir: Add a new optimization for acquire/release atomics & barriers |
| - anv: Use the new nir_opt_acquire_release_barriers pass |
| - brw: Refactor copy propagation checks for EOT send restrictions |
| - brw: Fix units in copy propagation EOT restriction size calculation |
| - brw: Update copy propagation into EOT sends handling for Xe2 units |
| |
| Kevin Chuang (4): |
| |
| - intel/compiler: Use 24bits for hit_kind on Xe3+ |
| - intel/compiler/rt: Calculate barycentrics on demand |
| - intel/bvh: Compile and adapt bvh shaders separately into Xe1/2 and Xe3+ |
| - intel/bvh/debug: Adapt instance leaf dumping to support 64-bit RT |
| |
| Konrad Dybcio (1): |
| |
| - freedreno: Add initial A702 support |
| |
| Konstantin Seurer (80): |
| |
| - util: Add BITSET_EXTRACT |
| - vulkan: Add a vk_device parameter to get_encode_key |
| - vulkan: Add vk_ir_header::dst_leaf_node_offset |
| - ac: Add rt_version |
| - aco,nir: Add support for new GFX12 ray tracing instructions |
| - radv: Refactor create_bvh_descriptor |
| - radv/rra: Move gfx10_3 specific code to a new file |
| - radv/rra: Set rra_accel_struct_header::rtip_level |
| - radv/bvh: Document GFX12 BVH encoding |
| - radv/bvh: Add helpers for encoding |
| - radv: Use the BVH8 format on GFX12 |
| - radv: Optimize the gfx12 encode shader |
| - lavapipe: Fix ray tracing position fetch with multiple geometries |
| - radv: Return VK_ERROR_INCOMPATIBLE_DRIVER for unsupported devices |
| - util: Add util_format_is_int64 |
| - llvmpipe: Implement 64-bit image operations |
| - llvmpipe: Disable 64-bit integer formats for vertex fetch |
| - gallium: Handle 64bit textures in the SW clear fallback path |
| - lavapipe: Implement 64-bit image clears |
| - lavapipe: Advertise VK_EXT_shader_image_atomic_int64 |
| - llvmpipe: Handle nir_tex_src_min_lod |
| - lavapipe: Advertise shaderResourceMinLod |
| - lavapipe: Implement KHR_shader_quad_control |
| - lavapipe: Implement VK_EXT_fragment_shader_interlock |
| - gallivm/nir/aos: Remove left over debug print |
| - gallivm: Use divergent sources for more subgroup ops |
| - lavapipe/ci: Update expected vkd3d-proton fails |
| - vulkan: Introduce VK_BUILD_FLAG for specializing BVH build shaders |
| - vulkan: Add helpers for creating bvh build pipelines/layouts |
| - radv: Use build flags instead of defines |
| - vulkan: Add acceleration structure update keys |
| - radv: Use a specialized shader for in place updates |
| - radv: Refactor the update scratch layout code |
| - vulkan/bvh: Add type information for vk_bvh_geometry_data |
| - vulkan: Add more information to BVH update callbacks |
| - radv: Make radv_update_memory non-static |
| - radv: Use subgroup OPs for BVH updates on GFX12 |
| - nir/print: Add a get_name helper |
| - nir/print: Use get_name for types |
| - nir: Print struct type declarations |
| - radv: Flush L2 on GFX12 when binding an update pipeline |
| - radv/bvh: Make sure the AABB is written before internal_ready_count |
| - radv/bvh: Remove some unused variables |
| - radv/bvh: Fix comment |
| - lavapipe: Fail device enumeration when DRAW_USE_LLVM=0 is set |
| - llvmpipe: Compile texture fetch functions on demand |
| - llvmpipe: Compile fewer unnecessary functions |
| - llvmpipe: Compile size query functions on demand |
| - llvmpipe: Make more texture state static with bindless |
| - llvmpipe: Reduce the size of lp_descriptor |
| - radv/bvh: Fix updating empty bvhs |
| - vulkan: Allow reserving scratch memory for encode passes |
| - vulkan: Move the build options to the accel struct header |
| - vulkan: Replace get_*_key with get_build_config |
| - vulkan: Remove bvh_state::leaf_node_size |
| - vulkan: Pass a structure to most BVH build callbacks |
| - spirv: Move the shader_call_data workaround above nir_validate_shader |
| - nir: Add nir_lower_halt_to_return |
| - gallivm: Implement demote and lower terminate in nir |
| - gallivm/nir/soa: Fix typo |
| - gallivm/nir/soa: Include helper invocations in first_active_invocation |
| - gallivm/nir/soa: Cast divergent->uniform at the consumer |
| - gallivm/nir/soa: Call lp_build_opt_nir |
| - ci: Update trace checksums |
| - lavapipe: Suspend conditional rendering around ray tracing dispatches |
| - lavapipe: Advertise VK_EXT_tooling_info support |
| - docs: Document some lavapipe extensions |
| - gallium: Add back pipe_rasterizer_state::offset_units_unscaled |
| - llvmpipe: Implement pipe_rasterizer_state::offset_units_unscaled |
| - llvmpipe: Improve depth bias rounding workaround |
| - lavapipe: Implement VK_EXT_depth_bias_control |
| - llvmpipe: Use the correct field to decide if coroutines are used |
| - lavapipe: Implement VK_KHR_workgroup_memory_explicit_layout |
| - vulkan/cmd_queue: Fix indentation a bit |
| - vulkan/cmd_queue: Free pNext |
| - ci/llvmpipe: Update expectations |
| - llvmpipe: Move allocation out of compile_sample_functions |
| - llvmpipe: Allocate some stuff on demand |
| - radv/rra/gfx10_3: Fix acceleration structure addresses |
| - radv: Initialize base IDs when doing a BVH update with src!=dst |
| |
| Kovac, Krunoslav (2): |
| |
| - amd/vpelib: Update comment and fix some code alignment issues. |
| - amd/vpelib: Fix CodeQL issues Pt1 |
| |
| Krunoslav Kovac (1): |
| |
| - amd/vpelib: Remove RGBE_ALPHA |
| |
| Lars-Ivar Hesselberg Simonsen (21): |
| |
| - panfrost: Add pan_unpack to ForEachMacros |
| - pan/texture: Correctly handle slice stride for MSAA |
| - pan/texture: Set plane size to slice size |
| - pan/genxml/v10: Add minus1 mod for plane width/height |
| - pan/genxml/v12: Add minus1 mod for plane width/height |
| - pan/genxml/v13: Add minus1 mod for plane width/height |
| - pan/texture/v10+: Set width/height in the plane descs |
| - panvk/v9+: Set up limited texture descs for storage use |
| - panvk: Request resources during subqueue init |
| - panvk/v10+: Remove unnecessary alloc in dispatch_precomp |
| - panvk/v10+: Limit direct dispatch WLS allocation |
| - panvk/jm: Apply direct dispatch WLS instance limit |
| - panfrost: Apply direct dispatch WLS instance limit |
| - panvk: Fix unsupported LogicOp NOOP blending |
| - panvk: Add FS read support for PrimitiveID |
| - panvk: Skip barrier QFOT if src_qfi equals dst_qfi |
| - pan/genxml/v9+: Add support for NullDescriptor decode |
| - panvk/v10+: Implement nullDescriptor support |
| - panvk/v9+: Handle nullDescriptor for texture/image builtins |
| - panvk/v10+: Advertise nullDescriptor support |
| - panvk: Advertise VK_KHR_unified_image_layouts |
| |
| Leder, Brendan Steve (1): |
| |
| - amd/vpelib: Make BG gen stream idx adjustable |
| |
| Lina Versace (3): |
| |
| - anv: Assert that only external images have private bindings |
| - anv: Fix comment about external queue transitions |
| - anv: Enable VK_EXT_external_memory_acquire_unmodified |
| |
| LingMan (4): |
| |
| - entaviv/isa: Silence warnings about non snake case names |
| - meson: Streamline silencing of warnings in bindgen generated code |
| - rusticl: Initialize NIR option structs with default values |
| - docs/rusticl: Update documented version requirements for meson and bindgen |
| |
| Lionel Landwerlin (101): |
| |
| - intel: fixup a few debugging option checks |
| - anv: use companion batch for operations with HIZ/STC_CCS destination |
| - anv: update Wa_22019225126 check |
| - anv: add ability to mmap at offset |
| - anv: force fragment shader execution when occlusion queries are active |
| - intel: fix null render target setup logic |
| - anv: promote VK_EXT_robustness2 to VK_KHR_robustness2 |
| - brw: add pre ray trace intrinsic moves |
| - compiler: add VARYING_BIT_CULL_PRIMITIVE |
| - compiler: add VARYING_BIT_PRIMITIVE_INDICES |
| - anv: remove tbimr workaround check |
| - anv: lower input vertices for TCS unconditionally |
| - brw: add helpers to check if a fragment shader execution is dynamic |
| - brw: store input_vertices on tcs_prog_data |
| - anv: switch to use the tcs_prog_data for dynamic input vertices |
| - anv: switch to brw helpers to figure out if a fragment is dynamic |
| - brw: improve VUE printout |
| - brw: add new helper to print out FS URB setup |
| - brw: move helper to brw_nir.c |
| - brw: use newer NIR constructs |
| - brw/nir: use a new intrinsic for fs_msaa_flag |
| - anv/brw: shrink FS varying payload |
| - anv: tidy up (CLIP|SBE)_MESH emission |
| - brw: use VARYING_BIT_* macros more |
| - brw: document some brw_wm_prog_data fields |
| - brw: fix brw_nir_move_interpolation_to_top |
| - anv/brw: use separate_shader to deduced MUE compaction |
| - brw/nir: add intrinsics to read attribute payload register indirectly |
| - brw: add documentation pointers to FS attribute layout |
| - intel: prepare VUE layout for more than 2 layouts |
| - intel: introduce new VUE layout for separate compiled shader with mesh |
| - anv/brw: handle pipeline libraries with mesh |
| - panvk/ci: add more flaky tests |
| - vulkan/runtime: store index of the push descriptor in pipeline layout |
| - vulkan/runtime: track dynamics descriptor in a set layout |
| - vulkan/runtime: fixup assert with link_geom_stages |
| - vulkan/runtime: add a multialloc variant for pipeline create |
| - anv: enable preemption setting on command/batch correctly |
| - anv: add a comment about Wa_14016820455 |
| - anv/brw: stop turning load_push_constants into load_uniform |
| - hasvk/elk: stop turning load_push_constants into load_uniform |
| - anv: avoid 64bit atomics emulation on Xe2+ |
| - anv: don't use pipeline layout at descriptor bind |
| - radv: rename radv_lower_terminate_to_discard for wider use |
| - anv: add support for lower_terminate_to_discard workaround |
| - anv: workaround Sky: Children of the Light |
| - intel: remove GRL/intel-clc |
| - brw: don't generate invalid instructions |
| - brw: fix brw_nir_fs_needs_null_rt helper |
| - nir/opt_algebraic: extend lowering for (i|u)bitfield_extract |
| - anv: fix pool allocation failure reporting |
| - brw: implement read without format lowering |
| - anv: add support for shaderStorageImageReadWithoutFormat through emulation |
| - anv: simplify RENDER_SURFACE_STATE parsing |
| - spirv: bump headers |
| - vulkan: dump headers/registry to 1.4.317 |
| - ci: add intel format emulation testing |
| - brw: enable more lowering for bitfield manipulation at non 32bit sizes |
| - anv: allow device creation with no queue |
| - anv: implement VK_KHR_maintenance9 |
| - anv: missing bit from maintenance9 |
| - anv: handle REMAINING_LAYERS with 3D images & maintenance9 |
| - docs/features: update anv support |
| - anv: fix valgrind warning |
| - anv: fix R64 format support reporting |
| - anv: pass image usage/flags to anv_get_image_format_features2 |
| - anv: report color/storage features on YCbCr images with EXTENDED_USAGE |
| - ci/zink: add the same glx\@glx-tfp flake on ADL |
| - ci/zink: add validation error |
| - anv: only use compressed memory types on Xe2+ |
| - intel: fix monitor build dependencies |
| - nir/divergence: add missing intel intrinsics |
| - brw: print descriptor & extended descriptors |
| - intel: replace RANGE_BASE by BASE for uniform block loads |
| - nir: add new intel ssbo intrinsics |
| - nir/opt_offsets: add support for intel intrinsics |
| - brw: add new helper for immediate integer register with type |
| - isl: handle DISABLE_AUX in get_mcs_surf |
| - anv: fix clears on single aspect of YCbCr images |
| - anv: fix sampler hashing in set layouts |
| - intel/ci: document a couple of vkd3d failures |
| - brw: fix set_range on load_per_primitive_output |
| - brw: fix vertex attribute offset computation |
| - anv: make Wa_18019110168 deal with dynamic provoking vertex |
| - anv/brw: move Wa_18019110168 handling to backend |
| - brw: extract out attribute register remapping |
| - brw: store the remapping table for wa_18019110168 in constant data |
| - brw: move primitive_id_index field in fs_msaa |
| - brw: make a helper for vertex attribute offset computation |
| - brw: handle wa_18019110168 with independent shader compilation |
| - anv: add support for handling wa_18019110168 with gfx-libs |
| - brw: remove debug printf |
| - brw: fix non constant BTI accesses with offsets |
| - anv: rework embedded sampler hashing |
| - anv: do not rely on sampler objects for pipeline compilation |
| - intel/genxml: rename body field |
| - genxml: fix 3DSTATE_TE definition on Gfx12.[05] |
| - brw: fix NIR metadata invalidation with closest-hit shaders |
| - brw: fixup source depth enabling with coarse pixel shading |
| - brw: fixup coarse_z computation |
| - anv: fix wsi image aliasing |
| |
| Lorenzo Rossi (32): |
| |
| - nak: Add OpTexDepBar |
| - nak: Add a new OpFSwz and use it for derivatives on Kepler |
| - nak: Use s2r for SV_CLOCK on Kepler |
| - nvk: nak: Add OpViLd support |
| - nak: Initial SM32 support |
| - nvk: Clean up boilerplate around complex NIR flags |
| - nak: sm32: Fix wrong dsetp encoding |
| - nak/from_nir: Fix fquantize crash on < SM70 |
| - nak/opt_copy_prop: convert b2i-i2b into copy |
| - nak/sm32: Fix shfl.up register alignment |
| - nak/sm32: Fix various encoding bugs |
| - nvk: Don't advertise BAR memory for Kepler cards |
| - nvk: Fix local memory loads in Kepler shader header |
| - nak/sm20: Fix encoding panic for OpFAdd |
| - nir,nak: Add KeplerB shared atomics intrinsics and lowering |
| - nak: Add OpLdSharedLock and OpStSCheckUnlock |
| - nak/sm32: Add encodings for OpLdSharedLock and OpStSCheckUnlock |
| - nil: Add helpers for filling out Fermi/Kepler surface info |
| - nir,nak: Add NV-specific image intrinsics |
| - nak: Add an image address lowering pass |
| - nvk: Use nil_su_info for image descriptors on Kepler |
| - nak: Add surface address ops |
| - nak: Remove unused intrinsic image_load_raw_nv |
| - nak/sm32: Encode surface address ops |
| - nak/sm20: Encode imadsp |
| - nak: Add tests for imnmx and dsetp |
| - nak: Remove unreachable code in sm50.rs |
| - nak/sm32: Remove unnecessary NOP filling |
| - nak: Specialize MAX_INSTR_DELAY by ShaderModel |
| - nak: Add real instruction dependencies for Kepler |
| - nak: Add forward dataflow algorithm |
| - nak/kepler: Add texdepbar insertion pass |
| |
| Loïc Molinari (16): |
| |
| - panfrost: Optimize AFBC-P offsets computation |
| - util: Add AArch64 support to util_streaming_load_memcpy() |
| - panfrost: Use util_streaming_load_memcpy() to copy AFBC superblocks |
| - panfrost: Fix modifier conversion shaders search key |
| - panfrost: Fix AFBC packing |
| - panfrost: Disable AFBC tiled layout with driconf option |
| - panfrost: Improve AFBC packing terms |
| - panfrost: Test mipmap chain validity in panfrost_should_pack_afbc() |
| - panfrost: Make panfrost_pack_afbc() static |
| - panfrost: Async AFBC packing |
| - panfrost: Async AFBC packing debug info |
| - panfrost: Improve AFBC header block accesses |
| - panfrost: Get AFBC-P payload layout on the CPU |
| - panfrost: Optimize pan_afbc_payload_layout_packed() for AArch64 |
| - panfrost: Optimize pan_afbc_payload_layout_packed() for AArch32 |
| - panfrost: Test pan_afbc_payload_layout_packed() |
| |
| Lucas Fryzek (2): |
| |
| - anv: Add format conversion for AHARDWAREBUFFER_FORMAT_YCbCr_P010 |
| - lp: Don't allocate sampler functions if count is 0 |
| |
| Lucas Stach (33): |
| |
| - etnaviv: don't pretend to support TS for array or 3D textures |
| - etnaviv: drop ts_offset from etna_surface |
| - etnaviv: add resource render compatible check |
| - etnaviv: move TS allocation to resource allocation |
| - etnaviv: add screen spec bit for PE multitiling |
| - etnaviv: drop unused member PE_RT_COLOR_ADDR from compile framebuffer state |
| - etnaviv: stop tracking color buffer address information multiple times |
| - etnaviv: stop tracking ZS buffer address information multiple times |
| - etnaviv: don't prepare relocs in etna_surface |
| - etnaviv: drop superfluous PE alignment check |
| - etnaviv: drop precomputed offset from etna_surface |
| - etnaviv: stop caching RS clear commands on surface |
| - etnaviv: return render resource from render_handle_incompatible if it exists |
| - etnaviv: move render compatible handling from surface to resource |
| - etnaviv: rs: stop using etna_surface |
| - etnaviv: blt: stop using etna_surface |
| - etnaviv: state: stop using etna_surface |
| - etnaviv: get rid of etna_surface |
| - etnaviv: use direct BLT/RS blit hook for internal copies |
| - etnaviv: use most recent shadow of resources as blit source/target |
| - etnaviv/ci: update expection after blit fixes |
| - etnaviv: expose ARB_seamless_cube_map |
| - docs/etnaviv: add/fix supported texture extensions |
| - etnaviv: use TX_SEAMLESS_CUBE to determine seamless cubemap support |
| - etnaviv: don't advertise anisotropic filtering on GPUs with NO_ANISTRO_FILTER |
| - Revert "etnaviv/ci: document new failure" |
| - etnaviv: improve dither enable conditions |
| - etnaviv/ci: update expectation after dither fixes |
| - etnaviv: remove obsolete two-sided stencil comment |
| - etnaviv: drop obsolete question about alpha/stencil test interaction |
| - etnaviv: simplify stencil ref front/back selection condition |
| - etnaviv: allow 0 back stencil valuemask on new cores |
| - etnaviv/ci: drop fails due to ignored stencil valuemask |
| |
| Ludvig Lindau (1): |
| |
| - panvk: Fix ls_tracker usage in cs_maybe |
| |
| Luigi Santivetti (2): |
| |
| - vulkan/util: add vk_realloc2 |
| - Revert "pvr: Implement VK_EXT_memory_budget" |
| |
| M Henning (5): |
| |
| - nv50_ir_ra: Remove an assign to lval->compMask |
| - nv50_ir_ra: Add constraint movs for split-of-split |
| - nv50_ir_ra: Use propagated compMask for reg offset |
| - nv50_ir_ra: Don't coalesce mixed constraints |
| - nv50_ir: Remove dead variable |
| |
| Marek Olšák (265): |
| |
| - ac/gpu_info: clean up ac_get_hs_info, use standard terms like workgroup |
| - ac/gpu_info: add total_tess_ring_size |
| - radv: move the tess factor ring after the tess offchip ring |
| - ac/gpu_info: move HS info into radeon_info |
| - ac: use HS offchip wg size from radeon_info in ac_compute_num_tess_patches |
| - ac: minor cleanup of ac_compute_num_tess_patches |
| - ac/gpu_info: print tessellation ring info |
| - ac/nir: don't include TCS offchip size in LDS_SIZE |
| - ac/gpu_info: compute the tess factor ring size proportionately to the CU count |
| - ac/gpu_info: reduce the tess offchip ring size and compute it proportionately |
| - ac/nir: remove shader_info parameter from ac_nir_compute_tess_wg_info |
| - ac/nir: don't store tess levels for TES in TCS if no_varying is set |
| - ac/gpu_info: rename tess ring variables, fold double_offchip_wg |
| - radv: fix incorrect patch_outputs_read for TCS with dynamic state |
| - nir: add gathering passes that gather which inputs affect specific outputs |
| - nir/opt_varyings: group TES inputs based on whether they are used by POS or VAR |
| - radeonsi: initialize use_ngg* sooner |
| - radeonsi: enable nir_io_compaction_groups_tes_inputs_into_pos_and_var_groups |
| - amd: stop using CLEAR_STATE on gfx11 |
| - ac: remove gfx11_emulate_clear_state |
| - nir: add shader_info::tess::tcs_cross_invocation_outputs_written |
| - nir: split \*_accessed_indirectly* bitmasks into \*_read/written_indirectly* |
| - ac/nir/tess: remove unused variables |
| - ac/nir/tess: don't pass nir_intrinsic_instr to VMEM IO calc helpers |
| - ac/nir/tess: don't pass nir_intrinsic_instr to hs_output_lds_offset |
| - radeonsi: remove dead gfx12 tess code from a gfx6-11 function |
| - aco: remove unused aco_shader_info::tcs_offchip_layout |
| - ac: adjust maximum HS workgroup size |
| - nir/opt_vectorize_io: fix a failure when vectorizing different bit sizes |
| - nir: fix gathering color interp modes in nir_lower_color_inputs |
| - nir: remove unnecessary (nir_shader \*) typecasts |
| - nir: handle ibfe/ubfe in nir_def_bits_used |
| - nir: handle bit shifts by constants in nir_def_bits_used |
| - nir: handle mul24 in nir_def_bits_used |
| - nir: handle u2u/i2i recursively in nir_def_bits_used |
| - nir: handle extract opcodes recursively in nir_def_bits_used |
| - nir: handle iand/ior opcodes recursively in nir_def_bits_used |
| - nir: handle mov and bcsel in nir_def_bits_used |
| - ac/nir: fix export_ps_outputs not preserving divergence metadata |
| - radeonsi: don't check sel->nir in si_init_shader_selector_async |
| - radeonsi: restructure si_get_shader_variant_info |
| - radeonsi: remove si_shader_info::uses_indirect_descriptor |
| - radeonsi: determine uses_vmem_load_other more accurately |
| - radeonsi: don't return progress from run_pre_link_optimization_passes |
| - radeonsi: always gather shader variant info |
| - radeonsi: move gathering VMEM information into si_get_shader_variant_info |
| - radeonsi: gather VS system value usage from shader variants |
| - radeonsi: gather uses_gs_state_provoking_vtx_first/outprim from the shader |
| - radeonsi: gather writes_z/stencil/sample_mask as shader variant info |
| - radeonsi: gather uses_discard from shader variants |
| - radeonsi: don't use si_shader_info in si_parse_next_shader_property |
| - radeonsi: remove si_shader_info::writes_position |
| - radeonsi: move xfb fields from si_shader_info to shader variant info |
| - radeonsi: implement remove_streamout in si_nir_kill_outputs |
| - radeonsi: don't declare GDS size for LLVM |
| - radeonsi: use info.num_streamout_vec4s instead of si_shader_uses_streamout |
| - radeonsi: use a simpler way to gather enabled_streamout_buffer_mask |
| - radeonsi: rename num_stream_output_components -> num_gs_stream_components |
| - radeonsi: inline shader_info in si_shader_info, keep only what's used |
| - radeonsi: move NIR passes from si_shader.c into their own files |
| - radeonsi: move shader info structures into new file si_shader_info.h |
| - nir: move nir_lower_color_inputs into radeonsi |
| - radeonsi: move si_nir_mark_divergent_texture_non_uniform to its own file |
| - radeonsi: move shader variant info and spi_ps_input_ena code into its own file |
| - radeonsi: move shaders args initialization into its own file |
| - radeonsi: add struct si_temp_shader_variant_info |
| - radeonsi: move si_gs_output_info into si_temp_shader_variant_info |
| - mesa: remove the old GL name (ID) allocator |
| - nir/tcs_info: gather for all patch outputs whether they're written by all invocs |
| - nir/tcs_info: gather which patch outputs are only read/written by invoc 0 |
| - nir/tcs_info: use range analysis to determine the range of tess levels |
| - winsys/amdgpu: print an error when we fail to allocate VA |
| - winsys/amdgpu: use alt_fence for all video queues |
| - winsys/amdgpu: fix running out of 32bit address space with high FPS |
| - winsys/amdgpu: add enums for queues using the fence rings |
| - winsys/amdgpu: add a high priority gfx queue |
| - winsys/amdgpu: set the priority for gfx user queues |
| - winsys/amdgpu: pass PIPE_CONTEXT_* flags to ctx_create |
| - winsys/amdgpu: fall back to a normal priority without root in the winsys |
| - glsl: fix sampler and image type checking in lower_precision |
| - glsl,gallium: add an option not to lower mediump tex & image dst |
| - radeonsi: enable 16-bit ALU, LDS, uniforms on gfx8 |
| - nir: change the type of shader_info::patch_* fields to 32 bits |
| - nir: always index SSA defs before printing |
| - nir/xfb_info: add new fields to describe 16-bit XFB better |
| - nir/opt_clip_cull_const: support GS |
| - nir: remove unused nir_io_semantics::invariant |
| - mesa: strengthen the condition that triggers generating VS with edge flags |
| - amd: replace most u_bit_consecutive* with BITFIELD_MASK/RANGE |
| - ac/nir: implement load_subgroup_id/local_invocation_index for TCS on gfx6-10.x |
| - ac/nir/tess: add if/endif for HS threads in NIR instead of ACO/LLVM |
| - ac/nir/tess: adjust memory layout of TCS outputs to have aligned store offsets |
| - radv,radeonsi: merge PATCH_CONTROL_POINT & OUT_PATCH_CP into 1 field |
| - radv,radeonsi: make TCS_OFFCHIP_LAYOUT_NUM_PATCHES not off by one |
| - radv,radeonsi: precompute and pass TCS per-vertex output stride via a user SGPR |
| - ac/nir/tess: allow passing explicit patch_offset to VMEM/LDS offset calculations |
| - ac/nir/tess: write TCS per-vertex outputs to memory as vec4 stores at the end |
| - ac/nir/tess: execute the tess level workgroup vote on all chips |
| - ac/nir/tess: if all tess levels are 0, skip per-vertex TCS output stores |
| - ac/nir/tess: indent a block for nir_if |
| - ac/nir/tess: move LDS and VMEM output masks into a new info structure |
| - radv,radeonsi: use ac_nir_tess_io_info for LDS size computation |
| - nir: add shader_info::tess::tcs_*outputs_read_by_tes* |
| - ac/nir/tess: apply no_varying to ac_nir_tess_io_info |
| - ac/nir/tess: stop using tes_inputs_read / tes_patch_inputs read for TCS & TES |
| - ac/nir/tess: compute the number of remapped VRAM outputs in common code |
| - radeonsi: replace tess_levels_written_for_tes mask with a count |
| - ac/nir/tess: rewrite tess level tracking, don't use LDS for more cases |
| - ac/nir/tess: inline mask helpers |
| - ac/nir/tess: use if-ladder to determine valid tess level components for the vote |
| - ac/nir/tess: unify computing LDS output patch size, minimize LDS bank conflicts |
| - ac/nir/tess: write TCS patch outputs to memory as vec4 stores at the end |
| - ac: set LDS limit for TCS to 32K for all chips |
| - ac: move tcs_offchip_layout into ac_shader_args |
| - ac/nir/tess: remove parameter from and simplify hs_per_patch_output_vmem_offset |
| - egl: export GL-CL interop functions from libEGL_mesa.so |
| - ac/llvm: allocate LLVM PS output variables on demand |
| - radeonsi: restructure si_emit_clip_regs |
| - radeonsi: support 8 instead of 6 clip planes with gl_ClipVertex |
| - radeonsi: use AC_EXP_PARAM_UNDEFINED for clarity |
| - radeonsi: remove a non-trivial optimization that doesn't do much |
| - radeonsi: don't refer to removed GTF CTS tests |
| - radeonsi: fix AMD_DEBUG=usellvm |
| - radeonsi: don't clamp the shadow comparison value for nir_texop_lod on gfx8-9 |
| - radeonsi/ci: update gfx8 failures |
| - radeonsi: add a workaround for a DrawTransformFeedback issue on gfx11.5 |
| - nir: add shader_info::prev_stage |
| - glsl: set prev/next_stage according to the new definition |
| - nir/opt_varyings: completely exclude mediump from type changes |
| - nir/opt_varyings: set prev_stage/next_stage if they are NONE and validate them |
| - nir/opt_vectorize_io: convert bool merge_low_high_16_to_32 to an enum |
| - nir/opt_vectorize_io: don't vectorize 16-bit IO to vec8 - it's illegal |
| - nir/opt_vectorize_io: fix vectorizing 16-bit XFB |
| - nir/xfb_info: don't merge incompatible XFB outputs to fix mediump |
| - nir: add nir_clear_mediump_io_flag |
| - glsl/spirv: link XFB before prelink_lowering |
| - nir: add shader_info::prev_stage_has_xfb |
| - nir/opt_vectorize_io: work around a 16-bit IO bug for RADV |
| - nir: return progress from nir_group_loads, nir_inline_uniforms |
| - radeonsi: fix tess regression with latest LLVM git |
| - radeonsi: fix printing asm for the first shader of merged shaders for ACO |
| - nir: rename nir_lower_io_to_scalar_early -> nir_lower_io_vars_to_scalar |
| - nir: move nir_lower_io_vars_to_scalar into its own file |
| - nir: rename nir_lower_io_to_vector -> nir_opt_vectorize_io_vars |
| - nir: rename nir_vectorize_tess_levels -> nir_lower_tess_level_array_vars_to_vec |
| - nir: move nir_io_add_intrinsic_xfb_info into its own file |
| - nir: move nir_io_add_const_offset_to_base into its own file |
| - nir: move nir_assign_var_locations to freedreno (its only use) |
| - nir: move lots of code from nir_lower_io.c into new nir_lower_explicit_io.c |
| - nir: remove unused nir_force_mediump_io & nir_unpack_16bit_varying_slots |
| - nir: move nir_recompute_io_bases into its own file |
| - nir: rename nir_lower_clip_cull_distance_arrays -> nir_lower_clip_cull_distance_array_vars |
| - nir: rename nir_lower_io_arrays_to_elements -> nir_lower_io_array_vars_to_elements |
| - nir: rename nir_lower_io_to_temporaries -> nir_lower_io_vars_to_temporaries |
| - ac/nir/lower_ngg: never export edge flags via position exports |
| - ac/nir: remove the done parameter from ac_nir_export_position |
| - ac/nir: lower ClipVertex before all position exports |
| - ac/nir/lower_ngg_gs: build streamout after lowering intrinsics |
| - ac/nir: add an option write_pos_to_clip_vertex to clip against POS |
| - ac/nir: add an option to pack clip/cull distance components to remove holes |
| - ac/nir: add LDS layout info for GSVS and XFB to ac_nir_prerast_per_output_info |
| - ac/nir: add an option not to gather values in ac_nir_gather_prerast_store_output_info |
| - ac/nir/lower_ngg_gs: split lower_ngg_gs_intrinsic into gathering and lowering |
| - radv: always use the ngg_lds_layout SGPR |
| - radv: call gfx10_get_ngg_info after NIR lowering |
| - ac/nir/lower_ngg: return LDS size for NGG VS and TES from the pass |
| - ac/nir/lower_ngg_gs: return LDS size from the pass |
| - ac/nir/lower_ngg: pack GS outputs and XFB outputs in LDS optimally |
| - ac/nir/lower_ngg: rename user_clip_plane_enable_mask -> cull_clipdist_mask |
| - ac/nir/lower_ngg_gs: cull against clip/cull distances & clip planes in GS |
| - ac/nir/lower_ngg: add & use new scalar helpers for GS loads/stores |
| - ac/nir/lower_ngg: add & use new scalar helpers for XFB loads/stores |
| - ac/nir/lower_ngg: forward constant GS & XFB output components from stores to loads for LDS |
| - ac/nir/lower_ngg: rename clip_cull_dist_mask and use it correctly |
| - ac/nir: rename clip_cull_mask parameter to clearer export_clipdist_mask |
| - ac/nir/lower_ngg: add an option not to export cull distances if the shader culls them |
| - ac/nir: remove ngg_scratch LDS ABI, allocate it in the lowering pass |
| - ac: add NGG subgroup size computation from radeonsi |
| - ac: add legacy GS subgroup size computation from radeonsi |
| - ac/nir: remove no-op loop from ac_nir_create_gs_copy_shader |
| - ac/nir: return the GS copy shader from ac_nir_lower_legacy_gs |
| - ac/nir: move gs_output_component_mask_with_stream to prerast utils |
| - ac/nir: switch legacy GS lowering to ac_nir_prerast_out completely |
| - ac/nir: remove kill_pointsize and kill_layer options from lowering passes |
| - ac/nir: rename force_center_interp_no_msaa to msaa_disabled |
| - ac/nir: eliminate sample_id/sample_pos if MSAA is disabled |
| - ac/nir: use u_foreach_bit more |
| - etnaviv,r600,v3d,virgl: report correct nir_options::support_indirect_* |
| - agx,freedreno,intel,lima,panfrost,svga,virgl,zink: fix supports_indirect_inputs |
| - nir: remove nir_shader_compiler_options::lower_all_io_to_temps |
| - glsl: fix a possible crash in gl_nir_lower_xfb_varying |
| - glsl: don't lower inputs to temps unconditionally |
| - glsl: don't lower outputs to temps unconditionally |
| - glsl: don't call nir_split_var_copies in preprocess_shader |
| - glsl: don't call nir_lower_global_vars_to_local twice in preprocess_shader |
| - nir: invert the meaning of has_indirect_* flags in nir_lower_io_passes |
| - gallium/noop: fix a crash in sampler_view_release |
| - nir: remove deprecated nir_io_dont_optimize |
| - nir/lower_io: validate that location and num_slots fit in the bitfields |
| - nir: print lowp/mediump/highp next to deref types |
| - nir: add nir_shift_channels helper |
| - nir: change nir_lower_mem_access_bit_sizes to an intrinsics pass |
| - glsl: only set has_transform_feedback_varyings for the last pre-FS shader |
| - glsl: check against varying limits using NIR shader_info after nir_opt_varyings |
| - glsl: remove gl_nir_opt_dead_builtin_varyings |
| - nir: add new pass nir_lower_io_indirect_loads |
| - nir: add new pass nir_opt_move_to_top |
| - radeonsi: use nir_opt_move_to_top for input loads |
| - nir: switch indirect IO load lowering to nir_lower_io_indirect_loads for GLSL |
| - glsl: fix MESA_GLSL_DISABLE_IO_OPT env var |
| - ac/nir: fix indexing GS inputs with non-constant vertex index on gfx9-11 |
| - zink/ci: add post-merge failures from a previous MR |
| - radv: use shared ac_ngg_compute_subgroup_info |
| - radv: use shared ac_legacy_gs_compute_subgroup_info |
| - radv: set the maximum possible workgroup size for legacy GS before linking |
| - radv: call radv_get_legacy_gs_info after ac_nir_lower_legacy_gs |
| - ac/nir: return GSVS emit sizes from legacy GS lowering and simplify shader info |
| - ac/nir: remove unnecessary 16-bit handling from pre-rast GS and XFB loads/stores |
| - ac/nir: fix mediump XFB |
| - ac/nir: handle VARYING_SLOT_VARn_16BIT the same as other slots |
| - radv: cull against clip and cull distances in the shader |
| - radv: enable W/front/back face NGG culling with multiple viewports |
| - radv: rework radv_link_shaders_info as as not be called in a loop |
| - radv: enable NGG culling for GS |
| - radv: don't include positions exports in pipeline executable stats |
| - radv: compute the number of position outputs after compilation |
| - radv: don't export cull distances if the shader culls against them |
| - radv: enable nir_opt_clip_cull_const for GS too |
| - radv: pack clip and cull distance outputs for both legacy and NGG pipelines |
| - radv: stop using LLVM LDS linking logic |
| - radeonsi: add si_shader_variant_info::clip/culldist_mask |
| - radeonsi: simplify old_vs & old_ps checking in si_update_shaders |
| - radeonsi: gather nr_pos_exports from the final NIR |
| - radeonsi: support 8 non-ClipVertex clip planes instead of 6 |
| - radeonsi: use nir_opt_clip_cull_const |
| - radeonsi: pack clip/cull distance export components |
| - radeonsi: reduce the size of 2 fields in si_shader_variant_info |
| - radeonsi: enable culling against clip/cull distances and clip planes in GS |
| - radeonsi: cull against cull distances in the shader and don't export them |
| - radeonsi: move gfx10_shader_ngg.c contents into si_shader.c |
| - radeonsi: use si_assign_param_offsets for legacy GS too |
| - radeonsi: don't count outputs with GS streams > 0 for outputs_written_before_ps |
| - radeonsi: remove unused output_type and output_usage from si_shader_info |
| - radeonsi: remove gs_input_verts_per_prim from si_shader_info |
| - radeonsi: enable 16-bit mediump IO for PS outputs only, and VS->PS with env var |
| - radeonsi: stop using LLVM LDS linking logic for the GS out LDS offset |
| - radeonsi: remove now unused LLVM LDS logic for NGG |
| - radeonsi: determine compute shader LDS size from NIR instead of LLVM |
| - radeonsi: add a comment about early prim exports |
| - radeonsi: remove all uses of NIR_PASS_V |
| - radeonsi: use shader_info::next_stage correctly |
| - aco: remove unused aco_symbol_lds_ngg_gs_out_vertex_base |
| - ac/llvm: don't declare LDS as an array for HS & GS & CS, use IntToPtr(0) |
| - ac/llvm: remove LDS linking code |
| - ac/llvm: rename misnamed get_memory_ptr -> get_shared_mem_ptr |
| - ac/nir: remove pack_clip_cull_distances option |
| - ac/nir: rename ac_nir_get_lds_gs_out_slot_offset -> ac_nir_get_gs_out_lds_offset |
| - ac/nir: remove redundant option dont_export_cull_distances |
| - nir/opt_varyings: use nir_scalar |
| - nir/opt_varyings: optimize the consumer after constant propagation and dedupli. |
| - nir/lower_io: validate locations more accurately |
| - glsl: remove most IO optimizations that are replaced by nir_opt_varyings |
| - nir/opt_vectorize_io: optionally vectorize loads with holes |
| - aco: implement upcasting 16-bit types for 32-bit color buffers in PS epilog |
| - radeonsi: recompute FS output IO bases to prevent an LLVM crash |
| |
| Martin Krastev (9): |
| |
| - svga/ci: enable vmware farm |
| - svga/ci: Increase vmware-vmx-piglit job parallelism to 4 |
| - svga/ci: return svga piglit job to automatic rules pool |
| - svga/ci: vmware farm is out; temporary relegate piglit job to manual |
| - svga/ci: vmware farm is back; restore piglit job to automatic |
| - svga/ci: enable vmware farm |
| - svga/ci: bump up piglit coverage to 100% |
| - svga/ci: bump up FDO_CI_CONCURRENT to 2 |
| - svga/ci: drop 'fraction' from deqp-vmware-vmx-piglit job config |
| |
| Martin Roukala (né Peres) (10): |
| |
| - radv/ci: move the vangogh timeouts to the jobs |
| - radv/ci: drop redundant renoir timeouts |
| - ci/ci-tron: switch to the upstream ci-tron template |
| - ci: rename all the .b2c- jobs into .ci-tron- |
| - panfrost/ci: document new t860 flakes |
| - nvk/ci: switch nouveau.ko jobs to common kernel (6.13.7) |
| - nvk/ci: split the common ci-tron parts to their own job |
| - nvk/ci: uprev the kernel to v6.16-rc2 |
| - nvk/ci: run on our jobs on ad106 |
| - zink/ci: run glcts and piglit on NVK's ad106 |
| |
| Mary Guillemard (87): |
| |
| - panfrost: Take tiler memory budget into account in pan_select_tiler_hierarchy_mask |
| - panvk: reset dyn_bufs map count to 0 in create_copy_table |
| - panvk: Take rasterization sample into account in indirect draw on v10+ |
| - panvk: Emit sample count and tile size when emitting framebuffer/tiler descriptors |
| - panfrost: Allow max effective tile size of 64x64 on v12+ |
| - panvk: Take resource index in valhall_lower_get_ssbo_size |
| - panvk: Advertise VK_EXT_shader_replicated_composites |
| - panvk: Advertise VK_EXT_depth_bias_control |
| - pan/bi: Properly lower add/sub with saturation on v11+ |
| - pan/genxml: Fix inverted logic on nr_regs |
| - panfrost: Ensure printf buffer size is lowered |
| - panfrost: Move genxml out of pan/lib |
| - pan/util: Move lcra to midgard |
| - pan/util: Move liveness logic to midgard |
| - panfrost/util: Move print_alu_type and PAN_IS_REG to midgard |
| - pan/util,midgard: Remove pan_block |
| - panfrost: Remove progress_increment from all CS builders |
| - panfrost: Remove PROGRESS_* helpers in cs_builder.h |
| - pan/bi: Flush subnormals to zero for FROUND on v11+ |
| - pan/bi: Lower ffract in bifrost_nir_algebraic on v11+ |
| - pan/genxml: Add Register File Format to common.xml |
| - pan/lib: Move pan_fixup_blend_type to pan_blend.c |
| - pan/lib: Make pan_shader_compile not GENX |
| - pan/lib: Make pan_shader_get_compiler_options not GENX |
| - pan/lib: Make pan_shader.c not GENX |
| - pan/lib: Add support for rotated 2x grid sample pattern |
| - panvk: Enable VK_SAMPLE_COUNT_2_BIT support on v12+ |
| - panfrost: Use pan_sample_pattern in pan_cmdstream |
| - panfrost: Enable 2 sample count support on v12+ |
| - panvk: Advertise support for VK_EXT_primitive_topology_list_restart |
| - panvk: Advertise support for VK_EXT_image_2d_view_of_3d |
| - panvk: Clean up some todo comments about already supported extensions |
| - pan/csf: Fix typo in cs_trace_run_idvs2 |
| - panfrost: Fix varying descriptors on v12+ |
| - panfrost: Allow up to 16 for scoreboards on CSF instructions on v11+ |
| - panvk: Depends on Panthor scoreboard information |
| - panfrost: Add cs_* helpers for v11 CSF instructions |
| - panfrost: Add indirect mode for async operations on v11+ |
| - panfrost: Fix codestyle issue with cs_nop |
| - panvk: Set proper upper limit for IDVS reg blacklist on v12+ |
| - panvk: Increase CSF scratch limits on v12+ |
| - pan/genxml: Fix typo for NEXT_SB_ENTRY |
| - panvk: Expose 2 queues on v10+ |
| - panforst: Update ForEachMacros for pan/bi |
| - pan/bi: Stop writing pan_shader_info::vs::idvs on non VS stages |
| - pan/bi: Lower VS atomics direct output store to only exec in one stage when possible |
| - panvk: Enable vertexPipelineStoresAndAtomics on ANGLE for v13+ |
| - panfrost: Make pan_encoder.h CL safe |
| - pan/genxml: Switch pan_merge to a macro |
| - libpan: Add indirect dispatch implementation |
| - panvk: Implement indirect dispatch on JM |
| - panfrost: Use panlib indirect dispatch precomp shader in Gallium |
| - panfrost: Enable indirect dispatch support on v6 and v9 |
| - panfrost: Remove pan_indirect_dispatch |
| - panvk: Rework iter_sb to be the scoreboard directly |
| - panvk: Select next iter scoreboard early |
| - panvk: Use NEXT_SB_ENTRY on v11+ and prepare for indirect wait |
| - panvk: Implement v11+ path for compute dispatch |
| - panvk: Implement v11+ path for flush_tiling |
| - panvk: Implement v11+ path for issue_fragment_jobs |
| - panvk: Only restrict iter scoreboards on v10 |
| - panvk: Remove iter_sb on v11+ |
| - pan/lib: Rewrite npot divisor algorithm |
| - pan/lib: Refactor pan_padded_vertex_count |
| - pan/lib: Inline pan_atributes.c in pan_encoder.h |
| - pan/lib: Rename pan_compute_magic_divisor to pan_compute_npot_divisor |
| - pan/lib: Add a test for pan_compute_npot_divisor |
| - pan/clc: Use hash_format_strings option with nir_lower_printf |
| - pan/clc: Lower IO as late as possible |
| - libcl: Add more UINT_MAX variants |
| - meson: make clz available in OpenCL code |
| - panvk: Fix dependency chain on indirect dispatch for JM |
| - panvk/ci: Update Mali-G52 status with full VKCTS |
| - pan/genxml: Fix wrong size for compute size workgroup |
| - panvk: Do not adjust job offset with base on CSF |
| - pan/bi: Do not allow passthrough for instructions disallowing temps |
| - pan/bi: Disallow FAU for CLPER in bi_check_fau_src |
| - panvk: Fix wrong reporting of subgroup size for executable properties |
| - nvk: Do not ignore contiguous in nvk_heap_init |
| - nir: Add NVIDIA-specific muladd intrinsics |
| - nak: Wire up coop matrix opcodes |
| - nvk: Advertise VK_KHR_cooperative_matrix |
| - panvk: Fix nullDescriptor for dynamic descriptors |
| - pan/genxml: Add missing parenthesis on pan_cast_and_pack macros |
| - pan/bi: Properly handle SWZ.v4i8 lowering on v11+ |
| - panvk: Always use varying_count in emit_varying_attrs |
| - panvk: track oq write jobs in JM |
| |
| Matt Jolly (5): |
| |
| - ci/alpine: Control Meson version |
| - ci/debian: arm64: only install Meson once |
| - ci/fedora: use \`build-bindgen.sh` to install [c]bindgen |
| - ci: add header with image tags to \`build-bindgen.sh` |
| - build: Rust: Bump minimum Meson and bindgen version |
| |
| Matt Turner (14): |
| |
| - gallivm: Use \`llvm.roundeven` in lp_build_round() |
| - meson: add wrap for libdrm |
| - intel/compiler: Align human-readable send message info |
| - compiler: Generate files with newline at end |
| - intel: Generate files with newline at end |
| - vulkan: Generate files with newline at end |
| - intel: Increase size of cooperative_matrix_configurations[] to 16 |
| - nir: Add convert_cmat_intel intrinsic |
| - brw: Implement convert_cmat_intel intrinsic |
| - brw: Use convert_cmat_intel intrinsic |
| - intel: Add support for float16 as cooperative matrix accumulator |
| - brw: Handle bfloat16 dest and src0 operands for DPAS |
| - intel: Add support for BFloat16 as cooperative matrix accumulator |
| - bin/people.csv: Prefer mattst88's \@gmail.com address |
| |
| Matthieu Oechslin (1): |
| |
| - r600: Take dual source blending in account when creating target mask with RATs |
| |
| Mauro Rossi (6): |
| |
| - intel/compiler: use ffsll instead of ffsl in brw_vue_map.c |
| - android: fix llvmpipe build rules |
| - llvmpipe: Define LP_NIR_SHADER_DUMP_DIR differently on Android |
| - llvmpipe: Use mkdir instead of std::filesystem::create_directory on Android |
| - Revert "android: Link with libc++fs up to SDK 35" |
| - radv: Fix gnu-empty-initializer error |
| |
| Maíra Canal (6): |
| |
| - ci: disable Igalia's farm |
| - v3d: make sure all jobs are done before destroying the context |
| - broadcom/ci: update expected results with kernel 6.12 |
| - bin: explicitly use \`python3` instead of \`python` |
| - vulkan: create a wrapper struct for vk_sync_timeline |
| - vulkan: don't destroy vk_sync_timeline if a point is still pending |
| |
| Mel Henning (83): |
| |
| - nak/spill_values: Spill constants across edges if needed |
| - nak: Handle idp4 ureg latencies |
| - nvk: SET_STATISTICS_COUNTER at start of meta_begin |
| - nvk: Override render enable for blits and resolves |
| - nvk: Remove dead function nvk_meta_init_render |
| - nak: Add nvdisasm_tests |
| - nak: Remove range parameter from set_atom_type |
| - nak: Fix sm90+ atomg/redg encoding |
| - nak: sm100+ texture encodings |
| - nak: Disable cbuf textures on blackwell |
| - nak: Add test for lea disasm. |
| - wsi/headless: Override finish_create |
| - nak: Remove hfma2 src 1 modifiers |
| - nak/sm70_encode: Encode fneg/fabs for hfma2 src 2 |
| - nak/sm70_encode: Remove unused has_mod parameter |
| - nak: Add an SSARef::from_iter() helper |
| - nak: Use NonZeroU32 for SSAValue and remove NONE |
| - nak: Split scalar/vec in SSABuilder::alloc_ssa |
| - nak: Return SSAValue from builder where possible |
| - nak: Use references to src/dst more places |
| - nak/from_nir: Make fault an Option<SSAValue> |
| - nak: Dst is no longer Copy |
| - nak/from_nir: Turn srcs into a closure |
| - nak: Src is no longer Copy |
| - nak: SrcRef is no longer Copy |
| - nak: CBuf and SSARef are no longer Copy |
| - nak: Support large SSARef |
| - nak: Mark Large SSARef paths as cold |
| - nak: Add Src::is_unmodified() helper |
| - nak: Check that swizzles are none |
| - nak: Call nir_opt_phi_to_bool |
| - nak: Remove #![allow(unstable_name_collisions)] |
| - nouveau/headers: Run rustfmt after file is closed |
| - nouveau/headers: Ignore PermissionError in rustfmt |
| - nak: rustfmt ir_proc.rs |
| - nak: s/HashMap::new()/Default::default()/ |
| - nak: s/HashSet::new()/Default::default()/ |
| - nak: Add a dependency on rustc-hash |
| - nak: Switch most Hash{Set,Map} uses to rustc-hash |
| - nak: Also use rustc-hash for UnionFind |
| - compiler/rust/cfg: Add a hash func type parameter |
| - nak: Use nir membar optimizations |
| - compiler/rust/bitset: find_aligned_unset_region() |
| - nak: Change parameters to try_find_unset_reg_range |
| - nak: Bias RA to find clear spaces for vector dests |
| - nak: Fix a perf regression in tex lowering |
| - nak: Call nir_lower_undef_to_zero |
| - nvk: Call ensure_slm for nvk_cmd_dispatch_shader |
| - nak/spill_values: Follow phis from src to dest |
| - nak: Forbid reordering labeled OpNop |
| - nak: Always run nak_nir_mark_lcssa_invariants |
| - nak: Change divergence analysis pass order |
| - nak: lower_scan_reduce after divergence analysis |
| - nak: Add OpRedux |
| - nak: Implement nir_intrinsic_reduce with REDUX |
| - nak: Don't swap f2fp sources in legalize |
| - nir/divergence_analysis: Update LCSSA comment |
| - nak,nir: Stop using std::mem::zeroed() |
| - compiler/rust/bitset: BitSetStream takes Key type |
| - zink: Return NULL on vkCreateInstance failure |
| - zink: Handle null instance in 2nd create_screen |
| - vulkan: Specify library_arch in ICD files |
| - nak: Add OpMatch |
| - nir: Split lower_vote_eq into int/float versions |
| - nak: Implement nir_intrinsic_vote_ieq with OpMatch |
| - nak: Call nir_opt_uniform_subgroup |
| - nir/opt_uniform_subgroup: Handle vote_ieq |
| - nir/opt_uniform_subgroup: Handle vote_feq |
| - nak: Factor dataflow logic out of SimpleLiveness |
| - nak: Use the dataflow framework for NextUseLiveness |
| - nak: Use a worklist approach for dataflow |
| - nak/nvdisasm_tests: Remove extra . for lod_mode |
| - meson: Allow unnecessary_transmutes for bindgen |
| - nir/divergence_analysis: Add NV_shader_sm_builtins |
| - zink: Assert ici.tiling != DRM_FORMAT_MODIFIER_EXT |
| - egl: Clear modifiers if we clear use_flags |
| - nouveau/headers: Stop running rustfmt |
| - nouveau/headers: Move use statements into template |
| - nouveau/headers: Factor out write_template |
| - nouveau/headers: Re-enable rustfmt |
| - nouveau/headers: Update g_nv_name_released.h |
| - nak/mark_lcssa_invariants: Invalidate divergence |
| - loader: Don't load nouveau GL on nvidia kmd |
| |
| Michael Cheng (3): |
| |
| - intel: Switch debug flags to enums to prep for bitset conversion |
| - intel: Switch uint64_t intel_debug to a bitset |
| - anv: Add new debug flag to show shader stage |
| |
| Michel Dänzer (3): |
| |
| - ci: Drop obsolete -Wno-error= stanzas |
| - amd/ci: Add cl profile on rusticl in radeonsi-raven-piglit job |
| - radeonsi: Don't assert src_va != 0 with CP_DMA_CLEAR |
| |
| Mike Blumenkrantz (208): |
| |
| - delete clover |
| - zink: verify that surface exists when adding implicit feedback loop |
| - util/dynarray: add util_dynarray_resize_zero() |
| - zink: use util_dynarray_resize_zero() for descriptor pool |
| - vk/cmd_queue: try to fix some indentation |
| - vk/cmd_queue: stop generating weird casts for free functions |
| - vk/cmd_queue: generate copies for struct-ptr members |
| - vk/cmd_queue: generate copies for string struct members |
| - egl: rename dri2_load_driver -> dri2_detect_swrast |
| - egl: fix sw fallback rejection in non-sw EGL_PLATFORM=device |
| - zink: fix broken comparison for dummy pipe surface sizing |
| - lavapipe: EXT_zero_initialize_device_memory |
| - egl: handle DRI_PRIME with surfaceless |
| - gallium: pipe_surface_release -> pipe_surface_unref |
| - util/inlines: constify pipe_surface_equal() |
| - ci: set -Wno-error=deprecated-declarations |
| - mesa: remove gl_renderbuffer::surface |
| - gallium: de-pointerize pipe_surface |
| - ci: disable panfrost-t720-gles2 |
| - llvmpipe: disable conditional rendering mem for blits |
| - lavapipe: handle counterOffset in vkCmdDrawIndirectByteCountEXT |
| - delete the XA frontend |
| - delete gallium-nine |
| - zink: delete legacy renderpasses and framebuffer objects |
| - zink: delete zink_surface_info |
| - aux/trace: handle sampler_view_destroy |
| - tc: rework resource usage tracking to be lighter |
| - gallium: delete pipe_surface::writable |
| - gallium: delete union pipe_surface_desc |
| - zink: also check for host-visible on staging uploads |
| - zink: fix queue transition check in check_for_layout_update() |
| - util/blitter: delete more pipe_context::create_surface usage |
| - util/debug: delete more pipe_context::create_surface usage |
| - util/surface: delete more pipe_context::create_surface usage |
| - gallium: kill off pipe_surface_unref_no_context() |
| - lavapipe: kill off pipe_surface objects |
| - util/tc: fix surface refcounting (by deleting it) |
| - d3d10umd: stop using pipe_surface::width/height |
| - d3d10umd: update to latest pipe_surface changes |
| - va: remove one more pipe_surface object |
| - vdpau: de-pointerize surfaces |
| - vl: delete a missed pipe_surface_reference usage |
| - util/tc: delete surface hooks |
| - aux/trace: delete surface object hooks |
| - noop: delete surface object hooks |
| - util/dd: delete surface object hooks |
| - zink: unlock instance mutex if creation fails |
| - zink: fix ZINK_RENDERDOC=all |
| - zink: remove a pipe_surface from blitting |
| - zink: slightly rework transient surface/image handling |
| - zink: remove some trivial transient surface references in rt init |
| - zink: delete another pipe_surface from msaa replicate |
| - zink: delete zink_ctx_surface::transient |
| - zink: delete zink_ctx_surface::needs_mutable |
| - zink: don't fixup depth buffer if it's already big enough |
| - zink: delete zink_ctx_surface |
| - lavapipe: undo fb remapping before poisoning memory at end of renderpass |
| - nir/lower_to_scalar: fix opt_varying with output reads |
| - zink: update renderdoc layer string for android |
| - zink: support vulkan 1.4 |
| - zink: hook up VK_KHR_unified_image_layouts |
| - zink: add a local screen variable in begin_rendering |
| - zink: add UIM feedback loop info for dynamic rendering |
| - zink: rename a sync function |
| - zink: hook up global GENERAL image layouts |
| - zink: emulated alpha formats do not require mutable |
| - zink: verify that mutable dmabufs are created with a format list |
| - util/box: make u_box_test_intersection_2d() consistent with other funcs |
| - zink: update docs for nopc |
| - zink: be slightly more precise about flagging loadop change on fb invalidate |
| - zink: extract resolve surface init to separate function |
| - zink: add format param to zink_surface_resolve_init |
| - zink: move tc inlined resolve code down a bit |
| - zink: create inline resolve surface in begin_rendering |
| - tc: fix zsbuf rp info persistence across fb states |
| - tc: explicitly terminate renderpasses on fb access |
| - tc: rework rp info incrementing |
| - tc: add resolve resource to rp info |
| - tc: add #define to disable strict resolve merging |
| - zink: add ZINK_DEBUG to skip msaa stores on renderpasses which have ended |
| - draw: silence some no-op draw debug prints |
| - ci: rename "lvp" jobs to lavapipe |
| - gallium: add compressed_surface_reinterpret_blocks_layered |
| - zink: put back the sampler barrier on fb unbind with GENERAL layouts |
| - zink: fill in sync flags for a couple barriers |
| - zink: delete barrier sync flag inferring |
| - zink: fix image transfer op batching with GENERAL layouts |
| - zink: local screen variable in zink_copy_image_buffer |
| - zink: use GENERAL layout in zink_copy_image_buffer when possible |
| - zink: fix GENERAL image copy box resets on non-transfer barrier |
| - zink: break out the fb unbind sampler barrier |
| - zink: add another aggressive sampler barrier after sequential image copies |
| - zink: skip sampler layout updates when using GENERAL layouts |
| - zink: don't use TRANSFER bit as default pipeline stage if no access is set |
| - zink: always pass through src access flags for buffer barriers |
| - zink: always unset unordered access flags when executing an ordered barrier |
| - zink: mimic buffer transfer dst noop barrier semantics with images |
| - zink: actually use GENERAL image barrier functions |
| - zink: enable image2DViewOf3DSparse with maint9 |
| - zink: require nullDescriptor feature |
| - zink: delete zink_surface::hash |
| - zink: remove resource param from apply_view_usage_for_format() |
| - zink: delete zink_surface::usage_info |
| - zink: add a pipe_resource ref to bindless descriptors |
| - zink: make pipe_resource refs for image views more consistent |
| - zink: remove pipe_resource ref from bufferviews |
| - zink: move zink_prep_fb_attachment up and make static |
| - zink: clean up null surface case in prep_fb_attachment() |
| - zink: simplify code surrounding prep_fb_attachment() |
| - zink: delete surface handling of blockTexelViewCompatibleMultipleLayers |
| - zink: unify zink_resource_object_init_mutable() calling |
| - zink: always directly create surfaces |
| - zink: delete transient attachments instead of leaking them |
| - zink: delete some redundant flagging of ctx->rp_changed |
| - zink: remove random format check in surface create |
| - zink: unify some z24 sampler clamping code |
| - zink: remove some usage of zink_surface::ivci |
| - zink: big resource view rework |
| - zink: break out fb ivci init |
| - zink: tweak zink_create_transient_surface to stop using zink_surface::ivci |
| - zink: store/use fb attachment formats directly |
| - zink: allocate keys for surface cache to stop using zink_surface::ivci |
| - zink: delete zink_surface::ivci |
| - zink: delete/simplify some bindless descriptor resource accesss |
| - zink: use a better check for current fb zsbuf in zink_clear_depth_stencil() |
| - zink: track whether fb has swapchain bound on context |
| - zink: directly use imageview format for image descriptors for depth clamping |
| - zink: directly return import2d resource |
| - zink: add layer info to bindless descriptor |
| - zink: use temp struct for implicit feedback loop detection |
| - zink: various trivial struct access changes |
| - zink: prune zink_surface down to the imageview and create/fetch on demand |
| - zink: enforce pipe_surface::texture matching the resource in zink_get_surface() |
| - zink: remove redundant pipe_resource param from zink surface funcs |
| - zink: unify bvci creation |
| - zink: use samplerview obj to check buffer rebinds in non-db path |
| - zink: use smaller keys for surface/bufferview caching and switch to sets |
| - tc: don't reuse first rp info on batch if there is work pending |
| - zink: enable ioopt by default |
| - aux/trace: always finish dumping draw/dispatch calls before triggering them |
| - gallium: add pipe_context::image_copy_buffer |
| - zink: rework zink_copy_image_buffer() |
| - zink: implement image_copy_buffer |
| - tc: use image_copy_buffer for subdata uploads instead of resource_copy_region |
| - tc: replace gpu util_copy_box with single image_copy_buffer call |
| - zink: loosen heuristic for buf2img texture upload barriers |
| - zink: don't modify the u_foreach_bit64 bit inside the loop in loop_io_var_mask() |
| - zink: make HIC image transitions public |
| - zink: use HIC image transitions to init swapchain images |
| - zink: more fixups for GENERAL image sync |
| - zink: fix another case of blocking image op reordering with GENERAL layouts |
| - zink: stop deleting src stage in image sync |
| - zink: fix acquire semaphore sync |
| - zink: fix submit_count disambiguation for bo usage checks |
| - mesa/st: handle renderbuffer with null zsbuf |
| - zink: move HIC resource usage check into hic transition function |
| - zink: always insert current batch sparse semaphore into sparse wait chain |
| - zink: use unsynchronized cmdbuf for unsynchronized image GENERAL barriers |
| - mesa: unify creation of real semaphore objects |
| - mesa: set semaphore type directly during creation |
| - mesa: PIPE_FD_TYPE_TIMELINE_SEMAPHORE -> PIPE_FD_TYPE_TIMELINE_SEMAPHORE_D3D12 |
| - zink: fix signaling multiple API semaphores |
| - mesa: support NV_timeline_semaphore |
| - zink: capture KHR_timeline_semaphore properties |
| - gallium: fix timeline semaphore value passing |
| - zink: support NV_timeline_semaphore |
| - st/clear: handle null color renderbuffer |
| - lavapipe: fix advertised depth resolve modes |
| - vulkan/cmd_queue: don't null deref when freeing pNext |
| - zink: use a rebind counter to manage descriptor rebinds |
| - zink: zero the unordered access for images upon executing an ordered barrier |
| - cso: use unbind_trailing for sampler view unbinds on context unbinds |
| - zink: only copy resource during add_bind if it is valid |
| - zink: enable img copy boxes for blit/copy ops |
| - zink: assert that batches never use a timeline value of 0 |
| - zink: slightly tighten the implicit feedback loop detection logic |
| - zink: fix UIL feedback loop usage |
| - zink: remove duplicate setting of last_write in image sync |
| - zink: unify setting general access flags in sync |
| - zink: tighten up same-batch sync |
| - zink: double-check descriptor layout creation before adding to cache |
| - zink: always create gfx shader objects with 5 descriptor layouts |
| - zink: fix qbo sync |
| - zink: always emit fb attachment sync |
| - zink: fix fb attachment usage setting |
| - tc: fix usage wrapping in busy test |
| - zink: document VK_EXT_primitive_topology_list_restart in profile |
| - zink: lock harder around memory mappings |
| - zink: reenable unsynchronized texture subdata without HIC |
| - aux/trace: sanitize draws |
| - zink: translate intensity formats |
| - gallium/hud: set the framebuffer texture when drawing |
| - zink: fix valid contents check for adding new bind |
| - lavapipe: call nir_lower_int64 |
| - zink: simplify sampler bufferview change for non-db path |
| - egl/x11: don't leak device_name when choosing zink |
| - zink: account for generated tcs when pruning programs |
| - zink: remove extra gfx prog unref during separable replacement |
| - anv: fix format compatibility check typo |
| - zink: create a dummy image for shaderdb runs |
| - kopper: fix initial swapinterval setting |
| - zink: also add access stage sync when rebinding buffers |
| - zink: fix tc buffer replacement rebind condition |
| - zink: trigger multi-context buffer invalidate on internal buffer invalidate |
| - zink: zero dynamic rendering resolve views on rp end |
| - tc: also inline depth resolves |
| - zink: fix assert for unsynchronized non-GENERAL image barriers |
| - tc: don't clobber CSO info when renderpass has ended |
| |
| Mohamed Ahmed (3): |
| |
| - nil/copy: Rename Copy16B to CopyBytes and add a copy_8b method |
| - nil/copy: Add Blackwell+ GOB layouts for host copy |
| - nil/copy.rs: Add host copy support for Fermi-Volta |
| |
| Myrrh Periwinkle (1): |
| |
| - gallium: Properly handle non-contiguous used sampler view indexes |
| |
| Nagulendran, Iswara (2): |
| |
| - amd/vpelib: Setup cost profiling support |
| - amd/vpelib: Fix VPELIB Build Warnings |
| |
| Nanley Chery (17): |
| |
| - intel: Add and use isl_surf_from_mem() |
| - intel: Add and use isl_surf_image_has_unique_tiles() |
| - intel/blorp: Disable repclear for gfx12 fast-clear |
| - intel/blorp: Drop clear color assignment prior to Xe2 |
| - intel/blorp: Redescribe gfx12.5 surfaces for CCS fast clears |
| - intel/blorp: Simplify get_fast_clear_rect() for gfx12.5 |
| - anv: Drop the slow clear heuristic |
| - intel/isl: Fix isl_surf_image_has_unique_tiles() |
| - intel/isl: Fix isl_get_sampler_clear_field_offset() |
| - intel/blorp: Use get_copy_format_for_bpb more for gfx12.5 |
| - iris: Update check for sampler field changes |
| - iris: Update clear color initializations for FCV |
| - anv: Fix an assert for ISL_FORMAT_RAW clear color update |
| - anv: Query sampler offset in set_image_clear_color() |
| - anv: Move the gfx9 sRGB sampling clear color code |
| - anv: Use genX(set_fast_clear_state)() in transition_depth_buffer() |
| - intel/blorp: Don't redescribe some Tile64 clears |
| |
| Natalie Vock (34): |
| |
| - aco: Add support for multiple ops fixed to defs |
| - aco: Add support for multiple definitions in emit_mimg |
| - aco/ra: Don't consider precolored ops/defs in get_reg_impl |
| - radv,driconf: Add radv_force_64k_sparse_alignment config |
| - driconf: Add workarounds for DOOM: The Dark Ages |
| - radv/rt: Avoid encoding infinities in box node coords |
| - radv/rt: Report 256 byte alignment for scratch |
| - aco/opt: Rename loop header phis |
| - aco: Add pseudo instr to calculate a function callee's stack pointer |
| - aco: Add scratch stack pointer |
| - aco/spill: Use scratch stack pointer |
| - aco: Add common utility to load scratch descriptor |
| - aco/isel: Use stack pointer parameter in load/store_scratch |
| - aco/isel: Don't add scratch offset as gfx8- soffset if no offsets exist |
| - aco: Introduce static_scratch_rsrc program member |
| - aco/spill: Add a null scratch offset if no scratch_offset arg exists |
| - aco/spill: Don't spill scratch_rsrc-related temps |
| - vulkan/runtime/bvh: Propagate opaqueness information through the BVH |
| - radv: Encode child opaqueness information in triangle nodes |
| - radv: Encode child opaqueness information in box nodes |
| - radv: Use common helper to set BLAS node pointer flags on gfx11+ |
| - radv/rt: Enable pointer flags on GFX11+ |
| - aco/isel: Improve vector splits for image_bvh8_intersect_ray |
| - aco: Support vector-aligned ops fixed to defs |
| - aco: Use vector-aligned operands for image_bvh8_intersect_ray |
| - radv/rt/gfx12: Always overwrite origin/dir |
| - nir,aco: Add ds_bvh_stack_rtn |
| - radv/rt: Use ds_bvh_stack_rtn |
| - aco,nir: Add support for GFX12 ds_bvh_stack_push8_pop1_rtn_b32 instruction |
| - radv/rt: Use ds_bvh_stack_push8_pop1_rtn_b32 |
| - aco/assembler: Support vector-aligned operands on DS instructions |
| - aco/ra: Add affinities for DS vector-aligned operands |
| - aco/isel: Use vector-aligned operands for ds_stack_push8_pop1_rtn_b32 |
| - radv/winsys: Support vm_always_valid in the NULL winsys |
| |
| Nicolas Dufresne (2): |
| |
| - build: Sort the tools 'all' list |
| - build: Include panfrost to the tools 'all' list |
| |
| Okenczyc, Andrzej (1): |
| |
| - amd/vpelib: Resolve query, predication and timestamp operations |
| |
| Olivia Lee (66): |
| |
| - panfrost: allow promoting sysval UBO to push constants |
| - mailmap: update my name and email |
| - panfrost: move some blend shader infrastructure into gallium driver |
| - panfrost: map sysval UBO to a fixed index |
| - panfrost: pass blend constants to blend shaders dynamically |
| - pan/bi: push blend constants to FAU |
| - panfrost: upload blend shaders to bin pool |
| - panvk: fix driconf memory leak |
| - util/u_printf: fix memory leak in u_printf_singleton_add_serialized |
| - panfrost: use cpp_args instead of c_args for C++ files |
| - pan/csf: make cs_builder.h usable from c++ |
| - pan/csf: add cs_builder unit test infrastructure |
| - pan/csf: add cs_maybe mechanism to retroactively patch cs contents |
| - pan/csf: rename cs_exception_handler to cs_function |
| - panvk/csf: set up shared register dump regions for cs functions |
| - panvk: track whether we are in a vk_meta command |
| - panvk: fix case where vk_meta is used after PROVOKING_VERTEX_MODE_LAST |
| - panvk/csf: fix case where vk_meta is used before PROVOKING_VERTEX_MODE_LAST |
| - panvk/csf: fix provoking vertex mode in partial secondary cmdbufs |
| - pan/va: allow using both FAU and small constants in the same instruction |
| - panvk: advertise VK_EXT_shader_subgroup_vote and VK_EXT_shader_subgroup_ballot |
| - panfrost: fix assertion failure compiling image conversion shaders |
| - panfrost: legalize afbc before zs and rt clears |
| - panfrost/ci: smoke test AFBC-P in CI |
| - panvk: advertise VK_KHR_depth_clamp_zero_one |
| - panvk: refactor depth range logic |
| - panvk: implement VK_EXT_depth_clip_control |
| - panvk: Add tests for ls tracking in cs_maybe |
| - ci: disable vmware farm |
| - panvk: add missing include to panvk_cmd_draw.h |
| - panvk: fix outdated comment |
| - panvk: make extensions, features, and properties per-arch |
| - panvk: use constants from other headers in device properties |
| - panvk: move descriptor set limits to constant macros |
| - panvk: advertise texel buffer dynamic indexing |
| - panvk: allow sysvals to be dynamically-indexed |
| - panvk: implement shaderInputAttachmentArrayDynamicIndexing |
| - panvk: implement runtimeDescriptorArray and descriptorBindingVariableDescriptorCount |
| - panvk: implement non uniform indexing except for input attachments |
| - panvk: implement shaderInputAttachmentArrayNonUniformIndexing |
| - panvk: raise descriptor limits on valhall |
| - panvk: advertise descriptor indexing on valhall |
| - ci: document process for disabling broken farms in more detail |
| - ci: update CI contact list |
| - panvk: return error when mmap fails in BindImageMemory2 and BindBufferMemory2 |
| - panvk: factor BindImageMemory2 loop into function |
| - panvk: implement VkBindMemoryStatus |
| - panvk: allow binding VK_NULL_HANDLE index buffers |
| - panvk: advertise VK_KHR_maintenance6 on v10+ |
| - panfrost: fix depth clip range with u_blitter |
| - bin/people.csv: update my name/email |
| - pan/shared: fix typo in pan_tiling doc comments |
| - panvk: document vulkan 1.3 in new_features.txt |
| - docs: document GL_EXT_shader_clock in new_features.txt |
| - panvk: add error checking for dump/trace mmap call |
| - pan/kmod: fix propagation of MAP_FAILED in pan_kmod_bo_mmap |
| - panvk: don't report features for image formats that are only usable as vertex buffers |
| - panvk: store BO offset in panvk_image_plane |
| - panvk: implement VK_EXT_host_image_copy for linear color images |
| - panvk: split out helper function for checking AFBC support |
| - pan/shared: add function to copy between two tiled images |
| - panvk: implement VK_EXT_host_image_copy for tiled images |
| - pan/shared: refactor pan_tiling |
| - panfrost: add support for (de)interleaving Z24S8 in pan_tiling |
| - panvk: implement VK_EXT_host_image_copy for depth/stencil images |
| - panvk: advertise vulkan 1.4 on v10+ |
| |
| Patrick Lerda (37): |
| |
| - mesa_interface: fix legacy dri2 compatibility |
| - r600: implement EXT_shader_image_load_store |
| - r600: add a constant representing gs vertex indirect total |
| - r600: implement gs indirect load_per_vertex_input |
| - radeon/evergreen: improve depth24_stencil8 mipmap behavior |
| - r600: add r600_lower_tess_io metadata compatibility |
| - r600: add r600_lower_shared_io metadata compatibility |
| - r600: add r600_nir_lower_int_tg4 metadata compatibility |
| - r600: add r600_nir_lower_txl_txf_array_or_cube metadata compatibility |
| - r600: fix pop-free clipping |
| - r600: index_bias should be forced to zero for all indirect draw calls |
| - r600: add an unlikely() to synchronize with the index_bias fix |
| - r600: fix index buffer with offset |
| - r600: restructure the atomic function for the next update |
| - r600: unlock cayman number of atomic counters |
| - r600: bump evergreen atomic counters to twelve |
| - r600: enable ARB_transform_feedback_overflow_query |
| - r600: enable trivial ARB_shader_group_vote |
| - r600: improve command stream constraints |
| - r600: implement ARB_shader_draw_parameters |
| - r600: implement ARB_indirect_parameters |
| - r600: bump evergreen and cayman glsl level to 460 |
| - r600: handle cayman border color sint formats |
| - r600: clean up and refactor texture_buffer_sampler_view() |
| - r600: clamp to max_texel_buffer_elements |
| - r600: update the documentation of the extensions |
| - r600: make vertex r10g10b10a2_snorm conformant on palm and beyond |
| - r600: fix emit_image_load_or_atomic() snorm formats |
| - r600: enable AMD_framebuffer_multisample_advanced |
| - r600: fix emit_ssbo_store() wrmask compatibility |
| - r600: set never as the depth compare function when depth compare is disabled |
| - r600: fix rv770 border color |
| - dri: fix image_loader_extensions array |
| - dri: complete the support for ARGB4444 |
| - r600: refactor r600_is_buffer_format_supported() for the next update |
| - r600: fix remaining pbo issues |
| - r600: fix arb_shader_image_load_store incomplete |
| |
| Paul Gofman (1): |
| |
| - radv/amdgpu: Fix hash key in radv_amdgpu_winsys_destroy(). |
| |
| Paulo Zanoni (3): |
| |
| - anv/trtt: don't avoid the TR-TT submission when there is stuff to signal |
| - intel/isl: don't clamp num_elements to (1 << 27) |
| - brw: properly decode TGL_PIPE_SCALAR |
| |
| Pavel Gribov (1): |
| |
| - radv: small fix for sam check |
| |
| Pavel Ondračka (2): |
| |
| - r300/ci: add two flakes |
| - r300: minor fix for backend writer/reader detection |
| |
| Petar G. Georgiev (1): |
| |
| - freedreno/fdl: Add support for RGB888/BGR888 pipe formats in render buffer creation |
| |
| Peyton Lee (5): |
| |
| - radeonsi/vpe: enhance scaling quality |
| - radeonsi/vpe: remove background color |
| - radeonsi/vpe: correct capability |
| - amd: add vpe_version |
| - amd/gmlib: remove the executable bit |
| |
| Philip Rebohle (1): |
| |
| - radv: Remove offset parameter from radv_make_texel_buffer_descriptor. |
| |
| Philipp Zabel (1): |
| |
| - teflon: Allow per-axis quantization |
| |
| Pierre-Eric Pelloux-Prayer (52): |
| |
| - winsys/amdgpu: disable VM_ALWAYS_VALID |
| - radeonsi/tests: use proper skip file |
| - radeonsi: fix potential use after free in si_set_debug_callback |
| - ac/nir: init blake3 for cs blit shader |
| - radeonsi: skip blit incompatible scenarios |
| - radeonsi: init use_aco properly when llvm is disabled |
| - winsys/radeon: add surface_offset_from_coord |
| - radeonsi: allow msaa sparse textures on gfx10+ |
| - radeonsi: skip draws using invalid setup |
| - amd/ci: remove references to tests that don't exist anymore |
| - radeonsi/tests: fix script handling of missing results |
| - radeonsi: allow sparse depth textures |
| - ac/llvm: rework component trimming in visit_tex |
| - aco/isel: fix visit_tex handling of is_sparse |
| - radeonsi/tests: update expected results |
| - radeonsi: fix attribute_pos_prim_ring handling |
| - radeonsi: check set_debug_callback before use |
| - winsys/amdgpu: remove return value from amdgpu_init_cs_context |
| - radeonsi: use mesa_loge instead of fprintf |
| - winsys/amdgpu: use mesa_loge instead of fprintf |
| - radeonsi: use si_set_buf_desc_address helper |
| - radeonsi: fix typo in sdma code |
| - ac/pm4: determine spi_shader_pgm_lo_reg when PKT3_SET_SH_REG_PAIRS is used |
| - radeonsi: enable SQTT for more chips |
| - radeonsi: transition to TC-compatible HTILE on full clears |
| - util: add util_sync_provider::clone |
| - ac/info: rename has_local_buffers into has_vm_always_valid |
| - ac/virtio: port to vdrm functions |
| - ac/drm: remove ac_drm_cs_create_syncobj |
| - ac/drm: store a util_sync_provider in ac_drm_device |
| - ac/drm: replace direct ioctl calls by util_sync_provider |
| - amd/virtio: add vpipe support |
| - radeonsi/virtio: support vpipe |
| - radv/virtio: support vpipe |
| - radv: disable extension depending on timeline syncobj when missing |
| - radv: rework VM_ALWAYS_VALID handling |
| - ac/info: use u_sync_provider instead of DRM_CAP_SYNCOBJ(_TIMELINE) |
| - ac/info: add ac_drm_query_pci_bus_info |
| - ac/virtio: support timeline syncobj |
| - amd/virtio: clear DRM_SYNCOBJ_WAIT_FLAGS_WAIT_FOR_SUBMIT |
| - ac/virtio: fix alignment of metadata command |
| - ac: fix potential overflows |
| - ac: fix invalid array size |
| - radeonsi: fix potential overflows |
| - tc: fix potential overflows |
| - mesa/program: remove useless assignment |
| - egl: cast printf argument to the expected type |
| - aux/indices: remove useless lines |
| - util/texcompress: fix comparison warning |
| - mesa: fix total_invocations computation |
| - frontends/va: fix potential overflows |
| - bufferobj: init the return value for GetParam functions |
| |
| Pohsiang (John) Hsu (24): |
| |
| - pipe: add PIPE_VIDEO_CAP_ENC_MAX_LONG_TERM_REFERENCES_PER_FRAME for H264/H265 encode |
| - d3d12: Add support for retrieving PIPE_VIDEO_CAP_ENC_MAX_LONG_TERM_REFERENCES_PER_FRAME for H264/H265 encode |
| - pipe: add PIPE_VIDEO_CAP_ENC_MAX_DPB_CAPACITY for H264/H265 encode |
| - d3d12: Add support for retreiving PIPE_VIDEO_CAP_ENC_MAX_DPB_CAPACITY for H264/H265 encode |
| - d3d12: enable D3D12_VIDEO_ENCODER_CODEC_CONFIGURATION_HEVC_FLAG_ENABLE_LONG_TERM_REFERENCES when max_num_ltr_frames > 0 |
| - d3d12: fix configuration flag for D3D12_VIDEO_ENCODER_CODEC_CONFIGURATION_HEVC_FLAG_DISABLE_LOOP_FILTER_ACROSS_SLICES |
| - util: fix msvc build warning 4146 (unary minus operator applied to unsigned type, result still unsigned) |
| - gallium/auxiliary: fix msvc build warning 4146 (unary minus operator applied to unsigned type, result still unsigned) |
| - mediafoundation: Add mediafoundation frontend |
| - mediafoundation: add ETW event for perf analysis |
| - mediafoundation: get device vendor id, device id, and driver version |
| - mediafoundation: on use LTR, synchronize the active ltr bitmap to the one passed in |
| - mediafoundation: add mechanism to disable async and h.264 unwrapped POC (commented out for now) according to gpu/version |
| - mediafoundation: move readme.md to docs folder |
| - mediafoundation: Allow to build multiple codec MFT DLLs in one build pass |
| - ci: build mediafoundation frontend in windows-msvc |
| - mediafoundation: Disable level check to unblock 8K x 8K HEVC encode for now. |
| - mediafoundation: periodic clang-format |
| - mediafoundation: fix slice capability check, and fix the slice mb mode, remove slice mode 2 |
| - d3d12: fix failure when building with v1.717.0-preview and running on Windows 11 without Agility Pack |
| - mediafoundation: make file version the same as product version, and increment version to 1.0.6 |
| - mediafoundation: fix cropping flag when using software sample |
| - mediafoundation: fix build after updating sdk to 26100.4188 |
| - mediafoundation: don't send METransformNeedInput when in Flush/Drain |
| |
| Qiang Yu (17): |
| |
| - nir/opt_varyings: fix mesh shader miss promote varying to flat |
| - gallium: remove ir arg from pipe_screen.get_compiler_options |
| - mesa: do not assume geometry shader when init |
| - Revert "gallium: add drawid_offset to draw_mesh_tasks interface" |
| - mesa: remove gl_program.Target |
| - radeonsi: check render feedback only for fragment shader |
| - radeonsi: use mask for uses_bindless_samplers/images |
| - radeonsi: fix gfx11 ngg shader emit |
| - ac,radv: move mesh_fast_launch_2 to ac |
| - radeonsi: support more than 64 options for AMD_DEBUG |
| - ac: parse ib for mesh shader dispatch packets |
| - radv: change mesh shader gs_vgpr_comp_cnt for gfx11 |
| - nir: fix PRIMITIVE_INDICES mistreated as varying |
| - nir/recompute_io_bases: fix for per primitive IO |
| - ac,radv: move nir_load_task_ring_entry_amd to ac |
| - ac,radv: move mesh scratch ring constants to ac |
| - ac,radv: move nir_load_ring_mesh_scratch_offset_amd to ac |
| |
| Renato Pereyra (2): |
| |
| - pps: Report available counters when gpu.counters* data source is registered |
| - pps: Generate libgpudataprofiling.so from pps-producer sources for Android CTS |
| |
| Rhys Perry (131): |
| |
| - aco/gfx12: don't use second VALU for VOPD's OPX if there is a WaR |
| - aco: rename is_opy_only to can_be_opx |
| - aco: add are_src_banks_compatible helper for VOPD creation |
| - aco: refactor can_use_vopd so that it returns flags |
| - aco/gfx12: assume VOPD with two v_mov_b32 are src bank compatible |
| - aco/gfx12: VOPD src0/1 are src bank compatible if they are the same vgpr |
| - aco: combine VALU lanemask hazard into VALUMaskWriteHazard |
| - aco: init vm_vsrc/sa_sdst from depctr_wait |
| - aco: remove va_vdst/vm_vsrc/sa_sdst variables |
| - aco: use v_perm_b32 for do_pack_2x16 on gfx10+ |
| - aco: use v_perm_b32 for byte swaps within a VGPR on gfx10 |
| - aco/gfx11: create waitcnt for workgroup vmem barriers |
| - aco/ra: update_renames() before add_subdword_definition() |
| - aco/ra: use a correct stride for subdword get_reg_impl |
| - aco: fix get_temp_reg_changes with clobbered operands |
| - aco: improve spilling of clobbered operands |
| - aco/ra: change sorting in compact_relocate_vars |
| - aco/ra: don't require alignment for NPOT SGPR temporaries |
| - aco: fix compact_relocate_vars fallback with scc/exec/m0 precolored regs |
| - aco/ra: cleanup compact_relocate_vars fallback path |
| - aco/ra: add ra_test_policy::use_compact_relocate |
| - aco/tests: add pseudo-scalar transcendental and fallback path RA tests |
| - aco/gfx12: use s_add_u64 |
| - aco/gfx12: use s_sub_u64 |
| - aco: increase max_const_offset_plus_one for SMEM load_global |
| - aco/gfx12: increase maximum smem offset |
| - aco/gfx12: increase maximum global/scratch offset |
| - aco/gfx12: increase maximum vbuffer offset |
| - aco: fix max_const_offset_plus_one overflow |
| - aco: replace max_const_offset_plus_one with max_const_offset |
| - vtn: use nir_const_value_for_raw_uint for bfloat SpecConstantOp/FConvert |
| - util: silence -Wstringop-overread in SHA1 |
| - nir/opt_shrink_vectors: add assume to silence warning |
| - nir/lower_gs_intrinsics: silence warning |
| - aco: swap the correct v_mov_b32 if there are two of them |
| - aco: add smem opcode helper |
| - aco/gfx12: use s_load_dwordx3 to load ray launch sizes |
| - aco: prepare for dwordx3 smem loads |
| - aco/gfx12: select dwordx3 smem loads |
| - radv/gfx12: use dword3 smem loads for push constants |
| - aco: support 8/16-bit loads in smem_combine() |
| - aco/gfx12: allow 8/16-bit smem loads |
| - nir/search: extend swizzle_y |
| - nir/algebraic: optimize ior(unpack_4x8, unpack_4x8<<8) to unpack_32_2x16 |
| - ac/nir: round components when lowering 8/16-bit loads to 32-bit |
| - ac/nir: allow less than one register of overfetch |
| - ac/nir: allow vectorization of unsupported 8/16-bit loads |
| - ac/nir: allow 8/16-bit smem loads |
| - aco: support more opcodes in apply_ds_extract |
| - aco: support sign extension in apply_load_extract |
| - aco: don't check dst_bitsize in apply_load_extract |
| - radv: perform nir_opt_access before the first radv_optimize_nir |
| - util: fix float to bfloat16 conversion with NaN |
| - ac/llvm: correctly split vector 8/16-bit stores |
| - ac/llvm: correctly set alignment of vector global load/store |
| - aco/gfx115: consider point sample acceleration |
| - aco: fix wait_entry::join() when changing vmem_types |
| - aco: rename ops_fixed_to_def to tied_defs |
| - aco/ra: replace skip_renaming with copy_kill |
| - aco/ra: move optimize_encoding earlier |
| - aco: set copy-kill for tied operands with the same temporary |
| - aco/ra: rewrite handling of tied definitions |
| - aco/tests: add tests for tied definitions |
| - aco/tests: Add tests for vector-aligned operands |
| - aco: consider image_msaa_load a sample operation before gfx12 |
| - aco/hard_clauses: simplify and complete get_type() |
| - nir: fix unpack_unorm_2x16/unpack_snorm_2x16 constant folding |
| - radv: fix too large shift exponent in radv_remove_color_exports |
| - nir: fix signed overflow in pack_half_2x16 constant folding |
| - nir: fix left shift of negative value in ibfe constant folding |
| - nir/load_store_vectorize: stabilize entry sort |
| - radv/meta: use unsigned min in copy/fill shaders |
| - aco: set vmem_types for args_pending_vmem |
| - aco: only join barrier_imm/barrier_events for logical edges |
| - aco: add wait_entry::vm_mask |
| - aco: add wait_entry::logical_events |
| - aco: skip waitcnt between two vmem writing different halves |
| - aco: skip waitcnt between two vmem writing different lanes |
| - aco: sometimes join linear wait entries on logical edges |
| - aco: make all wait entries linear |
| - aco: don't consider gfx1153 to have point sample acceleration |
| - aco/gfx12: fix VALUReadSGPRHazard with carry-out |
| - aco: update VALUReadSGPRHazard comment |
| - ac/nir: run nir_lower_vars_to_ssa after nir_lower_task_shader |
| - meson: add dep_llvm to radv_tests |
| - ac/nir: create lowered inverse_ballot |
| - ac/llvm: fix overloading of intrinsic names |
| - ac/llvm: fix mul24 intrinsic overloading |
| - ac/llvm: convert to integer after reductions |
| - ac/llvm: rewrite shuffle waterfall loop |
| - ac/llvm: use ds_bpermute_b32 for GFX12 wave64 |
| - nir/uub: improve ishl/imul with constant sources |
| - nir/uub: improve iand with constant sources |
| - nir/uub: improve ior/ixor with constant sources |
| - nir: simplify nir_addition_might_overflow |
| - aco/lower_branches: keep blocks with multiple logical successors |
| - aco: continue when try_remove_simple_block can't remove a predecessor |
| - aco: validate that preds/succs match |
| - aco/tests: add lower_branches tests |
| - util/stats: fix signness of c types |
| - util/stats: use PRIdx/PRIux format constants |
| - util/stats: fix XML type validation |
| - util/stats: add serialize helpers |
| - radv: use vk_outarray for pipeline statistics/executables |
| - radv,aco: switch to shader statistics framework |
| - nir/lower_bit_size: fix bitz/bitnz |
| - nir/search: fix dumping of conversions |
| - aco: update ctx.block when inserting discard block |
| - aco/ra: fix repeated compact_linear_vgprs() in get_reg() |
| - aco: handle FPAtomicToDenormModeHazard |
| - aco: count flat as vmem in statistics |
| - ac/nir: don't combine multiple non-constant offsets into a global access |
| - aco/gfx6: disallow global access with sgpr address and two offsets |
| - aco: use addition helper in emit_load |
| - aco: add too-large constant offset to the address instead of the offset |
| - aco: simplify the load callback |
| - aco/gfx6: allow vgpr offset for global access with sgpr address |
| - aco/gfx6: allow both constant and gpr offset for global with sgpr address |
| - aco: increase global constant offset limit slightly |
| - aco: return a format in lower_global_address |
| - aco: simplify waitcnt insertion for flat access |
| - aco: optimize lds-only or vmem-only flat access |
| - aco: use MUBUF for global access with SGPR address on GFX7/8 |
| - aco: remove RegClass::as_subdword |
| - nir/divergence: ignore boolean phis for ignore_undef_if_phi_srcs |
| - nir/cf: have nir_remove_after_cf_node remove phis at the start too |
| - nir/search: check variable requirements even if it's already seen |
| - nir/uub: fix 8/16-bit overflow |
| - nir/load_store_vectorize: check for interfering shared2 before vectorizing |
| - nir/load_store_vectorize: set is_store for shared append/consume |
| - aco/ra: set late-kill for operands of temporary p_create_vector |
| |
| Ricardo Garcia (1): |
| |
| - radv: Ignore image barrier queue families if equal |
| |
| Rob Clark (72): |
| |
| - ci: Re enable fd-farm |
| - tu: Add TU_DEBUG=comm |
| - tu/virtio: Use util_get_command_line() helper |
| - freedreno/virtio: Use util_get_command_line() helper |
| - freedreno/ir3: Fix tess/geom asan error |
| - freedreno: Fix shader-clock when kernel exposes UCHE_TRAP_BASE |
| - ci: Disable fd-farm |
| - freedreno: Avoid repeated device lookup |
| - freedreno: Don't autotune if we are already sysmem |
| - freedreno: Fix compute caps |
| - freedreno: Use non-draw batches for compute contexts |
| - freedreno: correct resource usage check |
| - freedreno: A couple features.txt updates |
| - freedreno: Fold X1-85 back into existing dev table entry |
| - freedreno: Enable the X1-45 |
| - freedreno/fdl: Add fd6_texture_format_supported() |
| - vtn: Handle non-32b tex dests |
| - freedreno: Remove obsolete comment |
| - freedreno/a6xx: Fix max_threads_per_block calc |
| - freedreno/a6xx: cl_khr_image2d_from_buffer support |
| - freedreno/cffdec: Skip printing null descriptors |
| - freedreno/a6xx: Fix buffer clears |
| - util+tu: Add util_format_is_float16() |
| - util: Add a util_blitter_blit_with_txf() helper |
| - freedreno: Add custom f16 blit shader |
| - freedreno/a6xx: Don't use 2d blit path for f16 |
| - nir: Fix lower_readonly_images_to_tex bitsize |
| - freedreno/a6xx: Fix max_threads to account for reg footprint |
| - freedreno/decode: Add a simple util to dump csv reg list |
| - freedreno/a6xx: Fix SP_CS_IBO address on a7xx |
| - freedreno/registers: Fix a couple missing variants |
| - freedreno/registers: Some reg64 conversion |
| - freedreno: Rename IBO -> UAV |
| - freedreno/a6xx: The great register renaming |
| - freedreno/a6xx: Fix thread calc for dummy kernels |
| - rusticl: Fix work group size validation |
| - dri: Remove unused prototype |
| - dri: Don't iterate the planes for non-YUV |
| - dri: Remove plane info for RGB formats |
| - freedreno: Add register renaming to .git-blame-ignore-revs |
| - freedreno: Add missing CP_INDIRECT_BUFFER |
| - freedreno/a6xx: Add single common source for a6xx.xml.h |
| - freedreno/a6xx: Split enums out of a6xx.xml |
| - freedreno/a6xx: Split descriptors out into their own file |
| - freedreno/a6xx: Don't try to tile NPoT formats |
| - freedreno/layout: Support for NPoT formats |
| - dri2+gallium: Support to import suboptimal formats |
| - dri: Correct handle-usage flags |
| - freedreno/a6xx: Allow suboptimal sampling formats when requested |
| - freedreno: Avoid unnecessarily aligning to gmem_align_w |
| - freedreno: Advertise external_only if we can't render |
| - freedreno: Misc whitespace cleanups |
| - freedreno/layout: Introduce fdl_image_params |
| - turnip: convert to fdl6_layout_image |
| - freedreno: Don't call setup_slices for buffers |
| - freedreno: Add helper to initialize layout params |
| - freedreno: Pass layout type to backend |
| - freedreno: Move layout init to backends |
| - freedreno/a5xx: Convert to fdl5_layout_image |
| - freedreno/a6xx: Convert to fdl6_layout_image |
| - freedreno/layout: Remove unneeded shims |
| - freedreno: Pass handle down to layout_resource_for_modifier |
| - freedreno/a6xx: Use handle for explicit layout |
| - freedreno/a6xx: Refactor handle import |
| - freedreno/a6xx: Use metadata to resolve INVALID modifier |
| - freedreno: Move layout to layout_resource_for_handle() |
| - freedreno: dump layout in import path as well |
| - drm-uapi: update drm_fourcc.h |
| - mesa/main: Add MESA_FORMAT_RGB_UNORM16 |
| - dri2: Allow allocating suboptimal for sampling |
| - dri: Add additional 16/32b float/int formats |
| - gbm: Add more formats |
| |
| Robert Mader (9): |
| |
| - llvmpipe: Fix dmabuf import paths for DRM_FORMAT_YUYV variants |
| - llvmpipe: Preparations for multiplanar formats |
| - llvmpipe: Enable support for multiplanar formats |
| - egl: Remove check for GL or GLES |
| - egl: Fixes for eglQueryContext and RESET_NOTIFICATION_STRATEGY |
| - drm-uapi: update drm_fourcc.h to latest version |
| - mesa/formats: Add support for 10 and 12 bit SW decoder YCbCr formats |
| - gles: Add support for 10/12/16 bit SW decoder YCbCr formats |
| - llvmpipe: Fix parameter queries for mulitplanar formats |
| |
| Rohan Garg (15): |
| |
| - anv: re enable compression for CPS surfaces on platforms other than Xe |
| - intel/compiler: use a immediate when doing the shift |
| - anv: use the common function for detecting a mesh shader stage |
| - compiler/types: add a bfloat16 type |
| - spirv: construct a bfloat16 from the given SPIR-V bitsize and encoding |
| - spirv: Enable bfloat16 capabilities |
| - anv: enable VK_KHR_shader_bfloat16 |
| - anv: use the float qualifier to denote the right value |
| - anv: Support fast-clears in vkCmdClearDepthStencilImage |
| - anv: drop anv_can_hiz_clear_ds_view in favor of anv_can_hiz_clear_image |
| - nir/lower_io: add io_offset support for more intrinsics |
| - brw: introduce MEMORY_LOGICAL_ADDRESS_OFFSET to encode address offsets |
| - brw: encode the offset into the message descriptor for Xe2 |
| - brw: store the buffer offset for load/store intrinsics |
| - brw: run the nir_opt_offsets pass and set the maximum offset size |
| |
| Roland Scheidegger (2): |
| |
| - llvmpipe: Fix crash when using nir_tex_src_min_lod |
| - llvmpipe: Improve persepctive correction with centroid/sample interpolation |
| |
| Romaric Jodin (7): |
| |
| - panvk: Advertise support for VK_KHR_shader_integer_dot_product |
| - panfrost: bi_builder.h.py: stop generating unneeded parentheses |
| - panfrost: Update bifrost_isa.py to support saturate for valhall |
| - panvk: add support for {s,u}dot_4x8_{sat} |
| - panvk: fix typo in printf in bi_validate |
| - panvk: fix bi_emit_alu calling bi_make_vec_to |
| - nir/lower_bit_size: Avoid round-trip conversion when possible |
| |
| Ruijing Dong (1): |
| |
| - radeonsi/vcn: vcn5 av1 decoding context buffer fix |
| |
| Russell Greene (1): |
| |
| - radeonsi: vpe: fix noisy false error |
| |
| Ryan Houdek (1): |
| |
| - tu: Fixes missing sse2 flags |
| |
| Ryan Mckeever (14): |
| |
| - panvk: Prepare cmd_prepare_push_uniforms() for multiDrawIndirect |
| - panvk: Prepare cmd_prepare_shader_res_table() for multiDrawIndirect |
| - panvk: Implement multiDrawIndirect for v10+ |
| - panvk: Implement CmdDraw[Indexed]IndirectCount for v10+ |
| - panvk: Advertise KHR_draw_indirect_count for v10+ |
| - panvk: Enable VK_EXT_load_store_op_none |
| - frontend/dri: Replace tabs with spaces |
| - st/mesa: Do not specify PIPE_FORMAT_R8G8B8X8_UNORM twice for RGB |
| - gallium: Support RGB888/BGR888 formats |
| - gbm/dri: Support RGB888/BGR888 formats |
| - egl/wayland: Set EGL_NATIVE_VISUAL_ID for EGLConfig |
| - egl/wayland: Support RGB888/BGR888 formats |
| - st/mesa: Support RGB888/BGR888 formats |
| - pan/format: Add render target support for R8G8B8/B8G8R8 formats |
| |
| Sagar Ghuge (17): |
| |
| - intel/genxml/xe3: Update 3STATE_BTD field |
| - anv: Update RT dispatch globals to use 64bit data structure |
| - intel/compiler: Pass around intel_device_info parameter in helper |
| - intel/compiler: Update MemHit data structure to 64-bit version |
| - intel/compiler: Update MemRay data structure to 64-bit |
| - intel/rt: Update BVH instance leaf load for Xe3+ |
| - anv: Enable 64bit memory structure mode for RT |
| - intel/compiler: Fix stackIDs on Xe2+ |
| - anv: Fix untyped data port cache pipe control dump output |
| - anv: Drop unused helper cmd_buffer_dispatch_kernel |
| - anv: Apply flush during WriteAccelerationStructures |
| - anv: Drop unused anv_rt_bvh_build_method enum |
| - intel/common: Add helper for compute thread group dispatch size |
| - iris: Set TG size based on number of threads |
| - blorp: Set TG size based on number of threads |
| - anv: Set TG size based on number of threads |
| - intel/compiler: Drop unused param from set_memory_address |
| |
| Samuel Pitoiset (189): |
| |
| - radv: use radv_fill_memory() in the accel struct path |
| - radv: add radv_fill_image() helper |
| - radv: simplify radv_fill_xxx() helpers |
| - radv: re-introduce the compute vs CP DMA heuristic for copy/fill operations |
| - ac/gpu_info,radv: add scratch_wavesize_granularity info |
| - ac,radeonsi: rework computing scratch wavesize and tmpring register |
| - radv: compute the optimal scratch wavesize |
| - radv: use common scratch tmpring size programming |
| - radv: use radeon_set_sh_reg() for only 1 DWORD |
| - radv: remove redundant assertion when emitting PS epilog state |
| - radv: simplify combining TES/VS+GS config registers |
| - radv: tidy up radv_emit_vertex_shader() |
| - radv: tidy up radv_emit_geometry_shader() |
| - radv: tidy up radv_emit_ps_epilog_state() |
| - radv: add a function to emit meshlet registers on GFX11+ |
| - radeonsi: fix configuring compute scratch |
| - radv/meta: simplify dealing with image layouts for blits/resolves |
| - radv/meta: remove unused functions/prototypes |
| - radv/meta: rename more buffer->memory for fill/copy/update operations |
| - radv/meta: remove unnecessary radv_meta_blit2d_buffer::bs |
| - radv/meta: remove redundant parameter to blit_surf_for_image_level_layer() |
| - radv: set radv_disable_dcc=true for WWE 2k23 |
| - radv: fix re-emitting VRS state when rendering begins |
| - radv: do not clear unwritten color attachments with dual-source blending |
| - radv: disable SINGLE clear codes to workaround a hw bug with DCC on GFX11 |
| - radv: fix GPU hangs with image copies for ASTC/ETC2 formats on transfer queue |
| - spirv: Update the JSON and headers |
| - vulkan: Update XML and headers to 1.4.314 |
| - radv: promote VK_EXT_robustness2 to VK_KHR_robustness2 |
| - radv: ignore radv_disable_dcc_stores on GFX12 |
| - radv: ignore conditional rendering with vkCmdTraceRays* |
| - radv: adjust computing the PC from the trap handler on GFX12 |
| - radv: fix save/restore SCC in the trap handler on GFX12 |
| - radv: fix configuring TRAP_PRESENT for compute shaders on GFX12 |
| - radv,aco: dump more SQ_WAVE registers from the trap handler on GFX12 |
| - aco: adjust an assertion in select_trap_handler_shader() |
| - vulkan: Update XML and headers to 1.4.315 |
| - vulkan: add support for VK_IMAGE_LAYOUT_ZERO_INITIALIZED_EXT |
| - ac/gpu_info: add has_zerovram_support |
| - radv: implement VK_EXT_zero_initialize_device_memory |
| - radv: advertise VK_EXT_zero_initialize_device_memory |
| - radv: ignore radv_zero_vram=true if zeroInitialDeviceMemory is enabled |
| - radv: fix SDMA copies for linear 96-bits formats |
| - radv: fix emitting dynamic viewports/scissors when the count is static |
| - radv: remove the optimization for equal immutable samplers |
| - radv: fix fetching conditional rendering state for DGC preprocess |
| - radv: fix conditional rendering with DGC and non native 32-bit predicate |
| - ac/gpu_info: allow 32-bit predicate on GFX11+ |
| - vulkan/runtime: fix a wrong assert that validates timeline semaphores |
| - radv: expose VK_EXT_zero_initialize_device_memory unconditionally |
| - ac/gpu_info: rename has_zerovram_support to has_default_zerovram_support |
| - radv: fix missing texel scale for unaligned linear SDMA copies |
| - radv: add vk_format_is_96bit() |
| - radv/meta: move and rename get_r32g32b32_format() |
| - radv: fix non-indexed draws with primitive restart enable |
| - radv: adjust comments describing GDS needs |
| - radv: rename a variable in gfx10_copy_shader_query_ace() |
| - radv: stop reserving NGG streamout counters |
| - radv: increase size of the buffer for emulated queries on GFX12 |
| - radv: declare and emit NGG_QUERY_BUF_VA on GFX11-GFX11.5 |
| - radv: stop using GDS for emulated prims gen/xfb queries on GFX11-GFX11.5 |
| - ac/gpu_info: add has_cp_dma_with_null_prt_bug |
| - radv: fix CP DMA with NULL PRT pages on GFX8-9 |
| - radv: eliminate useless mov(const) after lowering all IO to scalar |
| - amd/ci: hold back navi21/navi31 to kernel 6.6 |
| - radv/ci: remove RADV_PERFTEST=video_{decode,encode} when it's the default |
| - radv: fix capture/replay with sparse images and descriptor buffer |
| - radv: add radv_disable_hiz_his_gfx12 and enable for Mafia Definitive Edition |
| - aco: implement bitfield_reverse for types other than 32-bits |
| - nir/lower_int64: add bitfield_extract lowering |
| - ac/nir: enable 64-bit lowering for bitfield_extract |
| - aco: implement bitfield_extract for 8-bit/16-bit |
| - radv,radeonsi: emit UPDATE_DB_SUMMARIZER_TIMEOUT on GFX12 |
| - ci: uprev VKCTS main to f66e0ae866117f3d7375763bf5ec194404d88cdb |
| - radv/ci: bump the timeout for radv-polaris10-vkcts |
| - radv: set radv_zero_vram=true for GPUScore: Breaking limit |
| - radv: add a new dirty state for emitting tess user SGPRs |
| - radv: add macros for paired shader registers on GFX12 |
| - radv: use paired shader registers for graphics on GFX12 |
| - spirv: disable mediump for bitfield insert/extract |
| - nir: generalize bitfield insert/extract sizes |
| - radv,radeonsi: fix emitting UPDATE_DB_SUMMARIZER_TIMEOUT on GFX12 |
| - radv: stop using multiview with DGC |
| - radv: remove dead code in radv_CreateDescriptorSetLayout() |
| - radv: use 32 bytes descriptor for sampled/input attachment images on GFX11+ |
| - radv/meta: fix using the wrong pipeline layout for ASTC decoding |
| - radv: remove useless vk_common_entrypoints.h includes |
| - radv: stop using vk_common entrypoints when not necessary |
| - radv: stop setting the address for capture/replay and non-sparse buffers |
| - radv: cleanup creating sparse buffers with capture/replay |
| - radv: add capture/replay for sparse buffers and descriptor buffer |
| - radv: make sure to zero-initialize image view descriptors |
| - radv: fix 3-plane formats with descriptor buffers |
| - radv: unset radv_disable_dedicated_sparse_queue for Bethesda Games |
| - radv: remove radv_disable_dedicated_sparse_queue completely |
| - radv: unset radv_force_64k_sparse_alignment for DOOM: The Dark Ages |
| - Revert "radv,driconf: Add radv_force_64k_sparse_alignment config" |
| - radv: unset radv_lower_terminate_to_discard=true for Indiana Jones |
| - radv: add support for VK_QUERY_POOL_CREATE_RESET_BIT_KHR |
| - radv/nir: lower unassigned vertex attributes to (0,0,0,0) |
| - radv: advertise VK_KHR_maintenance9 |
| - radv/nir: adjust a comment about inlining immutable samplers |
| - radv: cleanup redundant switch cases in radv_GetDescriptorEXT() |
| - radv: remove unused 96-bit memcpy for image descriptors |
| - radv: use constants for different descriptor sizes |
| - radv: implement VK_KHR_unified_image_layouts |
| - radv: advertise VK_KHR_unified_image_layouts on GFX11+ |
| - radv: enable shaderBufferFloat32AtomicMinMax on GFX12 |
| - aco: add support for image f32 atomic add |
| - radv: enable shaderImageFloat32AtomicAdd on GFX12 with ACO |
| - radv: fix a compilation warning with Clang in radv_get_depth_clamp_mode() |
| - docs: mark VK_KHR_depth_clamp_zero_one as done for RADV |
| - ac/surface: fix aliasing DCC tilings with HiZ info on GFX12 |
| - radv: fix 1x user sample locations on GFX10+ |
| - radv/meta: use radv_get_copy_flags_from_bo() more |
| - docs: move nogttspill to the RADV_PERFTEST section |
| - docs: remove redundant nogttspill description |
| - ci: uprev VKCTS main to 9cc8e038994c32534b3d2c4ba88c1dc49ef53228 |
| - radv: replace radv_combined_image_descriptor_sampler_offset() by a constant |
| - radv: add a separate path for writing ycbcr combined image+sampler desc |
| - radv: reduce the number of bytes written for combined image+sampler desc |
| - radv: use one descriptor per plane for combined image+sampler with ycbcr |
| - ci: uprev vkd3d |
| - radv: switch to nir_intrinsic_load_input_attachment_coord |
| - ac/descriptors: configure {Z,STENCIL}_READ_ONLY for ds surfaces on GFX12+ |
| - radv/meta: add a helper to determine if clearing is a full rect |
| - meson: remove unused osmesa option |
| - mr-label-maker.yml: remove remaining osmesa occurrences |
| - radeonsi: add a way to override the disk cache key with radeonsi-build-id |
| - radv: remove an useless check about custom border color |
| - radv: rename border_color_slot to border_color_index |
| - radv: simplify add_descriptor_set() |
| - radv: move pipeline layout implementation to radv_pipeline_layout.c/h |
| - aco: consider that nir_tex_src_{coord,ddx} can be the first source |
| - radv: gather nir shader info at the end of radv_optimize_nir() |
| - radv: make sure to zero-initialize MSB of accel struct descriptor |
| - radv: move descriptor pool implementation to radv_descriptor_pool.c/h |
| - radv: move write descriptor helpers to radv_descriptors.h |
| - radv: move descriptor update implementation to radv_descriptor_update_template.c/h |
| - radv: move radv_push_descriptor_set to radv_cmd_buffer.h |
| - radv: split descriptor set and descriptor utils in separate files |
| - radv/ci: uprev kernel to 6.15.3 |
| - vulkan/runtime: add vk_sampler_{init,finish}() helpers |
| - radv: rework initializing/finishing samplers |
| - radv/meta: stop allocating sampler for blit operations |
| - radv: stop disabling the alpha optimization with E5B9G9R9 and RB+ |
| - radv: disable RB+ with E5B9G9R9 to workaround failures on GFX10.3-GFX11.5 |
| - ac/surface: use align with NPOT for estimating surface size |
| - ac/surface: select a different swizzle mode for ASTC formats on GFX12 |
| - zink/ci: update list of expected failures for RADV |
| - radv: add a helper to emit INDIRECT_BUFFER in DGC |
| - radv: add helpers to emit one DGC sequence |
| - radv: add a helper to emit padding with DGC cmdbuf |
| - radv: remove unused parameter in build_dgc_buffer_preamble() |
| - radv: pass a dgc_cmdbuf parameter to more functions |
| - radv: do not perform a per-pixel copy for BCn formats with mips on GFX12+ |
| - ac/descriptors,radv: move the nbc view param to the gfx10 union |
| - radv: fix indexing with variable descriptor count |
| - radv: fix the maximum variable descriptor count with inline uniform blocks |
| - radv: simplify creating descriptor sets with variable desciptor count |
| - radv/sdma: fix unaligned 96-bits copies on GFX9 |
| - ac/surface: constify bpe_to_format() |
| - ac/surface: add support for surface<->memory copy using addrlib |
| - ac/surface: add a flag to forbid some swizzles for surface<->memory copies |
| - radv: only expose host visible memory types for images with host-transfer |
| - radv: map images with host-transfer at bind time |
| - radv: disable compression for host-transfer images |
| - radv: set RADEON_SURF_HOST_TRANSFER for host-transfer images |
| - radv: add support for formats with host-transfer |
| - radv: add support for querying HIC performance info |
| - radv: add support for querying HIC memcpy size |
| - radv: do not use MRT counters for host-transfer images |
| - radv: implement vkCopyMemoryToImageEXT()/vkCopyImageToMemoryEXT() |
| - radv: implement vkCopyImageToImageEXT() |
| - radv: implement vkTransitionImageLayout() |
| - radv: advertise VK_EXT_host_image_copy on GFX10+ behind RADV_PERFTEST=hic |
| - radv/ci: enable RADV_PERFTEST=hic for GFX10+ jobs |
| - zink/ci: enable RADV_PERFTEST=hic for GFX10+ jobs |
| - radv: add a way to disable the HIZ/HiS events based workaround on GFX12 |
| - radv/bvh: remove redundant definition of DIV_ROUND_UP |
| - radv: re-run clang-format |
| - ci: uprev VKCTS main to 73db56e823f8bf6b9dcab57af43b4216c3ba19b5 |
| - radv: implement DGC IB chaining when the number of sequences is too high |
| - radv: fix SQTT shaders relocation on GFX12 |
| - radv: fix reporting instance/vertex_count for direct draws with RGP on GFX12 |
| - radv: reject 1D block-compresed formats with mips on GFX6 |
| - radv: fix a memleak with GS copy shader NIR |
| - radv: fix fbfetch output with compresed FMASK on <= GFX9 |
| - vulkan: fix missing presentId2/presentWait2 enable features |
| |
| Sasha Finkelstein (1): |
| |
| - hk: Implement VK_EXT_map_memory_placed |
| |
| Serdar Kocdemir (15): |
| |
| - gfxstream: Track sparse image related functions |
| - gfxstream: Use logging functions for call tracing |
| - gfxstream: Mark VK_KHR_ray_tracing_pipeline dispatch only |
| - gfxstream: Reduce log level on AHB allocation |
| - gfxstream: Track vkGetDeviceImageMemoryRequirements on host |
| - gfxstream: Adjust allowed device extensions |
| - gfxstream: Implement private data extension |
| - gfxstream: Fix codegen for the moved logging.h file |
| - gfxstream: Fix compiler warnings |
| - gfxstream: Allow all promoted device extensions |
| - gfxstream: Fix min max reduction samplers v2 |
| - gfxstream: Change clear colors only for images with ANB |
| - gfxstream: Check size for memcpy operations |
| - gfxstream: Remove unnecessary parentheses in codegen |
| - gfxstream: Small optimization on transformDescriptorSetList |
| |
| Sergi Blanch Torne (11): |
| |
| - bin/ci: crnm: bug while stressing a single job |
| - bin/ci: crnm: bug stress mode retry formula |
| - ci: angle: fix depot-tools dependency release |
| - ci: disable Collabora's farm due to maintenance |
| - Revert "ci: disable Collabora's farm due to maintenance" |
| - fluster: xfile update |
| - bin/ci: crnm: format in columns when listing |
| - bin/ci: crnm: columns list for cancelled jobs |
| - bin/ci: crnm: columns list when waiting jobs update |
| - bin/ci: crnm: adaptative field pads |
| - docs: crnm: document token usage |
| |
| Sergi Blanch-Torne (11): |
| |
| - ci: disable Collabora's farm due to maintenance |
| - ci: Re-enable the Collabora farm |
| - Uprev ANGLE to c39f4a5c553cbee39af8f866aa82a9ffa4f02f5b |
| - ci: crnm: fix hyperlink format |
| - ci: separate hidden jobs to -inc yml files |
| - ci: split long containers build yaml |
| - ci: split hidden job definition for bare-metal and ci-tron |
| - ci: reduce the root .gitlab-ci file |
| - ci: crnm: extend it to work with other instances |
| - ci: crnm: rev argument for tags and branches |
| - ci: crnm: review argument metavar and type |
| |
| Seán de Búrca (22): |
| |
| - rusticl/device: set maximum work dimension to match implicit bounds |
| - rusticl: align memory utilities with std |
| - rusticl: improve internal typing |
| - rusticl: verify validity of property names and values |
| - rusticl: improve use of Rust idioms |
| - rusticl: remove unnecessary check for device in kernel list |
| - rusticl: replace \`map_or(false, f)` with \`is_some_and(f)` |
| - rusticl: fix build with clippy driver |
| - nak: remove unnecessary casts and conversions |
| - nak: collapse extraneous conditional branches |
| - nak: use \`Option` propagation instead of explicit let-else clauses |
| - nak: use standard methods and macros to improve readability |
| - nak,nil: avoid explicit returns at the end of functions |
| - nak,nil: elide lifetimes where possible |
| - nak,nouveau: adjust function/method signatures to better match convention |
| - nak,nil: style cleanup |
| - nouveau: implement \`Default` for \`Push` |
| - rusticl: hand-map 3-element arrays for fallible conversion |
| - rusticl: use simple equality check for contexts instead of \`HashSet` |
| - rusticl: iterate subgroup sizes only as needed |
| - rusticl: write \`CLInfoValue`s from iterators instead of collecting |
| - rusticl: replace unnecessary \`Vec` references with slice refs |
| |
| Shih, Jude (3): |
| |
| - amd/vpelib: VPE descriptor added |
| - amd/vpelib: remove sensitive keywords |
| - amd/vpelib: Alpha Fill add mode |
| |
| Sid Pranjale (1): |
| |
| - vulkan/wsi: add cached bit to wsi host memory selection |
| |
| Sil Vilerino (58): |
| |
| - pipe: Add pipe_enc_cap_roi.log2_roi_min_block_pixel_size |
| - d3d12: Report pipe_enc_cap_roi.log2_roi_min_block_pixel_size |
| - d3d12: Fix reporting for PIPE_VIDEO_CAP_ENC_MAX_DPB_CAPACITY |
| - d3d12: Add #if guards for using new ID3D12VideoEncodeCommandList4 |
| - d3d12: Use D3D12_FEATURE_VIDEO_ENCODER_SUPPORT2 when D3D12_VIDEO_USE_NEW_ENCODECMDLIST4_INTERFACE is set |
| - pipe: Add PIPE_VIDEO_CAP_ENC_DIRTY_RECTS and pipe_enc_dirty_rects for H264/H265 encode |
| - d3d12: Add support for pipe_enc_dirty_rects for H264/H265 encode |
| - pipe: Add PIPE_VIDEO_CAP_ENC_MOVE_RECTS and pipe_enc_move_rects for H264/H265 encode |
| - d3d12: Add support for pipe_enc_move_rects for H264/H265 encode |
| - pipe: Add PIPE_VIDEO_CAP_ENC_GPU_STATS_* and pipe_resource textures in H264/H265 encode pic params |
| - d3d12: Add support for QP, SATD and RC bits output stats |
| - pipe: Add sliced encoding API and caps |
| - d3d12: Prepare d3d12_video_encoder_encode_bitstream for sliced encoding. Checked working with single slice buffer at this point |
| - d3d12: Implement multi-slice notifications |
| - d3d12: Support slice NAL prefixes on slice notifications mode |
| - d3d12: deduplicate code with helpers: get_dirty_rects_support, get_move_rects_support, get_gpu_output_stats_support, get_sliced_encode_support |
| - pipe: Add (GPU input) PIPE_VIDEO_CAP_ENC_DIRTY_MAPS and pipe_enc_move_rects GPU input support |
| - d3d12: Add GPU input dirty rect support |
| - pipe: Add (GPU input) PIPE_VIDEO_CAP_ENC_QP_MAPS and input_gpu_qpmap GPU input support |
| - d3d12: Implement PIPE_VIDEO_CAP_ENC_QP_MAPS and CPU/GPU input QPMaps |
| - pipe: Add support for (GPU Input) motion maps |
| - d3d12: Make d3d12_video_encode_support_caps in/out param capEncoderSupportData1 pointer members an external allocation to outlive the stack of the method call |
| - d3d12: Implement GPU Input Motion vectors |
| - pipe: Add pipe_enc_cap_dirty_info.supports_require_auto_slice_mode |
| - d3d12: Reports pipe_enc_cap_dirty_info.supports_require_auto_slice_mode |
| - pipe: Add PIPE_VIDEO_SLICE_MODE_AUTO for gallium driver auto-partition |
| - d3d12: Implement H264/HEVC PIPE_VIDEO_SLICE_MODE_AUTO for gallium driver auto-partition |
| - pipe: Add pipe_enc_cap_dirty_info.supports_require_sao/loop_filter_disabled |
| - d3d12: Report pipe_enc_cap_dirty_info.supports_require_sao/loop_filter_disabled |
| - d3d12: Do not build microsoft/compiler when graphics, gl or vk disabled |
| - d3d12: Remove more nir dependencies when with_gallium_d3d12_graphics disabled |
| - d3d12: Set HEVC slice mode state before calling d3d12_video_encoder_negotiate_requested_features_and_d3d12_driver_caps |
| - d3d12: Add missing D3D12_VIDEO_USE_NEW_ENCODECMDLIST4_INTERFACE guards |
| - d3d12: Add fallback to ID3D12VideoEncodeCommandList2 if ID3D12VideoEncodeCommandList4 not available in underlying OS |
| - d3d12: Add missing offset to encode slice metadata in the get_feedback function |
| - nir: Only build nir headers for mediafoundation/d3d12-no-graphics paired build |
| - d3d12/meson: Add USE_D3D12_PREVIEW_HEADERS compiler flag based on dep_dxheaders.version() |
| - d3d12: Add new video encode HEVC configuration CU size 16-32 range |
| - mr-label-maker.yml: Add mediafoundation tag and associated directories |
| - d3d12: Upgrade to use DirectX-Headers 717 SDK from previous 716 version |
| - d3d12: Add D3D12_VIDEO_ENCODER_CODEC_CONFIGURATION_HEVC_FLAG_DISABLE_LOOP_FILTER_ACROSS_SLICES when querying HEVC capabilities |
| - d3d12: Fix sliced encode ID3D12Resource/ID3D12Fence reuse. Note the fence value now is the frame fence value as the fence is reused |
| - d3d12: Use RefPicListL0/L1 sizes in pipe_h265_enc_picture_desc when supported |
| - d3d12: Use helper to properly initialize d3d12_fences in d3d12 video |
| - pipe: Two pass video encode definitions |
| - d3d12: Implement full/lower two pass video encode |
| - pipe: Add video encode PSNR output stats |
| - d3d12: Add video encode PSNR output stats |
| - mediafoundation: Upgrade to use DirectX-Headers 717 SDK from previous 716 version |
| - mediafoundation: Fill RefPicListL0/L1 sizes in pipe_h265_enc_picture_desc |
| - mediafoundation: Implement full/lower resolution two pass |
| - mediafoundation: Request PSNR frame stats |
| - ci: Add Windows SDK 26100 to msvc image |
| - mediafoundation: Remove WIL dependency, use std elements instead |
| - mediafoundation: Cleanup comments/dead code |
| - mediafoundation: Pass the frame stats buffers to the gallium driver |
| - d3d12: Remove requirement of surface creation/clearing functions for video |
| - mediafoundation: Fix interop without copy fallback from DX11 to DX12 |
| |
| Simon Perretta (47): |
| |
| - pco: drop shader binary finalizing |
| - pco: amend source validation tracking |
| - pco: amend z/w usage code |
| - pco, pygen: rework isa gen to support multi-instruction groups |
| - pco, pygen: amend translation of srcs/dests with no mods |
| - pco, pygen: support enum mappings for instances of two bitsets |
| - pco, pygen: add fdiv/frcp support |
| - pco, pygen: add support for tst, movc instructions and s{lt,ge,eq,ne} ops |
| - pco, pygen: add support for dma ld and add64_32 |
| - pco, pygen: add support for unpck and conversions |
| - pco, pygen: add support for bitwise logical ops |
| - pco, pygen: further abstract src/dest references |
| - pco, pygen: track which hw srcs map to op srcs/dests |
| - pco: initial legalize pass/validation to handle hw restrictions |
| - pco, pygen: track valid phases and io allocations for ops |
| - pco, pygen: iterators for igrps and the instrs they contain |
| - pco, pygen: validate phases and io allocations for ops |
| - pco: commonise pass macro, use on opt subpasses |
| - pvr: commonize limits |
| - pvr: set device features/props/extensions to Vulkan 1.0 minimums (unless implemented) |
| - pco: run dce pass until no more progress is made |
| - pco: add pco nir algebraic pass boilerplate and basic lowering/opts |
| - pco: additional helper functions for address refs |
| - pvr: clarify image/sampler state word packing |
| - pvr: drop pvr_lower_nir |
| - pvr, pco: initial descriptor rework |
| - pco: remove per-device specialization of SPIR-V/NIR options |
| - pco: add support for load_ubo |
| - pco: handle frag/point coords sysvals |
| - pco: skip vector coalescing if ssa srcs are repeatedly referenced |
| - pco, pygen: add f{min,max} support |
| - pvr, pco: point size handling |
| - pco: support flat interpolation varyings |
| - pco: enable translation of vs sysvals |
| - pvr: store device uuid and build sha |
| - pvr: setup vk pipeline cache |
| - pco: allow empty/nop vertex shaders |
| - pvr: reorder device setup to create the compiler context earlier |
| - pco: add support for more pack/unpack ops |
| - pvr: enable partial vertex input dmas |
| - nir/precompiled: add helper to emit an enum map for multiple targets |
| - nir/precompiled: add shader stage option to nir_precompiled_build_variant |
| - nir/serialize: increase the op limit to 10 bits/1024 |
| - nir/builder: add nir_bitfield_insert_imm helper |
| - nir/builder: add nir_ibitfield_extract_imm helper |
| - nir/lower_io_to_scalar: add case for lowering push constants |
| - glsl/list: add foreach_list_typed_from_reverse iterator |
| |
| Stéphane Cerveau (2): |
| |
| - radv/debug: use common path for dmesg and tail |
| - vulkan: radv: set quality level properties |
| |
| Sushma Venkatesh Reddy (9): |
| |
| - intel/compiler: Always run opt_algebraic after descriptor_lowering |
| - intel/compiler: Centralize type stomping logic for Gen12.5 restrictions |
| - intel/compiler: Call brw_try_override_assembly independent of debug flag |
| - intel/tools: Reflect SIMD lane count for EUs in device info output |
| - intel/perf: Add PTL OA support |
| - intel: Add env variable to add break point on/before compute dispatch |
| - anv: Add GPU breakpoint before/after specific compute dispatch call |
| - docs: Add INTEL_DEBUG_BKP_BEFORE/AFTER_DISPATCH_COUNT |
| - intel/executor: Add missing dependency to fix intermittent build failures |
| |
| Sviatoslav Peleshko (2): |
| |
| - intel/brw: Use correct instruction for value change check when coalescing |
| - brw/disasm: Fix Gfx11 3src-instructions dst register disassembly |
| |
| Tanner Van De Walle (1): |
| |
| - util: Make atomic_read use volatile reads with _Generic/decltype |
| |
| Tapani Pälli (11): |
| |
| - iris: force reallocate on eglCreateImage with GFX >= 20 |
| - iris: make sure to not mix compressed vs non-compressed |
| - intel/dev: add note about PAT entries and Wa_18038669374 |
| - anv: put parenthesis to the set_sampler_size equation |
| - intel/dev: update mesa_defs.json from internal database |
| - mesa: add missing stencil formats to _mesa_is_stencil_format |
| - intel/compiler: provide a helper for null any-hit shader |
| - anv: use internal rt-null-ahs when any_hit is null |
| - anv: small cleanup, remove unused helper function |
| - drirc: toggle on intel_storage_cache_policy_wt for tlou2 |
| - drirc: toggle filter addr rounding for Heroes Of Valor |
| |
| Thomas H.P. Andersen (3): |
| |
| - nak: make is_fneg_zero detect -rZ |
| - driconf: update X4 Foundations executable name |
| - zink: do not overwrite existing error for miptail on uncommit |
| |
| Timothy Arceri (22): |
| |
| - util/driconf: add force_gl_depth_component_type_int workaround |
| - mesa: fix color material tracking |
| - mesa: relax EXT_texture_integer validation |
| - st/mesa: fix _IntegerBuffers bitfield use |
| - mesa/st: fix _BlendForceAlphaToOneDraw bitfield use |
| - mesa/st: fix _IsRGBDraw bitfield use |
| - mesa: fix _FP32Buffers bitfield use |
| - mesa: update validation when draw buffer changes |
| - mesa: extend linear_as_nearest work around |
| - util: add workaround for the game Foundation |
| - mesa: get correct src address for transferOps |
| - nir: raise NIR_SEARCH_MAX_VARIABLES limit to 24 |
| - util/cpu: fix broken powerpc linux builds |
| - dril: fix format typo in lookup helper |
| - dril: use endian independent pipe formats |
| - dril: add BGR{X,A}8888 and RGB{X,A}8888 |
| - dri: use plane format directly |
| - dri: add missing field to dri2_dup_image() |
| - dri: fix __DRI_IMAGE_FORMAT* to PIPE_FORMAT* mappings |
| - util: add workaround for legacy OpenGL tf2 |
| - glsl: fix reuse of deref |
| - util: add workaround for Interstellar Rift |
| |
| Timur Kristóf (2): |
| |
| - radv: Clear dirty flag for MSAA state after emitting it. |
| - radv: Clear dirty flag for clip rects state after emitting it. |
| |
| Tomeu Vizoso (26): |
| |
| - teflon/tests: Use a single tolerance value |
| - teflon/tests: Test all models in /models |
| - teflon/tests: Take into account signedness when checking the output tensors |
| - teflon/tests: Print shorter negative INT8s |
| - teflon/tests: Divide the tolerance level by a constant |
| - teflon/tests: Dump output buffers to disk |
| - etnaviv/tests: Add comment on why the SSDLite MobileDet test fails |
| - etnaviv/ml: Use etna_buffer_resource instead of etna_resource |
| - etnaviv/ml: Reorder dimensions in convolutions |
| - etnaviv/ml: Fix zero point values for signedness |
| - etnaviv/ml: Support per-channel quantized weights |
| - teflon: Only mark integers as signed |
| - etnaviv/ml: Consolidate transpose/detranspose |
| - teflon: Actually accept concatenations with different number of channels |
| - teflon: Set unused dimensions to 1 |
| - etnaviv/ml: Track memory layout of tensors |
| - etnaviv/ml: All tensors are now 4D, adapt to it |
| - teflon: Skip unsupported FullyConvolution operations |
| - etnaviv/ml: Support FullyConnected with signed weights |
| - etnaviv/ml: Fix depthwise convolutions |
| - teflon: Support more Pad configurations |
| - etnaviv/ml: Support padding the channels dimension |
| - etnaviv/ml: Rework tensor addition on V8 |
| - teflon/tests: Read model data with mmap, for speed |
| - etnaviv: Release screen->dummy_desc_reloc.bo |
| - teflon: Release the arrays of tensors in operations |
| |
| Valentine Burley (163): |
| |
| - ci/lava: Collapse more log sections |
| - ci/lava: Don't include the timeout in the log sections |
| - ci/lava: Merge and deduplicate log sections |
| - intel/ci: Start using the new 6.14 kernel on JSL |
| - ci: Uncollapse yaml-toml-shell-py-test log sections |
| - ci: Fix Android container structured tagging checks |
| - ci: Update ci-fairy to use shared ref from ci-templates |
| - ci: Use hyphens in make-git-archive job name |
| - ci: Update kernel to fix GPU recovery issues on sm8350 |
| - ci/android: Use aapt from Debian packages |
| - ci/android: Collapse more build sections |
| - ci/angle: Collapse build section |
| - ci/deqp: Collapse build section |
| - ci/rust: Collapse build section |
| - ci/deqp-runner: Collapse build section |
| - ci/android: Remove redundant android-tools folder |
| - ci: Uprev ci-templates to get FDO_DISTRIBUTION_POST_EXEC and S3 fixes |
| - ci/lava: Ensure firmware directory exists before downloading a660_zap.mbn |
| - ci: Export the rootfs from the test-* containers |
| - ci/container: Include ANGLE in the x86_64 test-gl container |
| - ci/container: Include SkQP in the test-gl containers |
| - ci/lava: Add job definitions using the test-* containers |
| - ci/lava: Use rootfs exported from test-* containers |
| - ci/lava: Use the new container based rootfs for piglit traces |
| - ci/lava: Move rootfs handling for Fluster out of build script |
| - ci/va: Collapse build section for va-tools |
| - ci/va: Add /va/bin to PATH for test-video container |
| - ci/container: Add new container for libva and fluster testing |
| - ci: Include Fluster vectors in the rootfs |
| - ci/lava: Use the new test-video-based rootfs for VA-API jobs |
| - ci/amd: Convert to using the new container based rootfs |
| - ci/freedreno: Convert to using the new container based rootfs |
| - ci/panfrost: Convert to using the new container based rootfs |
| - ci/etnaviv: Convert to using the new container based rootfs |
| - ci/lima: Convert to using the new container based rootfs |
| - ci/svga: Convert to using the new container based rootfs |
| - ci/intel: Convert to using the new container based rootfs |
| - ci/lava: Remove job definitions using kernel+rootfs jobs |
| - ci: Add a workaround for finding deqp-runner on Broadcom |
| - ci/baremetal: Remove legacy support for unused devices |
| - ci/baremetal: Split baremetal_arm64_test to -gl and -vk |
| - ci/baremetal: Use container rootfs |
| - ci: Delete the kernel+rootfs jobs |
| - docs: Move the docs about caching downloads to LAVA from bare-metal |
| - docs: Remove the docs for setting up bare-metal devices |
| - ci/test: Move and rename .lava-fluster to .test-fluster |
| - amd/ci: Rename .radeonsi-raven-vaapi-fluster to .radeonsi-vaapi-fluster-rules |
| - ci/fluster: Move the fluster-runner.sh script |
| - ci/fluster: Add sections to job logs |
| - ci/fluster: Move Fluster to its own subfolder in the test-video container |
| - ci/fluster: Use structured tagging for Fluster |
| - ci/fluster: Use the structured tag as the Fluster vector version |
| - amd/ci: Remove increased timeout for radeonsi-raven-vaapi-fluster job |
| - ci/fluster: Fix and rename S3_PATH_FLUSTER to S3_FLUSTER_TAR |
| - iris/ci: Update trace checksums |
| - anv/ci: Update expectations |
| - freedreno/ci: Document regressions |
| - turnip/ci: Update expectations |
| - freedreno/ci: Disable a630 jobs |
| - zink/ci: Properly source setup-test-env.sh in zink-venus-lvp |
| - ci/crosvm: Fix image tags in build-crosvm.sh header |
| - ci/crosvm: Collapse build section |
| - ci/crosvm: Use structured tagging for crosvm |
| - ci: Don't include the kernel in test-base image |
| - ci: Replace HWCI_KVM with HWCI_ENABLE_X86_KVM |
| - ci: Refactor structured tagging component definitions |
| - ci/baremetal: Fix rootfs URL detection |
| - ci: Remove EXTERNAL_KERNEL_TAG variable |
| - ci/container: Include DEBIAN_BUILD_TAG in debian/arm64_build |
| - ci: Run rustfmt and python jobs on aarch64 |
| - zink/ci: Skip flaky trace on TGL |
| - freedreno/ci: Disable all a630 jobs |
| - freedreno/ci: Update piglit expectations on a618 |
| - freedreno/ci: Re-enable a618-piglit |
| - amd/ci: Update kernel for Raven |
| - ci: Remove firmware from test-base |
| - ci: Forward all environment variables to DUTs and crosvm |
| - ci: Fix date parsing with BusyBox on Alpine |
| - ci/lava: Use init-stage2 and setup-test-env.sh from Mesa install |
| - ci/lava: Forward environmental variables to DUT directly |
| - bin/ci: Split out LAVA-specific python requirements |
| - ci: Add a minimal Alpine container for running LAVA jobs |
| - ci/baremetal: Set LAVA_DISTRIBUTION_TAG directly |
| - ci/lava: Drop LAVA_DISTRIBUTION_TAG |
| - ci: Remove debian/x86_64_pyutils container |
| - ci: Delete python-artifacts job |
| - ci/lava: Rename LAVA_S3_ARTIFACT_NAME |
| - ci/lava: Move mesa build job dependency to variant definitions |
| - ci: Don't forward long environment variables |
| - ci: Rename debian-testing to debian-x86_64 |
| - ci/android: Switch to using eglinfo to check GLES implementation |
| - ci/android: Check Vulkan driver using vulkaninfo |
| - ci: Use zstd compressed kernel modules |
| - ci/lava: Only deploy kernel-modules when needed |
| - ci/android: Make android-runner scripts safe for set -u |
| - ci/android: Add a version and renderer check for android-angle-lavapipe |
| - ci/android: Only replace ANGLE if needed |
| - ci: Allow ANGLE_TAG to be unset in -runner scripts |
| - ci/android: Close the Android tools section |
| - khronos-update: Add missing Vulkan Video headers |
| - lima/ci: Remove redundant FDO_HTTP_CACHE_URI |
| - ci: Add and use find_s3_project_artifact (and curl-with-retry) helpers |
| - ci/deqp: Use curl-with-retry helper |
| - ci/fluster: Move build-time check into GitLab log section |
| - ci/vkd3d: Use structured tagging for vkd3d-proton |
| - ci/lava: Remove vkd3d-proton from the core rootfs |
| - ci/init-stage1: Add /dev/fd symlink |
| - radv/ci: Add a pre-merge vkd3d job on Raven |
| - ci: Uprev GL & GLES CTS |
| - ci/android: Use DEQP_ANDROID_EXE for GLES CTS |
| - ci/vkd3d: Fix build when not uploading vkd3d-proton archive |
| - ci/piglit: Collapse build section |
| - anv/ci: Add a pre-merge vkd3d job on ADL |
| - intel/ci: Reduce parallelism of intel-adl-cl job |
| - intel/ci: Drop deprecated fp16 environment variable |
| - ci/android: Build test AMD drivers |
| - ci/lava: Make Cuttlefish a LAVA overlay |
| - ci/lava: Remove Android CTS from the core rootfs |
| - ci/android: Print the detailed deqp versions |
| - ci/intel: Fix the runner tag for Flip-hatch |
| - ci/virtio: Generalize .venus-rules |
| - ci/lava: Fix .lava-x86_64-test-android dependencies |
| - panvk/ci: Increase DEQP_FRACTION for g610-vk |
| - radeonsi/ci: Move full radeonsi-stoney-glcts-piglit job to pre-merge |
| - radeonsi/ci: Rename radeonsi Collabora rule to make it unambiguous |
| - radeonsi/ci: Add glcts and piglit job on Mendocino |
| - zink/ci: Rename zink-radv Valve rule to make it unambiguous |
| - zink/ci: Add glcts and piglit job on Cezanne with RADV |
| - ci: Uprev kernel to enable Multi-Gen LRU |
| - virtio/ci: Increase timeout for AHB tests |
| - virtio/ci: Split android-angle-venus-anv into reusable template |
| - virtio/ci: Add nightly Android Venus jobs on Intel TGL and ADL |
| - ci: Fix artifact name for jobs with parallel indices |
| - ci/android: Move sourcing setup-test-env.sh before \`set -uex` |
| - ci/android: Add separate log section for Android CTS |
| - ci/android: Add link to Android CTS results |
| - ci: Use placeholder-job for mr-label-maker-test |
| - ci: Fix location of conditional build tags |
| - virtio/ci: Run additional VKCTS tests on Cuttlefish with Venus+ANV |
| - ci/android: Update Cuttlefish image |
| - freedreno/ci: Merge a618-piglit-cl into a618-piglit |
| - freedreno/ci: Merge a660-piglit-cl and a660-gl |
| - freedreno/ci: Increase parallelism of a660-gl-cl |
| - freedreno/ci: Increase the concurrency of a660-gl-cl |
| - freedreno/ci: Merge a750-piglit-cl and a750-gl |
| - panvk/ci: Run full g52-vk job pre-merge |
| - freedreno/ci: Increase the concurrency of sm8350-hdk |
| - turnip/ci: Skip slow tests in a660-vk |
| - turnip/ci: Reduce parallelism of a660-vk |
| - zink/ci: Update restricted trace checksums |
| - turnip/ci: Skip Vulkan Video tests |
| - ci/lava: Add arm64 ASan job templates |
| - freedreno/ci: Add ASan jobs on a618 |
| - ci: Fix trace update script following environment variable changes |
| - freedreno/ci: Fix configuration for a618-traces-performance |
| - iris/ci: Simplify performance trace template |
| - zink/ci: Add a performance traces job on ADL |
| - iris/ci: Add a performance traces job on ADL |
| - radv/ci: Lower concurrency of radv-raven-traces-restricted |
| - iris/ci: Lower concurrency of iris-cml-traces |
| - panfrost/ci: Update expectations |
| - freedreno/ci: Update expectations |
| - ci: Always save the artifacts for performance traces |
| |
| Vasily Khoruzhick (1): |
| |
| - lima: ppir: index SSA nodes the same way as we index registers |
| |
| Vignesh Raman (5): |
| |
| - ci: bump ci-templates |
| - ci: use MESA_TEMPLATES_COMMIT for ci-fairy install |
| - ci: replace s3_upload wrapper with ci-fairy s3cp |
| - ci: add libproc2-0 |
| - virgl/ci: update flakes |
| |
| Ville Syrjälä (1): |
| |
| - iris: Don't use CCS_CC modifiers with INTEL_DEBUG=nofc |
| |
| Vinson Lee (2): |
| |
| - freedreno: Sort MRTs so output is stable |
| - subprojects: Update linux-raw-sys to 0.7.0 |
| |
| Visan, Tiberiu (1): |
| |
| - amd/vpelib: feature augment |
| |
| Vitaliy Triang3l Kuzmin (3): |
| |
| - r600/sfn: Remove unused r600_lower_scratch_addresses |
| - r600/sfn: Fix scratch layout calculation for composite types |
| - r600: Fix rectangle coordinate limits on R6xx/R7xx |
| |
| Vladly (2): |
| |
| - freedreno/a3xx-a5xx: re-emit vertex state on VS change |
| - ir3/legalize: don't allow end instruction as jump target on a5xx |
| |
| Wei Zhao (1): |
| |
| - winsys/amdgpu: Remove assert about user fence in amdgpu_fence_wait |
| |
| Wenfeng Gao (5): |
| |
| - mediafoundation: Added new CODECAPIs for GetValue and SetValue. |
| - mediafoundation: Support new MFSampleExtension for PSNR, QPMap, and BitsUsedMap. |
| - mediafoundation: enable new CODECAPI properties for frame stats |
| - mediafoundation: Modify the newly added CODECAPI behavior CODECAPI_AVEncVideoOutputQPMapBlockSize, CODECAPI_AVEncVideoOutputBitsUsedMapBlockSize. |
| - mediafoundation: Modify the helper function MFAttachPipeResourceAsSampleExtension to tell MF that the attached IMFMediaBuffer is ready to use. With this change, application can use Lock() to get the buffer. |
| |
| Wolf480pl (1): |
| |
| - radv/virtio: don't leak drm FD when using vpipe |
| |
| Yao Zi (1): |
| |
| - radeonsi: Fix violation of aliasing rules in radeon_ws_bo_reference |
| |
| Yinjie Yao (3): |
| |
| - gallium/pipe: Increase hevc max slice to 600 |
| - frontends/va: Handle properly when decoding more slices than limit |
| - radeonsi: Disable av1 cdef_channel_strength for VCN4 |
| |
| Yiwei Zhang (127): |
| |
| - venus: fix missing renderer destructions |
| - venus: expose WSI on renderer without dma-buf support |
| - venus: fix to passively enable wsi required extensions |
| - vulkan: fix random tabs to spaces |
| - venus: clang-format |
| - venus: relax external memory ext requirement |
| - venus: misc cleanups post wsi requirement relax |
| - venus: filter out venus incapable physical devices |
| - kopper: Revert "kopper: Explicitly choose zink" |
| - venus: avoid false assert for MESA_VK_WSI_DEBUG=sw,linear |
| - venus: update docs and drop obsolete code for prime blit |
| - venus: refactor wsi acquire to use semaphore and fence SYNC_FD import |
| - venus: force different device on NV proprietary |
| - venus: track prime blit dst buffer memory in the wsi image |
| - venus: properly handle implicit fence from the compositor |
| - panvk: fix memory binding for wsi image alias |
| - panvk: drop bo tracking from panvk_buffer |
| - panvk: drop bo refcount from panvk_image |
| - vulkan/wsi: include missing barrier for transferring to blit dst image |
| - vulkan/wsi: split cmd record for img2buf blit and img2img blit |
| - pan/layout: add const to immutable local vars in pan_image_layout_init |
| - pan/layout: exclude plane offset from plane data size for import |
| - pan/layout: refactor wsi layout query |
| - pan/layout: document and prepare to fix planar plane offset |
| - panvk: fix imported external multi-planar image support |
| - venus: do not force prime blit on nv |
| - venus: force sw wsi path on nv proprietary |
| - lvp: fix wsi platform swapchain image bind |
| - venus: fix maint7 layered vk props query |
| - zink: leave can_do_invalid_linear_modifier open for venus |
| - v3dv: fix swapchain bind info look up |
| - anv: use narrow range to match up with mesa EGL |
| - hasvk: use narrow range to match up with mesa EGL |
| - vulkan/android: rename global u_gralloc ptr |
| - vulkan/android: update helper to initialize u_gralloc once |
| - anv: free up hwvulkan_device_t upon driver unloading |
| - anv: adopt common Android gralloc helpers |
| - hasvk: free up hwvulkan_device_t upon driver unloading |
| - lvp: free up hwvulkan_device_t upon driver unloading |
| - lvp: drop explicit u_gralloc init |
| - nvk: free up hwvulkan_device_t upon driver unloading |
| - nvk: drop explicit u_gralloc init |
| - panvk: free up hwvulkan_device_t upon driver unloading |
| - radv: free up hwvulkan_device_t upon driver unloading |
| - turnip: free up hwvulkan_device_t upon driver unloading |
| - v3dv: free up hwvulkan_device_t upon driver unloading |
| - v3dv: drop explicit u_gralloc init |
| - gfxstream: free up hwvulkan_device_t upon driver unloading |
| - venus: adopt common android u_gralloc helpers |
| - vulkan/android: clean up helpers no longer used |
| - vulkan/treewide: add and use common Android HMI hal implementation |
| - radv: relax the native_handle_t::numFds requirement |
| - u_gralloc/mapper5: fill u_gralloc_buffer_color_info properly |
| - u_gralloc/mapper4: fill u_gralloc_buffer_color_info properly |
| - u_gralloc: assign default u_gralloc_buffer_color_info |
| - egl/android: clean up u_gralloc_get_buffer_color_info handling |
| - vulkan/android: clean up u_gralloc_get_buffer_color_info handling |
| - vulkan/android: add a helper to easy front buffer usage query |
| - anv: use common vk_android_get_front_buffer_usage helper |
| - v3dv: use common vk_android_get_front_buffer_usage helper |
| - venus: use common vk_android_get_front_buffer_usage helper |
| - docs/venus: drop Virtio-WL section |
| - docs/venus: keep requirements up to date |
| - dozen: drop redundant dzn_swapchain_get_image api |
| - vulkan/wsi: drop wsi_common_get_images |
| - venus: relocate tls ring hint from wsi init to queue submission |
| - venus: use more common wsi codes |
| - venus: suppress Coverity Uninitialized scalar variable |
| - panvk: refactor panvk_image to track panvk_device_memory instead of bo |
| - panvk: refactor swapchain image alias bind |
| - venus: fix msaa state sample location info sanitization |
| - venus: use wsi_common_create_swapchain_image |
| - venus: drop drm_format_modifier tracking |
| - venus: drop tiling_override tracking |
| - venus: share code for AHB image subres query |
| - venus: wsi workaround for gamescope |
| - vulkan/wsi: amend barriers for blit dst buffer |
| - vulkan/wsi: amend barriers for blit dst buffer going to foreign queue |
| - venus: filter out vulkan video feature bits on common video formats |
| - venus: support AHB storage image format |
| - virtio/ci: update AHardwareBufferNativeTests expectations |
| - anv: avoid leaking private binding for aliased wsi image |
| - meson: drop vdrm from virgl and venus |
| - venus: fix a prime blit assert |
| - venus: back out implicit fencing handling for NV proprietary |
| - vulkan/android: fix to not append GRALLOC_USAGE_HW_COMPOSER bit |
| - vulkan/android: drop redundant image format query |
| - vulkan/android: rewrite swapchain gralloc usage query |
| - vulkan/android: fix and re-orgnize support before api level 26 |
| - radv: use common ANB swapchain gralloc usage query |
| - radv: use AHARDWAREBUFFER_USAGE_CAMERA_MASK |
| - anv: fix ANB gralloc usage query to not append display usage bits |
| - anv: use common ANB swapchain gralloc usage query |
| - anv: use AHARDWAREBUFFER_USAGE_CAMERA_MASK |
| - hasvk: use common ANB swapchain gralloc usage query |
| - hasvk: use AHARDWAREBUFFER_USAGE_CAMERA_MASK |
| - venus: use common ANB swapchain gralloc usage query |
| - venus: allow to build vtest-only on non-DRM/KMS systems |
| - vulkan/wsi: add wsi_common_get_memory |
| - venus: adopt wsi_common_get_memory |
| - hk: adopt wsi_common_get_memory |
| - radv: adopt wsi_common_get_memory |
| - dozen: adopt wsi_common_get_memory |
| - panvk: adopt wsi_common_get_memory |
| - nvk: adopt wsi_common_get_memory |
| - turnip: adopt wsi_common_get_memory |
| - lavapipe: adopt wsi_common_create_swapchain_image |
| - lavapipe: adopt wsi_common_get_memory |
| - v3dv: adopt wsi_common_create_swapchain_image |
| - v3dv: adopt wsi_common_get_memory |
| - anv: adopt wsi_common_get_memory |
| - hasvk: avoid leaking private binding for aliased wsi image |
| - hasvk: adopt wsi_common_get_memory |
| - vulkan/wsi: drop unused common wsi helpers |
| - venus/meson: amend api_version in the generated icd file |
| - lavapipe: allow AHB export allocation |
| - lavapipe: implement GetMemoryAndroidHardwareBufferANDROID |
| - lavapipe: do not close import fd on error and amend an error code |
| - lavapipe: properly handle AHB release |
| - lavapipe: populate AHB memory mapping |
| - lavapipe: do not short-circuit AHB export alloc (non-import) |
| - lavapipe: amend missing object finish on mem alloc failure |
| - u_gralloc/mapper4: properly expose ChromaSiting types based on api level |
| - lavapipe: ensure to use zero memoryOffset for wsi image alias binding |
| - lavapipe: fix a leak on a lvp_image_create exit path |
| - lavapipe: fix maint4 vkGetDeviceBufferMemoryRequirements |
| - lavapipe: fix maint4 vkGetDeviceImageMemoryRequirements |
| |
| Yogesh Mohan Marimuthu (17): |
| |
| - amd: update amdgpu_drm.h for userq info |
| - ac,winsys/amdgpu: get userq_ip_mask supported from kernel info ioctl |
| - ac,radeonsi: clear_state is not supported in user queue |
| - ac: fix getting mcbp info for userq |
| - ac: add AMD_USERQ env var to enable user queue |
| - winsys/amdgpu: apu fwm packet supports only 4 max fences |
| - ac: for userq do not set info->has_fw_based_shadowing |
| - winsys/amdgpu: pass r/w bo to w/r list in userq_wait ioctl |
| - radeonsi: replace fprintf stderr using mesa_log helper functions |
| - ac,radeonsi,radv: rename register_shadowing_required |
| - ac: enable kernelq reg shadowing only when userq is disabled |
| - radeonsi: has_kernelq_reg_shadowing failure means driver failed |
| - radeonsi: rename sctx->has_graphics to sctx->is_gfx_queue |
| - radeonsi: add uses_kernelq_reg_shadowing in struct si_context |
| - radeonsi: reorder if condition in si_init_cp_reg_shadowing() function |
| - ac, radeonsi: prepare cs_preamble_state for userq reg shadowing |
| - radeonsi: submit cs_preamble_state to as first job in userqueue |
| |
| Yurii Kolesnykov (3): |
| |
| - Get rid of 5 remaining references to glapitable.h |
| - Guard double include of libdrm.h by defining LIBDRM_H |
| - Guard call to free_zombie_glx_drawable with condition from its definition |
| |
| Zan Dobersek (4): |
| |
| - tu: allow bigger block sizes when copying between buffers |
| - ir3: fix display of dot-product instructions |
| - ir3: fix parsing of texture prefetch headers |
| - ir3: allow asm roundtrip testing of compiled shader variants |
| |
| Zhao, Jiali (1): |
| |
| - amd/vpelib: G2084 STUDIO output gamma fix |
| |
| Zoltán Böszörményi (4): |
| |
| - docs/features.txt: Move optional features of OpenCL 1.0 and 1.1 into a new section |
| - docs/features.txt: Move optional OpenCL 1.2 features to the new section |
| - docs/features.txt: Follow-up changes for rusticl |
| - docs/features.txt: Indicate SPIR-V version for rusticl |
| |
| jesse.zhang (1): |
| |
| - amd: Add user queue HQD count to hw_ip info |
| |
| jhananit (4): |
| |
| - iris: Update NIR_PASS_V to NIR_PASS |
| - intel: Update all NIR_PASS_V to NIR_PASS |
| - i915: Remove NIR_PASS_V and updated it to NIR_PASS |
| - anv: Remove NIR_PASS_V usage |
| |
| llyyr (3): |
| |
| - vulkan/wsi/wayland: make needs_color_surface_old check if surface exists |
| - brw: don't unconditionally print message on instance creation |
| - radv: don't set HOST_IMAGE_TRANSFER_BIT if host_image_copy not enabled |
| |
| mojyack (1): |
| |
| - mesa: reorder mesa format corresponding to GL_BGRA |
| |
| sarbes (2): |
| |
| - lima: add 'unorm8' format to genxml |
| - lima: genxml-ify PP frame registers |
| |
| sergiuferentz (2): |
| |
| - gfxstream: Remove MEMORY_HANDLE_OPAQUE_FD support from Guest to fix suballocation dEQP tests |
| - gfxstream: intercept vkCmdClearColor(...) and do linear->SRGB conversion |