| Mesa 25.1.0 Release Notes / 2025-05-07 |
| ====================================== |
| |
| Mesa 25.1.0 is a new development release. People who are concerned |
| with stability and reliability should stick with a previous release or |
| wait for Mesa 25.1.1. |
| |
| Mesa 25.1.0 implements the OpenGL 4.6 API, but the version reported by |
| glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / |
| glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. |
| Some drivers don't support all the features required in OpenGL 4.6. OpenGL |
| 4.6 is **only** available if requested at context creation. |
| Compatibility contexts may report a lower version depending on each driver. |
| |
| Mesa 25.1.0 implements the Vulkan 1.4 API, but the version reported by |
| the apiVersion property of the VkPhysicalDeviceProperties struct |
| depends on the particular driver being used. |
| |
| SHA checksums |
| ------------- |
| |
| :: |
| |
| SHA256: b1c45888969ee5df997e2542654f735ab1b772924b442f3016d2293414c99c14 mesa-25.1.0.tar.xz |
| SHA512: 40298370727fa1ad6d59be8692dfef01c42e41780816e9b951a05f779c7acca127162d2d3cedfccb7bfe3834b9e917c2b9bc6cb76887488d919cb61741a1da1a mesa-25.1.0.tar.xz |
| |
| |
| New features |
| ------------ |
| |
| - cl_khr_spirv_linkonce_odr in rusticl |
| - storagePushConstant16 on panvk |
| - storageInputOutput16 on panvk |
| - VK_KHR_depth_stencil_resolve on panvk |
| - VK_KHR_separate_depth_stencil_layouts on panvk |
| - VK_EXT_separate_stencil_usage on panvk |
| - VK_KHR_sampler_ycbcr_conversion on panvk/v10+ |
| - VK_EXT_ycbcr_2plane_444_formats on panvk/v10+ |
| - VK_EXT_ycbcr_image_arrays on panvk/v10+ |
| - VK_KHR_imageless_framebuffer on panvk |
| - VK_KHR_uniform_buffer_standard_layout on panvk |
| - VK_EXT_border_color_swizzle on panvk |
| - VK_MESA_image_alignment_control on NVK |
| - shaderFloat16 on panvk |
| - VK_KHR_shader_subgroup_uniform_control_flow on panvk/v10+ |
| - VK_KHR_shader_maximal_reconvergence on panvk/v10+ |
| - VK_EXT_device_memory_report on RADV |
| - VK_KHR_shader_subgroup_extended_types on panvk/v10+ |
| - shaderStorageImageExtendedFormats on panvk |
| - VK_KHR_display on panvk |
| - VK_EXT_display_control on panvk |
| - EXT_shader_framebuffer_image_fetch on v3d |
| - EXT_shader_framebuffer_image_fetch_coherent on v3d |
| - KHR_blend_equation_advanced on v3d |
| - KHR_blend_equation_advanced_coherent on v3d |
| - KHR_partial_update on etnaviv |
| - VK_KHR_line_rasterization on panvk |
| - VK_EXT_line_rasterization on panvk |
| - shaderImageGatherExtended on panvk |
| - textureCompressionBC on panvk |
| - VK_EXT_sample_locations on RADV for GFX10+ |
| - GL_ARB_shader_clock on panvk |
| - VK_KHR_shader_float_controls on panvk |
| - VK_KHR_shader_float_controls2 on panvk/v10+ |
| - storageBuffer8BitAccess on panvk |
| - storagePushConstant8 on panvk |
| - uniformAndStorageBuffer8BitAccess on panvk |
| - MSAA with 8 and 16 sample counts on panvk |
| - VK_KHR_spirv_1_4 on panvk/v10+ |
| - Mali G720 and G725 on Panfrost and panvk |
| - dualSrcBlend on panvk |
| - VK_KHR_dynamic_rendering_local_read on panvk |
| - VK_EXT_subgroup_size_control on panvk/v10+ |
| - VK_KHR_format_feature_flags2 on panvk |
| - shaderStorageImageReadWithoutFormat on panvk |
| - shaderStorageImageWriteWithoutFormat on panvk |
| - VK_EXT_direct_mode_display on panvk |
| - Vulkan 1.2 on panvk/v10+ |
| |
| |
| Bug fixes |
| --------- |
| |
| - RADV:RX 9070:Mesa-25.0.5 GTA 5 Enhanced GPU HANG |
| - [ANV/DG2] nvpro-samples/vk_raytracing_tutorial_KHR/ray_tracing_reflections renders not all reflections |
| - radv: Flickering in Kingdom Come: Deliverance II |
| - RADV regression causes severe glitches in Hunt Showdown 1896 on Polaris |
| - Z-Fighting in Tomb Raider IV - VI Remastered Linux |
| - [anv] VK_ERROR_DEVICE_LOST on Linux 6.13.8 while playing Dota 2 on Intel Graphics |
| - Variable Rate Shading (VRS) produces very aliased results on RADV with an AMD gpu |
| - WWE 2k23 small "artifacts" |
| - RadeonSI - ACO does not spill enough SGPRs despite low VGPR pressure in some scenarios |
| - RADV: Performance Regression (~18%) on Vega 64 (GFX9) in Cyberpunk 2077 caused by commit 08918f08805f (MR 34361) |
| - radeonsi: CL conformance test \`vector_swizzle` fails since 177427877bb50ad7ba24abfa13e55a2684d804df |
| - [macOS] glapi_gentable.c:46:10: fatal error: 'glapitable.h' file not found |
| - commit 90faadae regression - Warhammer 40,000: Space Marine 2 crashing |
| - RADV: GPU hangs always at certain places in the game "Sniper Elite: Resistance" |
| - Random mesa crashes in kwin_wayland on a 6600XT |
| - Patch to fix clinfo on rusticl |
| - radv/aco: Ghost of Tsushima hangs and causes gpu resets on RDNA 3 GPU |
| - mesa-vulkan-driver-git.x86_64 causes strange colored rectangle artifacts in Final Fantasy XIV |
| - NVK - Out of video memory error when starting Marvel Rivals |
| - NVK - Out of video memory error when starting Marvel Rivals |
| - RADV: Performance regression in Elden Ring on GFX8/Polaris |
| - RADV: Performance regression in Elden Ring on GFX8/Polaris |
| - Vulkan issues after sleeping on 9070 XT |
| - ring gfx_0.0.0 timeout after waking from sleep - RX 9070 |
| - Confidential issue #12324 |
| - Confidential issue #12946 |
| - svga: how do I test this driver? |
| - brw: new Xe2 CTS failures |
| - [NVK] NAK assert in The Last of Us Part 2 shader |
| - anv: S.T.A.L.K.E.R. 2: Heart of Chornobyl crashes after starting a new game |
| - [ANV][LNL] - Lost Records: Bloom & Rage (1902960) - Title hangs on launch and subsequently crashes to desktop. |
| - [BMG] Intel b580 battlemage: Fort Solis (Unreal Engine game) boots to menu, hangs while loading after hitting continue from the main menu |
| - [ANV][LNL] - NINJA GAIDEN 2 Black (3287520) - Environment assets are incorrectly rendered or missing. |
| - [ANV][LNL] - The Headliners (3059070) - Title hangs a few minutes after launch. |
| - anv, regression: Invisibly blinking cliffs & rocks in Satisfactory DX12 on BMG |
| - [Feature request] Add HK support for VK_EXT_queue_family_foreign vulkan extension |
| - vk/overlay: output_file option failing |
| - The Last of Us Part I GPU hang on gfx1201 |
| - glsl: IR validation fails on some shaders after lower_precision (breaks GLSL-to-NIR) |
| - [bisected, LNL] brw: 341e5117ecbc ("brw/nir: Treat load_const as convergent") regresses arb_gpu_shader5-interpolateAtOffset on LNL |
| - vulkan regression mesa 24.3.4 to 25.0.0.rc3 with broadcom |
| - AMD: broken Minetest trace on Vega iGPU |
| - All OpenGL applications segfault |
| - radv: nir_opt_varyings.c:2766: deduplicate_outputs: Assertion \`list_index == 0' failed. |
| - vulkan/wsi: memory leak from wsi_CreateSwapchainKHR |
| - [RADV][RDNA3][Phoenix3][APU] NARAKA: BLADEPOINT (1203220) gpu hang reproducible (ice/water regression mesa 24.1 bisected SAMPLE_MASK_TRACKER_WATERMARK=15) random (maybe other apps/games) |
| - GPU hangs running Octopath Traveler II with 780M |
| - GPU crash on Radeon 780M with Tales of Arise |
| - The Last of Us - shadows flickering on gfx1201 without nohiz flag |
| - brw: Hit unreachable nir_op_fsign case that brw_nir_lower_fsign missed |
| - Panvk:Add Support for BCn (BC1–BC7) Texture Compression |
| - anv: Dark pattern overlayed on objects in Eve Online DX11 mode on BMG |
| - radv/aco: Strobing artifacts in Pacific Drive |
| - aco: Distorted light halos, Star Citizen |
| - Mesa 25 removes VA-API encoding for R9 390 |
| - debian-android build is huge |
| - isl: CPCBCompressionEnable is now LosslessCompressionEnable |
| - Video stuttering / anv: extend implicit fencing support |
| - anv, bmg: Visual issues in AC Origins, Odyssey and Fenyx Rising when dxvk doesn't export PointSize |
| - [ANV][LNL] - A Game About Digging A Hole (3244220) - Title throws an assertion failure on launch. |
| - [ANV][BMG] - A Plague Tale: Requiem - Severe shadow flicker |
| - [ANV][LNL] - Hogwarts Legacy (990080) - Flickering artifacts visible on magical barrier. |
| - brw: regression on Gfx9 dEQP-VK |
| - HTML Docs fail to build from source with Sphinx 8.2.1 |
| - libglapi.dll ain't built anymore under Windows |
| - anv/video: Timestamps are exposed in video encode queue, but it crashes |
| - Getting a crash with manually built llvmpipe (OpenGL) |
| - [RadeonSI] Blender assetshelf icons are borken in mesa >= 25.0.0 |
| - radeonsi regression after 24.3.4 |
| - misc OpenGL CTS failures |
| - glBindVertexBuffer regression due to ID reuse |
| - RADV: logic used to avoid running on CDNA is faulty |
| - [ANV][LNL] - Company of Heroes 3 (1677280) - Circular banding is present on screen during gameplay. |
| - video hardware acceleration issue with "VA" |
| - [LNL/BMG] Assassin's Creed Valhalla trace replay hang |
| - Vulkan conformanceVersion is reported as 0.0.0.0 in Mesa 25.0.0 |
| - X11 + Zink on NVK flickers older frames in Firefox based browsers |
| - VRAM Abnormal use on mesa 25.0 |
| - [radv][regression] Multiple games detect the wrong amount of vram |
| - Resident Evil 2 Remake flickers |
| - OpConstantNull not supported for OpTypeCooperativeMatrixKHR |
| - v3dv: vkcube-wayland crashes on raspberry pi 5 kernel 6.12 and latest mesa |
| - GMSH Visualization Fails with radeonsi:can't compile a main shader part, Fedora 41 AMD 7900xt |
| - AMD VDPAU deinterlacing SIGSEGV |
| - radv: vkd3d-proton test_primitive_restart_list_topology_stream_output randomly fails on NAVI2X |
| - Mesa 24.1 introduced a Vulkan problem with DOOM 2016 on AMD 780M GPU |
| - radeonsi: Firefox fails assertion requiring binding vertex elements before vertex_buffers |
| - nouveau & zink+nvk: Flashing in Firefox and Thunderbird on Hyprland |
| - Zink: Kopper's present thread causes Wayland protocol races |
| - GLmatrix needs aligned malloc |
| - Lavapipe crashes if no Position is output in mesh shader |
| - radeonsi/video: Allocate video buffers with modifiers 7f7206f1a9d brake 'mplayer -vo vdpau...' on GFX8 (Polaris 20) |
| - [RADV/aco][regression][bisected] - Avowed (2457220) - GPU hangs near Watermill outside of Dawnshore |
| - radv/sqtt: assertion "layout transition marker should be only emitted inside a barrier marker" |
| - mesa_shader_cache directory is created even if mesa_shader_cache_db is used. |
| - nak: Fold i2b(b2i(x)) |
| - [radv] Glitchy ground geometry regression in Total War Warhammer III on RX 7600 |
| - NVK: Implement shaderSharedInt64Atomics |
| - nvk: Implement VK_MESA_image_alignment_control |
| - radeonsi: regression with running DaVinci Resolve under rusticl since 666a6eb871d5dec79362bdc5d16f15915eb52f96 |
| - [ANV][LNL] - Black Myth: Wukong (2358720) - Corruption is visible near the edge of water. |
| - [ANV][LNL] - Hogwarts Legacy (990080) - Pixelated corruption is visible when looking out at the water. |
| - radv/video/h265: pps.flags.transform_skip_enabled_flag = 1 randomly hangs GPU |
| - turnip falls with \`assertion "errno == ETIME" failed` |
| - [ANV][LNL] - Steel Rats (619700) - Game crashes after opening logos play before reaching main menu |
| - nvk: Implement host-only descriptors |
| - Gnome-shell Wayland fails to start with segfault at modifier-less driver |
| - [ANV][LNL] - DYNASTY WARRIORS: ORIGINS (2384580) - Dithered transparency has vertical bands. |
| - AMD Radeon R9 270 randomly causes video playback applications to crash with "amdgpu: The CS has been rejected" |
| - ci: a306 jobs use a307 files |
| - Rendering issues on GravityMark with RadeonSI ACO |
| - Expose low latency encoding for radv vulkan video encode |
| - i915: multiple tests assert with tgsi_ureg.h:893: ureg_swizzle: Assertion \`reg.File != TGSI_FILE_NULL' failed. |
| - ci: debian-build-testing fails in nightly |
| - shaders/closed/steam/deus-ex-mankind-divided/260.shader_test fails NIR validation |
| - shaders/closed/steam/deus-ex-mankind-divided/260.shader_test fails NIR validation |
| - panvk : vk_pipeline_cache_object_deserialize: Assertion \`reader.current == reader.end && !reader.overrun' failed. |
| - 46a8d5e7ef61735416d0c54886a7a9930621ae2c causes a permission denied spam |
| - [BUILD] Build Failure: Implicit Function Declaration 'timespec_sub_saturate' (loader_wayland_helper.c) |
| - anv: \`MESA: warning: INTEL_HWCONFIG_MIN_GS_URB_ENTRIES (2) != devinfo->urb.min_entries[MESA_SHADER_GEOMETRY] (0)` |
| - intel genX_acceleration_structure: missing dependency to bvh/header.spv.h |
| - ci: what's going on with zink-venus-lavapipe ? |
| |
| |
| Changes |
| ------- |
| |
| Aaron Ruby (17): |
| |
| - gfxstream: Replace pre-processor (LINUX && !ANDROID) checks with LINUX_GUEST_BUILD |
| - gfxstream: Make the virtgpu device discovery for LinuxVirtGpu more robust |
| - gfxstream: Change "mesaOnly" nomenclature to be "guestOnly" |
| - gfxstream: Add common interfaces in the VirtGpuDevice to query DrmInfo and PciBusInfo |
| - gfxstream: Clean up the gfxstream_vk device and instance init |
| - gfxstream: Fix precedence and ownership issues on Linux for imported FD and the VirtGpuResource |
| - gfxstream: Add reference counting for GEM handles in LinuxVirtGpuBlob |
| - gfxstream: Downgrade log severity when enabling params in LinuxVirtGpu |
| - gfxstream: Remove vkGetImageSubresourceLayout ResourceTracker entry |
| - gfxstream: Full emulation support for VK_EXT_image_drm_format_modifier |
| - gfxstream: Resolve/clean-up inconsistencies with advertising emulated extensions |
| - gfxstream: No VIRGL_BIND_LINEAR for ColorBuffers |
| - gfxstream: Move virtgpu_gfxstream_protocols.h to the common location for house protocols |
| - egl: Remove FallbackZink config item entirely |
| - gbm/dri: Match zink autoloading from egl |
| - drm-uapi: Sync virtgpu header |
| - virtio: Remove virglrenderer_hw.h entirely |
| |
| Adam Jackson (8): |
| |
| - zink: Enable KHR_shader_subgroup |
| - lvp: set subgroupQuadOperationsInAllStages to true |
| - glx: Make #undef GLX_INDIRECT_RENDERING do something |
| - mapi/glx: Remove xserver code generation |
| - glx: Remove (almost) all usage of _X_HIDDEN / _X_INTERNAL |
| - mapi/glx: Remove FASTCALL/PURE |
| - loader: Stop looking in ${libdir}/tls/ |
| - meson: Simplify the power8 optimization logic |
| |
| Aditya Kumar (1): |
| |
| - gfxstream: Fix compiling gfxstream for musl libs |
| |
| Alejandro Piñeiro (1): |
| |
| - nir: aliasing checks should be also done with index != 0 |
| |
| Aleksi Sapon (3): |
| |
| - draw: fix gl_PrimitiveID in tessellation |
| - llvmpipe: improve aniso filtering |
| - lp: fix gnu-empty-initializer warning |
| |
| Alyssa Rosenzweig (226): |
| |
| - meson: factor out with_driver_using_cl |
| - nir: add nir_function_intrinsics_pass |
| - nir/lower_scratch_to_var: handle KERNELs |
| - nir/lower_scratch_to_var: handle multi-function shaders |
| - nir/print: extract nir_print_function_body |
| - nir/serialize: add specialized function serialization |
| - nir: introduce bindgen_return |
| - nir: add nir_call_serialized helper |
| - vtn: add vtn_bindgen2 tool |
| - libcl: add MIN3/MAX3 macros like on the host |
| - libcl: define GLSL-style compute built-ins |
| - libagx: port to glsl-style compute builtins |
| - asahi: port to vtn_bindgen2 |
| - nir: include __LINE__ in NIR_PASS validation results |
| - libcl: add unreachable() macro |
| - nir: add image_min_lod_agx |
| - nir: add lod_bias_min_agx tex src |
| - agx: lower min LOD for txf |
| - agx: switch to nir_tex_src_lod_bias_min_agx |
| - hk: pack has_border with clamp_0_sampler_index |
| - hk: rearrange sampler image desc |
| - hk: emulate EXT_image_view_min_lod |
| - hk: advertise EXT_image_view_min_lod |
| - Revert "hk: Stop using strings or common key types for meta keys" |
| - intel: drop nir_lower_printf calls |
| - intel/nir_lower_printf: modernize nir |
| - intel: port to u_printf context + singleton |
| - nir/lower_printf: hash format strings in nir_printf_fmt |
| - nir: drop printf_base_identifier |
| - nir: add nir_intrinsic_has_semantic helper |
| - nir: switch intrinsic semantics to BIT |
| - nir: mark subgroup/quadgroup ops |
| - nir/gather_info: use subgroup/quadgroup flags |
| - nir/opt_move_discards_to_top: use semantic |
| - nir: mark a few more subgroup ops |
| - libcl/vk: add common query copy write routine |
| - nir: default-initialize next_stage |
| - asahi: clang-format |
| - asahi: fix libwrap.dylib |
| - asahi: fix cull distance with GS |
| - hk: implement calibrated timestamps |
| - hk: fix increment CS invs |
| - hk: do not incorrectly offset host-image-copy sources |
| - asahi: bind zero-page |
| - libagx: use zero page |
| - asahi: use zero sink for vbuf |
| - hk: use zero sink for null index buffer |
| - hk: don't allocate zero sink |
| - libagx: add missing null pointer check |
| - asahi: perf debug indirect tess |
| - asahi: use NIR_PASS to validate more |
| - asahi: switch tib lower to intrinsic pass |
| - libagx: fix subgroup id confusion |
| - libagx: fix wraparound issue with robust draw kernel |
| - libagx: use assert instead of 0xdeadbeef writes |
| - asahi: add more alignment asserts |
| - hk: reject non-2D modifiers |
| - hk: unstub UnmapMemory2KHR |
| - hk: unbind VAs |
| - hk: fix buffer binding |
| - hk: enable bufferDeviceAddressCaptureReplay |
| - docs/asahi: update varying info |
| - docs/asahi: add some section headers |
| - docs/asahi: describe sparse page tables |
| - asahi: rename Null layout |
| - asahi: identify image mode enum |
| - asahi: add sparse block XML |
| - ail: model sparse page tables |
| - ail: unit test sparse_table_size_B |
| - ail: report mip_tail_first_lod for sparse |
| - ail: report miptail stride |
| - ail: move helpers to layout.h for sharing |
| - ail: extract a blocksize helper |
| - ail: expose ail_get_max_tile_size |
| - ail: add ail_bytes_to_pages helper |
| - ail: add ail_get_twiddled_block_B helper |
| - ail: unit test ail_get_twiddled_block_B |
| - asahi: assert page alignment in vm_bind |
| - asahi: support unbinding VA in agx_va_free |
| - agx: implement sparse residency queries |
| - hk: ban sparse host-image-copy |
| - hk: implement sparse |
| - asahi: drop cargo culted disk cache disable |
| - agx: fix uniform atomic opts |
| - hk: ensure space with the dispatch |
| - hk: pass cmdbuf, not control stream, into precomp dispatch |
| - hk: assert more |
| - hk: accelerate buffer copies with CL |
| - hk: pass cmdbuf to perf_debug |
| - libagx: fix ia_primitives with tessellation |
| - libagx: vectorize tess level loads |
| - hk: perf debug sparse binding |
| - asahi,libagx,hk: don't set custom_target names |
| - asahi: drop trivial depends |
| - agx: assert shift bounds |
| - agx: fix botched address fuses |
| - hk: do not clamp txf for copy shaders |
| - libagx: use 64-bit multiply for image atomic calculation |
| - asahi: fix null deref in error path |
| - hk: fake min/max filtering for proton |
| - agx: handle rgb32 residency queries |
| - hk: ban sparse RGB32 |
| - agx: handle sparse buffer images |
| - asahi: support sparse in virtgpu |
| - nir/builder: add nir_shader_tex_pass |
| - treewide: use nir_shader_tex_pass |
| - radv/nir_lower_viewport_to_zero: intrinsic pass |
| - radv/nir_lower_view_index: intrinsic pass |
| - radv/nir_lower_fs_barycentric: intrinsic pass |
| - radv/nir_lower_intrinsics_early: intrinsic pass |
| - radv/nir_lower_fs_intrinsics: intrinsic pass |
| - nvk: rewrite query copy shader in CL C |
| - docs/asahi: clarify twiddled vs GPU-tiled |
| - ail: rename twiddled -> gpu tiled |
| - ail: split compression up |
| - asahi: rename wip modifier |
| - asahi: drop silly |
| - hk: drop silly |
| - nir: add nir_progress/nir_no_progress helpers |
| - asahi: clang-format |
| - nir: clang-format |
| - treewide: Switch to nir_progress |
| - treewide: port remaining nir_metadata_preserve users |
| - nir,nak: update comments referencing nir_metadata_preserve |
| - nir: eliminate nir_metadata_preserve |
| - nir: clean up progress |
| - ir3: clean up progress manually |
| - asahi: clang-format |
| - vulkan: add common VK_PRINT_STR/VK_COPY_STR macros |
| - hk: switch to common VK_COPY/PRINT_STR |
| - anv,hasvk: switch to common VK_COPY/PRINT_STR |
| - v3dv: switch to common VK_COPY/PRINT_STR |
| - radv: use VK_COPY_STR |
| - panfrost,panvk: fix clang warnings |
| - vulkan: add helpers to work with executable statistics |
| - util: add shader statistic framework |
| - asahi: port to common stats framework |
| - v3dv: fix clang warning |
| - meson: make CL args common |
| - meson: set NDEBUG appropriately for OpenCL |
| - intel: use common CL args |
| - meson,clc: set CL standard from meson |
| - panfrost: clean up CL meson rules |
| - panfrost: port to common stats framework |
| - agx: call nir_lower_is_helper_invocation |
| - pan/mdg: call nir_lower_is_helper_invocation |
| - nir/lower_helper_writes: fix stores after discard |
| - pan/lower_helper_invocation: clean up |
| - bin: add script for applying review trailers |
| - bin: add list of Mesa contributors |
| - libcl: add u_foreach_bit |
| - hk: fix cull distance confusion |
| - asahi: integrate printf/abort support |
| - libagx: do not use prefix sums for GS index buffer |
| - asahi/gs: drop non-XFB prefix sums |
| - asahi/gs: factor out output info |
| - asahi/gs: avoid recalculating |
| - asahi/gs: only prefix sum with XFB |
| - libagx,asahi: hoist GS draw generation |
| - asahi/gs: drop unused params |
| - asahi/gs: report whether xfb is needed |
| - asahi: do not dispatch count/pre-GS unless needed |
| - hk: do not dispatch count/pre-GS unless needed |
| - libagx: use indirect draw struct |
| - libagx: clean up |
| - tu: use the stats framework |
| - v3dv: use the stats framework |
| - drm-uapi: update drm_fourcc.h |
| - ail: drop fake modifiers |
| - libagx: reduce # of variants of unroll restart |
| - asahi: add assert for max variant count |
| - agx: fix ctz of zero with address calcs |
| - agx: fix tg4 offset residency |
| - hk: advertise semaphore extensions |
| - hk: advertise bufferDeviceAddressCaptureReplayEXT |
| - hk: fix unaligned copies |
| - libagx: clean up query copy; bug fix |
| - libagx: assert alignment for copies |
| - libagx: use common heap allocs |
| - asahi: fix printf without result buffer |
| - asahi: fix depth buffer feedback loops |
| - asahi: clang-format |
| - glsl_to_nir: upcast array indices |
| - nir/lower_blend: refactor logicop variables |
| - nir/lower_blend: disable logic ops for unsupported formats |
| - nir/lower_tex: use vector_insert_imm |
| - vk/meta: generalize 3D blit code |
| - asahi,hk: disentangle logicop_enable |
| - agx: optimize nonuniform offset |
| - hk: drop dead todo |
| - hk: fix last VGT output component limits |
| - hk: support colour <--> z/s copies |
| - hk: advertise VK_EXT_queue_family_foreign |
| - hk: advertise maintenance7 |
| - hk: advertise maintenance8 |
| - bin: add rebase mode |
| - drm-uapi: add asahi uapi |
| - asahi: disable virtio gpu for upstream |
| - asahi: port to stable uAPI |
| - asahi: remove unstable uapi header |
| - gallium: wire up asahi driver |
| - asahi: fix i/a queries with tess |
| - panfrost: invert and rename no_ubo_to_push flag |
| - panfrost: do not push "true" UBOs |
| - asahi: shrink VA space for sparse emulation |
| - asahi: add sparse emu helpers |
| - asahi: fix zero bo leak |
| - hk: bind for sparse emulation |
| - hk: use ro maps |
| - hk: advertise sparseResidencyBuffer |
| - hk: drop soft fault assumption in hk_buffer_addr_range |
| - util/simple_mtx: fix duplicate definition |
| - nir: unvendor lod_bias(_agx) |
| - nir: handle mismatched bias/lod bitsizes |
| - nir: add sampler LOD bias lowering |
| - hk: handle lod/min_lod size mismatch |
| - agx: switch to common LOD bias lowering |
| - hk: fix underbinding scratch |
| - hk: fix tessellation + clipper queries |
| - hk: fix null FS corner cases |
| - agx: model sources as late-kill in demand calcs |
| - agx: late-kill sources |
| - agx: early-kill sources only if it won't shuffle |
| - hk: fix patch count = 0 handling |
| - asahi: fix possible null deref |
| - asahi: do not use "Null" layout |
| |
| Andrew Wolfers (1): |
| |
| - vulkan: Add BGRA format support |
| |
| Antonino Maniscalco (2): |
| |
| - freedreno/crashdec: Avoid crashing on some traces |
| - freedreno/crashdec: Add check for \`CP_BV_SQE_STAT_ADDR` |
| |
| Antonio Ospite (28): |
| |
| - ci/android: fix building deqp on Android |
| - ci/android: add an android-angle-lavapipe job |
| - ci/android: don't do unnecessary cleanup |
| - ci/android: don't set HOME globally in cuttlefish-runner.sh |
| - ci/android: disable audio in cuttlefish, it is not needed |
| - ci/android: disable modem simulator in cuttlefish, it is not needed |
| - ci/android: stop cuttlefish before copying the logs, to log everything |
| - ci/android: increase the waiting time to stop the cuttlefish launcher |
| - ci/android: bump CUTTLEFISH_BUILD_NUMBER |
| - ci/android: use an x86_64_only cuttlefish image |
| - ci/android: push /deqp-gles/mustpass/egl-main.txt.zst |
| - ci/android: pass -vsock_guest_cid to launch_cvd |
| - ci/android: move all dEQP handling in one place in cuttlefish-runner.sh |
| - ci/android: add an android-deqp-runner.sh script |
| - ci/android: add a job using android-cts instead of deqp-runner |
| - ci/android: don't set EGL_PLATFORM on the host before launching cuttlefish |
| - ci/android: increase the waiting time to stop the cuttlefish launcher |
| - ci/android: factor out a generic android-runner.sh from cuttlefish-runner.sh |
| - ci/android: remove some unnecessary adb commands from android-runner.sh |
| - ci/android: check for ANGLE_TAG in android-deqp-runner.sh |
| - ci/android: remove old mesa and ANGLE libraries before pushing new ones |
| - ci/android: always push ANGLE libraries |
| - ci/android: push also the intel vulkan driver |
| - ci/android: handle ANGLE being installed under /system on Android 15+ |
| - ci/android: factor out GLES runtime version retrieval to a function |
| - ci/android: only ship x86_64 artifacts in debian-android build job |
| - ci/android: strip the artifacts of the debian-android job |
| - meson: pass --no-pager to \`meson configure` command |
| |
| Ashley Smith (2): |
| |
| - panfrost: Reset syncobj after use to avoid kernel warnings |
| - pan/bi: Enable ARB_shader_clock extension support |
| |
| Assadian, Navid (1): |
| |
| - amd/vpelib: More parameters to the segmentation process and introduce validation hook |
| |
| Autumn Ashton (5): |
| |
| - radv: Fix compute resolve rounding |
| - radv: Enable fragmentShadingRateWithCustomSampleLocations |
| - radv: Expose EXT_sample_locations everywhere |
| - radv: Expose VK_SAMPLE_COUNT_1_BIT for sample position on GFX10+ |
| - ci/zink+radv: Add sample locations fails |
| |
| Bas Nieuwenhuizen (1): |
| |
| - radv: Move support check out of winsys. |
| |
| Benjamin Cheng (1): |
| |
| - d3d12/video_enc: Fix AV1 tile_info() coding |
| |
| Benjamin Lee (41): |
| |
| - panfrost: minor refactors in preparation for panvk 16-bit IO |
| - panfrost: move handling for bifrost mediump lowering to pan_collect_varyings |
| - panfrost: support 16-bit vertex attributes |
| - panfrost: support 16-bit varyings |
| - panvk: advertise storagePushConstant16 and storageInputOutput16 |
| - panvk: enable VK_KHR_depth_stencil_resolve |
| - panvk: enable KHR_separate_depth_stencil_layouts |
| - panvk: document missed extensions in new_features |
| - panvk: implement VK_EXT_separate_stencil_usage |
| - panfrost: remove NIR_PASS_V usage for noperspective lowering |
| - meson: update wayland-protocols source_hash |
| - panfrost: fix large int32->float16 conversions |
| - panfrost: reorder lower_bit_size pass |
| - panfrost: fix condition in bi_nir_is_replicated |
| - panfrost/va: remove swizzle mod from LDEXP |
| - panfrost: implement 16-bit ldexp |
| - panvk: advertise shaderFloat16 |
| - panfrost: implement 16-bit pack/unpack intrinsics |
| - panvk/csf: fix uninitialized read in utrace_clone_init_builder |
| - panfrost/pps: fix omitting several counters |
| - panfrost: fix libpan_v4 header include |
| - panfrost/va: remove dead code for packing BRANCHZI.i16 lane mod |
| - driconf: add uint64 type |
| - panvk: add core mask driconf options |
| - panfrost: add core mask driconf options |
| - pan/va: add roundmode modifier to additional instructions |
| - panfrost: implement float controls rounding mode |
| - pan/va: preserve signed zero in f32->f16 conversions |
| - pan/bi: refactor bi_instr_ftz to allow dontcare FTZ states |
| - pan/bi: ignore ftz mode when scheduling int instructions |
| - pan/bi: implement denorm behavior float controls |
| - panvk/va: don't advertise independent denorm behavior |
| - panvk: advertise VK_KHR_shader_float_controls |
| - panvk: advertise VK_KHR_float_controls2 |
| - panfrost: define bi_swizzle alias values |
| - panfrost: add bi_swizzle values for unused swizzles |
| - panfrost/va: use 'lane' modifier for MKVEC.v2i8 |
| - panfrost: use bifrost instruction table for bi_lower_swizzle |
| - panfrost/va: valhall-specific swizzle lowering |
| - panvk: enable uniformAndStorageBuffer8BitAccess and storagePushConstant8 |
| - panvk: don't advertise VK_KHR_float_controls2 on bifrost |
| |
| Benjamin Otte (1): |
| |
| - lavapipe: Don't advertise support for multiplane drm formats |
| |
| Benjamin ROBIN (2): |
| |
| - util/disk_cache: Do not try to delete old cache if cache is disabled |
| - docs: Update MESA_SHADER_CACHE_DIR env variable behavior |
| |
| Bo Hu (2): |
| |
| - gfxstream: Adding support for VK_KHR_global_priority extension |
| - gfxstream: update code generator for simplified dep graph |
| |
| Boris Brezillon (29): |
| |
| - panvk: Don't clobber registers if the render pass was suspended |
| - pan/decode: Fix the blend_count mask |
| - panvk/csf: Don't free the resources twice when init_render_desc_ringbuf() fails |
| - panvk: Initialize device virtual address space after the VM creation |
| - vulkan/state: Fix input attachment map state initialization/copy |
| - vk/pass: Add input attachment location info |
| - vulkan/state: Fix default input attachment map values |
| - panvk/jm: Don't force a preload if the previous batch didn't have draws |
| - pan/bi: Allow depth/stencil tile buffer access using LD_TILE |
| - pan/bi: Pass load_converted_output_pan target through a source |
| - pan/bi: Pass an explicit sampleid to load_converted_output_pan |
| - panvk/csf: Set invalidate_inherited_ctx only if the render pass is inherited |
| - panvk: Re-order things in panvk_deserialize_shader() |
| - panvk: Isolate CS specific bits in panvk_shader |
| - panvk: Generate the earlyzs LUT at shader creation time |
| - pan/earlyzs: Support the shader ZS read-only case and its optimization on v10+ |
| - panvk/jm: Move cmd_prepare_draw_sysvals() out of the layer loop |
| - panvk: Support color attachment remapping |
| - pan/bi: Introduce two intrinsics to support input attachment remapping |
| - panvk: Optimize input attachment loads when we can |
| - panvk: Skip BY_REGION barriers if we're in a render pass |
| - panvk: Advertise KHR_dynamic_rendering_local_read support |
| - pan/va: Support relaxed waits on read-only render targets |
| - panvk/csf: Optimize read-only tile buffer access |
| - panfrost: also consider z/s for tile-size |
| - pan/earlyzs: Fix the read-only ZS optimization |
| - panvk/csf: Pass less arguments to emit_vs_attrib() |
| - panvk/csf: Fix instance attribute offseting |
| - pan/format: Disable image storage on A8_UNORM |
| |
| Caio Oliveira (112): |
| |
| - intel/brw: Move fs_inst implementation code together |
| - intel/brw: Rename fs_inst to brw_inst |
| - intel/brw: Rename fs_inst_box to brw_inst_box |
| - intel/brw: Move some larger functions from brw_inst.h to brw_inst.cpp |
| - intel/brw: Remove brw_gs_compile struct |
| - intel/brw: Rename file brw_fs_nir.cpp to brw_from_nir.cpp |
| - intel/brw: Remove 'fs' prefix from brw_from_nir functions |
| - anv: Add statistic for 'GRF registers' for Xe3+ |
| - intel/brw: Add brw_analysis.h |
| - intel/brw: Use brw_analysis prefix for liveness analysis files |
| - intel/brw: Use brw_analysis prefix for def analysis file |
| - intel/brw: Use brw_analysis prefix for performance analysis files |
| - intel/brw: Move idom_tree declaration to brw_analysis.h |
| - intel/brw: Move analysis passes without own file to brw_analysis.cpp |
| - intel/brw: Merge brw_ir_analysis.h into brw_analysis.h |
| - intel/brw: Use brw prefix instead of namespace for dep analysis enum |
| - intel/brw: Use brw prefix instead of namespace for analysis implementations |
| - intel/brw: Remove 'using namespace brw' directives |
| - intel/brw: Remove offsets and total_size from VGRF allocator |
| - intel/brw: Pass fs_visitor around instead of the simple_allocator |
| - intel/brw: Add functions to allocate VGRF space |
| - intel/brw: Fold simple_allocator into the shader |
| - intel/brw: Rename and move thread_payload types to own header |
| - intel/brw: Merge brw_fs_visitor.cpp into brw_fs.cpp |
| - intel/brw: Rename files brw_fs.cpp/h to brw_shader.cpp/h |
| - intel/brw: Rename fs_visitor to brw_shader |
| - intel/brw: Rename fs_copy_prop_dataflow to brw_copy_prop_dataflow |
| - intel/brw: Rename a few remaining functions to remove fs prefix |
| - intel/brw: Update outdated comments |
| - intel/compiler: Use -Werror=vla |
| - intel/brw: Use the builder DO() function in all places |
| - intel/brw: Always have a (non-DO) block after a DO in the CFG |
| - intel/brw: Don't need to repair CFG in brw_opt_combine_constants |
| - brw: Reformat brw_gram.y and brw_lex.l |
| - brw: Fix size in assembler when compacting |
| - brw: Always verify EU compaction in debug mode |
| - brw: Remove unused function |
| - brw: Add block pointer in brw_inst |
| - brw: Use brw_inst::block in Def analysis |
| - brw: Use brw_inst::block in Combine Constants |
| - brw: Use brw_inst::block in CSE |
| - brw: Remove block parameter from brw_inst::remove() |
| - brw: Simplify brw_builder "insert before inst" constructor |
| - brw: Add explicit way to get an empty brw_builder |
| - brw: Embed at_end() inside brw_builder(brw_shader \*) constructor |
| - brw: Remove bblock_t parameters from various passes |
| - brw: Make assembler strict about JIP and UIP order |
| - brw: Add support for GOTO/JOIN in the assembler |
| - brw: Rework label tracking in assembler |
| - brw: Remove extra interface in brw_cfg types |
| - brw: Remove dead code from control flow |
| - brw: Add missing dependency classes to various passes |
| - brw: Get the reference to brw_def_analysis only once in saturate propagation |
| - brw: Move defs analysis back to its place in saturate propagation |
| - brw: Simplify the test code for brw passes |
| - brw: Add assembler support for DPAS |
| - brw: Remove extra SHADER_OPCODE_FLOW emitted during NIR conversion |
| - spirv: Update headers and metadata from latest Khronos commit |
| - vulkan: Update XML and headers to 1.4.311 |
| - spirv: In SpecConstantOp handling don't adjust convert ops bit_size twice |
| - brw: Fix decoding of 3-src destination stride in EU validation |
| - brw: Allow generating destination with stride 2 in 3-src instructions |
| - brw: Remove prefix gfx10 from enum types |
| - brw: Make some integer check more explicit |
| - brw: Add BRW_TYPE_BF for bfloat16 |
| - brw: Add BRW_TYPE_BF validation |
| - brw: Consider bfloat16 in scoreboard |
| - brw: Add EU assembler support for bfloat16 |
| - intel/executor: Add small example with bfloat |
| - brw: Expand EU validation for DPAS |
| - brw: Fix disassembler trying to decode 3src_hstride in Gfx9 |
| - brw: Remove brw_cfg::dump() |
| - brw: Track num_instructions in a block |
| - brw: Use block's num_instructions in scoreboard tests |
| - brw: Track total_instructions in a shader |
| - brw: Add analysis for block IP ranges |
| - brw: Use brw_ip_ranges in passes |
| - brw: Use brw_ip_ranges in scheduling / regalloc |
| - brw: Use brw_ip_ranges in liveness analysis |
| - brw: Remove adjust_block_ips and brw_inst::remove() with defer |
| - intel/executor: Use getopt for command line arguments |
| - intel/executor: Allow selecting a device to use |
| - brw: Use control flow helpers in scoreboard tests |
| - brw: Use SIMD16 shaders in scoreboard tests for Xe2+ |
| - brw: Stop setting SFID in scoreboard tests |
| - brw: Return actual progress in brw_lower_scoreboard |
| - brw: Add parser of SWSB annotations to use in tests |
| - brw: Use new bld/exp style in scoreboard tests |
| - brw: Remove HSW specific code from brw_compile_cs.cpp |
| - brw: Add brw_builder::uniform() |
| - brw: Fix invalid memory access in scoreboard test |
| - brw: Fix memory leak in EU validation tests |
| - intel/executor: Fix check for open() failure |
| - brw: Properly handle cooperative matrices created with constants |
| - spirv: Take a separate data_type when creating a new vtn_variable |
| - brw: Add a few basic tests for register coalesce |
| - brw: Clean up saturate propagation after non-defs version removal |
| - brw: Add brw_range struct |
| - brw: Use brw_range in IP ranges analysis |
| - brw: Use brw_range when operating with live ranges |
| - brw: Use brw_range to store VGRF ranges |
| - brw: Use brw_range to store Vars ranges |
| - brw: Use brw_range::last() to explicit get the last valid IP |
| - brw: Make brw_range use half-open ranges |
| - broadcom/ci: Skip test due to timeout |
| - nir/load_store_vectorize: Skip new bit-sizes that are unaligned with high_offset |
| - intel: Add intel_device_info::has_systolic |
| - intel: Disable has_bfloat16 for MTL |
| - brw: Allow DPAS with BF on Gfx125 |
| - brw: Update EU validation to allow packed BF mixed with packed F |
| - intel/executor: Update bfloat example |
| - intel/executor: Fix bfloat example for converting F to packed BF |
| |
| Caleb Callaway (3): |
| |
| - anv: add INTEL_DEBUG=rt_notrace |
| - intel/compiler: fix lingering i965 references |
| - intel/tools: fix 32b build for EU stall tool |
| |
| Casey Bowman (2): |
| |
| - vulkan/screenshot-layer: Add RGBA surface format support |
| - vulkan/screenshot-layer: Correct queueFamilyIndex source |
| |
| Caterina Shablia (11): |
| |
| - panvk: enable imagelessFramebuffer |
| - panvk: enable uniformBufferStandardLayout |
| - nir: teach nir_lower_bit_size to handle ballot and ballot_relaxed |
| - pan/bi: ensure src bit sizes of ballot{,_relaxed} and read_invocation |
| - pan/bi: lower non-32-bit ballot{,_relaxed} |
| - pan/bi: lower subgroups before lowering int64 |
| - panvk: enable subgroupExtendedTypes |
| - panfrost: don't overwrite push uniforms and sysvals UBO with user's UBO |
| - panfrost: update nr_uniform_buffers before dispatching XFB |
| - panfrost: require buffer_count and pushed_words to be passed to panfrost_emit_const_buf |
| - panfrost: move the comment closer to what it's about |
| |
| Charmaine Lee (1): |
| |
| - svga: remove tgsi semantic in shader compiler key |
| |
| Christian Gmeiner (20): |
| |
| - docs: Update perfetto with the latest status |
| - docs: Update the list of drivers with CPU tracepoints |
| - docs: Add perfetto driver specifics for V3D |
| - etnaviv: isa: Add swizzle instruction |
| - nir: Add bool return value to nir_lower_clip_halfz(..) |
| - nir: Add bool return value to nir_lower_texcoord_replace(..) |
| - etnaviv: nir: Return progress for etna_lower_alu(..) |
| - etnaviv: nir: Return progress for etna_lower_io(..) |
| - etnaviv: nir: Don't use deprecated NIR_PASS_V macro anymore. |
| - zink/ntv: Only emit GeometryStreams cap if multiple streams are used |
| - etnaviv/ci: Bring back GC7000 |
| - etnaviv/ci: Start using the revision number for GPU_VERSION |
| - etnaviv/ci: Update flakes for gc7000-r6214 |
| - etnaviv/ci: Add gles2 run for GC3000 |
| - etnaviv/ci: Add missing rev to GC3000 |
| - etnaviv: rs: Factor out box alignment function |
| - etnaviv: rs: Change param etna_get_rs_alignment_mask(..) |
| - etnaviv: Add support for KHR_partial_update |
| - mesa/formats: Add MESA_FORMAT_NV12 |
| - etnaviv: Add multi-planar YUV support |
| |
| Colin Marc (1): |
| |
| - vulkan/wsi: implement the Wayland color management protocol |
| |
| Collabora's Gfx CI Team (7): |
| |
| - Uprev Piglit to fc8179d319046f45346bcbcc5aaeabebdf151f03 |
| - Uprev Piglit to 04d901e49de6b650f9dceaf73220371273d87f73 |
| - Uprev Piglit to 708a9e365b18fdd881af989f75e1a6c1409cae8c |
| - Uprev ANGLE to 1b34d2a18af12cc55a3bc74dd679c2937d10cc5c |
| - Uprev ANGLE to 3818d37d5e94317f01810053b8f28c1f1e8b98e6 |
| - Uprev Piglit to 0ecdebb0f592 |
| - Uprev ANGLE to a3f2545f6bb3 |
| |
| Connor Abbott (52): |
| |
| - tu: Constify frag_areas argument of tu_fdm_bin_apply_t |
| - tu: Refactor fragment density map sampling |
| - freedreno: Add VSC mask parameter to CP_SET_BIN_DATA5 |
| - tu: Implement bin merging for fragment density map |
| - freedreno: Add a750+ "absolute" VSC bin mask |
| - tu: Implement "absolute" bin mask on a750 |
| - tu: Make sure tiles being merged are adjacent |
| - tu: Fix vertical tile merging check |
| - tu: Fix static blend bandwidth calculation |
| - tu: Remove useless prim_order state merging |
| - ir3: Fix const allocation when parsing assembly |
| - freedreno/decode: Push hostaddr->gpuaddr conversion into highlight_gpuaddr() |
| - freedreno/crashdec: Use register for RB rptr |
| - freedreno/crashdec: Handle hangs where the SQE is in RB |
| - freedreno/crashdec: Fix and extend control reg dumping on a750 |
| - freedreno/crashdec: Dump CP_BV_SQE_UCODE_DBG |
| - freedreno: Misc control registers updates |
| - tu: Implement VK_KHR_maintenance7 |
| - tu: Plumb through VkMemoryBarrierAccessFlags3KHR |
| - tu: Handle D32S8 -> R32 and R32 -> D32S8 copies |
| - tu: Handle R8->D24S8 and D24S8->R8 copies |
| - tu: Handle a pipelineStageCreationFeedbackCount of 0 |
| - ir3: Remove ir3_info::data |
| - tu: Expose VK_KHR_maintenance8 |
| - compiler/shader_info: Better document require_full_quads |
| - nir, compiler: Rename needs_quad_helper_invocations |
| - compiler,nir: Gather needs_full_quad_helper_invocations info |
| - ir3: Use needs_full_helper_invocations |
| - tu: Fix binning_possible detection with bin merging |
| - tu: Fix GMEM offset for multisample layered separate stencil |
| - tu: Enable GMEM with layered rendering |
| - tu: Fix size of frag_size_ir3 and frag_offset_ir3 driver params |
| - tu: Fix reported FDM fragment size with multiview |
| - tu: Fix layer_count with dynamic rendering + multiview |
| - ir3: Split mad with scalar ALU |
| - tu: Only allow power-of-two fragment areas |
| - tu: Split out part of tiling config to vsc config |
| - tu: Fix CmdClearAttachments with fragment density map |
| - tu/fdm: Skip some patchpoints when binning |
| - tu: Implement VK_QCOM_fragment_density_map_offset |
| - vk/runtime: Use vk_command_buffer in renderpass wrappers |
| - vk/runtime: Add common CmdEndRendering |
| - tu: Implement VK_EXT_fragment_density_map_offset |
| - ir3: Pass through access flags when lowering global accesses |
| - nir/opt_preamble: Handle load_global_ir3 |
| - ir3: Move load/store vectorization to finalize |
| - ir3: Vectorize shared memory loads/stores |
| - tu: Fix flushing when using a staging buffer for copies |
| - freedreno: Add compute_lb_size device info |
| - freedreno/a6xx: Define CONSTANTRAMMODE |
| - freedreno/a6xx, turnip: Set CONSTANTRAMMODE correctly |
| - ir3: Take LB restriction on constlen into account on a7xx |
| |
| Corentin Noël (12): |
| |
| - venus/ci: Skip flaky test due to intermittent timeouts |
| - ci: Update CrosVM and Virglrenderer |
| - lavapipe: Remove doubly initialized features |
| - lavapipe: Change lvp_cmd_type to anonymous enum |
| - ci: Update CrosVM and Virglrenderer |
| - virgl: nir: Don't use deprecated NIR_PASS_V macro anymore |
| - nir: Add bool return value to nir_legacy_trivialize(..) |
| - ntt: nir: Don't use deprecated NIR_PASS_V macro anymore |
| - tnn: nir: Don't use deprecated NIR_PASS_V macro anymore |
| - ci: Uprev virglrenderer to latest version on time |
| - virgl: Close handle on resource info failure |
| - virgl: Use drmCloseBufferHandle instead of calling dmIoctl directly |
| |
| Daniel Schürmann (43): |
| |
| - aco/isel: fix empty exec tracking for uniform branches |
| - aco/isel: move cf_info into separate struct cf_context |
| - aco/isel: rename cf_context::has_divergent_branch |
| - aco/isel: use cf_context in if_context to restore cf information |
| - aco/isel: use cf_context in loop_context to restore cf information |
| - aco/isel: add and use exec_info::empty() helper |
| - aco/isel: fix assumptions about potential empty exec mask in nested control flow |
| - aco/isel: remove loop nest information from exec_info |
| - aco/isel: track control flow divergence in loops more accurately |
| - nir: make divergence information metadata |
| - nir: require nir_metadata_divergence if needed |
| - amd: switch to nir_metadata_divergence |
| - intel: switch to nir_metadata_divergence |
| - nouveau: switch to nir_metadata_divergence |
| - nir: only print divergence information if metadata is valid |
| - nir/serialize: don't serialize divergence information |
| - nir/validate: validate divergence metadata |
| - nir/print: print phi sources sorted by predecessor blocks |
| - aco/scheduler: always respect min_waves on GFX10+ |
| - aco/scheduler: stop rounding down the target number of waves on GFX10+ |
| - aco: unify get_addr_sgpr_from_waves() and get_addr_vgpr_from_waves() into one function |
| - aco/scheduler: keep track of RegisterDemand at UpwardsCursor::insert_idx |
| - aco/scheduler: keep track of RegisterDemand at DownwardsCursor::insert_idx{_clause} |
| - aco/scheduler: remove unused include of unordered_set |
| - aco/insert_exec_mask: Don't immediately set exec to zero in break/continue blocks |
| - aco/insert_exec_mask: don't restore exec in continue_or_break blocks |
| - aco/lower_branches: allow other instructions after s_andn2 in break blocks |
| - aco/ssa_elimination: insert parallelcopies for p_phi immediately before branch |
| - aco/ssa_elimination: refactor scratch_sgpr handling |
| - aco/ssa_elimination: create a single parallelcopy instruction for linear and logical phis |
| - aco/assembler: Fix short jumps over chained branches |
| - aco/assembler: Don't insert chained branches into otherwise empty blocks |
| - aco/tests: add more tests for chained branches |
| - zink: lower {demote|terminate}_if to CF after lowering point smooth |
| - nir: remove nir_lower_discard_if_to_cf option |
| - nir: set SYSTEM_VALUE_HELPER_INVOCATION read for nir_intrinsic_is_helper_invocation |
| - zink: clean up HelperInvocation code emission |
| - zink: enable CapabilityDemoteToHelperInvocation and extension only if required |
| - aco/lower_branches: properly consider exec mask needs of branch targets |
| - aco: don't assume that demote doesn't cause an empty exec mask |
| - aco/insert_exec_mask: if applicable, use s_wqm to restore exec after divergent CF |
| - aco/insert_exec_mask: don't disable dead quads on demote in divergent CF |
| - aco: Remove empty exec skipping after demote |
| |
| Daniel Stone (29): |
| |
| - ci/vvl: Use appropriate level of parallelism |
| - ci: Move common testing packages to test-base |
| - ci: Move apitrace to test-base container |
| - ci: Add debian/arm32_test-* images |
| - ci: Build libdrm for Android host builds |
| - ci/angle: Use native toolchain for builds |
| - ci/zink: Expand flake definition on radv |
| - ci: Update kernel for Panthor scheduling fixes |
| - ci/panfrost: Move G610 testing to pre-merge |
| - ci: Re-enable Collabora CI |
| - docs/ci: Fix nginx caching instructions |
| - ci/lava: Fail faster when getting results |
| - etnaviv: Add missing build dependency on generated header |
| - ci/amd: Add new radeonsi fail seen in the wild |
| - ci/zink: Flake out sparse tests |
| - ci: Add daniels to restricted-trace users |
| - ci: Re-enable trace jobs with updated Piglit |
| - ci: Move softpipe issue from fail to flake |
| - ci/amd: Disable radv-fossils |
| - ci/windows: Don't copy non-existent libglapi.dll |
| - ci/panfrost: Split inherit definitions into -inc |
| - ci: Reduce build job timeouts |
| - ci/softpipe: Add timeouts to softpipe jobs |
| - ci/llvmpipe: Add llvmpipe and lavapipe timeouts |
| - ci/virgl: Add timeout to software jobs |
| - ci/docs: Add timeout to doc build |
| - ci/microsoft: Add job timeouts and pin Piglit to GSt |
| - ci: Make all job timeouts explicit |
| - doc/ci: Update nginx caching snippets |
| |
| Danylo Piliaiev (28): |
| |
| - u_trace: print tracepoint params in csv output |
| - util: Make debug_dump_flags thread safe |
| - util: Add dump_debug_control_string to dump debug_control |
| - tu: Add gmem disable reason to renderpass tracepoint |
| - tu: Add info about debug options to command buffer tracepoint |
| - tu: Get correct src view when storing gmem attachment |
| - freedreno/fdl: Log mutability when dumping layout |
| - tu: Handle mismatched mutability when resolving from GMEM |
| - tu/a6xx: Emit VSC addresses for each bin to restore after preemption |
| - nir/nir_lower_multiview: Don't assert if load_deref doesn't have var |
| - ir3: Set need_full_quad depending on info.fs.require_full_quads |
| - tu: Be more granular in calculating whether blend state reads color |
| - tu: Fix NULL deref in trace_end_render_pass |
| - freedreno/regs: Rename BINNING bit to FS_DISABLE in a few regs |
| - ir3: Detect empty fragment shaders |
| - tu/a7xx: Bypass invoking empty FS for D/S-only draw calls |
| - ir3: Detect FS that write only color without other side effects |
| - tu: Disable FS in certain cases even if FS is not empty |
| - tu/lrz: Improve LRZ around stencil tests and reads_dest cases |
| - tu: Use EARLY_Z also for stencil tests |
| - ir3: Fix shaders that write only color classified as empty |
| - freedreno: Bump kernel uapi (linux 6.14) |
| - freedreno,tu: Read and pass to compiler uche_trap_base |
| - tu: Implement VK_KHR_shader_clock |
| - freedreno/a6xx: Implement ARB_shader_clock |
| - tu,freedreno: Don't fallback to LINEAR with DRM_FORMAT_MOD_QCOM_COMPRESSED |
| - ir3: VARYING_SLOT_LAYER output is used for binning |
| - tu: Fix disable_fs state update condition |
| |
| Dave Airlie (24): |
| |
| - radv/video: move encoder to using a buffer instead of an image |
| - radv/video: calculate colloc buffer size for h264 B frames. |
| - radv/video: add h264 b frame encoding support. |
| - vulkan/wsi/x11: don't use update_region for damage if not created |
| - nak: adjust latencies on fp16/64 instructions on Turing |
| - nvk: enable float16 on turing. |
| - loader/nouveau: load zink as the GL driver for turing and above. |
| - radv/video: don't try and send events on UVD devices. |
| - vulkan/video: add simple parameter retrieval wrappers |
| - radv/video: convert to using common parameter wrappers. |
| - anv/video: convert to common parameters retrieval code |
| - anv/video: don't write to params if not set. |
| - vulkan/video: add support for inline session paramters. |
| - radv: expose VK_KHR_video_mainteance2 |
| - anv: expose VK_KHR_video_maintenance2 |
| - gallivm: check for avx512vbmi and tell LLVM the correct answer. |
| - nak: add reads after setting writes |
| - nak: Add an a_has_pred parameter to waw_latency |
| - nak: Add Turing latency information |
| - nak: Add Ampere and Ada latency information |
| - nvk: update nvidia class header files. |
| - nvk: add ADA compute class to nv_push_dump |
| - nvk: add hopper support to nv_push_dump |
| - nak: add F2FP to sm75 instr latencies |
| |
| David Rosca (64): |
| |
| - ac/vcn_dec: Fix AV1 film grain on VCN5 |
| - radeonsi/video: Avoid stream handle duplicates in PID namespace |
| - frontends/va: Don't try to switch to protected buffer in EndPicture |
| - frontends/va: Add CreateContext flag to enable protected context |
| - frontends/va: Require protected context for VAProtectedSliceDataBuffer |
| - frontends/va: Switch to graphics context when creating protected surface |
| - radeonsi/vcn: Use texture instead of video buffer for DPB buffers |
| - radeonsi/video: Fix creating video buffers with AMD_DEBUG=tmz |
| - frontends/vdpau: Set H264 chroma_format_idc |
| - radeonsi/vcn: Set correct chroma format for H264 decode |
| - radeonsi/uvd: Set correct chroma format for H264 decode |
| - ci/amd: Remove VAAPI skips |
| - frontends/va: Use transfer stride and offset in DeriveImage |
| - radv/video: Fix setting balanced preset for HEVC encode with SAO enabled |
| - radv/video: Move IB header from begin/end to encode_video |
| - radeonsi: Use gfx for TMZ buffer clears |
| - winsys/amdgpu: Add assert for secure submissions on compute ring |
| - frontends/vdpau: Use extra reference buffer for AV1 film grain |
| - ac/surface: Only allow linear modifier for subsampled 422 formats |
| - ac/surface: Allow DCC for multi-plane formats on GFX12 |
| - radeonsi/vcn: Fix chroma pitch for JPEG decode |
| - radeonsi/video: Allocate video buffers with modifiers |
| - radeonsi/vcn: Add UDT support for VCN5 |
| - radeonsi/vcn: Rework decode ref handling |
| - radeonsi/video: Fix crash when creating buffers without modifiers support |
| - frontends/va: Set AV1 max_width/height to surface size |
| - frontends/vdpau: Set AV1 max_width/height to surface size |
| - Revert "radeonsi/vcn: Limit size to target size in AV1 decode" |
| - pipe: Remove PIPE_AV1_ENC_FRAME_TYPE_SHOW_EXISTING |
| - radeonsi/vcn: Set all pic params for H264 encode references |
| - radeonsi/vcn: Add radeon_enc_av1_picture_type |
| - radeonsi/vcn: Support H264 encode weighted_bipred_idc |
| - radeonsi/video: Remove mpeg12 shader decoder support |
| - gallium/vl: Fix video buffer supported format check |
| - Revert "frontends/vdpau: Alloc interlaced surface for interlaced pics" |
| - frontends/vdpau: Fix creating deinterlace filter for interleaved buffers |
| - frontends/va: Support A8R8G8B8 format for processing |
| - frontends/va: Use ARGB as default fourcc for RGB32 RT format |
| - frontends/va: Don't filter supported formats according to config RT format |
| - gallium/vl: Return YUV plane order for single plane formats |
| - radeonsi/video: Only allow 64K_S swizzle mode for VCN < 2.2 |
| - radeonsi/vce: Support old VCE firmware |
| - radeonsi/video: Allow DCC 256B block size with drm minor >= 63 |
| - gallium/vl: Fix rotation with scaling for compute shaders |
| - gallium/vl: Fix mirror with rotation for compute shaders |
| - frontends/va: Don't ignore rotation and mirror for conversions to RGB |
| - ac/parse_ib: Fix parsing output format on VCN5 |
| - ac/parse_ib: Parse VCN DYNAMIC_REFLIST_BUFFER |
| - radv: Use radv_format_to_pipe_format instead of vk_format_to_pipe_format |
| - radv: Add radv_format_description to remap 10/12bit formats to 16bit |
| - radeonsi/vcn: Disable AV1 unidir compound with rate control |
| - egl/x11: Fix swap interval setup |
| - radv/video: Fix msg header total size |
| - radv/video: Fix encode session info for VCN3+ |
| - radv/video: Use ac_vcn_enc_init_cmds |
| - radv/video: Always enable B pictures for H264 encode |
| - radeonsi/vpe: Don't try to flush cs from buffer_map |
| - radeonsi/vcn: Don't try to flush cs from buffer_map |
| - radeonsi/uvd,vce: Don't try to flush cs from buffer_map |
| - radeonsi/video: Remove cs argument from si_video_resize_buffer |
| - radeonsi/vpe: Use float division to get scaling ratio |
| - radeonsi/vpe: Fix process_frame return value |
| - radeonsi/vpe: Use studio range for YUV and full for RGB by default |
| - radeonsi/vcn: Fix decode target index for H264 interlaced streams |
| |
| David Tobolik (1): |
| |
| - rusticl/feat: LinkOnce ODR |
| |
| Dmitry Baryshkov (4): |
| |
| - freedreno/registers: allow skipping the validation |
| - meson: add freedreno (turnip) Vulkan to arm64 defaults |
| - meson: disable SIMD blake optimisations on x32 host |
| - mesa-clc: add an option to force inclusion of OpenCL headers |
| |
| Dmitry Osipenko (1): |
| |
| - virtio/vpipe: Correct vdrm_vpipe_connect() definition |
| |
| Dudemanguy (1): |
| |
| - treewide: remove unneeded executable bit in non-scripting files |
| |
| Dylan Baker (6): |
| |
| - intel: output a depfile with mesa_clc |
| - iris: Correctly set NOS for geometry shader state changes |
| - iris: fix handling of GL_*_VERTEX_CONVENTION |
| - intel/tools: deduplicate zlib_inflate function |
| - intel/tools: move ascii85_decode to common code |
| - intel/decoder: free memory in error case |
| |
| Ella Stanforth (4): |
| |
| - v3d/compiler: Implement load_output |
| - v3d: enable framebuffer fetch |
| - v3d: Fix fbfetch with discards. |
| - v3d/compiler: Fixup output types for all 8 outputs |
| |
| Emma Anholt (6): |
| |
| - mesa/ffvs: Skip doing redundant stores of .xyz when doing lighting calculation. |
| - ci/anv: Enable testing with Vulkan video encode/decode. |
| - docs: Drop some weird unhelpful text about DRI2. |
| - egl: Apply autopep8. |
| - egl: Retire NOK_swap_region support. |
| - egl: Retire NV_post_sub_buffer support. |
| |
| Emmanuel Gil Peyrot (1): |
| |
| - panvk: Initialize out array with the correct length |
| |
| Eric Engestrom (159): |
| |
| - VERSION: bump to 25.1 |
| - docs: reset new_features.txt |
| - docs: update calendar for 25.0.0-rc1 |
| - docs/android: drop libglapi.so now that it's gone |
| - ci/cuttlefish: drop \`rm libglapi.so` now that it's no longer loaded |
| - gfxstream: drop unnecessary semi-colons |
| - gfxstream: fix signedness of shifts |
| - gfxstream: drop dead variables |
| - gfxstream: use \`range` variable for its intended purpose |
| - gfxstream: mark unused variables as such |
| - docs: update calendar for 25.0.0-rc2 |
| - llvmpipe/tests: include math.h for INFINITY |
| - ci: don't run on tag pipelines |
| - ci: drop redundant condition |
| - ci: only trigger the CI for release managers when pushing to staging branch |
| - ci: run containers builds on staging branches |
| - ci/yaml-toml-shell-py-test: don't run on post-merge pipelines |
| - ci/yaml-toml-shell-py-test: run on direct push pipelines |
| - ci: finish sorting vars |
| - ci: rename generate-env.sh to export-gitlab-job-env-for-dut.sh |
| - docs: update calendar for 25.0.0-rc3 |
| - ci: debian-testing-ubsan is used by tests |
| - llvmpipe/ci: drop fraction for asan tests that takes 1.5 min without fraction |
| - docs: add release notes for 25.0.0 |
| - docs: add sha sum for 25.0.0 |
| - docs: update calendar for 25.0.0 |
| - docs: add release calendar for 25.0.x cycle |
| - docs/releasing: fix "release schedule" subsections nesting |
| - docs/release-calendar: add 25.1 branchpoint & rc dates |
| - ci/alpine: pin the release to avoid random unexpected changes |
| - ci/alpine: control wayland & wayland-protocols versions |
| - ci: move shader-db test job from build jobs yaml to test jobs yaml |
| - ci/build: remove a couple of unnecessary "override needs: to the same value" |
| - ci/build: move .use-debian/x86_64_build out of the generic .meson-build and into the debian/x86_64 jobs |
| - ci/build: split meson-build into build-for-tests and build-only |
| - ci/build: add explicit build-for-tests or build-only to all jobs |
| - ci/build: lower the delay to start build-only jobs |
| - docs: update gitlab docs urls |
| - vtn_bindgen2: keep the printf blob local to avoid LTO issues |
| - lavapipe/ci: add vkd3d job |
| - ci/build: build-test the dri2 code |
| - ci/b2c: fix comment location |
| - ci/b2c: explain better why we don't clone mesa |
| - ci/b2c: use B2C_JOB_TEMPLATE directly |
| - ci/b2c: set default value for B2C_MACHINE_REGISTRATION_IMAGE in the job |
| - ci/b2c: set default value for B2C_TELEGRAF_IMAGE in the job |
| - ci/b2c: set default value for B2C_KERNEL_CMDLINE_EXTRAS in the job |
| - ci/b2c: split B2C_JOB_VOLUME_EXCLUSIONS in the jinja template |
| - ci/b2c: rename IMAGE_UNDER_TEST to B2C_IMAGE_UNDER_TEST |
| - ci/b2c: pass through all the B2C_* variable without renaming them |
| - ci/b2c: pass through all the CI_* variables as well |
| - ci/b2c: set the registry proxy from the job |
| - ci/b2c: use more readable "long" argument names |
| - v3d/ci: mark traces humus/AmbientAperture and humus/DynamicBranching3 as flaky |
| - docs: update calendar for 25.0.1 |
| - docs: add release notes for 25.0.1 |
| - docs: add sha sum for 25.0.1 |
| - wsi/x11: drop misleading reference to anv in var names |
| - meson: simplify video-codecs option parsing |
| - ci/deqp: backport fix for dEQP-VK.binding_model.buffer_device_address.* |
| - meson: announce that clover is deprecated (slated for removal) |
| - lvp/ci: document fixed tests |
| - lvp/ci: skip tests that are timing out (>1 min) |
| - lvp/ci: fix sorting of flakes |
| - lvp/ci: remove duplicate flakes (noticed after sorting) |
| - lvp/ci: document flakes seen over the last week |
| - freedreno/ci: document fixed test |
| - freedreno/ci: document flakes seen over the last week |
| - nvk/ci: fix sorting of flakes |
| - nvk/ci: document flakes seen over the last week |
| - zink+nvk/ci: document new failures |
| - zink+nvk/ci: fix sorting of flakes |
| - zink+nvk/ci: document flakes seen over the last week |
| - radv/ci: document flakes seen over the last week |
| - zink+radv/ci: fix sorting of flakes |
| - zink+radv/ci: document flakes seen over the last week |
| - ci: document http proxy bug & disable farms relying on it |
| - ci/container: fix image tags comment in trampoline script |
| - ci/init-stage2: document that only lava jobs upload results to s3 |
| - ci/build: no need to list all the files that will go into the uploaded artifact |
| - ci/piglit: drop usage of s3cp for a simple download |
| - ci: always abort if the curl download fails |
| - ci/baremetal: make sure we can follow redirects on s3 downloads |
| - ci: do a regular GET request for /done files, instead of HEAD |
| - ci: replace broken s3cp command with a simple curl call |
| - ci: bump image tags |
| - ci: drop placeholder-job tags to allow jobs to run |
| - ci: document new llvmpipe & softpipe failures since the migration |
| - ci: run shader-db & zink-lvp on kvm runners |
| - ci: disable llvmpipe & virgl traces jobs |
| - meson: do not compile libisaspec unless used |
| - meson: do not compile libblake3 unless used |
| - meson: do not compile libmesa_util_sse41 unless used |
| - pick-ui: clean up formatting |
| - pick-ui: fix enum value in test expectation |
| - pick-ui: add missing field in test expectation |
| - pick-ui: rename s/out/commit_message/ variable to make its contents clearer |
| - pick-ui: fix parsing of multiple \`backport-to:` lines |
| - docs: update calendar for 25.0.2 |
| - docs: add release notes for 25.0.2 |
| - docs: add sha sum for 25.0.2 |
| - rpi/ci: disable traces jobs |
| - ci: re-enable igalia (rpi) farm |
| - ci: document who are the farm admins |
| - docs: fix last references to gallium-drivers=swrast |
| - ci: remove last uses of deprecated \`swrast` alias for softpipe+llvmpipe |
| - meson: drop deprecated \`swrast` alias for softpipe+llvmpipe |
| - meson: move special value \`all` out of the middle of the list |
| - rpi5/ci: sort flakes |
| - rpi5/ci: drop duplicate flakes |
| - ci: bump apitrace version |
| - ci: drop packet.net tag on git archive job |
| - ci: move aarch64 tag to .use-debian/arm64_build |
| - ci: move android's kvm requirement to .use-debian/x86_64_test-android |
| - ci: add FDO_RUNNER_JOB_PRIORITY_TAG_* to control priority of generic freedesktop runners |
| - docs: remove the last 24.3 releases |
| - virgl: fix typo inverting a condition |
| - docs: update calendar for 25.0.3 |
| - docs: add release notes for 25.0.3 |
| - docs: add sha sum for 25.0.3 |
| - ci/build: drop LTO from fedora build |
| - ci: rename ci-tron priority tag to avoid conflict with the generic fdo runners |
| - ci/fluster: use http proxy when checking for the vector files |
| - ci/fluster: don't overwrite FLUSTER_VECTORS_HOST_PATH to a different meaning |
| - radeonsi/ci: update expectations |
| - radv/ci: update expectations |
| - freedreno/ci: update expectations |
| - nvk/ci: update expectations |
| - lvp/ci: update expectations |
| - zink+nvk/ci: update expectations |
| - zink+radv/ci: update expectations |
| - zink+tu/ci: update expectations |
| - zink+anv/ci: document a bunch of flaky glx tests that have been preventing merges all day |
| - docs/ci: replace deprecated \`pages` job with \`pages: true` |
| - docs/ci: add link to the website preview for convenience |
| - docs/ci: add comment explaining what that long rule actually does |
| - docs/ci: split pre-merge & merge pipeline rules |
| - docs/ci: follow convention of only running jobs by default for Marge |
| - ci: fix image tags indentation |
| - ci/deqp: fix vulkan video build |
| - VERSION: bump for 25.1.0-rc1 |
| - .pick_status.json: Update to d5ad7981401c2393cac38fc3215e8bbb97de06b9 |
| - .pick_status.json: Update to 1bf8542490679fa2a244e27fb2b04bbc3d122f74 |
| - meson: remove duplicate \`deprecated` for \`gallium-xa` option |
| - meson: remove duplicate \`deprecated` for \`power8` option |
| - .pick_status.json: Update to 5f3a3740dcc6d243f2ef14138fb1c09bcbb9b5fd |
| - pick-ui: make \`Backport-to: 25.0` backport to 25.0 \*and more recent release branches* |
| - .pick_status.json: Update to dd3e1190a2bdbc6b996152510407adb9a8cb5618 |
| - pick-ui: add missing dependency |
| - [25.1 only] ci: don't treat misleading-indentation warnings are error on alpine |
| - VERSION: bump for 25.1.0-rc2 |
| - .pick_status.json: Update to 3493500abb78a4dc22aba14840bba5c777fde745 |
| - .pick_status.json: Update to eeffb4e674d10db9aefebeca91c2d87c1676b81e |
| - VERSION: bump for 25.1.0-rc3 |
| - .pick_status.json: Update to 615d0c9669595adf114a705f5b8ee88277aa99f2 |
| - .pick_status.json: Update to 7f0de1a51212881c9a7614327bf3e1fbc9784ddb |
| - ci: drop tracking of removed folder |
| - .pick_status.json: Update to c434050a0088ec3f07d63fd1019aea541632faed |
| - .pick_status.json: Update to 84b9c281fe82dd66f2552687cecb61a8e22809d0 |
| |
| Eric R. Smith (7): |
| |
| - panfrost: avoid potential divide by 0 calculating timer_resolution |
| - panfrost: fix YUV center information for 422 |
| - panfrost: fix backward propagation of values in loops |
| - panfrost: use an accessor function to read from bi_opcode_props |
| - panfrost: consider xfb shader when calculating thread local storage size |
| - panfrost,lima: use index size in panfrost minmax_cache |
| - panfrost: fix transaction elimination crc valid calculation |
| |
| Erico Nunes (2): |
| |
| - ci: re-enable lima farm |
| - panvk: disable VK_EXT_image_drm_format_modifier for arch < 10 |
| |
| Erik Faye-Lund (57): |
| |
| - panvk: fix line-rasterization of bifrost |
| - panvk: report strictLines as true |
| - panvk/ci: add back incorrectly removed crash |
| - pan/ci: add flaky tests to the flake-list |
| - pan/ci: add fail from llvm 19 upgrade |
| - pan/ci: add a couple of common flakes |
| - panvk: correct number of read bytes for dynamic buffers |
| - meson: rename meson_options.txt |
| - panvk: report passing the VK CTS |
| - panvk: rename helper |
| - mesa/main: wire up glapi bits for EXT_multi_draw_indirect |
| - pan/bi: use unreachable instead of DBG + assert |
| - pan/bi: remove unused debug output |
| - pan/genxml: rename field |
| - panfrost: respect pipe_rasterizer_state::line_rectangular |
| - panvk: disable shaderFloat16 on bifrost |
| - docs/features: add missing panvk feature |
| - docs/features: add VK_EXT_hdr_metadata |
| - panvk: fix extension requirement |
| - panvk: rework how we deal with extension-reqs |
| - panvk: expose VK_KHR_display |
| - panvk: expose EXT_display_control |
| - panvk: correct VkPhysicalDeviceProperties::deviceName |
| - panvk: enable KHR_line_rasterization support |
| - panvk: add basic driconf infrastructure |
| - panfrost: avoid accidental aliasing |
| - panfrost: fix overflow-debugging |
| - panfrost: use real array for panfrost_emit_plane |
| - panvk: check for texture-compression support |
| - panvk: expose textureCompressionBC when supported |
| - mesa/main: fix regression in extension-checking |
| - panvk/ci: disable some more slow tests |
| - docs/features: update panvk support |
| - panfrost: fixup typo in 16x sample-pattern |
| - panfrost: correct tile-buffer size for some v7 GPUs |
| - panvk/ci: move timeouts to crash |
| - panfrost: properly align value |
| - panfrost: allocate tile-buffer for dummy render-targets |
| - panfrost: disable tile-pipelining when needed |
| - panfrost: add color-attachment and msaa helpers |
| - panvk: enable 8x and 16x msaa when supported |
| - panvk: enable sampledImageIntegerSampleCounts for all MSAA formats |
| - nir/lower_tex: use texture_mask instead of shifting on use |
| - nir/lower_tex: avoid undefined-behavior |
| - panvk: set shared_addr_format |
| - panvk: enable KHR_spirv_1_4 on v10+ |
| - panvk: claim official conformance on v10 |
| - docs/panfrost: use anonymous hyperlinks |
| - panvk: enable dualSrcBlend |
| - docs/panvk: fixup extension support |
| - docs/panvk: remove disabled extension |
| - docs/panvk: fixup docs around float controls |
| - docs/panvk: add VK\_-prefix for extension name |
| - docs/panvk: document ycbr in terms of extensions |
| - docs/panvk: document EXT extension aswell |
| - docs/panvk: add missing new features |
| - panvk: support vulkan 1.2 on v10+ |
| |
| Ernst Persson (1): |
| |
| - intel/vulkan: Add bvh build dependency |
| |
| Faith Ekstrand (165): |
| |
| - nvk,nak: Only use u64 texture handles with codegen |
| - nvk: Only pass sampler handles when needed |
| - nak: Add support for bound and cbuf textures |
| - nak: Optimize bindless to cbuf textures on Volta+ |
| - nak: Fix cbuf textures |
| - nak: Stop setting .EF on tex ops |
| - nak: Rename MemEvictionPriority::Unchanged to LastUse |
| - nak: Add more MemEvictionPriorities |
| - nak: Print .dc for OpTld4::z_cmpr |
| - nak: Add MemEvictionPriorities to tex ops |
| - nvk: Fix scissor bounds |
| - nvk: Fix a typo in a comment |
| - nak/repair_ssa: Use a worklist for get_ssa_or_phi() |
| - nvk: Rename nvk_descriptor_set::mapped_ptr |
| - nvk: Respect VK_DESCRIPTOR_POOL_CREATE_HOST_ONLY_BIT_EXT |
| - nvk: Implement descriptorBufferPushDescriptors |
| - nvk: Pull shaders from the state command buffer in nvk_cmd_process_cmds() |
| - nvk: Handle shader==NULL in nvk_cmd_upload_qmd() |
| - nvk: Allow sparse loads on EDB buffers |
| - nak: Handle sparse texops with unused color destinations |
| - nvk: Use suld for EDB uniform texel buffers |
| - nvk: Align UBO/SSBO addresses down rather than up |
| - nak: Use suld.constant when ACCESS_CAN_REORDER is set |
| - nvk: Use suld.constant for EDB uniform texel buffers |
| - nvk: Constify instance and pdev pointers |
| - ci: Remove some NVK vkd3d fails |
| - nak: Only use suld.constant on Ampere+ |
| - nak: Use MemScope::GPU instead of MemScop::System |
| - zink: Use the correct array size for signal_values[] |
| - zink: Use persistent semaphores for PIPE_FD_TYPE_SYNCOBJ |
| - nvk/nvkmd: Fix logging of VA bind addresses |
| - nvk: Don't bind a fragment shading rate image pre-Turing |
| - nvk: Do not set INVALIDATE_SKED_CACHES pre-MaxwellB |
| - nak: Handle tex ops with only one source |
| - nak/nir: Don't provide dummy backend2 tex srcs |
| - nvk: Fix indentation in begin_end_query() |
| - nouveau/class_parser: Make strided element functions const |
| - nak/qmd: Drop some unnecessary .try_into().unwrap() |
| - nak/qmd: Add a nak_get_qmd_cbuf_desc_layout() helper |
| - nvk: Handle pre-Turing dispatch indirect commands |
| - nvk: Only support deviceGeneratedCommandsMultiDrawIndirectCount on Turing+ |
| - nvk: Only support compute shader derivatives on Turing+ |
| - nak/nir: Re-materialize load_const instructions in use blocks |
| - nvk/image: Prefer vk_image values over pCreateInfo |
| - nvk/image: Drop some unneeded initializers |
| - nvk: Fix capitalization of statistics |
| - nak: Fix NAK_DEBUG=spill for large FS outputs |
| - nak: Handle any->Mem parallel copies |
| - nak: Add a new ConstTracker struct |
| - nak: Don't spill/fill const values |
| - compiler/rust: Add u_printf_info to the rust bindings |
| - nir: Add a get_io_index_src() helper |
| - nir: Add a nir_opt_tex_skip_helpers optimization |
| - nak: Set .NODEP on tex ops based on nir_opt_tex_skip_helpers() |
| - zink: Don't present to Wayland surfaces asynchronously |
| - zink: Revert "zink: enable single-plane modifiers for generic 2D exports" |
| - egl/dri2: Rework get_wl_surface_proxy() |
| - egl/wayland: Pass the original wl_surface to kopper |
| - util/box: Add a intersect_2d helper |
| - iris: Use pipe_box helpers for damage calculations |
| - zink: Use pipe_box helpers for damage calculations |
| - vulkan: Add device address helpers to vk_buffer |
| - nvk: Use the new buffer device address infrastructure |
| - panvk: Use the new buffer device address infrastructure |
| - hk: Use the new buffer device address infrastructure |
| - vulkan/meta: Use vk_buffer.device_address directly |
| - zink: Set needs_barrier after transitioning to QUEUE_FAMILY_FOREIGN |
| - zink: Check queue families when binding image resources |
| - spirv: OpAsmTargetINTEL is untyped |
| - spirv: Update the JSON and headers |
| - vulkan: Update XML and headers to 1.4.309 |
| - nouveau/winsys: Stop asserting that imported BOs are aligned |
| - nvk: Allow rendering to linear images with unaligned strides |
| - nil: Relax alignment requirements for linear images |
| - nil: Split linear and tiled image creation |
| - nvk,nil: Stop panicing in image creation |
| - vtn: Support cooperative matrices in OpConstantNull |
| - loader/nouveau: Fix the comment in nouveau_zink_predicate() |
| - egl/x11: Re-order an if statement |
| - egl/kopper: Update the EGLSurface size after kopperSwapBuffers() |
| - nak: Insert the annotation in the right spot in assign_regs |
| - nak: Don't insert empty OpParCopy in assign_regs |
| - nak: Always copy sources when handling vec/pack/mov ops |
| - nak: Fix a SM check for OpPCnt |
| - nak: Check num_regs(UGPR) instead of SM version |
| - nak: Turing starts at SM73 |
| - nouveau/headers: Refactor class_parser |
| - nouveau/headers: Drop unused Rust constants |
| - nouveau/headers: Drop double-underscore from Rust names |
| - nouveau/headers: Re-use Rust method types when possible |
| - nvk: Reduce the size of nvk_image_view_capture |
| - nvk: Free owned_gart_mem correctly |
| - nvk: Fix a Volta check |
| - nvk: Disable VK_EXT_post_depth_coverage on Maxwell A and earlier |
| - nvk: Allocate QMDs from a heap on Maxwell A and earlier |
| - nvk: Disable VK_EXT_device_generated_commands on Maxwell A and earlier |
| - nvk: Don't set filterMinmax properties prior to Maxwell B |
| - nvk: Disable sparse buffer binding prior to Maxwell B |
| - nouveau/mme/fermi: Don't allow STATE and EMIT on the same op |
| - nvk: Use the right sample mask for 8x/4pass on Maxwell A |
| - nvk/nvkmd: Add a concept of incomplete pushes |
| - nvk: Mark the push before an indirect push as incomplete |
| - nak: hsetp2 and dsetp are slower on Volta |
| - nvk: Bump the conformance version to 1.4.1.3 |
| - vulkan/wsi: Signal buffer memory object when blitting |
| - venus: Assume wsi_mem->base_bo != NULL |
| - venus: Don't report global priorities if globalPriorityQuery is unsupported |
| - venus: Only claim modifiers in WSI if the host driver supports it |
| - venus: Set wsi_device::supports_scanout = false |
| - compiler/rust: Add a nir_alu_type wrapper |
| - compiler/rust: Add more NIR intrinsic getters |
| - nak: Implement nir_intrinsic_convert_alu_types |
| - nak/nir: Use correct rounding for fp64 -> fp16 conversions |
| - nak,nir: Generalize nak_nir_split_64bit_conversions and move it to NIR |
| - nak: Move some calc_instr_deps items to a new file |
| - nak: Box our RegTrackers |
| - nak: Improve WS abstractions in hw_runner |
| - nak: Add a QMD heap to hw_runner |
| - nak: Disable lea64 and f2fp.pack_ab tests pre-Volta |
| - nvk: Disable vulkanMemoryModel on Kepler and earlier |
| - nvk: Use max_image_dimension for maxFramebufferWidth/Height |
| - nvk: Disable 32k images on Pascal A |
| - nak: Move has_fixed_latency to Op |
| - nak: Add and use a ShaderModel::needs_scoreboard() helper |
| - nak: Add latency helpers to ShaderModel and use them |
| - nak: Move SM70 encoding and legalize to a separate file |
| - nak: Move exec_latency into the per-SM files |
| - nak: Move latency information into the per-SM files |
| - nak: Move sched_common.rs to reg_tracker.rs |
| - nak: Add GPU generation helpers |
| - nak: Use is_volta() instead of sm == 70 |
| - nak: Put the cycle count assert behind a debug flag |
| - nak: Handle delays > 15 |
| - nak: Add an Op::no_scoreboard() helper |
| - intel/compiler: Use nir_split_conversions() |
| - nak: Add a ChannelMask type |
| - nak: Add support for suld/st.b |
| - nak,nir: Add an image_load_raw_nv intrinsic |
| - nak: Use suld.b on Kepler if we have a format |
| - nak: Allow predicates in nir_intrinsic_as_uniform |
| - nak: Add a NAK_DEBUG=panic option |
| - nvk: Call vk_device_finish() last in nvk_DestroyDevice() |
| - nvk/nvkmd: Check the correct flag for the Kepler GART workaround |
| - nil: Multiply by array_stride_B instead of adding |
| - nak/hw_tests: Feed predicate/carry sources with 0/1 data |
| - nak: Add a plop2 test |
| - nak: Add False and True to IntCmpOp |
| - nak: Lower texture inputs for Kepler B |
| - nak/legalize: Add a helper for lowering ineg |
| - nak: Add stubs for Kepler B |
| - nak: Add stubs for Fermi and Kepler A |
| - nak: Move some legalization helpers from sm50 to common code |
| - nak/sm50: Add zero_reg() and true_reg() helpers |
| - nak/sm70: Add zero_reg() and true_reg() helpers |
| - nak: Get rid of RegRef::zero |
| - nvk: Disable VK_EXT_descriptor_buffer pre-Maxwell |
| - nak/qmd: Rework cbuf size suffix handling |
| - nak/qmd: Add support for shifted cbuf addresses |
| - nak/qmd: Add QMD version 4.0 for Hopper |
| - nvk: Handle shifted QMD cbuf addrs in indirect command processing |
| - nak/legalize: Take a RegFile in copy_alu_src_and_lower_fmod |
| - nak/legalize: Take a RegFile in copy_alu_src_and_lower_ineg() |
| - nak/sm70: Fix the bit74_75_ar_mod assert |
| - nvk: Maxwell+ is now conformant |
| - nak: Set lower_pack_64_4x16 |
| |
| Felix DeGrood (9): |
| |
| - vk/overlay-layer: fix regression in non-control pathway |
| - intel/brw: support for dumping shader line numbers |
| - anv: add INTEL_DEBUG=shaders-lineno |
| - iris: add INTEL_DEBUG=shaders-lineno |
| - drm-uapi: add eu_stall uapi |
| - intel/perf: remove unnused argument from xe_perf_stream_read_error |
| - intel/perf: add eu stall sampling support |
| - util: add hash functions for u64 data type |
| - intel/tools: create intel_monitor for sampling eu stalls |
| |
| GKraats (2): |
| |
| - x11: give error messages if Xorg only supports DRI2 and mesa only DRI3 |
| - EGL: legacy-x11=dri2 should support hardware driver |
| |
| Ganesh Belgur Ramachandra (1): |
| |
| - amd: use 128B compression for scanout images when drm.minor <63 |
| |
| Georg Lehmann (83): |
| |
| - nir/lower_poly_line_smooth: don't emit control flow |
| - nir/lower_poly_line_smooth: only smooth first color target |
| - nir/lower_poly_line_smooth: support partial store_output |
| - radv: remove radv_should_lower_poly_line_smooth |
| - radv: inline radv_nir_lower_poly_line_smooth |
| - nir/lower_poly_line_smooth: don't reject fp16 |
| - nir/lower_poly_line_smooth: use intrinsics_pass |
| - nir/opt_move: don't move into critical sections |
| - ac/nir/lower_ps: move exports after packing alu |
| - nir/print: print large floats as mantissa + exponent |
| - nir: range analysis for ffract |
| - nir: fix range analysis for frcp |
| - nir: fix frsq range analysis |
| - nir: improve fsqrt range analysis |
| - nir/opt_algebraic: optimize ffract(ffract(a)) |
| - nir/peephole_select: support demote for non CF HW |
| - nir/peephole_select: handle demote and terminate in nir_opt_collapse_if |
| - nir/peephole_select: don't special case nir_opt_collapse_if + limit = ~0 |
| - nir/peephole_select: don't include nir_search_helpers.h |
| - nir/peephole_select: add options struct |
| - nir/peephole_select: add option to allow discard without ~0 limit |
| - nir/peephole_select: don't completely ignore ifs with dont_flatten |
| - nir: replace nir_opt_conditional_discard with nir_opt_peephole_select |
| - radv: remove separate discard peephole select |
| - nir/opt_algebraic: optimize b2f(a != 0) * a |
| - nir/search_helpers: look through vecs in is_only_used_as_float |
| - nir/search_helpers: check tex source type in is_only_used_as_float |
| - nir/builder: add nir_shader_phi_pass |
| - nir/opt_phi_precision: use nir_shader_phi_pass |
| - nir/opt_remove_phis: use nir_shader_phi_pass |
| - aco/insert_exec: fix continue_or_break on gfx6-7 |
| - nir: add a pass to optimize phis to 1bit |
| - nir/opt_algebraic: optimize ineg(a) == ineg(b) |
| - nir/opt_algebraic: optimize ineg(a) == #b |
| - nir/opt_algebraic: 0 >= a -> 0 == a |
| - nir/opt_algebraic: optimize DXBC boolean bcsel |
| - nir/opt_algebraic: optimize more boolean bcsel with constants |
| - nir/opt_algebraic: optimize dxbc boolean not |
| - nir/opt_algebraic: optimize constant shift of DXBC booleans |
| - nir/opt_algebraic: optimize b2i(a) != -b2i(b) |
| - radv: use nir_opt_phi_to_bool |
| - nir/opt_varyings: clean up nir_progress usage |
| - radv/nir_apply_pipeline_layout: clean up progress handling |
| - radv/nir_lower_ray_queries: use nir_foreach_function_impl |
| - nir/opt_algebraic: optimize bit_count(a) != 0 |
| - nir/opt_algebraic: optimize bcsel of b2f and constants |
| - nir/opt_algebraic: optimize b2i/b2f comparision with non 0/1 constants |
| - nir/opt_algebraic: optimize ~a == ~b and ~a == #b |
| - nir/opt_algebraic: push comparisons with constants into bcsel with constant |
| - nir/opt_algebraic: optimize more ine/ieq(umin(b2i, ), 0) |
| - nir/opt_algebraic: optimize d3d a ? b : 0 |
| - aco/optimizer: delete combine_and_subbrev |
| - radv: remove outdated vectorize TODO |
| - ac/nir/mem_access_bit_sizes: split unaligned vec3 lds access to allow more read2/write2 |
| - aco/opt_postRA: split try_optimize_scc_nocompare in two functions |
| - aco/opt_postRA: allow try_optimize_scc_nocompare for all instructions |
| - aco/opt_postRA: remove scc != 0 with multiple uses |
| - aco/opt_postRA: remove scc == 0 for more opcodes |
| - aco/isel: use s_mul_i32 instead of s_cselect_b32 for a ? b : 0 |
| - radv: enable invariant geom for DOOM(2016) |
| - radv: add dcc_decompress_gfx11 in radv_graphics_state_key |
| - ac/nir/lower_ps_late: consider dcc decompression for null exports |
| - radv/gfx10+: remove null exports if discard isn't used |
| - aco: don't assume that v_interp_mov_f32 flushes denorms |
| - aco/gfx11.5: remove vinterp ddx/ddy path |
| - aco/validate: fix scalar source validation for DPP and gfx11+ VINTERP |
| - nir/opt_algebraic: create ubfe from (a & mask) >> c |
| - aco/ra: disallow vcc definitions for pseudo scalar trans instrs |
| - nir: add option to keep mul24_relaxed |
| - aco: implement mul24_relaxed |
| - ac/llvm: support mul24_relaxed |
| - ac/nir: set has_mul24_relaxed |
| - aco/insert_exec: reset exec temporary after combined p_demote + p_end_wqm |
| - spirv: clamp/sign-extend non 32bit ldexp exponents |
| - aco/gfx9+: use d16 global/scratch/buffer loads |
| - spirv: fix cooperative matrix by value function params |
| - aco/gfx10: simpler solution to avoid store instructions in clauses |
| - aco: form mixed MTBUF/MUBUF clauses |
| - nir/opt_algebraic: optimize open coded ffract |
| - nir/opt_algebraic: disable fsat(a + 1.0) opt if a can be NaN |
| - aco: set opsel_hi to 1 for WMMA |
| - aco/insert_exec: only restore wqm mask after control flow if necessary |
| - aco/insert_exec: reset temporary when recreating wqm mask from exact mask |
| |
| Gert Wollny (1): |
| |
| - r600/sfn: gather info and set lowering 64 bit after nir_lower_io |
| |
| Giovanni Mascellani (2): |
| |
| - llvmpipe: Remove an outdated comment about subclassing pipe_screen. |
| - lvp: Remove some dead code. |
| |
| Guilherme Gallo (30): |
| |
| - ci: Properly clean up rustup |
| - ci: Remove cargo symlink workaround |
| - ci/android: add missing pre/post build scripts |
| - ci: setup-test-env: Prefer functions over aliases |
| - ci: add _error_msg for internal messaging |
| - ci: add support for structural tagging |
| - ci: copy structural tag files to rootfs |
| - ci/angle: add structured tag check to ANGLE build time |
| - ci/angle: test-time structured tag checks |
| - ci/angle: condense angle variables in one job |
| - ci/angle: remove USE_ANGLE variable |
| - ci/docs: add structured tagging documentation |
| - ci/lava: Drop the repeating quotes on lava-test-case |
| - ci/lava: Propagate errors in SSH tests |
| - ci/lava: xtrace the lava_job_submitter call |
| - ci/lava: Add U-Boot action timeout for rockchip DUTs |
| - ci/lava: Properly detect VMWARE farm |
| - ci: Specify the FARM variable for DUT jobs |
| - ci: Simplify LAVA farm detection |
| - ci/lava: Remove depthcharge-start timeout |
| - ci/lava: Split boot action into deploy and boot |
| - ci/lava: Tweak timeouts |
| - ci/lava: Don't print empty lines when changing sections |
| - Revert "ci: setup-test-env: Prefer functions over aliases" |
| - ci/bin: update_tag: improve tag load |
| - ci/update_tag: fix linter errors |
| - ci/lava: Fix LAVA lima jobs |
| - ci/lava: Fix LAVA lima jobs |
| - ci/lava: Fetch kernel modules from overlay |
| - ci: Add some unit tests for the duration field |
| |
| Gurchetan Singh (5): |
| |
| - gallium: drop const qualifier on return type |
| - lavapipe: use quotes instead of angle bracket |
| - gfxstream: check device exists before using it |
| - gfxstream: refactor device initialization |
| - gfxstream: follow the semantics desired by distro VK loader |
| |
| Hans-Kristian Arntzen (3): |
| |
| - radv: Always allow sparse on normal GFX/COMPUTE/DMA queues. |
| - radv: Repurpose radv_legacy_sparse_binding drirc |
| - radv: Always set 0 dispatch offset for indirect CS. |
| |
| Hyunjun Ko (12): |
| |
| - anv: Fix to set CDEF flter flag correctly for AV1 decoding |
| - anv/video: clean-up duplicated code. |
| - dri: fix a build error |
| - kopper: implement to get sync values. |
| - anv: fix maxDpbSlots and maxActiveReferencePictures for AV1 decoding. |
| - anv: Add one more flag of VideoCapability for encoding. |
| - anv: Do not support the tiling of DRM modifier if DECODE_DST |
| - anv/ci: remove some expected failures of dEQP-VK.video.formats.* |
| - vulkan/video: Do byte-alignment when building a h264 slice header |
| - anv: Add stdSyntaxFlag values for h264/5 encoders |
| - anv: Move rateControlMode to the video session. |
| - anv: Use vk_video_derive_h265_scaling_list |
| |
| Iago Toral Quiroga (9): |
| |
| - v3dv: implement sync debug option |
| - v3dv: serialize jobs after any barrier when debug sync is set |
| - v3dv: fix missing access bit flag when checking for texel buffer reads |
| - mesa: fix RGBA_SIGNED_COMPONENTS for lowered signed luminance |
| - v3dv: fix crash on 32-bit builds |
| - v3dv: rename v3dv_cmd_buffer_merge_barrier_state |
| - v3dv: make cmd_buffer_serialize_job_if_needed take a barrier state |
| - v3dv: improve handling of trailing barriers |
| - pan/va: fix FAU validation |
| |
| Ian Romanick (38): |
| |
| - iris: Add missing nir_metadata_preserve in iris_lower_storage_image_derefs |
| - crocus: Add missing nir_metadata_preserve in crocus_lower_storage_image_derefs |
| - iris: Use nir_shader_intrinsics_pass in iris_lower_storage_image_derefs |
| - crocus: Use nir_shader_intrinsics_pass in crocus_lower_storage_image_derefs |
| - brw/copy: Fix handling of offset in extract_imm |
| - brw/copy: Use extract_imm in try_constant_propagate_value |
| - brw/copy: Allow constant propagation of some 64-bit integers |
| - nir/algebraic: More (a == 0 || a == 1 || ...) patterns |
| - nir/algebraic: Optimize zero comparisons of umax or umin |
| - nir/algebraic: Simplify equality comparisons of b2T with 1 or 0 |
| - nir/algebraic: Undistribute b2i from logic-ops |
| - brw/print: Don't let SHADER_OPCODE_FLOW affect indentation |
| - brw: Fix typo in comment |
| - brw/nir: Lower fsign again after last call to brw_nir_optimize |
| - brw/opt: Move non-SSA register accounting after first brw_opt_split_virtual_grfs |
| - brw: Add basic infrastructure for load_reg pseudo op |
| - brw/copy: Prepare copy_propagation for load_reg |
| - brw/coalesce: Prepare brw_opt_register_coalesce for load_reg |
| - brw/algebraic: Constant folding for BROADCAST and SHUFFLE |
| - brw: Add passes to generate and lower load_reg |
| - brw/sat: Convert tests to use load_reg |
| - brw/sat: Eliminate non-defs saturate propagation |
| - brw/opt: Don't call brw_opt_copy_propagation before brw_lower_load_reg |
| - brw/nir: Fix source handling of nir_intrinsic_load_barycentric_at_offset |
| - brw/nir: Eliminate default parameter to get_nir_src |
| - brw/algebraic: Optimize derivative of convergent value |
| - brw/copy: Refactor source modifier type checking |
| - brw/copy: Copy prop -X into Y&1 |
| - brw/nir: Optimize b2f(not(X)) using logical operations instead of arithmetic |
| - brw/nir: Allow b2f(not(X)) optimization on Gfx12.5+ |
| - brw/nir: Use offset() for all uses of offs in emit_pixel_interpolater_alu_at_offset |
| - nir/algebraic: Allow fmin(a,a) optimization when flush denorm to zero is not set |
| - brw/algebraic: Clear condition modifier on optimized SEL instruction |
| - brw/algebraic: Don't optimize float SEL.CMOD to MOV |
| - elk/algebraic: Clear condition modifier on optimized SEL instruction |
| - elk/algebraic: Don't optimize float SEL.CMOD to MOV |
| - brw/cmod: Fix some errors when propagating from CMP to ADD.SAT |
| - brw/cmod: Don't propagate from CMP to possible Inf + (-Inf) |
| |
| Ivan A. Melnikov (1): |
| |
| - gallium/radeon: Make sure radeonsi PCI IDs are also included |
| |
| Ivan Avdeev (2): |
| |
| - radv: add experimental support for AMD BC-250 board |
| - radv,radeonsi: disable compute queue for BC250 |
| |
| Iván Briano (2): |
| |
| - anv: handle REMAINING_LAYERS in host image copy cases |
| - brw: make HALT instruction act as barrier in new CSE pass |
| |
| James Hogan (8): |
| |
| - glsl: Expose gl_ViewID_OVR back to GLSL 1.30 |
| - mesa: Fix multiview attachment completeness check |
| - mesa: Fix FramebufferTextureMultiviewOVR num_views check |
| - mesa: Consider NumViews to reuse FBO attachments |
| - mesa: Handle GL_FRAMEBUFFER_INCOMPLETE_VIEW_TARGETS_OVR |
| - mesa: Check views don't exceed GL_MAX_ARRAY_TEXTURE_LAYERS |
| - mesa: OVR_multiview framebuffer attachment parameters |
| - mesa: Handle getting GL_MAX_VIEWS_OVR |
| |
| Jan Alexander Steffens (heftig) (1): |
| |
| - gfxstream: Use proper log format for 32-bit Vulkan |
| |
| Janne Grunau (8): |
| |
| - hk: Replace alloca with malloc in queue_submit |
| - hk: Use rowPitch from VkImageDrmFormatModifierExplicitCreateInfoEXT |
| - venus: Do not use instance pointer before NULL check |
| - venus: virtgpu: Require stable wire format |
| - asahi: build asahi_clc for -Dtools=asahi |
| - asahi: Drop unnecessary idep_mesaclc dependency |
| - panfrost: build panfrost_compile for -Dtools=panfrost |
| - ci: Switch cross-builds to '-D tools=panfrost' |
| |
| Jason Macnak (6): |
| |
| - gfxstream: Move snapshot decoder replay into VkDecoderGlobalState |
| - gfxstream: Remove unused handling mappers |
| - gfxstream: Move the handle replay buffer into BoxedHandleManager |
| - gfxstream: Remove duplicated boxed handle func declarations |
| - gfxstream: Update variable names to avoid -Wshadow error |
| - gfxstream: Remove extra dispatch variable |
| |
| Jeongik Cha (1): |
| |
| - gfxstream: Add AHARDWAREBUFFER_FORMAT_B8G8R8A8_UNORM in android_format_is_yuv |
| |
| Jesse Natalie (2): |
| |
| - meson: Enable /Zc:preprocessor for MSVC |
| - CI/Windows: Update container deps |
| |
| Job Noorman (46): |
| |
| - freedreno/drm-shim: enable raytracing |
| - ir3: fix emitting descriptor prefetches at end of preamble |
| - ir3: add braces around complex if/else block |
| - ir3/ra: handle phis with preferred regs first |
| - ir3/parser: add helper to generate syntax errors based on gen |
| - ir3/isa: fix (dis)asm of ldg.a/stg.a on a6xx |
| - ir3: don't create SRC2 for isam without .v |
| - ir3/legalize: use (sy) for ray_intersection WAR hazards |
| - ir3/lower_tess: make all NIR passes report progress |
| - ir3: don't use deprecated NIR_PASS_V anymore |
| - ir3: reformat after previous commit |
| - ir3/opt_prefetch_descriptors: fix crash after nir_progress rewrite |
| - ir3: add reformatting commits to .git-blame-ignore-revs |
| - nir/lower_phis_to_scalar: remove unused mem_ctx |
| - nir/lower_phis_to_scalar: use nir_builder API where possible |
| - nir/lower_phis_to_scalar: don't create moves for undef sources |
| - nir/lower_subgroups: use build_cluster_mask for quad mask |
| - ir3/ra: prevent reusing parent interval of reloaded sources |
| - ir3: clear instruction uses when cloned |
| - ir3/sched: unblock a0.x/a1.x after last use |
| - ir3: add ir3_cursor_current_block helper |
| - ir3/cse: add support for mov a0.x/a1.x |
| - ir3: remove hash table for a1.x |
| - ir3: add helper to create STC |
| - ir3: fix false dependencies of rpt instructions |
| - ir3/sched: handle dependencies between stc and const reads |
| - ir3: split immediate state from rest of const state |
| - ir3: make const_imm_index_to_reg helper public |
| - ir3: fix max const size calculation for the binning pass |
| - ir3: lower immediates to const regs in preamble on a7xx |
| - ir3: keep inputs at start block when creating empty preamble |
| - ir3/legalize: fix off-by-one error in kill_sched |
| - ir3/legalize: take wrmask into account for delay updates |
| - ir3: don't sync every TCS/GEOM block |
| - ir3: run opt_if after opt_vectorize |
| - ir3: make shpe a terminator |
| - ir3/ra: assign interval offsets to new defs after shared RA |
| - ir3: add ir3_aggressive_coalesce helper |
| - ir3/ra: create merge sets for splits/collects inserted for shared RA |
| - ir3/opt_preamble: take alias.rt into account for rewrite cost |
| - ir3: remove spaces in shader stats |
| - ir3/cp: add option to disable immediate to const lowering |
| - ir3/cp: ignore alias sources for sam.s2en |
| - ir3: run cp after ir3_imm_const_to_preamble |
| - ir3/ra: add helper for getting a dst interval |
| - ir3/ra: ignore phis handled by shared RA |
| |
| John Anthony (2): |
| |
| - panvk: Avoid division by zero for vkCmdCopyQueryPoolResults |
| - panvk: Enable VK_EXT_direct_mode_display |
| |
| Jon Hunter (1): |
| |
| - freedreno/registers: Fix gen_header.py for older python3 versions |
| |
| Jordan Justen (5): |
| |
| - intel/dev: Add BMG PCI IDs (0xe210, 0xe215, 0xe216) |
| - intel/dev: Stop checking hwconfig values at driver runtime |
| - tools/intel_dev_info: Print hwconfig discrepancies |
| - intel/dev: Ignore hwconfig difference due to WA 18040209780 |
| - intel/dev: Add BMG 0xe211 PCI ID |
| |
| Jose Fonseca (1): |
| |
| - glapi: Make _GLAPI_EXPORT a no-op on Windows. |
| |
| Jose Maria Casanova Crespo (3): |
| |
| - v3dv/ci: add new flakes |
| - glapi: import noop_array and public stubs earlier. |
| - v3dv: avoid TFU reading unmapped pages beyond the end of the buffers |
| |
| José Roberto de Souza (20): |
| |
| - intel: Initialize upper 32bits of drm_xe_sync.handle |
| - intel/dev: Improve max_cs_threads documentation |
| - intel/dev: Call intel_device_info_update_after_hwconfig() from common code |
| - intel/common: Retry GEM_CONTEXT_CREATE when PXP have not finished initialization |
| - anv: Remove protected memory support from compute queue |
| - intel: Sync xe_drm.h |
| - anv: Move code adding protected memory type to common code |
| - anv: Add support to create protected bo and protected exec_queue in Xe KMD |
| - iris: Add support to create protected bo and protected exec_queue in Xe KMD |
| - intel: Add function to check if PXP is supported in Xe KMD |
| - iris: Replace BO_ALLOC_* macros by a enum |
| - intel/hwconfig: Sync hwconfig with IGT |
| - intel/hwconfig: Remove ignored intel_hwconfigs from apply_hwconfig_item() |
| - intel/dev/xe3: Set max_slices and max_subslices_per_slice using hwconfig |
| - intel/perf: Update intel_perf to match xe_drm.h |
| - drm-uapi: Sync xe_drm.h |
| - intel: Program XY_FAST_COLOR_BLT::Destination Mocs for gfx12 |
| - intel: Fix the MOCS values in XY_FAST_COLOR_BLT for Xe2+ |
| - intel: Fix the MOCS values in XY_BLOCK_COPY_BLT for Xe2+ |
| - intel/tools: Fix batch buffer decoder |
| |
| Juan A. Suarez Romero (31): |
| |
| - broadcom/compiler: move stores to the end of shader |
| - Revert "st/mesa: move VS & TES output stores to the end before unlowering IO" |
| - broadcom/ci: add new failures/flakes |
| - v3dv: take into account GS when enabling line smooth |
| - v3dv/ci: disable rpi5 job |
| - vulkan: don't leak debug utils label name |
| - v3dv: duplicate key for texel_buffer cache |
| - vc4/ci: update expected results |
| - broadcom/simulator: use string copy instead of memcpy |
| - vc4/ci: update expected results |
| - v3dv/ci: Skip tests causing OOM |
| - Revert "v3dv/ci: disable rpi5 job" |
| - v3d/v3dv/vc4: review all expected timeouts |
| - v3dv: remove src_format from blit render pass creation |
| - v3dv: don't batch regions with different depth offsets |
| - v3dv: include depth offset on image view creation |
| - vc4: check instruction before setting flags |
| - v3dv: asserts struct is always non null |
| - v3dv: check dynamic offset output |
| - v3dv: asserts push constants data is valid |
| - vc4: initialize register |
| - vc4: add assertion on constant_fold |
| - vc4: assert there are sources when emitting texture |
| - broadcom/cle: assert attribute has a value |
| - vc4: use safe iterator to remove instructions |
| - broadcom/compiler: use safe iterator to remove instructions |
| - broadcom/compiler: don't use VLA on emit alu |
| - broadcom/compiler: initialize register |
| - v3dv: don't check if DRM device is master |
| - v3d(v)/ci: update expected results |
| - ci: include duration in the CustomLogger |
| |
| Julia Zhang (5): |
| |
| - vulkan: handle device memory report requests |
| - radv: add import and export handle_type in radv_alloc_memory |
| - radv: add obj_id to radeon_winsys_bo |
| - radv: emit device memory report for device memory events |
| - radv: advertise VK_EXT_device_memory_report |
| |
| Jung-uk Kim (1): |
| |
| - FreeBSD: Disable support for "-mtls-dialect" for FreeBSD |
| |
| Juston Li (3): |
| |
| - anv: xe: fully initialize drm_xe_sync addr/handle union |
| - iris: xe: fully initialize drm_xe_sync addr/handle union |
| - wsi/common: android: disable KHR_present_[wait/id] |
| |
| K900 (1): |
| |
| - meson: support building with system libgbm |
| |
| Karmjit Mahil (3): |
| |
| - loader/wayland: Fix missing timespec.h include |
| - tu: Fix Perfetto build error with vk_buffer |
| - tu: Fix segfault in fail_submit KGSL path |
| |
| Karol Herbst (49): |
| |
| - ci/windows: Bump Vulkan SDK for SPIRV-Tools |
| - clc: use SetUseHighestVersion when linking spirvs |
| - mesa_clc: drop spirv version workaround |
| - rusticl/mem: set bind flags for gl imports |
| - rusticl/mesa: add PipeContext::device_reset_status |
| - rusticl/queue: check device error status |
| - clc: bump SPIR-V target to 1.6 |
| - rusticl/kernel: call nir_lower_variable_initializers earlier |
| - rusticl: support SPIR-V 1.5 and 1.6 |
| - rusticl/mem: do not apply offset with in copy_image_to_buffer |
| - rusticl/mesa: add buffer and texture variant for resource_copy_region |
| - rusticl/mem: Buffer::copy_to_image layering |
| - rusticl/mem: Image::copy_to_buffer layering |
| - rusticl/mem: Image::copy_to_image layering |
| - rusticl/mem: Image::write layering |
| - rusticl/mem: accelerate Buffer::copy_rect |
| - rusticl/mem: accelerate Buffer::write_rect |
| - rusticl/mem: set num_samples and num_mip_levels to 0 when importing from GL |
| - rusticl/platform: advertise all extensions supported by all devices |
| - rusticl/util: add missing comment and assert to char_arr_to_cstr |
| - intel/brw, lp: enable lower_pack_64_4x16 |
| - nir: Do not eliminate dead writes to shared memory in called functions. |
| - rusticl/program: implement CL_INVALID_PROGRAM_EXECUTABLE check in clGetProgramInfo |
| - rusticl/program: pass options by reference |
| - rusticl/program: loop over all devices inside Program::build |
| - rusticl/program: rework build_nirs so it only touches devices we care about |
| - rusticl/program: fix building kernels |
| - rusticl/program: simplify active_kernels check |
| - rusticl/kernel: rename CSOWrapper to SharedCSOWrapper |
| - rusticl/queue: make it unncessary to keep QueueContext Send |
| - rusticl/queue: cache bound CSO |
| - rusticl/mesa: remove Sync from PipeContext |
| - nir/serialize: fix decoding of is_return and is_uniform |
| - vtn: Support the UniformDecoration capability. |
| - zink: don't apply the map_offset when mapping a staging resource in zink_buffer_map |
| - iris: remove all clover support code |
| - freedreno: remove all clover support code |
| - llvmpipe: remove all clover support code |
| - gallium: stop filling ir_target in various drivers |
| - gallium: stop using PIPE_BIND_COMPUTE_RESOURCE in drivers |
| - gallium: stop implementing set_compute_resources in various drivers |
| - nouveau: ignore req_input_mem |
| - rusticl/device: fix panic when disabling 3D image write support |
| - nir_lower_mem_access_bit_sizes: fix negative chunk offsets |
| - nak: fix handling of delays > 15 |
| - r600: fix r600_buffer_from_user_memory for rusticl |
| - iris: parse global bindings for every gen |
| - iris/xe: fix compute shader start address |
| - iris/xe: take the grids variable_shared_mem into account |
| |
| Kenneth Graunke (37): |
| |
| - brw: Drop unused defines |
| - brw: Eliminate fs_inst::shadow_compare |
| - brw: Replace fs_inst::pi_noperspective with a logical control source |
| - brw: Drop FB_WRITE_LOGICAL_SRC_DST_DEPTH source |
| - brw: Replace fs_inst::last_rt with a logical control source |
| - brw: Replace fs_inst::target field with logical FB read/write sources |
| - brw: Use correct builder size for MEMORY_FENCE/INTERLOCK virtual opcodes |
| - brw: Change destination of memory fences to UD type |
| - brw: Eliminate the BTI source from MEMORY_FENCE/INTERLOCK opcodes |
| - brw: Add latencies for HDC/RC memory fences |
| - brw: Lower MEMORY_FENCE and INTERLOCK in lower_logical_sends |
| - brw: Drop INTERPOLATE_AT mlen handling from size_read() |
| - brw: Drop unnecessary mlen/header_size on virtual GET_BUFFER_SIZE op |
| - nir: Eliminate dead writes to shared memory at the end of the program |
| - brw: Rename shared function enums for clarity |
| - isl: Delete redundant "use separate stencil?" check |
| - isl: Drop compile time "use separate stencil" checks. |
| - intel: Delete devinfo->must_use_separate_stencil |
| - intel: Delete devinfo->has_surface_tile_offset |
| - intel: Move devinfo->has_negative_rhw_bug into the elk compiler |
| - intel: Move devinfo->has_compr4 into the elk compiler |
| - intel/dev: Set minimum HS URB entries to 0. |
| - intel/dev: Set max_wm_threads to 0 in the Gfx9+ devinfo structs |
| - intel/dev: Rework device info macros for Gfx8+ |
| - intel/dev: Set a higher minimum number of URB entries for GS |
| - intel: Use devinfo->urb.min_entries[GS and TCS] for setting URB configs |
| - intel: Move unlit centroid workaround into the elk compiler |
| - intel/decoder: Decode compute shaders in EXECUTE_INDIRECT_DISPATCH |
| - brw: Make a helper to emit UNDEF for temporaries containing small types |
| - brw: Emit UNDEF as needed in SSA-style builder helpers |
| - brw: Skip unnecessary UNDEFs for comparisons |
| - brw: Use a smaller type for masked sub-32-bit shift values |
| - brw: Avoid regioning restrictions for u2u16/i2i16 narrowing conversions |
| - brw: Track the largest VGRF size in liveness analysis |
| - brw: Use live->max_vgrf_size in register coalescing |
| - brw: Use live->max_vgrf_size in pre-RA scheduling |
| - brw: Don't assert about MAX_VGRF_SIZE in brw_opt_split_virtual_grfs() |
| |
| Kenny Levinsen (1): |
| |
| - device-select: Support linux-dmabuf feedback |
| |
| Kevin Chuang (2): |
| |
| - anv/bvh: Fix encoder handling sparse buffer |
| - anv/bvh: Fix copy shader handling sparse buffer |
| |
| Konstantin (1): |
| |
| - nir/tests: Do not rely on __LINE__ |
| |
| Konstantin Seurer (68): |
| |
| - nir: Stop using instructions for debug info |
| - spirv: Handle NonSemantic.Shader.DebugInfo.100 |
| - nir: Add variable debug info to instructions |
| - nir/lower_vars_to_ssa: Annotate defs with variable names |
| - vulkan: Stop using strings for BVH build pipeline keys |
| - vulkan/meta: Remove object types from vk_meta_object_key_type |
| - vulkan/meta: Stop using strings for meta keys |
| - hk: Stop using strings or common key types for meta keys |
| - radv/meta: Stop using strings for meta keys |
| - lavapipe: Fix maintainance7 descriptor set limits |
| - vulkan/cmd_queue: Simplify freeing cmd_queue entries |
| - vulkan/cmd_queue: Add VK_CMD_TYPE_COUNT |
| - vulkan/radix_sort: Stop force-unrolling loops |
| - gallivm: Remove loop limiting |
| - lavapipe: Implement some functions required by the common BVH framework |
| - lavapipe: Use the common BVH framework |
| - radv: Optimize fs builtins using static gfx state |
| - gallivm: Split nir prepasses into aos/soa |
| - gallivm/nir/aos: Remove the dependency on lp_bld_nir.c |
| - gallivm/nir/soa: Remove the dependency on lp_bld_nir.c |
| - gallivm/nir/soa: Lower bools to i1 |
| - gallivm/nir/soa: Implement robusst uniform loads without controlflow |
| - gallivm/nir/soa: Select more IO to gather/scatter intrinsics |
| - lavapipe: Move nir passes to a new directory |
| - lavapipe: Lower descriptor sets in NIR |
| - lavapipe: Initialize the compiler options of the noop fs |
| - llvmpipe: Do not use coroutines when they are unnecessary |
| - nir: Rename in-bounds-agx to in-bounds |
| - nir: Do not emit amul if it is unsupported |
| - lavapipe: Optimize buffer robustness |
| - gallivm/nir/soa: Do not lower vectors to llvm arrays |
| - nir/divergence_analysis: Handle load_const_buf_base_addr_lvp |
| - gallivm/nir/soa: Use divergence analysis |
| - lavapipe: Lower push constants in NIR |
| - gallivm: Only guard tex/image ops if the exec mask can be zero |
| - gallivm/nir/soa: Skip bounds checking for in-bounds access |
| - gallivm/nir/soa: Properly skip empty else branches |
| - lavapipe: Remove uniform inlining |
| - vulkan: Add utilities for triggering renderdoc captures |
| - radv: Lower ray query vars to structs |
| - radv: Implement multidimensional ray query arrays |
| - llvmpipe: Skip draw_mesh if the ms did not write gl_Position |
| - nir: Test nir_minimize_call_live_states |
| - nir/sweep: Fix handling instructions with debug info |
| - nir/print: Do not print debug information when gathering it |
| - gallivm: Create a debug builder and add GALLIVM_DEBUG=symbols |
| - llvmpipe: Annotate functions with debug information |
| - gallivm: Handle nir_instr_debug_info |
| - gallivm: Emit debug info for definitions |
| - gallivm: Add a debug variable for the exec mask |
| - gallivm: Run nir_lower_load_const_to_scalar |
| - lavapipe: Enable debug information if GALLIVM_DEBUG=symbols is set |
| - radv/meta: Change the return type of get_r32g32b32_format to VkFormat |
| - ci: Do not build hk on alpine |
| - asahi: Only require IOKit for tooling |
| - hk: Fix building without the gallium driver |
| - spirv: Emit code for NonSemantic.DebugPrintf if supported |
| - nir: Turn the format string index into a const index |
| - radv: Handle nir_intrinsic_printf |
| - clc: Print errors when initializing clang fails |
| - clc: Allow bitfields |
| - clc,libcl: Clean up CL includes |
| - radv: Fix rayTracingPositionFetch with multiple geometies |
| - lavapipe: Prefetch 56 bytes of node data during ray traversal |
| - lavapipe: Run nir optimizations on ray tracing pipelines |
| - lavapipe: pre-load tmax |
| - lavapipe: Do not emit aabb handling if no isec shader is used |
| - radv: Return VK_ERROR_INCOMPATIBLE_DRIVER for unsupported devices |
| |
| Lakshman Chandu Kondreddy (1): |
| |
| - freedreno: Add support for Adreno623 GPU |
| |
| Lars-Ivar Hesselberg Simonsen (9): |
| |
| - panfrost: Do not evaluate_per_sample for non-MSAA |
| - Revert "panfrost: remove is_blit flag" |
| - Revert "panfrost: fix hang by using MALI_PIXEL_KILL_WEAK_EARLY in color preload" |
| - panvk: Set missing shader_modifies_coverage flag |
| - panfrost: Use RUN_COMPUTE over RUN_COMPUTE_INDIRECT |
| - panvk: Use RUN_COMPUTE over RUN_COMPUTE_INDIRECT |
| - vulkan/wsi/wayland: Avoid duplicate colorspace entry |
| - panvk: Add barrier for interleaved ZS copy cmds |
| - vk/sync: Fix execution only barriers |
| |
| Leder, Brendan Steve (1): |
| |
| - amd/vpelib: Reformat index variables and update enum |
| |
| Leonard Göhrs (2): |
| |
| - etnaviv/ci: add pengutronix LAVA lab with one i.MX8MP device |
| - ci: re-enable pengutronix farm |
| |
| LingMan (2): |
| |
| - meson: Update pest subproject family |
| - meson: Sync subproject version numbers in \`packagefiles` with their \`.wrap` equivalents |
| |
| Lionel Landwerlin (137): |
| |
| - anv: fixup missing compiler dependency on tests |
| - intel: move internal shader compile to vtn_bindgen2 |
| - compiler: drop vtn_bindgen |
| - intel_clc: remove NIR output support |
| - spirv: remove spirv_library_to_nir_builder |
| - brw: fixup scoreboarding for find_live_channels |
| - anv: reuse helper for compute push constants |
| - anv: increase general state pool |
| - anv: track the first 2MB of unused VA |
| - anv: use heap size to program generate state heap |
| - anv: add a helper for getting gfx push constant addresses |
| - brw/anv: rework push constants for mesh/task shaders |
| - brw: enable A64 pulling of push constants |
| - anv: use A64 messages for push constants loads on Gfx12.5+ |
| - nir: add a high precision conversion unorm->float |
| - anv: add source hashes for BVH building shaders |
| - vulkan/wsi: propagate protected swapchain to images |
| - vulkan: allow support for protected surfaces |
| - anv: support protected surfaces with display platform |
| - anv,driconf: Add sampler coordinate precision workaround for Dynasty Warriors |
| - genxml: make component packing an array |
| - genxml: add convenience dwords for packing components |
| - brw: fix indentation |
| - brw: remove nr_attribute_slots from vs_prog_data |
| - brw: port vs input to lower_64bit_to_32_new |
| - brw: update vulkan max attribute limit |
| - brw: add a max HW vertices attribute limit |
| - brw: enable vertex fetching component packing |
| - blorp: emit 3DSTATE_VF |
| - anv: disable VF statistics for memcpy |
| - anv: enable vertex fetching component packing |
| - anv: ensure Wa_16012775297 interacts correctly with Wa_18020335297 |
| - brw: use meaningful io locations for system values |
| - brw: add support for no VF input slot compaction |
| - brw: add documentation about slot compaction & component packing |
| - brw: fix component packing starting index |
| - anv: move RT stage bits to main header |
| - anv: move reg_mask push constant field to gfx |
| - anv: hold a prepacked COMPUTE_WALKER instruction on CS pipelines |
| - anv: make gfx state flushing available externally |
| - anv: make compute state flush helper visible |
| - runtime: sort push constant layouts |
| - anv: avoid memory type changes with INTEL_DEBUG=noccs |
| - anv/ci/adl: update fail expectation for video |
| - anv: fixup compute walker storage length |
| - nir: add options to lower only some image atomics to global |
| - brw: factor out base prog_data setting |
| - brw: store source_hash in prog_data |
| - anv: switch to use brw's prog_data source_hash |
| - isl: report tiling address swizzles |
| - isl: add usage for software detiling |
| - isl: centralize supported tilings in a single function |
| - isl: select a tiling for shader detiling |
| - isl: add support for R64 storage image lowering |
| - isl: add a helper to report what dimensions a tiling supports |
| - nir: track lowered image intrinsics to globals |
| - brw: include UGM fence when TGM + lowered image->global |
| - brw: add support for 64bit storage images load/store |
| - brw: add support for texel address lowering |
| - anv: rename compressed format emulation helpers |
| - anv: add mapping for VBO formats in format mapping |
| - anv: add a is_sparse for image format support checks |
| - u_trace: pass tracepoint flags to the read_timestamp callback |
| - intel/ds: rework RT tracepoints |
| - anv: fix missing 3DSTATE_PS:Kernel0MaximumPolysperThread programming |
| - brw: optimize load payload with immediate headers |
| - brw: avoid setting up the sampler header bits when unused |
| - spirv: fix racy build |
| - vulkan/runtime: ensure robustness state is fully initialized |
| - vulkan/runtime: pass robustness state to preprocess vfunc |
| - vulkan/runtime: add a multialloc vk_shader allocator |
| - vulkan/runtime: store flags on descriptor set layouts |
| - anv/iris: centralize TBIMR drirc |
| - iris: remove duplicate TessellationDistributionMode programming |
| - anv/iris: add drirc keys to disable VF/TE distribution |
| - anv/apply_layout: split binding table building into its own function |
| - intel/genxml: add a genX RT include header |
| - genxml: simplify genX_rt_pack.h |
| - brw: make intel_shader_enums.h opencl importable |
| - anv/brw: move INTEL_MSAA_* flag computation to the compiler |
| - anv: break down Wa_16014912113 in need/apply parts |
| - anv: fixup indentation around Wa_16014912113 |
| - brw: avoid calling lower_indirect_derefs multiple times |
| - anv: fill runtime buffer device_address field |
| - anv: move index buffer entry point out of genX code |
| - anv: move vertex buffer storage to 64bit address + mocs |
| - anv: move xfb buffer storage to 64bit address + mocs |
| - anv: track protection on anv_address |
| - anv: use addresses for buffer<->image copies |
| - anv: simplify internal blorp helper |
| - anv: fix non page aligned descriptor bindings on <Gfx12.0 |
| - lavapipe: fill buffer address |
| - vulkan/runtime: rely on vk_buffer::device_address |
| - brw: fix spilling for Xe2+ |
| - brw: ensure VUE header writes in HS/DS/GS stages |
| - anv: Set limit_trig_input_range option for Company of Heroes 3 |
| - anv: avoid early lower of the fp64 code |
| - blorp: assert that shaders don't spill |
| - blorp: relax depth/stencil<->color copy restriction |
| - vulkan: add helper for color/depth-stencil capable formats |
| - anv: relax depth/stencil<->color copy restrictions |
| - anv: fix end of pipe timestamp query writes |
| - anv: disable replication when we don't have both VS/FS stages |
| - brw: always write the VUE header |
| - anv: limit implict write with drirc |
| - nir: add support for lowering non uniform texture offsets |
| - elk: stop using intel_nir_lower_texture |
| - brw: don't lower tg4 offsets without LOD |
| - intel: move lower_texture to brw |
| - brw: move texture offset packing to NIR |
| - anv: enable non uniform texture offset lowering |
| - anv: wire VkAccessFlagBits3KHR flags in internal helpers |
| - anv: expose VK_KHR_maintenance8 support |
| - intel/genxml: remove ISA fields |
| - intel/genxml: add more engine tagging on instructions |
| - intel/genxml: fixup engine filtering |
| - intel/genxml: define post-sync operations for MI_FLUSH_DW |
| - intel/genxml: add MI_FLUSH_DW to blitter engine |
| - anv/genxml: use special genX video pack files |
| - anv/hasvk: consider timeline semaphore support stable |
| - docs: remove unused env variable |
| - anv/hasvk: sort out debug options |
| - anv: consolidate environment variables |
| - anv: add shader-hash debug option |
| - brw: fix shuffle with scalar/uniform index |
| - anv: relax restriction on variable count descriptors |
| - anv: fix self dependency computation |
| - brw: fix Wa_22013689345 emission |
| - iris: update Wa_1607156449 to use WA infrastructure |
| - anv: update Wa_1607156449 to use WA infrastructure |
| - intel/dev: remove ADLN references |
| - anv: remove ALWAYS_INLINE from globally visible functions |
| - anv/iris: implement Wa_18040903259 |
| - anv: use companion batch for operations with HIZ/STC_CCS destination |
| - anv: force fragment shader execution when occlusion queries are active |
| - intel: fix null render target setup logic |
| - brw: add pre ray trace intrinsic moves |
| |
| Lorenzo Rossi (8): |
| |
| - nvk: fix preprocess buffer alignment |
| - nvk: Fix MSAA sparse residency lowering crash |
| - nir: support shared atomics in nir_lower_atomics |
| - nvk, nak: Implement shaderSharedInt64Atomics |
| - nak: Fold bool-int-bool conversions |
| - nak: Flatten AttrAccess into instructions |
| - nak: Fix SM50 rounding-mode encoding edge-case |
| - nak: Simplify shl64 lowering on Maxwell |
| |
| Loïc Minier (1): |
| |
| - freedreno: check if GPU supported in fd_pipe_new2 |
| |
| Loïc Molinari (5): |
| |
| - perfetto: Let MESA_TRACE_FUNC() take printf-like format arguments |
| - panfrost: Add CPU traces |
| - pan/kmod: Add drmIoctl() wrapper pan_kmod_ioctl() with CPU trace |
| - docs: Add Panfrost to the list of drivers with CPU traces |
| - mesa: Add CPU traces |
| |
| Lucas Fryzek (4): |
| |
| - anv: Release correct bo for RT scratch |
| - vulkan/runtime: Add object type to DMR API |
| - anv: Implement VK_EXT_device_memory_report |
| - anv: Expose VK_EXT_device_memory_report |
| |
| Lucas Stach (19): |
| |
| - etnaviv: drm: fix instruction limit for cores with instruction cache |
| - etnaviv: drop double masking in etna_emit_load_state |
| - etnaviv: split large multi-state updates into multiple batches |
| - etnaviv: emit PS start and end PC states only on shader changes |
| - etnaviv: correct and rename shader range register check |
| - etnaviv: replace open-coded shifts in shader range registers with macros |
| - etnaviv: place FS right behind VS in unified instruction memory |
| - etnaviv: allow larger shaders with unified instruction memory |
| - kmsro: look for graphics capable screen as renderonly device |
| - etnaviv: rs: fix slow/fast clear transitions |
| - etnaviv/ci: remove dEQP-GLES2.functional.polygon_offset.default_enable fail |
| - etnaviv: fix ETNA_MESA_DEBUG=no_early_z |
| - etnaviv: Update headers from rnndb |
| - etnaviv: enable forwarding ZW fragcoord components from RA to SH |
| - etnaviv/ci: remove fragcoord related fails on GC7000 |
| - etnaviv: use pipe_resource in etna_resource_status |
| - etnaviv: split PIPE_BUFFER resources from other types of resources |
| - include: update GL headers from the registry |
| - etnaviv: add context flush sw query |
| |
| Ludvig Lindau (1): |
| |
| - panvk: Get flush_id once per submit |
| |
| Maaz Mombasawala (3): |
| |
| - svga: Add all tgsi double instructions for shader codegen checks |
| - nir: Add option to preserve double immediates in tgsi shader. |
| - svga: Check float type when emitting atomic instructions |
| |
| MaciejDziuban (3): |
| |
| - vulkan: handle use_default_scaling_matrix_mask in h264 decoder |
| - vulkan: Add default scaling lists for H265 |
| - radv: Use vk_video_derive_h265_scaling_list |
| |
| Marek Olšák (128): |
| |
| - amd: drop support for LLVM 15, 16, 17 |
| - ci/debian-ppc64el: don't build AMD drivers due to having only LLVM 15 |
| - gallium,st/mesa: allow reporting compile failures from create_vs/fs/.._state |
| - nine: change the vtxbuf dirty mask to bool |
| - nine: always update vertex buffers after updating vertex elements |
| - nine: remove unused last_vtxbuf_count |
| - nine: bind exactly the number of vertex buffers as vertex elements need |
| - cso_context: add cso_get_vertex_elements_for_bind, letting the caller bind it |
| - gallium/u_threaded,st/mesa: add a merged set_vertex_elements_and_buffers call |
| - radeonsi: add assertion requiring binding vertex elements before vertex_buffers |
| - radeonsi: require that count in set_vertex_buffers matches vertex elements state |
| - radeonsi: don't set num_vertex_buffers and don't unbind in set_vertex_buffers |
| - radeonsi: simplify bind_vertex_elements due to being before set_vertex_buffers |
| - radeonsi: print why draws are rejected |
| - util: remove glthread enablement from app profiles |
| - mesa: don't build st_format_test on Windows |
| - glapi: remove dead code |
| - glapi: inline functions or use equivalent alternatives |
| - glapi: don't export always-private functions |
| - mesa: remove a glapi workaround for Mesa 10.5 and older |
| - glapi: fix build dependencies by putting all xml/py files into a single list |
| - glapi: remove support for dynamically-registered functions |
| - glapi: use an assertion in SET_by_offset instead of doing nothing |
| - glapi: clarify the meaning of static_data.functions |
| - glapi: remove check_table tests |
| - glapi: generate static offsets from the list of GL functions automatically |
| - glapi: remove unused functions from dispatch tables |
| - glapi: move legacy imaging functions to the end of dispatch tables |
| - glapi: verify that aliased functions don't have entries in dispatch tables |
| - glapi: just use _gloffset_COUNT_ everywhere, which is always the function count |
| - glthread: handle glFlush with HasExternallySharedImages more efficiently |
| - glthread: rename scripts to match the names of generated files |
| - glapi: disable python escape sequences in strings that use invalid ones |
| - glapi: use static_data.libgl_public_functions directly |
| - glapi: remove is_static_entry_point wrapper |
| - glapi: inline entry_current_get() |
| - glx: don't call GL functions directly, use the current dispatch instead |
| - glx: fix hardcoded use of dispatch table index in glAreTexturesResidentEXT |
| - glx: stop exporting GL functions from libGLX_mesa.so |
| - glx: make it more obvious what libglapi_bridge is |
| - glapi: remove unused python code |
| - mesa: allocate GLmatrix aligned to 16 bytes |
| - Remove osmesa |
| - gallium/u_blitter: remove UTIL_BLITTER_ATTRIB_COLOR, use a constant buffer |
| - gallium/u_blitter: change blitter_attrib from union to struct |
| - meson: never use static glapi because shared glapi is also static and better |
| - glx/xlib: switch glapi from static to shared (which is also static) |
| - meson: remove with_shared_glapi variable (it's always true) |
| - glapi: remove static glapi (it's dead code now) |
| - glapi: remove duplicated generated header glapitable.h |
| - glx: fix build with -Dlegacy-x11=dri2 |
| - egl: use the current dispatch to execute glFlush instead of get_proc_address |
| - glapi: remove extension definitions that will likely never be implemented |
| - ac/gpu_info: use max_good_cu_per_sa for computation of max_scratch_waves |
| - ac,radeonsi: don't set num_slots/src/dest_type/write_mask when they're set automatically |
| - Revert "ac/nir: clamp vertex color outputs in the right place" |
| - ac/nir/cull: extract a helper calling accept_func |
| - ac/nir/ngg: add an option to skip viewport-based culling |
| - ac/cmdbuf: split meta_*_policy to dcc and htile variables |
| - ac/cmdbuf: rework CB/DB cache controls for better perf |
| - winsys/amdgpu: don't use 32-bit address space for IBs |
| - radeonsi: use si_is_buffer_idle everywhere |
| - radeonsi: move buffer reallocation to a separate function |
| - radeonsi: enable NGG culling when the shader writes the viewport index |
| - radeonsi: enable Z/S caching in GL2 by default except FurMark |
| - radeonsi: use the restrict keyword for draw parameters |
| - radeonsi: lower IO only if io_lowered == false |
| - radeonsi: reflect blitter VS in si_context::num_vertex_elements |
| - radeonsi: don't cull front/back faces in the hw if the shader culls them |
| - ac,radeonsi: define all SDMA DCC fields & use them, enable compressed writes |
| - radeonsi: remove clover support |
| - glx: don't generate indirect GLX dispatch for ARB_imaging functions |
| - glapi: make a separate copy of entry.c for each lib |
| - glapi: replace mapi_func type with identical _glapi_proc |
| - glapi: simplify codegen macros |
| - glapi: merge all shared-glapi source files into one .c file |
| - glapi: remove unused _mesa_glapi_Dispatch |
| - glx: add a test to verify exported symbols |
| - glapi: get the list of public GL functions from libgl-symbols.txt |
| - glx: remove the hack that forced exporting GL functions from libGL |
| - glapi: remove the GLAPI_PREFIX macro, just use gl as the function prefix |
| - glapi: simplify headers of generated files |
| - glapi: simplify mapi_abi.py |
| - glapi: rename dispatch stubs to use function names instead of numbers |
| - glapi: remove noop_generic callbacks |
| - glapi: remove the option to set the nop handler |
| - mesa: move dispatch.h and marshal_generated.h generation to glapi/gen |
| - mesa: inline main/meson.build |
| - glx: stop exporting EXT_texture_object functions from libGLX_mesa.so |
| - glapi: fix x86 32-bit asm dispatch regression |
| - amd: update addrlib |
| - amd/addrlib: remove the DCC page fault workaround |
| - ac/nir/cull: cull small prims using a point-triangle intersection test |
| - glsl: return failure from gl_nir_validate_first_and_last_interface_explicit_locations |
| - glsl: return failure from varying_matches_assign_locations |
| - glsl: return failure from remove_unused_io_vars |
| - glsl: stop calling nir_opt_combine_stores (redundant with nir_opt_vectorize_io) |
| - glsl: remove a deprecated comment about nir_compact_varyings |
| - glsl: always return true at the end of link_varyings |
| - glsl: move code after link_varyings into link_varyings |
| - st/mesa: don't do nir_remove_dead_variables on in/out twice |
| - nir/opt_algebraic: lower 16-bit imul_high & umul_high |
| - ac: define physical VGPRs for fake hw overrides |
| - ac/nir/cull: rename skip_viewport_culling -> skip_viewport_state_culling |
| - ac/nir/cull: always do frustum culling, skip only small prim culling |
| - ac/surface: remove 64K_2D modifier with 64B max compressed blocks for gfx12 |
| - radeonsi: work around a primitive restart bug on gfx10-10.3 |
| - radeonsi: always use ACO callbacks to scalarize/vectorize 16-bit ALU |
| - radeonsi: expose 16-bit NIR types for ALU, MEM, and LDS (no inputs/outputs) |
| - radeonsi/ci: update gfx11 failures |
| - radeonsi/ci: don't run GTF tests (they have been removed from glcts) |
| - radeonsi: lower load/store bit sizes before load/store vectorization |
| - radeonsi: add a VOP3P swizzle requirement for 16-bit packed math |
| - radeonsi/ci: add gfx12 failures and flakes |
| - radeonsi: make si_shader_selector::main_shader_part_* an iterable union |
| - radeonsi: add ACO-specific main shader parts |
| - winsys/amdgpu: don't add VM_ALWAYS_VALID buffers into the BO list |
| - radeonsi: determine VM_ALWAYS_VALID accurately |
| - ac/gpu_info: increase the attribute ring size for gfx12 |
| - ac/gpu_info: remove has_tmz_support function |
| - ac/gpu_info: add payload_entry_size into ac_task_info |
| - ac/gpu_info: add 256 to payload_entry_size to increase future task shader perf |
| - ac/nir: set X=0 for task->mesh shader dispatch when Y or Z is 0 |
| - ac/surface: make gfx12_estimate_size reusable by gfx6 |
| - ac/surface: select 3D tile mode without overallocating too much for gfx6-8 |
| - radeonsi: always scalarize shared memory instructions |
| - radeonsi: use nir_opt_shrink_vectors |
| - radv: fix incorrect patch_outputs_read for TCS with dynamic state |
| |
| Mark Collins (4): |
| |
| - tu/kgsl: Remove zero CB queue submission fast path |
| - tu/kgsl: Revert "Remove zero CB queue submission fast path" |
| - tu/kgsl: Fix KGSL syncobj lifetime in no CB submit |
| - ir3/a7xx: Add post-RA pass to track liveness and insert (last) |
| |
| Martin Krastev (7): |
| |
| - svga/ci: disable vmware farm |
| - svga/ci: enable vmware farm |
| - svga/ci: enable vmware farm |
| - svga/ci: enable vmware farm |
| - svga/ci: update EXTERNAL_KERNEL_TAG to new kernel |
| - svga/ci: disable vmware farm |
| - docs/svga: Add steps how to get VMware Workstation Pro on Linux |
| |
| Martin Roukala (né Peres) (42): |
| |
| - ci: fix the artifact name |
| - ci: be explicit about the fact HW jobs do not need linter artifacts |
| - ci/test/b2c: do not download the debian/\*_test-(gl|vk) artifacts |
| - ci-templates: update to the latest upstream version |
| - ci/build-deqp: delegate the deqp main commit check to github |
| - ci/test: add arm32 b2c jobs |
| - ci/debian/test-vk: remove software-properties-common |
| - ci/debian/test-vk: trim down the container on a per-arch basis |
| - ci/image-tags: bump the debian base tag |
| - ci/b2c: add support for diskless DUTs |
| - ci/b2c: de-duplicate the download of install.tar |
| - etnaviv/ci: convert from baremetal to CI-Tron |
| - ci: re-enable austriancoder's farm |
| - zink/ci: use the debian-built-testing for nvk |
| - zink/ci: update the radv expectations |
| - radv/ci: bump tahiti's cpu cores |
| - radv/ci: update expectations |
| - freedreno/ci: update expectations |
| - ci/b2c: fix the S3 artifact for amd64 manual vk/gl |
| - radv/ci: document more Tahiti VKCTS flakes |
| - turnip/ci: re-introduce the \`multiviewport` flakes |
| - zink/ci: mark query-rgba-signed-components as fixed on more platforms |
| - zink/ci: document more RADV flakes |
| - radv/ci: add testing on RAPHAEL |
| - radv/ci: reduce Renoir concurrency in vkcts |
| - radv/ci: set a tight timeout on vkcts-navi31 |
| - radv/ci: document more flakes |
| - zink/ci: document more radv flakes |
| - zink/ci: document more GA106 flakes |
| - zink/ci: increase the a750 job's timeout to 18 minutes |
| - radv/ci: mark a whole subset of tests as flaky on tahiti |
| - zink/ci: document more RADV flakes |
| - zink/ci: document more NVK GA106 flakes |
| - ci/b2c: allow jobs to opt-in into the new mars setup command |
| - radv/ci: opt-in the new mars setup command |
| - etnaviv/ci: opt-in the new mars setup command |
| - nvk/ci: opt-in the new mars setup command |
| - freedreno/ci: opt-in the new mars setup command |
| - i915g/ci: opt-in the new mars setup command |
| - r300g/ci: opt-in the new mars setup command |
| - radv/ci: reduce the timeout of vkcts-tahiti to a more sensible time |
| - radv/ci: add hawaii to CI |
| |
| Mary Guillemard (95): |
| |
| - panfrost: Fix group priorities in drm-shim |
| - panfrost: Fix PROGRESS_LOAD destination register |
| - pan/bi: Properly encode LEA_BUF_IMM |
| - pan/bi: Remove shift lanes invalid encodings |
| - pan/bi: Fix invalid CLPER encoding |
| - pan/bi: Remove b3210 from valid swizzle |
| - pan/bi: Use 2D dimension with TEX_FETCH with CUBE on Valhall |
| - pan/decode: Fix indirect branch calculation for 64-bit |
| - panfrost: Properly name CSF instruction UMIN32 source 0 |
| - panvk: Disallow unknown GPU models early in physical device init |
| - pan/genxml: Stop using "constant" for struct fields in xml defs |
| - pan/genxml: Rework gen_pack.py to support OpenCL |
| - panfrost: Add base of OpenCL C infrastructure |
| - bi: Optimize scratch access |
| - panvk: Integrate libpan |
| - panfrost: Integrate libpan in gallium driver |
| - panvk: Add create_shader_from_binary |
| - panvk: Move TLS preparation logic to cmd_dispatch_prepare_tls |
| - panvk: Expose calculate_task_axis_and_increment on CSF |
| - panvk: Implement precomp dispatch |
| - panfrost: Implement precomp dispatch on Gallium |
| - panfrost,panvk: Wire printf and abort support |
| - ci: Transition to precomp-compiler |
| - ci: Enable mesa-clc and precomp-compiler on debian-arm32 |
| - pan/bi: Fix scratch access optimization |
| - panvk: Switch JM copy queries to CLC |
| - panvk: Switch JM clear queries to CLC |
| - pan/bi: Document removed instructions on v11 |
| - pan/bi: Do not run bi_fuse_small_int_to_f32 on v11+ |
| - pan/bi: Disallow dst swizzle optimization in case of conversion |
| - pan/bi: Add round mode modifier to FADD |
| - pan/bi: Stop using V2F32_TO_V2F16 on Valhall |
| - pan/bi: Lower SWZ.v4i8 to multiple MKVEC.v2i8 on v11+ |
| - pan/bi: Stop using S16_TO_F32 in nir_texop_lod computation on v11+ |
| - pan/bi: Stop using V2F16_TO_V2S16 for barycentric_at_offset on v11+ |
| - pan/bi: Lower removed instructions in algebraic on v11+ |
| - pan/bi: Lower FROUND.v2s16 in nir_lower_bit_size on v11+ |
| - pan/bi: Lower hadd on v11+ |
| - pan/bi: Handle LD_VAR_BUF_IMM encoding changes on v11+ |
| - pan/bi: Properly extract primitive facing on v11+ |
| - pan/genxml: Define RUN_IDVS staging registers in an enum |
| - panfrost: Remove write to TSD_3 in Gallium driver |
| - pan/genxml: Define RUN_COMPUTE staging registers in an enum |
| - pan/genxml: Define RUN_FRAGMENT staging registers in an enum |
| - panfrost: Rework cs_sr_regXX to be a macro |
| - panfrost: Switch Gallium driver to use cs_sr_regXX |
| - panfrost: Use CSIF info for CSF registers count |
| - panfrost: Avoid hard crash when major arch is unknown |
| - panfrost: Rename CSF MOVE into MOVE48 |
| - panfrost: Fix FLUSH_CACHE2 other definition |
| - panfrost: Rename CS ADD_IMMEDIATEXX to ADD_IMMXX |
| - pan/genxml: Use DCD Flags in Draw struct on v9+ |
| - pan/bi: Disallow FAU special page 3 and WARP_ID on message instructions |
| - pan/bi: Add unit tests for FAU special page 3 and WARP_ID |
| - pan/bi: Run nir_lower_bit_size after algebraic |
| - pan/bi: Run bifrost_nir_lower_algebraic_late while there is progress |
| - pan/bi: Lower FREXPE.v2f16 and FREXPM.v2f16 on v11+ |
| - pan/bi: Fix out of range access in bi_instr_replicates |
| - pan/bi: Ensure we select b0 with halfswizzle in va_lower_constants |
| - nir: Add Panfrost specific shader_output intrinsic |
| - pan/bi: Use shader_output intrinsic for IDVS |
| - pan/bi: Add support for IDVS2 on Avalon |
| - pan/bi: Lower IADD.v4s8 in algebraic on v11+ |
| - pan/bi: Allow no_psiz variant with IDVS2 |
| - pan/genxml: Rename UMIN32 opcode to COMPARE_SELECT32 |
| - pan/genxml: Add v12 definition |
| - pan/genxml: Add v13 definition |
| - pan/genxml: Build libpanfrost_decode for v12 |
| - pan/genxml: Build libpanfrost_decode for v13 |
| - pan/lib: Build for v12 |
| - pan/lib: Build for v13 |
| - pan/clc: Build for v12 |
| - pan/clc: Build for v13 |
| - panvk: Support 64x64 meta tile size for v12 in cmd_preload_render_area_border |
| - pan/lib: Adapt CRC calculation to align to 64x64 on v12+ |
| - panfrost: Disable hierarchy levels based on multiple of tile_size |
| - panvk: Use spd variant instead of pos_points when checking for vs shader presence |
| - pan/earlyzs: Default to FORCE_EARLY for ZS update on v11+ |
| - panvk: Implement Z/S dependency tracking on v11+ |
| - panvk: Add v12 support |
| - panvk: Add v13 support |
| - panfrost: Add v12 support to the Gallium driver |
| - panfrost: Add v13 support to the Gallium driver |
| - panfrost/ci: Add Mali-G720 current expectations |
| - panfrost/ci: Add Mali-G725 current expectations |
| - panfrost: Advertize Mali-G720 support |
| - panfrost: Advertize Mali-G725 support |
| - panfrost: Update the release note to mention G720/G725 addition |
| - panvk: Fix inverted ZS read flags in DCD2 on v11+ |
| - panfrost: Take tiler memory budget into account in pan_select_tiler_hierarchy_mask |
| - panvk: reset dyn_bufs map count to 0 in create_copy_table |
| - panvk: Take rasterization sample into account in indirect draw on v10+ |
| - panvk: Take resource index in valhall_lower_get_ssbo_size |
| - pan/bi: Properly lower add/sub with saturation on v11+ |
| - pan/genxml: Fix inverted logic on nr_regs |
| |
| Matt Turner (22): |
| |
| - gallium: Fix typos |
| - glsl: Fix typos |
| - glx: Fix typos |
| - intel: Fix typos |
| - mapi: Fix typos |
| - vulkan: Fix typos |
| - gallium: Return NULL, not false, from functions returning pointers |
| - nir: Return NULL, not false, from functions returning pointers |
| - iris: Initialize pointer with NULL, not false |
| - intel/compiler: Use FALLTHROUGH |
| - glsl: Use FALLTHROUGH |
| - glsl: Add missing break |
| - intel/compiler: Use unreachable instead of assert(!"...") |
| - intel/isl: Use unreachable instead of assert(!"...") |
| - anv: Use unreachable instead of assert(!"...") |
| - hasvk: Use unreachable instead of assert(!"...") |
| - intel/decoder: Remove assert(!"...") in recoverable condition |
| - intel/compiler: Add missing breaks |
| - intel/compiler: Use correct enum type |
| - iris: Remove ignored qualifier |
| - intel/isl: Remove ignored qualifier |
| - anv: Remove ignored qualifier |
| |
| Maíra Canal (4): |
| |
| - v3dv: VK_EXT_acquire_drm_display doesn't require a DRM master fd |
| - v3dv: don't overwrite the primary fd if it's already set |
| - broadcom/simulator: Expose V3D revision number in the simulator interface |
| - broadcom/simulator: Fix Indirect CSD jobs for V3D 7.1.6+ |
| |
| Mel Henning (31): |
| |
| - nvk: Use hw support for instancing on PASCAL_B+ |
| - nak/opt_copy_prop: Fix IAdd3 overflow check |
| - nak/opt_copy_prop: Add force_alu_src_type |
| - nak/opt_copy_prop: Force alu src for IAdd2X/IAdd3X |
| - driconf: force_vk_vendor on Deep Rock Galactic+NVK |
| - nak: Add OpLea |
| - nak,nir: Add 32-bit nir_op_lea_nv and use it |
| - nak: Add OpLeaX |
| - nak,nir: Add 64-bit lea_nv |
| - nouveau/mme: Print ip in mme_tu104_dump |
| - nak: Add static cycle count statistic |
| - nak: Add an occupancy statistic |
| - nak/spill_values: Make Spill take &mut self |
| - nak: Add spill/fill statistics |
| - nvk: Don't zero imported memory |
| - nvk: Add NVK_DEBUG=trash_memory |
| - vulkan: Relax bda assert for zero-size buffers |
| - nak: Add a simple postpass instruction scheduler |
| - nak: Calc static cycle count in instr_sched |
| - nak: Assert instr_sched matches calc_instr_deps |
| - nak/instr_sched: Barriers activate after 2 cycles |
| - nvk: SET_PS_{REGISTER,WARP}_WATERMARKS |
| - nvk: Support blackwell in max_warps_per_mp_for_sm |
| - nak/spill_values: Spill constants across edges if needed |
| - nak: Handle idp4 ureg latencies |
| - nvk: SET_STATISTICS_COUNTER at start of meta_begin |
| - nvk: Override render enable for blits and resolves |
| - wsi/headless: Override finish_create |
| - nak: Remove hfma2 src 1 modifiers |
| - nak: Add Src::is_unmodified() helper |
| - nak: Check that swizzles are none |
| |
| Mi, Yanfeng (2): |
| |
| - anv: Support putting image base address and image params in surface state |
| - anv: add emulated 64bit integer storage support |
| |
| Michael Cheng (5): |
| |
| - anv: Fix missing Perfetto trace for as build |
| - Revert "anv: Fix missing Perfetto trace for as build" |
| - vulkan : Pass in number of tlas/blas being built |
| - anv: Change as_build to show num tlas/blas |
| - vulkan: add leaves and ir_leaves sizes to encode for utrace. |
| |
| Michel Dänzer (1): |
| |
| - egl/glx/sw: Check xcb_query_extension_reply return value for MIT-SHM |
| |
| Mike Blumenkrantz (97): |
| |
| - zink: emit SpvCapabilityDemoteToHelperInvocation for IsHelperInvocation |
| - zink: implement ops for KHR_shader_subgroup |
| - aux/trace: trace tex2d_from_buf for samplers/images |
| - gallium: add a pipe_tex2d_from_buf struct |
| - rusticl: stop clearing shader images after every dispatch |
| - zink: also refcount needs_present from frontbuffer flush |
| - zink: delete some old DGC remnants |
| - zink: guard rebar check against fallback heap detection |
| - ac/surface: always allow LINEAR modifier for color formats |
| - radv: fix error reporting for VkExternalMemoryTypeFlagBitsKHR |
| - radv: stop blocking non-2D import/export ops |
| - radv: print stringname for VkExternalMemoryHandleTypeFlagBits error |
| - driconf: move a glthread viewperf option to global |
| - driconf: add GL_EXT_shader_image_load_store for viewperf to fix crashing |
| - zink: only enable unsynchronized_texture_subdata with HIC |
| - zink: never try to oom flush during unsync texture upload |
| - zink: apply layer/depth to clear handling |
| - zink: wait on tc fence before checking for fd semaphore |
| - zink: handle buffer import/export |
| - zink: verify that adding a dmabuf bind actually chooses a modifier |
| - zink: support cl_gl_sharing if dmabuf is supported |
| - egl/x11: delete some dri2 remnants |
| - zink: force cached mem for streaming uploads |
| - zink: always fully unwrap contexts |
| - zink: use a separate ralloc ctx for zink_program objects |
| - zink: put zink_program::reference on separate cacheline |
| - dri: delete INVALIDATE extension |
| - zink: enable single-plane modifiers for generic 2D exports |
| - zink: clamp UBO sizes instead of asserting |
| - anv: support all dimensions of image for LINEAR dmabufs |
| - aux/trace: truncate descriptor unbinds |
| - zink: always check submit_count to disambiguate when checking/waiting |
| - zink: don't reset all batch states when stalling |
| - zink: alloc bo ids for non-slab bos too |
| - zink: explicitly check usage in buffer barriers |
| - svga/ci: disable vmware farm |
| - llvmpipe: pass layer count to rast clear |
| - gallium: fix pipe_framebuffer_state::view_mask |
| - mesa: add error handling for OVR_multiview |
| - mesa: avoid creating incomplete surfaces when multiview goes out of range |
| - zink: improve precision on changes to depth bias between draws |
| - zink: delete zink_batch_state::ref_lock |
| - zink: split set_vertex_buffers to avoid conditionals |
| - zink: start using ctx->vertex_state_changed again |
| - zink: use VKCTX for vertex buffer binds and delete unused screen local |
| - zink: only add fb surf refs on unbind |
| - radv: rewrite radv_get_line_mode() conditional |
| - radv: store num_attributes to shader info |
| - radv: store vertex prolog simple input check to cmdbuf on vs bind |
| - radv: roll line topology dynamic state changes into existing rast samples flag |
| - radv: eliminate a memset in radv_get_vbo_info() |
| - radv: inline some vertex descriptor functions |
| - radv: split out dynamic vertex input descriptor writing |
| - radv: move non_trivial_format calc to dynamic VI bind |
| - radv: get vbo info directly into dgc upload |
| - radv: don't unnecessarily flag prolog recalc when binding VBOs |
| - zink: disable reordering on compute contexts |
| - zink: extract some shader image resource binding code |
| - zink: extract some samplerview bind/unbind code |
| - zink: support crazy CL buffer-to-texture extension |
| - st/drawpixels: move sv unref out to callers |
| - gallium: eliminate frontend refcounting from samplerviews |
| - gallium: make pipe_sampler_view::reference non-atomic |
| - mesa: remove st_sampler_view::private_refcount |
| - aux/trace: set sampler_view_release pointer |
| - gallium: delete tests |
| - util/tests: move u_half_test into half_float_test |
| - gallium: delete u_cache |
| - meson: add i915 to 'all' build for gallium-drivers |
| - egl/x11: always expose EXT_swap_buffers_with_damage |
| - egl: delete invalidate_available flag |
| - egl: move _EGLDisplay DriverData association into dri2_display_create |
| - egl: hoist dri2 display creation up a level |
| - egl/x11: split out dri2 init entirely |
| - egl: move kopper detection to display creation |
| - egl/x11: hoist up dri2_get_xcb_connection call |
| - egl/x11: simplify a kopper check on init |
| - egl/x11: hoist up swrast/zink driver_name setting |
| - egl/x11: hoist and document dri3_x11_connect() during init |
| - egl: delete dri2_egl_driver_fail |
| - egl/x11: consolidate LIBGL_DRI3_DISABLE use on init |
| - egl/x11: unify swrast/kopper/dri3 paths a bit |
| - gallium: delete pipe_surface::width and pipe_surface::height |
| - zink: fix refcounting of zink_surface objects |
| - zink: block streaming cached uploads on unsynchronized/persistent maps |
| - zink: implement unsynchronized staging uploads for buffers |
| - zink: block inferred loading for swrast |
| - gallium/util: check nr_samples in pipe_surface_equal() |
| - tu: check for valid descriptor set when binding descriptors |
| - meson: deprecate gallium-nine |
| - meson: deprecate gallium-xa |
| - zink: don't set shared block stride without KHR_workgroup_memory_explicit_layout |
| - zink: stop setting ArrayStride on image arrays |
| - zink: use implicit stride in ntv for temp vars |
| - ci: update VVL to current week |
| - zink: verify that surface exists when adding implicit feedback loop |
| - egl: fix sw fallback rejection in non-sw EGL_PLATFORM=device |
| |
| Mohamed Ahmed (7): |
| |
| - nvk: Add NVK_MAX_IMAGE_PLANES for images |
| - nvk: Add NVK_MAX_SAMPLER_PLANES for samplers |
| - nil: Use multiplanar init_info during image creation |
| - nil: Force smallest block size for images meant for Vulkan Video |
| - nvk: Set NIL_IMAGE_USAGE_VIDEO_BIT for images meant for video usage |
| - nil: Add an API to clamp max image alignment |
| - nvk: Implement VK_MESA_image_alignment_control |
| |
| Mykhailo Skorokhodov (1): |
| |
| - drirc/anv: force_vk_vendor=-1 for Drive Beyond Horizons |
| |
| Natalie Vock (30): |
| |
| - vulkan/bvh: Move leaf builder code to header |
| - vulkan/bvh: Add option to override leaf builder SPIR-Vs |
| - vulkan/bvh: Add optional fine-grained instance node bounds calculation |
| - radv/bvh, vulkan/bvh: Move AccelerationStructureInstance to vk_build_helpers |
| - radv/bvh: Remove unused build_instance helper |
| - radv/bvh: Prefix RADV-specific node functions with radv\_ |
| - radv/bvh: Add custom leaf node builder |
| - mailmap: Update my name |
| - radv/rt: Don't allocate the traversal shader in a capture/replay range |
| - aco/ra: Use iterators for linear VGPR copy extraction |
| - aco/ra: Use struct for parallelcopies |
| - aco/ra: Add option to skip renaming for parallelcopies |
| - aco/ra: Handle temps fixed to different regs in different operands |
| - aco/tests: Add tests for precolored operands in different regs |
| - aco/ra: Assert operands only clear their own id |
| - radv/rt: Limit monolithic pipelines to 50 stages |
| - radv/rt: Guard leaf encoding by leaf node count |
| - radv/rt: Flush L2 after writing internal node offset on GFX12 |
| - ac/llvm: Don't use getTriple() on LLVM21+ |
| - aco: Add get_temp_reg_changes helper |
| - aco/spill: Invert reloads map |
| - aco: Add RegisterDemand(Temp) constructor |
| - aco/spill: Allow spilling live-through operands |
| - radv/rt: Flush CP writes from the common BVH framework with INV_L2 on GFX12 |
| - vulkan/bvh: Move first PLOC task_count fetch inside PHASE |
| - aco: Fix RT VGPR limit on Navi31/32, GFX11.5, GFX12 |
| - aco: Make private_segment_buffer/scratch_offset per-resume |
| - radv: Add radv_emulate_rt drirc and enable for Indiana Jones TGC |
| - radv: Add radv_enable_float16_gfx8 drirc and enable for Indiana Jones TGC |
| - radv/ci: Test FP16 for GFX8 |
| |
| Nikita Popov (2): |
| |
| - clover: Don't include libclc headers |
| - clover: Fix MSVC build |
| |
| Nikolas Zimmermann (1): |
| |
| - etnaviv: drm: Add DRM_RDWR permissions to drmPrimeHandleToFD() call. |
| |
| Olivia Lee (1): |
| |
| - panfrost: allow promoting sysval UBO to push constants |
| |
| Patrick Lerda (27): |
| |
| - r600: fix r600_init_shader_caps() has_atomics issue |
| - r600: fix evergreen_emit_vertex_buffers() related cl regression |
| - r600: fix cayman sfn_nir_legalize_image_load_store ssa dominance |
| - r600: prepare the lds constant buffer to be shared |
| - r600: implement a conformant gl_VertexID with base offset |
| - r600: fix the indirect draw 8-bits path |
| - r600: fix emit_image_size() range base compatibility |
| - r600: update evergreen_convert_border_color() |
| - r600: update cayman_convert_border_color() |
| - r600: fix cayman main non-deterministic behavior problem |
| - r600: update the software fp64 support |
| - radeonsi: fix clear_depth_stencil refcnt imbalance |
| - r600: move stores to the end of shader when required |
| - r600: fix clear_depth_stencil refcnt imbalance |
| - r600: fix textures with swizzles limited to zero and one |
| - r600: implement EXT_window_rectangles |
| - r600: remove deprecated NIR_PASS_V |
| - r600: fallback to util_blitter_draw_rectangle when required |
| - r600: fix pa_su_vtx_cntl rounding mode |
| - r600: fix points clipping |
| - r600: enable ARB_compute_variable_group_size |
| - r600: clean up not used fields detected by clang |
| - i915: fix i915_set_vertex_buffers() related refcnt imbalance and remove redundancies |
| - i915: fix slab_create() related memory leaks |
| - i915: fix nir_to_tgsi() related memory leak |
| - i915: fix draw_create_fragment_shader() related memory leak |
| - mesa_interface: fix legacy dri2 compatibility |
| |
| Patrick Nicolas (1): |
| |
| - radv/video: Add low latency encoding |
| |
| Paul Gofman (1): |
| |
| - radv/amdgpu: Fix hash key in radv_amdgpu_winsys_destroy(). |
| |
| Paulo Zanoni (10): |
| |
| - brw: increase brw_reg::subnr size to 6 bits |
| - brw: don't always set cond_modifier on parsed assembly instructions |
| - brw: don't mark instructions read from text assembly as compacted |
| - brw: add instructions missing from is_control_flow() |
| - brw: extend the NOP+WHILE workaround |
| - drirc/anv: DiggingGame.exe needs force_vk_vendor=-1 |
| - intel/i915: restrict the RAM size restrictions to Anv |
| - anv/xe: detect the newer xe.ko memory reporting model and act accordingly |
| - anv: restore the old behavior of up to 75% of RAM for the system heap |
| - anv: add ANV_SYS_MEM_LIMIT for debugging system memory restrictions |
| |
| Pavel Ondračka (22): |
| |
| - i915/ci: use debian-build-testing instead of debian-testing |
| - r300: remove some dead code in redeon_program_alu |
| - r300: do not limit maximum TEX group for R300/R400 |
| - r300: copy propagate constant swizzles |
| - ci: fix debian-build-testing BUILDTYPE |
| - ci: disable LTO for nightly debian-build-testing |
| - i915/ci: update expectations |
| - i915: rework shader compile failures reporting |
| - mesa: properly signal report_compile_error to drivers |
| - nine: set pipe_shader_state.report_compile_errors |
| - r300: stop reporting compile failures in finalize_nir |
| - r300: forward all compile failures to state tracker |
| - r300: do not include newline in the error messages |
| - r300,i915: update CI expectations |
| - r300: fix temps counting for shader-db stats |
| - r300/ci: add dEQP job with RADEON_DEBUG=notcl |
| - r300: remove usage of NIR_PASS_V |
| - r300: fix INV and BIAS presubtract on R300/R400 |
| - r300: remove finalize_nir |
| - r300: remove support for tgsi_texcoord |
| - r300: fix crash when creating surfaces |
| - r300/ci: update expectations |
| |
| Peyton Lee (6): |
| |
| - radeonsi/vpe: check reduction ratio |
| - radeonsi/vpe: support geometric scaling |
| - amd/gmlib: add gmlib for radeonsi |
| - radeonsi/vpe: vpe support tonemapping |
| - radeonsi/vpe: vpe support hdr input |
| - radeonsi/vpe: fix background issue |
| |
| Philipp Zabel (3): |
| |
| - etnaviv/ml: Drop duplicated function reorder_for_hw_depthwise() |
| - etnaviv/ml: Fix padding input/output tensor zero points |
| - teflon: Log (un)supported operations |
| |
| Pierre-Eric Pelloux-Prayer (16): |
| |
| - radeonsi: update si_need_gfx_cs_space upper bound |
| - winsys/amdgpu: treat cs overflow as context lost |
| - radeonsi: disable dcc when external shader stores are used |
| - tc: add missing TC_SENTINEL for TC_END_BATCH |
| - mesa/st: call _mesa_glthread_finish before _mesa_make_current |
| - tc: flag closed batches |
| - st/mesa: fix nir_load_per_vertex_input parameter |
| - radeonsi: cache u_trace_perfetto_active value |
| - radeonsi: tag perfetto conditions as unlikely |
| - radeonsi: add pid/tid to the vk_queue_submit event |
| - radeonsi: guard perfetto support with ifdef in si_draw |
| - ac/nir: fix nir_metadata value of ac_nir_lower_image_opcodes |
| - radeonsi/video: require has_image_opcodes for full modifier support |
| - radeonsi: use composed swizzle in cdna_emu_make_image_descriptor |
| - winsys/amdgpu: disable VM_ALWAYS_VALID |
| - radeonsi: fix potential use after free in si_set_debug_callback |
| |
| Pohsiang (John) Hsu (7): |
| |
| - d3d12: fix incorrect IsRefUsedByCurrentPic marking for P Frame |
| - d3d12: log all the field of dpb buffer for better diagnostic, cosmetic space removal |
| - d3d12: initialize ReconstructedPicTexture |
| - d3d12: fix start code prevention in write_sei_nalu() |
| - d3d12: fix d3d12_video_nalu_writer_h264::write_slice_svc_prefix |
| - d3d12: use log2_max_pic_order_cnt_lsb_minus from upper layer |
| - d3d12: use log2_max_pic_order_cnt_lsb_minus4 from upper layer for h264 |
| |
| Qiang Yu (78): |
| |
| - lavapipe: fix min_vertex_pipeline_param |
| - gallium: fix ddebug and noop screen caps init |
| - gallium,clover: add compute caps used by clover only |
| - r600: init compute caps without ir_type param |
| - radeonsi: init compute caps without ir_type param |
| - gallium,mesa: remove ir_type param when get_compute_param |
| - gallium: add pipe_shader_caps and pipe_compute_caps |
| - gallium: copy shader and compute caps for ddebug/noop/trace |
| - tgsi: add tgsi_exec_init_shader_caps |
| - gallivm: add gallivm_init_shader_caps |
| - draw: add draw_init_shader_caps |
| - agx: init shader and compute caps |
| - crocus: init shader and compute caps |
| - d3d12: init shader and compute caps |
| - etnaviv: init shader caps |
| - freedreno: init shader and compute caps |
| - i915: init shader caps |
| - iris: init shader and compute caps |
| - lima: init shader caps |
| - llvmpipe: init shader and compute caps |
| - nouveau/nv30: init shader caps |
| - nouveau/nv50: init shader and compute caps |
| - nouveau/nvc0: init shader and compute caps |
| - panfrost: init shader and compute caps |
| - r300: init shader caps |
| - r600: init shader and compute caps |
| - radeonsi: init shader and compute caps |
| - softpipe: init shader and compute caps |
| - svga: init shader and compute caps |
| - tegra: init shader and compute caps |
| - v3d: init shader and compute caps |
| - vc4: init shader caps |
| - virgl: init shader and compute caps |
| - zink: init shader and compute caps |
| - nine: GET_SHADER_CAP use pipe_shader_caps |
| - lavapipe: change min shader param to use pipe_shader_caps |
| - gallium,mesa: replace get_shader_param with pipe_shader_caps access |
| - clover: replace get_compute_param with pipe_compute_caps |
| - gallium,mesa: replace get_compute_param with pipe_compute_caps |
| - rusticl: replace get_shader_param with pipe_shader_caps |
| - rusticl: replace get_compute_param with pipe_compute_caps |
| - asahi: remove shader and compute get param |
| - crocus: remove shader and compute get param |
| - d3d12: remove shader and compute get param |
| - etnaviv: remove shader get param |
| - freedreno: remove shader and compute get param |
| - i915: remove shader get param |
| - iris: remove shader and compute get param |
| - lima: remove shader get param |
| - llvmpipe: remove shader and compute get param |
| - nouveau/nv30: remove shader get param |
| - nouveau/nv50: remove shader and compute get param |
| - nouveau/nvc0: remove shader and compute get param |
| - panfrost: remove shader and compute get param |
| - r300: remove shader get param |
| - r600: remove shader and compute get param |
| - radeonsi: remove shader and compute get param |
| - softpipe: remove shader and compute get param |
| - svga: remove shader and compute get param |
| - tegra: remove shader and compute get param |
| - v3d: remove shader and compute get param |
| - vc4: remove shader get param |
| - virgl: remove shader and compute get param |
| - zink: remove shader and compute get param |
| - gallium/aux: remove aux shader param get function |
| - gallium: remove screen shader and compute get param callback |
| - doc,src: replace doc and comments for shader and compute cap |
| - ac/surface: fix radv import dmabuf from radeonsi |
| - gallium/ddebug: add missing modifier callback |
| - radeonsi: fix has_non_uniform_tex_access info |
| - radeonsi,util: add more usage for AMD_FORCE_SHADER_USE_ACO |
| - radeonsi: fix GravityMark corruption when use aco |
| - egl: decouple dmabuf import/export cap from xserver support |
| - dri: dmabuf cap does not rely on winsys multibuffer support |
| - glx/egl/x11: fix x11_dri3_check_multibuffer |
| - kopper: remove unused function definition |
| - dri,egl,glx: remove redundant usage of HAVE_X11_DRM |
| - llvmpipe/ci: change for oes_egl_image_external_essl3 |
| |
| Rebecca Mckeever (27): |
| |
| - pan/format: Use HW version to determine siting for YUV 422 formats |
| - pan/texture: Only use plane_chroma_2p for chroma planes |
| - vk/meta: Extend copy/fill/update helpers to support YCbCr |
| - util/hash_table: Add _mesa_hash_table_u64_replace() |
| - panvk: Allow a 32-bit binding value in desc id key and use 64-bit keys |
| - panvk: Move single-plane views of multiplane formats to pview.planes[0] |
| - panvk: Change immutable_samplers to panvk_sampler ** |
| - panvk: Create helper function for sampler descriptor emission |
| - panvk: Update panvk_get_desc_stride prototype |
| - panvk: Move mali_texture_packed structs in panvk_image_view to a union |
| - panvk: Use multiple sampler planes and one texture descriptor per plane |
| - panvk: Fix assertion in is_disjoint() |
| - panvk: Add YCbCr sampler NIR lowering pass |
| - panvk: Split get_format_properties into format features helper functions |
| - panvk: Report formats not supported by HW as unsupported |
| - panvk: Enable YCbCr support for v10+ |
| - vk/image: Add vk_image_can_be_aliased_to_yuv_plane() helper |
| - panvk: Use vk_image_can_be_aliased_to_yuv_plane() helper |
| - panvk: Allow 3-byte formats |
| - panfrost: Add BI_OPCODE_WMASK to bi_instr_uses_helpers |
| - panvk: Enable VK_KHR_shader_subgroup_uniform_control_flow for v10+ |
| - panvk: Enable VK_KHR_shader_maximal_reconvergence for v10+ |
| - panvk: Enable VK_KHR_shader_quad_control for v10+ |
| - panvk: Add STORAGE_IMAGE_BIT feature for formats supporting sampled images |
| - panvk: Enable shaderStorageImageExtendedFormats |
| - panvk: Remove lower_tg4_broadcom_swizzle from panvk_preprocess_nir() |
| - panvk: Support shaderImageGatherExtended |
| |
| Renato Pereyra (1): |
| |
| - perfetto/android: align datasource names with tooling expectations |
| |
| Rhys Perry (41): |
| |
| - nir,aco,radv: add align_mul/offset to buffer_amd intrinsics |
| - nir/opt_offsets: don't check NUW for unswizzled buffer_amd |
| - ac/nir/ngg: update bit_sizes_int |
| - nir/load_store_vectorize: handle load_buffer_amd/store_buffer_amd |
| - ac/nir: set memory_modes for lowered TES input loads |
| - radv: don't use bit_sizes_int to skip nir_lower_bit_size |
| - radv: move nir_opt_shrink_vectors later |
| - radv: DCE before nir_opt_shrink_vectors |
| - nir/load_store_vectorize: fix sorting of vectors in add_to_entry_key |
| - nir/algebraic: optimize ishl(iadd(iadd(a, #b), c), #d) |
| - radv: vectorize descriptor loads |
| - radv: vectorize lowered shader IO |
| - nir/use_dominance: invalidate nir_metadata_instr_index |
| - nir/opt_move: invalidate nir_metadata_instr_index |
| - nir/lower_io_arrays_to_elements: invalidate metadata |
| - nir/find_array_copies: invalidate nir_metadata_instr_index |
| - nir/linking_helpers: invalidate metadata in nir_link_opt_varyings |
| - radv/rt: correctly preserve metadata in move_rt_instructions |
| - nir: add NIR_DEBUG=extended_validation |
| - nir: add NIR_DEBUG=invalidate_metadata |
| - nir/validate: assert that if condition dominates use |
| - aco: validate operands fixed to definitions |
| - aco/ra: reverse renaming of operands outside update_renames |
| - aco/ra: unconditionally call undo_renames |
| - radeonsi: fix invalidation of metadata in si_nir_lower_abi |
| - ac/nir: don't cross swizzle elements when vectorizing buffer_amd intrinsic |
| - ac/nir: set higher alignment for some swizzled store_buffer_amd |
| - ac/nir: fix tess factor optimization when workgroup barriers are reduced |
| - aco: insert dependency waits in certain situations |
| - radv: improve radv_get_max_waves for multi-wave workgroups with LDS |
| - radv: don't assume WGP mode in radv_get_max_waves |
| - amd/drm-shim: add gfx1201 |
| - aco/ra: fix free register counting when moving variables |
| - radv/winsys: increase gfx12 vgprs for null winsys |
| - radv/winsys: set gart_page_size for null winsys |
| - radv/winsys: set has_distributed_tess for null winsys |
| - aco/gfx12: don't use second VALU for VOPD's OPX if there is a WaR |
| - aco: combine VALU lanemask hazard into VALUMaskWriteHazard |
| - aco/gfx11: create waitcnt for workgroup vmem barriers |
| - aco: fix get_temp_reg_changes with clobbered operands |
| - aco: swap the correct v_mov_b32 if there are two of them |
| |
| Rob Clark (17): |
| |
| - freedreno+tu: Add new virtgpu caps |
| - tu: Suballoc VkEvent BOs |
| - tu: Don't emit SP_PS_2D_WINDOW_OFFSET on a6xx |
| - tc: Add missing tc_set_driver_thread() |
| - freedreno: Wait for imported syncobj fences to be available |
| - ci: Re enable fd-farm |
| - freedreno/ci: Disable traces job until piglit is fixed |
| - ci: Re enable fd-farm |
| - ir3: Comment re-indent |
| - tu/vdrm: Fix userspace fence cmds |
| - tu: Avoid extraneous set_iova |
| - tu: Add some func traces |
| - util: Add drmSyncobj shim |
| - panvk: Remove dependency on vk_device::drm_fd |
| - vulkan: Use syncobj shim |
| - virtio/vdrm: Add vtest backend |
| - tu: vdrm vtest support |
| |
| Robert Mader (4): |
| |
| - llvmpipe: Take offset into account when importing dmabufs |
| - llvmpipe: Free dummy_dmabuf on shutdown |
| - gallivm: Re-add check for passmgr before disposing it |
| - meson: Bump minimum version to 1.3.0 |
| |
| Rohan Garg (7): |
| |
| - anv: refactor add_aux_surface_if_supported to use a common variable |
| - anv,blorp,isl: handle compressed CPS surfaces through the depth stencil hw |
| - anv: CPB surfaces that are used as color attachments or for stores cannot be compressed |
| - anv: no need to handle transitions for CPB surfaces |
| - anv: separate fast clear handling for compressed CPS |
| - isl: enable CPB compression |
| - anv: re enable compression for CPS surfaces on platforms other than Xe |
| |
| Roland Scheidegger (3): |
| |
| - llvmpipe: don't assert on exceeding if_stack size |
| - llvmpipe: Fix overflow issues calculating loop iterations for aniso |
| - llvmpipe: Fix alpha-to-coverage without dithering |
| |
| Ruijing Dong (1): |
| |
| - radeonsi/vcn: vcn5 roi fix |
| |
| Ryan Mckeever (5): |
| |
| - pan/format: Update format flags to follow HW spec |
| - pan/format: Add PAN_BIND_STORAGE_IMAGE flag |
| - panvk: Enable KHR_format_feature_flags2 and use them |
| - panvk: shaderStorageImageReadWithoutFormat support |
| - panvk: shaderStorageImageWriteWithoutFormat support |
| |
| Sagar Ghuge (12): |
| |
| - intel/genxml: Update BLEND_STATE_ENTRY structure |
| - anv: Enable simpleFloatBlendEnable on Xe3+ |
| - iris: Enable simpleFloatBlendEnable on Xe3+ |
| - blorp: Enable SimpleFloatBlendEnable on Xe3+ |
| - intel/compiler: Drop primitive leaf desc load code |
| - anv: Exclude non-standard block shapes on Xe2+ |
| - intel/compiler: Zero out the header for texel fetch |
| - anv: CPS LOD Compensation Enable is deprecated on Xe2+ |
| - intel/compiler: Add support for MSAA typed load/store messages |
| - intel/compiler: Lower sample index into coord for MSRT messages |
| - intel/compiler: Don't lower 64bit data memory access on LSC |
| - intel/compiler: Fix stackIDs on Xe2+ |
| |
| Samuel Pitoiset (227): |
| |
| - ac,radv,radeonsi: add new GFX12_DCC_WRITE_COMPRESS_DISABLE tiling flag |
| - radv/meta: add missing pipeline lookups |
| - radv/nir: update radv_nir_opt_tid for derivative group quads |
| - radv: implement derivative group quads on GFX12 |
| - radv: advertise computeDerivativeGroupQuads on GFX12 |
| - radv/meta: stop using string keys also for DGC and query objects |
| - util/disk_cache: add a new helper to create a disk cache |
| - vulkan/runtime: allow to use a different disk cache |
| - radv: fix caching on-demand meta shaders |
| - radv: fix adding the BO to cmdbuf list when starting conditional rendering |
| - radv: fix fetching draw vertex data from counter buffers with transform feedback |
| - radv: remove redundant drawCount == 0 for indirect mesh/task draws |
| - radv: use radv_indirect_dispatch() more |
| - radv: rework passing dispatch info via radv_dispatch_info |
| - radv: rework passing draw info via radv_draw_info |
| - radv: do not keep track of the streamout binding buffer |
| - nir: adjust number of components for cmat_muladd_amd |
| - radv/nir: add a struct for parameters to cooperative matrix lowering |
| - radv/nir: add cooperative matrix lowering for GFX12 |
| - radv: advertise VK_KHR_cooperative_matrix on GFX12 |
| - radv/meta: use BDA for query resolves |
| - radv/meta: compute the destination addr earlier for query resolves |
| - radv/meta: simplify creating buffers for R32G32B32 operations |
| - radv: remove unused device memory init/finish helpers |
| - radv/video: pass addr to send_cmd() |
| - radv/rmv: pass addr to log_resource_bind_locked() |
| - radv: pass addr to radv_copy_buffer() |
| - radv: rename radv_buffer::bo_va to addr |
| - radv: compute radv_buffer::addr at bind time |
| - radv: use radv_buffer::addr more |
| - radv/video: fix missing image offset when computing VA |
| - radv/meta: use BDA for clear HTILE mask |
| - radv/meta: use BDA for copying VRS rates to HTILE |
| - radv: use BDA for the uploaded parameters with DGC |
| - radv: simplify determining VBO size |
| - radv/meta: remove the heuristic that prefers CP DMA for GTT BOs |
| - radv/meta: disable conditional rendering for fill/update buffer operations |
| - radv/meta: inline copy_buffer() |
| - radv: rename fill/copy memory helpers |
| - radv/meta: add radv_{copy,fill,update}_memory() helpers |
| - radv: compute VBO addr at bind time |
| - radv: stop relying on VkBuffer for VBO |
| - ac/gpu_info: add gfx12_supports_dcc_write_compress_disable |
| - radv: add initial DCC support on GFX12 |
| - vulkan: add descriptor buffer support to compute astc decoder |
| - radv/meta: switch to descriptor buffers |
| - radv: remove radv_buffer_{init,finish}() helpers |
| - radv: remove radv_buffer_view_{init,finish}() helpers |
| - radv: fix adding the VRS image BO to the cmdbuf list on GFX11 |
| - radv/meta: add BOs to cmdbuf list earlier for image copy operations |
| - radv/meta: pass the buffer addr to SDMA image buffer copy operations |
| - radv/meta: use radv_copy_memory() instead of radv_copy_buffer() |
| - radv/meta: remove the buffer dependency for image copy operations |
| - radv: stop using image binding offset when exporting BO metadata |
| - radv: remove redundant zero initialization when creating images |
| - radv: rename radv_image::bindings::bo_va to addr |
| - radv: compute radv_image::bindings::addr at bind time |
| - radv: use radv_image::bindings::addr more |
| - radv: fix adding the BO for unaligned SDMA copies to the cmdbuf list |
| - vulkan: constify vk_acceleration_structure_get_va() |
| - radv: use radv_CmdDispatchIndirect() in the accel struct path |
| - radv/meta: use radv_copy_memory() for the FMASK copy |
| - radv: use radv_copy_memory() for accel structure updates |
| - radv/meta: inline radv_copy_buffer() |
| - radv/meta: rename image<->buffer copies helpers |
| - radv/video: fix adding the query pool BO to the cmdbuf list |
| - radv: stop computing the UUID using the physical device cache key |
| - radv: fix missing SQTT barriers for fbfetch color/depth decompressions |
| - radv: reserve bits explicitly for cache key structs |
| - vulkan: filter duplicate pNext struct at device creation |
| - radv: replace radv_image::shareable by vk_image::external_handle_types |
| - radv: remove meaningless TODOs in radv_GetDeviceImageMemoryRequirements() |
| - radv: handle OOM error properly when selecting image modifier |
| - radv/meta: inline radv_meta_get_view_layer() |
| - radv/meta: remove useless assertion in when copy VRS to HTILE |
| - radv: check HTILE compression for depth/stencil images per level |
| - radv: enable DCC fast clears for 8bpp/16bpp on GFX11 |
| - radv: fix re-emitting fragment output state when resetting gfx pipeline state |
| - docs: add missing RADV_PERFTEST=video_encode description |
| - radv: fix trap handler exception options |
| - radv: use radv_emulate_rt() more |
| - radv: remove redundant radv_instance::drirc::rt_wave64 |
| - docs: adjust NGG culling options description |
| - spirv: move workarounds to an inner struct in spirv_to_nir_options |
| - spirv/tests: initialize compiler options in constructor |
| - spirv/tests: add a test for NonSemantic.DebugBreak |
| - spirv/tests: add a test for force_ssbo_non_uniform |
| - spirv/tests: add a test for force_tex_non_uniform |
| - spirv/tests: add a test for lower_terminate_to_discard |
| - ci: update VKCTS main to ba86fb95004331f2cf571dd9adefe2458290ee11 |
| - radv: switch to device address from vk_buffer |
| - ci/b2c: fix passing B2C_* variables |
| - radv/ci: re-enable ET2C emulation testing on non-native GPUs |
| - radv: add RADV_DEBUG=pso_history |
| - ac,radv: add a workaround for a hw bug with primitive restart on GFX10-GFX10.3 |
| - radv/ci: enable RADV_PERFTEST=video_{decode,encode} on few GFX9+ GPUs |
| - aco/tests: use GFX1201 instead of GFX1200 |
| - radv/winsys: use real info for GFX12 in the null winsys |
| - radv: fix a GPU hang with inherited rendering and HiZ/HiS on GFX1201 |
| - radv/amdgpu: fix device deduplication |
| - meson: add build-radv-tests option |
| - ci: enable build-radv-tests for debian-clang and debian-vulkan |
| - radv/winsys: enable has_timeline_syncobj for the null winsys |
| - radv: add a small framework for RADV specific tests |
| - radv/tests: add a test to verify that pipelineCacheUUID is invariant |
| - radv/tests: add a test to verify that pipeline hash matches RGP<->Fossilize |
| - radv/tests: add few tests that verify drirc options |
| - radv: update conformance version |
| - aco: do not apply OMOD/CLAMP for pseudo scalar trans instrs |
| - radv/ci: enable RADV_PERFTEST=video_decode,video_encode on TAHITI,HAWAII and POLARIS10 |
| - ac/rgp: few fixes for GFX11.5 |
| - ac/rgp: add GFX12 definitions |
| - ac/rgp: bump version to 1.6 |
| - ac/rgp: bump instrumentation API version to 1.5 |
| - ac/sqtt: fix registers programming for GFX12 |
| - radv: enable RGP on GFX12 |
| - radv: emit a dummy PS state for noop FS on GFX12 |
| - radv: track redundant register writes for PA_SC_HISZ_CONTROL on GFX12 |
| - radv: fix creating pipeline binary from the traversal shader |
| - radv: use radv_sdma_emit_nop() more |
| - radv: add more SDMA emit helpers |
| - radv: add a helper to know whether compute queue is enabled |
| - radv: remove meaningless comment when resetting SQTT trace |
| - radv: add a helpers to know whether video decode/encode queues are enabled |
| - docs: update documentation for RADV_PERFTEST=video_decode,video_encode |
| - radv: fix bpe for the stencil aspect of depth/stencil copies on transfer queue |
| - radv: fix compresed depth/stencil copies on transfer queue |
| - radv: remove useless parameter to radv_sdma_get_buf_surf() |
| - radv: cleanup passing the aspect mask for SDMA operations |
| - radv: remove radv_force_pstate_peak_gfx11_dgpu=true for Helldivers 2 |
| - radv: use PM4 for setting specific graphics registers in the preamble |
| - radv/meta: fix color<->depth/stencil image copies |
| - radv/video: use a pointer to write the total task size |
| - radv/video: rework command buffer emission |
| - ci: uprev vkd3d-proton to 078f07f588c849c52fa21c8cfdd1c201465b1932 |
| - radv/ci: remove vkcts-navi21-llvm-valve completely |
| - radv/ci: remove radv-stoney-flakes.txt |
| - radv/ci: drop aco suffix for CI files |
| - radv/ci: rename radv-gfx1200 to radv-gfx1201 |
| - radv/ci: delete empty radv-hawaii-skips.txt |
| - radv: replace radeon_set_reg_seq by a macro |
| - radv: switch all emit helpers to macros |
| - radv: move the optimized context reg macros with other similar ones |
| - radv: rework radeon_set_uconfig_perfctr_reg_seq to use amd_ip_type |
| - radv: add new helper to emit PKT3_EVENT_WRITE for sampling queries |
| - radv: add radeon_event_write() macros |
| - radv/ci: fix renaming the VKCTS job for tahiti |
| - radv: rework the shader pointer emit as macros |
| - radv: apply some cosmetic changes for future begin/end CS sequences |
| - radv: add a helper to emit indirect buffer for draws/dispatches |
| - radv: add a helper to emit SPM muxsel |
| - radv: slightly change the COND_EXEC for sampling performance counters |
| - radv: use radv_cs_write_data_imm() more |
| - radv: add more helpers to start/stop perfcounters |
| - radv/video: slightly change radv_vcn_sq_header() |
| - radv: add a helper to emit a PKT3_COPY_DATA with an immediate |
| - radv: add a helper to emit PM4 commands to a CS |
| - radv: skip FCE for comp-to-single fast clears with DCC MSAA |
| - radv: rework radv_fast_clear_flush_image_inplace() |
| - radv: add radv_fmask_color_expand() |
| - radv: rework radv_handle_color_image_transition() |
| - radv: inline radv_fast_clear_flush_image_inplace() |
| - radv: make sure to always decompress FMASK before expanding it |
| - radv: disable TC-compatible CMASK with {FMASK,DCC}_DECOMPRESS |
| - ac,radv,radeonsi: use PM4 for shadowed registers |
| - radv: do not trigger FCE or FMASK decompress on compute queue |
| - radv: add queue family assertions when doing decompression passes |
| - radv: remove useless use of radv_image_use_comp_to_single() |
| - radv: only enable HTILE for depth/stencil attachment images |
| - radv: rework command buffer emission with begin/end sequences |
| - radv: switch back radeon_cmdbuf to use 32-bit counters |
| - radv/ci: stop skipping one memory test due to timeouts |
| - radv/ci: remove all skips for STONEY |
| - ac/surface: fix selecting preferred alignments for HiZ/HiS on GFX12 |
| - Revert "radv: program SAMPLE_MASK_TRACKER_WATERMARK optimally for GFX11 APUs" |
| - Revert "radeonsi/gfx11: program SAMPLE_MASK_TRACKER_WATERMARK optimally for APUs" |
| - radv: fix ignoring conditional rendering with vkCmdResolveImage() |
| - radv: add new helper to suspend/resume user conditional rendering |
| - radv: rework suspend/resume user conditional rendering |
| - ac,radv: remove has_scheduled_fence_dependency |
| - ac/gpu_info: bump required DRM minor version to 3.42.0 (kernel 5.15+) |
| - radv: remove useless FDCC_ENABLE bitfield clear on GFX12 |
| - radv: tidy up radv_emit_raster_state() |
| - radv: restore DB_DFSM_CONTROL properly when POPS isn't used |
| - radv: tidy up radv_emit_db_shader_control() |
| - radv: split framebuffer color state emission for GFX12 |
| - radv: split framebuffer depth/stencil state emission for GFX12 |
| - radv: split null framebuffer state emission for GFX12 |
| - radv: stop emitting CB_FDCC_CONTROL to zero on GFX11-GFX11.5 |
| - radv: do not emit the VRS surface VA when it's not enabled |
| - radv: add a fuction to emit the VRS surface on GFX11 |
| - radv: determine if HiZ/HiS is enabled earlier on GFX12 |
| - radv: add a workaround for buggy HiZ/HiS on GFX12 |
| - radv: apply the workaround for buggy HiZ/HiS on GFX12 for DGC |
| - radv: add before/after draw functions for DGC |
| - radv: move emitting raster and depth/stencil state slightly earlier |
| - radv: emit conservative raster mode as part of the MSAA state |
| - radv: configure COVERAGE_TO_SHADER_SELECT only if conservative rast is enabled |
| - radv: track more MSAA related register writes |
| - radv: regroup emitting all MSAA states in one function |
| - radv: add clip rects state bit for emitting discard rectangles |
| - radv: remove an old workaround for D3D9 with DXVK 2.3.0 and older |
| - radv: move the disable_trunc_coord drirc at instance/pdev level |
| - radv: move emitting more fb registers when rendering begins |
| - radv: use consecutive registers for PA_SC_WINDOW_SCISSOR_{TL,BR} |
| - radv: track redundant DB_RENDER_OVERRRIDE register writes on GFX12 |
| - radv: reduce the number of emitted DWORDS for MSAA 8x user sample locs |
| - radv: tidy up radv_emit_raster_state() |
| - radv: tidy up radv_emit_hw_ngg() |
| - radv: add macros for paired context registers on GFX12 |
| - radv: use paired context regs when optimal on GFX12 |
| - radv: only enable DCC for invisible VRAM on GFX12 |
| - ci: update VKCTS main to 76c1572eaba42d7ddd9bb8eb5788e52dd932068e |
| - radv: allocate the SPM BO in GTT for faster readback |
| - radv: print more error messages during SPM initialization |
| - ac/perfcounter: add support for GFX12 |
| - radv/sdma: simplify configuring the number of uncompressed DCC blocks |
| - radv/sdma: use SDMA5_DCC_xxx bitfields |
| - radv/sdma: remove redundant check for compression when getting metadata |
| - radv/sdma: use the correct helper to get the number type field |
| - radv/sdma: add a new flag to know if the surface is compressed |
| - radv/sdma: add support for compression on GFX12 |
| - radv: set radv_disable_dcc=true for WWE 2k23 |
| - radv: fix re-emitting VRS state when rendering begins |
| - radv: do not clear unwritten color attachments with dual-source blending |
| - radv: disable SINGLE clear codes to workaround a hw bug with DCC on GFX11 |
| - radv: fix GPU hangs with image copies for ASTC/ETC2 formats on transfer queue |
| |
| Saroj Kumar (2): |
| |
| - mesa: Add GL_EXT_protected_textures support |
| - radeonsi: Move buffer descriptor slot to the beginning |
| |
| Sasha Finkelstein (1): |
| |
| - vtn_bindgen2: Fix memory corruption |
| |
| Serdar Kocdemir (6): |
| |
| - gfxstream: Track more fence functions on host |
| - gfxstream: Add VK_KHR_multiview support |
| - gfxstream: track pipeline layouts on decoder |
| - gfxstream: wrap semaphore functions on the host |
| - gfxstream: Add dispatcher validity checks |
| - gfxstream: Wrap vkEnumerateInstanceExtensionProperties for host |
| |
| Sergi Blanch Torne (8): |
| |
| - ci: disable Collabora's farm due to maintenance |
| - Revert "ci: disable Collabora's farm due to maintenance" |
| - ci: typo in debian-android in .build-for-tests-jobs |
| - ci: disable Collabora's farm due to maintenance |
| - Revert "ci: disable Collabora's farm due to maintenance" |
| - ci: disable Collabora's farm due to maintenance |
| - Revert "ci: disable Collabora's farm due to maintenance" |
| - Uprev Piglit to ebdf60e0d4b0dc23e79373cb923fdee023eb3b2b |
| |
| Sergii Ushakov (1): |
| |
| - gfxstream: Emulate DMABUF with OPAQUE_FD |
| |
| Seán de Búrca (8): |
| |
| - rusticl/mem: don't write more supported image formats than requested |
| - rusticl/mem: don't create svm_pointers slice from null raw pointer |
| - rusticl/mem: use cl_slice::from_raw_parts in place of std::slice |
| - rusticl: rename CheckedPtr::copy_checked to match primitive method |
| - rusticl: mark CheckedPtr::write_checked as unsafe |
| - rusticl: add debug assertions to avoid truncating casts |
| - rusticl: correct calculation of maximum allocation size |
| - rusticl: cap max alloc size to i32 to avoid overflowing gallium |
| |
| Sil Vilerino (13): |
| |
| - d3d12: Fix HEVC range extension pic params validation |
| - pipe: Add profiles for HEVC 422 8/10b and 444 10b |
| - d3d12: Add support for Y210, Y410, YUY2 and HEVC 422 8/10b, HEVC 444 10b profiles |
| - d3d12: Add NULL initialization for d3d12_video_enc::m_pVideoTexArrayDPBPool |
| - d3d12: Add some missing members initialization for d3d12_video_buffer |
| - d3d12: Increase DPB video texture array pool size for async queue depth |
| - d3d12: Fix array of texture DPB cap detection |
| - d3d12: Fix warning 4305 truncation from type1 to type2 |
| - nir.h: Fix warning C4800 forcing value to bool 'true' or 'false' |
| - u_thread.h: Fix warning C4800 forcing value to bool 'true' or 'false' |
| - d3d12: Fix warning C4800 forcing value to bool 'true' or 'false' |
| - d3d12: Enable warnings C4056, C4305, C4351, C4756, C4800, C4291, C4020, C4624, C4309, C5105, C4024, C4189 |
| - d3d12: Cache the texture array cap requirement in encoder creation for calls to d3d12_video_create_dpb_buffer |
| |
| Simon Ser (9): |
| |
| - gbm: fix get_back_bo() failure with gbm_surface and implicit modifiers |
| - pvr: replace dup() with os_dupfd_cloexec() |
| - freedreno: replace dup() with os_dupfd_cloexec() |
| - iris: replace dup() with os_dupfd_cloexec() |
| - lavapipe: replace dup() with os_dupfd_cloexec() |
| - venus: replace dup() with os_dupfd_cloexec() |
| - panvk: replace dup() with os_dupfd_cloexec() |
| - libsync: replace dup() with os_dupfd_cloexec() |
| - vulkan/wsi/x11: replace dup() with os_dupfd_cloexec() |
| |
| Stéphane Cerveau (2): |
| |
| - anv: fix error code in GetPhysicalDeviceVideoFormatProperties |
| - radv: video: rework maxActiveReferenceSlot/MaxDpbSlots |
| |
| Sushma Venkatesh Reddy (1): |
| |
| - intel/tools: Improve memory allocation failure handling in aubinator_error_decode_xe |
| |
| Sviatoslav Peleshko (3): |
| |
| - anv: Add full subgroups workaround for the shaders that use shared memory |
| - drirc: Apply assume_full_subgroups_with_shared_memory to Resident Evil 2 |
| - vulkan/wsi/headless: Remove unnecessary wsi_configure_image() |
| |
| Tapani Pälli (25): |
| |
| - intel/genxml/anv: fix the layout of call stack handler struct |
| - intel/dev: reduce warning noise from urb settings |
| - intel/common: fix mi_builder_test issue |
| - anv: handle non-wsi images in anv_layout_to_aux_state |
| - anv: tighten condition for changing barrier layouts |
| - anv: apply cache flushes on pipeline select with gfx20 |
| - iris: wait for imported fences to be available in iris_fence_await |
| - intel/compiler: add a spec note about L1WT types being uncached |
| - iris: remove dead code that cannot get hit anymore |
| - intel/dev: update mesa_defs.json from internal database |
| - anv: restrict TessellationDistributionLevel for Wa_16025857284 |
| - iris: restrict TessellationDistributionLevel for Wa_16025857284 |
| - intel/dev: reduce warning noise from urb settings II |
| - isl: add usage field to isl_buffer_fill_state_info |
| - iris: pass down buffer usage for isl_buffer_fill_state |
| - anv: pass down buffer usage for isl_buffer_fill_state |
| - isl/iris/anv: setup L1CacheControl based on surface and buffer usage |
| - compiler/glsl: check that bias is not used outside fragment stage |
| - intel/dev: update mesa_defs.json from internal database |
| - mesa: clamp texbuf query size to MAX_TEXTURE_BUFFER_SIZE |
| - mesa: various fixes for ClearTexImage/ClearTexSubImage |
| - iris: force reallocate on eglCreateImage with GFX >= 20 |
| - iris: make sure to not mix compressed vs non-compressed |
| - anv: put parenthesis to the set_sampler_size equation |
| - intel/dev: update mesa_defs.json from internal database |
| |
| Taras Pisetskyi (1): |
| |
| - anv,driconf: Add sampler coordinate precision workaround for EVE Online |
| |
| Thomas H.P. Andersen (1): |
| |
| - nvk: use a valid allocation scope |
| |
| Tim Keller (1): |
| |
| - dril: Check for null config in dril_target.c |
| |
| Timothy Arceri (14): |
| |
| - util/disk_cache: dont create multidisk cache dir if unused |
| - util/u_idalloc: fix util_idalloc_sparse_alloc_range() |
| - ci: move llvmpipe fails to flakes |
| - mesa: fix reuse of deleted buffer object |
| - mesa: fix reuse of deleted texture object |
| - mesa: fix potential race condition in with TexObjects |
| - mesa: fix reuse of deleted sampler object |
| - mesa: fix potential race conditions in with FrameBuffers |
| - mesa: fix potential race condition in with RenderBuffers |
| - mesa: fix potential race condition in with ATIShaders |
| - mesa: fix potential race condition in with Programs |
| - nir: fix uniform cloning helper |
| - glsl: fix regression in ubo cloning |
| - util/driconf: add force_gl_depth_component_type_int workaround |
| |
| Timur Kristóf (94): |
| |
| - radv: Move buffer related NIR meta shaders to radv_meta_nir.c |
| - radv: Move blit NIR shaders to radv_meta_nir.c |
| - radv: Move blit2d NIR shaders to radv_meta_nir.c |
| - radv: Move buffer-image copy and clear NIR shaders to radv_meta_nir.c |
| - radv: Move clear NIR shaders to radv_meta_nir.c |
| - radv: Move VRS HTILE copy NIR shader to radv_meta_nir.c |
| - radv: Move DCC retile NIR shader to radv_meta_nir.c |
| - radv: Move expand depth stencil NIR shader to radv_meta_nir.c |
| - radv: Move DCC decompress NIR shader to radv_meta_nir.c |
| - radv: Move FMASK copy NIR shader to radv_meta_nir.c |
| - radv: Move FMASK expand NIR shader to radv_nir_meta.c |
| - radv: Move resolve NIR compute shaders to radv_meta_nir.c |
| - radv: Move resolve NIR fragment shaders to radv_meta_nir.c |
| - radv: Move resolve NIR fs to radv_meta_nir.c |
| - radv: Move NIR helpers from radv_meta.c to radv_meta_nir.c |
| - radv: Move NIR specific function declarations to radv_meta_nir.h |
| - radv: Rename get_global_ids to radv_meta_nir_get_global_ids. |
| - radv: Add missing copyright info to radv_meta_buffer.c |
| - util/enum_operators: Don't define anything for OpenCL |
| - nir: Add struct names where they were missing. |
| - nir: Add missing extern "C" to nir_defines.h |
| - nir: Add forward declarations of relevant structs to nir_defines.h |
| - nir: Move some enums and structs to nir_defines.h |
| - nir: Move nir_tcs_info to separate file. |
| - nir: Move nir_shader_compiler_options to separate file. |
| - nir: Don't include full nir.h in nir_xfb_info.h |
| - nir: Don't include the full nir.h when not necessary. |
| - spirv: Don't include full nir.h in nir_spirv.h |
| - vk: Don't include full nir.h in headers. |
| - glsl: Don't include full nir.h where not necessary. |
| - ac/nir: Don't include nir.h in headers anymore. |
| - ac/nir: Move surface related NIR functions to separate file. |
| - aco: Don't include nir.h in aco_interface.h anymore. |
| - ac: Don't include full nir.h anymore. |
| - radv: Add missing includes and remove unnecessary NIR includes. |
| - hk: Don't include full nir.h in hk_shader.h |
| - zink: Don't include full nir.h where not necessary. |
| - ttn: Don't include full nir.h where not necessary. |
| - nak: Don't include full nir.h in nak.h |
| - nvk: Don't include full nir.h in nvk_shader.h |
| - nir: Don't include xxhash.h in nir.h, only where it is used. |
| - nir: Don't include bitscan.h in nir.h, it's not actually used. |
| - nir: Don't include u_printf.h in nir.h, only where necessary. |
| - nir: Don't include u_format.h in nir.h, it's not actually used. |
| - nir: Remove struct keyword from nir.h where possible. |
| - nir: Add comment to indicate that NIR_PASS_V is deprecated. |
| - nir: Add bool return value to nir_fixup_deref_modes. |
| - nir: Add bool return value to nir_fixup_deref_types. |
| - nir: Don't use deprecated NIR_PASS_V macro anymore. |
| - radv: Stop using deprecated NIR_PASS_V with core NIR passes. |
| - radv: Add bool return value to radv_nir_lower_abi. |
| - radv: Add bool return value to radv_nir_apply_pipeline_layout. |
| - radv: Add bool return value to ray tracing NIR lowerings. |
| - ac/nir: Add bool return value to ac_nir_lower_ls_outputs_to_mem. |
| - ac/nir: Add bool return value to ac_nir_lower_hs_inputs_to_mem. |
| - ac/nir: Add bool return value to ac_nir_lower_hs_outputs_to_mem. |
| - ac/nir: Add bool return value to ac_nir_lower_tes_inputs_to_mem. |
| - ac/nir: Add bool return value to ac_nir_lower_es_outputs_to_mem. |
| - ac/nir: Add bool return value to ac_nir_lower_gs_inputs_to_mem. |
| - ac/nir: Add bool return value to ac_nir_lower_task_outputs_to_mem. |
| - ac/nir: Add bool return value to ac_nir_lower_mesh_inputs_to_mem. |
| - ac/nir: Add bool return value to ac_nir_lower_legacy_vs. |
| - ac/nir: Add bool return value to ac_nir_lower_legacy_gs. |
| - ac/nir/ngg: Add bool return value to ac_nir_lower_ngg_nogs. |
| - ac/nir/ngg: Add bool return value to ac_nir_lower_ngg_gs. |
| - ac/nir/ngg: Add bool return value to ac_nir_lower_ngg_mesh. |
| - radv: Don't use deprecated NIR_PASS_V macro for AMD common NIR passes. |
| - compiler/clc: Stop using deprecated NIR_PASS_V macro. |
| - vulkan/runtime: Don't use deprecated NIR_PASS_V macro. |
| - hk: Don't use deprecated NIR_PASS_V macro anymore. |
| - glsl: Don't use deprecated NIR_PASS_V macro anymore. |
| - nak: Don't use deprecated NIR_PASS_V macro anymore. |
| - nvk: Don't use deprecated NIR_PASS_V macro anymore. |
| - radv: Use flush postamble on GFX7 with different flags. |
| - ac/nir/ngg: Run copy propagation. |
| - ac/nir/ngg: Improve reuse of position value. |
| - ac/nir/ngg: Remove inputs_needed_by_* |
| - ac/nir/ngg: Prepare deferred shader part before adding culling code. |
| - ac/nir/ngg: Gather info about what the deferred shader part uses. |
| - ac/nir/ngg: Use deferred info for compacted arguments. |
| - ac/nir/ngg: Remove cleanup_culling_shader_after_dce. |
| - nir/xfb: Preserve some xfb information when gathering from intrinsics. |
| - nir/opt_varyings: Fix assertion when deduplicating TCS outputs. |
| - radv: Use buffers_written mask when gathering XFB info. |
| - radv: Call nir_opt_undef too after nir_opt_varyings. |
| - radv: Remove radv_streamout_info::num_outputs. |
| - nir/print: Fix variable mode for arrayed output load intrinsics. |
| - radv: Add radv_foreach_stage to ForEachMacros again. |
| - radv: Inline radv_graphics_shaders_link_varyings_{first/second}. |
| - radv: Refactor loops in radv_graphics_shaders_link_varyings. |
| - radv: Move preparation and fixup to separate loops in varying optimization. |
| - radv: Don't call nir_opt_varyings a second time when unnecessary. |
| - radv: Clear dirty flag for MSAA state after emitting it. |
| - radv: Clear dirty flag for clip rects state after emitting it. |
| |
| Tomeu Vizoso (4): |
| |
| - egl/surfaceless: Only choose drivers that expose the graphics capability |
| - kopper: Explicitly choose zink |
| - etnaviv/ml: Use etna_buffer_resource instead of etna_resource |
| - etnaviv: Release screen->dummy_desc_reloc.bo |
| |
| Trigger Huang (2): |
| |
| - radeonsi: Fix perfcounter start event in si_pc_emit_start |
| - radeonsi: Change program seqnece for perf counters |
| |
| Valentine Burley (84): |
| |
| - amd/ci: Revert to 6.6 kernel on Raven |
| - zink/ci: Add a fraction for zink-venus-lvp |
| - khronos-update: Update ANDROID guards in vk_android_native_buffer.h |
| - zink/ci: Make zink-venus-lvp-full a nightly job |
| - ci/intel-gpu-freq: Add Xe support |
| - ci: Use new kernel with Intel Xe driver |
| - ci/lava: Allow passing extra cmdline arguments |
| - intel/ci: Drop redundant BOOT_METHOD variables |
| - intel/ci: Add newer i915/ADL-P firmware to rootfs |
| - intel/ci: Load Xe instead of i915 on ADL |
| - intel/dev: Provide a toggle to avoid warnings about unsupported devices |
| - intel/ci: Use INTEL_XE_IGNORE_EXPERIMENTAL_WARNING to reduce warnings |
| - intel/ci: Update expectations for Xe |
| - amd/ci: Fix fraction for radv-stoney-angle-full |
| - ci/android: Don't build desktop GL CTS |
| - ci/lava: Don't build VK-main for arm64 |
| - ci: Allow building ANGLE for multiple platforms |
| - ci/android: Build and use ANGLE |
| - ci/android: Check ANGLE version |
| - ci: Move debian-android up to the build-for-tests stage |
| - freedreno/ci: Unify naming for a306 jobs |
| - zink/ci: Rename a618 suite and expectation files |
| - ci/angle: Rework building ANGLE (again) |
| - ci/lava: Build ANGLE for arm64 |
| - ci/android: Add build section for Cuttlefish |
| - ci/angle: Use lld-19 for linking ANGLE |
| - intel/ci: Fix manual rules for ANGLE jobs |
| - ci/angle: Uprev ANGLE |
| - ci: Update expectations from latest nightly |
| - turnip/ci: Add nightly ANGLE jobs on a618 and a660 |
| - ci/container: Include ANGLE in the arm64 test-gl container |
| - turnip/ci: Rename valve-freedreno-turnip-manual-rules |
| - turnip/ci: Add a nightly ANGLE job on a750 |
| - radv/ci: Don't start X11 for ANGLE |
| - anv/ci: Don't start X11 for ANGLE |
| - ci: Simplify downloading kernel for crosvm |
| - ci: Don't download the kernel image in lava_build.sh |
| - intel/ci: Update GuC firmware for ADL-S and ADL-N |
| - ci: Use new kernel that supports more Mediatek devices |
| - anv/ci: Update expectations from latest nightly |
| - intel/ci: Honor device-specific FDO_CI_CONCURRENT variables |
| - intel/ci: Add brask and nissa |
| - anv/ci: Migrate anv-adl-angle job to brask |
| - intel/ci: Migrate intel-adl-cl and intel-adl-skqp to nissa |
| - anv/ci: Increase parallelism of zink-anv-adl |
| - lavapipe: Update driverVersion |
| - anv/ci: Remove fixed test from xfails |
| - zink/ci: Run more traces on Tiger Lake |
| - zink/ci: Add trace testing on Alder Lake |
| - anv/ci: Append -vk suffix to VKCTS job names |
| - ci: Update kernel to include i.MX8MP dtb |
| - panforst/ci: Migrate the G57 GL job to MT8195 |
| - panfrost/ci: Add a Piglit job on G57 |
| - panfrost/ci: Shorten Piglit job names |
| - panfrost/ci: Pin g610-gl job to 4GB DUTs |
| - tu: Switch to device address from vk_buffer |
| - ci: Enable Perfetto in debian-no-libdrm for Turnip build testing |
| - zink/ci: Work around recent OOM issues in zink-anv-adl |
| - ci/deqp: Delete more uncompressed caselist files |
| - ci: Add missing kvm runner tags |
| - ci/android: Don't delete ninja after building LLVM |
| - ci/android: Keep the LLVM install when rebuilding |
| - ci/android: Update to LLVM 19 for Android |
| - ci/android: Temporarily disable building llvmpipe |
| - radv/ci: Delete obsolete vkcts-stoney-valve job |
| - amd/ci: Rename AMD jobs to follow unified naming convention |
| - ci: Disable the Google freedreno farm |
| - panvk/ci: Migrate the G52 VK job to MT8186 |
| - ci: Make it possible to use ANGLE traces on other architectures |
| - radv/ci: Update ANGLE version used for traces |
| - ci/android: Remove platform-tools from test-android container |
| - ci/android: Add section for downloading Android CTS |
| - ci/container: Disable debian/arm32_test-vk container |
| - ci/container: Remove double build sections in test-* containers |
| - ci/container: Move calling strip-rootfs.sh to common scripts |
| - ci/android: Add build section for uninstalling build software |
| - ci/container: Drop unnecessary variables for image paths |
| - intel/ci: Convert iris-kbl-piglit to deqp-runner suite |
| - ci/piglit: Drop redundant PIGLIT_PROFILES variable |
| - ci/piglit: Consolidate HWCI_TEST_SCRIPT for piglit traces |
| - ci/piglit: Remove piglit-runner.sh script |
| - ci/piglit: Consolidate identical skip lists for X11 and gbm |
| - ci/lava: Consolidate piglit trace job definitions |
| - zink/ci: Work around recent OOM issues in zink-anv-tgl |
| |
| Vasily Khoruzhick (20): |
| |
| - lima: ppir: handle ffma in the backend |
| - lima: ppir: improve readability of ppir represantation dump |
| - lima: ppir: fixup src node when replacing src for select and load_reg |
| - lima: ppir: print index of the node that breaks node_to_instr |
| - lima: ppir: fix diassembling atan and combiner codegen definition |
| - lima: ppir: assert on unexpected pipeline dest for fmul and vmul |
| - lima: ppir: add codegen for mov and mul on combiner unit |
| - lima: ppir: use combiner unit for mul |
| - lima: ppir: duplicate fneg and fabs if its source is an intrinsic |
| - lima: ppir: fix regalloc bugs |
| - lima: ppir: assign an index for discard block |
| - lima: ppir: introduce an optimizer |
| - lima: ppir: try scheduling root nodes into the same instruction |
| - lima: ci: update deqp CI expectations |
| - lima: ppir: add compactification pass |
| - lima: ppir: reuse uniform load in instruction if possible |
| - lima: ppir: reuse load_temp/store_temp nodes if possible |
| - lima: ppir: assign actual index to discard block |
| - lima: ppir: optimize branches |
| - lima: ppir: try inserting nodes into successor instr for uncond branch |
| |
| Vignesh Raman (2): |
| |
| - s3_upload: improve url validation and error message |
| - ci: Uprev kernel to 6.14 |
| |
| Visan, Tiberiu (2): |
| |
| - amd/vpelib: Fix studio range |
| - amd/vpelib: Apply normalization for full range |
| |
| Vlad Zahorodnii (3): |
| |
| - egl/wayland: Damage whole surface using wl_surface_damage_buffer() |
| - vulkan/wsi/wayland: Damage whole surface using wl_surface_damage_buffer() |
| - vulkan/wsi/wayland: Document why wl_surface_damage() code path ignores provided damage |
| |
| Xaver Hugl (6): |
| |
| - increase required wayland-protocols version to 1.41 |
| - vulkan/wsi: implement support for VK_EXT_hdr_metadata on Wayland |
| - vulkan/wsi: handle the compositor not supporting extended target volume better |
| - vulkan/wsi: don't use sRGB if the compositor doesn't support it |
| - vulkan/wsi: validate HDR metadata to not cause protocol errors |
| - vulkan/wsi: warn once when HDR metadata is skipped because of protocol errors |
| |
| Yinjie Yao (2): |
| |
| - gallium/pipe: Increase hevc max slice to 600 |
| - frontends/va: Handle properly when decoding more slices than limit |
| |
| Yiwei Zhang (124): |
| |
| - venus: scrub disallowed ycbcr features for rgba10x6 |
| - venus: further sanitize image props for rgba10x6 |
| - venus: refactor more to image format props sanitization |
| - venus: enable VK_EXT_external_memory_acquire_unmodified if needed |
| - venus: use dedicated allocation for ANB image memory import |
| - venus: emulate a second graphics queue on Android |
| - venus: sync venus protocol for below extensions |
| - venus: group private data together with other 1.3 exts |
| - venus: fix to handle pipeline flags2 from maint5 |
| - venus: support VK_EXT_blend_operation_advanced |
| - venus: refactor to share more codes between pipeline state fillings |
| - venus: support VK_EXT_sample_locations |
| - venus: fix sampler locations feats and props scrub |
| - venus: sync protocol to v1.3.302 |
| - Support 5 more promoted extensions |
| - venus: fix maintenance5 props init and create flags2 |
| - venus: drop unused codes |
| - venus: sync protocol for v1.4.307 release and update promoted entries |
| - venus: updated to use core types for promoted extensions |
| - venus: support VK_KHR_shader_subgroup_rotate |
| - venus: support VK_KHR_shader_float_controls2 |
| - venus: support VK_EXT_pipeline_protected_access |
| - venus: support VK_EXT_pipeline_robustness |
| - venus: support VK_KHR_map_memory2 |
| - venus: support VK_KHR_global_priority |
| - venus: support VK_KHR_dynamic_rendering_local_read |
| - venus: a few tiny naming fixes |
| - venus: support VK_KHR_maintenance6 |
| - venus: update second queue emulation for 1.4 requirement |
| - venus: deprecate a few useless micros |
| - venus: advertise 1.4 support |
| - venus: fix 2 entry points from maint6 |
| - venus: fix sample location info validity |
| - venus: limit second queue emulation to android framework |
| - venus: temporarily disable 1.4 support |
| - venus: suppress a few -Wmaybe-uninitialized |
| - venus: drop unnecessary struct |
| - venus: use sharing_mode from common vk_image |
| - venus: fix image format cache miss with AHB usage query |
| - venus: sync latest protocol v3 support for host copy |
| - venus: extend image format cache for host copy props |
| - venus: implement host image copy commands |
| - vulkan/util: clean up copy_property and fix for setter |
| - venus: enable VK_EXT_host_image_copy support |
| - venus: re-enable 1.4 support |
| - venus: align on wsi frontends support |
| - venus: support wsi maintenance1 extensions |
| - venus: sync protocol for the passthrough extensions |
| - venus: added passthrough extension support - Part I |
| - venus: added passthrough extension support - Part II |
| - venus: added passthrough extension support - Part III |
| - venus: added passthrough extension support - Part IV |
| - venus: added passthrough extension support - Part V |
| - venus: relax the requirement for sync2 |
| - venus: fix an obsolete protocol sync earlier |
| - venus: fix a memory corruption in query records recycle |
| - venus: sync protocol for ray tracing support |
| - venus: add a debug option for ray tracing support |
| - venus: support VK_KHR_deferred_host_operations |
| - venus: add stubs for accel struct host commands |
| - venus: implement VK_KHR_acceleration_structure - Part I |
| - venus: implement VK_KHR_acceleration_structure - Part II |
| - venus: implement VK_KHR_acceleration_structure - Part III |
| - venus: enable VK_KHR_acceleration_structure |
| - venus: support VK_KHR_ray_query |
| - venus: prepare push template for ray tracing pipeline |
| - venus: implement VK_KHR_ray_tracing_pipeline commands |
| - venus: enable VK_KHR_ray_tracing_pipeline |
| - venus: support VK_KHR_ray_tracing_position_fetch |
| - venus: support VK_KHR_ray_tracing_maintenance1 |
| - lavapipe: set availability bit for accel struct host queries |
| - lavapipe: fix accel struct device query copy |
| - venus: use common memory report implementation |
| - venus: fix to ignore dstSet for push descriptor |
| - venus: extend async descriptor set alloc coverage |
| - venus: relax 2 assertions for prime blit path |
| - venus: sync protocol support for maint7 |
| - venus: sync protocol for accel struct indirect build encoding fix |
| - pan/kmod: set DRM_RDWR for exported dma-bufs |
| - venus: drop vn_call usage on apis without any returns |
| - venus: sync latest protocol for more extension support |
| - venus: add a few more trivial extensions |
| - venus: support VK_EXT_filter_cubic |
| - venus: support VK_EXT_legacy_dithering |
| - venus: support VK_EXT_depth_bias_control |
| - venus: support VK_EXT_depth_clamp_control |
| - venus: support VK_EXT_attachment_feedback_loop_dynamic_state |
| - venus: support VK_EXT_nested_command_buffer |
| - venus: default to passthrough ray tracing support |
| - venus: add a new debug option to revive memory budget support |
| - venus: properly enable display platform extensions |
| - venus: back out display control |
| - venus: support VK_EXT_debug_report |
| - venus: sync latest protocol for 2 more extensions |
| - venus: support VK_EXT_pipeline_library_group_handles |
| - venus: support VK_EXT_image_sliced_view_of_3d |
| - panvk: disable VK_KHR_shader_quad_control |
| - panvk: fix dependency for EXT_display_control |
| - vulkan: update ALLOWED_ANDROID_VERSION for api level 34 |
| - vulkan: update ALLOWED_ANDROID_VERSION for api level 35 |
| - venus: rename common vk object base member to vk |
| - venus: explicitly get vn_device from vk_device |
| - venus: use common vk_command_pool |
| - venus: use vk_command_pool internals |
| - venus: use common vk_command_buffer |
| - venus: use vk_command_buffer internals |
| - venus: use common cmd pool_link |
| - docs: demote VK_KHR_shader_relaxed_extended_instruction |
| - venus: fix unexpected ring alive status expire upon owner thread switch |
| - venus: fix ahb usage caching |
| - venus: fix maint4 multi-planar memory requirements |
| - venus: improve image memory requirement cache for image aliasing |
| - venus: enable VK_EXT_debug_utils |
| - virgl/venus/vtest: align capset and protocol with virglrenderer |
| - panvk/csf: rework cache flush reduction |
| - panvk: fix memory requirement query for aliased disjoint image |
| - venus: support VK_KHR_maintenance7 |
| - venus: sync latest protocol |
| - venus: support VK_EXT_buffer_device_address |
| - venus: support VK_KHR_shader_relaxed_extended_instruction |
| - venus: refactor format properties cache to be extensible |
| - venus: support VK_EXT_multisampled_render_to_single_sampled |
| - docs: update venus driver page |
| - venus: fix missing renderer destructions |
| |
| Yogesh Mohan Marimuthu (7): |
| |
| - winsys/amdgpu: do not use rcs->csc |
| - winsys/amdgpu: make csc context as array |
| - winsys/amdgpu: amdgpu_cs_context is csc, amdgpu_cs is acs |
| - winsys/radeon: struct radeon_cmdbuf is rcs instead of cs for consistency |
| - winsys/amdgpu: same_queue variable should be set if there is only one queue |
| - winsys/amdgpu: userqueue multi ctx jobs are guaranteed to be in sequence |
| - winsys/amdgpu: userq non imported fence can be ignored for same ip_type |
| |
| Yurii Kolesnykov (1): |
| |
| - Get rid of 5 remaining references to glapitable.h |
| |
| Zan Dobersek (11): |
| |
| - tu/a750: invalidate vertex state before CP_DRAW_INDIRECT_MULTI |
| - tu: make tu_debug_flags enum 64-bit |
| - tu: use query index when retrieving performance query iovas |
| - tu: performance query result writes must use dedicated union type |
| - tu/a7xx: disable preemption during performance query measurement |
| - freedreno: add common implementation of perfcntr-based derived counters |
| - tu: support exposing derived counters through VK_KHR_performance_query |
| - tu: fix zero-index perfcntr collection for derived counters |
| - tu: disable logic operations for float and sRGB formats |
| - freedreno/registers: add useful A6XX_SP_TP_MODE_CNTL bitfields |
| - tu: allow D3D-compatible texture coordinate rounding |
| |
| Zhao, Jiali (1): |
| |
| - amd/vpelib: Fix studio output CSC |
| |
| forbiddenlake (2): |
| |
| - docs: Fix HTML build with Sphinx 8.2 |
| - ci/alpine: upgrade sphinx and hawkmoth to the latest versions |
| |
| irql-notlessorequal (7): |
| |
| - hasvk: Fix non-functioning version override. |
| - elk: ensure VUE header writes in HS/DS/GS stages |
| - elk: always write the VUE header |
| - hasvk: Pre-plumbing needed for VK_KHR_maintenance5 |
| - hasvk: Implement VkPipelineCreateFlags2KHR support |
| - hasvk: Implement VK calls and formats. |
| - hasvk: Advertise VK_KHR_maintenance5 |
| |
| lcagustini (1): |
| |
| - panvk: Advertise support for VK_EXT_border_color_swizzle |
| |
| liuqiang (1): |
| |
| - intel/brw: Remove redundant condition in components_read() |
| |
| llyyr (1): |
| |
| - vulkan/wsi/wayland: initialize surface colorspace with PASS_THROUGH_EXT |
| |
| sarbes (1): |
| |
| - lima: add genxml for texture descriptor |