| Mesa 23.2.1 Release Notes / 2023-09-28 |
| ====================================== |
| |
| Mesa 23.2.1 is a new development release. People who are concerned |
| with stability and reliability should stick with a previous release or |
| wait for Mesa 23.2.2. |
| |
| Mesa 23.2.1 is an unusual first stable release due to the accidentl tagging of |
| 23.2.0 durring the rc cycle. |
| |
| Mesa 23.2.1 implements the OpenGL 4.6 API, but the version reported by |
| glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / |
| glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. |
| Some drivers don't support all the features required in OpenGL 4.6. OpenGL |
| 4.6 is **only** available if requested at context creation. |
| Compatibility contexts may report a lower version depending on each driver. |
| |
| Mesa 23.2.1 implements the Vulkan 1.3 API, but the version reported by |
| the apiVersion property of the VkPhysicalDeviceProperties struct |
| depends on the particular driver being used. |
| |
| SHA256 checksum |
| --------------- |
| |
| :: |
| |
| 64de0616fc2d801f929ab1ac2a4f16b3e2783c4309a724c8a259b20df8bbc1cc mesa-23.2.1.tar.xz |
| |
| |
| |
| New features |
| ------------ |
| |
| - VK_EXT_attachment_feedback_loop_dynamic_state on RADV |
| |
| - extendedDynamicState3SampleLocationsEnable on RADV |
| |
| - VK_EXT_dynamic_rendering_unused_attachments on RADV |
| |
| - VK_EXT_mesh_shader on lavapipe |
| |
| - OpenGL 3.1 on Asahi |
| |
| - OpenGL ES 3.0 on Asahi |
| |
| - VK_KHR_fragment_shader_barycentric on RADV/GFX10.3+ |
| |
| - VK_KHR_ray_tracing_pipeline on RADV/GFX10.3+ |
| |
| - VK_EXT_depth_bias_control on RADV |
| |
| - VK_EXT_fragment_shader_interlock on RADV/GFX9+ |
| |
| - VK_EXT_pipeline_robustness on RADV |
| |
| |
| |
| Bug fixes |
| --------- |
| |
| - intel: State cache invalidation after BLORP binding table setup ought to be unnecessary on ICL. |
| - RadeonSI: glClear() causes clear texture for some frames on RX580 |
| - shader_test causing a crash in compiler |
| - Crash in st_ReadPixels |
| - [ANV] [DG2/A770] The Spirit and The Mouse, miscellaneous issues with Mesa Git |
| - Penumbra: Overture hangs on new game loading screen |
| - radv: Regression from 266b2cfe5bf3feda16747c50c1638fb5a0426958 |
| - h264 encoding picture showed randomly repeated frames. |
| - [Google][Rex][anv] GLES dEQP test fails in anv when run via ANGLE-on-Venus on ChromeOS ARCVM. |
| - VAAPI on VCN: bad stream may crash whole gfx system |
| - aco: Assertion when compiling CP2077 shader |
| - [RADV] Dead by Daylight memory leak (shader-related?) on 23.1.6 |
| - gpu hang on DG2 when running KHR-GLES31.core.texture_cube_map_array.image_op_tess* |
| - KHR-GLES31.core.texture_cube_map_array.image_op_tessellation_evaluation_sh fail on GFX12+ |
| - wsi: deadlocks when DISPLAY is changed |
| - VAAPI: AMDGPU crash on RX 6900 XT on corrupted video |
| - [RADV] red and pink tinted shadows in Overwatch 2 on 7900 XTX |
| - blorp regression on dg2 |
| - radv: commit 81641b01555faa4dd1dfc7de2513ad8d63e77ab7 leaded to artifacts in Quake II RTX |
| - [radv] Colors are distorted in Cyberpunk 2077 with ray tracing enabled |
| - Forza Horizon 5 stuttering since mesa 23.1.4 / 9b008673 revert as a FIX |
| - glCopyTexSubImage2D is very slow on Intel |
| - NVE4 (GeForce 710) fails to get vdpau in mesa git |
| - nouveau prevents hardware acceleration with Chromium (Wayland) |
| - Corrupt text rendering in Blender |
| - DRI2 gallium frontend is using bad format type |
| - Incorrect vlVaCreateBuffer/vlVaMapBuffer behavior for buffer type VAEncCodedBufferType in Gallium |
| - ci: do not download perfetto on-fly in build jobs |
| - Shared Memory Leak With Qt OpenGL Applications |
| - OpenGL, SIGSEGV when program pipeline objects has separated vertex shader progam and separated fragment shader progam with in/out |
| - 975a8ecc881873744d851ab0ef45ad7698eaa0ef "frontends/va: use resources instead of views" cause radeonsi can't play video. |
| - Rusticl,radeonsi: ac_rtld error(2): too much LDS |
| - aco, radv Rage 2 menu corruption - bisected |
| - radv, aco: World War Z character texture regression on 7900xtx |
| - lavapipe/llvmpipe: regressions since descriptor rewrite |
| - Building llvmpipe with LP_USE_TEXTURE_CACHE set fails since 23.2.0-rc1: error C2039: dynamic_state is not member of lp_build_sampler_soa in lp_tex_sample.c |
| - [anv] Death Stranding crashes |
| - Can no longer build Clover without llvmspirvlib |
| - Baldurs Gate 3 (DX11) - Graphical corruption on RDNA3 (ACO regression) |
| - intel: Deathloop and other DX12 games fail assert(validated) with invalid SEL instruction |
| - gpu hangs on dg2 with mesh shading enabled on vkcts |
| - GTF-GL46.gtf21.GL.build.CorrectFull_vert regressed on intel platforms |
| - radeonsi: Deadlock when creating a new GL context in parallel with linking a shader on another GL context |
| - robustness2 raygen tests intermittently fail in Intel Mesa CI |
| - glthread: huge performance regression |
| - DirectX games do not launch on Intel HD Graphics 4000 (IVB GT2) [bisected] |
| - [Vega 64] Newer Mesa-git revisions past 283be8ac3b8610a77b28ebe9e44b946b979f0381 crash the system when accessing hardware accelerated apps |
| - Docs: Imagination driver not have documentation in https://docs.mesa3d.org/ |
| - Unigine Heaven broken on Navi 21 since https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22846 |
| - [bisected] amdgpu graphics acceleration causing system crashes on 22f3bcfb5a33 or later |
| - anv: GPU hangs on MSAA tests with Angle |
| - AMD OpenGL texture corruption and crashing regression in java app |
| - The filenumber part of the #line preprocessor directive is ignored for multiline error messages |
| - r300: channel merging missed case for mad + mov |
| - radv: incorrect RTE rounding in corner cases |
| - Confidential issue #4103 |
| - r600 regression |
| - clc: compiler_test gets built even if unit tests are disabled via -Dbuild-tests=false |
| - x11 swapchain fails to check for DRI3 PixmapFromBuffers error |
| - msys2: build fails with error: conflicting types for '_glapi_add_dispatch' |
| - [bisected][anv] newly enabled test (dEQP-VK.api.info.format_properties.g8_b8r8_2plane_420_unorm) failing |
| - deqp gles3.1 gpu hangs on DG2 A380 when running zink |
| - [amd/drm-shim] build issue on ppc64 |
| - radeonsi: bogus advertisement for encode/decode support for 10 bit h264 video |
| - radeon: Basphemous graphical glitch |
| - aztec ruins gl benchmark slow to compile shaders on intel |
| - anv: false cacheline flushing or insufficient buffer alignment on at least ADL |
| - macOS build error timespec |
| - intel: Borderlands 2 misrendering with ZINK with OpTerminateInvocation instruction |
| - gallium: Error path in st_create_context_priv leads to segfault |
| - [Vulkan][TGL] vkCmdCopyQueryPoolResults failed to write buffer after compute shader write with overlap |
| - r600: GPU hang on The Long Dark on R600/R700 |
| - Add iris and crocus to features.txt |
| - r600: Segfault on glxgears and almost every OpenGL applications on RV770 (regression) |
| - R9 280 - Broken font rendering in Godot Engine (GLES2) - Radeonsi |
| - radeonsi gcn1 regression |
| - ANV crashes on init on 32 bit builds |
| - eglCreateImageKHR should throw a error when called with anything but EGL_NO_CONTEXT |
| - virgl: Stack overflow in virgl_bind_sampler_states on hosts with more than 32 samplers |
| - [TGL] regression with r11_g11_b10 formats when running through virgl |
| - anv: incorrect vkGetPhysicalDeviceImageFormatProperties2KHR success |
| - r300: reconstruct ARR in shaders from wined3d |
| - ci: enable pre-merge testing for Zink/RADV |
| - rusticl: segmentation fault when enabling for llvmpipe and zink with the driver set to zink |
| - unify load_ubo_dxil and load_ubo_vec4 |
| - Move \`lower_loads_and_stores_to_dxil` bit-size lowering logic to \`nir_lower_mem_access_bit_sizes` |
| - Intel drivers fail to link with -Dxmlconfig=disabled |
| - mesa: Remove dynamic dispatch stubs |
| - radv: regression UE5.2 nanite d3d12 vertex explosion |
| - [ANV/DG2] nvpro-samples/vk_raytracing_tutorial_KHR/ray_tracing_reflections crash |
| - validate_intrinsic_instr: Assertion \`dst' failed. |
| - anv: fails to build on aarch64 |
| - radv: crash/freeze/assert with raytracing and Elden Ring 1.09 |
| - Elden Ring freeze when summoning cooperator with Raytracing |
| - gc_alloc_size returns unaligned allocations |
| - Rusticl OpenCL: panicked at 'called \`Option::unwrap()` on a \`None` value' && void SPIRVKernelParser::applyDecoration(uint32_t, const spv_parsed_instruction_t*): Assertion \`c.first == id' failed. |
| - Using a \`NULL` pointer as \`bitmap` in \`glBitmap` leads to \`GL_OUT_OF_MEMORY` while creating display-list ("glNewList -> glBitmap") |
| - Bitwise and with constant 31 removed on width argument to BitFieldSExtract, causing incorrect result on RADV ACO |
| - Rusticl OpenCL: Simple SYCL / DPC++ program hangs indefinitely at rusticl::api::event::wait_for_events() |
| - radeonsi: Metro Last Light Redux graphical glitches |
| - radv: VK_KHR_fragment_shader_barycentric support |
| - freedreno/a6xx: assert(state->view_rsc_seqno[i] == seqno) failed with sway |
| - radv: Trackmania 2020 crashing on mesa-git |
| - radv crashes when using vertex format VK_FORMAT_B10G11R11_UFLOAT_PACK32 |
| - changes in commit e4b6a0a82457b3ef40c5857412e20bc344ff302c leads to GPU hang |
| - radv,aco: In the game Rise of the Tomb Raider on RDNA 3 GPUs appeared artifacts after commit 290c3d360e5a6f5226c062d6a9267629adb1060e |
| - CI: Linux CI jobs naming |
| - docs.mesa3d.org contrast is bad in dark mode |
| - iris now requires Linux v5.3 |
| - Wolfenstein II: The New Colossus vsync off crash |
| - Surfaceless mode ES2.0 number of vertices limitation |
| - freedreno: firefox crashes on video playback |
| - radv: unaligned vertex input regression |
| - rusticl build error: error[E0308]: mismatched types on some archs |
| - GetInternalformativ with GL_TEXTURE_2D and GL_CLEAR_TEXTURE incrrectly returns GL_INVALID_ENUM |
| - radeonsi: texturing is broken on R9.270x since eaf98b14220d8cbc186d67a929254acc3e7de41a |
| - aco: KHR-GL46.shader_image_load_store.basic-allTargets-atomic asserts on Bonaire |
| - Firefox / VA-API / H.264 decoding artifacts on AMD RX 6600 / Fedora 37 |
| - AMD/RX 6600 - VA-API video output is corrupted if decoded surfaces are exported by vaExportSurfaceHandle and then quickly returned to ffmpeg/va-api decoder and reused |
| - Pixel Game Maker MV - Elfin Force |
| - Anvil - Vulkan CTS tests fail if has_context_isolation set to false |
| - anv: binding table pool leak or overly cached |
| - [regression/bisected] Bone wireframes are no longer rendered correctly in Blender with RadeonSi/Vega |
| - [nine, radeonsi] Texture missing in Heroes of Might and Magic 5 |
| - [REGRESSION] Crash in \`loader_dri3_wait_gl()` due to \`dri3_front_buffer(draw) == NULL` |
| - r600: Visual glitches on The Long Dark with the NIR backend |
| - some piglit tests seg-fault if -Dgles1=disabled is set |
| - anv: Tom Clancy's Rainbow Six Siege [DX11: Image Corruption(FIXED)/Vulkan: crash on lauch] |
| - glSpecializeShaderARB works on SPIR-V compiled with shader compilers ca. 2021, but fails for SPIR-V generated with current compilers |
| - ANV: Vulkan driver regression in clearing Depth/Stencil |
| - radv: Sample rate shading broken in AC:Valhalla |
| - FTBFS: gallivm: src/gallium/auxiliary/gallivm/lp_bld_init.c:45:10: fatal error: llvm-c/Transforms/Scalar.h: No such file or directory (Legacy Pass Manager removed in LLVM 17) |
| - intel: workaround mechanism initialized before device revision (stepping) is available |
| - radv/rt: crash compiling Unity Enemies Demo RT pipelines |
| - gfxbench4/5 crashing on android |
| - mesa 23.0.3 build failure |
| - gallium-aux msan failure on Debian bookworm |
| - radv: Battlefield {1,5} hangs on RX 7900 XTX |
| - radv: graphical artifacts in MSFS running via DXVK on RX7900XT |
| - vulkancts regressions on bdw |
| - [BDW] intel/blorp: MCS partial resolve produces unexpected result |
| - Compile failure v23.0.0 - error: implicit declaration of function |
| - ci/radv: Stop setting MESA_SPIRV_LOG_LEVEL |
| - "frontends/va: report min width and min height values if available" broke VA-API tests on amd-raven |
| - [radeonsi] flickering debug chunk border lines in Minecraft |
| - nine: Lower alpha test in shader key? or require PIPE_CAP_ALPHA_TEST? |
| - radv, radeonsi: Rogue Legacy 2 alpha-to-coverage rendering issues |
| - [r600, TURKS] R600: Unsupported instruction: vec1 32 ssa_1 = intrinsic image_samples (ssa_0) on spec@arb_shader_texture_image_samples@compiler@fs-image-samples.frag (23.1.0-rc4) |
| - anv: Overwatch 2 hangs GPU with GPL enabled |
| - Penumbra: Overture ingame enviroment not displaying on Proton version |
| - Penumbra: Overture ingame enviroment not displaying on Proton version |
| - nir: 'base' may be used uninitialized |
| - vulkan/device_select: no way to select between GPUs of the same model due to bugs |
| - radv: 7900 XTX hair flickering/rendering issues in VaM |
| - radv: cache crashing |
| - nouveau: Regression in arb_transform_feedback_overflow_query-basic from multithreading patches |
| - regression in aco,ac/llvm,radv,radeonsi: handle ps bc optimization in nir for radv |
| - radeonsi: vaapi: \`width >2880 && width % 64 != 0` results in wrong width in h265 stream |
| - [regression] iris: unable to use driver as secondary GPU (primary AMD GPU) |
| - iris: steam doesn't render on dg2 |
| - [llvm 16+] [microsoft-clc] opencl-c-base.h does not exist |
| - Vulkancts clipping / tesselation tests trigger gpu hang on DG2 |
| - Swaped fields in picture in vlc and mythtv if hw accel is on |
| - radeonsi: glGetGraphicsResetStatusEXT keeps returning GL_INNOCENT_CONTEXT_RESET after a GPU reset |
| - WGL: Assert assigns dwThreadId variable |
| - Intel/anv: Modifier problems running gamescope embedded |
| - R600: drop TGSI code path |
| - r600_shader.c:193 r600_pipe_shader_create - translation from TGSI failed ! |
| - nine regression with r600 (bisected) |
| - [ACO] [RADV] Flickering squares in some areas in The Last of Us Part 1 (with workaround) |
| - radv: Jedi Fallen Order flickering & blocky plants |
| - qemu 7.2.0-rc4 with sdl output crashes with assert |
| - nouveau: NV50 (NVAC) broken in latest master |
| - [dozen]: [vkcube] force closing on WSL2 |
| - rusticl failed to build with rust-bindgen 0.65.0 |
| - nine: strange color or transparency of trees when called DrawIndexedPrimitive? |
| - Regression, Bisected: glsl: Delete the lower_tess_level pass breaks r600 tesselation |
| - vkcts-navi21-valve failing often with GCVM_L2_PROTECTION_FAULT_STATUS:0x00X00830 |
| - ci/radv: Remove vkctx-navi21-llvm-valve job? |
| - Deep Rock Galactic GPU freeze (AMD, DX11 DXVK Proton) |
| - radv: Resident Evil 4 Chainsaw Demo GPU hang with Navi 24 |
| - radv: Gotham Knights GPU hang with Navi 24 |
| - SPIR-V error "Invalid back or cross-edge in the CFG" |
| - SPIR-V parsing FAILED: Loop breaks can only break out of the inner most nested loop level |
| - ci: a618 traces performance broken |
| - aco: s_load_dword with negative soffset cause GPU hang |
| - d3d12: Attempting to display a framebuffer through GDI with low bpc produces on-screen corruption |
| - piglit.spec.ext_image_dma_buf_import.ext_image_dma_buf_import crash shutting down |
| - overlay layer: unable to launch titles on steam |
| - radv/zink: spec@ext_texture_integer@multisample-fast-clear gl_ext_texture_integer |
| - ci: a530-gl with 6.3 kernel |
| - a530: hangs with newer firmware version on db820c (apq8096) |
| - tu: debug marker support |
| - VAAPI: Wrong H.264 playback on RX 6900 XT and RX 6700 XT (all Sienna?) |
| - radv: possibly not setting state dirty bits correctly |
| - RADV: VRS attachment not working in specific scenario |
| - VAAPI/AMD: videos less than 64 pixels in width or height are decoded to black |
| - d3d12: DirectX doesn't support seperate stencil functions for front and back face |
| |
| |
| Changes |
| ------- |
| |
| Adam Jackson (1): |
| |
| - egl: Clear EGL_WINDOW_BIT for non-double-buffered EGLConfigs |
| |
| Alan Previn (2): |
| |
| - drm-uapi: bump headers (except AMD) |
| - iris: Add GET_PARAM for protected context capability support |
| |
| Alejandro Piñeiro (12): |
| |
| - v3dv/pipeline: don't prepack up early-z configuration |
| - v3d: use more an auxiliar devinfo |
| - v3d: remove v3d_create_texture_shader_state_bo |
| - v3d: remove v3d_tfu_supports_tex_format |
| - v3d: remove v3d_get_internal_type_bpp_for_output_format |
| - broadcom/compiler: return NULL if we fail to register allocate |
| - v3d: assert if v3d_compile returns NULL |
| - broadcom/compiler: disable tmu pipelining when needed |
| - broadcom/compiler: clarify use of QFILE_VPM |
| - v3dv: refactor copy_image_to_buffer_blit |
| - v3dv: add a linear images to buffer copy codepath |
| - v3dv/device: update conformanceVersion |
| |
| Aleksey Komarov (2): |
| |
| - pan/va: Fix MUX.v2i16 and MUX.v4i8 description |
| - pan/va: fix typo in IADD_IMM.i32 description |
| |
| Alex Denes (1): |
| |
| - virgl: link VA driver with build-id |
| |
| Alexander von Gluck IV (1): |
| |
| - egl/haiku: Fix potential crash if double buffering is disabled |
| |
| Alyssa Rosenzweig (289): |
| |
| - gallium: Add u_default_get_sample_position |
| - zink: Use u_default_get_sample_position |
| - panfrost: Use u_default_get_sample_position |
| - freedreno: Use u_default_get_sample_position |
| - d3d12: Use u_default_get_sample_position |
| - nir: Add more system values for lowering XFB |
| - pan/bi: Don't set has_fsub |
| - asahi: Fix disk cache disable with AGX_MESA_DEBUG |
| - asahi: Minify width/height in create_surface |
| - asahi: Don't use depth/stencil staging blits |
| - asahi: Identify XML for barycentric coordinates |
| - asahi: Track write to separate stencil |
| - agx: Handle splits of uniforms |
| - agx: Fix abs/neg propagation into fcmpsel |
| - agx/lower_zs_emit: Fix progress returning |
| - agx: Handle linear 2D array textureSize() |
| - asahi: Explicitly ban MSAA, compression with linear |
| - asahi: Use 2D array staging resources for cube/3D |
| - asahi: Compress more texture targets |
| - agx: Remove bogus assert |
| - asahi: Use u_default_get_sample_position |
| - agx: Defeature fsub |
| - asahi: Use device_load shift for VBO loads |
| - agx: Fix packing for iadd with shift |
| - asahi: Rename no colour output to tag write disable |
| - asahi: Copy resources if needed to shadow |
| - agx: Don't wait at the end of the shader |
| - asahi: Bind staging resources as RENDER_TARGET |
| - agx/lower_address: Add helper to match multiplies |
| - agx/lower_address: Match multiplies, not only shifts |
| - agx: Ensure load_frag_coord has the right sizes |
| - agx: Rework z/s emit |
| - agx: Validate that collect sources are the same size |
| - agx: Lower I/O to scalar later |
| - asahi: Shrink disk cache size of push ranges |
| - asahi: Bump MAX_PUSH_RANGES to the worst-case |
| - asahi: Implement transform feedback |
| - asahi: Fix depth load/store flags |
| - nir: Add nir_alu_src_as_uint helper |
| - pan/bi: Use nir_alu_src_as_uint |
| - agx: Use nir_alu_src_as_uint |
| - nir: Model AGX-specific multiply-shift-add |
| - agx: Handle imadshl_agx, imsubshl_agx |
| - agx: Fix packing of imsub instructions |
| - agx: Optimize multiplies |
| - zink: Always set a blend state for shader-db |
| - ail: Handle larger block sizes |
| - nir: Allow adding descriptions to ALU opcodes |
| - nir: Make ALU descriptions machine-readable |
| - docs: Include ALU opcode descriptions |
| - nir: Add nir_foreach_phi(_safe) macro |
| - nir: Use nir_foreach_phi(_safe) |
| - dxil: Use nir_foreach_phi_safe |
| - ac/llvm: Use nir_foreach_phi |
| - nir: Use nir_block_last_phi_instr more |
| - nir: Add unified atomics |
| - nir: Add pass to lower atomics to unified |
| - agx: Use unified atomics |
| - pan/bi: Use unified atomics |
| - pan/mdg: Fix icky formatting |
| - pan/mdg: Use unified atomics |
| - gallivm: Use unified atomics |
| - ntt: Use unified atomics |
| - ac/llvm: Don't handle atomic derefs |
| - ac/llvm: Use unified atomics |
| - aco,radv: Use unified atomics |
| - zink: Use unified atomics |
| - ir3: Use unified atomics |
| - nir: Handle unified atomics in simple cases |
| - nir/lower_task_shader: Handle unified atomics |
| - nir/lower_io: Handle unified atomics |
| - nir/lower_ssbo: Handle unified atomics |
| - nir/opt_uniform_atomics: Handle unified atomics |
| - nir/validate: Handle unified atomics |
| - radv: Constify radv_device_supports_etc |
| - radv: Use common GetPhysicalDeviceFeatures2 |
| - r600: Use unified atomics |
| - lvp: Use common GetPhysicalDeviceFeatures2 |
| - tu: Use common GetPhysicalDeviceFeatures2 |
| - agx: Lower legacy atomics sooner |
| - pan/mdg: Lower legacy atomics earlier |
| - panvk: Lower legacy atomics earlier |
| - tu: Lower legacy atomics earlier |
| - v3dv: Lower legacy atomics earlier |
| - lavapipe: Lower legacy atomics sooner |
| - glsl/nir: Produce unified atomics |
| - nir/lower_atomics_to_ssbo: Produce unified atomics |
| - nir/lower_printf: Produce unified atomic |
| - mesa/st: Produce unified atomics |
| - vtn: Produce unified atomics |
| - intel: Produce unified atomics |
| - ac: Produce unified atomic |
| - treewide: Stop lowering legacy atomics |
| - nir: Drop nir_lower_legacy_atomics |
| - ntt: Stop handling legacy atomics |
| - nir: Drop legacy atomics in simple cases |
| - nir/lower_io: Drop legacy atomics |
| - nir/lower_task_shader: Drop legacy atomics |
| - nir/validate: Drop legacy atomics |
| - nir/opt_load_store_vectorize: Reclaim ATOMIC |
| - nir/opt_uniform_atomics: Drop legacy atomics |
| - nir: Remove legacy atomics |
| - nir: Drop unused name from nir_ssa_dest_init |
| - nir: Drop unused argument from nir_ssa_dest_init_for_type |
| - nir: Remove stale TODOs |
| - nir: Fix incorrect comment |
| - util: Add common hex dump utility |
| - asahi: Use common hexdump utility |
| - pan/decode: Use common hexdump |
| - CODEOWNERS: Update panfrost |
| - gallium: Drop Asahi-as-a-swrast hack |
| - asahi: Drop Asahi-as-a-swrast hack |
| - nir: Document extra image source |
| - nir: Add image_texel_address intrinsics |
| - nir: Add pass to lower image atomics |
| - pan/bi: Fix atomic exchange on Valhall |
| - pan/bi: Use nir_lower_image_atomics_to_global |
| - pan/mdg: Use nir_lower_image_atomics_to_global |
| - gallium: Add pipe_image_view::single_layer_view |
| - mesa/st: Set pipe_shader_image::single_layer_view |
| - dxil: Rely on scoped_barrier |
| - treewide: Avoid nir_lower_regs_to_ssa calls |
| - nir/opt_barriers: Add a default callback |
| - agx: Use common combine_all_barriers callback |
| - nir: Drop stale comments |
| - zink: Switch to scoped barriers |
| - panfrost/ci: Skip Piglit tests known to crash |
| - panfrost/ci: Skip hanging test |
| - nir: Add intrinsics for multisampling on AGX |
| - nir/builder: Add nir_replicate helper |
| - treewide: Use nir_replicate |
| - pan/lower_framebuffer: Use nir_replicate |
| - radv/query: Use nir_trim_vector |
| - intel/blorp: Use nir_trim_vector |
| - nir/print: Print locations for geometry shader inputs |
| - gallium: Add util_image_to_sampler_view helper |
| - panfrost: Use util_pipe_image_to_sampler_view |
| - nir: Add and use nir_tex_src_ssa |
| - treewide: Use nir_tex_src_for_ssa |
| - treewide: Use nir_trim_vector more |
| - agx: Set support_16bit_alu |
| - agx: Constant fold when optimizing int64 |
| - agx: Use textures_used, not num_textures |
| - asahi: Add passes to lower MSAA |
| - asahi: Add passes to lower sample intrinsics |
| - asahi: Add alpha-to-coverage (and alpha-to-one) lowering |
| - agx: Assert that sample shading is lowered |
| - asahi: Set uses_sample_shading for background program |
| - asahi: Plumb API sample mask into shaders |
| - asahi: Plumb ppp_multisamplectl into shaders |
| - agx: Model both sources of sample_mask |
| - agx: Plumb in nir_intrinsic_load_sample_mask_in |
| - agx: Handle sample_mask_agx |
| - agx: Enable tag writes when sample mask written |
| - agx: Lower discard in NIR |
| - asahi,agx: Call lower_discard_zs_emit in the driver |
| - agx: Split iter and iterproj instructions |
| - agx: Model interpolation for iter instructions |
| - agx: Handle centroid and sample interpolation |
| - asahi: Lower MSAA |
| - asahi: Use nonempty tib for MSAA |
| - agx: Emit shader info late |
| - asahi: Advertise GL 3.1 |
| - agx: Stop bit-inexact conversion propagation |
| - asahi: Add ASAHI_MESA_DEBUG=nowc flag |
| - asahi: Extract transition_resource helper |
| - asahi: Decompress writable images |
| - asahi: Decompress with format reinterpretation |
| - asahi: Remove stale comments |
| - pan/mdg: Drop lower_locals_to_regs call |
| - lima: Drop lower_locals_to_regs call |
| - ir2: Drop lower_locals_to_regs call |
| - nir: Add AGX atomic intrinsics |
| - agx: Refactor expressions in agx_nir_lower_address |
| - agx: Fold addressing math into atomics |
| - nir/builder: Add steal_tex_src helper |
| - nir/lower_tex: Use nir_steal_tex_src |
| - agx: Use common nir_steal_tex_src |
| - nir: Add interleave_agx instruction |
| - vtn: Handle atomic counter semantics |
| - ir3: Drop reference to unsupported intrinsic |
| - ttn: Emit scoped barriers when needed |
| - ntt: Use scoped barriers |
| - ac/llvm: Drop memory_barrier_buffer impl |
| - glsl: Assume use_scoped_barrier |
| - vtn: Assume use_scoped_barrier |
| - nir: Assume use_scoped_barrier |
| - ttn: Assume use_scoped_barrier |
| - treewide: Remove use_scoped_barrier |
| - nir/tests: Use scoped barriers internally |
| - nir: Remove handling for non-scoped barriers |
| - radeonsi: Scan for scoped barriers |
| - nir: Remove non-scoped barriers |
| - iris: Don't use STREAMING_LOAD without SSE |
| - nir/builder: Add ubitfield_extract_imm helper |
| - agx: Implement bitfieldExtract natively |
| - asahi: Use bitfield_extract for texture lowering |
| - nir: Remove integer and 64-bit modifiers |
| - aco: Drop NIR parallel copy handling |
| - nir: Add discard_agx intrinsic |
| - agx: Update explanation of sample_mask behaviour |
| - agx: Fix discards |
| - agx: Extract coordinate register size calculation |
| - agx: Recollect stored vectors at their use |
| - agx: Add loop header? flag |
| - agx: Validate predecessor information |
| - agx/lower_parallel_copy: Lower 64-bit copies |
| - agx: Implement vector live range splitting |
| - nir/lower_bool_to_int32: Fix progress reporting |
| - nir/lower_locals_to_regs: Add bool bitsize knob |
| - gallivm: Use NIR_PASS macros |
| - nir: Add pixel_coord, frag_coord_zw intrinsics |
| - nir: Add lower_frag_coord_to_pixel_coord pass |
| - pan/bi: Use lower_frag_coord_to_pixel_coord |
| - agx: Use nir_lower_frag_coord_to_pixel_coord |
| - asahi: Use txf for background program |
| - nir/lower_blend: Optimize masked out RTs |
| - nir: Add nir_builder_create returning nir_builder |
| - nir: Use nir_builder_create |
| - treewide: Use nir_builder_create more |
| - treewide: Remove unused builders |
| - nir: Add nir_foreach_function_impl helper |
| - nir: Convert to nir_foreach_function_impl |
| - nir/validate: Assert txf(_ms) matches dimension |
| - nir: Add nir_lower_robust_access pass |
| - broadcom/compiler: Use nir_lower_robust_access |
| - broadcom/compiler: Remove v3d_nir_lower_robust_access |
| - broadcom/compiler: Remove unused #define |
| - broadcom/compiler: Use nir_steal_tex_src |
| - nir: Add b32fcsel_mdg opcode for Midgard |
| - pan/mdg: Optimize b32csel(inot) in NIR |
| - pan/mdg: Type CSEL with a NIR pass |
| - pan/mdg: Lower isub in common code |
| - pan/mdg: Constant fold after algebraic_late |
| - pan/mdg: Add is_ssa helper |
| - pan/mdg: Fix IR from scheduling conditions |
| - pan/mdg: Fix 2-const CSEL at block beginning |
| - pan/mdg: Fix temp count calculation |
| - pan/mdg: Lower special reads better |
| - pan/mdg: Reset predicate.exclude while scheduling |
| - pan/mdg: Copy-prop even with swizzle restrictions |
| - pan/mdg: Propagate modifiers in the backend |
| - nir: Rename load/store_reg -> load/store_register |
| - nir: Rename nir_reg_{src,dest} -> nir_register_{src,dest} |
| - agx: Add algebraic opt to help with discard lowering |
| - agx: Smarten discard_agx -> sample_mask lowering |
| - asahi: Strip ? in GenXML |
| - asahi: Rename 'Render Target' to 'PBE' |
| - asahi: Identify PBE::sRGB flag |
| - asahi: Remove ; in perf_debug_ctx |
| - agx: Use nir_opt_shrink_stores |
| - agx: Use nir_opt_shrink_vectors |
| - agx: Assert that barriers are not used in the preamble |
| - asahi: Assert we don't transition shared resources |
| - asahi: Fix scissor_culls_everything check |
| - asahi: Use ralloc harder |
| - asahi: Take ownership of compute shader NIR |
| - agx: Don't leak ssa_to_reg_out |
| - asahi: Use txf_ms for MSAA background programs |
| - nir: Fix breaking in nir_foreach_phi(_safe) |
| - vulkan: Add vk_index_type_to_bytes helper |
| - lavapipe: Use vk_index_type_to_bytes |
| - v3dv: Use vk_index_type_to_bytes |
| - rogue: Remove commented convert_from_ssa call |
| - nir: Add intrinsics for register access |
| - nir: Add helpers for walking register uses |
| - nir: Add pass for trivializing register access |
| - nir: Add legacy data structures & helpers |
| - nir: Add new version of lower_regs_to_ssa |
| - nir: Produce intrinsics in lower_{phis,ssa_defs}_to_regs |
| - nir: Add intrinsics version of locals_to_regs |
| - nir: Add lower_vec_to_regs pass |
| - gallium: Return SSA values from TTN ALU helpers |
| - gallium: Convert TTN to register intrinsics |
| - mesa: Simplify ptn_log() a bit |
| - mesa: Return SSA defs from PTN ALU helpers |
| - mesa: Convert PTN to register intrinsics |
| - nir/lower_shader_calls: Convert to register intrinsics |
| - nir: Remove nir_lower_regs_to_ssa |
| - nir: Remove nir_register-based unit tests |
| - gallivm: Switch to reg intrinsics |
| - pan/mdg: Ingest new-style registers |
| - panfrost: Fix transform feedback on v9 |
| - panfrost: Lower vertex_id for XFB |
| - panfrost: Fix transform feedback on v9 harder |
| - nir/trivialize: Handle more RaW hazards |
| - nir/lower_blend: Fix 32-bit logicops |
| - nir/lower_helper_writes: Consider bindless images |
| - nir/passthrough_gs: Fix array size |
| |
| Amber (3): |
| |
| - turnip: fix buffer markers using wrong addresses |
| - ir3, freedreno: implement GL_ARB_shader_draw_parameters |
| - freedreno: implement GL_ARB_indirect_parameters |
| |
| Andres Calderon Jaramillo (1): |
| |
| - r600: Report multi-plane formats as unsupported |
| |
| Andres Gomez (3): |
| |
| - .mailmap: add an alias for Miguel Casas-Sanchez |
| - .mailmap: add an alias for Clayton Craft |
| - .mailmap: add an alias for Christian Gmeiner |
| |
| André Almeida (2): |
| |
| - radv: debug: Update decode ring umr command |
| - radv: Search for guilty contexts at radv_check_status |
| |
| Antonio Gomes (3): |
| |
| - rusticl: Move nir compilation to Program |
| - rusticl: Drop some Kernel data and have a NirKernelBuild ref instead |
| - rusticl: Drop Program::kernel_count |
| |
| Asahi Lina (33): |
| |
| - asahi: Identify ZS resolve bits (tentative) |
| - asahi: Broadcast Z for all components on texture fetch |
| - asahi: Enable 2xMSAA (for deqp) |
| - asahi: Add batch state debugging |
| - asahi: Fix batch writer tracking for null batches |
| - asahi: Clear batch->resolve on agx_batch_init |
| - asahi: Assert that freed BOs have no pending writers |
| - asahi: Fix batch writer_syncobj cleanup |
| - asahi: Implement memory_barrier |
| - asahi: Implement create_fence_fd and fence_server_sync |
| - asahi: Make framebuffer texture barriers a no-op |
| - asahi: Disable tilebuffer write masking optimization |
| - asahi: Add missing stdbool include to lib/hexdump.h |
| - asahi: Fix check for sprite coord mode in agx_bind_rasterizer_state |
| - asahi: Add some more system registers |
| - asahi: Partially identify some missing index list stuff |
| - asahi: Lazily initialize batch state on first draw |
| - asahi: Make bo->writer_syncobj atomic |
| - ail: Implement multisampling for compression meta calculation |
| - asahi: Use ail_can_compress() in agx_compression_allowed() |
| - ail: Add MSAA tests |
| - asahi: Use os_dupfd_cloexec() instead of dup() |
| - asahi: Fix memory leak in agx_nir_lower_sysvals() |
| - asahi: Do not leak meta shader NIR |
| - asahi: Revert "Advertise ARB_texture_barrier" |
| - asahi: Disable PIPE_CAP_SURFACE_SAMPLE_COUNT |
| - asahi: Pass through surface sample count |
| - asahi: match_soa: Treat offsets as signed |
| - asahi: Identify the separate varying count fields |
| - asahi: Gather flat/linear shaded input info from uncompiled FS |
| - asahi: Fix type confusion for fragment shader keys |
| - asahi: Add flat/linear shaded varyings mask to the VS shader key |
| - asahi: Arrange VS varyings in the correct order |
| |
| Axel Davy (17): |
| |
| - frontend/nine: Fix missing clamping of pointsize for ff |
| - frontend/nine: Apply writemask to pointsize |
| - frontend/nine: fix fog key overflow |
| - frontend/nine: fix wfog |
| - frontend/nine: Fix num_textures count |
| - frontend/nine: Drop max_ps_const_f |
| - frontend/nine: Implement alpha test backup support |
| - frontend/nine: Implement backup support for pointsize |
| - frontend/nine: Improve VS_WINDOW_SPACE_POSITION fallback |
| - frontend/nine: Print warning incomplete position_t support |
| - frontend/nine: Enforce legacy pow behaviour |
| - frontend/nine: Get rid of INTERPOLATE_COLOR |
| - frontend/nine: initialize force_color_in_centroid |
| - docs/gallium: Clarify PIPE_CAP_CLIP_PLANES |
| - frontend/nine: Implement backup support for clip planes |
| - frontend/nine: Fix shader cap test for POSITIONT |
| - frontend/nine: Add debug driconf var force_features_emulation |
| |
| Bas Nieuwenhuizen (11): |
| |
| - radv: Reserve space for indirect descriptor set address writes. |
| - radv: Reserve space in the ACE pre/postambles. |
| - radv: Add stricter space checks. |
| - radv: Add asserts in radeon_emit{,_array}. |
| - radv: Move all the dirty flags from TES binding to TCS binding. |
| - amd/drm-shim: Add vangogh entry. |
| - amd/drm-shim: Add raphael&mendocino, polaris12 and gfx1100. |
| - amd/drm-shim: Update docs for more devices. |
| - aco: fix nir_op_vec8/16 with 16-bit elements. |
| - aco: Fix some constant patterns in 16-bit vec4 construction with s_pack. |
| - nir: Fix 16-component nir_replicate. |
| |
| Benjamin Cheng (1): |
| |
| - radv/video: use app provided hevc scaling list order |
| |
| Benjamin Lee (1): |
| |
| - intel: Fix stack overflow in intel_dump_gpu |
| |
| Billy Laws (1): |
| |
| - wgl: Fix depth/stencil image support when using zink kopper |
| |
| Blisto (1): |
| |
| - driconf: set vk_x11_strict_image_count for Wolfenstein II |
| |
| Boris Brezillon (4): |
| |
| - panfrost: Check blend enabled state in pan_allow_forward_pixel_to_kill() |
| - renderonly: Fix potential NULL deref in the error path |
| - renderonly: Make sure we reset scanout on error in create_kms_dumb_buffer_for_resource() |
| - winsys/panfrost: Make sure we reset scanout on error in create_kms_dumb_buffer_for_resource() |
| |
| Boyuan Zhang (2): |
| |
| - frontends/va: add default intra idr period |
| - radeonsi: disable H264HIGH10 profile |
| |
| Brian Paul (5): |
| |
| - llvmpipe: remove lp_setup_alloc_triangle()'s unneeded tri_size param |
| - llvmpipe: code clean-ups in llvmpipe_get_query_result_resource() |
| - lavapipe: clean-ups in lvp_GetQueryPoolResults() |
| - lavapipe: clean-ups in lvp_physical_device_get_format_properties() |
| - lavapipe: asst. clean-ups in lvp_execute.c |
| |
| Caio Oliveira (56): |
| |
| - spirv/tests: Add test for single-block loop |
| - spirv: Output spirv2nir tool result to stdout |
| - spirv: Add --optimize flag to spirv2nir tool |
| - spirv: Rework structured control flow handling |
| - spirv: Do more on spirv2nir --optimize |
| - spirv: Use NIR_PASS for spirv2nir --optimize |
| - spirv: Extract vtn_handle_debug_text() helper |
| - spirv: Fix gl_spirv_validation when OpLine with strings is present |
| - spirv: Improve the 'ID is the wrong kind of value' error messages |
| - mesa/spirv: Provide more specific error message for glSpecializeShader() |
| - spirv: Validate Dim of OpTypeSampledImage and OpSampledImage |
| - spirv: Assert sampler_dim is valid when building nir_tex_instr |
| - nir/print: Print 0 when mem_modes or resource_intel have no values |
| - nir/print: Do not print raw values |
| - spirv: Add workaround for OpImageQueryLevels with Multi-sampled images |
| - compiler/types: Make key in subroutine_name more effective |
| - r600/sfn: Fix warning about overloads hiding virtual functions |
| - spirv: Refactor and rename scope translation helper |
| - spirv: Use vtn_translate_scope for OpReadClockKHR |
| - intel/compiler: Refactor dump_instruction(s) |
| - intel/compiler: Remove unused functions and declarations |
| - compiler/types: Be consistent when naming array element/size |
| - compiler/types: Tidy up the asserts in get_*_instance functions |
| - compiler/types: Use hash table pre-hashed functions for type caching |
| - microsoft/clc: Add unreachable() to fix 'may be unitialized' warning |
| - compiler: Move from nir_scope to mesa_scope |
| - compiler: Add mesa_scope_name() function |
| - nir/print: Use mesa_scope_name() function to print scopes |
| - intel/compiler: Move brw_kernel.c to the intel_clc target |
| - compiler/clc: Rename the internal library from libclc to libmesaclc |
| - compiler/clc: Move related NIR passes to the common mesa clc |
| - compiler: Move spirv into a module of its own |
| - nir/print: Print whether the shader is internal or not |
| - intel/compiler: Respect NIR_DEBUG_PRINT_INTERNAL flag |
| - meson: Explicitly add "check : false" to a couple instances of run_command |
| - vulkan: Add NV suffix to VK_NV_cooperative_matrix feature names |
| - vulkan: Update XML and headers to 1.3.255 |
| - nir: Allow nir_gather_ssa_types() to ignore regs instead of assert |
| - nir/print: Improve NIR_PRINT=print_consts by using nir_gather_ssa_types() |
| - nir/print: Make NIR_DEBUG=print_consts behavior the default |
| - nir: Make a const-friendly way to get the offset_src and arrayed_io_src from intrinsic |
| - nir: Extract logic to get dest and srcs types from intrinsic |
| - nir/print: Use src_type when printing consts in SSA uses |
| - nir/print: Print more representations in load_const |
| - nir/print: Use symbols % for SSA and @ for intrinsic |
| - nir/print: Use \`bN` instead of \`block_N` for identifying basic blocks |
| - nir/print: Use BITSIZExELEMENTS for SSA sizes |
| - nir/print: Align instructions around \`=` |
| - nir/print: Rename print_tabs() to print_indentation() and use it more |
| - nir/print: Don't use comment syntax for deref_cast properties |
| - nir/print: Use \`//` for comments |
| - nir/print: Use 4-space indentation |
| - nir/print: Print div/con annotation first |
| - nir/print: Reformat the preds/succs block information |
| - meson: Ensure that LLVMSPIRVLib is not required for Clover |
| - compiler/types: Use right hash for function types |
| |
| Caleb Cornett (3): |
| |
| - d3d12: Fix Xbox GDK build errors |
| - wgl: Add BITMAPV5HEADER to stw_gdishim.h |
| - d3d12: Fix Xbox frame scheduling for interval != 1 |
| |
| Charmaine Lee (7): |
| |
| - translate: do not clamp element index in generic_run |
| - svga: set PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY for VGPU10 device |
| - mesa/main: fix distance attenuation calculation in ffvertex |
| - svga: fix shader type after ntt |
| - svga: fix compute shader type after ntt |
| - svga: lower images before ntt |
| - svga: set clear_texture to NULL for vgpu9 |
| |
| Chia-I Wu (24): |
| |
| - drm-shim: apply file overrides for open |
| - amd/drm-shim: add amdgpu drm-shim |
| - hasvk: Refactor Android externalFormat handling in CreateYcbcrConversion |
| - hasvk/android: Use VkFormat for externalFormat |
| - hasvk: Use the common vk_ycbcr_conversion object |
| - vulkan: make sure vk_image::format is never UNDEFINED |
| - vulkan: make sure vk_image_view::format is never UNDEFINED |
| - vulkan: rename vk_image::ahardware_buffer_format |
| - vulkan: define inline stubs when android api level < 26 |
| - vulkan: add vk_ahb_format_to_image_format |
| - anv,hasvk,radv: do not fall back to AHARDWAREBUFFER_FORMAT_BLOB |
| - vulkan: add vk_image_format_to_ahb_format |
| - anv,hasvk: android ahb is not always exportable |
| - radv: improve externalMemoryFeatures for android ahb |
| - amd/drm-shim: add raven2 |
| - ac/surface: print tile_swizzle as well |
| - radv: do not use a pipe offset for aliased images |
| - aco: fix alignment check in emit_load |
| - ac, radeonsi: add and use ac_get_ps_iter_mask |
| - radv: fix gl_SampleMaskIn for sample shading |
| - radv: fix msaa feedback loop without tc-compat cmask |
| - radv: fix non-square compressed image copy on gfx9 |
| - radv: disable calibrated timestamps on raven/raven2 |
| - ac/surface: limit RADEON_SURF_NO_TEXTURE to color surfaces |
| |
| Christian Gmeiner (31): |
| |
| - etnaviv: Add util_blitter_save_so_targets(..) call |
| - etnaviv: nir: improve uniform usage for ALU opc |
| - etnaviv: correct number of instructions in dump_shader_info(..) |
| - etnaviv: move printing of final shader out of etna_link_shaders(..) |
| - etnaviv: nir: do not call nir_lower_idiv(..) unconditionally |
| - etnaviv: make wider use of DBG_ENABLED(..) |
| - ci: add debian-arm32-asan |
| - ci/etnaviv: add asan run |
| - etnaviv: Add support for conditional rendering |
| - etnaviv: add support for performance warnings |
| - mesa/arbprog: fix compile errors |
| - etnaviv: remove tgsi remains |
| - etnaviv: drop usage of tgsi_swizzle_names |
| - etnaviv: remove not used tgsi includes |
| - ci/etnaviv: update ci expectation |
| - ir3/analyze_ubo_ranges: Move IR3_DBG_NOUBOOPT check |
| - etnaviv: nir: call nir_remove_dead_variables(..) before linking setup |
| - etnaviv: linker: add fallback lookup to VARYING_SLOT_BFC[n] |
| - nir: add helper to clear all pass_flags |
| - nir/lower_amul: make use nir_shader_clear_pass_flags(..) |
| - etnaviv: make use nir_shader_clear_pass_flags(..) |
| - etnaviv: nir: do a late nir_opt_cse run |
| - docs: mark OES_texture_half_float done on etnaviv |
| - etnaviv: support OES_texture_half_float_linear |
| - ci/etnaviv: update ci expectation |
| - docs: update etnaviv extensions |
| - etnaviv: linker: handle scenario where there are FS inputs without matching VS output |
| - etnaviv: linker: clean up etna_link_shader(..) |
| - nir: rename intrinsic to have a more generic nameing |
| - nir: rename has_txs to has_texture_scaling |
| - nir/lower_tex: optimize offset lowering for has_texture_scaling |
| |
| Christopher Snowhill (2): |
| |
| - Corrects log print to produce hexadecimal base output |
| - intel: Sync xe_drm.h |
| |
| Collabora's Gfx CI Team (4): |
| |
| - Uprev Piglit to 79a084c56b6dd79f7c3a97b57a72963121ebb1e6 |
| - Uprev Piglit to 536975d94a40cf76a69fcfa786c2513eccd0c989 https://gitlab.freedesktop.org/mesa/piglit/-/compare/79a084c56b6dd79f7c3a97b57a72963121ebb1e6...536975d94a40cf76a69fcfa786c2513eccd0c989 |
| - Uprev Piglit to d8c08d123fadb986e9a8a7887b922ff63fcff52e https://gitlab.freedesktop.org/mesa/piglit/-/compare/536975d94a40cf76a69fcfa786c2513eccd0c989...d8c08d123fadb986e9a8a7887b922ff63fcff52e |
| - Uprev Piglit to 5036601c43fff63f7be5cd8ad7b319a5c1f6652c |
| |
| Connor Abbott (42): |
| |
| - tu: Don't override depth for GMEM |
| - tu: Don't pre-shift depth and stencil pitch |
| - freedreno/fdl: Don't pre-shift image view pitch |
| - freedreno/fdl: Expose view offset |
| - tu: Add 3D GMEM load path |
| - tu: Use dirty bit for scissor state |
| - tu: Precompute maximum views across all subpasses |
| - tu: Merge RB_DEPTH_CNTL and RB_STENCIL_CONTROL drawstates |
| - tu: Make dynamic viewport and scissor count more accurate |
| - freedreno/a6xx: Document per-view viewport in GRAS_SU_CNTL |
| - tu: Parse fragment density map attachment info |
| - tu: Implement sampling the fragment density map |
| - tu/cs: Add support for CS patching |
| - tu: Add core FDM patchpoint infrastructure |
| - ir3: Record whether a shader writes gl_ViewportIndex |
| - tu: Implement FDM viewport patching |
| - tu: Implement FDM scaled loads/stores |
| - nir, ir3: Add option to use unscaled FragCoord for input attachments |
| - tu, ir3: Handle FDM shader builtins |
| - tu/autotune: Always prefer GMEM with fragment density maps |
| - tu: Don't allow importing/exporting subsampled images with modifiers |
| - tu: Expose VK_EXT_fragment_density_map |
| - util/bitset: Add some extra functions |
| - vk/graphics_state: Remove vk_subpass_info |
| - vk/graphics_state: Add feedback_loop_input_only |
| - vk/graphics_state: Add VI_BINDINGS_VALID state |
| - vk/graphics_state: Fix some assertions when copying state |
| - vk/graphics_state: Add helpers for pre-baking state |
| - radv: Fix radv_pipeline_is_blend_enabled |
| - vk/graphics_state: Track attachment count as state |
| - vulkan: Fix renderpass flags with driver-specific renderpass |
| - vk/graphics_state: Don't track each vertex input field |
| - tu: Don't use A6XX_PC_PRIMITIVE_CNTL_0::TESS_UPPER_LEFT_DOMAIN_ORIGIN |
| - freedreno/a6xx: Fix name of A6XX_PC_PRIMITIVE_CNTL_0::TESS_UPPER_LEFT_DOMAIN_ORIGIN |
| - tu: Split pipeline struct into different types |
| - tu: Rewrite to use common Vulkan dynamic state |
| - tu: Use common dirty tracking for PC_PRIMITIVE_CNTL_0 |
| - freedreno/regs: Document a7xx CP_FIXED_STRIDE_DRAW_TABLE |
| - tu: Fix vk2tu_*_stage flag type |
| - vk/graphics_state: Fix copying MS locations pipeline state |
| - tu: Fix per-view viewport state propagation |
| - tu: Fix assert in FDM state emission |
| |
| Constantine Shablia (3): |
| |
| - anv: move get_features after get_device_extensions (ugly diff) |
| - panvk: use common vkGetPhysicalDeviceFeatures2 |
| - v3dv: use common vkGetPhysicalDeviceFeatures2 |
| |
| Constantine Shablya (7): |
| |
| - vulkan: add common implementation of vkGetPhysicalDeviceFeatures2 |
| - vulkan: introduce supported_features parameter to vk_physical_device_init |
| - anv: switch to using the common vkGetPhysicalDeviceFeatures2 |
| - vulkan: inline vk_get_physical_device_features into vk_common_GetPhysicalDeviceFeatures2 |
| - vulkan: put interesting code before boring code |
| - vulkan: put TEMPLATE_H before TEMPLATE_C |
| - vulkan: rename vk_physical_device_features.py to vk_physical_device_features_gen.py |
| |
| Corentin Noël (18): |
| |
| - ci: Uprev crosvm and virglrenderer |
| - nir: Propagate the type sampler type change to the used variable. |
| - build-crosvm: Use the pkg-config crate 0.3.27 |
| - util: Use the gcc_struct attribute for packed structures in mingw |
| - ci: Bump base tag to rebuild piglit |
| - ci: uprev virglrenderer and crosvm |
| - gallium: Incorporate the device release in dri_destroy_screen_helper |
| - gallium: Rename dri_destroy_screen_helper into dri_release_screen |
| - pipe-loader: Document the behavior regarding screen creating failures |
| - pipe-loader: Do not destroy the winsys on screen creation failure |
| - gallium: Only call dri_init_options when the screen is actually created |
| - gallium: Use the common destroy function on screen initialization failure |
| - gallium: Rename dri_init_screen_helper into dri_init_screen |
| - compiler: Allow the explicit_stride of aoa types to be zero |
| - nir/split_64bit_vec3_and_vec4: Use the right number of components |
| - ci: Uprev virglrenderer |
| - ci: Add locked flag to bindgen-cli installation |
| - virgl: Do not expose EXT_texture_mirror_clamp when using a GLES host |
| |
| Daniel Schürmann (60): |
| |
| - radv/rt: fix total stack size computation |
| - radv/rt: properly destroy radv_ray_tracing_lib_pipeline on error |
| - radv/rt: rename radv_ray_tracing_module -> radv_ray_tracing_group |
| - radv/rt: add shader stage indices to radv_ray_tracing_group |
| - radv/rt: replace uses of pGroups with radv_ray_tracing_group |
| - radv/rt: remove merged VkRayTracingShaderGroupCreateInfoKHR |
| - vulkan/pipeline_cache: replace raw data objects on cache insertion of real objects |
| - vulkan/pipeline_cache: use vk_pipeline_cache_insert_object() to replace raw data objects |
| - radv: add padding to radv_shader_binary_legacy |
| - vulkan/pipeline_cache: expose vk_raw_data_cache_object |
| - radv/pipeline_cache: add NIR caching capabilities |
| - radv/rt: expose radv_parse_rt_stage() |
| - radv/rt: introduce struct radv_ray_tracing_stage |
| - radv/rt: retain parsed NIR shaders in radv_ray_tracing_lib_pipeline |
| - radv/rt: use precompiled stages to create RT shader |
| - radv/rt: refactor compute_rt_stack_size() to use radv_ray_tracing_stage information |
| - radv/rt: remove merged VkPipelineShaderStageCreateInfo |
| - radv/rt: Fix and improve VkPipelineCreationFeedback |
| - radv/rt: change base of radv_ray_tracing_lib_pipeline to radv_compute_pipeline |
| - radv/rt: unify radv_ray_tracing_lib_pipeline and radv_ray_tracing_pipeline |
| - radv/rt: unify radv_rt_pipeline_create() and radv_rt_pipeline_library_create() |
| - radv/rt: refactor radv_rt_pipeline_compile() |
| - radv/rt: use vk_multialloc for radv_ray_tracing_pipeline |
| - radv/rt: store stack_sizes per stage instead of per group |
| - vulkan/pipeline_cache: don't log warnings for internal caches |
| - vulkan/pipeline_cache: don't log warnings for client-invisible caches |
| - radv: add remaining RT shader args for separate compilation |
| - nir,amd: add nir_intrinsic_store_[scalar|vector]_arg_amd to overwrite inputs |
| - nir: add nir_intrinsic_resume_shader_address_amd |
| - aco: implement nir_intrinsic_load_resume_shader_address_amd |
| - aco: implement select_program_rt() |
| - radv/rt: adjust shared_size when lowering hit_attribs |
| - radv/rt: extend radv_pipeline_group_handle with shader VAs |
| - radv/shader_info: add RT stages to radv_get_user_data_0() |
| - radv/rt: implement radv_nir_lower_rt_abi to lower RT shaders for separate compilation |
| - radv/rt: implement radv_rt_nir_to_asm() |
| - radv/rt: change RT main shader to MESA_SHADER_INTERSECTION |
| - radv/rt: replace pCreateInfo with VkPipelineCreateFlags in rt_variables |
| - radv/rt: pass radv_ray_tracing_pipeline to RT shader creation |
| - radv/rt: add and use specialized cache search/insert functions |
| - radv/rt: reference library shaders during radv_rt_fill_stage_info() |
| - radv/rt: don't write cache hit feedback per stage. |
| - radv/rt: create compile_rt_prolog() function |
| - radv/rt: set up RT shader args for separate compilation |
| - radv/rt: adjust lower_rt_instructions() for shader functions [disables RT] |
| - aco: adjust RT prolog for shader functions [disables RT] |
| - radv/rt: separate shader compilation |
| - radv/debug: dump ray tracing shaders in case of a hang |
| - radv/rt: use priorities to select the next shader |
| - radv/rt: remove now dead code |
| - radv: reference pipeline cache object in radv_pipeline |
| - aco/assembler: align resume shaders with cache lines |
| - aco/assembler: align loops if it reduces the number of cache lines |
| - aco/assembler: change prefetch mode on GFX10.3+ during loops if beneficial |
| - vulkan/pipeline_cache: add 'skip_disk_cache' option |
| - radv/meta: disable disk cache for meta shaders |
| - radv: migrate radv_shader hash to BLAKE3 |
| - amd: move end-of-code marker padding to ACO. |
| - amd: Do shader binary alignment for prefetch at memory allocation time. |
| - aco/insert_exec_mask: set Exact mode after p_discard_if when necessary |
| |
| Daniel Stone (10): |
| |
| - wsi/wayland: Support VK_KHR_present_wait |
| - ci/zink: Disable Freedoom trace on ANV |
| - ci: Respect $HTTP_PROXY for ci_run_n_monitor |
| - ci: Elaborate causes for job retries |
| - ci: Don't retry manual or scheduled jobs |
| - ci: Extend a618_vk_full runtime |
| - CI: Re-enable freedreno CI |
| - ci/fdno: Pause a660 testing |
| - Revert "ci/fdno: Pause a660 testing" |
| - egl/wayland: Always initialise fd_display_gpu |
| |
| Danylo Piliaiev (42): |
| |
| - freedreno: Early exit in device matching if id doesn't have chip_id |
| - ir3/a7xx: NOPs may have some no-op bits set |
| - ir3/a7xx: Add new lock/unlock CS instructions |
| - ir3/a7xx: Add new form of stg.a/ldg.a addressing |
| - ir3/a7xx: Add STSC definition |
| - ir3: Document that stc has higher DST upper bound than we defined |
| - ir3/a7xx: Document "alias" instruction |
| - ir3: documents (ss) flag for cat7 instructions |
| - tu: Create drm fd per logical device |
| - tu: Move VMA heap to the logical device |
| - tu: Re-enable bufferDeviceAddressCaptureReplay |
| - freedreno/perfcntrs: Link with libfreedreno_common |
| - freedreno: Decouple GPU gen from gpu_id/chip_id |
| - freedreno,ir3: Don't call fd_dev_64b more than necessary |
| - freedreno/decode: Correctly handle chip_id |
| - tu: Add missing dbg reg stomping to tu_CmdBeginRendering |
| - tu: Fix zombie VMAs array not initialized when first BOs may be freed |
| - freedreno/regs: Print xml validation error if validation fails |
| - freedreno/rnn: Fix addvariant being set effectively once |
| - freedreno/rnn: Make addvariant work for fields in the same reg |
| - freedreno/rnn: Take into account array's variant for regs |
| - freedreno/regs: Change a7xx regs to have open range for generation |
| - freedreno/regs: More CP commands are the same on a7xx as on a6xx |
| - freedreno/regs: Document CP_MEM_TO_SCRATCH_MEM |
| - freedreno/regs: Document a7xx CP_MODIFY_TIMESTAMP |
| - freedreno/regs: Clarify polling on a7xx for CP_WAIT_REG_MEM/CP_COND_WRITE5 |
| - freedreno/regs: Add a7xx pseudo-regs to CP_SET_PSEUDO_REG |
| - freedreno/regs: a7xx has a new source type CP_REG_TEST |
| - freedreno/regs: Add 2 new a7xx modes to CP_COND_REG_EXEC |
| - freedreno/regs: Add some new a7xx events |
| - freedreno/regs: Add more a7xx regs and reg fields |
| - freedreno/regs: Fix a7xx SP_FS_PREFETCH definition |
| - freedreno/regs: Generate per-gen reg usage tables |
| - freedreno/regs: Define usage for all a6xx/a7xx regs |
| - tu: Allow reg stomping of compute related registers |
| - tu: Use reg usage tables for stale reg dbg option |
| - freedreno/regs: Properly document a7xx CP_EVENT_WRITE, CP_WAIT_TIMESTAMP |
| - freedreno/regs: Document a7xx CP_BV_BR_COUNT_OPS |
| - freedreno/regs: Rename SP_FS_CTRL_REG0.DIFF_FINE into LODPIXMASK |
| - ir3: Fix FS quad ops returning wrong values from helper invocations |
| - tu,freedreno: Forbid blit event for R8G8_SRGB due to gpu faults |
| - radv: fix unused non-xfb shader outputs not being removed |
| |
| Dave Airlie (134): |
| |
| - radeonsi/ac: move some vcn defines to common |
| - radv/video: add missing gfx family |
| - radv: set a video decode ip block in physical device. |
| - radv/winsys: handle encoder queue padding/submits. |
| - radv/video: add a video addr gfx mode |
| - radv/video: fix dpb surface programming |
| - radv/video: start adding gfx11 vcn decoder |
| - lp_jit: use pipe max for the lp_jit texture levels. |
| - gallivm: consolidate draw/lp texture type. |
| - gallivm: consolidate llvmpipe/draw sampler types. |
| - gallium: consolidate jit image types between draw/llvmpipe |
| - gallivm: reorder some texture/image members. |
| - vulkan/cmd_queue: handle beta extensions. |
| - vulkan: write beta extensions into generator scripts. |
| - draw: align common members in jit context structs. |
| - llvmpipe: refactor fs/cs jit structure members. |
| - gallivm: refactor common resources out of contexts |
| - gallivm/draw/llvmpipe: consolidate the sampler/image dynamic state fns |
| - gallivm: add common code for sample/image tracking. |
| - llvmpipe: move to common sampler/image binding code |
| - draw: move to use common sampler/image binding code |
| - llvmpipe/cs: refactor cs generator args to use an enum |
| - gallivm/draw: refactor vertex header jit type out |
| - llvmpipe: convert a bunch of shader_type ifs to switches. |
| - llvmpipe/cs: start making variant generator less compute specific |
| - llvmpipe/cs: support passing a csctx instead of using implicit one |
| - lavapipe: add lavapipe specific shader stages define. |
| - lvp: explictly skip compute shader stage. |
| - gallivm: fix whitespace in get_deref_offset |
| - gallivm/nir: refactor the local invocation index calc. |
| - lvp: use stage mask |
| - lvp: use stage iterator macros instead of explicit loops |
| - ci: reenable lavapipe |
| - radv/video: add missing space checks for video. |
| - radv/video: use correct h264 levels |
| - radv/video: fix h264/265 dpb usage. |
| - radv/video: add missing offset to the dpb binding. |
| - radv/video: rework stream handle generation. |
| - radv/video: fix some whitespace. |
| - radv/video: add debug flag to enable dpb image array on newer GPUs. |
| - radv/video: fix physical device format property count. |
| - vk/video: add a common function to get block alignments for profiles |
| - radv: align video images internal width/height inside the driver. |
| - anv/video: move format properties to outarray. |
| - radv/meta: fix uninitialised stack memory usage. |
| - gallium: add task/mesh shader query types to stats interface. |
| - gallium: expand pipe_grid_info to handle task/mesh. |
| - gallium: add a new PIPE_SHADER_MESH_TYPES |
| - freedreno: don't report task/mesh. |
| - gallium: add task/mesh shader entrypoints in context |
| - iris: don't return shader params for task/mesh. |
| - crocus: don't report mesh/task limits |
| - radeonsi: don't report shader params for task/mesh |
| - svga: don't report mesh/task shader limits |
| - d3d12: don't report mesh/task limits |
| - gallium/cso: add task/mesh shaders to the cso cache |
| - gallium/nir/tgsi: add various support for task/mesh bits |
| - lavapipe: when in doubt, swizzle the swizzle |
| - lavapipe: fix pipeline sanitizing. |
| - lavapipe: fix indentation whitespace |
| - draw: add mesh shader infrastructure |
| - draw: move draw_vertex_info and draw_prim_info to public header. |
| - draw: add a mesh primitive assembler. |
| - draw: add mesh pipeline middle end. |
| - draw: add support for per primitive aos emission |
| - gallivm: add support for payload access |
| - gallivm/nir: add launch mesh workgroups |
| - gallivm/nir: add a mesh interface and vert/prim count setting. |
| - gallivm/nir: call task shader lowering. |
| - gallivm/nir: add support for mesh shader outputs. |
| - llvmpipe: resize arrays to handle mesh shaders. |
| - llvmpipe: start adding task/mesh support. |
| - llvmpipe: bump dirty tracker to 64-bits. |
| - llvmpipe: add dirty bits for mesh and task shaders. |
| - llvmpipe: add debug bit for mesh shaders |
| - llvmpipe: add query support for task/mesh shaders |
| - llvmpipe: bind task/mesh resources and dirty bits |
| - gallivm/cs: add payload ptr to the cs thread data. |
| - llvmpipe/cs: add task/mesh shader support to compute shader builder. |
| - llvmpipe/cs: add multiple stride indirect to fill_grid_info. |
| - llvmpipe: add mesh shader drawing. |
| - llvmpipe: enable task/mesh shader support. |
| - lavapipe: handle some mesh shader stage differences. |
| - lavapipe: add mesh query support |
| - lavapipe: add support for task/mesh shader stages in various places |
| - lavapipe: add execution backends for mesh shader draw apis |
| - lavapipe: enable task/mesh shaders. |
| - docs: update docs for lavapipe mesh shading |
| - llvmpipe: emit fences for barrier. |
| - lavapipe: don't remove queue family barriers. |
| - gallivm/nir: fix shuffleup tests. |
| - draw: rename jit to vs_jit in lots of places. |
| - draw/tess: drop unused tgsi bits. |
| - gallium/tgsi/draw/softpipe: remodel shader const/buffer bindings. |
| - draw: refactor resources to use arrays instead of explicit structs. |
| - draw: add a max stage define and use it in a few places |
| - draw: repack some members of context. |
| - radv/video: convert video format properties to an outarray |
| - radv/video: convert session memory requirements to outarray. |
| - radv/video: don't supply an 8-bit format for a 10-bit dpb. |
| - radv/video: rework h265 reference frame bindings. |
| - radv/video: fix hevc st rps programming |
| - radv/video: fix hevc scaling lists. |
| - lavapipe: ignore another yuv format. |
| - radv/video: report bad profile operation if h264 profile isn't supported. |
| - radv/video: fix hevc scaling list order. |
| - radv/video: program frame number correctly. |
| - radv/video: program hevc max dec pic buffering correctly |
| - radv/video: restrict the number of IBs on video related queues. |
| - ac/radeonsi: add av1 defaults header file from radeonsi |
| - radv/video: drop incorrect defines for uapi ones. |
| - lavapipe: check sampler pointer before deref |
| - draw/gs: handle extra shader outputs in geometry. |
| - lavapipe: expose subgroups in mesh/task shaders. |
| - gallivm: store thread id in separate values. |
| - gallivm: convert block_id to discrete values. |
| - gallivm: convert grid_size to discrete values. |
| - gallivm: make block_size use discrete values. |
| - clc: llvm 17 requires opaque pointers. |
| - gallium/va: fix superres av1 decoding. |
| - llvmpipe/linear: don't allow linear path for shader output with location frac |
| - llvmpipe/linear: refactor linear samplers into templated code. |
| - llvmpipe/linear/tgsi: calculate num_texs properly for nir. |
| - llvmpipe/linear: add sample routines for swapping r/b channels |
| - llvmpipe/linear: add support for sampling when cbuf order is different. |
| - llvmpipe/linear: add support for rgba color buffers. |
| - ci: update fails for fixed tests due to llvmpipe linear changes. |
| - gallivm: fix atomic global temporary storage. |
| - llvmpipe: fix fragdata/lastfragdata heuristic a bit more. |
| - zink: turn off threaded cpu access if not visible. |
| - llvmpipe: enable f16 paths on aarch64. |
| - radv: don't emit event code on video queues. |
| - spirv: use a pointer sized int type for opencl event_t |
| - radv/video: take db alignment into account when allocating images. |
| |
| David (Ming Qiang) Wu (1): |
| |
| - radeonsi/vcn: add an exception of field case for h264 decoding |
| |
| David Heidelberg (129): |
| |
| - ci/amd: 4/5 runners TPad-C13 runners are online, restore most of the tests |
| - ci/dxvk: uprev to 2.1 |
| - ci/amd: update checksums after DXVK 2.1 update |
| - ci: bump kernel to the 6.3, support HDK 888 based on sm8350 |
| - ci/freedreno: do not restrict to 2 cpus on a530 |
| - ci: drop overriding new a530 firmware due to preemption issues with older kernel |
| - ci/freedreno: a530 behaves stable in 6.3 |
| - ci/freedreno: update a530 flakes, fails and skips |
| - ci/freedreno: fix the a530_piglit job and switch to Weston |
| - ci: polish deqp-runner a bit |
| - ci: uninstall libdrm from the GL and VK containers |
| - ci: do not retry on forks to get the upstream kernel and rootfs |
| - ci/mold: bump to 1.11.0 |
| - ci: add Adreno 660 on sm8350 chipset (HDK 888) |
| - ci/lava: implement fastboot support |
| - ci/lava: add support for HDK 888 firmware |
| - ci: add a660 firmware into rootfs |
| - pvr: drop unused variable |
| - ci/dzn: add flaking test |
| - ci/skqp: replace license with SPDX and extract the used branch |
| - ci/skqp: update to the Android CTS 12.1_r5 version |
| - mesa/main: drop unused variable |
| - nir/lower_io_to_vector: initialize base |
| - panvk: clear dangling pointers |
| - ci: uprev kernel to 6.3.1 with fixed patch for Adreno SMMU |
| - util/tests: adjust for new gtest |
| - gtest: Update to 1.13.0 |
| - ci/skqp: handle all warnings printed with clang >= 14 |
| - panvk: drop path from panvk_physical_device struct |
| - venus: drop unused sem_feedback_count from vn_queue_bind_sparse_submit_batch |
| - ci/broadcom: skip timeouting ssbo.layout.3_level_array.std430.mat4 on RPi4 |
| - ci/venus: add recent flakes |
| - ci/freedreno: add recent a630 flake |
| - ci/v3d: add flaking opengl 1.1@depthstencil-default_fb-drawpixels-float-and-ushort |
| - ci/amd: re-enable VA-API testing |
| - ci/rules: radeonsi VAAPI rules should include also VA-API targets |
| - ci: update libva to 2.18.1 |
| - ci/gtest: improve the runner script |
| - ci/amd: update VA-API expectations |
| - ci/amd: add radeonsi-raven-va-full job to cover all VA-API tests |
| - ci/gtest-runner: fix results reporting |
| - ci/venus: add missing flakes |
| - ci/crosvm: update cmdline options |
| - docs: update crosvm networking options |
| - ci/radv: add another raven flake dEQP-VK.draw.dynamic_rendering.primary_cmd_buff.linear_interpolation |
| - ci/v3dv: add often timeouting ssbo.layout.3_level_array.std140.column_major_mat4 |
| - r300: workaround GCC 12+ warning, declare NULL value as unreachable |
| - docs: use meson instead invoking ninja directly |
| - ci/freedreno: disable 3 jobs to match our farm 3 devices down |
| - ci/freedreno: rename piglit job to respresent the real testing it does |
| - ci: move from pkg-config to pkgconf |
| - ci: use meson setup and meson install instead of meson and invoking ninja directly |
| - ci: bump libdrm from 2.4.110 to 2.4.114 present in Debian 12 |
| - ci: install stock android-libext4-utils (available in 12, bookworm) |
| - ci: bump gfxreconstruct revision up to compatible version with Debian 12 |
| - ci: libwayland from 1.18 to 1.21 and wayland protocols from 1.24 to 1.31 |
| - ci: VVL uprev (temporary until new release will be published) |
| - ci: bump from Debian 11 (bullseye) to 12 (bookworm) |
| - ci/apitrace: install win64 apitrace only on x86_64 |
| - ci/crosvm: install libelogind0 and sysvinit-core for poweroff functionality |
| - ci: add clang-15 and clang++-15 wrapper script |
| - ci/skqp: skqp can't live with compiler named clang-15, provide symlink |
| - ci: drop gallium-aux test on msan builds, renable freedreno |
| - ci/mingw: disable as it's broken |
| - ci/venus: add fail after CI uprev to the Debian 12 |
| - ci/virpipe: add flakes introduced with CI uprev to Debian 12 |
| - ci/zink: disable flaking anv traces |
| - ci: enable shellcheck on whole .gitlab-ci |
| - ci: disable bogus GCC warning with -Warray-bounds |
| - ci: do not fail when SHA1 impl. produce stringop-overreads warning |
| - ci/lavapipe: document subgroups.shuffle.compute.subgroupshuffleup_double_constant crash |
| - ci/lavapipe: zink failures |
| - ci/llvmpipe: document intel_shader_atomic_float_minmax@execution@ssbo-atomic* |
| - bin/ci: mention requirements.txt |
| - gitlab: add template for merge requests |
| - ci/zink: add KHR-GL46.limits.max_fragment_interpolation_offset flake |
| - ci/amd: previously missed raven flake |
| - ci/panfrost: add largest possible eglcreatepbuffersurface and then glclear flake |
| - gitlab: prefill MR template with first multiline commit message |
| - ci: bump Alpine to 3.18 |
| - ci/ccache: recent ccache changed a output a bit, adapt script |
| - ci: rename x86 and amd64 to x86_64, armhf to arm32, and i386 to x86_32 |
| - ci: use bash arrays in Fedora script + shebang change |
| - ci/fedora: re-enable ccache |
| - traces: update sir-f720 trace expectations for zink on anv and freedreno |
| - ci: missed variable inside the big rename and split ARCH and DEBIAN_ARCH |
| - ci: fix KVM module modprobe code |
| - ci: explicitely state BUILDTYPE |
| - ci: rename S3 artifacts according to scheme mesa-$arch-$config-$buildtype |
| - ci: rename MINIO to S3 |
| - ci: rename MINIO_HOST variable to S3_HOST |
| - ci: replace MINIO_RESULTS_UPLOAD with S3_RESULTS_UPLOAD |
| - ci: remove BUILD_PATH, always use S3_ARTIFACT_NAME |
| - ci/lava: rename rest local MINIO\_ variables to S3\_ |
| - ci/android: remove the artifact file just as we unpack it |
| - ci: valve and freedreno farm is down |
| - ci/windows: move microsoft farm rules |
| - ci/etnaviv: if farm is down, we expect no manual jobs can be triggered |
| - ci/amd: hide vaapi job dependent on Collabora farm when it's down |
| - ci/crocus: depend on state of the Anholt farm |
| - ci: implement farms handling trough files inside .ci-farms |
| - ci/docs: fixup incorrect spacing around console block |
| - ci/panfrost: switch panfrost-g52-piglit-gles2 from X to XWayland |
| - ci/fastboot: use gzipped Image to avoid compressing on the runner |
| - ci/microsoft: uploading artifacts gets stuck currently (retried) |
| - ci/microsoft: rename manual rules according to rest introduced rules |
| - ci: create manual farm rules |
| - ci/traces: guard DXVK and VK behind VK_DRIVER |
| - ci/apitrace: include version with LTO enabled |
| - ci/traces: print version of apps used for replaying traces |
| - ci: when touching farms, never run manual jobs |
| - ci/microsoft: partly revert rename from container-rules to manual-rules |
| - ci/x86: Build ANGLE for testing layering on VK drivers. |
| - ci/amd: switch all possible jobs from X11 to Wayland |
| - ci/freedreno: switch a630_{piglit,skqp} and a618_gl to Weston |
| - ci/freedreno: re-enable a530 as it's now stable with multiple skips |
| - ci/freedreno: document number of a630 devices available |
| - ci/freedreno: add KHR-GL46.buffer_storage flakes |
| - ci/freedreno: add execution@varying-struct-copy-return-vs flake |
| - ci/container: add weston into Vulkan container |
| - ci/container: we need to keep the wine inside |
| - ci/traces: switch from xvfb to Weston XWayland |
| - ci/freedreno: another batch of a530 flakes |
| - ci: add quirk for GitLab assuming changes is always true for scheduled runs |
| - ci/microsoft: when re-enabling Windows Farm, always run the container |
| - ci: disable Material Testers.x86_64_2020.04.08_13.38_frame799.rdc trace |
| - ci/amd: fix timeouting radeonsi-raven-va-full job |
| - ci: add perfetto into mesa git-cache |
| - ci/deqp: really remove the uncompressed results.csv file |
| |
| David Redondo (1): |
| |
| - egl/wayland: fix oob buffer access during buffer_fds clean up |
| |
| David Rosca (7): |
| |
| - radeonsi: Use DIV_ROUND_UP instead of ALIGN_POT |
| - frontends/va: Init view_resources array in vlVaPut/GetImage |
| - frontends/va: Ignore requested size when creating VAEncCodedBufferType |
| - Revert "radeonsi/vcn: add an exception of field case for h264 decoding" |
| - frontends/va: Flush after unmapping VAImageBufferType |
| - frontends/va: Process VAEncSequenceParameterBufferType first in vaRenderPicture |
| - frontends/va: Set default rate control values once when creating encoder |
| |
| Derek Foreman (1): |
| |
| - vulkan/wsi: Allow binding presentation_timing when software rendering |
| |
| Diederik de Haas (1): |
| |
| - treewide: spelling fixes |
| |
| Dmitry Baryshkov (3): |
| |
| - freedreno/registers: updte HDMI registers to include CEC details |
| - freedreno/registers: add bitfield for DSI wide bus enablement |
| - tu: Pass real size of prime buffers to allocator |
| |
| Dmitry Osipenko (4): |
| |
| - iris/bufmgr: Use intel_ioctl() helper for GEM_SET_TILING |
| - intel/dev: Use intel_ioctl() helper for GEM_SET_TILING |
| - anv: Use intel_ioctl() helper for GEM_SET_TILING |
| - hasvk: Use intel_ioctl() helper for GEM_SET_TILING |
| |
| Dmitry Rogozhkin (1): |
| |
| - meson/vaon12: fix driver file name for mingw build |
| |
| Donald Robson (2): |
| |
| - pvr: Move heap initialisation out of pvr_winsys_helper. |
| - pvr: Rename rogue_fw.xml -> rogue_kmd_stream.xml. |
| |
| Dor Askayo (3): |
| |
| - meson: add feature option for use of system Clang headers at runtime |
| - ci: Disable "opencl-external-clang-headers" when "microsoft-clc" is enabled |
| - nouveau: add exported GEM handles to the global list |
| |
| Dr. David Alan Gilbert (4): |
| |
| - rusticl/screen: Wrap get_timestamp |
| - rusticl/device: Stash timestamp availability |
| - rusticl/api: Implement get_{device_and\_}host_timer |
| - rusticl/api: Wire up CL_DEVICE_PROFILING_TIMER_RESOLUTION |
| |
| Dylan Baker (57): |
| |
| - docs: add release notes for 23.0.1 |
| - docs: Add sha256 sum for 23.0.1 |
| - docs: add release notes for 23.0.2 |
| - docs: Add sha256 sum for 23.0.2 |
| - docs: add release notes for 23.0.3 |
| - docs: Add sha256 sum for 23.0.3 |
| - docs: update calendar for 23.0.1 |
| - docs: update calendar for 23.0.2 |
| - docs: update calendar for 23.0.3 |
| - docs: add release notes for 23.0.4 |
| - docs: Add sha256 sum for 23.0.4 |
| - docs: update calendar for 23.0.4 |
| - intel/tools/error2aub: Fix potential out of bounds read |
| - meson: Key whether to build batch decoder on expat |
| - bin/pick: fix issue where None for nomination_type could fail |
| - bin/pick: use lineboxes to make the UI clearer |
| - bin/pick: Add support for adding notes on patches |
| - bin/pick-ui: use asyncio.new_event_loop |
| - meson: Add back execmem option as a deprecated option |
| - VERSION: update to 23.2.0-rc1 |
| - docs: Update release calendar for 23.2.0-rc1 |
| - .pick_status.json: Update to 6e87b277bde71e30c98ab9dda7bd2f2017b77ed5 |
| - .pick_status.json: Update to 27d30fe3c0e71efd90fcfe209d8515b195b0075f |
| - .pick_status.json: Update to 3a8aae9e6aa526367523c58dfe5046909776be74 |
| - .pick_status.json: Update to 59087003c4b7a4f5a6bf207f214a4c3443b9759f |
| - ci: mark passing zink and lima tests as expected |
| - docs: truncate new_features.txt |
| - docs: add release notes for 23.2.0 |
| - VERSION: update to 23.2.0 |
| - docs: Update release calendar for 23.2.0 |
| - docs: Add sha256 sum for 23.2.0 |
| - Revert incorrect 23.2.0 release |
| - VERSION: update to 23.2.0-rc2 |
| - docs: Update release calendar for 23.2.0-rc2 |
| - .pick_status.json: Update to e88c0770969f6ae0bfa5bea0f9d99687d257fea1 |
| - .pick_status.json: Mark d3f26cbbe1a957b76804da44bbf5e30de2bac941 as denominated |
| - .pick_status.json: Update to c5a6e88c4e816ded6105b74f101528eb004e0581 |
| - .pick_status.json: Update to 088c2bbd51a48eb0de1e9fd23c529759585bad59 |
| - .pick_status.json: Update to 088c2bbd51a48eb0de1e9fd23c529759585bad59 |
| - VERSION: update to 23.2.0-rc3 |
| - docs: Extend calendar entries for 23.2 by 2 releases. |
| - docs: update calendar for 23.2.0-rc3 |
| - .pick_status.json: Update to 10e75aae1bddee9795b1ff04ffd656b0da79b5b5 |
| - .pick_status.json: Updates notes for aebe58458611e0bb585a5bce8e16c1175783f3cc |
| - .pick_status.json: Updates notes for f8cb0d8a44afb9c70f38e359ffe0ad57416e66a4 |
| - Revert "Revert "intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR"" |
| - .pick_status.json: Updates notes for 93b4f200dead198e680991a1e95bf3d3b58f87bd |
| - .pick_status.json: Updates notes for 7e246f7f2bde0c859269c4b81505bd0887045e7b |
| - .pick_status.json: Updates notes for 9865e5dff49395543da4331a943ba5a03ce6a413 |
| - .pick_status.json: Update to 1cdc4be14b66108ae0e8069686ac3efe52bef3cb |
| - .pick_status.json: Updates notes for b8ea9724fa5ca38620bc0cdc01b7addd05574954 |
| - .pick_status.json: Updates notes for 68027bd38e134f45d1fe8612c0c31e5379ed7435 |
| - VERSION: update to 23.2.0-rc4 |
| - docs: Update release calendar for 23.2.0-rc4 |
| - .pick_status.json: Mark fa6562b239f00f9f72c988459e252bdee072fd73 as denominated |
| - .pick_status.json: Update to f4fecdad724edf8187d22928ed844af7fd84654d |
| - zink/ci: mark unexpcted pass as expected |
| |
| Emma Anholt (124): |
| |
| - zink: Avoid infinite loop finding no var in update_so_info. |
| - ci/crocus: Update checksum for STK. |
| - symbol_table: Store the symbol name in the same allocation as the symbol entry. |
| - symbol_table: Don't maintain the HT as we're destroying the table. |
| - symbol_table: Don't bother resetting the key on popping scope. |
| - symbol_table: Prehash the key on insert, and reuse the entry on shadowing. |
| - tu/perfetto: Refactor code out of the macro, to stage_end. |
| - tu/perfetto: Clean up an extra token paste to just use the arg being passed. |
| - tu/perfetto: Use tu_CmdBeginDebugUtilsLabelEXT as a stage event in perfetto. |
| - tu/perfetto: Drop unused arg to send_descriptors(). |
| - tu/perfetto: s/MRTs/attachment_count/ in traces. |
| - anv: Only enable GPL if ANV_GPL=true, or if zink or DXVK are the engine. |
| - anv: Refactor repeated pipeline creation feedback output code. |
| - ci/lvp: Update sanctuary trace hash. |
| - ci/radv: Demote navi21 to manual until recent flakiness resolves. |
| - ci/zink+tu: Drop some intermittently failing a630 traces. |
| - ci/freedreno: Drop portal-2-v2 trace. |
| - ci/radv: Add known flakes for #8817 |
| - ci: Crank up the yamllint line length limit. |
| - ci/freedreno: Demote a530 to manual again. |
| - ci: Make a variable for the repeated rootfs directory name. |
| - ci: Add the Vulkan validation layer to amd64 rootfs builds. |
| - ci/zink: Re-enable traces now that !20319 has landed. |
| - ci: Move zink's validation layer setup to deqp-runner.sh. |
| - ci/zink: Enable the validation layer on the TGL GL46 run. |
| - blob: Don't valgrind assert for defined memory if we aren't writing. |
| - util/log: Fix log messages over 1024 characters. |
| - ci: Move some timeout xfails to skips. |
| - ci/deqp: Update to 1.3.5.1 and pull in additional bugfixes from main. |
| - ci/zink: Drop anv/lvp validation exceptions that should be fixed in the CTS. |
| - ci/valve: Add a workaround for finding libdrm on navi21s. |
| - ci/panfrost: Drop tex3d-maxsize on g52. |
| - ci/lima: Skip ppgtt_memory_alignment that flaked a job with the oomkiller. |
| - ci/crocus: Note a recent regression. |
| - ci/zink: Try to update TGL results for new MSAA behavior. |
| - vulkan: Handle alignment failure in the pipeline cache. |
| - vulkan: Actually increment the count of objects in GetPipelineCacheData. |
| - Revert "ci/zink: Try to update TGL results for new MSAA behavior." |
| - ci/zink: Update more xfails for tgl piglit. |
| - ci/zink+anv: Test piglit quick_gl pre-merge, dropping a few KHR-GL46 tests. |
| - ci/radeonsi: Mark glx-make-current as flaky. |
| - ci/radv: Disable flaky heaven d3d9 trace. |
| - ci/turnip: Drop an xfail from the full run for a recent fix. |
| - ci/turnip: Drop the IUB bug fallout flakes. |
| - mesa: Fix debug logging of fp compile compare func. |
| - mesa: Fix precompile of GLSL programs with shadow samplers. |
| - zink: Explain some of the current pathway for shadow sampling. |
| - zink: Fix silly void * type in rewrite_tex_dest. |
| - zink: Don't flag legacy_shadow_mask for RED-only reads in the shader. |
| - ci: Re-enable some piglit tests that should be fast enough post-uprev. |
| - ci/zink+anv: Skip a couple more long tests pre-merge. |
| - compiler: Update reference to name_for_stage func. |
| - nir: Add helpers for lazy var creation. |
| - drm-shim: Avoid assertion fail if someone does close(-1). |
| - glsl: Allow invariant flags on sysvals, such as gl_PointCoord. |
| - nir/lower_texcoord_replace: Flag SYSTEM_VALUE_POINT_COORD read when we load it. |
| - zink: Use PIPE_CAP_FS_POINT_IS_SYSVAL. |
| - mesa: Use find_state_var in lower_builtin. |
| - nir: Use find_state_var in lower_atomics_to_ssbo. |
| - nir,mesa: Add helpers for creating uniform state variables. |
| - mesa: Move ATI_fragment_shader fog code emit to a NIR lowering pass. |
| - mesa/ARB_fp: Drop an extra enum for fog mode. |
| - mesa/ARB_fp: Use the NIR pass for adding fog code instead of ARB instrs. |
| - mesa: Move ARB_vp position invariant option handling to NIR. |
| - mesa: Drop ARB program helper functions that are no longer used. |
| - mesa: Drop unused control flow instructions for ARB programs. |
| - mesa: Drop remaining unused ARB program instructions. |
| - mesa: Move st_prog_to_nir_postprocess out of prog_to_nir. |
| - mesa/ati_fs: Move sampler dim adjustment to a separate NIR pass. |
| - mesa/ati_fs: Move NIR translation to ATI_fs compile time. |
| - mesa/ati_fs: Move prog->SamplersUsed/TexturesUsed setup to EndFragmentShader. |
| - mesa: Use the NIR pass for fixed function fog. |
| - mesa/ffvs: Fix mvp_with_dp4 position transformation. |
| - mesa: Use shared NIR code for ARB_vp and FF VS position transformation. |
| - ci/freedreno: Update minetest hash. |
| - Revert "ci: disable anholt's farm" |
| - crocus: Fix regression from !20153 |
| - ci/crocus: Add a missing xfail. |
| - ci/turnip: Update full-run xfails. |
| - tu: Ignore unused shader stages in pipeline library creation. |
| - anv: Drop unused ALL_GRAPHICS_LIB_FLAGS. |
| - ci/crocus: Update trace hash for the neverball regression. |
| - ci/etnaviv: Update some xfails common between the last 3 nightly runs. |
| - v3d: Respect nir_intrinsic_store_output's write_mask. |
| - mesa: Emit full output write in st_pbo_create_vs(). |
| - mesa: Port the pbo.use_gs path to NIR and let it get used on NIR drivers. |
| - softpipe: Drop the use_tgsi debug flag. |
| - llvmpipe: Drop the LP_DEBUG=tgsi_ir debug option. |
| - virgl: Drop the VIRGL_DEBUG=use_tgsi debug var. |
| - r600: Drop docs for use_tgsi debug var. |
| - r300: Drop RADEON_DEBUG=use_tgsi. |
| - nouveau: Delete the NV50_PROG_USE_TGSI env var. |
| - svga: Switch to preferring NIR by default. |
| - nine: Drop the nir_vs/nir_ps env vars. |
| - gallium: Drop PIPE_SHADER_CAP_PREFERRED_IR. |
| - mesa/drawtex: Cut out the TGSI semantic translation. |
| - svga: Stop asserting that compute params are queried against TGSI. |
| - mesa: Always query our compute params against IR_NIR. |
| - mesa: Drop TGSI token handling |
| - mesa: Simplify st_get_nir_compiler_options(). |
| - mesa: Drop dead TGSI serialization prototypes. |
| - mesa/atifs: Rename the header guard. |
| - mapi: clang-format _glapi_add_dispatch(). |
| - mapi: Delete dynamic stub generation. |
| - mesa: Drop the function parameter spec from the remap table. |
| - mapi: Clean up mapi_stub struct. |
| - mesa: Drop the aliases from the remap table. |
| - mapi: Drop the unused_functions table. |
| - mapi: Delete execmem support code. |
| - intel: Count reads_remaining across all blocks. |
| - intel: Allocate the last_grf_write once per scheduler. |
| - intel: Reduce cost of resetting last_grf_write. |
| - ci/zink: Update current xfails on tgl. |
| - ci: Update to vulkan-cts-1.3.5.2 (and pull in some more fixes). |
| - ci: Drop skips for some previously-invalid CTS tests. |
| - ci: Drop some skips of GL CTS ArraysOfArrays tests. |
| - ci/anv: Make anv-manual-rules actually manual on anv-only changes. |
| - ci: Clean up .intel-rules definition. |
| - ci/amd: Report flakes to #amd-ci on OFTC. |
| - ci/anv: Add testing of the GLES CTS using ANGLE on TGL. |
| - ci/radv+radeonsi: Fix the combo rules to include core vulkan changes. |
| - ci/radv: Add testing of the GLES CTS using ANGLE on stoney. |
| - ci/tu: Drop some xfails for !24086 |
| - disk_cache: Disable the "List" test for RO disk cache. |
| |
| Eric Engestrom (134): |
| |
| - VERSION: bump to 23.2 |
| - docs: reset new_features.txt |
| - v3d: add flake spec@ext_framebuffer_blit@fbo-sys-sub-blit |
| - ci: stop removing -x11 suffix for x11 build of deqp-egl |
| - ci: add -android suffix for android build of deqp-egl |
| - ci: move deqp-egl instead of copying it |
| - ci: start documenting which image tags need to be bumped |
| - ci: bump tags |
| - ci: update shebang to make it more portable |
| - broadcom/ci: deduplicate script definition |
| - v3dv/ci: drop fixed failure from fails.txt |
| - amd: fix buggy usage of unreachable() |
| - compiler: fix buggy usage of unreachable() |
| - pvr: fix buggy usage of unreachable() |
| - vk/util: fix buggy usage of unreachable() |
| - util: enforce unreachable()'s argument being a literal string |
| - egl: inline driver.GetProcAddress() as it's always _glapi_get_proc_address() |
| - ci: rework vulkan validation layer build script |
| - v3d: document that \`V3D_DEBUG=shaderdb` is \*not* for shader-db |
| - v3d: fix tfu_supports_tex_format() param type, and document why |
| - v3d: fix various minor issues in gen_pack_header.py |
| - dzn: fix pointer type mismatch |
| - ci: bump bin/ci/ deps to support python 3.11 |
| - ci: drop GENERATE_ENV_SCRIPT |
| - ci: stop marking environment variable list as executable |
| - ci: replace write + cat with tee |
| - ci: disable anholt's farm |
| - ci: only execute capture-devcoredump.sh when it's present |
| - util/bitset: ensure the sets compared have the same size at compile time |
| - docs: add release notes for 23.1.0 |
| - docs: update calendar for 23.1.0 |
| - ci/b2c: increase timeout to 5 minutes |
| - ci/amd: don't override the b2c timeout in the steamdeck config |
| - ci/zink: add new zink-radv-navi10-valve flakes |
| - mailmap: update @mupuf's name |
| - docs: fix release date of 23.1.0 |
| - ci/zink: document new zink-radv-navi10-valve failures |
| - v3dv: fix align() computation for pixel formats with non-POT block sizes |
| - docs: update calendar for 23.1.1 |
| - docs: add release notes for 23.1.1 |
| - docs/relnotes: add sha256sum for 23.1.1 |
| - ci_run_n_monitor: add ability to specify the pipeline to use, instead of auto-detecting it |
| - ci/amd: move AMD-specific LD_PRELOAD to AMD config |
| - ci/amd: only define AMDGPU_GPU_ID for the duration of the call |
| - bin/ci: fix mistakenly hardcoded repo name in get_gitlab_project() |
| - ci/intel: reuse iris_file_list instead of copying its definition |
| - meson: simplify another "any of" check |
| - wsi/display: drop unused parameters from local functions |
| - ci: split clang-format list of folders for easier maintenance |
| - ci: show diff when clang-format check fails |
| - panfrost: fix formatting of a couple of files that were missed |
| - panfrost: rename \*.cc files to \*.cpp |
| - ci/zink+radv: fix flakes definition |
| - ci/zink+radv: mark all spec@arb_copy_image@arb_copy_image-targets* as flaky after getting a bunch more of them |
| - ci/zink+radv: document recent regressions |
| - ci: color the diff for clang-format |
| - meson: enable the clang-format target |
| - ci: use meson to run clang-format |
| - docs: document clang-format and how to use it |
| - docs/calendar: add 23.2 branchpoint and release candidates |
| - ci/zink+radv: mark flakes as such |
| - ci/radv: fix flakes definition |
| - ci/crocus: fix flakes definition |
| - ci/zink+anv: fix flakes definition |
| - ci/b2c: also detect non-soft GPU hangs with AMDGPU |
| - amd/ci: run gl(es) cts & piglit on radeonsi on vangogh |
| - ci/radv: update expectations |
| - ci/zink+radv: update expectations |
| - docs/relnotes/23.1.1: clear "new features" |
| - docs: add release notes for 23.1.2 |
| - docs/relnotes: add sha256sum for 23.1.2 |
| - docs: update calendar for 23.1.2 |
| - egl: return correct error for EGL_KHR_image_pixmap |
| - clang-format: add explanation for anyone reading .clang-format-include |
| - radv,aco: tweaks to get clang-format to print nicer code |
| - radv: reformat according to its .clang-format |
| - aco: reformat according to its .clang-format |
| - ci: enforce formatting for RADV & ACO |
| - radv: fix formatting |
| - Revert "ci: remove clang-format testing" |
| - asahi: drop unnecessary DRM_FORMAT_MOD_{LINEAR,INVALID} fallbacks |
| - ci: mark the valve farm as down |
| - docs/ci: fix command to disable/re-enable farms |
| - docs: add release notes for 23.1.3 |
| - docs/relnotes: add sha256sum for 23.1.3 |
| - docs: update calendar for 23.1.3 |
| - docs/coding-style: add example vim config for clang-format |
| - docs/coding-style: add example emacs config for clang-format |
| - docs/coding-style: add pre-commit hook fallback for clang-format |
| - v3dv: replace boolean and uint with bool and size_t |
| - amd/ci: add another dEQP-VK.multiview.renderpass2.multisample.* flake |
| - amd/ci: add another dEQP-VK.dynamic_rendering.primary_cmd_buff.basic.* flake |
| - ci: split valve farm in two |
| - util/disk_cache: fix ~/.cache/ permissions |
| - panfrost/ci: drop invalid skips that are already marked as known flakes |
| - intel/ci: fix skips definitions |
| - etnaviv/ci: fix skips definition |
| - zink/ci: fix skips & flakes for zink+radv on vangogh & navi10 |
| - docs/codingstyle: fix clang-format command |
| - vc4/ci: fix skipping of gles3 piglit tests |
| - v3dv/ci: fix skipping of vk tests |
| - v3dv/ci: skip more tests that are timing out |
| - virgl/ci: fix skips definition |
| - clang-format: add egl foreach macro |
| - clang-format: add wayland foreach macros |
| - egl: change a couple of clang-format settings |
| - egl: add a few trailing commas |
| - egl: protect the formatting in a couple of places |
| - egl: prevent clang-format from reordering some headers |
| - egl: re-format using clang-format |
| - clang-format: enforce formatting of egl |
| - add initial .git-blame-ignore-revs |
| - ci/zink+radv: document another flake |
| - ci/zink+radv: fix flake definition |
| - ci: document workflow rules |
| - ci: set priority:low tag only on non-Marge pipelines |
| - ci: fix .valve-farm-manual-rules |
| - ci: split farm rules out of test-source-dep.yml |
| - etnaviv/ci: drop duplicate line in etnaviv files list |
| - broadcom/ci: add the renderonly folder to things that can affect v3d & vc4 |
| - meson: clarify description of \`opengl` option |
| - meson: clarify what "off-screen rendering" means |
| - ci: avoid running hardware jobs if there are already trivial issues |
| - ci: avoid running hardware jobs if lint fails - now on LAVA too! |
| - ci: avoid running hardware jobs if lint fails - now on Windows too! |
| - bin/ci_run_n_monitor: get git sha from pipeline if specified, instead of requiring --rev to match |
| - panfrost: upcast uint8/uint16 before shifting them beyond their range |
| - vc4: drop duplicate .lower_ldexp |
| - zink: fix format in zink_make_{image,texture}_handle_resident() |
| - v3dv: fix VK_PIPELINE_ROBUSTNESS_{BUFFER,IMAGE}_BEHAVIOR_DEVICE_DEFAULT_EXT copy/paste typo |
| - v3dv: fix copy/pasted type of \`sample` |
| - v3dv: fix shader stage name in error message |
| - v3d/qpu: fix type of function argument |
| - ci/farm-rules: fix missing valve-infra jobs in scheduled pipelines |
| |
| Erico Nunes (6): |
| |
| - Revert "ci: disable lima farm, currently out-of-space, needs to be fixed" |
| - lima: fix stringop-overflow warning |
| - lima/ci: temporarily disable deqp-egl tests due to timeouts |
| - ci: temporarily disable lima farm |
| - ci: restore lima farm |
| - lima: fix plbu block stride calculation |
| |
| Erik Faye-Lund (144): |
| |
| - nir: remove nir_state_slot::swizzle |
| - glsl: remove ir_state_slot::swizzle |
| - docs: renderpass -> render pass |
| - docs: statechanges -> state changes |
| - docs: backfacing -> back-facing |
| - docs: codepath -> code-path |
| - docs: did't -> didn't |
| - docs: cma -> CMA |
| - docs: Anv -> ANV |
| - docs: perfetto -> Perfetto |
| - docs: use correct tick for "doesn't" |
| - docs: vlan -> VLAN |
| - docs: toplevel -> top-level |
| - docs: correct spelling of "source" |
| - docs: correct spelling of "tagged" |
| - docs: correct spelling of "frame" |
| - docs: sort extensions |
| - docs: add custom html theme |
| - docs: add bootstrap extension |
| - docs: translate admonitions into bootstrap alerts |
| - docs: remove support for old sphinx-versions |
| - docs: use custom html theme |
| - nir: clean up white-space in deref-printing |
| - mesa/main: clean up white-space in ffvertex_prog.c |
| - mesa/main: drop disasm-code from ffvertex_prog.c |
| - mesa/main: allow passing nir-shaders to st_program_string_notify |
| - mesa/main: make ffvertex output nir |
| - nir: fix constant-folding of 64-bit fpow |
| - docs: fix edit-links |
| - mesa/main: drop use_legacy_math_rules |
| - llvmpipe: fixup refactor copypasta |
| - docs: fixup About Mesa3D.org link |
| - docs/tgsi: fix up indent |
| - docs/tgsi: fix bad latex |
| - docs/tgsi: fixup bad latex |
| - docs/tgsi: wrap overly long lines |
| - docs/tgsi: use math-notations for conditionals |
| - docs/tgsi: do not use math-block for non-latex |
| - docs/tgsi: fixup latex for TEX and TEX2 |
| - docs/tgsi: use \\ll and \\gg for left and right shift |
| - aux/draw: check for lines when setting clipping-mode |
| - zink: fix bad indent |
| - zink: clean up tcs_vertices_out_word handling |
| - zink: do not open-code memcpy |
| - aco: use c++17 |
| - meson: remove needless c++17-overrides |
| - mesa/main: clean up white-space in ff_fragment_shader.cpp |
| - mesa/st: refactor st_translate_fragment_program |
| - mesa/st: allow using nir for ff-fragment shaders |
| - compiler/nir: move find_state_var to common code |
| - mesa/main: ff-fragshader to nir |
| - mesa/main: compile ff_fragment_shader as c-code |
| - mesa/program_cache: remove unused shader-cache functions |
| - panfrost: expose PIPE_CAP_POLYGON_OFFSET_CLAMP |
| - util: mark externally-unused functions as static |
| - nir: use more nir_fmul_imm |
| - nir: use more nir_fadd_imm |
| - nir: fsub -> fadd_imm |
| - nir: use more nir_ffma_imm variants |
| - nir: add nir_fsub_imm |
| - nir: use nir_fsub_imm |
| - radeonsi,radv: use nir_format_linear_to_srgb |
| - docs: explicitly mark extensions as obsolete |
| - docs: mark MESA_multithread_makecurrent as obsolete |
| - docs: mark MESA_shader_debug as obsolete |
| - docs: mark MESA_swap_frame_usage as obsolete |
| - docs: mark MESA_texture_array as obsolete |
| - docs: move obsolete extensions to their own list |
| - zink: update profiles schema |
| - zink: keep gl46_optimal extensions/features sorted |
| - zink: compute correct location for line-smooth gs |
| - zink: do not lower line-smooth for non-lines |
| - docs: increase contrast in dark-theme |
| - zink: update profiles schema |
| - d3d12, dozen: make sure we pass float to fge |
| - nir: use nir_i{ne,eq}_imm helpers |
| - nir: generate nir_{cmp}_imm variants |
| - nir: use generated immediate comparison helpers |
| - nir: add nir_[fui]gt_imm and nir_[fui]le_imm helpers |
| - nir: use new immediate comparison helpers |
| - mesa/st: use nir_imm_vec4 |
| - nir: use more imm-helpers |
| - nir: isub -> iadd_imm |
| - nir: use nir_imm_{true,false} |
| - nir: add and use nir_fdiv_imm |
| - nir: add and use nir_imod_imm |
| - nir: add missed nir_cmp_imm-helpers |
| - docs: upgrade bootstrap to 5.3.0 |
| - cso: use enum for render-conditions |
| - draw: use enum for tgsi-semantic |
| - draw: use uint32_t instead of uint |
| - draw: use enum for primitive-type |
| - draw: track vertices and vertex_ptr as byte-pointers |
| - draw: use stdint.h types |
| - cso: use unsigned instead of uint |
| - draw: match type of pipe_draw_start_count_bias::count |
| - draw: use unsigned instead of uint |
| - aux/indices: use stdint.h types |
| - draw/i915: move hwfmt array to i915 specific struct |
| - microsoft/compiler: use nir_imm_zero |
| - mesa/st: use nir_ineg |
| - vulkan: avoid needless constant-folding |
| - broadcom/compiler: use imm-helpers |
| - v3dv: use imm-helpers |
| - pan: use imm-helpers |
| - freedreno: use imm-helpers |
| - r600/sfn: use imm-helpers |
| - d3d12: use imm-helpers |
| - radeonsi: use imm-helpers |
| - vc4: use imm-helpers |
| - intel: use imm-helpers |
| - anv: use imm-helpers |
| - hasvk: use imm-helpers |
| - mesa/st: use imm-helpers |
| - amd: use imm-helpers |
| - etnaviv: use imm-helpers |
| - gallium: use imm-helpers |
| - nir: use imm-helpers |
| - math: fix indentation in m_matrix.[ch] |
| - math: remove unused defines |
| - math: drop MAT_[ST][XYZ] defines |
| - aux/trace: use stdint.h types |
| - pipebuffer: use unsigned instead of uint |
| - gallivm: use unsigned instead of uint |
| - aux/pp: use unsigned instead of uint |
| - aux/util: use enum for render-condition |
| - aux/util: match type of pipe_draw_start_count_bias::start/count |
| - aux/util: use enum for primitive-type |
| - aux/util: use unsigned instead of uint |
| - aux/util: use stdint.h types |
| - aux/util: uint -> unsigned |
| - tgsi: use enum instead of defines |
| - tgsi: use stdint.h types |
| - tgsi: use enum for tgsi-file type |
| - tgsi: use enum for property-name |
| - tgsi: use enum for shader-type |
| - tgsi: use enum for interpolate-mode |
| - tgsi: uint -> uint32_t |
| - tgsi: uint -> unsigned |
| - nir: constify intrin |
| - nir: use nir_intrinsic_get_var |
| - radv: do not rely on constant-folding |
| - nir: do not needlessly rely on optimizations |
| - panfrost: delete stale editorconfig file |
| |
| Faith Ekstrand (16): |
| |
| - nouveau/nir: image_samples/size don't have coordinates |
| - vulkan: Document vk_physical_device::supported_features |
| - nir/opt_if: Use block_ends_in_jump |
| - nir: Add a reg_intrinsics flag to nir_convert_from_ssa |
| - nir/from_ssa: Make additional assumptions in coalescing |
| - nir/from_ssa: Support register intrinsics |
| - freedreno/ci: Update pixmark piano checksums |
| - nv50/ir: Support vector movs |
| - nir: Properly handle divergence for load_reg |
| - nir/trivialize: Maintain divergence information |
| - nir/trivialize: Trivialize cross-block loads |
| - Revert "mesa, compiler: Move gl_texture_index to glsl_types.h" |
| - Revert "compiler: Combine duplicated implementation of is_gl_identifier into glsl_types.h" |
| - nir: Handle nir_op_mov properly in opt_shrink_vectors |
| - nir: Don't handle nir_op_mov in get_undef_mask in opt_undef |
| - nir: Fix metadata in nir_lower_is_helper_invocation |
| |
| Felix DeGrood (19): |
| |
| - anv: disable reset query pools using blorp opt on MTL |
| - anv: Add END_OF_PIPE_SYNC reporting to INTEL_DEBUG=pc |
| - anv: Add flush reasons to raytracing flushes |
| - anv: Add flush reason to NEEDS_END_OF_PIPE_SYNC |
| - anv: split INTEL_MEASURE multi events |
| - intel: INTEL_MEASURE cpu mode |
| - anv: Enable INTEL_MEASURE=cpu |
| - iris: Enable INTEL_MEASURE=cpu |
| - docs: add INTEL_MEASURE=cpu |
| - intel/debug: Control start/stop frame of batch debug |
| - anv: Enable INTEL_DEBUG_BATCH_FRAME_START/_STOP |
| - iris: Enable INTEL_DEBUG_BATCH_FRAME_START/_STOP |
| - docs: Add INTEL_DEBUG_BATCH_FRAME_START/_STOP |
| - anv: fix INTEL_MEASURE on MTL |
| - anv: re-enable RT data in INTEL_MEASURE |
| - intel: refactor INTEL_MEASURE pointer dumping |
| - intel: batch consecutive dispatches into implicit renderpasses |
| - intel: Secondary CB print primary CB's renderpass |
| - anv: override vendorID for Cyberpunk 2077 |
| |
| Feng Jiang (3): |
| |
| - frontends/va: Fix memory leak of decrypt_key |
| - radeonsi/vcn: Remove unnecessary type conversion |
| - virgl/video: Fix out-of-bounds access in fill_mpeg4_picture_desc() |
| |
| Filip Gawin (5): |
| |
| - nine: add fallback for D3DFMT_D16 in d3d9_to_pipe_format_checked |
| - glx: fix build with APPLEGL |
| - ac/nir: fix slots in clamping legacy colors |
| - anv: allow intel_clflush_range only on igpu |
| - crocus: Avoid fast-clear with incompatible view |
| |
| Francisco Jerez (3): |
| |
| - anv: Fix calculation of guardband clipping region. |
| - intel/gfx12.5: Enable L3 partial write merging for compressible surfaces among other cases. |
| - anv: Swap ordering of memory types on non-LLC platforms to work around application bugs. |
| |
| Frank Binns (7): |
| |
| - pvr: add missing explicit check against VK_SUCCESS |
| - pvr: use util_dynarray_begin() in more places |
| - pvr: replace transfer EOT binary shaders with run-time compiled shaders |
| - pvr: fix typo in pvr_rt_get_region_headers_stride_size() |
| - pvr: fix array overflow in pvr_device_tile_buffer_ensure_cap() |
| - pvr: fix invalid read reported by valgrind |
| - pvr: skip setting up SPM consts buffer when no const shared regs are used |
| |
| Friedrich Vock (41): |
| |
| - radv/rmv: Fix creating RT pipelines |
| - radv/rmv: Fix import memory |
| - radv/rt: Plug some memory leaks during shader creation |
| - radv: Don't leak the RT prolog binary |
| - radv: Hash pipeline libraries separately |
| - radv: Always call si_emit_cache_flush before writing timestamps |
| - radv: Add driconf to always drain waves before writing timestamps |
| - nir: Rematerialize derefs in use blocks before repairing SSA |
| - nir: Remove unnecessary assert in nir_before_src |
| - radv: Disable capture/replay handles |
| - aco: Lower divergent bool phis iteratively |
| - radv: Always flush before writing acceleration structure properties |
| - aco: Reset scratch_rsrc on blocks without predecessors |
| - aco: Fix live_var_analysis assert |
| - aco: Fix assert in insert_exec_mask |
| - radv: Add driconf to force wave64 for RT |
| - radv: Add RADV_DEBUG=nort |
| - radv: Enable ray tracing pipelines by default |
| - radv: Add the BOs of all shaders in a RT pipeline |
| - radv: Add radv_shader_free_list |
| - radv: Move shader arena allocation to a separate function |
| - radv: Add option to allocate shaders in replayable VA range |
| - radv: Add utilities to serialize and deserialize shader allocation info |
| - radv: Add radv_shader_reupload |
| - radv: Break up radv_shader_nir_to_asm |
| - radv: Split up implementation of radv_shader_create |
| - radv: Add support for creating capture/replay shaders |
| - radv: Add radv_rt_capture_replay_handle |
| - radv/rt: Only compare the non-recursive capture/replay handle |
| - radv/rt: Associate capture/replay handles with stages |
| - radv/rt: Replay shader allocations according to capture/replay handle |
| - radv/rt: Rework radv_GetRayTracingCaptureReplayShaderGroupHandlesKHR |
| - radv: Re-enable RT pipeline capture/replay handles |
| - meson: Prefix Vulkan "Ray Tracing" summary with "Intel" |
| - radv/ci: Skip ray tracing tests on vangogh |
| - Revert "radv/rt: Enable RT pipelines on GFX10_3+ excluding vangogh" |
| - Revert "Revert "radv: Enable ray tracing pipelines by default"" |
| - radv/rt: Enable exact on software intersection functions |
| - radv/rt: Miss rays that hit the triangle's v edge |
| - radv: Handle VK_SUBOPTIMAL_KHR in trace layers |
| - nir/load_store_vectorize: Handle intrinsics with constant base |
| |
| Ganesh Belgur Ramachandra (5): |
| |
| - gallium/pipe: Add get_resources() to pipe_video_buffer |
| - gallium/vl: implementation for get_resources() |
| - nouveau: implementation for get_resources() |
| - d3d12: implementation for get_resources() |
| - frontends/va: use resources instead of views |
| |
| Georg Lehmann (51): |
| |
| - nir: lower ballot_bit_count_exclusive/inclusive to mbcnt_amd |
| - radv: use lower_ballot_bit_count_to_mbcnt_amd |
| - aco: Assert that operands have the same byte offset when reassigning split vectors |
| - aco: also reassign p_extract_vector post ra |
| - aco/vn: compare all valu modifers |
| - aco/optimizer: don't use pass_flags for mad idx |
| - aco/optimizer: copy pass flags for newly created valu instructions |
| - aco/assembler: support VOP3P with DPP |
| - aco/builder: support VOP3(P) with dpp |
| - aco: add assembler tests for VOP3(P) with DPP |
| - aco/ra: convert VOPC_DPP instructions without vcc to VOP3 |
| - aco: use VOP3+DPP |
| - aco: don't apply dpp if the alu instr uses the operand twice |
| - aco: emit_wqm on MIMG dst, not operands |
| - aco: introduce helper to swap valu operands with modifiers |
| - aco/gfx11: use fmamk/fmaak with opsel |
| - aco: add withoutVOP3 helper |
| - aco/ra: use smaller operand stride for VOP3P with DPP |
| - aco/ra: use fmac with DPP/opsel on GFX11 |
| - aco: add helper function for can_use_input_modifiers |
| - aco: use get_operand_size for dpp opt |
| - aco: use can_use_input_modifiers helper |
| - aco/optimizer: allow DPP to use VOP3 on GFX11 |
| - util: fix stack dynarray used by multiple tus |
| - nir/opt_if: use nir_alu_instr_is_comparison directly |
| - aco: cleanup v_cmp_class usage |
| - aco: p_start_linear_vgpr doesn't always need exec mask |
| - aco/ir: return true in hasRegClass for Operand(reg, rc) |
| - aco/statistics: improve v_fma_mix dual issuing detection |
| - aco: use v_add_f{16,32} with clamp for fsat |
| - aco: use v_fma_mix for f2f32 and f2f16 on gfx11 if wave64 |
| - aco: make validation work without SSA temps |
| - aco: move cfg validation to its own function |
| - aco: don't validate p_constaddr_addlo/p_resumeaddr_addlo operands |
| - aco: validate ir for prologs and after lower_to_hw_instr |
| - aco/opcodes: move v_cndmask_b32 back to the VOP2 list |
| - aco: remove v_cvt_pkrtz_f16_f32_e64 when it's actually VOP2 |
| - aco/opcodes: delete wrong comment copy pasted from NIR |
| - aco: use uses helpers for pk_fma opt |
| - aco: combine scalar mul+pk_add to pk_fma |
| - aco/gfx10+: use v_cndmask with literal for reduction identity |
| - nir: add single bit test opcodes |
| - nir/lower_bit_size: mask bitz/bitnz src1 like shifts |
| - aco: implement nir_op_bitz/bitnz |
| - nir/opt_algebraic: combine bitz/bitnz |
| - radv: set has_bit_test for aco |
| - aco/optimizer: delete s_bitcmp optimization |
| - aco/gfx11: fix get_gfx11_true16_mask with v_cmp_class_f16 |
| - aco: fix non constant 16bit bitnz/bitz |
| - aco: fix u2f16 with 32bit input |
| - nir/opt_algebraic: remove broken fddx/fddy patterns |
| |
| George Ouzounoudis (1): |
| |
| - radv: small fix for VkDescriptorSetVariableDescriptorCountLayoutSupport |
| |
| Gert Wollny (98): |
| |
| - r600/sfn: Lower tess levels to vectors in TCS |
| - r600/sfn: make sure f2u32 is lowered late and correctly for 64 bit floats |
| - r600: remove TGSI code path |
| - r600/sfn: Add a type for address registers |
| - r600/sfn: don't track address registers in live ranges |
| - r600/sfn: Handle MOVA_INT in sfn assembler |
| - r600/sfn/tests: Cleanup and move some code around |
| - r600/sfn: Add address and index registers creation to ValueFactory |
| - r600/sfn: Rework query for indirect access in alu instr and opt |
| - r600/sfn: don't allow more than one AR per instruction |
| - r600: Allow both index registers for all CF types |
| - r600/sfn: Prepare uniforms and local arrays for better address handling |
| - r600/sfn: handle AR and IDX register in shader from string |
| - r600/sfn: add method to update indirect address to all instrution types |
| - r600/sfn: Add function to insert op in block |
| - r600/sfn: Update resource based instruction index mode check |
| - r600/sfn: Be able to track expected AR uses |
| - r600/sfn: AR and IDX don't need the write flag, but haev a parent |
| - r600/sfn: Add a RW get function of IF predicate access |
| - r600/sfn: Add interface to count AR uses in ALU op |
| - r600/sfn: Add pass to split addess and index register loads |
| - r600/sfn: Add function to check whether a group loads a index register |
| - r600/sfn: take address loads into account when scheduling |
| - r600/sfn: Add more tests and update to use address splits |
| - r600/sfn: Don't copy-propagate indirect access into LDS instr |
| - r600/sfn: Add test for multiple index load |
| - r600/sfn: set CF force flag always when starting a new block |
| - r600/sfn: Start a new ALU CF on index use, not on index emission |
| - r600/sfn: Add chip family to shader class |
| - r600/sfn: Add handling for R600 indirect access alias handling |
| - r600/sfn: Override Array access handling in backend assembler |
| - r600/sfn: Fix copy-prop with array access |
| - r600/sfn: scheduled instructions are always ready |
| - r600/sfn: Add more tests and update to use address splits |
| - r600/sfn: print failing block when scheduling fails |
| - r600/sfn: Can't use an indirect array access as source to AR load |
| - r600/sfn: factor out index loading for non-alu instructions |
| - r600/sfn: prepare for emitting AR loads |
| - r600/sfn: Tie in address load splitting |
| - r600+sfn: Assign ps_conservative_z and switch to NIR defines |
| - r600/sfn: assign window_space_position in shader state |
| - r600/sfn: Ass support for image_samples |
| - r600/sfn: fix cube to array lowering for LOD |
| - r600/sfn: Fix iterator use |
| - r600/sfn: move kill instruction test to alu instruction |
| - r600/sfn: add dependencies for kill instructions |
| - r600/sfn: move kill handling fully to scheduling |
| - r600/sfn: use correct FS output location if not all outputs are used |
| - virgl: Make query result resource as dirty before requesting result |
| - virgl: Add support for ARB_pipeline_statistics |
| - virgl/ci: uprev virglrenderer |
| - docs/features: fix empty line error |
| - virgl: Fix IB upload when a start >0 is given |
| - virgl: Submit drawid_offset if is not zero |
| - virgl: signal support for group vote and draw parameters |
| - virgl: enable ARB_gl_spirv |
| - features: Update virgl features |
| - ci: uprev virglrenderer to include changes needed for GL 4.6 support |
| - r600/sfn: assert that group barrier is not emitted in divergent code flow |
| - r600/sfn: Switch to scoped barriers |
| - util/driconf: pin minImageCount to three for "Path of Exile" |
| - r600/sfn: add read instruction for unused but required LDS op results |
| - r600/sfn: Don't rewrite TESS_EVAL inner tess level outputs |
| - r600/sfn: Add experimental support for load/store_global |
| - r600/sfn: Handle store_global when lowering 64 bit ops to vec2 |
| - r600/sfn: Handle load_global in 64 to vec2 lowering |
| - rusticl: compile r600 driver |
| - r600: fix handling of use_sb flag |
| - r600/sfn: move kill handling to fully scheduling |
| - 600/sfn: Trigger use of ACK for some barriers |
| - r600: Disable SB if we use the ariable length DOT |
| - r600/sfn: Silence warnings "overloaded-virtual" |
| - r600/sfn: Downgrade some error message to warning |
| - r600: Split tex CF only if written component is read |
| - r600/sfn: Don't deref unused group slots |
| - r600/sfn: on R600/R700 write a dummy pixel output if there is a gap |
| - r600/sfn: Clean up FS member initialization |
| - virgl: don't allow vertex input arrays on GLES hosts |
| - r600/sfn: Fix typo |
| - r600/sfn: drop use of nir source mods |
| - r600/sfn: allow source mods for per source with multi-slot ops |
| - r600/sfn: add source and dest mod info to opcode table |
| - r600/sfn: Implement source mod optimization in backend |
| - r600/sfn: Implement fsat for 64 bit ops |
| - r600/sfn: Add source mod propagation also to fp64 ops |
| - r600/sfn: Don't clear clear group flag on vec4 that comes from TEX or FETCH |
| - virgl/ci: Drop duplicate runs |
| - ci: Upref virglrenderer |
| - r600/sfn: Fix filling FS output gaps |
| - r600: Pre-EG - Set wrap texture modes to repeat when seemless cube is used |
| - r600/sfn: Be more conservative with AR re-use |
| - r600/sfn: Shorten array elements live range |
| - r600/sfn: remove debug output leftovers |
| - r600/sfn: Fix use of multiple IDX with kcache |
| - r600/sfn: Don't try to propagate to vec4 with more than one use |
| - r600/sfn: Only switch to other CF if no AR uses are pending |
| - r600/sfn: AR loads should depend on all previous non ALU instructions |
| - r600/sfn: Take source uses into account when switching channels |
| |
| Giancarlo Devich (5): |
| |
| - d3d12: Update and require DirectX-Headers 1.610.0 |
| - d3d12: Query device for D3D12_FEATURE_D3D12_OPTIONS14 |
| - d3d12: Update PSO creation to use CreatePipelineState |
| - d3d12: Add ID3D12GraphicsCommandList8 to the context |
| - d3d12: Support separate front/back stencils |
| |
| Gregory Mitrano (2): |
| |
| - ac/sqtt: Add RGP Definitions for Mesh Shaders |
| - radv/sqtt: Add RGP Markers for Mesh Shaders |
| |
| Guilherme Gallo (29): |
| |
| - ci/lava: Move job definition stuff to another file |
| - ci/lava: Extract LAVA proxy and LAVAJob abstractions |
| - ci/lava: Use python-fire in job submitter |
| - ci/lava: Update LogFollower for better section handling and history |
| - ci/lava: Add a simple Structural Logger into submitter |
| - bin/ci: Add StructuredLogger to improve log handling |
| - ci/lava: Integrate StructuralLogger with AutoSaveDict |
| - ci/lava: Force use of UTC timezones |
| - ci/lava: Refactor LAVAJobSubmitter and add tests |
| - ci/lava: Use f-strings in job definition |
| - ci/lava: Skip regression test if LAVA log file is not present |
| - ci/freedreno: Fix a618-traces-performance rules |
| - ci/lava: Bypass arg list to print_log function |
| - ci/lava: Fix last section in job submitter |
| - ci: Use absolute paths in init-stage2.sh |
| - ci/lava: Add SSH support in rootfs |
| - ci/lava: Add SSH job definition |
| - ci/lava: Add bridge function for job definition |
| - ci/lava: Distinguish test suites in DUT vs Docker |
| - ci/lava: Only check for the first section marker |
| - ci/lava: Hide JWT block during YAML dump |
| - ci/lava: Tweak http-download timeout in SSH based jobs |
| - ci/lava: Raise the post test metadata gathering retry count |
| - ci/lava: Force LAVA panfrost jobs to use UART |
| - dzn: Skip a few deqp tests which are prone to timeout |
| - ci/lava: Renable SSH sessions for panfrost jobs |
| - ci/lava: Increase Docker action failure_retry counter |
| - ci/lava: Add LAVA SSH client container |
| - ci/lava: Use an alpine image for SSH client container |
| |
| Hans-Kristian Arntzen (6): |
| |
| - wsi/x11: Fix present ID signal when IDLE comes before COMPLETE. |
| - wsi/wayland: Simplify wait logic for present wait. |
| - wsi/wayland: Do not assert that all present IDs have been waited on. |
| - radv/amdgpu: Report 48-bit VAs in bo logs. |
| - Fix DGC bug where indirect count > maxSequencesCount. |
| - wsi/x11: Fix potential deadlock in present ID. |
| |
| Harri Nieminen (11): |
| |
| - amd: fix typos |
| - amd: fix typos in code |
| - r300: fix typos |
| - radeonsi: fix typos |
| - r600: fix typos |
| - r600/sb: fix typo |
| - r600/sfn: fix typos |
| - r600/sfn: fix typos in code |
| - broadcom: fix typos |
| - egl: fix typos |
| - glx: fix typos |
| |
| Helen Koike (3): |
| |
| - ci: move .microsoft-farm-container-rules to test-source-dep.yml |
| - ci: remove unused tag DEBIAN_X86_64_TEST_IMAGE_PATH |
| - ci/android: remove strace output from cuttlefish-runner.sh |
| |
| Hyunjun Ko (27): |
| |
| - intel/genxml: fix num bits of some MOCS fields |
| - intel/genxml: conform some fields to each other gen. |
| - intel/genxml: align some fields on gen9/11/12/125 with media driver. |
| - intel/genxml: add a command VD_CONTROL_STATE to gen12/125 |
| - util/vl: initialize data/end pointers. |
| - vulkan/video: add to parse h265 slice. |
| - vulkan/video: add h265 reference structures and relevant util functions. |
| - anv/image: Add a surface usage bit for video decoding |
| - anv/image: allocate mv storage buffers for h265 |
| - anv/image: allow VK_IMAGE_CREATE_ALIAS_BIT with a private binding. |
| - anv: add initial video decode support for h265 |
| - anv: support P010 format for video 10-bit hevc decoding |
| - anv/image: get width/height for each plane of a surface for video decoding. |
| - anv: support HEVC 10-bit decoding |
| - anv: enable the video h265 decode extension. |
| - anv/ci: Add tests for video formats to the failing tests. |
| - anv/video: move video requirements to outarray. |
| - vulkan/video: adds more conditions for setting loop_filter_across_slices_enable in h265 slice parsing. |
| - vulkan/video: move parsing longterm rps in h265 slice parsing. |
| - util/rbsp: keep track of removed bits for the emulation prevention three bytes. |
| - vulkan/video: consider removed bits when calculating the size of comsumed data. |
| - anv/video: fix to set U/V offset correctly. |
| - vulkan/video: keep delta weight and offsets of predicted weight tables in h265 slice parsing |
| - intel/genxml: changes the type for predicted weight to unsigned. |
| - anv: fix to set predicted weight tables correctly. |
| - anv/video: fix to support HEVC 10bit on some of 9th gens. |
| - anv: Adds a workaround for HEVC decoding on some old platforms. |
| |
| Iago Toral Quiroga (34): |
| |
| - broadcom/compiler: fix v3d_qpu_uses_sfu |
| - broadcom/compiler: add a v3d_qpu_instr_is_legacy_sfu helper |
| - broadcom/compiler: fix incorrect check for SFU op |
| - broadcom/compiler: fix incorrect ALU checks |
| - broadcom/compiler: return early for SFU op latency calculation |
| - broadcom/compiler: try harder to merge thread switch earlier |
| - broadcom/compiler: don't allocate undef to rf0 |
| - broadcom/compiler: move buffer loads to lower register pressure |
| - broadcom/compiler: increase peephole limit to 24 instructions |
| - broadcom/compiler: use unified atomics |
| - broadcom/compiler: skip jumps in non-uniform if/then when block cost is small |
| - v3dv: simplify too small Z viewport scale workaround |
| - v3dv: store slice dimensions in pixels |
| - v3dv: allow TFU transfers for mip levels other than 0 |
| - v3dv: align compressed image regions to block size |
| - broadcom/compiler: flag use of control barriers |
| - broadcom/compiler: use scoped barriers |
| - v3d: only warn about bining sync for indirect draw once |
| - v3dv: remove bogus viewport code |
| - v3dv: simplify scissor setup for negative viewport height |
| - broadcom/cle: fix up viewport offset packet definition for V3D 4.1+ |
| - v3d,v3dv: fix viewport offset for negative viewport center |
| - broadcom/compiler: only use last thread switch flag to detect final section |
| - nir/lower_tex: copy missing fields when creating copy of tex instruction |
| - nir/lower_tex: handle lower_tg4_offsets with lower_tg4_broadcom_swizzle |
| - broadcom/compiler: handle textureGatherOffsets |
| - v3dv: expose shaderImageGatherExtended |
| - v3dv: fix slice size for miplevels >= 2 |
| - v3dv: don't use the TLB path if we might be copying partial tiles |
| - v3dv: use div_round_up for division by block size |
| - v3dv: fix blit path for compressed image to buffer copies |
| - broadcom: use nir info to keep track of implicit sample shading |
| - broadcom/compiler: free defin and defout arrays if they already exist |
| - broadcom/compiler: don't leak v3d_compile when finding a new best strategy |
| |
| Ian Romanick (20): |
| |
| - intel/fs: Don't munge source order of 3-src instructions in opt_algebraic |
| - intel/fs: Fix handling of W, UW, and HF constants in combine_constants |
| - intel/fs: Allow HF const in MAD on Gfx12.5 if all sources are HF |
| - nir/algebraic: Fixup iadd3 related patterns |
| - intel/fs: Add constant propagation for ADD3 |
| - intel/eu/validate: Use a single macro define half_float_conversion cases |
| - intel/eu/validate: Add Gfx12.5 |
| - intel/eu/validate: Add some validation of ADD3 |
| - nir: Add optimization pass to reassociate some bfi instructions |
| - intel/fs: Use nir_opt_reassociate_bfi |
| - nir/algebraic: Lower some bfi with two constant sources |
| - intel/fs: Emit better code for bfi(..., 0) |
| - nir/algebraic: Optimize some u2f of bfi |
| - nir/algebraic: Simplify various trivial bfi |
| - intel/stub_gpu: Don't run program again after using GDB |
| - intel/fs: Constant propagate into SHADER_OPCODE_SHUFFLE |
| - intel/fs: Add missing newline |
| - intel/fs: Always do opt_algebraic after opt_copy_propagation makes progress |
| - intel/fs: Constant fold SHL |
| - intel/fs: Constant fold OR and AND |
| |
| Ikshwaku Chauhan (2): |
| |
| - radeonsi/gfx11: updated si_is_format_supported |
| - radeonsi/gfx11: updated vertex format changes |
| |
| Illia Abernikhin (3): |
| |
| - docs: add iris features to docs/features.txt |
| - docs: add crocus features to docs/features.txt |
| - docs: remove i965 features from docs/features.txt |
| |
| Illia Polishchuk (6): |
| |
| - glx: add fail check for current context in another thread |
| - drirc: add allow_sampled_tex_copy option |
| - nir: switch to a normal sampler for ARB program with not depth textures |
| - zink, drirc: Add Borderlands 2 workaround to fix spir-v 1.6 translated discard |
| - zink: move find_sampler_var from zink to nir core |
| - nir: fix invalid sampler search by texture id |
| |
| Italo Nicola (22): |
| |
| - egl: disable partial redraw when gallium hud is active |
| - egl: fix comments alignment |
| - freedreno: implement clear_render_target and clear_depth_stencil |
| - v3d: implement clear_render_target and clear_depth_stencil |
| - vc4: implement clear_render_target and clear_depth_stencil |
| - d3d12: fix clear_depth_stencil texture deref |
| - gallium: implement u_default_clear_texture |
| - gallium: use u_default_clear_texture where applicable |
| - gallium: rename util_clear_texture to util_clear_texture_sw |
| - mesa/st: use fallback path when pipe->clear_texture is not available |
| - rusticl: use fallback path when pipe->clear_texture is not available |
| - clover: use fallback path when pipe->clear_texture is not available |
| - gallium: cleanup util_blitter_clear_render_target |
| - gallium: remove PIPE_CAP_CLEAR_TEXTURE |
| - lima/ci: add some ARB_clear_texture piglit tests to lima-fails.txt |
| - d3d12/ci: add piglit arb_clear_texture-integer fail to CI expectations |
| - nir: add options to lower y_vu, yv_yu, yx_xvxu and xy_vxux |
| - gallium/st: add support for PIPE_FORMAT_NV21 and PIPE_FORMAT_G8_B8R8_420 |
| - mesa/main: add PIPE_FORMAT_YVYU and PIPE_FORMAT_R8B8_R8G8 |
| - mesa/main: add PIPE_FORMAT_VYUY and PIPE_FORMAT_B8R8_G8R8 |
| - freedreno/ci: add KHR-GL46.buffer_storage.map_persistent_flush to flakes |
| - egl: reenable partial redraw with a warning when using gallium hud |
| |
| Iván Briano (24): |
| |
| - anv: Remove dead parameters from copy_fast_clear_dwords |
| - anv: make anv_can_fast_clear_color_view more generally available |
| - anv: factor out code for ccs_op and mcs_op |
| - anv: expose some helper functions |
| - anv: support fast color clears on vkCmdClearAttachments |
| - anv: put EXT_mesh_shader behind an environment variable |
| - anv: enable graphics pipeline libraries by default |
| - hasvk: avoid assert due to unsupported format |
| - anv: enable the GPL feature based on whether the extension is supported |
| - vulkan/wsi: fix double free on error condition |
| - anv: do not explode on 32 bit builds |
| - anv: update conformanceVersion |
| - anv: flush data cache before emitting availability |
| - anv: ensure CFE_STATE is emitted for ray tracing pipelines |
| - iris: ensure mesh is disabled on context init |
| - anv: ensure mesh is disabled on context init |
| - anv: implement Wa_14019750404 |
| - blorp: fix hangs with mesh enabled |
| - anv: use a simpler MUE layout for fast linked libraries |
| - anv: track what kind of pipeline a fragment shader may be used with |
| - intel/fs: read viewport and layer from the FS payload |
| - intel/fs: handle URB setup for fast linked mesh pipelines |
| - anv: enable VK_EXT_mesh_shader where supported |
| - intel/fs: use ffsll so we don't explode on 32 bits |
| |
| James Glanville (7): |
| |
| - pvr: Improve support for image clears |
| - pvr: Fix vtxin special var allocation count |
| - pvr: Fix image to buffer copies |
| - pvr: Fix incorrect PBE packmode for S8_UINT |
| - pvr: Adjust clear's region clip words |
| - pvr: Fix seg fault on unused ds attachment |
| - pvr: Fix deferred_control_stream_flags |
| |
| James Knight (1): |
| |
| - meson: ensure i915 Gallium driver includes Intel sources |
| |
| Janne Grunau (4): |
| |
| - asahi: Fix typo in debug/error message helper macro |
| - asahi: Free low VA BOs correctly |
| - st/mesa: Set gl_config.floatMode based on color_format |
| - asahi,agx: Fix stack buffer overflow in agx_link_varyings_vs_fs |
| |
| Jarred Davies (3): |
| |
| - pvr: Don't ralloc build context from compiler |
| - pvr: Use vk_device's enabled features struct |
| - pvr: Reduce free list initial size when multiple devices are created |
| |
| Jesse Natalie (133): |
| |
| - d3d12: Remove #if D3D12_SDK_VERSION blocks now that 610 is required |
| - microsoft/clc: Remove #if D3D12_SDK_VERSION blocks now that 610 is required |
| - dzn: Remove #if D3D12_SDK_VERSION blocks now that 610 is required |
| - util: Delete Offset() macro from u_memory.h |
| - d3d12: Respect buffer offsets for sampler views |
| - d3d12: Support blit texture uploads |
| - spirv2dxil: Lower quad ops in non-fragment/compute stages |
| - dzn: Remove driconf for quad ops in vertex stages |
| - dzn: Add physical device arg to format lookup |
| - dzn: Support dynamic depth bias via command list instead of PSO |
| - dzn: Use narrow quadrilateral lines when supported |
| - dzn: Support aniso-with-point-mip samplers |
| - dzn: Align-up heap sizes when allocating memory |
| - ci/windows: Update Agility SDK to 1.610.2 |
| - dzn: Use unrestricted copy alignments when available |
| - dzn: Handle opaque BC1 |
| - dzn: Handle depth bias for point fill mode emulation |
| - dzn: Re-design custom buffer descriptors |
| - ci/dzn: Run almost the full CTS |
| - dzn: Expose core VK1.1 extensions that aren't optional |
| - dzn: Expose core VK1.2 extensions that aren't optional |
| - meson: Don't use masm with VS backend |
| - spirv2dxil: Mark SSBO reads for bindless as CAN_REORDER |
| - microsoft/compiler: Unroll loops in opt passes |
| - dzn: Fix UBO descriptors pointing to the end of the buffer |
| - dzn: Hook up subgroup size to compute shader compilation |
| - dzn: Ensure sample-rate shading is factored into nir hash |
| - dzn: Use the nir hash as an input to the dxil hash |
| - dzn: Ensure subgroup size control is factored into pipeline hash |
| - dzn: Ensure bindless is factored into pipeline/nir hash |
| - dzn: Augment blit resolve to support min/max/sample-zero modes |
| - dzn: Support all available depth/stencil resolve modes |
| - dzn: Support separate depth/stencil resolves via blits |
| - dzn: Delete queue-level event waits |
| - ci/windows: Pick up WARP 1.0.6 NuGet with lots of dzn fixes |
| - dzn: Use A4B4G4R4 instead of B4G4R4A4 when available |
| - spirv2dxil: Lower large temps to scratch |
| - microsoft/compiler: Avoid integer divides by 0 |
| - dzn: Run nir_opt_remove_phis before nir_lower_returns |
| - dzn/ci: Remove 'exclude' for graphicsfuzz cases |
| - microsoft/compiler: Allocate space for I/O and viewID dependency tables before instruction processing |
| - microsoft/compiler: Do basic I/O analysis for dependency tables |
| - spirv2dxil: Support int64 and doubles |
| - d3d12: Convert from D3D shader model to Mesa shader model earlier |
| - dzn: Enable 64-bit ints and floats |
| - microsoft/compiler: Take inputs from callers before providing nir options |
| - microsoft/compiler: Enable packed dot product intrinsics for SM6.4+ |
| - dzn: Enable KHR_shader_integer_dot_product |
| - nir_lower_system_values: Add ASSERTED to assert-only variable |
| - nir: Load/store atomic op indices when lowering image intrinsics |
| - microsoft/compiler: Remove alu type info from store_dest() |
| - microsoft/compiler: Duplicate some SSA values to simplify SSA typing |
| - microsoft/compiler: Back-propagate type requirement information |
| - dxil: Use unified atomics |
| - vulkan: Win32 sync import/export support |
| - dzn: Don't zero an output struct that can have pNext |
| - dzn: Finish implementing KHR_synchronization2 |
| - dzn: Dedicated resource cleanup |
| - dzn: External Win32 memory extension |
| - dzn: External Fd memory extension |
| - dzn: Hook up win32 semaphore import/export |
| - dzn: Hook up fd semaphore import/export |
| - docs: Update list of extensions implemented by dzn |
| - glsl: Delete dead intrinsics |
| - microsoft/compiler: Better and simpler bitcast reduction |
| - dzn: Add a no-bindless debug flag |
| - dzn: Fix inverted assert |
| - dzn: Partial revert of 8887852d |
| - dzn: Don't expose copy queues |
| - dzn: Fix src/dest confusion for some non-bindless descriptor copies |
| - wsi/win32: Handle acquiring an image while one is already acquired |
| - nir_lower_returns: Optimize phis before beginning the pass |
| - nir: Add undef phi srcs when adding successors |
| - radv: Don't run opt_remove_phis before lower_returns |
| - dxil: Don't run opt_remove_phis before lower_returns |
| - ci/windows: Update WARP to 1.0.7 |
| - microsoft/compiler: Enable emitting type info for textures with <4 comps |
| - microsoft/compiler: Add a pass to assign image formats based on number of components |
| - spirv2dxil: Assign formats to image vars before lowering to bindless |
| - microsoft/compiler: Use image formats to determine texture types |
| - ci/windows: Update WARP to 1.0.7.1 |
| - nir_opt_algebraic: Don't shrink 64-bit bitwise ops if pack_split is going to be lowered |
| - nir: Add preserve_mediump as a shader compiler option |
| - microsoft/compiler: Always set support_16bit_alu |
| - microsoft/compiler: Handle mediump |
| - spirv2dxil: Enable mediump |
| - dzn: Don't lower away mediump |
| - microsoft/compiler: Fix the int->uint pass for arrayed I/O |
| - microsoft/compiler: Fix usage of type var in semantic asserts |
| - microsoft/compiler: Viewport/layer as input to GS/HS needs to set feature bit |
| - d3d12: Support PIPE_CAP_VS_LAYER_VIEWPORT |
| - dzn: Don't create D3D objects for secondary command buffers |
| - dzn: Fix incremental binding of VBs |
| - d3d12: Fully initialize UAV desc for null SSBOs |
| - dzn: Don't support VK R4G4B4A4_UNORM_PACK16 unless we have B4G4R4A4 |
| - nir_opt_constant_folding: Fix nir_deref_path leak |
| - nir: Add is_null_constant to nir_constant |
| - vtn: Set is_null_constant |
| - nir_split_struct_vars: Support more modes and constant initializers |
| - nir: Allow atomics as non-complex uses for var-splitting passes |
| - nir_lower_ubo_vec4: Delete an invalid assert |
| - nir_lower_mem_access_bit_sizes: Add a bit_size input to the callback |
| - nir_lower_mem_access_bit_sizes: Move options into a struct |
| - nir_lower_mem_access_bit_sizes: Support unaligned stores via a pair of atomics |
| - nir: Fix constant expression for unpack_64_4x16 |
| - nir: Optimize unpacking 16 bit values that were originally packed |
| - microsoft/clc: Try harder to optimize memcpys before lowering them |
| - microsoft/clc: Fix progress reporting for some lowering |
| - microsoft/compiler: Support vec/struct const vals |
| - microsoft/compiler: Improvements to constant -> shader_temp pass used for CL |
| - microsoft/compiler: Add some more lowering passes for derefs |
| - microsoft/compiler: Emit const accesses as load_deref |
| - microsoft/compiler: Use mem_constant instead of shader_temp for consts |
| - microsoft/compiler: Un-lower shared/scratch to derefs |
| - spirv2dxil: Don't lower shared/temp to explicit I/O |
| - microsoft/compiler: Support load_ubo_vec4 |
| - dxil: Don't generate load_ubo_dxil directly |
| - dxil: Delete load_ubo_dxil intrinsic |
| - microsoft/compiler: Don't lower bit sizes for movs |
| - microsoft/compiler: Don't over-align raw buffer load/store intrinsics |
| - dxil: Remove custom SSBO lowering |
| - nir_lower_returns: Mark assert-only var as ASSERTED |
| - dzn: Ignore export access parameters |
| - dzn: Inline D3D12 device creation in physical device creation |
| - dzn: Use common GetPhysicalDeviceFeatures2 |
| - dzn: Remove dynamic check for block-compressed support |
| - dzn: Fix multisample counts in device limits |
| - dzn: Align placed footprints used when copying linear <-> optimal for BC formats |
| - dzn: VK_EXT_external_memory_host |
| - radv: Fix label name |
| - microsoft/clc: Fix usage of nir_builder_at |
| - ci/windows: Re-enable Windows builds |
| - d3d12: Fix indexing of local_reference_state |
| |
| Jiadong Zhu (1): |
| |
| - ac: enable SHADOW_GLOBAL_CONFIG for preemptible ib |
| |
| Jianxun Zhang (8): |
| |
| - iris: Fix memory alignment when importing dmabuf (GFX12.5) |
| - include/uapi: Update drm_fourcc.h from drm kernel |
| - intel/isl: Add MTL RC CCS modifier into modifier info |
| - iris: Support I915_FORMAT_MOD_4_TILED_MTL_RC_CCS modifier |
| - intel/isl: Add MTL RC CCS CC modifier into modifier info |
| - iris: Support I915_FORMAT_MOD_4_TILED_MTL_RC_CCS_CC modifier |
| - intel/isl: Add MTL MC CCS modifier into modifier info |
| - iris: Support MTL modifier MC_CCS |
| |
| Jonathan Gray (1): |
| |
| - intel/dev: remove dg2 0x5698 pci id |
| |
| Jordan Justen (26): |
| |
| - intel/compiler/gfx12.5+: Lower 64-bit cluster_broadcast with 32-bit ops |
| - mesa/main: Exit early when trying to create an unsupported context API |
| - iris: Flush untyped dataport cache when HDC flush is requested on compute |
| - iris: Flush untyped dataport cache DC flush is requested on compute |
| - anv: Clear untyped dataport cache flush bit if not in GPGPU mode |
| - anv: Flush untyped dataport cache when HDC flush is requested on compute |
| - anv: Flush untyped dataport cache DC flush is requested on compute |
| - intel/devinfo: Add has_set_pat_uapi |
| - intel/devinfo: Define PAT indices used on MTL |
| - iris/bufmgr: Add iris_pat_index_for_bo_flags() |
| - iris/bufmgr: Skip bucket allocation if not using writeback cache PAT index |
| - iris: Map aux-map with WC on MTL+ (has_set_pat_uapi) |
| - drm-uapi/i915_drm.h: Update from drm-next (2023-06-09) |
| - iris: Use set PAT extension on BO creation for MTL |
| - anv: Use set PAT extension on BO creation for MTL |
| - intel/devinfo/i915: Set has_set_pat_uapi for MTL+ |
| - intel/genxml: Add COMPCS0 aux-table registers |
| - anv: Program compute aux-map base address during queue init |
| - anv: Use correct CCS0 aux-map register offset in pipe flush |
| - isl: Add ISL_SURF_USAGE_STREAM_OUT_BIT |
| - anv,iris,hasvk: Use ISL_SURF_USAGE_STREAM_OUT_BIT for setting stream-out MOCS |
| - isl/dev: Add uncached MOCS value |
| - isl: Set MOCS to uncached for MTL stream-out |
| - intel/dev: Use RPL-U name on RPL-U devices |
| - intel/dev: Add more RPL PCI IDs |
| - intel/dev: Update device string for MTL PCI ID 0x7d55 |
| |
| Joshua Ashton (8): |
| |
| - radv: Do not enable robustness for push constants with robustBufferAccess2 |
| - radv: Refactor buffer robustness to an enum |
| - radv: Rename radv_nir_compiler_options::robust_buffer_access to robust_buffer_access_llvm |
| - radv: Split and move buffer robustness to shader key |
| - radv: Rename radv_required_subgroup_info to radv_shader_stage_key |
| - radv: Implement VK_EXT_pipeline_robustness |
| - radv: Advertise VK_EXT_pipeline_robustness |
| - radv: Remove unused pipeline param from radv_generate_pipeline_key |
| |
| Joshua Watt (2): |
| |
| - drm-shim: Set file type in readdir() |
| - drm-shim: Use anonymous file for file override |
| |
| José Fonseca (2): |
| |
| - wgl: Fix unintentional assignment on assert. |
| - wgl: Remove needless \`if (1) { ... }`. |
| |
| José Roberto de Souza (29): |
| |
| - iris: Move i915 batch destroy logic to iris_i915_destroy_batch() |
| - iris: Initialize batch screen in iris_init_batch() |
| - iris: Move iris_batch i915 specific variables to union |
| - iris: Create, destroy and replace Xe engines |
| - iris: Implement batch_check_for_reset() in Xe kmd backend |
| - iris: Set priority to Xe engines |
| - iris: Fix close of exported bos |
| - intel/common: Add gt_id to intel_engine_class |
| - iris: Implement batch_submit() in Xe kmd backend |
| - iris: Fix vm bind of imported bos from other GPUs |
| - build: Add Iris and ANV to ARM's auto-generated drivers |
| - anv: Take into consideration physical device max heap size to set maxStorageBufferRange |
| - iris: Allow shared scanout buffer to be placed in smem as well |
| - iris: Add a function to return allocated bo mmap mode |
| - iris: Add function to return mmap mode for userptr bos |
| - iris: Add function to return mmap mode for aux map |
| - anv: Set memory types supported by Xe KMD |
| - anv: Fix ANV_BO_ALLOC_NO_LOCAL_MEM flag |
| - anv: Nuke ANV_BO_ALLOC_WRITE_COMBINE |
| - iris: Fix return of xe_batch_submit() when exec fails |
| - iris: Replace aperture_bytes by sram size in iris_resource_create_for_image() for PIPE_USAGE_STAGING |
| - intel: Fix support of kernel versions without DRM_I915_QUERY_ENGINE_INFO |
| - iris: Attach a dma-buf to bo flink |
| - iris: Implement external object implicit syncronization for Xe kmd |
| - anv: Fix compute maximum number of threads value |
| - anv: Fix some mismatches of canonical and regular addresses around anv_bo_vma_alloc_or_close() |
| - anv: Drop unnecessary intel_canonical_address() call around anv_address_physical() |
| - anv: Drop unnecessary intel_canonical_address() calls around bo->offset |
| - iris: Convert slab address to canonical |
| |
| Juan A. Suarez Romero (34): |
| |
| - v3d: set depth compare function correctly |
| - v3d: use primitive type to get stream output offset |
| - v3d/ci: annotate failure |
| - v3dv/ci: rename waiver test |
| - v3d: add support for ARB_texture_cube_map_array |
| - v3d/ci: enable glsl 1.30 and 1.40 piglit tests |
| - v3d: apply 1D texture miplevel alignment in arrays |
| - v3d/ci: update neverball-v2 trace reference |
| - vc4/ci: skip unsupported test versions |
| - vc4/ci: disable VC4 jobs |
| - v3d: add per hw-version caller macro |
| - v3d: upgrade V3D 4.1 to 4.2 version |
| - v3d: apply proper clamping when setting up RT |
| - v3d/ci: annotate failures |
| - vc4/ci: re-enable VC4 testing |
| - v3d: delay offset/counter values with primitive restart |
| - v3d/ci: run GPU piglit profile |
| - v3d/ci: make traces test mandatory |
| - v3d: enable NIR compact arrays |
| - vc4: set blit mask correctly |
| - vc4: call blit paths in chain |
| - vc4: allow tile-based blit for Z/S |
| - vc4: add specific stencil blit path |
| - v3d/v3dv/ci: adjust job fractions |
| - v3dv/vc4/ci: update expected results |
| - v3d/ci: update traces |
| - v3d: Z/S blit require Z/S formats |
| - broadcom/ci: update expected results |
| - v3d: handle samplerExternalOES |
| - broadcom/ci: update expected results |
| - gallium/util: fix color clamp for alpha-only formats |
| - v3d: clear alpha-only as red-only |
| - vc4/v3d/ci: update expected results |
| - v3d/ci: add new flake |
| |
| Julia Tatz (7): |
| |
| - zink: Implement PIPE_CAP_OPENCL_INTEGER_FUNCTIONS and PIPE_CAP_INTEGER_MULTIPLY_32X16. |
| - zink: Implement PIPE_CAP_RESOURCE_FROM_USER_MEMORY |
| - zink: fix layout(local_size_variable) for vk1.3+ |
| - zink/ci: update expected results |
| - aux/trace: fix (u)int dump |
| - gallium/dri: fix dri2_from_names |
| - aux/trace: fix set_hw_atomic_buffers method name |
| |
| Julia Zhang (1): |
| |
| - virgl: remove check of VIRGL_CAP_V2_UNTYPED_RESOURCE |
| |
| Julian Hagemeister (1): |
| |
| - Gallium: Fix shared memory segment leak |
| |
| Juston Li (24): |
| |
| - venus: use pipelineCacheUUID for shader cache id |
| - venus: filter out queue familes with exclusive sparse binding support |
| - venus: add helper function support for VkBindSparseInfo |
| - venus: add back sparse binding support |
| - venus: enable sparse binding features |
| - venus: enable sparse binding properties |
| - venus: sync to latest protocol header from v1.3.252 |
| - venus: sync protocol for multiple extensions for zink |
| - venus: enable VK_EXT_non_seamless_cube_map |
| - venus: enable VK_EXT_dynamic_rendering_unused_attachments |
| - venus: enable VK_KHR_shader_clock |
| - venus: enable VK_EXT_border_color_swizzle |
| - venus: enable VK_EXT_fragment_shader_interlock |
| - venus: enable VK_EXT_shader_subgroup_ballot |
| - venus: enable VK_EXT_color_write_enable |
| - docs: venus: update extension support |
| - radv: fix incorrect size for primitives generated query |
| - venus: factor out flush barrier cmd |
| - venus: expose vn_feedback_buffer_create() |
| - venus: add query pool feedback cmds |
| - venus: track viewMask |
| - venus: track render pass |
| - venus: batch query feedback and defer until after render pass |
| - venus: use feedback for vkGetQueryPoolResults |
| |
| Karmjit Mahil (44): |
| |
| - pvr: Add missing includes in pvr_common.h |
| - pvr: Implement vkCmdUpdateBuffer(). |
| - pvr: Implement simple internal format v2 transfer paths. |
| - pvr: Add deferred RTA clears for cores without gs_rta_support. |
| - pvr: Finish pvr_perform_start_of_render_attachment_clear(). |
| - pvr: Collect vertex input data and fill info struct. |
| - pvr: Fix a comment in the PDS code |
| - pvr: Fix typo in PDS function name |
| - pvr: Add handling for missing entries in pvr_setup_vertex_buffers() |
| - pvr: Handle special built-in variable loading in vertex shader |
| - pvr: Add PVR_DW_TO_BYTES() |
| - pvr: Fix pvr_csb_bake() list return. |
| - pvr: Change push_constants_shader_stages to type pvr_stage_allocation |
| - pvr: Fix static assert check |
| - pvr: Fix unaligned VDMCTRL_PDS_STATE1 data address |
| - pvr: Don't advertise S8_UINT support |
| - pvr: Fix cs corruption in pvr_pack_clear_vdm_state() |
| - pvr: Add missing NULL checks in some vkDestroy...() functions |
| - pvr: Use original binding numbers instead of reassigning |
| - pvr: Remove custom status in command buffer |
| - pvr: Fix missing invalidation of the command buffer |
| - pvr: Fix possible allocation of 0 size |
| - pvr: Fix vk_free() in vkCreateRenderPass2() error path |
| - pvr: Use the suballocator for queries |
| - pvr: Add pvrsrvkm sync prim set bridge call |
| - pvr: Move pvrsrv sync prim code into new pvr_srv_sync_prim.{c,h} |
| - pvr: Use idalloc as the allocator for sync prims |
| - pvr: Handle barrier load and store flags. |
| - pvr: Fix typo causing seg faults copying immutable samplers |
| - pvr: Fix draw indirect page faults due to missing index list buffer |
| - pvr: Rename temps_count to pds_temps_count |
| - pvr: Fix PDS temps allocation on fragment stage |
| - pvr: HWRT creation simplifications. |
| - pvr: Dedup a check with pvr_is_render_area_tile_aligned() |
| - pvr: Remove outdated finishme |
| - pvr: Fix seg fault on empty descriptor set |
| - pvr: Fix dynamic offset patching |
| - pvr: Fix csb control stream extension |
| - pvr: Fix missing BITFIELD_BIT for winsys frag job flag |
| - pvr: Change winsys flag defines to bitfields |
| - pvr: Setup ZLS depth and stencil load/store separately |
| - docs: Add inital PowerVR driver documentation |
| - pvr: Fix \`for` loop itarator usage |
| - pvr: Fix dynamic desc offset storage |
| |
| Karol Herbst (140): |
| |
| - rusticl: rework CLVec helper function to calculate bounds |
| - rusticl/mem: fix Mem::copy_rect |
| - rusticl/mem: replace buffer_offset_size with CLVec::calc_offset_size |
| - gallium: correctly name the flags of svm_migrate |
| - rusticl/context: add helper to get the max mem alloc size for all devices |
| - rusticl/memory: Rework mapping of memory located in system RAM |
| - rusticl/mem: add get_parent helper |
| - rusticl: add support for fine-grained system SVM |
| - nv50/ir: ignore CL system values |
| - nouveau: allow to enable SVM without having to enable CL |
| - nouveau: nouveau_copy_buffer can deal with user_ptrs just fine |
| - rusticl/event: drop work item before updating status |
| - rusticl: add create_pipe_box to better deal with pipe_box restrictions |
| - rusticl/mem: more region and origin validation |
| - radeonsi: lower mul_high |
| - ac/llvm: support shifts on 16 bit vec2 |
| - rusticl: don't set size_t-is-usize for >=bindgen-0.65 |
| - rusticl/device: improve advertisement of fp64 support |
| - rusticl/platform: make the initialization more explicit |
| - rusticl/platform: extract env variable parsing from Platform::init |
| - rusticl/platform: add RUSTICL_FEATURES boilerplate |
| - rusticl/device: allow enablement of fp64 via RUSTICL_FEATURES |
| - rusticl/program: rework dynamic Program state |
| - rusticl/program: use if let to get rid of an unwrap in build |
| - clc: free kernel args in clc_free_kernels_info |
| - rusticl/nir: finish blob after serializing |
| - nvc0: do not randomly emit fences. |
| - nv50/ir: Use unified atomics |
| - Reviewed-by: Nora Allen <blackcatgames@protonmail.com>rusticl/platform: make the extension array a static |
| - rusticl/device: use PLATFORM_EXTENSIONS as a template for filling extensions |
| - rusticl/platform: advertise byte_addressable_store |
| - rusticl/device: split add_ext in fill_extensions |
| - rusticl: explicitly state supported SPIR-V extensions |
| - rusticl/platform: generate extension constants via macro |
| - rusticl/spirv: skip printing info messages |
| - rusticl/device: limit MAX_PARAMETER_SIZE to 32k |
| - rusticl/device: set preferred vector size of doubles if fp64 is enabled |
| - nv50/ir: convert to scoped_barrier |
| - doc/rusticl: add Rust Update Policy |
| - rusticl: bump rust req to 1.60 |
| - rusticl/event: flush queues from dependencies |
| - ci: add and use clippy for rusticl |
| - rusticl: fix clippy errors on image_slice_pitch change to usize |
| - clc: relax spec constant validation |
| - rusticl: add proc macro module for generating API stubs |
| - rusticl/icd: make release return nothing |
| - rusticl/icd: use new proc macros |
| - ac/llvm: support vec2 on b2i16 |
| - ac/llvm: replace MESA_SHADER_COMPUTE checks with gl_shader_stage_is_compute |
| - ac/llvm: set +cumode for radeonsi |
| - lp: align memory for long16 CL types |
| - rusticl/icd: fix ReferenceCountedAPIPointer::from_ptr for NULL pointers |
| - rusticl/api: remove some repr(C) |
| - rusticl/event: ensure even status is updated in order |
| - docs: improve OpenCL features |
| - rusticl/queue: overhaul of the queue+event handling |
| - rusticl: enforce using unsafe blocks in unsafe functions |
| - nv50/ir: use override |
| - nv50/ir: resolve -Woverloaded-virtual=1 warnings |
| - clc: add commment to clc_optional_features to ensure no padding exists |
| - rusticl/spirv: Key optional clc features when caching. |
| - clc: static assert that clc_optional_features has no padding |
| - nouveau: eliminate busy waiting on fences |
| - rusticl/device: add intel usm queries DPCPP cares about |
| - rusticl/device: sort cl_device_info queries |
| - rusticl/version: use cl_version instead of cl_uint and provide a From impl |
| - rusticl: advertize cl_khr_extended_versioning |
| - docs/cl: fix whitespace issues and add missing entries |
| - rusticl: advertize cl_khr_spirv_no_integer_wrap_decoration |
| - docs/cl: improve reporting of image features |
| - rusticl/mem: cache the pipe_format |
| - rusticl/mem: fix validation of packed image formats |
| - rusticl/format: pass order and type to rusticl_image_format directly |
| - rusticl/format: extract CL format to pipe format mapping into const function |
| - rusticl/format: extract required format checks into const functions |
| - rusticl/format: drop req_for_3d_image_write_ext |
| - rusticl/format: add required format table for CL2.0 |
| - rusticl/format: document cl to pipe format mapping |
| - rusticl/format: move format table generation into a macro |
| - rusticl/format: enable all trivial to support optional image formats |
| - clc: fix SPIRVMessageConsumer for NULL src |
| - clc: allow passing custom validator options |
| - rusticl/program: pass our max param size along to the spirv validator |
| - compiler/types: fix size of padded OpenCL Structs |
| - rusticl/device: rename doubles to fp64 and long to int64 |
| - rusticl: experimental support for cl_khr_fp16 |
| - rusticl: add ld_args_gc_sections |
| - rusticl: specify which symbols to export |
| - rusticl: stop linking with libgalliumvl |
| - rusticl/device: create helper context before loading libclc |
| - nir/load_libclc: run some opt passes for everybody |
| - docs: document CLC_DEBUG |
| - rusticl/program: add debugging for OpenCL C compilation |
| - rusticl/program: add debugging option to disable SPIR-V validation |
| - nvc0: fix printing shaders |
| - nv50/ir/nir: set numBarriers if we emit an OP_BAR |
| - rusticl: structurize and reorder mesa binding args |
| - rusticl: generate bindings for build-id stuff |
| - rusticl/meson: extract common bindgen rust args |
| - rusticl/mesa: create proper build-id hash for the disk cache |
| - rusticl: bump bindgen requirement |
| - rusticl/program: skip linking compiled binaries |
| - docs/rusticl: mark building section as such |
| - docs/rusticl: add Enabling section |
| - docs/cl: remove cl_khr_byte_addressable_store from extension list. |
| - docs/cl: move vec3 support under OpenCL C 1.1 |
| - docs/cl: timer sync is implemented |
| - docs: add missing get_compute_state_info documentation |
| - vtn: more CL subgroups |
| - clc: rework optional subgroup feature |
| - llvmpipe: report the proper subgroup size |
| - gallium: add simd_sizes to pipe_compute_state_object_info |
| - gallium: add get_compute_state_subgroup_size |
| - gallium: add PIPE_COMPUTE_CAP_MAX_SUBGROUPS |
| - iris: implement get_compute_state_subgroup_size |
| - rusticl/util: add an Iterator to iterate over set bits in an integer |
| - rusticl/util: add div_round_up |
| - rusticl/device: rework subgroups to subgroup_sizes |
| - gallium: change PIPE_COMPUTE_CAP_SUBGROUP_SIZE to a bitfield of sizes |
| - rusticl: deal with compute_param returning 0 |
| - rusticl: support subgroups |
| - nvc0: backport fp helper invocation fix to 2nd gen Maxwell+ |
| - rusticl/kernel: silence newer clippy warning |
| - rusticl: Replace &Arc<Device> with &Device |
| - rusticl/device: make it &'static |
| - api/icd: drop static lifetime from \`get_ref` return type |
| - nvc0: initial Ada enablement |
| - rusticl: fix warnings with newer rustc |
| - nv50/ir/nir: fix txq emission on MS textures |
| - nv50/ir/nir: Fix zero source handling of tex instructions. |
| - rusticl/kernel: only handle function_temp memory before lowering printf |
| - n50/compute: submit initial compute state in nv50_screen_create |
| - nv50: fix code uploads bigger than 0x10000 bytes |
| - nouveau: take glsl_type ref unconditionally |
| - nv50: limit max code uploads to 0x8000 |
| - clc: use CLANG_RESOURCE_DIR for clang's resource path |
| - zink: fix source type in load/store scratch |
| - zink: fix global stores |
| - rusticl/disk_cache: fix stack corruption |
| - rusticl/memory: do not verify pitch for IMAGE1D_BUFFER |
| |
| Kenneth Graunke (17): |
| |
| - intel/compiler: UNDEF comparisons with smaller than 32-bit |
| - intel/compiler: UNDEF SubgroupInvocation's register |
| - intel/compiler: Fold constants after distributing source modifiers |
| - nir: Add a variant of nir_lower_int64 for float conversions only |
| - intel/compiler: Postpone most int64 lowering to brw_postprocess_nir |
| - nir: Add find_lsb lowering to nir_lower_int64. |
| - intel/compiler: Fix 64-bit ufind_msb, find_lsb, and bit_count |
| - nir: Assert that we don't shrink bit-sizes in nir_lower_bit_size() |
| - intel/compiler: Fix a fallthrough in components_read() for atomics |
| - intel/genxml: Drop Tiled Resource Mode fields |
| - intel: Initialize FF_MODE2 on all Gfx12 platforms |
| - iris: Allocate coherent buffers for resources flagged as persistent/coherent |
| - isl: Don't set "Enable Unorm Path in Color Pipe" on Alchemist |
| - intel/genxml: Fix gen_sort_tags.py to handle mbz/mbo |
| - intel/genxml: Update RENDER_SURFACE_STATE Fields |
| - iris: Re-emit 3DSTATE_DS for each primitive (workaround 14019750404) |
| - iris: Check prog[] instead of uncompiled[] for BLORP state skipping |
| |
| Kiskae (1): |
| |
| - vulkan/wsi: check for dri3 buffer initialization failure |
| |
| Konrad Dybcio (2): |
| |
| - freedreno: Add some A6/7xx registers |
| - freedreno: Partially decode CP_PROTECT_CNTL |
| |
| Konstantin Kharlamov (1): |
| |
| - loader/dri3: temporarily work around a crash when front is NULL |
| |
| Konstantin Seurer (133): |
| |
| - nir/lower_fp16_casts: Fix SSA dominance |
| - nir/lower_io: Emit less iadd(x, 0) |
| - nir: Make rq_load committed src an index |
| - radv: Stop running constant folding during ray query lowering |
| - radv/ci: Test ray tracing pipelines |
| - gallium/nir: Handle unified atomics in nir_to_tgsi_info |
| - nir/inline_uniforms: Handle num_components > 1 |
| - nir/lower_shader_calls: Remat derefs earlier |
| - radv: Stop using radv_get_int_debug_option |
| - treewide: Add a .clang-format file |
| - amd: Use the Mesa base style |
| - asahi: Use the Mesa base style |
| - freedreno: Use the Mesa base style |
| - d3d12: Use the Mesa base style |
| - i915: Use the Mesa base style |
| - r600/sfn: Use the Mesa base style |
| - panfrost: Use the Mesa base style |
| - util/perf: Use the Mesa base style |
| - venus: Use the Mesa base style |
| - asahi: Reformat using the new style |
| - panfrost: Reformat using the new style |
| - gallivm: Fix gather/scatter types for newer llvm |
| - radv/rt: Fix pipeline libraries |
| - gallivm: Fix anisotropic sampling with num_mips=1 |
| - gallivm: Cast read_first_invocation source to an int |
| - llvmpipe: refactor out the pipe->lp_jit structure fillers. |
| - llvmpipe: Add lp_storage_image_format_supported |
| - llvmpipe: Add lp_storage_render_image_format_supported |
| - gallivm: Add lp_build_nir_sample_key |
| - gallivm: Add lp_img_op_from_intrinsic |
| - gallivm: Handle invalid image format/op combinations |
| - gallivm: Zero initialize param structs |
| - radv/rt: Do not guard the raygen shader |
| - radv/rt: Clear NIR metadata after lowering the ABI |
| - aco/rt: Do not initialize the next shader addr |
| - radv/ci: Test ray tracing on vkd3d-proton |
| - radv/rt: Stop forcing wave32 by setting compute_subgroup_size |
| - Revert "radv: Enable ray tracing pipelines by default" |
| - radv/rt: Enable RT pipelines on GFX10_3+ excluding vangogh |
| - radv: Move the shader type to radv_shader_info |
| - radv: Adjust the traversal shader description |
| - radv: Use get_shader_from_executable_index for executable properties |
| - radv: Implement executable properties for ray tracing stages |
| - radv: Use _mesa_shader_stage_to_string for executable name |
| - radv/rt: Store the prolog outside the shaders array |
| - radv: Call radv_pipeline_init_scratch per shader |
| - meson: Add a xcb-keysyms dependency |
| - vulkan: Common trace capturing infrastructure |
| - radv: Add radv_trace_mode |
| - vulkan/wsi/x11: Capture traces using a hotkey |
| - radv/rra: Use common trace trigger |
| - radv/rgp: Use common trace trigger |
| - vulkan/rmv,radv: Use common trace trigger |
| - docs: Update envvars used for tracing |
| - amd: Use nir\_ instead of nir_build\_ helpers |
| - microsoft: Use nir\_ instead of nir_build\_ helpers |
| - intel: Use nir\_ instead of nir_build\_ helpers |
| - freedreno: Use nir\_ instead of nir_build\_ helpers |
| - vtn: Use nir\_ instead of nir_build\_ helpers |
| - nir: Use nir\_ instead of nir_build\_ helpers |
| - nir/builder_opcodes: Remove nir_build\_ prefixed helpers |
| - util: Do not include immintrin.h in half_float.h |
| - radv/rt: Fix caching non-recursive stages |
| - radv/rt: Hash stages using radv_hash_shaders |
| - llvmpipe: Add BDA jit type helpers |
| - gallivm: Add missing includes |
| - gallivm: Add lp_descriptor struct |
| - gallivm: Expose lp_build_sample_soa_code |
| - llvmpipe: Add lp_build_sampler_soa_dynamic_state |
| - llvmpipe: Add lp_build_image_soa_dynamic_state |
| - gallivm: Add LP_IMG_OP_COUNT |
| - gallivm: Expose LP_MAX_TEX_FUNC_ARGS |
| - llvmpipe: Add LP_TOTAL_IMAGE_OP_COUNT |
| - gallivm: Expose lp_build_texel_type |
| - gallivm: Propagate vulkan resources |
| - gallivm: Clamp the texel buffer size |
| - llvmpipe: Pre compile sample functions |
| - gallivm: Add a function for loading vulkan descriptors |
| - gallivm: Implement vulkan UBOs |
| - gallivm: Implement vulkan SSBOs |
| - gallivm: Implement vulkan textures |
| - gallivm: Implement vulkan images |
| - llvmpipe: Disable the linear path when running vulkan |
| - lavapipe: Include llvmpipe |
| - lavapipe: Lower more texture OPs |
| - lavapipe: Make pipeline_lock generic for accessing the queue |
| - lavapipe: Rework descriptor handling |
| - lavapipe: Lower non uniform access |
| - lavapipe: EXT_descriptor_indexing |
| - llvmpipe: Use lp_jit_buffer_from_pipe_const in setup |
| - lavapipe: Make shader compilation thread safe |
| - zink: Increase ZINK_FBFETCH_DESCRIPTOR_SIZE to 280 |
| - zink/ci: Update lavapipe expectations |
| - venus/ci: Update fails |
| - lavapipe/ci: Update CI expectations for new extensions |
| - llvmpipe/ci: Update expectations |
| - nir: Add nir_builder_at |
| - radv: Use nir_builder_at |
| - asahi: Use nir_builder_at |
| - v3d: Use nir_builder_at |
| - glsl: Use nir_builder_at |
| - nir: Use nir_builder_at |
| - spirv: Use nir_builder_at |
| - freedreno: Use nir_builder_at |
| - gallium,st: Use nir_builder_at |
| - crocus: Use nir_builder_at |
| - etnaviv: Use nir_builder_at |
| - r600: Use nir_builder_at |
| - radeonsi: Use nir_builder_at |
| - vc4: Use nir_builder_at |
| - zink: Use nir_builder_at |
| - lavapipe: Use nir_builder_at |
| - microsoft: Use nir_builder_at |
| - panfrost: Use nir_builder_at |
| - intel: Use nir_builder_at |
| - nir/opt_dead_cf: Handle if statements ending in a jump correctly |
| - nir/builder_opcodes: Do not generate empty intrinsic indices |
| - amd: Move ac_hw_stage to its own file |
| - gallivm: Fix atomic_global types |
| - lavapipe: Set the descriptor count to what vkd3d-proton requires |
| - llvmpipe: Allow comparison sampling for float formats |
| - llvmpipe: Allocate more dummy sample functions for FORMAT_NONE |
| - llvmpipe,lavapipe: Relayout lp_descriptor |
| - lavapipe: Always advertise formatless storage image OPs |
| - nir/lower_shader_calls: Remat derefs after shader calls |
| - nir/opt_dead_cf: Run dead_cf_block while it makes progress |
| - nir/opt_dead_cf: Clarify comment |
| - draw: Do not restart the primitive_id at 0 |
| - llvmpipe: Fix compiling with LP_USE_TEXTURE_CACHE |
| - llvmpipe: Zero extend vectors in widen_to_simd_width |
| - vulkan/wsi/x11: Implement capture hotkey using the keymap |
| - radv: Don't use the depth image view for depth bias emission |
| - aco/spill: Make sure that offset stays in bounds |
| |
| Kurt Kartaltepe (1): |
| |
| - drirc: Set limit_trig_input_range option for Nier games |
| |
| Leo Liu (6): |
| |
| - radeonsi: create a new context for transcode with multiple video engines |
| - radeonsi/vcn: AV1 skip the redundant bs resize |
| - radeonsi: Remove redundant vcn_decode from info |
| - amd: Add vcn ip version info |
| - radeonsi: Use vcn version instead of CHIP family for VCNs |
| - radeonsi/vcn: fix the incorrect dt_size |
| |
| Lina Versace (2): |
| |
| - venus: Advertise 1.3 in ICD file |
| - venus: Fix detection of push descriptor set |
| |
| LingMan (4): |
| |
| - rusticl: core: stop using cl_prop from the api module |
| - rusticl: drop CLProp implementation for String |
| - rusticl: drop cl_prop_for_type macro |
| - rusticl: fix UB in CLProp machinery |
| |
| Lionel Landwerlin (185): |
| |
| - docs: add missing MESA_VK_WSI_HEADLESS_SWAPCHAIN variable |
| - vulkan/runtime: discard unused graphics stages in libraries |
| - intel/vec4: force exec_all on float control instruction |
| - anv: enable blorp query reset for performance queries |
| - vulkan/overlay: deal with unknown pNext structures |
| - isl: don't set inconsistent fields for depth when using stencil only |
| - anv: introduce a base graphics pipeline object |
| - anv: move force shading rate writes checks |
| - anv: make input attachments available through bindless |
| - anv: move preprocessing of NIR right before compilation |
| - anv: add dynamic buffer offsets support with independent sets |
| - anv: implement VK_EXT_graphics_pipeline_library |
| - anv: Work around the spec question about pipeline feedback vs GPL. |
| - isl: fix a number of errors on storage format support on Gfx9/12.5 |
| - intel/nir: add options to storage image lowering |
| - anv: drop lowered storage images code |
| - anv: enable shaderStorageImageReadWithoutFormat on Gfx12.5+ |
| - anv: rework Wa_14017076903 to only apply with occlusion queries |
| - intel/tools: add ability to dump out raw kernels data |
| - nir/divergence: add missing load_global_constant_* intrinsics |
| - anv: fix anv_nir_lower_ubo_loads pass |
| - anv: enable shaderUniformBufferArrayNonUniformIndexing |
| - intel/fs: fix per vertex input clamping |
| - nir/lower_non_uniform_access: add get_ssbo_size handling |
| - intel/compiler: make uses_pos_offset a tri-state |
| - vulkan: bump headers to 1.3.249 |
| - spirv: update to latest headers |
| - spirv/nir: wire ray interection triangle position fetch |
| - intel/nir/rt: use a single load for instance leaf loading |
| - intel/nir/rt: wire position fetch intrinsic |
| - anv: implement VK_KHR_ray_tracing_position_fetch |
| - intel/fs: fix scheduling of HALT instructions |
| - anv: remove 48bit address space checks |
| - anv: avoid hardcoding instruction VA constant in shaders |
| - anv: link anv_bo to its VMA heap |
| - anv: make internal address space allocation more dynamic |
| - anv: increase instruction heap to 2Gb |
| - intel/fs: reduce register usage for relocated constants |
| - intel: enable protected context creation along with engines |
| - Revert "intel/compiler: make uses_pos_offset a tri-state" |
| - anv: fixup workaround 16011411144 |
| - intel/mi_builder: fixup tests for newer kernel uAPI |
| - intel: switch over to unified atomics |
| - spirv: fix argument to ray query intrinsic |
| - intel/devinfo: printout on stdout |
| - intel/devinfo: allow -p to take a pci-id in hexa |
| - intel/devinfo: call intel_device_info_init_was only once |
| - anv: put private binding BOs into execlists |
| - anv: mark images compressed for untracked layout/access |
| - gitlab-ci: add capture for i915 error state |
| - anv: defer binding table block allocation to when necessary |
| - anv: assume context isolation support |
| - anv: fix push descriptor deferred surface state packing |
| - intel/fs: fix size_read() for LOAD_PAYLOAD |
| - anv: move timestamp vfunc initialization to genX code |
| - anv: use COMPUTE_WALKER post sync field to track compute work |
| - iris: use COMPUTE_WALKER post sync field to track compute work |
| - intel/fs: make tcs input_vertices dynamic |
| - anv: implement EDS2.extendedDynamicState2PatchControlPoints |
| - iris: rework Wa_14017076903 to only apply with occlusion queries |
| - intel: add alignment helper for aux map |
| - iris: add a comment about aux-tt alignment requirements |
| - anv: update aux-tt alignment requirements for MTL |
| - intel: reduce minimum memory alignment on Gfx12.5 |
| - anv: further reduce pool alignments |
| - anv: opportunistically align VMA to 2Mb |
| - anv: update internal address space to have 4Gb of dynamic state |
| - anv: fix push range for descriptor offsets |
| - intel/fs: reuse descriptor helper |
| - intel/fs: lower get_buffer_size like other logical sends |
| - nir/lower_shader_calls: add ability to force remat of instructions |
| - nir: add a new intrinsic to describe resources accessed on intel |
| - nir: teach nir_chase_binding about resource_intel |
| - nir/opt_gcm: allow resource_intel to be moved anywhere |
| - intel/fs: add a pass to move resource_intel closer to user |
| - intel/fs: teach ubo range analysis pass about resource_intel |
| - intel/fs: keep track of new resource_intel information |
| - intel/fs: enable SSBO accesses through the bindless heap |
| - intel/fs: enable UBO accesses through bindless heap |
| - intel/fs: enable get_buffer_size on bindless heap |
| - intel/fs: enable extended bindless surface offset |
| - intel/fs: enable bindless sampler state offsets |
| - intel/fs: enable uniform block accesses through bindless heap |
| - intel/fs: try to rematerialize surface computation code |
| - anv: remove unused define |
| - anv: fix null descriptor handling with A64 messages |
| - anv: remove incorrect ifdef |
| - anv: bail flush_gfx_state when not gfx push constant is dirty |
| - anv: track pipeline in anv_cmd_pipeline_state |
| - anv: move pipeline active_stages to common structure |
| - anv: increase workaround BO so that we can hold a full 4Kb page of 0s |
| - anv: toggle extended bindless surface state on Gfx12.5+ |
| - docs/anv: some binding table explanations |
| - anv: add an option for using indirect descriptors |
| - anv: introduce a new descriptor set layout type |
| - anv: create a pool for indirect descriptors |
| - anv: reduce push constant size for descriptor sets |
| - anv: new structure to hold surface states |
| - anv: add a pass to partially lower resource_intel |
| - nir: expose a couple of address format add helpers |
| - anv: bound load descriptor mem better |
| - anv: prepare image/buffer views for non indirect descriptors |
| - anv: add support for direct descriptor in allocation/writes |
| - anv: add helpers to build pipeline bindings |
| - anv: handle null surface in the binding table with direct descriptors |
| - anv: factor out dynamic buffer bti emission |
| - anv: implement binding table emission for direct descriptors |
| - anv: simplify ycbcr bti computations |
| - anv: track descriptor data size |
| - anv: add direct descriptor support to apply_layout |
| - anv: bring back the max number of sets to 8 |
| - anv: descriptor binding for direct descriptors |
| - anv: ensure descriptor addresses are used with bindless stages |
| - anv: enable direct descriptors on platforms with extended bindless offset |
| - anv: add support for VK_EXT_dynamic_rendering_unused_attachments |
| - anv: remove unused functions |
| - intel/fs: fix a couple of descriptor mistakes |
| - intel/stub_gpu: add an option to launch valgrind |
| - intel/fs: fix pull-constant-load prior to gfx7 |
| - anv: allow binding tables allocations on compute only queues |
| - intel/nir: switch ray query state tracking to local variables uint16_t |
| - anv: add query tracepoints |
| - anv: deal with unsupported VkImageFormatListCreateInfo::pViewFormats |
| - anv: report max simd width only once for fragment shaders |
| - anv: always report all pipeline stats regardless of stages |
| - anv: only disable mesh when enabled at the VkDevice level |
| - anv: disable mesh/task for generated draws |
| - anv: fix incorrect batch for 3DSTATE_CONSTANT_ALL emission |
| - anv: limit ANV_PIPE_RENDER_TARGET_BUFFER_WRITES to blorp operations using 3D |
| - anv: factor out generation kernel dispatch into helper |
| - anv: add support for simple internal compute shaders |
| - anv: generalize internal kernel concept |
| - anv: add shaders for copying query results |
| - intel/ds: add query count in query tracepoints |
| - anv: enable CmdCopyQueryPoolResults to use shader for copies |
| - intel/fs: fix bindless/shared surface mistake |
| - intel/fs: print identation for control flow |
| - intel/fs: avoid reusing the VGRF for uniform load_ubo |
| - nir: add a new ubo uniform loading intrinsic for intel |
| - intel/fs: make use of load_ubo_uniform_block_intel |
| - nir: add a load_global_constant uniform intel variant |
| - intel/fs: handle load_global_constant_uniform_block_intel |
| - anv: avoid private buffer allocations in vkGetDeviceImageMemoryRequirementsKHR |
| - anv: add missing query clear flush for acceleration structure queries |
| - anv: track buffer writes from shaders for query results writes |
| - anv: change the way we clear pending query bits |
| - anv: fix pending query bits for compute only command buffers |
| - anv: tracking query buffer writes & query clears separately |
| - anv: switch copy query results with shaders from semaphore waits to flushes |
| - vulkan: registry/headers bump to 1.3.254 |
| - vulkan/runtime: add support for EXT_depth_bias_control |
| - anv: add VK_EXT_depth_bias_control support |
| - isl: assert on gfx6 condition that should not be met |
| - isl: assert on gfx7 condition that should not be met |
| - isl: assert on gfx8 condition that should not be met |
| - isl: add surface creation reporting mechanism |
| - anv: align buffers to a cache line |
| - anv: fix utrace batch allocation |
| - genxml: enable decoding on compute engine |
| - intel/aubinator_error_decode: add ccs support |
| - anv: look into batch bo reloc list looking for BOs to decode |
| - anv: implement storage image depth query using descriptor buffer read |
| - Revert "isl: Set Depth to array len for 3D storage images" |
| - docs/features: update anv entries |
| - intel/fs: disable coarse pixel shader with interpolater messages at sample |
| - nir/opt_shrink_vectors: enable sparse intrinsics shrinking |
| - docs/features: add more missing extensions |
| - docs/features: add hasvk entries |
| - zink: update profile vulkan version requirements |
| - zink: drop linear D32_SFLOAT_S8_UINT requirement |
| - anv: fix utrace signaling with Xe |
| - intel/fs: fix missing predicate on SEL instruction |
| - intel/fs: don't try to rebuild sequences of non ssa values |
| - anv: fix 3DSTATE_RASTER::APIMode field setting |
| - hasvk: fix null descriptor handling with A64 messages |
| - anv: don't try to access dynamic buffers from surface states |
| - intel/compiler: disable per-sample interpolation modes with non-per-sample dispatch |
| - anv: add missing ISL storage usage |
| - intel/nir: rerun lower_tex if it lowers something |
| - hasvk: add state cache invalidation back before fast clears |
| - anv: fix utrace timestamp buffer copies |
| - intel: don't assume Linux minor dev node |
| - blorp: switch blorp_update_clear_color to early return |
| - blorp: update and move fast clear PIPE_CONTROLs to drivers |
| - iris: ensure stalling pipe control before fast clear |
| |
| Liviu Prodea (1): |
| |
| - microsoft/clc: Don't build compiler test if build-tests is false |
| |
| Lone_Wolf (3): |
| |
| - compiler/clc: Fix embedded clang headers (microsoft-clc) for LLVM 16+ |
| - clc: Add clangASTMatchers to fix static llvm build of microsoft-clc with LLVM 16+ |
| - clc: Add clang frontendhlsl module to fix build of microsoft-clc with llvm 16+ |
| |
| Luc Ma (1): |
| |
| - meson: keep Mako version checking in accord with build msg |
| |
| Luca Bacci (1): |
| |
| - Add checks for NULL dxil_validator |
| |
| Luca Weiss (1): |
| |
| - freedreno: Enable A506 |
| |
| Lucas Fryzek (6): |
| |
| - broadcom: Add support for VK_FORMAT_A2R10G10B10_UNORM_PACK32 |
| - broadcom: Fix slice memory allocation logic for compressed textures |
| - v3d: Add support for ASTC texture compression |
| - v3dv: Update texture padding logic to match v3d changes |
| - mailmap: Add Lucas Fryzek to mailmap |
| - gallium: Remove \`PIPE_CAP_RGB_OVERRIDE_DST_ALPHA_BLEND` |
| |
| Lucas Stach (16): |
| |
| - etnaviv: update derived state after forced commandstream flush |
| - etnaviv: don't flush implicit flush resources when forced |
| - etnaviv: rs: flush TS cache before making configuration changes |
| - etnaviv: rs: unconditionally flush color and depth cache before using RS |
| - etnaviv: optimize transfer flushes |
| - etnaviv: query: move sample counter manipulation into query providers |
| - etnaviv: query: reset sample count on begin_query |
| - etnaviv: query: remove incorrect comment |
| - etnaviv: query: correct max number of perfmon samples |
| - etnaviv: query: correct max number of occlusion query samples |
| - etnaviv: query: optimize context flushes |
| - mesa/st: discard whole resource when mapping drawpixels texture |
| - etnaviv: only emit sampler config for changed samplers |
| - etnaviv: move resource level dimension members to make comments line up |
| - etnaviv: rs: fix multisampled blits |
| - etnaviv: blt: fix multisampled blits |
| |
| Luigi Santivetti (13): |
| |
| - pvr: use PVR_DW_TO_BYTES for stream_link_space calculation |
| - pvr: add GUARD_SIZE_DEFAULT for CDM and VDM control stream links 1 and 2 |
| - pvr: fixup stack overflow in {start,end}_sub_cmd |
| - pvr: introduce suballocator for internal allocations |
| - pvr: switch pvr_gpu_upload_* to use pvr_bo_suballoc |
| - pvr: switch pvr_cmd_buffer_alloc_mem to use pvr_bo_suballoc |
| - pvr: switch pvr_descriptor_set_create to use pvr_bo_suballoc |
| - pvr: switch pvr_clear to use pvr_bo_suballoc |
| - pvr: switch pvr_spm to use pvr_bo_suballoc |
| - pvr: fixup assert in pvr_cmd_buffer_alloc_mem |
| - pvr: fix division by block size in blit |
| - pvr: fixup transfer primary sub-command list |
| - pvr: do not claim support for ASTC texture compression |
| |
| Luna Nova (5): |
| |
| - device_select_layer: fix inverted strcmp in device_select_find_dri_prime_tag_default (v1) |
| - device_select_layer: apply DRI_PRIME even if default device is > 1 to match opengl behavior |
| - device_select_layer: pick a default device before applying DRI_PRIME |
| - device_select_layer: add MESA_VK_DEVICE_SELECT_DEBUG which logs why default selection was made |
| - device_select_layer: log selectable devices if MESA_VK_DEVICE_SELECT_DEBUG or DRI_PRIME_DEBUG are set |
| |
| Lynne (4): |
| |
| - radv/video: reject general unsupported video formats |
| - radv/video: reject non-8bit H264 |
| - radv/video: reject unsupported hevc profiles and bit depths |
| - anv_video: reject decoding of unsupported profiles and formats |
| |
| M Henning (12): |
| |
| - nvc0: Use nir in nvc0_program_init_tcp_empty |
| - nvc0: Use nir in nvc0_blitter_make_vp |
| - nv50,nvc0: Use nir in nv50_blitter_make_fp |
| - nv50,nvc0: Stop advertising TGSI by default |
| - nv50,nvc0: Use ttn for tgsi shaders by default |
| - gallium: Add pipe_shader_state_from_nir |
| - nouveau/codegen: Check nir_dest_num_components |
| - nv50/codegen: Set lower_uniforms_to_ubo |
| - nouveau/nir: Set isSigned on all atomic_imax/imin |
| - nv50,nvc0: Free nir from blitter fp shader |
| - nvc0: Free blitter->vp |
| - nv50: Fix return type of nv50_blit_is_array |
| |
| Marcin Ślusarz (17): |
| |
| - intel: split URB space between task and mesh proportionally to entry sizes |
| - anv: move nir_shader_gather_info to anv_pipeline_nir_preprocess |
| - intel/tools: decode ACTHD printed by newer kernels |
| - nir: extract try_lower_id_to_index_1d |
| - nir: use wg id to wg idx shortcut if two dims of num_workgroups are 1 |
| - nir: use constant components of num_workgroups in wg id to wg idx lowering |
| - nir: lower num_workgroups to constants |
| - intel/compiler: pass num_workgroups from task to mesh shaders |
| - nir: add cheap shortcut for wg id to wg idx lowering |
| - anv,intel/compiler: enable shortcut in wg id to wg idx lowering on >= gfx12.5 |
| - intel/compiler: simplify reading of gl_NumWorkGroups in task/mesh |
| - anv: fix how NULL buffer_view is handled in anv_descriptor_set_write_buffer_view |
| - anv: pass anv_surface_state using a pointer |
| - anv: limit stack usage for anv_surface_state |
| - intel/compiler/mesh: compactify MUE layout |
| - intel/compiler,anv: put some vertex and primitive data in headers |
| - intel/compiler: load debug mesh compaction options once |
| |
| Marek Olšák (169): |
| |
| - nir: fix 2 bugs in nir_create_passthrough_tcs |
| - nir: lower load_barycentric_at_offset in lower_wpos_ytransform |
| - nir: assign IO bases in nir_lower_io_passes |
| - nir: skip nir_lower_io_passes for compute shaders |
| - nir: extend nir_opt_fragdepth to handle lowered IO |
| - nir: handle more opcodes in nir_lower_io_to_scalar |
| - nir: handle all varying slots in gl_varying_slot_name_for_stage |
| - nir: don't remove dead IO variables in nir_lower_io_passes for st_link_nir |
| - nir: rework nir_lower_color_inputs to work with lowered IO intrinsics |
| - nir: return a status from nir_remove_varying whether it removed the instruction |
| - nir: remove an obsolete comment from nir_gather_xfb_info_from_intrinsics |
| - nir: add next_stage parameter to nir_slot_is_sysval_output to return better info |
| - nir: add next_stage parameter to nir_remove_varying |
| - nir: set uses_wide_subgroup_intrinsics for all shader stages |
| - venus: fix the RHEL8 build by using syscall for gettid |
| - nir: rename ACCESS_STREAM_CACHE_POLICY -> ACCESS_NON_TEMPORAL and document |
| - nir: add/update comments for gl_access_qualifier |
| - ac/surface: don't expose modifiers with DCC retiling if radeon_info forbids it |
| - ac/gpu_info: disable display DCC on Raphael and Mendocino to improve power usage |
| - radeon: add radeon_info parameter into radeon_winsys::surface_init |
| - radeonsi: do AMD_DEBUG=nodisplaydcc differently to also remove modifiers |
| - aco: don't treat ACCESS_NON_READABLE as ACCESS_COHERENT |
| - ac/llvm: don't treat ACCESS_NON_READABLE as ACCESS_COHERENT |
| - ac/llvm: rewrite and unify how GLC, DLC, SLC are set |
| - nir/lower_io: don't renumber VS inputs when not called from a linker |
| - ac/surface: fix address calculation for large images by using uint64_t |
| - radv: fix sparse image address calculation for large images by using uint64_t |
| - radv: fix SDMA image address calculation for large images by using uint64_t |
| - radeonsi: fix SDMA image address calculation for large images by using uint64_t |
| - radeonsi: fix image address calculation for large images by using uint64_t |
| - radeonsi: fix sparse image address calculation for large images by using uint64_t |
| - radeonsi: fix image size calculation in fast clear |
| - ac/surface: clean up and move the PIPE_CONFIG helper to ac_surface.c |
| - ac/surface: define LINEAR_PITCH_ALIGNMENT |
| - ac/surface: validate overridden pitch for all chips |
| - ac/surface: fix overridden linear pitch for CPU access |
| - ac/surface: add ac_surf_config::is_array |
| - amd/registers: update pitch definitions in descriptors |
| - mesa: fix a VBO buffer reference leak in _mesa_bind_vertex_buffer |
| - ac,radeonsi,winsyses: switch to SPDX-License-Identifier: MIT |
| - winsys/radeon: set has_image_opcodes to unbreak gfx6-7 |
| - winsys/radeon: fix the scratch buffer on gfx6-7 |
| - winsys/radeon: set more radeon_info fields |
| - ac/gpu_info: give has_msaa_sample_loc_bug a more accurate name |
| - ac/surface: move CB format translation helpers here |
| - ac/surface: move determing ADDR_FMT_* into a helper function |
| - ac/llvm: clean up translation of nir_intrinsic_load_invocation_id |
| - ac/llvm: clean up visit_load_local_invocation_index and visit_load_subgroup_id |
| - ac/llvm: use LLVM 0/1 constants from ac_llvm_context instead of LLVMConstInt |
| - radeonsi/gfx11: fix alpha-to-coverage with blending |
| - radeonsi: reorder code in si_texture_create_object as preparation for the future |
| - radeonsi: cosmetic changes in si_shader.h |
| - radeonsi: remove the gl_SampleMask FS output if MSAA is disabled |
| - radeonsi: don't enable WGP_MODE because of high cost of workgroup mem coherency |
| - radeonsi: move emitting draws states out of si_emit_all_states |
| - radeonsi/gfx11: use DISABLE_FOR_AUTO_INDEX to disable non-indexed prim restart |
| - radeonsi: reduce the supported compute grid size |
| - radeonsi: update test results and flakes |
| - radeonsi: re-enable fp16_rtz for compute blits to fix PBO tests on gfx11 |
| - amd/addrlib: switch the license to the SPDX identifier MIT |
| - amd/addrlib: add ADDR_FMT_BG_RG_16_16_16_16 |
| - ac/surface: fix is_linear for stencil-only surfaces |
| - ac/nir: handle DEPTH as PITCH in ac_nir_lower_resinfo |
| - radeonsi: implement setting a custom pitch to any multiple of 256B on gfx10.3+ |
| - radv: implement setting a custom pitch to any multiple of 256B on gfx10.3+ |
| - ac/surface: relax custom pitch requirements to any multiple of 256B on gfx10.3+ |
| - ac/surface: fix R32G3B32 image format regression for gfx6-8 |
| - ac/nir/ngg: always use load_initial_edgeflags_amd, choose the value in drivers |
| - amd: add radeon_info* into ac_llvm_context and radv_nir_compiler_options |
| - radeonsi: define si_shader_io_get_unique_index() values as SI_UNIQUE_SLOT_* |
| - radeonsi: remove gl_BackColor VS outputs on demand if color_two_side is disabled |
| - radeonsi: export non-zero edgeflags for GS and tess |
| - radeonsi/gfx11: extend DB_Z_INFO.NUM_SAMPLES programming to > GFX11 |
| - radeonsi: print shader-db stats with AMD_DEBUG=vs,ps,stats |
| - radeonsi: use nir_lower_alu_to_scalar correctly |
| - radeonsi: remove a useless depth texture function call in a fast color clear |
| - radeonsi: add a gfx11 version of si_decompress_textures, add assertions < GFX11 |
| - radeonsi: remove RADEON_FLAG_MALL_NOALLOC due to no use |
| - radeonsi: completely rewrite how VGT_SHADER_STAGES_EN is set |
| - radeonsi: unduplicate si_translate_format_to_hw |
| - radeonsi: decompress DCC for SDMA if we're really going to use SDMA |
| - radeonsi: increase SDMA gfx9+ limits |
| - radeonsi: split tracked_regs masks into context registers and other registers |
| - radeonsi: reorder and comment tracked registers |
| - radeonsi: move PA_CL_NGG_CNTL emission into rasterizer state |
| - radeonsi: always set sample locations even for 1x MSAA for simplicity |
| - radeonsi: adjust 16x EQAA sample locs to make PA_SU_PRIM_FILTER_CNTL immutable |
| - radeonsi: move PA_SU_SMALL_PRIM_FILTER_CNTL to the preamble when possible |
| - radeonsi: merge si_emit_msaa_sample_locs with si_emit_sample_locations |
| - radeonsi: rename the msaa_sample_locs state to sample locations |
| - radeonsi: optimize no-op primitive restart index changes thanks to index masking |
| - radeonsi: don't program COMPUTE_MAX_WAVE_ID (GDS register) on gfx6 |
| - radeonsi: add helpers to create and clone a sized pm4 state |
| - radeonsi: add a separate gfx10_init_gfx_preamble_state function |
| - radeonsi: don't set registers set by CLEAR_STATE in the preamble for gfx10-11 |
| - radeonsi: add a separate cdna_init_compute_preamble_state function |
| - radeonsi/ci: add gfx6 failures |
| - radeonsi: re-indent gfx10_create_sh_query_result_cs |
| - radeonsi: don't use SET_SH_REG_INDEX on gfx7-9 |
| - radeonsi: don't use SET_SH_REG_INDEX if the kernel doesn't use CU reservation |
| - amd: remove unused PKT0 definitions |
| - treewide: use uint64_t / (u)intptr_t in image address calculations |
| - amd: drop support for LLVM 11 |
| - amd: drop support for LLVM 12 |
| - amd: drop support for LLVM 13 |
| - amd: drop support for LLVM 14 |
| - mesa: fix glBitmap in display lists when width <= 0 || height <= 0 |
| - gallium/hud: append results to files instead of overwriting them |
| - radeonsi: don't convert L8A8 to R8A8 when blitting via compute to fix gfx7 |
| - amd: update SET_*_REG_PAIRS* documentation and remove radeon_info options |
| - amd: improve the IB parser, parse more packets |
| - amd: rename mid_command_buffer_preemption_enabled -> register_shadowing_required |
| - amd: increase the attribute ring size on gfx1103_r1 |
| - amd: don't set PA_RATE_CNTL because it has no effect |
| - amd: fix GPU cache sizes retrieved from the kernel |
| - amd: remove non-shadowed register tables |
| - amd: remove ac_check_shadowed_regs |
| - amd: add a new helper that prints all non-shadowed regs |
| - amd: update shadowed register tables for gfx11 |
| - amd: skip redundant PKT3_NUM_INSTANCES even with register shadowing |
| - amd: skip redundant INDEX_TYPE even with register shadowing |
| - radeonsi: set register_shadowing_enabled if AMD_DEBUG=shadowregs is set |
| - radeonsi/ci: add glx@glx-visuals-stencil to skips because it gets stuck often |
| - radeonsi: fix RB+ and gfx11 issues with framebuffer state |
| - radeonsi: change si_emit_derived_tess_state into a state atom |
| - radeonsi: shrink the last field of tcs_offchip_layout due to LDS limit |
| - radeonsi: don't do PFP_SYNC_ME before CP DMA and compute blits |
| - radeonsi: don't needlessly invalidate L0/L1 caches at the beginning of IBs |
| - radeonsi: add more variables into si_pm4_state and rework how it's created |
| - radeonsi: remove sscreen parameter from si_pm4_set_reg_idx3 |
| - radeonsi: set non-graphics uconfig registers first in the preamble |
| - radeonsi: handle demoted si_pm4_set_reg_idx3 as si_pm4_set_reg |
| - radeonsi: eliminate redundant compute SH register changes |
| - radeonsi: handle VGT_GS_OUT_PRIM_TYPE like a tracked register |
| - radeonsi: handle VGT_LS_HS_CONFIG like a tracker register |
| - radeonsi: handle GE_CNTL and IA_MULTI_VGT_PARAM as a tracked register |
| - radeonsi: remove gfx10 NGG streamout |
| - ci: remove clang-format testing |
| - intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR |
| - radeonsi: move TCS.gl_PatchVerticesIn into the tcs_offchip_layout SGPR |
| - radeonsi: replace tcs_out_lds_layout with nearly identical tes_offchip_addr |
| - radeonsi: move the only tcs_out_lds_offsets field to vs_state_bits |
| - radeonsi: eliminate redundant TCS user data and RSRC2 register changes |
| - radeonsi/gfx11: use SET_*_REG_PAIRS_PACKED packets for pm4 states |
| - radeonsi: determine si_pm4_state::reg_va_low_idx automatically |
| - radeonsi: keep pipeline statistics disabled when they are not used |
| - radeonsi: don't do BREAK_BATCH for context regs with only 1 context per batch |
| - radeonsi: use si_pm4_create_sized for the shadowing preamble |
| - radeonsi: remove radeon_winsys::cs_set_preamble |
| - radeonsi: remove uses_reg_shadowing parameter from si_init_gfx_preamble_state |
| - radeonsi/gfx11: fix GLCTS with register shadowing by keeping the CS preamble |
| - radeonsi/gfx11: enable register shadowing by default |
| - radeonsi: reorder compute code to prepare for packed SET_SH_REG packets |
| - radeonsi/gfx11: use SET_SH_REG_PAIRS_PACKED for gfx by buffering reg writes |
| - radeonsi/gfx11: use SET_SH_REG_PAIRS_PACKED for compute by buffering reg writes |
| - radeonsi: clean up query functions, make them static, remove forward decls |
| - radeonsi: declare compiler[] and nir_options as pointers to reduce #includes |
| - radeonsi: clean up #includes |
| - Revert "egl: return correct error for EGL_KHR_image_pixmap" |
| - vbo: correctly restore _VaryingInputs for display list fast path |
| - radeonsi/gfx11: only use SET_*_PAIRS* packets on dGPUs |
| - radeonsi: fix gfx9 regression causing GPU hangs |
| - radeonsi/gfx11: fix a regression with PAIRS packets due to shader changes |
| - Revert "ac/nir/ngg: Follow intrinsic sources when analyzing before culling." |
| - glthread: determine global locking once every 64 batches to fix get_time perf |
| - mesa: fix 38% decrease in display list performance of Viewperf2020/NX8_StudioAA |
| - util/u_queue: fix util_queue_finish deadlock by merging lock and finish_lock |
| - radeonsi: fix a CDNA regression breaking compute |
| - Revert "ac: don't call ac_query_pci_bus_info from ac_query_gpu_info" |
| |
| Mark Collins (1): |
| |
| - ir3/a7xx: Add definitions for (last) src GPR attribute |
| |
| Mark Janes (8): |
| |
| - intel/dev: update mesa_defs.json from defect database |
| - intel/dev: report stepping for TGL systems |
| - intel/dev: switch defect identifiers to use lineage numbers |
| - isl: use generated workaround helpers for Wa_1806565034 |
| - iris: convert Wa_14010455700 to use workaround mechanism |
| - anv: convert Wa_14010455700 to use workaround mechanism |
| - intel: use generated helpers for Wa_1508744258 |
| - intel/dev: update mesa_defs.json from defect database |
| |
| Martin Roukala (né Peres) (26): |
| |
| - radv/ci: disable the vkcts-navi21-llvm-valve job |
| - radv/ci: document all the flakes we hit while I was away |
| - ci/b2c: allow not specifying a reboot condition |
| - radv/ci: only reboot on hangs for vkcts-navi10-valve |
| - zink/ci: document that some tests no longer fail |
| - zink/ci: mark 77 multisample-related tests as fixed |
| - radv/ci: document another vkcts flake on vega10 |
| - radv/ci: document a series of recent regressions |
| - zink/ci: document recent fixes on RADV |
| - zink/ci: document new flakes on RADV |
| - radv/ci: document more flakes for navi21 |
| - radv/ci: switch to b2c v0.9.10 |
| - ci/b2c: update to mesa-trigger:2023-03-08.1 |
| - zink/ci: add more QBO-related fails on RADV |
| - amd/ci: add another test to the vkcts-vega10 flake list |
| - zink/ci: remove spec@nv_shader_atomic_int64@* from the fail lists |
| - ci: bring back the valve farm |
| - ci/b2c: select the DUT to run on by name |
| - radv/ci: use the low-priority runners for vangogh jobs |
| - ci/b2c: change the default first-console-activity timeout to 2 minutes |
| - zink/ci: add more tests to the flake list of vangogh |
| - zink/ci: enable zink-radv-vangogh-valve for pre-merge testing |
| - Revert "ci: mark the valve farm as down" |
| - amd/ci: temporarily disable some manual jobs that take a long time to run |
| - zink/ci: remove 3 tests from the fails list |
| - Revert "amd/ci: temporarily disable some manual jobs that take a long time to run" |
| |
| Martin Stransky (1): |
| |
| - llvmpipe: fix UAF in lp_scene_is_resource_referenced. |
| |
| Matt Coster (57): |
| |
| - pvr: Complete pvr_isp_ctrl_stream() |
| - pvr: Fully declare support for VK_EXT_private_data |
| - pvr: Remove false assumption from pvr_write_draw_indirect_vdm_stream() |
| - pvr: Fixup format features |
| - pvr: Unmap mapped memory on free |
| - pvr: Correctly validate PBE accum format |
| - pvr: Actually check for depth load when setting up load op constants |
| - pvr: Initialize aspect_mask when creating buffer views |
| - pvr: Correctly compile graphics pipelines without a fragment shader |
| - pvr: Fix off-by-one in pvr_cmd_buffer_upload_desc_set_table() assert |
| - pvr: Remove unneeded assert in pvr_get_hw_clear_color() |
| - pvr: Set output_offset correctly in pvr_clear_color_attachment_static() |
| - pvr: Return correct pbe_accum_format size for A2B10G10R10_UINT_PACK32 |
| - pvr: Remove bad assert in pvr_clear_attachments() |
| - pvr: Add PVR_DEBUG=vk_desc option to dump descriptor set layouts |
| - pvr: Simplify descriptor set layout dump separators |
| - pvr: Return VkResult from pvr_winsys_create() |
| - pvr: Propagate errors as VkResults from ioctls through winsys |
| - pvr: Fix incorrect error return in pvr_ctx_sr_programs_setup() |
| - pvr: Fix incorrect error handling in pvr_render_ctx_switch_init() |
| - pvr: Squeeze fd handling into winsys layer |
| - pvr: Drop pdevice from pvr_physical_device_get_supported_extensions() |
| - pvr: Rename primary_{device,fd,path} to display_* |
| - pvr: Use common physical device enumeration |
| - pvr: Assorted cleanup |
| - pvr: Return VkResult from winsys buffer_map operation |
| - pvr: Fix allocation scopes in vkCreateRenderPass2() code path |
| - pvr: Fix memory leaks on realloc failure in pvr_pipeline.c |
| - pvr: Correct error flow in pvr_graphics_pipeline_compile() |
| - pvr: Correct error flow in pvr_compute_pipeline_compile() |
| - pvr: Use correct surface for deferred RTA clear |
| - pvr: Rename shadowing loop variable in pvr_add_deferred_rta_clear() |
| - pvr: Do not free deferred pvr_transfer_cmd instances |
| - pvr: Fix out of range stream errors for geometry-only jobs on pvrsrvkm |
| - pvr: Reorder execution in pvr_cmd_buffer_end_sub_cmd() |
| - pvr: Fix page faults in occlusion query tests |
| - pvr: Fix rect splitting logic in pvr_unwind_rects() |
| - pvr: Use correct pbe format for VK_FORMAT_A8B8G8R8_UNORM_PACK32 |
| - pvr: Use common vkGetPhysicalDeviceFeatures2() implementation |
| - pvr: Fix segfault in pvr_physical_device_init() |
| - pvr: Move pvr_get_isp_num_tiles_xy() to rogue_hw_utils.h |
| - pvr: Use pvr_sub_cmd_event union members directly |
| - pvr: Add wait_on_previous_transfer flag to graphics subcommand |
| - pvr: Cleanup in pvr_process_cmd_buffer() |
| - pvr: Add pvr_image_view_get_image() |
| - pvr: Publicise some static functions from pvr_blit.c |
| - pvr: Rename ds_{image,iview} in pvr_gfx_sub_cmd_job_init() |
| - pvr: Implement ZLS subtile alignment |
| - pvr: Correct calculations in pvr_unwind_rects() |
| - pvr: Refactor pvr_unwind_rects() |
| - pvr: Allow S8_UINT to be used as a stencil attachment format |
| - pvr: Don't overwrite PDS vertex input flags |
| - pvr: Declare dependency on idep_mesautil |
| - pvr: Add support for sampler border colors |
| - pvr: Correctly read dynamic state setup during blend constant setup |
| - pvr: Advance entry pointer in pvr_setup_vertex_buffers() |
| - pvr: Rename transfer 3D heap to transfer frag heap |
| |
| Matt Turner (13): |
| |
| - intel: Disable shader cache when executing intel_clc during the build |
| - u_format: Use memcpy to avoid unaligned accesses |
| - meson: Remove reference to removed SWR driver |
| - anv: Pipe anv_physical_device to anv_get_image_format_features2 |
| - anv: Only expose video decode bits with KHR_video_decode_queue |
| - intel: Rearrange for next commit |
| - intel: Consider with_intel_clc in with_any_intel |
| - intel: Only build blorp if drivers are enabled |
| - intel: Only build ds if drivers are enabled |
| - intel: Only build perf if drivers or tools are enabled |
| - intel: Allow using intel_clc from the system |
| - intel: Limit Intel Vulkan RT to x86_64 |
| - Revert "intel/fs: only avoid SIMD32 if strictly inferior in throughput" |
| |
| Matthieu Bouron (1): |
| |
| - lavapipe: honor dst base array layer when resolving color attachments |
| |
| Michael Tretter (2): |
| |
| - panfrost: remove BO from cache before closing GEM |
| - kmsro: assert that scanout refcount is larger than 0 |
| |
| Michel Dänzer (17): |
| |
| - ci: Explicitly test for meson feature checks in compiler wrapper |
| - ci: Use set -e in frontend compiler wrapper scripts. |
| - ci: Remove shebang from backend compiler wrapper script |
| - ci: Drop executable permissions from backend compiler wrapper script |
| - tgsi: Make ureg_DECL_output_masked definition match its declaration |
| - llvmpipe: Make lp_build_interp_soa declaration match its definition |
| - mesa/st: Make st_convert_image(_from_unit) declaration match definition |
| - vulkan: Fix GetPhysicalDeviceSparseImageFormatProperties definition |
| - anv/format: Fix GetPhysicalDeviceSparseImageFormatProperties definition |
| - vulkan: Fix GetPhysicalDeviceSparseImageFormatProperties definitions |
| - svga: Make vmw_svga_winsys_buffer_map definition match declaration |
| - svga: Make declaration of emit_input_declaration match definition |
| - clover/llvm: Use llvm::DataLayout::getABITypeAlign with LLVM >= 16 |
| - clover/llvm: Use std::nullopt already with LLVM 16 |
| - ci: Drop -Wno-error=array-bounds from fedora-release job |
| - ci: Upgrade fedora-release job to Fedora 38 |
| - ci: Enable rusticl in the fedora-release job |
| |
| Michel Zou (4): |
| |
| - vulkan/wsi: fix -Wnarrowing warning |
| - vk/entry_points:: fix mingw build |
| - mesa/draw: fix -Wformat warning |
| - util: reinstate ENUM_PACKED |
| |
| Mihai Preda (1): |
| |
| - nir: update nir->num_inputs, num_outputs in nir_recompute_io_bases() |
| |
| Mike Blumenkrantz (364): |
| |
| - mesa/st/program: don't init xfb info if there are no outputs |
| - zink: remove atomics from zink_query |
| - zink: pass ctx through query destroy paths |
| - zink: always defer query pool deletion |
| - zink: remove screen param from zink_prune_query() |
| - util/cpu: add big.LITTLE cpu detection |
| - driconf: rework glthread enablement |
| - glthread: disable by default with fewer than 4 (big) CPUs |
| - zink: move memoryTypeIndex selection down in general bo allocation |
| - zink: slightly rework memoryTypeIndex selection to pre-determine heap |
| - zink: restore BAR allocation failure demotion |
| - zink: make general bo allocation more robust by iterating |
| - zink: avoid zero-sized memcmp for descriptor layouts |
| - iris: use util_framebuffer_get_num_samples when setting ps dispatch samples |
| - nir/lower_alpha_test: rzalloc state slots |
| - zink: fix non-db bindless texture buffers |
| - util/blitter: fix line wrapping on error to avoid giving wrong line number |
| - glthread: add newline to env override |
| - zink: emit demote cap when using demote |
| - zink: only print copy box warning once per resource |
| - zink: hook up debug callback |
| - zink: use a perf_debug() macro for debug message logging of copy box warning |
| - util/debug: move null checks out of debug message macro |
| - zink: manually re-set framebuffer after msrtss replicate blit |
| - zink: handle 'blitting' flag better in msrtss replication |
| - zink: skip msrtss replicate if the attachment will be full-cleared |
| - zink: avoid recursion during msrtss blits from flushing clears |
| - zink: don't bitcast bool deref loads/stores |
| - zink: zink_shader_free -> zink_gfx_shader_free |
| - zink: split out generic shader destruction for reuse |
| - zink: always wait on precompile fence at start of zink_gfx_shader_free() |
| - zink: call zink_shader_free for compute shaders |
| - zink: add a util function for printing shaders |
| - zink: don't create separate shader dsls if there are no bindings |
| - drisw: don't leak the winsys |
| - zink: check for extendedDynamicState3DepthClipNegativeOneToOne for ds3 support |
| - mesa/st: try to block multisampled texsubimage from doing cpu writes |
| - mesa: fix ms fallback texture creation |
| - draw: fix viewmask iterating |
| - zink: use tes to generate tcs |
| - zink: hook up EXT_shader_object |
| - zink: wrap zink_shader_compile_separate() return |
| - zink: wrap return of compile_module() |
| - zink: make zink_shader_spirv_compile static |
| - zink: more zink_shader_object conversion |
| - zink: use zink_shader_object for precompiled separate shaders |
| - zink: minor whitespace cleanup |
| - zink: move separate shader dsl creation to compiler function |
| - zink: add a 'separate' flag to shader module compile to indicate separate shaders |
| - zink: run bo lowering passes for separate shader compile with uniform inlining |
| - zink: remove redundant compute program batch ref |
| - zink: use EXT_shader_object to (re)implement separate shaders |
| - zink: add validation exceptions for shader object extension enable |
| - zink: don't pin flush queue threads if no threads exist |
| - zink: add z32s8 as mandatory GL3.0 profile attachment format |
| - zink: add a driver workaround to disable background compiles |
| - nir/gs: fix array type copying for passthrough gs |
| - zink: fix array copying in pv lowering |
| - gallivm: break out native vector width calc for reuse |
| - llvmpipe: do late init for llvm builder |
| - zink: print the type of shader when dumping |
| - zink: use intermediate variable for separate shader descriptor update loop |
| - zink: use intermediate variable for separate shader db resize check |
| - zink: simplify separate shader prog init a little |
| - zink: streamline separate shader descriptor update |
| - zink: switch to a regular loop to wait on precompile shader fences |
| - zink: move some shader CSO functions around |
| - zink: assign separate shader prog stages from ctx->shader_stages |
| - zink: use a more standardized loop for initing separate shader program descriptors |
| - zink: move separate shader creation to shader CSO creation |
| - zink: handle all stages in fixup_io_locations() |
| - zink: fix longstanding TODO for generated tcs |
| - zink: use EXT_shader_object to implement generic separate shader precompile |
| - bump VVL to 1.3.248 |
| - zink: prune some validation errors from ci |
| - zink: break out VkImageViewUsageCreateInfo applying for reuse |
| - zink: reapply VkImageViewUsageCreateInfo when rebinding a surface |
| - zink: add a workaround for a nir_assign_io_var_locations bug |
| - zink: don't run update_so_info if shader has no outputs |
| - zink: add ZINK_DEBUG=noshobj to disable EXT_shader_object |
| - zink: rename 'separate' param in shader compilation to 'can_shobj' |
| - zink: explicitly block sample shading in the GPL precompile path |
| - zink: add zink_program::uses_shobj for managing shader object binds |
| - zink: use local screen var in zink_gfx_program_update_optimal() |
| - zink: deduplicate separable program replacement handling |
| - zink: delete redundant conditional |
| - zink: use zink_shader_object for zink_shader_module |
| - zink: use zink_destroy_shader_module() for compute to deduplicate code |
| - zink: store spirv onto zink_shader_object structs |
| - zink: allow zink_shader_module to be either a shobj or a mod using a bool |
| - zink: avoid accessing zink_gfx_program::modules during pipeline compile |
| - zink: add a union to zink_gfx_pipeline_cache_entry for gpl |
| - zink: use zink_shader_object for pipeline compiles from zink_gfx_program |
| - zink: make zink_shader_spirv_compile public |
| - zink: enable EXT_shader_object for generic precompiles |
| - draw: fix robust ubo size calc |
| - ci: disable all a306/a530/a630 jobs |
| - llvmpipe: fix native vector width init |
| - zink: update amdpro fails |
| - zink: add extendedDynamicState3DepthClipNegativeOneToOne to profile |
| - zink: only unset a generated tcs if the bound tcs is the generated one |
| - Revert "zink: don't create separate shader dsls if there are no bindings" |
| - zink: disable a630 traces |
| - zink: set depth dynamic state values unconditionally |
| - zink: null some descriptor buffer pointers during destruction |
| - zink: sync queries at the end of cmdbufs |
| - cso: unbind fb state when unbinding the context |
| - i915: use util_copy_framebuffer_state to set fb state |
| - i915: use util_unreference_framebuffer_state to unref fb state |
| - iris: use util_unreference_framebuffer_state to unref fb state |
| - softpipe: use util_unreference_framebuffer_state to unref fb state |
| - v3d: use util_unreference_framebuffer_state to unref fb state |
| - vc4: use util_unreference_framebuffer_state to unref fb state |
| - llvmpipe: use util_unreference_framebuffer_state to unref fb state |
| - svga: use util_unreference_framebuffer_state to unref fb state |
| - zink: move EXT_shader_object check to another place |
| - zink: break out optimal key handling into separate function |
| - zink: disable EXT_shader_object if !optimal_keys |
| - zink: add ZINK_DEBUG=optimal_keys |
| - gallium: pipe_rasterizer_state::point_tri_clip -> point_line_tri_clip |
| - aux/draw: guard_band_points_xy -> guard_band_points_lines_xy |
| - aux/draw: add guardband clipping for lines |
| - zink: don't init mutable resource bit for swapchain images |
| - zink: don't init mutable for swapchain src during blit |
| - tgsi_to_nir: handle PIPE_CAP_NIR_COMPACT_ARRAYS for clipdistance |
| - zink: allow vk 1.2 timelineSemaphore feature if extension isn't supported |
| - zink: stringify unsupported prim restart log error |
| - zink: delete persistent map tracking |
| - zink: add PERSISTENT for db buffer maps |
| - zink: delete unnecessary pipeline stage flags from inference |
| - zink: use an intermediate variable for binding ssbo slots |
| - zink: unbind the ssbo slot being iterated, not the index of the buffer |
| - zink: flush INDIRECT_BUFFER mem barrier for compute |
| - zink: disable batched unordered barries with ZINK_DEBUG=noreorder |
| - zink: block batching of unordered barriers if previous usage was write |
| - zink: fix uncached memory readback |
| - glsl/lower_samplers_as_deref: apply bindings for unused samplers |
| - vulkan/runtime: add VK_DYNAMIC_STATE_ATTACHMENT_FEEDBACK_LOOP_ENABLE_EXT |
| - zink: add ZINK_DEBUG=noopt |
| - zink: add ZINK_DEBUG=nobgc |
| - zink: make mesa_logw separate from perf_debug |
| - zink: add perf_debug for "interesting" shader compiles |
| - zink: set debug callback on context |
| - zink: bind bindless db set when updating separate shader db sets |
| - zink: compare desc set to detect bindless vars in separate shaders |
| - zink: adjust bindless texel buffer handle before indexing |
| - zink: block more flushes during unordered blits |
| - zink: also cache swapchain semaphores |
| - zink: disable always zs feedback loop on radv |
| - zink: add back some anv qbo flakes |
| - zink: disable have_EXT_vertex_input_dynamic_state without EDS2 |
| - zink: disable dynamic state exts if the previous ones aren't present |
| - zink: add some ci flakes |
| - zink: don't leak swapchain readback semaphores |
| - zink: destroy current batch state after all other batch states |
| - zink: reorder some native blit code |
| - zink: reject blits where src/dst is 3D and dst/src z!=0 |
| - zink: reorder some image copy code |
| - zink: ignore no-op image copies |
| - zink: only add feedback loop usage bit if extension is supported |
| - lavapipe: EXT_attachment_feedback_loop_layout_dynamic_state |
| - zink: slightly simplify bda allocation chaining |
| - zink: hook up some memory extensions |
| - zink: set higher prio on dedicated memory allocations |
| - zink: flag batch usage on swapchain images |
| - vulkan/wsi: add feedback loop usage to swapchain caps if supported |
| - zink: add feedback loop usage for swapchains |
| - vtn: add spirv index to type mismatch error for debugging |
| - vtn: print spirv id for type mismatch error |
| - vtn: print spirv ids for type mismatch in bcsel |
| - vtn: add more info to bitcast bit size error message |
| - zink: try update fb resource refs when starting new renderpass |
| - zink: add special-casing for (not) reordering certain image barriers |
| - zink: use batch usage function for a simple case |
| - zink: move zink_batch_state::submit_count to zink_batch_usage |
| - zink: move batch usage to substruct on zink_bo objects |
| - zink: track/check submit info on resource batch usage |
| - zink: disable unordered blits when swapchain images need aqcuire |
| - zink: explicitly disable reordering after restricted swapchain readback blits |
| - zink: explicitly disable promotion on images that are both unflushed and non-reorderable |
| - zink: flag 'has_work' on batch when promoting a cmd |
| - lavapipe: more correctly handle null pipeline states |
| - anv: more correctly handle null pipeline states |
| - vk/graphics_state: handle null pipeline state structs in creation |
| - zink: promote flushed clears to unordered cmdbuf when possible |
| - zink: also declare int size caps inline with signed int type usage |
| - zink: delete unnecessary bitcast in load_shared/scratch |
| - zink: use void return for store_dest |
| - zink: move get_alu_type() up in file |
| - zink: manually memcpy the spirv instruction buffer |
| - zink: write out register variables to a separate spirv buffer |
| - zink: dynamically emit non-bool register values using local_vars spirv buffer |
| - zink: store and use alu types for ntv defs |
| - zink: infer types from load_const instrs to avoid more bitcasts |
| - lavapipe: bump memory allocation heap to 3GiB |
| - lavapipe: report full memory in heap for 64bit processes |
| - lavapipe: EXT_memory_budget |
| - lavapipe: EXT_memory_priority |
| - lavapipe: store memory allocation size onto lvp_device_memory |
| - lavapipe: VK_EXT_pageable_device_local_memory |
| - zink: don't wait on queue thread if disabled |
| - zink: use the per-context track_renderpasses flag in more places |
| - zink: don't remove psiz from linked shaders if the consumer reads it |
| - zink: don't propagate psiz in quads emulation gs |
| - lavapipe: VK_EXT_dynamic_rendering_unused_attachments |
| - zink: require EXT_dynamic_rendering_unused_attachments for dynamic rendering |
| - zink: explicitly avoid ci errors due to unrecognized extensions in VVL |
| - vulkan: reorder vk_cmd_queue_entry |
| - vulkan/cmd_queue: allocate cmds based on the size of the cmd |
| - vulkan/cmd_queue: expose cmd sizes |
| - vulkan: use cmd size array for queued cmd allocations |
| - ci: uprev VVL to 1.3.251 |
| - lavapipe: fix DS3 min sample setting |
| - lavapipe: bump max push constant size |
| - lavapipe: stop setting patch vertices constantly |
| - lavapipe: don't pass indirect info in streamout draws |
| - draw: add (disabled) vertex dumping for non-linear emit |
| - lavapipe: fix memory budget reporting |
| - zink: also disable bg compile for compute with nobgc |
| - zink: hook up VK_EXT_attachment_feedback_loop_dynamic_state |
| - zink: use dynamic state for feedback loops when available |
| - zink: enable EXT_shader_object globally with have_EXT_attachment_feedback_loop_dynamic_state |
| - zink: add a ci flake |
| - lavapipe: pass list to cmdbuf exec, not cmdbuf |
| - lavapipe: add a mapping for BDA |
| - lavapipe: add a zeroed buffer that can be bound in place of an index buffer |
| - lavapipe: handle index buffers with offsets for indirect draws |
| - lavapipe: NV_device_generated_commands |
| - zink: combine some rast state draw conditionals |
| - zink: don't check prog->shaders when creating gfx pipeline |
| - zink: check for cached mem correctly when mapping buffer |
| - zink: remove assert for dt in zink_kopper_update |
| - zink: stop swizzling conditional render during batch flush |
| - zink: update some radv qbo fails |
| - radv: tweak gfx pipeline stage binding |
| - zink: only try to create srgb mutable images if the vk format is supported |
| - vk: make vk_format_map[] public |
| - radv: directly use vk_format_map for vertex input |
| - lavapipe: use PACKAGE_VERSION for cache uuid in release builds |
| - zink: massively shrink qbo size for timestamp queries |
| - zink: assert that ntv image creation isn't clobbering existing images |
| - zink: add some ntv asserts for ms txf |
| - zink: add a dgc debug mode for testing |
| - lavapipe: add version uuid to shader binary validation |
| - egl/dri2: trigger drawable invalidation from surface queries for zink |
| - zink: add some ci flakes |
| - zink: break out vk flag unrolling into util function |
| - zink: add mem debugging |
| - zink: remove redundant conditional in set_sampler_views |
| - zink: wrap format mismatch checks for blit/surface |
| - zink: add srgb mutable for all resources by default |
| - zink: drop dt checks for mutable format init |
| - zink: strip format list when disabling mutable during image creation |
| - dri3: only invalidate drawables on geometry change if geometry has changed |
| - zink: more anv ci flakes |
| - aux/trace: add methods for mesh shaders |
| - lavapipe: more fixes for sample shading |
| - lavapipe: fix shader binary binding with mesh shaders |
| - lavapipe: correctly update shader object per-stage push constant sizes |
| - zink: add COHERENT requirement for CACHED memory |
| - zink: ZINK_HEAP_HOST_VISIBLE_CACHED -> ZINK_HEAP_HOST_VISIBLE_COHERENT_CACHED |
| - zink: fix anv ci flake wildcarding |
| - aux/pipebuffer: add a return to pb_slabs_reclaim() |
| - aux/pipebuffer: add a return to pb_cache_release_all_buffers() |
| - zink: only retry bo allocation after reclaim if reclaims actually happened |
| - zink: fix ubo array sizing in ntv |
| - zink: acquire persistently bound swapchain descriptors before setting usage |
| - zink: recache present semaphores |
| - zink: always clamp NUM_QUERIES to 500 |
| - zink: radv vangogh ci updates |
| - radv: remove redundant intermediate variable in radv_is_mrt0_dual_src() |
| - radv: inline radv_can_enable_dual_src() |
| - zink: no-op redundant samplemask changes |
| - zink: force inlining for a bunch of functions |
| - zink: make invalidate_descriptor_state a ctx hook |
| - zink: specialize invalidate_descriptor_state hook for compact mode |
| - zink: clean up rp update tracking on dsa bind |
| - zink: use local screen var in blend state bind |
| - zink: track and apply ds3 states only on change |
| - zink: don't update tc info directly from cso binds |
| - zink: check sampler views pointer before loop |
| - zink: add fastpaths for no-op sampler/view rebinds |
| - nir/lower_tex: ignore saturate for txf ops |
| - radv: pre-init surface info |
| - ci: add a test-dozen-deqp flake |
| - lavapipe: handle multiview queries |
| - zink: fix assert for inline uniform invalidation with generated gs bound |
| - zink: fix unbinding generated gs on real gs bind |
| - zink: get new bda when rebinding invalidated buffers |
| - lavapipe: create a desc set for immutable sampler layouts |
| - lavapipe: split out descriptor stage setting |
| - lavapipe: EXT_descriptor_buffer |
| - lavapipe: VK_EXT_mutable_descriptor_type |
| - llvmpipe: flush/reference fs ubos on bind |
| - zink: do initial program unref during program creation |
| - zink: fix separate shader program refcounting |
| - docs: update lavapipe extensions |
| - zink: don't destroy swapchain on initial CreateSwapchainKHR fail |
| - aux/trace: fix bindless texture dumping |
| - vk/wsi/x11: move surface alpha check from get_caps to creation |
| - vk/wsi/x11: handle geometry updating more asynchronously |
| - vk/wsi/x11: stop roundtripping on presentation |
| - vk/wsi: unify dmabuf exporting |
| - vk/wsi: add error logging for syncfile import/export failures |
| - zink: fix anv ci flakes (for real this time) |
| - zink: fix batch disambiguation on first submit |
| - zink: set pipeline dynamic state count after all dynamic states are set |
| - zink: be even dumber about buffer refs when replacing storage |
| - zink: emit SpvCapabilitySampleMaskPostDepthCoverage with SpvExecutionModePostDepthCoverage |
| - zink: fix the fix for separate shader program refcounting |
| - kopper: handle pixmap creation failure more gracefully |
| - glxsw: check geometry of drawables on creation |
| - zink: don't clobber descriptor mode on multiple screen creation |
| - nir: fix slot calculations for compact variables with location_frac |
| - lavapipe: use the component offset directly for xfb |
| - glsl: only explicitly check GS components in PSIZ injection with output variables |
| - lavapipe: don't check geometry for fb attachments |
| - zink: better handle separate shader dsl creation when no bindings exist |
| - zink: force image barriers after dmabuf import |
| - zink: use VK_WHOLE_SIZE when binding null db buffer descriptors |
| - zink: unset line stipple ds3 state flags when stipple not available |
| - nir/lower_io_to_scalar: fix 64bit io splitting |
| - nir/linking_helpers: force type matching in does_varying_match |
| - zink: add batch refs for transient images |
| - zink: fix zs resolve attachment indexing |
| - zink: don't add VK_IMAGE_USAGE_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT for transient images |
| - zink: don't append msrtss to dynamic render if not supported |
| - zink: set msrtss depth resolve mode when enabled |
| - zink: add more locking for pipeline cache |
| - aux/trace: fix winsys handle dumping |
| - zink: generated tcs is on the tes, not the vs |
| - llvmpipe: block weird uses of subsampled formats in buffers |
| - llvmpipe: fix early depth + alpha2coverage + occlusion query interaction |
| - lavapipe: fix resolves where src image has a layer offset |
| - lavapipe: block yuv formats from getting blit feature flags |
| - zink: explicitly set non-optimal last_vertex_stage shader key on ctx create |
| - zink: fix big tcs output io |
| - zink: fix crash in lower_pv_mode_gs_store |
| - u/draw: skip zero-sized indirect draws |
| - nir/zink: fix gs emulation xfb_info sizing |
| - vk/graphics: fix CWE handling with DS3 |
| - Revert "vk/wsi/x11: handle geometry updating more asynchronously" |
| - zink: wait on async fence during ctx program removal |
| - zink: don't start multiple cache jobs for the same program |
| - zink: disable validation |
| - zink: be more precise about flagging rp changes around unordered u_blitter |
| - zink: fix linear modifier dmabuf imports |
| - aux/tc: handle stride mismatch during rp-optimized subdata |
| - zink: always add a per-prog ref for gpl libs |
| - zink: set is_xfb=false for all i/o variables |
| - nir/inline_uniforms: fix oob access with nir_find_inlinable_uniforms |
| - aux/tc: fix staging buffer sizing for texture_subdata |
| - aux/tc: fix address calc for segmented texture subdata |
| - glsl: check for xfb setting xfb info |
| - aux/tc: fix renderpass tracking fb state clobber scenario |
| - aux/tc: fix rp info handling around tc_sync calls |
| - aux/tc: don't use pipe_buffer_create_with_data() for rp-optimized subdata |
| - zink: flag db maps as unsynchronized |
| - lavapipe: clamp cache uuid size |
| - tu: handle unused color attachments without crashing |
| - zink: propagate rp_tc_info_updated across unordered blits |
| - zink: move swapchain fence to swapchain object |
| - zink: avoid UAF on wayland async present with to-be-retired swapchain |
| - zink: always trace_screen_unwrap in acquire |
| |
| MouriNaruto (1): |
| |
| - dzn: Fix segmentation fault when Direct3D 12 user mode driver from at least one of GPUs is not available. |
| |
| MrRobbin (1): |
| |
| - zink: Move the workaround before the EDS setting. |
| |
| Mykhailo Skorokhodov (4): |
| |
| - mesa: Implement GL_CLEAR_TEXTURE flag |
| - mesa: Fallthrough GL_SRB_DECODE_ARB pname |
| - iris: Fix memory size with disabled resizable bar |
| - nir: Rematerialize derefs after opt_dead_cf |
| |
| Mykola Piatykop (1): |
| |
| - mesa: Fix use after free. |
| |
| Nanley Chery (28): |
| |
| - iris: Allocate ZEROED BOs for shared resources |
| - iris/bufmgr: Add and use zero_bo |
| - iris/bufmgr: Handle flat_ccs for BO_ALLOC_ZEROED |
| - intel/isl: Bump the MCS halign value for BDW+ |
| - iris: Add a barrier to iris_mcs_partial_resolve |
| - intel: Implement ISL_AUX_OP_AMBIGUATE for MCS |
| - iris: Enable MCS init with ISL_AUX_OP_AMBIGUATE |
| - anv: Drop the MCS initialization performance warning |
| - anv: Enable MCS init with ISL_AUX_OP_AMBIGUATE |
| - intel/blorp: Assert an 8bpp fast clear restriction |
| - iris: Init CCS_E to COMPRESSED_NO_CLEAR for XeHP |
| - intel/blorp: Use the depth copy format more on BDW+ |
| - intel/blorp: Add depth usage check for copy format |
| - intel/blorp: Change condition for CCS_E copy formats |
| - intel/blorp: Add and use blorp_copy_get_formats |
| - iris: Use known formats for tex_cache_flush_hack |
| - iris: Drop a GFX12_CCS_E check in can_fast_clear_color |
| - intel: Rename the GFX12_CCS_E aux-usage to FCV_CCS_E |
| - iris: Avoid extra CCS_E flushes for aux mode changes |
| - iris: Avoid FCV_CCS_E for shader image accesses |
| - iris: Assert against FCV_CCS_E for blitter writes |
| - intel/blorp: Avoid 32bpc fast clear sampling issue |
| - Revert "iris: Add missed tile flush flag" |
| - iris: Drop the RT flush for PIPE_BARRIER_TEXTURE |
| - iris: Drop GPGPU Tex Invalidate restriction for TGL+ |
| - isl: Add and use size and alignment calculators |
| - anv: Don't support ASTC images with modifiers |
| - intel/blorp: Ambiguate after CCS resolves on gfx7-8 |
| |
| Oskar Rundgren (20): |
| |
| - pvr: Allow block compressed source blit |
| - pvr: Transfer PBE source snorm format should be signed |
| - pvr: Transfer PBE gamma is unset |
| - pvr: Transfer fix blit with multiple emits |
| - pvr: Transfer multiple emits clip rectangle |
| - pvr: Add back S8_UINT support |
| - pvr: Add PBE packmode for depth stencil formats |
| - pvr: Transfer add depth merge support for X8_D24 |
| - pvr: Transfer add s8_uint support |
| - pvr: PBE fix mesa pipe swizzle conversion |
| - pvr: Transfer ignore non zero stride for twiddled surface |
| - pvr: Transfer block compressed with 3d twiddled layout |
| - pvr: Transfer support flipped rectangle mapping |
| - pvr: Transfer remove byte unwind workaround |
| - pvr: fix texel unwind workaround mappings |
| - pvr: Transfer check valid source address mask |
| - pvr: Transfer optimisation remove unused features from API |
| - pvr: Transfer image to buffer dest rect |
| - pvr: Fix transfer image clearing PBE packmodes |
| - pvr: add block compressed formats blit support |
| |
| Patrick Lerda (22): |
| |
| - r600: fix refcnt imbalance related to r600_set_vertex_buffers() |
| - r600: fix refcnt imbalance related to evergreen_set_shader_images() |
| - lima: fix refcnt imbalance related to framebuffer |
| - r600/sfn: fix memory leak related to sh_info->arrays |
| - aux/draw: fix memory leak related to ureg_get_tokens() |
| - crocus: fix refcnt imbalance related to framebuffer |
| - crocus: fix refcnt imbalance related to crocus_create_surface() |
| - r600: fix refcnt imbalance related to atomic_buffer_state |
| - radeonsi: set proper drm_amdgpu_cs_chunk_fence alignment |
| - crocus: fix scratch_bos memory leak |
| - mesa: fix refcnt imbalance related to egl_image_target_texture() |
| - glthread: fix typo related to upload_vertices() |
| - mesa: fix refcnt imbalance related to _mesa_delete_semaphore_object() |
| - mesa/st: fix refcnt imbalance related to st_feedback_draw_vbo() |
| - mesa/st: fix buffer overflow related to set_program_string() |
| - r600: fix r600_draw_vbo() buffer overflow |
| - nouveau: fix nouveau_heap_destroy() memory leak |
| - r600: fix cayman_convert_border_color() swizzle behavior |
| - util/blitter: fix util_blitter_clear_buffer() refcnt imbalance |
| - util/blitter: revert util_blitter_clear_buffer() |
| - radeonsi: fix refcnt imbalance related to util_blitter_save_fragment_constant_buffer_slot() |
| - panfrost: fix refcnt imbalance related to blitter |
| |
| Paul Gofman (2): |
| |
| - driconf: add a workaround for Captain Lycop: Invasion of the Heters |
| - driconf: add a workaround for Rainbow Six Extraction |
| |
| Paulo Zanoni (9): |
| |
| - iris: Store prime fd of external bos for Xe KMD |
| - iris: Add functions to import and export implicit sync state |
| - iris: Extend iris_bo_wait_syncobj() to wait on external implicit syncobj |
| - iris: Add iris_implicit_sync struct and functions to do implicit synchronization for Xe kmd |
| - iris: also avoid isl_memcpy_linear_to_tiled for Tile64 |
| - intel/isl: tile 64 calculations work with 1D surfaces |
| - iris: assert bufmgr->bo_deps_lock is held |
| - iris: avoid stack overflow in iris_bo_wait_syncobj() |
| - iris: assert(bo->deps) after realloc() |
| |
| Pavel Ondračka (33): |
| |
| - r300: fix unconditional KIL on R300/R400 |
| - r300: add CI list of known rv370 dEQP failures |
| - r300: remove simple duplicate ARL instructions |
| - r300: fuse ROUND and ARL to ARR |
| - r300: remove nir round lowering |
| - r300: enable PIPE_CAP_TGSI_TEXCOORD |
| - r300: fail linking instead of using dummy shaders |
| - CODEOWNERS: add r300 driver |
| - r300: move nir stuff to r300_nir file |
| - r300: move the ARL merging pass up in the opt loop |
| - r300: move the ROUND+ARL->ARR fusing to main optimization loop |
| - r300: optimize the load A0 pattern from wined3d |
| - r300: remove duplicate ARRs |
| - r300: be more agressive when merging A0 loads |
| - r300: remove unused SIN/COS lowering |
| - r300: remove unused SSG lowering |
| - r300: move CEIL lowering to NIR |
| - r300: remove unused FLR lowering |
| - r300: remove unused POW lowering |
| - r300: remove unused DST lowering |
| - r300: remove unused ROUND lowering |
| - r300: remove unused LIT lowering |
| - r300: remove unused opcodes from r300_tgsi_to_rc |
| - nir_opt_algebraic: don't use i32csel without native integer support |
| - r300: add partial CMP support on R5xx |
| - r300: properly count maximum used register index |
| - r300: lower undefs to zero |
| - r300: add some early safe bool lowering |
| - r300: remove most of backend contant folding |
| - r300: disable ntt regalloc for vertex shaders |
| - r300: assert that every writer has a reader |
| - r300: update RV370 failures |
| - r300: don't abort on flow control when using draw for vs |
| |
| Philipp Zabel (1): |
| |
| - etnaviv: fix segfault after compile failure |
| |
| Pierre-Eric Pelloux-Prayer (13): |
| |
| - amd: update amdgpu_drm.h |
| - amd: determine info->has_fw_based_shadowing |
| - radeonsi: implement fw based mcbp |
| - amd: update amdgpu_drm.h |
| - radeonsi: stop reporting reset to app once gpu recovery is done |
| - winsys/amdgpu: add a helper function to submit a no-op job |
| - winsys/amdgpu: use the no-op helper to detect if reset completion |
| - mesa: don't share reset status across contexts |
| - mesa: remove unused bools |
| - llvmpipe: only include old Transform includes when needed |
| - Revert "gallium/u_threaded: buffer sharedness tracking" |
| - st/mesa: check renderbuffer before using it |
| - radeonsi: emit framebuffer state after allocating cmask |
| |
| Qiang Yu (119): |
| |
| - nir: add nir_load_barycentric_optimize_amd intrinsic |
| - radeonsi: implement nir_load_barycentric_optimize_amd |
| - ac/nir/ps: lower barycentric load when bc_optimize |
| - ac/nir/ps: add force lower barycentric load options |
| - ac/nir/ps: lower sample mask input when needed |
| - ac/llvm,radeonsi: lower ps color load in nir |
| - radeonsi: add si_nir_lower_ps_color_input |
| - radeonsi: add si_nir_emit_polygon_stipple |
| - radeonsi: handle lowered ps in scan_io_usage |
| - radeonsi: monolithic ps emit prolog in nir directly |
| - radeonsi: restructure mono merged shader build |
| - radeonsi: remove separate_prolog parameter |
| - radeonsi: add si_mark_divergent_texture_non_uniform |
| - ac/llvm,radeonsi: use texture non-uniform flag as waterfall switch |
| - nir,ac/llvm,radeonsi: replace nir_load_smem_buffer_amd with nir_load_ubo |
| - ac/llvm,radeonsi: lower nir_load_point_coord_maybe_flipped in nir |
| - ac,radv: move ps arg compation to common place |
| - aco: support 32bit address in nir_load_smem_amd |
| - nir: add missing image atomic_inc/dec_wrap intrinsic |
| - aco: implement nir_bindless_image_atomic_inc/dec_wrap |
| - aco: skip scratch buffer init when its arg is not used |
| - aco: fix nir_f2u64 translation |
| - nir: add nir_export_dual_src_blend_amd intrinsic |
| - aco: move create_fs_dual_src_export_gfx11 above |
| - aco: implement nir_export_dual_src_blend_amd |
| - ac/nir/ps: use nir_export_dual_src_blend_amd when aco |
| - ac/nir/ps: add no_color_export option |
| - aco: support nir_export_amd with ps targets |
| - aco,radv: lower outputs to exports when nir for monolithic ps |
| - ac/llvm: remove output variable declaration for radv ps |
| - radv: implement nir_load_barycentric_optimize_amd |
| - ac/nir/ps: remove used nir_variable if created |
| - aco,ac/llvm,radv,radeonsi: handle ps bc optimization in nir for radv |
| - aco,radv: remove unused aco compile options |
| - aco,radv: support symbol relocation in aco |
| - aco: get scratch addr from symbol for radeonsi |
| - aco: allow no export instruction for gfx10+ fs |
| - ac/nir/cull: fix line position w culling |
| - meson: build radeonsi with aco |
| - radeonsi: add aco debug option |
| - radeonsi: add use_aco field for struct si_shader |
| - radeonsi: add shader info for frag coord and sample pos read |
| - radeonsi: add shader info uses_sampleid |
| - radeonsi: pack spi ps input fixup to a function |
| - radeonsi: init spi ps input shader config when aco |
| - radeonsi: add a raw shader binary type |
| - ac/binary: pack prefech align code to a function |
| - radeonsi: support raw shader binary upload |
| - radeonsi: support print raw shader binary |
| - radeonsi: remove ps vgpr index save when args init |
| - tgsi_to_nir: call nir_lower_int64 when required |
| - ac/llvm,radeonsi: lower idiv in nir |
| - ac/llvm,radeonsi: lower fsin/fcos in nir |
| - ac/llvm,radeonsi: lower txf offset in nir |
| - ac/llvm,radeonsi: lower ineg in nir |
| - ac/llvm,radeonsi: lower some pack/unpack ops not supported by aco |
| - ac/llvm,radeonsi: lower nir_fpow for aco and llvm |
| - radeonsi: lower some 64bit ops aco does not support |
| - radeonsi: lower vector const to scalar at last for aco |
| - radeonsi: add has_non_uniform_tex_access shader info |
| - radeonsi: lower non uniform texture access when aco |
| - radeonsi: add initial aco compile code |
| - radeonsi: add symbols to si_shader_binary |
| - radeonsi: resolve aco scratch addr symbols |
| - radeonsi: adjust ps args for aco |
| - radeonsi: pass use_aco to ac_nir_lower_ps |
| - radeonsi: clamp shadow texture reference in nir for aco |
| - ac/llvm,radeonsi: enable lower_array_layer_round_even |
| - radeonsi: fixup sampler desc for tg4 in nir |
| - radeonsi: be able to use aco compiler for mono ps |
| - ac/llvm: remove the double frcp special handling |
| - radeonsi: fix aco compile for atomic ops |
| - ac/llvm: remove redundant nir_lower_legacy_atomics |
| - radeonsi: fix uses_instanceid for merged mono shader stage |
| - aco: implement two load lds ngg intrininsic for radeonsi |
| - aco,radv: remove unused aco_shader_info fields |
| - ac/nir/ngg: don't use 8bit alu ops |
| - aco: implement load buffer with ACCESS_USES_FORMAT_AMD |
| - aco/assembler: handle ds_(add|sub)_gs_reg_rtn encoding |
| - aco: use gds reg when ordered xfb counter add |
| - aco: implement nir_xfb_counter_sub_amd |
| - aco: implement nir_bindless_image_fragment_mask_load_amd |
| - aco: use ac_get_image_dim for array check when image intrinsic |
| - radeonsi: resolve lds ngg aco symbols |
| - radeonsi: add scratch offset vs args explicitly for aco |
| - ac/llvm,radeonsi: lower nir_load_gs_vertex_offset_amd in abi |
| - ac/llvm,radeonsi: lower nir_load_merged_wave_info_amd in abi |
| - ac/llvm,radeonsi: lower load_workgroup_num_input_(vertices|primitives) in abi |
| - ac/llvm,radeonsi: lower nir_load_initial_edgeflags_amd in abi |
| - ac/llvm,radeonsi: lower nir_load_packed_passthrough_primitive_amd in abi |
| - ac/llvm,radeonsi: lower nir_load_ordered_id_amd in abi |
| - ac/llvm,radeonsi: lower nir_load_ring_esgs_amd in abi |
| - nir,ac/llvm,radeonsi: replace nir_buffer_atomic_add_amd with ssbo atomic |
| - radeonsi: fill aco shader info for mono standalone vs |
| - radeonsi: calculate needed lds size when upload raw binary for vs |
| - radeonsi: use nir_umul_high for fast udiv |
| - radeonsi: always use scoped barrier |
| - ac/llvm: remove unused barrier implementation |
| - radeonsi: enable aco for mono standalone vs |
| - aco,radv: remove unused gs aco shader info |
| - ac/nir,radv: add 1 dword to LS/HS vertex stride |
| - ac/nir,radv: add 1 dword to ES/GS item size |
| - radeonsi: add scratch_offset arg for aco tcs |
| - radeonsi: lower nir_load_tess_rel_patch_id_amd in abi for aco |
| - ac/llvm,radeonsi: lower nir_load_ring_tess_offchip_amd in abi |
| - radeonsi: enable aco support for mono standalone tcs |
| - radeonsi: add scratch_offset arg for aco tes |
| - radeonsi: init tes aco shader info fields |
| - radeonsi: update lds size for tes |
| - radeonsi: enable aco support for standalone tes |
| - radeonsi: add scratch_offset arg for aco gs |
| - ac/llvm,radeonsi: lower nir_load_ring_gsvs_amd in abi |
| - radeonsi: enable aco for standalone gs |
| - radeonsi: enable aco support for gs copy shader |
| - radeonsi: add scratch_offset arg for aco cs |
| - ac/llvm,radeonsi: lower nir_load_user_data_amd in abi |
| - radeonsi: fix crash when AMD_DEBUG=cs,initnir |
| - radeonsi: enable aco support for compute shader |
| - ac/nir/ngg: fix ngg_gs_clear_primflags crash |
| |
| QwertyChouskie (1): |
| |
| - docs/features.txt(fix): mark VK_EXT_pipeline_robustness as supported on radv |
| |
| Rajnesh Kanwal (9): |
| |
| - pvr: Add support to process transfer and blit cmds |
| - pvr: Implement vkCmdCopyBufferToImage API. |
| - pvr: Implement vkCmdCopyImage2KHR API. |
| - pvr: Implement vkCmdBlitImage API. |
| - pvr: Implement vkCmdClearColorImage API. |
| - pvr: Implement vkCmdCopyImageToBuffer2 API. |
| - pvr: Implement vkCmdFillBuffer API. |
| - pvr: Implement vkCmdResolveImage2KHR API. |
| - pvr: Implement vkCmdClearDepthStencilImage API. |
| |
| Rhys Perry (92): |
| |
| - nir: add is_gather_implicit_lod |
| - vtn: set is_gather_implicit_lod |
| - aco: support implicit LOD for nir_texop_tg4 |
| - ac/llvm: support implicit LOD for nir_texop_tg4 |
| - aco: remove SMEM_instruction::prevent_overflow |
| - aco: use apply_nuw_to_ssa() with load_smem_amd |
| - ac/nir/ps: fix null export write mask miss set to 0xf |
| - aco: don't move exec reads around exec writes |
| - aco: don't move exec writes around exec writes |
| - radv: fix bc optimization with POS_W_FLOAT_ENA(1) |
| - aco/ra: create M0-affinities for s_sendmsg |
| - aco/gfx11: fix VMEM/DS->VALU WaW/RaW hazard |
| - amd/drm-shim: move device list to external file |
| - amd/drm-shim: add polaris10 |
| - amd/drm-shim: add vega10 |
| - amd/drm-shim: add navi10 |
| - aco: add get_op_fixed_to_def() helper |
| - aco: consider how definitions fixed to operands can change register demand |
| - nir/fold_16bit_tex_image: skip tex instructions with backend1 |
| - nir,vtn,aco,ac/llvm: make cube_face_coord_amd more direct |
| - ac/nir: add pass for lowering 1d/cube coordinates |
| - ac/nir: round layer in ac_nir_lower_tex |
| - radv,radeonsi: use ac_nir_lower_tex |
| - nir/lower_tex: remove lower_array_layer_round_even |
| - ac/nir: add fix_derivs_in_divergent_cf |
| - aco: remove unused RegType |
| - aco: let p_start_linear_vgpr take an operand |
| - aco: add MIMG_instruction::strict_wqm |
| - aco: implement strict_wqm_coord_amd |
| - aco: implement texture samples with strict WQM coordinates |
| - radv: use fix_derivs_in_divergent_cf |
| - aco/tests: improve performance of declaration parsing |
| - aco/tests: add fix_derivs_in_divergent_cf tests |
| - aco: fix update_alu(clear=true) for exports |
| - aco: use pass_flags to recover s_delay_alu cycles |
| - aco: insert s_delay_alu on the linear CFG |
| - aco: improve printing of s_delay_alu |
| - radv: allow wave32 for geometry shaders |
| - aco: fix has_color_exports=true for mrtz exports |
| - aco/tests: add discard export target tests |
| - aco: fix ds_sub_gs_reg_rtn validation |
| - radv: initialize aco_compiler_options::is_opengl |
| - radv: correctly skip vertex loads with packed formats |
| - aco: consider position/primitive exports around memory barriers |
| - ac/nir: use scoped barriers to finish stores before exports |
| - aco: remove memory_barrier_buffer implementation |
| - aco: mask bits source of s_bfe |
| - aco/tests: test that s_bfe bits is masked |
| - util: fix gc_alloc_size alignment |
| - util/tests: add gc_alloc_size alignment tests |
| - aco: run nir_lower_int64 after nir_opt_uniform_atomics |
| - ac: fix PIPE_FORMAT_R11G11B10_FLOAT DST_SEL_W |
| - radv: refactor CS subgroup size determination |
| - radv: use wave32 for small workgroups |
| - aco: don't try to form load+store clauses |
| - aco/gfx11: use s_clause with stores |
| - aco/gfx11: schedule for VMEM store clauses |
| - aco: don't set exec_hi for wave32 scan reductions |
| - amd/drm-shim: use fixed-width types |
| - nir/peephole_select: allow some invocation broadcast intrinsics |
| - aco: include helpers in emit_uniform_{reduce,scan} |
| - nir,aco: add INCLUDE_HELPERS index to reduce intrinsic |
| - nir/opt_intrinsic: optimize quad vote |
| - radv: use nir_opt_intrinsics |
| - aco,ac/llvm,ac/nir,vtn: unify cube opcodes |
| - nir: split nir_lower_mov64 |
| - radv: use nir_lower_conv64 |
| - radv: call nir_lower_int64 later |
| - radeonsi: use nir_lower_conv64 |
| - aco: remove 64-bit integer conversion opcodes |
| - ac/llvm: fix AC_TM_CHECK_IR |
| - radv: fix radv_get_ballot_bit_size with CS |
| - ac/llvm: fix wave32 ac_build_mbcnt_add with 64-bit mask |
| - ac/llvm: skip ballot zext for 32-bit dest with wave32-as-wave64 |
| - radv: add conformant_trunc_coord to cache UUID |
| - ac/nir: always round cube array layers |
| - nir/unsigned_upper_bound: fix phi(bcsel) |
| - nir/opt_dead_cf: remove nodes after a jump earlier |
| - aco: insert s_nop before VGPR deallocation |
| - radv: workaround WWZ exporting index=1 through location=1 |
| - radv: correctly skip MRT output NaN fixup for meta shaders |
| - aco: summarize register demand after handling branches |
| - aco: don't create sendmsg(dealloc_vgprs) if scratch is used |
| - radv: disable 64-bit color attachments |
| - aco: fix p_bpermute_gfx6 with input at non-zero byte |
| - radv: fix 128bpp comp-to-single clears |
| - aco/spill: skip p_branch in process_block |
| - aco/spill: add all live-in to merge block spill candidates |
| - aco/optimizer_postRA: check overwritten_subdword in is_overwritten_since() |
| - aco: check logical_phi_info at p_logical_end when eliminating exec writes |
| - aco: remove unused p_logical_end check when optimizing branching sequence |
| - aco: reset prefetch in the correct block after removing the exit |
| |
| Rob Clark (58): |
| |
| - freedreno/a6xx: Fix valid_format_cast logic for newer a6xx |
| - freedreno: Remove unused fd_batch_reset() |
| - freedreno: Inline single-caller helpers |
| - freedreno: Extra casting to make C++ happy |
| - freedreno/registers: C++ struct casting |
| - util/log: Add missing "const" |
| - freedreno/ir3: More perfetto tracing |
| - mesa/nir: Add some perfetto traces |
| - freedreno/perfetto: Add shader_id for compute stages |
| - freedreno: Add dirty state logging |
| - freedreno/a6xx: Pass ring to __ONE_REG() |
| - freedreno: Add more tracepoint fields |
| - freedreno: Fix resource tracking vs rebind/invalidate |
| - freedreno/a6xx: Change a618 tile_align_h back to 32 |
| - dri/android: Fix MSAA resolve |
| - Revert "ci: disable all a306/a530/a630 jobs" |
| - freedreno/a6xx: Rework set_bin_size() |
| - freedreno/a6xx+: Use template to handle a6xx vs a7xx differences |
| - freedreno/batch: Add helper to set fb state |
| - freedreno/a6xx: Move LRZ clear to blitter |
| - freedreno/a6xx: Add ctx->emit_sysmem() |
| - freedreno/a6xx: Simplify per-tile conditional IBs |
| - freedreno/a6xx: Switch to batch->cleared |
| - freedreno/a6xx: Split tile loads and clears |
| - freedreno/a6xx: Introduce batch subpasses |
| - freedreno/a6xx: Per-subpass LRZ |
| - freedreno/a6xx: New subpass on mid-frame clears |
| - freedreno/a6xx: Move LRZ clears to gmem |
| - freedreno/a6xx: Actually use LRZ for ms |
| - freedreno/a5xx+a6xx: Don't allocate LRZ for z32 |
| - tu: Move queue deletion to last |
| - mesa: Skip update_gl_clamp() if samplers need clamp |
| - freedreno/a6xx: Template specialization for draw type |
| - freedreno/a6xx: Template specialization for pipeline type |
| - freedreno/a6xx: Optimize max_indices calculation |
| - freedreno/batch: Move submit bo tracking to batch |
| - freedreno/drm: Don't try to export suballoc bo |
| - freedreno: Handle export error handling |
| - freedreno: Add aux-context support |
| - freedreno: Reallocate on unshared export |
| - freedreno/a6xx: Clean up open coded flushes |
| - freedreno/a6xx: Stop using fd_wfi() |
| - freedreno/a6xx: Add missing cap |
| - freedreno/a6xx: Fix xfb stream configuration |
| - freedreno/a6xx: Remove primitives_relocw() |
| - freedreno/a6xx: GL_ARB_transform_feedback_overflow_query |
| - freedreno/a6xx: Split primitives and pipeline-stats queries |
| - freedreno/a6xx: Handle nested pipeline stats queries |
| - freedreno: Handle compute queries |
| - freedreno/a6xx: GL_ARB_pipeline_statistics_query |
| - freedreno/a6xx: Enable gl46 |
| - freedreno: Add extra assert |
| - freedreno/batch: Add driver-thread assert |
| - freedreno/a6xx: Directly invalidate on samp view update |
| - freedreno/a6xx: Use idalloc for samp/view seqno's |
| - freedreno/fdperf: Use common device info helpers |
| - freedreno/drm/virtio: Trigger host side wait boost |
| - tu/drm: Add missing error path cleanup |
| |
| Robert Beckett (1): |
| |
| - winsys/panfrost: Fix a scanout resource leak |
| |
| Robert Mader (1): |
| |
| - egl/wayland: wait for compositor to release shm buffers |
| |
| Rohan Garg (41): |
| |
| - anv: use the workaround framework for WA 14013111325 |
| - hasvk: drop dead code |
| - iris: use the workaround framework for WA 14013111325 |
| - anv: use the common vulkan runtime to do the heavy lifting |
| - anv: drop duplicated nir_opt_dce passes |
| - intel: infer scalar'ness locally for brw_postprocess_nir |
| - intel: drop unused is_scalar function parameter in brw_nir_apply_key |
| - intel: update comments about non-existent function parameter |
| - intel: infer scalar'ness locally for brw_vectorize_lower_mem_access |
| - anv: drop duplicate checks when setting the compressed bit |
| - iris: correctly set alignment to next power of two for struct size |
| - ac/surface: make sure alignment is a POT |
| - freedreno: set alignment to next POT |
| - util: fix ROUND_DOWN_TO alignment type |
| - util: migrate alignment functions and macros to use ALIGN_POT |
| - util: revert back to ALIGN since it moved to util |
| - util: move pot functions to use existing macros |
| - anv: enable single texel alignment |
| - isl: add helper to check if aux usage is CCS_E |
| - anv: set aux usage to GFX12_CCS_E if a platform needs WA 14010672564 |
| - anv: limit non zero fast clear check to GFX12_CCS_E |
| - anv: fix incorrect asserts when combining CPS and per sample interpolation |
| - hasvk: enable single texel alignment |
| - anv: split ANV_PIPE_RENDER_TARGET_BUFFER_WRITES for finer grained flushing |
| - anv: move WA 1607854226 to use the WA infrastructure |
| - intel/compiler: construct masks instead of using magic values |
| - intel/compiler: reuse previously computed bitsize |
| - anv: retry batchbuffer submission with i915 |
| - iris: migrate WA 14013910100 to use the WA framework |
| - iris: migrate WA 14016118574 to use the WA framework |
| - iris: fix iris for WA 16013000631 |
| - intel/perf: add perf query support for Intel Raptorlake |
| - anv: use the correct GFX_VERx10 macro for WA |
| - anv,iris: program the maximum number of threads on compute queue init |
| - anv: partially revert 2e8b1f6d |
| - anv: drop dead ifdef |
| - iris: use the correct WA macros and lineage numbers |
| - anv: use the lineage number for WA |
| - crocus: fix GFX_VERx10 macro |
| - blorp: drop undefined macro |
| - iris: migrate preemption streamwout wa to WA infra |
| |
| Roland Scheidegger (2): |
| |
| - llvmpipe: minor cleanups in line rendering code |
| - llvmpipe: fix some corner cases with line rendering |
| |
| Romain Failliot (1): |
| |
| - docs(fix): remove last ref to i965 in features.txt |
| |
| Ruijing Dong (19): |
| |
| - radeonsi/vcn: add macros used in av1 encoding |
| - radeonsi/vcn: enable 2 pass search center map |
| - radeonsi/vcn: enable swizzle mode in encoding ref frames. |
| - radeonsi/vcn: merge get_output_format_param function |
| - radeonsi/vcn: remove extra zero bytes from bitstream |
| - radeonsi/vcn: add av1 dpb variables and cdf table |
| - gallium/pipe: add av1 encoding data structure in pipe |
| - radeonsi/vcn: add av1 enc data structure |
| - radeonsi/vcn: add some av1 encoding function |
| - radeonsi/vcn: add av1 encoding ib packages and get_info |
| - frontends/va: adding va av1 encoding functions |
| - radeonsi/vcn: use PIPE_ENC_FEATURE enum |
| - frontends/va: define va av1 encoding caps |
| - radeonsi/vcn: correct cropping for hevc case |
| - radeonsi/vcn: fix decoding bs buffer alignement issue. |
| - gallium/pipe: add interface update_decoder_target |
| - radeonsi/vcn: apply update_decoder_target logic |
| - frontends/va: remove private member and update target buffer |
| - radeonsi/vcn: change max_poc to fixed value for hevc encoder. |
| |
| Ryan Houdek (1): |
| |
| - util: move check for AVX512 |
| |
| Ryan Neph (2): |
| |
| - virgl: add debug flag to force synchronous GL shader compilation |
| - virgl: check a debug option again at context creation |
| |
| Sagar Ghuge (20): |
| |
| - anv: Factor out code from anv_image_hiz_clear |
| - anv: Move and make anv_can_hiz_clear_ds_view non-static |
| - anv: Fast clear depth/stencil surface in vkCmdClearAttachments |
| - anv: Set CS stall bit during HIZ_CCS_WT surface fast clear |
| - iris: Set CS stall bit during HIZ_CCS_WT surface fast clear |
| - intel/genxml: Add CCS cache flush field to PIPE_CONTROL |
| - intel/genxml: Add Compute/Blitter CCS aux invalidation register |
| - anv: Add CCS cache flush bits to anv_pipe_bits |
| - anv: Fix AUX-TT invalidation |
| - anv: implement recommended flush/wait of AUX-TT invalidation on compute |
| - iris: Add CCS cache flush bits |
| - iris: Fix AUX-TT invalidation |
| - iris: implement recommended flush/wait of AUX-TT invalidation |
| - intel/ds: Track CCS cache flush bit |
| - iris: Use correct CCS0 aux-map register offset |
| - intel/genxml: Fix typo in CCS cache flush enable |
| - intel/genxml: Drop incorrect compute aux-inv register entry |
| - anv: Drop depth cache flush requirement after depth clear/resolve |
| - iris: Drop depth cache flush requirement after depth clear/resolve |
| - blorp: Drop unnecessary assertions in blorp_can_hiz_clear_depth |
| |
| Samuel Holland (3): |
| |
| - Android.mk: Allow building only Vulkan drivers |
| - Android.mk: Explicitly enable/disable LLVM support |
| - Android.mk: Only link LLVM for radeonsi, not amd_vk |
| |
| Samuel Pitoiset (203): |
| |
| - radv: fix detecting FMASK_DECOMPRESS/DCC_DECOMPRESS meta pipelines |
| - vulkan: ignore rasterizationSamples when the state is dynamic |
| - radv: try to keep HTILE compressed for READ_ONLY_OPTIMAL layout |
| - radv: re-emit the guardband state when related PSO are bound |
| - radv: tidy up dirtying RBPLUS state in radv_bind_dynamic_state() |
| - radv: disable fast-clears with CMASK for 128-bit formats |
| - radv: require DRM 3.27 |
| - radv/amdgpu: remove legacy code path for creating the BO list |
| - radv/amdgpu: remove legacy code for querying context status |
| - radv: do not allow 1D block-compressed images with (extended) storage on GFX6 |
| - radv: fix usage flag for 3D compressed 128 bpp images on GFX9 |
| - radv: wait for occlusion queries in the resolve query shader |
| - radv: delay enabling/disabling occlusion queries at draw time |
| - radv: track DB_COUNT_CONTROL changes to avoid context rolls |
| - radv: emit PIXEL_PIPE_STAT_CONTROL in the gfx preamble for GFX11 |
| - radv: use gfx_level in radv_flush_occlusion_query_state() |
| - radv: update binning settings to work around GPU hangs |
| - radv/ci: remove one expected test failure on PITCAIRN |
| - radv/amdgpu: fix adding continue preambles and postambles BOs to the list |
| - Revert "ci/radv: Demote navi21 to manual until recent flakiness resolves." |
| - radv: add the perf counters BO to the preambles BO list |
| - radv: do not overallocate the CS array during submissions |
| - ac/sqtt: add rgp_sqtt_marker_cb_id definition |
| - ac/sqtt: add a helper to get cmdbuf IDs per queue |
| - radv: reserve command buffer index for SQTT |
| - docs: rename ACO_DEBUG=noscheduling to ACO_DEBUG=nosched |
| - docs: add missing ACO_DEBUG=force-waitdeps |
| - radv: only enable extendedDynamicState3ConservativeRasterizationMode on GFX9+ |
| - ac/spm: introduce ac_spm_trace and ac_spm_get_trace() |
| - ac/spm: rename ac_spm_trace_data to ac_spm |
| - ac/sqtt: add a helper for adding clock calibration records |
| - ac/sqtt: add helpers for initializing ac_thread_trace_data |
| - ac/sqtt: initialize clock calibration/queue info/queue event records |
| - radv/sqtt: sample CPU/GPU clocks before starting the trace |
| - radv/sqtt: add support for queue info |
| - ac/sqtt: add new bits to rgp_sqtt_marker_barrier_end |
| - ac/sqtt: add missing EventUnknown to rgp_sqtt_marker_event_type |
| - ac/rgp: update SQTT_FILE_CHUNK_TYPE_API_INFO to minor version 2 |
| - ac/rgp: update SQTT_FILE_CHUNK_TYPE_ASIC_INFO to minor version 5 |
| - ac/sqtt: add ac_sqtt_se_is_disabled() helper |
| - ac/sqtt: add ac_sqtt_get_trace() helper |
| - radv: do not abort when the SQTT buffer resize failed |
| - ac/rgp: remove ac_thread_trace_data from ac_thread_trace |
| - ac,radv,radeonsi: rename thread_trace to sqtt everywhere |
| - ac/nir: fix 8-bit/10-bit PS exports clamping |
| - radv: enable RADV_THREAD_TRACE_CACHE_COUNTERS by default |
| - radv: fix dynamic depth clamp enable support |
| - radv: fix invalid type for usage in radv_get_buffer_memory_requirements() |
| - radv: fix fast-clearing images with VK_REMAINING_{ARRAY_LAYERS,MIP_LEVELS} |
| - radv: replace radv_get_layerCount by vk_image_subresource_layer_count() |
| - radv: replace radv_get_levelCount() by vk_image_subresource_level_count() |
| - radv/meta: rename dest to dst |
| - radv: disable RB+ blend optimizations on GFX11 when a2c is enabled |
| - radv: use vk_image::mip_levels instead of radv_image::info::levels |
| - radv: use vk_image::array_layers instead of radv_image::info::array_size |
| - radv: use vk_image::samples instead of radv_image::info::storage_samples |
| - radv: use vk_image::samples instead of radv_image::info::samples |
| - radv: use vk_image::extent instead of radv_image::info::{width,height,depth} |
| - radv: remove ac_surf_info from radv_image |
| - ac/spm: switch to SPM version 2.0 |
| - vulkan: Update XML and headers to 1.3.250 |
| - radv: implement VK_EXT_attachment_feedback_loop_dynamic_state |
| - radv: advertise VK_EXT_attachment_feedback_loop_dynamic_state |
| - spirv: ignore SpvDecorationInvariant warning on struct members |
| - radv/ci: stop setting MESA_SPIRV_LOG_LEVEL |
| - radv: reset the emitted VS prolog when a new vertex shader is bound |
| - radv: dirty the dynamic vertex input state only when needed |
| - radv: re-emit fragment shading rate state when PA_CL_VRS_CNTL changes |
| - radv: configure PA_CL_VRS_CNTL entirely from the cmd buffer |
| - radv: implement dynamic sample locations enable |
| - radv: handle NULL fragment shaders when recording cmdbuf |
| - radv: handle NULL fragment shaders when creating graphics pipelines |
| - radv: rework the checks for implicit exports with GPL |
| - radv: allow to determine NGG settings with a NULL fragment shader |
| - radv: stop compiling a noop FS when the application doesn't provide a FS |
| - radv: advertise VK_EXT_tooling_info |
| - radv: reset the emitted PS epilog when a new fragment shader is bound |
| - radv: remove unused pipeline param in radv_generate_ps_epilog_key() |
| - radv: stop using the pipeline for determining the null export workaround |
| - radv: fix emitting VRS state with a null fragment shader |
| - radv: fix resetting VRS if the graphics pipeline doesn't enable it |
| - radv: fix a sync issue with primitives generated query and NGG/legacy |
| - amd/drm-shim: add navi21 |
| - amd/drm-shim: add pitcairn |
| - amd/drm-shim: add bonaire |
| - amd/drm-shim: update README about which file to modify |
| - ci: build drm-shim in debian-testing |
| - ci,radv: use drm-shim instead of the null winsys for radv-fossils |
| - ci: stop using the hang-detection tool for vkd3d-proton |
| - ci: rework vkd3d-proton runner and fix detecting failures |
| - radv: reserve cmdbuf space in radv_flush_gfx2ace_semaphore() |
| - radv: bump the global VRS image size to maximum supported FB dimensions |
| - radv: disable IMAGE_USAGE_STORAGE with depth-only and stencil-only formats |
| - radv: remove useless check about USAGE_STORAGE for TC-compat HTILE |
| - nir: add nir_intrinsic_load_poly_line_smooth_enabled |
| - radeonsi: lower nir_intrinsic_load_poly_line_smooth_enabled_amd |
| - nir: lower smooth lines conditionally using the new intrinsic |
| - radv: track if the smoothLines features is enabled in the device |
| - radv: determine if smooth lines can be used in the pipeline key |
| - radv: declare a new user SGPR for the dynamic line rasterization mode |
| - radv: lower nir_intrinsic_load_poly_line_smooth_enabled_amd |
| - radv: add support for smooth lines |
| - radv: enable smoothLines |
| - radv: apply a bug workaround for smoothing on GFX6 |
| - radv: do not enable VRS flat shading if the VRS builtin is read |
| - zink/ci: update VANGOGH expected list of failures |
| - vulkan/pipeline_cache: remove a bogus assert when inserting objects |
| - zink/ci: skip arb_texture_buffer_object@texture-buffer-size-clamp* with RADV |
| - radv: fix copying 2D to 3D images |
| - ci: uprev vkd3d-proton to 2.9 |
| - amd: fix 64-bit integer color image clears |
| - radv: rework configuring VGT_SHADER_STAGES_EN |
| - radv/ci: update list of expected failures since Vulkan loader 1.3 |
| - radv/ci: skip tests that timeout since Vulkan loader 1.3 |
| - vulkan: Update XML and headers to 1.3.251 |
| - radv: advertise VK_EXT_dynamic_rendering_unused_attachments |
| - aco: remove nir_intrinsic_load_barycentric_at_sample occurences |
| - radv/ci: removed expected failures that are skipped now |
| - radv/nir: use ac_nir_unpack_arg() for packed shader input user SGPRS |
| - radv: introduce SHIFT/MASK for unpacking shader input args |
| - radv: regroup fragment shader user SGPRs emission |
| - radv: merge all FS user SGPRs into one using packed arguments |
| - spirv: add support for SpvCapabilityFragmentBarycentricKHR |
| - spirv,nir: add support for BaryCoord{NoPersp}KHR builtins |
| - spirv,nir: add support for SpvDecorationPerVertexKHR |
| - nir/lower_io: add nir_intrinsic_load_input_vertex to is_input() |
| - nir: print locations for per-vertex fragment shader inputs |
| - zink/ci: remove useless RADV_PERFTEST=gpl |
| - radv: initialize the device cache UUID even if on-disk cache is disabled |
| - nir: add nir_intrinsic_load_provoking_vtx_amd |
| - radv: add support for nir_intrinsic_load_provoking_vtx_amd |
| - radv: track if the rasterization primitive is known at compile time |
| - nir: add nir_intrinsic_load_rasterization_primitive_amd |
| - radv: add support for nir_intrinsic_load_rasterization_primitive_amd |
| - radv: handle per_vertex variables when gathering FS inputs |
| - radv: set ROTATE_PC_PTR for custom interpolations |
| - radv: configure RSRC1.LOAD_PROVOKING_VTX for the fragment shader |
| - radv: add a NIR pass that lower fragment shader barycentric intrinsics |
| - radv: gather info about nir_intrinsic_load_sample_positions_amd |
| - radv: advertise VK_KHR_fragment_shader_barycentric on GFX10.3+ |
| - radv: add a helper for emitting a null depth/stencil target |
| - radv: reset more DB registers when emitting a null ds target |
| - radv: emit DB_RENDER_CONTROL as part of the framebuffer |
| - radv: disable HTILE compression only when layouts are compressed |
| - radv/ci: update the list of expected failures on STONEY |
| - radv: gather info about load_poly_line_smooth_enabled |
| - radv: add a helper for forcing VRS 1x1 in some situations |
| - radv: do not force VRS 1x1 when smooth lines are enabled |
| - radv: fix smooth lines with graphics pipeline library |
| - radv: fix re-emitting some dynamic states when the previous FS is NULL |
| - radv: fix re-emitting early_z/late_z when the bound PS changes |
| - radv: reset some dynamic states when the fragment shader stage is unbound |
| - radv: remove unused radv_dgc_token struct |
| - radv: add dgc_emit_state() helper |
| - radv: add dgc_emit_push_constant() helper |
| - radv: add dgc_emit_vertex_buffer() helper |
| - radv: add dgc_emit_draw() helper |
| - radv: add dgc_emit_draw_indexed() helper |
| - radv: add dgc_emit_index_buffer() |
| - radv: do not use IB for the GFX preamble with RADV_DEBUG=noibs |
| - radv: use IB for the GFX preamble on GFX6 |
| - radv: reserve space for shadowed regs |
| - radv/amdgpu: fix a buffer overflow for submissions with RADV_DEBUG=noibs |
| - radv/amdgpu: remove useless assert in radv_amdgpu_winsys_cs_submit_internal() |
| - radv/amdgpu: add cs_execute_ib() for executing IBs |
| - radv: use cs_execute_ib() for GFX, MBCP and DGC IBs |
| - vulkan/runtime: call CmdSetDepthBias2EXT() from CmdSetDepthBias() |
| - radv: implement VK_EXT_depth_bias_control |
| - radv: advertise VK_EXT_depth_bias_control |
| - radv: implement padding cmdbuffer for DGC on GFX6 |
| - radv: enable NV_device_generated_commands on GFX6 |
| - radv: reserve more space in CS for SQTT |
| - radv/amdgpu: fix dumping cs with RADV_DEBUG=noibs |
| - radv/amdgpu: dump all cs with RADV_DEBUG=noibs |
| - radv: only dirty the index type when necessary with DGC |
| - radv: only dirty the active push constant stages with DGC |
| - radv: adjust alignment of the preprocess buffer with DGC |
| - radv/amdgpu: use the correct IB size when growing a CS with RADV_DEBUG=noibs |
| - radv/amdgpu: rework growing a CS with the chained IB path slightly |
| - radv/amdgpu: do not set the IB size when ending a CS with RADV_DEBUG=noibs |
| - radv/amdgpu: use the array of IB buffers for the chained IB path |
| - radv/amdgpu: use cs_finalize() when growing a CS |
| - radv/amdgpu: rename old_ib_buffers to ib_buffers |
| - radv/amdgpu: add a helper to get a new IB |
| - radv/amdgpu: skip adding per VM BOs for sparse during CS BO list build |
| - radv/amdgpu: workaround a kernel bug when replacing sparse mappings |
| - radv/amdgpu: add more small helpers for managing CS |
| - radv/amdgpu: add support for executing DGC cmdbuf with RADV_DEBUG=noibs |
| - radv: allow NV_device_generated_commands with RADV_DEBUG=noibs |
| - radv: stop emitting TILE_SURFACE_ENABLE for the ZRANGE_PRECISION workaround |
| - radv: inline more values in radv_emit_fb_ds_state() |
| - radv: emit PA_SC_SCREEN_SCISSOR_BR with the actual fb extent |
| - zink/ci: update list of expected failures for NAVI10 |
| - zink: fix setting VkShaderCreateInfoEXT::nextStage |
| - radv/rt: fix capture/replay support |
| - vulkan: ignore VkPipelineColorWriteCreateInfoEXT if the state is dynamic |
| - Revert "radv/amdgpu: workaround a kernel bug when replacing sparse mappings" |
| - Revert "radv/amdgpu: skip adding per VM BOs for sparse during CS BO list build" |
| - radv/amdgpu: fix executing secondaries without IB2 |
| - radv/amdgpu: do not copy the original chain link for IBs |
| - radv: fix emitting SQTT userdata when CAM is needed |
| - radv: fix capturing RGP on RDNA3 with more than one Shader Engine |
| - radv: set THREAD_TRACE_MARKER_ENABLE for mesh/task draws |
| |
| Sarah Walker (15): |
| |
| - pvr: Support single core transfer queue commands on multicore GPUs |
| - pvr: Implement pvr_pbe_setup_modify_defaults() |
| - pvr: Complete pvr_modify_command() |
| - pvr: Complete pvr_unwind_rects() |
| - pvr: Complete pvr_double_stride() |
| - pvr: Implement pvr_isp_scan_direction() |
| - pvr: Implement pvr_reroute_to_clip() |
| - pvr: Support ipf_creq_pf in pvr_isp_ctrl_stream() |
| - pvr: Complete pvr_3d_validate_addr() |
| - pvr: Support multiple sources per pass in TQ job submission |
| - pvr: Complete pvr_generate_custom_mapping() |
| - pvr: Fragment register fb_cdc_zls is feature dependent |
| - pvr: use pvr_csb_pack() to setup CR_FB_CDC_ZLS |
| - pvr: Rename heap reserved area to static data carveout |
| - pvr: Merge main and extension command streams |
| |
| Sathishkumar S (5): |
| |
| - util/format: add planar3 r8_g8_b8_unorm pipe format |
| - frontends/va: add support for RGBP rt_format |
| - radeonsi/vcn: enable RGBP format on gfx940 jpeg |
| - radeonsi/vcn: engage all jpeg engines on gfx940 for mjpeg decode |
| - frontends/va: return matching drm format for yuyv pipe format |
| |
| Semjon Kravtsenko (1): |
| |
| - glx: Assign unique serial number to GLXBadFBConfig error |
| |
| Sergi Blanch Torne (8): |
| |
| - ci: Move Vulkan CTS patches to their own directory |
| - ci: disable Collabora's LAVA lab for maintance |
| - Revert "ci: disable Collabora's LAVA lab for maintance" |
| - ci: Allow zink-radv jobs to be manual when uprev piglit |
| - ci: disable Collabora's LAVA lab for maintance |
| - Revert "ci: disable Collabora's LAVA lab for maintance" |
| - ci: disable Collabora's LAVA lab for maintance |
| - Revert "ci: disable Collabora's LAVA lab for maintance" |
| |
| Shan-Min Chao (1): |
| |
| - tu/kgsl: Fix memory overwrite with vkFlushMappedMemoryRanges when more than 1 range |
| |
| Sil Vilerino (30): |
| |
| - d3d12: Do not fail d3d12_screen creation if D3D12_FEATURE_D3D12_OPTIONS14 not available |
| - frontend/va: Support QVBR rate control mode |
| - frontend/va: Allow distinction for HRD params sent from app and frontend defaults |
| - frontend/va: Allow distinction for Min/MaxQP params sent from app and frontend defaults |
| - d3d12: Support QVBR rate control mode |
| - d3d12: Support rate control HRD and MaxFrameSize app params |
| - d3d12: Support QPMin/QPMax app params |
| - d3d12: Support PIPE_VIDEO_CAP_MIN_WIDTH/HEIGHT caps |
| - d3d12: Support PIPE_VIDEO_CAP_ENC_QUALITY_LEVEL |
| - frontend/va: Add VAProfileH264High10 |
| - frontend/va: Add H264 decode slice data |
| - d3d12: Use frontend H264 decode slice offsets and sizes instead of parsing buffer |
| - d3d12: Clean unused code for parsing slices |
| - frontends/va: Extend AV1 Encode params |
| - d3d12: AV1 Encode |
| - CI/windows: Update headers and Agility redist to 1.711.3-preview |
| - d3d12: Correct tx_mode_support reporting as specified in libva spec |
| - d3d12: Only set reduced_tx_set when supported by D3D12 caps (no libva caps for reduced_tx_set to map to) |
| - d3d12: Fix usage of D3D12_VIDEO_ENCODER_RATE_CONTROL_FLAG, was using D3D12_VIDEO_ENCODER_SUPPORT_FLAG wrongly instead |
| - frontend/va: Pass surf->fence in PIPE_VIDEO_ENTRYPOINT_ENCODE contexts for driver to wait on input surface pending work |
| - frontend/va: Add video processing async fence support |
| - d3d12: Video Decode - Implement get_decoder_fence and async queing |
| - d3d12: Apply style format to d3d12_video_dec.cpp |
| - d3d12: Video Decode - Sync 3D context copy with decode work for texture array case |
| - d3d12: Video Encode - GPU wait on input surface fence |
| - d3d12: Video Process - Implement get_processor_fence and async queing |
| - d3d12: Video Decode - Refactor and style fixes |
| - frontend/va: Fix vaSyncSurface and vaQuerySurface status for drivers not implementing get_processor_fence |
| - frontend/va: Remove fence_server_sync for surface in vlVaHandleVAProcPipelineParameterBufferType |
| - aux/tc: Add ASSERTED to unreferenced release build variable |
| |
| Simon Perretta (5): |
| |
| - pvr: Amend validation when checking multiple supported types |
| - pvr: Use movc for reading special registers |
| - pvr: Add support for generating transfer fragment programs |
| - pvr: Add support for generating transfer EOT programs |
| - pvr: Use driver vertex input data in the compiler |
| |
| Simon Ser (4): |
| |
| - wayland: generalize wayland-protocols code generation |
| - radv: advertise LINEAR filter support for multiplanar/subsampled |
| - vulkan/wsi/wayland: add 16-bit formats |
| - Update OpenGL headers |
| |
| SoroushIMG (8): |
| |
| - zink: do not emit line stipple dynamic state when emulating |
| - zink: take location_frac into account in lower_line_smooth_gs |
| - zink: fix incorrect line mode check for bresenham |
| - zink: refcount the correct query pool |
| - pvr: fix sync waiting while using pvrsrvkm |
| - pvr: fix infinite recursion in pvr_cmd_buffer_{start,end}_sub_cmd |
| - pvr: add missing frag to geom dependency for jobs targetting same render target |
| - pvr: Fix barrier insertion on merged subpasses |
| |
| SureshGuttula (2): |
| |
| - va/surface : Add Nv12 support for PRIME_2 imports |
| - radeonsi/vcn: update luma and chroma size |
| |
| Sviatoslav Peleshko (7): |
| |
| - isl: Check all channels in isl_formats_have_same_bits_per_channel |
| - anv: Handle UNDEFINED format in image format list |
| - anv: Improve image/view usage bits verification |
| - nir/lower_shader_calls: Fix cursor if broken after nir_cf_extract() call |
| - glsl: Fix yylloc.source propagation in YYLLOC_DEFAULT |
| - dri: Use RGB internal formats for RGBX formats |
| - intel/fs: Check if the whole ubo load range is in the push const range |
| |
| Sylvain Munaut (1): |
| |
| - egl/dri2: Add a couple of missing mutex release in error path |
| |
| Tapani Pälli (33): |
| |
| - isl: disable mcs (and mcs+ccs) for color msaa on gfxver 125 |
| - iris: implement state cache invalidate for Wa_16013063087 |
| - anv: cleanup bitmask construction for PIPELINE_SELECT |
| - anv: implement state cache invalidate for Wa_16013063087 |
| - isl: fix layout for comparing surf and view properties |
| - egl/loader: move crtc resource infrastructure as common helper |
| - anv: handle missing astc for gfx125 in CreateImageView |
| - mesa: set a type for depth fallback texture |
| - intel/dev: provide helper to check if devinfo is ATS-M |
| - anv: add required invalidate/flush for Wa_14014427904 |
| - iris: add required invalidate/flush for Wa_14014427904 |
| - mesa: validate shader binary format in _mesa_spirv_shader_binary |
| - iris: make Wa_16013994831 to use intel_needs_workaround |
| - anv: make Wa_16013994831 to use intel_needs_workaround |
| - anv: remove BDW specific WA for CS stall enable |
| - intel/dev: add parentheses around intel_needs_workaround macro |
| - iris: use workaround framework for 1408224581, 14014097488 |
| - anv: use workaround framework for 1408224581, 14014097488 |
| - anv: wrap pipe control emission to a set of helper functions |
| - anv: implement flush part of emit_apply_pipe_flushes with helper |
| - anv: implement invalidate part of emit_apply_pipe_flushes with helper |
| - anv: convert genX_query pipe controls to use pc helper |
| - anv: change pipe controls in genX_state to use pc helper |
| - anv: change pipe control in genX_pipeline to use pc helper |
| - anv: change pipe controls in genX_gpu_memcpy to use pc helper |
| - anv: change pipe control in indirect draw gen to use pc helper |
| - anv: change most pipe controls in gfx8_cmd_buffer to use pc helper |
| - anv: convert most pc in genX_cmd_buffer to use pc helper |
| - isl: handle DRM_FORMAT_MOD_INVALID in isl_drm_modifier_has_aux |
| - intel/compiler: add more validation for acc register usage |
| - mesa: fix some TexParameter and SamplerParameter cases |
| - iris: avoid issues with undefined clip distance |
| - crocus: avoid issues with undefined clip distance |
| |
| Tatsuyuki Ishi (18): |
| |
| - util: Add dedicated hex conversion functions and use it. |
| - util: Call mesa_bytes_to_hex directly instead of disk_cache_format_hex_id. |
| - util: Add a copy of BLAKE3 hash library. |
| - util/blake3: Patch with hidden visibility for asm symbols. |
| - util: Add mesa_blake3 wrappers. |
| - nir: Fix serializing pointer initializers. |
| - radv: Make shader related destruction happen before hw_ctx. |
| - radv: Add RGP barrier markers for render pass transition and copy. |
| - radv: Guard against misplaced RGP barrier markers. |
| - util/blake3: Add blake3_hash typedef. |
| - vulkan: Migrate shader module hash to BLAKE3. |
| - vulkan/pipeline_cache: Do not consume object passed into remove_object. |
| - vulkan/pipeline_cache: Move locking outside of remove_object. |
| - vulkan/pipeline_cache: Move cache_object_unref out of header. |
| - vulkan/pipeline_cache: Introduce weak reference mode. |
| - radv: Enable weak reference cache for device->mem_cache. |
| - zink/ci: Add ext_transform_feedback@api-errors to fail list. |
| - radv/amdgpu: Do not pass in a BO handle when clearing PRT VA region. |
| |
| Teng, Jin Chung (1): |
| |
| - d3d12: HEVC Encode - Fix num_subregions_per_scanline rounding |
| |
| Thomas H.P. Andersen (30): |
| |
| - nir/nir_lower_wpos_center: Use the nir_shader_instructions_pass() helper |
| - nir/nir_lower_wpos_ytransform: Use the nir_shader_instructions_pass() helper |
| - nir/nir_lower_viewport_transform: Use the nir_shader_instructions_pass() helper |
| - nir/nir_lower_var_copies: Use the nir_shader_instructions_pass() helper |
| - nir/nir_lower_uniforms_to_ubo: Use the nir_shader_instructions_pass() helper |
| - nir/nir_lower_two_sided_color: Use the nir_shader_instructions_pass() helper |
| - nir/nir_lower_to_source_mods: Use the nir_shader_instructions_pass() helper |
| - nir/nir_lower_vec3_to_vec4: Use the nir_shader_instructions_pass() helper |
| - r600: remove unused code |
| - tgsi: delete unused functions |
| - aux: remove unused tgsi includes |
| - d3d12: remove unused tgsi includes |
| - etnaviv: remove unused tgsi includes |
| - freedreno: remove unused tgsi includes |
| - i915: remove unused tgsi includes |
| - llvmpipe: remove unused tgsi includes |
| - nouveau: remove unused tgsi includes |
| - r300: remove unused tgsi includes |
| - r600: remove unused tgsi includes |
| - radeonsi: remove unused tgsi includes |
| - softpipe: remove unused tgsi includes |
| - svga: remove unused tgsi includes |
| - v3d: remove unused tgsi includes |
| - vc4: remove unused tgsi includes |
| - virgl: remove unused tgsi includes |
| - zink: remove unused tgsi includes |
| - lavapipe: remove unused tgsi includes |
| - st: remove unused tgsi includes |
| - r600: tgsi cleanup |
| - tgsi: remove unused functions and structs |
| |
| Thong Thai (11): |
| |
| - gallium/pipe: add min width and min height video cap enums |
| - radeonsi: return min width and min height video cap values |
| - frontends/va: report min width and min height values if available |
| - mesa/main: rework locale setup/teardown |
| - util: check and initialize locale before using it |
| - tgsi: use locale independent float and double parsing |
| - frontends/va/config: add disable packed headers as valid config |
| - frontends/va/context: check min supported resolution when creating |
| - frontends/va/config: check for QVBR support when creating |
| - frontends/va/context: return error if context_id == 0 |
| - frontends/va: fix some coverity scan reported issues |
| |
| Tim Pambor (1): |
| |
| - virgl: Fix stack overflow in virgl_bind_sampler_states |
| |
| Timothy Arceri (32): |
| |
| - util: add Pixel Game Maker MV workaround |
| - util: add Jamestown+ workaround |
| - st/glsl: move linking code to the same st file |
| - glsl: call nir_opt_find_array_copies() when linking |
| - glsl: port lower_blend_equation_advanced() to nir |
| - glsl: call nir version of lower_blend_equation_advanced() |
| - glsl: remove old lower_blend_equation_advanced() code |
| - glsl: add some more c wrappers for string_to_uint_map |
| - mesa: add some new constants |
| - glsl: move some compiler code out of st |
| - glsl: move lowering linker code out of st |
| - glsl: port assign location code for VS inputs or FS outputs |
| - glsl: call assign_attribute_or_color_locations() in NIR linker |
| - glsl: remove unused buffer objects with packed layout |
| - glsl: remove unused system vars |
| - glsl: drop the dce of global vars from GLSL IR linker |
| - nir/glsl: add nir_var_declared_implicitly enum |
| - glsl: move disable_varying_optimizations_for_sso() to NIR linker |
| - glsl: remove the always_active_io flag from GLSL IR |
| - glsl: inline link_varyings() |
| - glsl: set last_vert_prog in the nir linker |
| - glsl: drop link_invalidate_variable_locations() |
| - glsl: move store_fragdepth_layout() to nir linker |
| - glsl: remove glsl ir optimisation loop from linker |
| - st/glsl: merge link_shader() into st_link_nir() |
| - st/glsl: merge st_link_glsl_to_nir() into st_link_nir() |
| - st/glsl: merge st_glsl_to_ir.cpp with st_glsl_to_nir.cpp |
| - glsl: remove dead varyings before assigning attr locations |
| - glsl: do vs attribute validation in NIR linker |
| - glsl: fix validation of ES vertex attribs |
| - glsl: fix spirv sso validation |
| - util: add radeonsi workaround for Nowhere Patrol |
| |
| Timur Kristóf (61): |
| |
| - radv/amdgpu: Remove unnecessary assertions from chaining. |
| - radv: Disallow IB2 on GFX6 when using draw_indirect_multi. |
| - radv: Use IB BOs (chaining) by default on GFX6. |
| - radv: Chain command buffers on GFX6 in radv_queue. |
| - amd: Rename INDIRECT_BUFFER_CIK to just INDIRECT_BUFFER. |
| - radv: Simplify IB2 workaround. |
| - radv: Remove IB2 workaround from mesh shader draws. |
| - radv: Enable IB2 workaround on all indirect draws. |
| - radv: Fix dword alignment in SDMA buffer copy. |
| - aco: Disallow constant propagation on SOPP and fixed operands. |
| - amd: Add and implement sendmsg_amd intrinsic. |
| - amd: Add and implement gs_wave_id sysval. |
| - amd: Move sendmsg defines to ac_shader_util. |
| - ac/llvm: Clarify arguments of ac_build_sendmsg. |
| - ac/nir: Use sendmsg in legacy GS lowering. |
| - ac/nir: Emit legacy GS DONE signal in NIR. |
| - ac/nir/ngg: Use sendmsg in NGG lowering. |
| - amd: Cleanup old GS intrinsics code. |
| - aco: Don't allow any VALU instruction to write m0. |
| - aco: Initialize vcmpx field in get_cmp_info. |
| - radv/amdgpu: Remove unused extra BO array. |
| - radv/amdgpu: Split radv_amdgpu_get_bo_list to smaller functions. |
| - radv/amdgpu: Pass preambles to get_bo_list. |
| - radv/amdgpu: Use STACK_ARRAY for IB array to reduce stack usage. |
| - radv: Move perf counter CS creation to where it's used. |
| - ac: Use const keyword for some function arguments. |
| - radv: Use const keyword more. |
| - radv: Emit primitive reset index with primitive restart enable. |
| - radv: Compute tess info when emitting patch control points. |
| - radv: Move ignore forced VRS code to more optimal place. |
| - radv: Set last_index_type in radv_before_draw. |
| - radv: Slight refactor to late_scissor_emission. |
| - radv: Move indirect check from index buffer emission to caller. |
| - radv: Move empty dynamic states check to caller. |
| - radv: Clear query dirty flags when flushing them. |
| - radv: Clarify gang submit terminology. |
| - radv: Use RESET_FILTER_CAM for some mesh shading draws. |
| - aco: Mark exec write used when it writes other registers. |
| - radv: Remove primitive reset index from late scissor workaround. |
| - radv: Leave primitive reset index at max on GFX8+. |
| - ac: Add ac_hw_stage enum. |
| - aco: Use ac_hw_stage instead of aco-specific HWStage. |
| - aco: Add hw_stage field to aco_shader_info. |
| - radeonsi: Set aco_shader_info::hw_stage |
| - radv: Set aco_shader_info::hw_stage |
| - aco: Use aco_shader_info::hw_stage instead of guessing. |
| - aco: Remove unneeded stage related info fields. |
| - ac/nir/ngg: Call nir_convert_to_lcssa before divergence analysis. |
| - ac/nir/ngg: Add upper limit to reusable uniforms. |
| - ac/nir/ngg: Follow intrinsic sources when analyzing before culling. |
| - ac/nir/ngg: Follow tex sources when analyzing before culling. |
| - radv: Refactor required subgroup size in pipeline key. |
| - radv: Use required subgroup info for graphics shaders. |
| - radv: Enable required subgroup size on mesh/task. |
| - aco: Add MESA_SHADER_KERNEL to instruction selection setup. |
| - aco: Fix subgroup_id intrinsic on GFX10.3+. |
| - ac/nir: Add done arg to ac_nir_export_position. |
| - ac/nir: Slightly refactor how pos0 exports are added when missing. |
| - ac/nir/ngg: Wait for attribute stores before VS/TES/GS pos0 export. |
| - ac/nir/ngg: Refactor mesh shader primitive export. |
| - ac/nir/ngg: Wait for attribute ring stores in mesh shaders. |
| |
| Tony Wasserka (2): |
| |
| - aco/spill: Use arena allocator for next use distances |
| - aco/spill: Use arena allocator for spills |
| |
| Veerabadhran Gopalakrishnan (2): |
| |
| - radeonsi: return kernel queried video capability for HEVC and JPEG |
| - radeonsi: return kernel queried video capability for HEVC and JPEG |
| |
| Viktoriia Palianytsia (1): |
| |
| - iris,crocus: Add proper way of assigning num_levels value |
| |
| Vinson Lee (10): |
| |
| - r600/sfn: Initialize BlockScheduler member m_chip_family. |
| - freedreno/a6xx: Fix memory leak on error path. |
| - nv50: Fix memory leak in error path |
| - pvr: Fix signed comparison |
| - dzn: Fix qpool->queries_lock double lock |
| - tu: Fix missing unlock |
| - vulkan/wsi: Remove duplicate NULL check |
| - frontends/va: Fix missing unlock |
| - r600/sfn: Remove duplicate assignment |
| - vk/wsi/x11: Remove dead code |
| |
| Vitaliy Triang3l Kuzmin (27): |
| |
| - lavapipe: Fix vk_instance_init vk_error instance use-after-free |
| - radv: Fix vk_instance_init vk_error instance use-after-free |
| - radv: Move most of DB_SHADER_CONTROL to PS, more precise GFX11 blend WA |
| - docs/amd: Document Primitive Ordered Pixel Shading |
| - ac/nir: Support Primitive Ordered Pixel Shading in lower_ps |
| - aco: Support pops_exiting_wave_id PhysReg usage |
| - ac: Define POPS collision wave ID argument SGPR |
| - aco: Add s_wait_event argument bit definitions |
| - aco: Add Primitive Ordered Pixel Shading pseudo-instructions |
| - aco: Skip waitcnt insertion in the discard early exit block |
| - aco: Add Primitive Ordered Pixel Shading scheduling rules |
| - aco: Send MSG_ORDERED_PS_DONE where necessary |
| - aco: Add Primitive Ordered Pixel Shading waitcnt rules |
| - aco: Implement fragment shader interlock intrinsics |
| - radeonsi: Remove unconditional POPS_DRAIN_PS_ON_OVERLAP setting |
| - radv: Remove unconditional POPS_DRAIN_PS_ON_OVERLAP setting |
| - radv: Detect the use of Primitive Ordered Pixel Shading |
| - radv: Ensure 1x1 shading rate on GFX10.3 with interlock execution mode |
| - radv: Declare POPS collision wave ID shader argument |
| - radv: Enable POPS collision wave ID shader argument |
| - radv: Enable the null export workaround with POPS |
| - radv: Handle Primitive Ordered Pixel Shading in DB_SHADER_CONTROL |
| - ac/gpu_info: Check whether the device has the POPS missed overlap bug |
| - radv: Apply the POPS missed overlap hardware bug workaround |
| - radv: Disable VRS forcing with Primitive Ordered Pixel Shading |
| - zink/ci: Add broken fragment shader interlock test to RADV flakes |
| - radv: Enable VK_EXT_fragment_shader_interlock |
| |
| Víctor Manuel Jáquez Leal (1): |
| |
| - vulkan: complete the usage flags for video layouts |
| |
| Weibin Wu (1): |
| |
| - winsys/gdi: GDI B5G6R5 display target support |
| |
| Xaver Hugl (1): |
| |
| - vulkan wsi: add support for PresentOptionAsyncMayTear |
| |
| Xi Ruoyao (1): |
| |
| - Revert "glx: Remove pointless GLX_INTEL_swap_event paranoia" |
| |
| Yiwei Zhang (46): |
| |
| - radv: respect VK_QUERY_RESULT_WAIT_BIT in GetQueryPoolResults |
| - venus: stop query experimental features |
| - venus: adopt venus protocol release |
| - meson/ci: promote virtio-experimental to virtio |
| - docs: update Virtio-GPU Venus driver page |
| - ci: carry venus-protocol 1.0 release patches in virglrenderer |
| - ci: uprev virglrenderer to drop venus release patches |
| - anv: apply ANV_BO_ALLOC_IMPLICIT_SYNC for external memory |
| - pipe-loader: avoid undefined memcpy behavior |
| - lvp: avoid accessing member of NULL ptr for global entries |
| - venus: bump ring space to 128K |
| - docs/venus: update vtest instructions |
| - radv: fix radv_emit_userdata_vertex for vertex offset -1 |
| - venus: silence -Wuninitialized |
| - venus: sync to latest protocol from header v1.3.248 |
| - venus: sync protocol for VK_EXT_image_2d_view_of_3d |
| - venus: enable VK_EXT_image_2d_view_of_3d |
| - docs/venus: advertise VK_EXT_image_2d_view_of_3d |
| - venus: temporarily disable VK_EXT_memory_budget |
| - venus: refactor vn_device_memory to track VkMemoryType |
| - venus: handle device memory report requests |
| - venus: emit device memory report for device memory events |
| - venus: enable VK_EXT_device_memory_report |
| - docs: update venus VK_EXT_device_memory_report support |
| - anv: avoid requiring ordered memory planes for explicit import |
| - venus: suballocate feedback slot with feedback buffer alignment |
| - venus: refactor ahb buffer mem type bits cache to be lazy |
| - venus: refactor buffer cache related bits |
| - venus: extend VkBuffer cache to cover concurrent sharing |
| - venus: fix a cmd tmp storage leak |
| - venus: fix leaks from tracked present src images |
| - venus: track pool in cmd and track device in pool |
| - venus: cmd to reuse alloc copy from cmd pool |
| - venus: refactor vn_cmd_add_query_feedback and miscs |
| - venus: cache query batches at cmd pool |
| - venus: refactor query batch handling |
| - venus: recheck valid bit after acquiring lock to init ahb mem type bits |
| - venus: handle query feedback creation failure |
| - venus: ensure consistency of query overflow behavior |
| - venus: add a missing barrier before copying query feedback |
| - turnip: flush cache for dstBuffer in vkCmdCopyQueryPoolResults |
| - lvp: avoid reading immutable sampler from desc write info |
| - venus: fix a cmd builder render_pass state leak across reset |
| - venus: fix cmd state leak across implicit reset |
| - venus: fix a device memory report leak |
| - vulkan/android: add missing AHARDWAREBUFFER_USAGE_GPU_DATA_BUFFER usage |
| |
| Yogesh Mohan Marimuthu (2): |
| |
| - ac/gpu_info: num_cu = 4 and gfx11 enable dcc with retile |
| - ac/gpu_info: rearrange if checks for dcc config |
| |
| Yonggang Luo (121): |
| |
| - loader: Replace usage of mtx_t with simple_mtx_t in loader/loader_dri3_helper.c |
| - v3d: Replace usage of mtx_t with simple_mtx_t in v3d_simulator.c |
| - vc4: Replace usage of mtx_t with simple_mtx_t in vc4/vc4_simulator.c |
| - drm-shim: Replace usage of mtx_t with simple_mtx_t in drm_shim.c |
| - drm: Replace usage of mtx_t with simple_mtx_t in virgl/drm/virgl_drm_winsys.c |
| - drm: Replace usage of mtx_t with simple_mtx_t in drm/radeon_drm_winsys.c |
| - drm: Replace usage of mtx_t with simple_mtx_t in nouveau_drm_winsys.c |
| - hud: Replace usage of mtx_t with simple_mtx_t in hud_cpufreq.c |
| - hud: Replace usage of mtx_t with simple_mtx_t in hud_diskstat.c |
| - hud: Replace usage of mtx_t with simple_mtx_t in hud_nic.c |
| - hud: Replace usage of mtx_t with simple_mtx_t in hud_sensors_temp.c |
| - xlib: Replace usage of mtx_t with simple_mtx_t in xm_api.c |
| - rtasm: Trim trailing spaces and replace tab with 3 space |
| - rtasm: Replace usage of mtx_t with simple_mtx_t in rtasm_execmem.c |
| - nine: Replace usage of mtx_t with simple_mtx_t in nine_lock.c |
| - omx: Replace usage of mtx_t with simple_mtx_t in vid_omx_common.c |
| - vdpau: Replace usage of mtx_t with simple_mtx_t in htab.c |
| - c11: Remove _MTX_INITIALIZER_NP as it's not used anymore |
| - microsoft/compiler: Getting function impl to be consistence with decl in dxil_enums.* |
| - compiler: Getting shader_prim to be PACKED that consistence with pipe_prim_type |
| - compiler: Add SHADER_PRIM_COUNT to be SHADER_PRIM_MAX + 1 |
| - compiler: Rename shader_prim to mesa_prim and replace all usage of pipe_prim_type with mesa_prim |
| - docs: Update document about pipe_prim_type with mesa_prim |
| - util: Replace all usage of PIPE_TIMEOUT_INFINITE with OS_TIMEOUT_INFINITE |
| - r300: Replace usage of os_get_process_name with util_get_process_name in r300_chipset.c |
| - virgl: Array cmdline on stack should initialized to 0 |
| - virgl: Replace the usage of os_get_process_name with util_get_process_name |
| - compiler: Combine duplicated implementation of is_gl_identifier into glsl_types.h |
| - compiler: Move can_implicitly_convert_to helper to glsl module from glsl_types.h |
| - mesa, compiler: Move gl_texture_index to glsl_types.h |
| - compiler: Remove the need include "util/glheader.h" and "util/ralloc.h" in glsl_types.h |
| - compiler: Remove redundant struct glsl_type in nir_types.h |
| - vulkan: move nir_convert_ycbcr into vulkan runtime |
| - util: Remove redundant type cast in function align64 |
| - util: use uint32_t as the parameter of align function |
| - util: Do not use align as variable name |
| - compiler: use align instead glsl_align and remove glsl_align |
| - panfrost: Replace the usage of PIPE_BIND_* with PAN_BIND_* |
| - ac: Replace the usage of pipe_compare_func with compare_func |
| - dri: Replace usage of boolean/TRUE/FALSE with bool/true/false |
| - freedreno: Fixes error: passing argument 1 of ‘pthread_mutex_unlock’ from incompatible pointer type in tu_pipeline.c |
| - wsi: Fixes passing argument 1 of ‘mtx_unlock’ from incompatible pointer type |
| - c11: Improve timespec_get to support TIME_MONOTONIC TIME_ACTIVE TIME_THREAD_ACTIVE TIME_MONOTONIC_RAW |
| - c11: Improve mtx_timedlock to use timespec_get instead of time(NULL) |
| - c11: Implement os_time_get_nano with timespec_get(&ts, TIME_MONOTONIC) |
| - zink: Replace the usage of os_get_process_name with util_get_process_name |
| - dd: Replace the usage of os_get_process_name with util_get_process_name in dd_draw.c |
| - gallium: Remove unused os_process.h in gallium/auxiliary |
| - util: Fixes prototype of threads_timespec_compare |
| - mapi: Fixes check_table.cpp for DrawArraysInstancedARB and DrawElementsInstancedARB |
| - meson: Use consistence disabled/enabled comment for shared-glapi option |
| - mapi: Fixes non-constant-expression cannot be narrowed from type 'unsigned long' to 'unsigned int' in initializer list with clang |
| - meson: Guard the glsl tests that only working when OpenGL ES2 is enabled |
| - draw: Replace usage of boolean/TRUE/FALSE with bool/true/false in draw_pt_vsplit* |
| - draw: Replace usage of ubyte/ushort/uint with uint8_t/uint16_t/uint32_t in draw_pt_vsplit.c |
| - draw: Update the comment and function name to match the type |
| - vtn: Do not assign main_entry_point->impl twice |
| - nir: Add function nir_function_set_impl |
| - hud: Use bool/true/false to replace boolean/TRUE/FALSE in hud/hud_context.c |
| - gallium/draw: Replace the usage of ushort to uint16_t in files that can not found by tools |
| - llvmpipe: altivec.h inclusion in -std=c++98..11 causes bool to be redefined |
| - treewide: replace usage of boolean to bool |
| - treewide: style fixes after replace usage of boolean to bool |
| - treewide: Replace the usage of TRUE/FALSE with true/false |
| - treewide: Replace the usage of ubyte/ushort with uint8_t/uint16_t |
| - treewide: style fixes after replace the usage of ubyte/ushort with uint8_t/uint16_t |
| - util: Merge p_compiler.h into src/util/compiler.h |
| - util: include "util/compiler.h" instead of "pipe/p_compiler.h" |
| - mapi: Fixes compile error with build option "-D shared-glapi=disabled" |
| - mapi: Now _glapi_get_dispatch_table_size always equal to sizeof(struct _glapi_table) / sizeof(void \*) |
| - mapi: Hide OpenGL functions to be exported when shared-glapi is disabled |
| - ci: Testing -D shared-glapi=disabled with debian-clang-release |
| - d3d12: Fixes unused-variable compile error |
| - compiler: set alignment=1 by default for handling empty struct/interface in glsl_types.cpp |
| - util: Add function util_is_power_of_two_nonzero64 in bitscan.h |
| - util: use uint32_t instead of unsigned in bitscan.h |
| - util: Getting align and align64 consistence with ALIGN |
| - util: Replace the usage of redundant u_align_u32 with align and remove u_align_u32 |
| - util: Do not use align64 over unsigned int in register_allocate.c |
| - util: sizeof bucket are always 32bit width, use align instead align64 |
| - mapi: Style fixes in glapi/glapi_getproc.c |
| - mapi: Merge get_static_proc_address into _glapi_get_proc_address |
| - mapi: Remove dead struct _glapi_function in glapi/glapi_getproc.c |
| - nir: Split macro nir_foreach_function_with_impl out of nir_foreach_function_impl |
| - clang-format: Add nir_foreach_function_with_impl into src/.clang-format |
| - treewide: Switch to use nir_foreach_function_with_impl when possible |
| - clang-format: Add nir_foreach_function_impl into src/.clang-format |
| - gallium/auxiliary: Switch to use nir_foreach_function_impl |
| - asahi: Use nir_foreach_function_impl instead nir_foreach_function in function agx_nir_lower_zs_emit |
| - d3d12: Switch to use nir_foreach_function_impl |
| - glsl: Switch to use nir_foreach_function_impl from nir_foreach_function |
| - glsl: Remove the extra scope in gl_nir_link_uniforms.c |
| - crocus: Switch to use nir_foreach_function_impl |
| - intel/compiler: Switch to use nir_foreach_function_impl |
| - broadcom: replace redefined ALIGN() macro with common util functions |
| - util: Remove redundant defined(_WIN32) in u_string.h |
| - util: Remove redundant #if !defined(XF86_LIBC_H) in u_string.h |
| - nir: Strip the const modifier on nir_function * in nir_foreach_function_with_impl |
| - panfrost: Convert to use nir_foreach_function_with_impl in function midgard_compile_shader_nir |
| - panfrost: Convert to use nir_foreach_function_impl when possible |
| - mesa: Convert to use nir_foreach_function_impl |
| - llvmpipe: Convert to use nir_foreach_function_impl |
| - sfn: Convert to use nir_foreach_function_impl |
| - sfn: indent fixes after switch to use nir_foreach_function_impl |
| - compiler/clc: Switch to use nir_foreach_function_impl in function nir_lower_libclc |
| - dxil: Use nir_remove_non_entrypoints |
| - nir: Update the comment to call nir_remove_non_entrypoints directly |
| - glsl: Use nir_remove_non_entrypoints to simplify the code |
| - radv: Use nir_remove_non_entrypoints in radv_shader.c |
| - nir: Add nir_foreach_function_safe and use it |
| - pvr: Use alignas instead of ALIGN_ATTR and remove ALIGN_ATTR |
| - vc4: Convert to use nir_foreach_function_impl when possible |
| - v3d: Switch to use nir_foreach_function_impl |
| - broadcom: Switch to use nir_foreach_function_impl |
| - radeonsi: Use ALIGN_POT instead ALIGN_TO |
| - etnaviv: Convert to use nir_foreach_function_impl |
| - intel/vulkan: Convert to use nir_foreach_function_impl when possible |
| - iris: Convert to use nir_foreach_function_impl |
| - treewide: Remove all usage of nir_builder_init with nir_builder_create and nir_builder_at |
| - treewide: remove unused nir_builder |
| - nir: Remove nir_builder_init, it's not used anymore |
| |
| Zhang Ning (2): |
| |
| - lima: use u_pipe_screen_lookup_or_create in the renderonly path too |
| - Revert "intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR" |
| |
| Zhang, Jianxun (3): |
| |
| - intel/isl: Fix map between sRGB and linear formats |
| - anv: Support 1MB AUX mapping (MTL) |
| - anv: Remove alignment to aux ratio on size of main surface |
| |
| antonino (29): |
| |
| - zink: don't emulate edgeflags for patches |
| - zink: use correct primitives for passthrough gs with tess |
| - zink: add \`single_sample` to fs key |
| - zink: add to multisample field to \`zink_gfx_pipeline_state` |
| - zink: don't render with multisampling when it is disabled |
| - zink/ci: remove xt_framebuffer_multisample-interpolation fail |
| - zink: fix pv mode lowring index calculation |
| - zink: use ring buffer to preserve last element |
| - zink: fix exit condition on pv emulation loop |
| - zink: fix line strip offsets in pv mode emulation |
| - nir/zink: use sysvals in \`nir_create_passthrough_gs` |
| - zink: fix store subsitution in \`lower_pv_mode_gs_store` |
| - zink: set when pipeline dirty flag when multisample changes |
| - Revert "zink: set when pipeline dirty flag when multisample changes" |
| - Revert "zink/ci: remove xt_framebuffer_multisample-interpolation fail" |
| - Revert "zink: don't render with multisampling when it is disabled" |
| - Revert "zink: add to multisample field to \`zink_gfx_pipeline_state`" |
| - Revert "zink: add \`single_sample` to fs key" |
| - zink: take location_frac into account in pv emulation |
| - nir: use \`nir_variable_clone` in \`nir_create_passthrough_gs` |
| - nir: don't create invalid inputs in \`nir_create_passthrough_gs` |
| - zink: don't replace non generated gs |
| - nir: handle interface blocks in \`copy_vars` |
| - zink: handle interface blocks in \`copy_vars` |
| - nir: make var arrays large enough in \`nir_create_passthrough_gs` |
| - zink: don't create invalid inputs in \`zink_create_quads_emulation_gs` |
| - vulkan/wsi: add \`vk_wsi_force_swapchain_to_current_extent` driconf |
| - drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "The Talos Principle" |
| - drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "Serious Sam Fusion" |
| |
| i509VCB (1): |
| |
| - docs/asahi: Add hardware glossary |
| |
| lorn10 (1): |
| |
| - docs: Update Clover's env variable documentation |
| |
| nihui (1): |
| |
| - panvk: port panvk_logi to vk_logi |
| |
| norablackcat (24): |
| |
| - rusticl: implement cl_khr_pci_bus_info |
| - docs/rusticl: add Contributing section |
| - rusticl/types add ::new for cl_dev_idp_accel_props |
| - rusticl/api: add integer_dot_product api |
| - rusticl/clc add integer_dot_prod feature macros |
| - rusticl/kernel: remove nir_lower_pack pass |
| - rusticl/device: add cl_khr_integer_dot_product ext |
| - rusticl/program: fix clippy cast to the same type |
| - rusticl/types: fix clippy new() not returning Self |
| - rusticl/screen: implement uuid wrapper funcs |
| - rusticl/device: implement cl_khr_device_uuid |
| - rusticl/screen: fix driver_uuid on non x86 |
| - rusticl: add cl_khr_create_command_queue |
| - docs/features update opencl extensions add rusticl |
| - docs: rusticl envvars list supported drivers |
| - rusticl/memory: fix clippy errors |
| - gallium: add PIPE_CAP_TIMER_RESOLUTION |
| - llvmpipe/screen: add PIPE_CAP_TIMER_RESOLUTION |
| - sofpipe/screen: add PIPE_CAP_TIMER_RESOLUTION |
| - crocus/screen: add PIPE_CAP_TIMER_RESOLUTION |
| - iris/screen: add PIPE_CAP_TIMER_RESOLUTION |
| - r600/pipe: add PIPE_CAP_TIMER_RESOLUTION |
| - radeonsi/get: add PIPE_CAP_TIMER_RESOLUTION |
| - zink/screen: add PIPE_CAP_TIMER_RESOLUTION |
| |
| timmac-qmc (1): |
| |
| - glsl: fix potential crash with DisableUniformArrayResize |
| |
| xurui (6): |
| |
| - zink: Some return values of malloc should be checked |
| - zink: Use malloc instead of ralloc |
| - zink: Use malloc to allocate libs |
| - zink: Add some printfs when initialization fails |
| - zink: Free the cdt when an error occurs |
| - zink: The result should be assigned a value when returned |