| Mesa 22.2.0 Release Notes / 2022-09-21 |
| ====================================== |
| |
| Mesa 22.2.0 is a new development release. People who are concerned |
| with stability and reliability should stick with a previous release or |
| wait for Mesa 22.2.1. |
| |
| Mesa 22.2.0 implements the OpenGL 4.6 API, but the version reported by |
| glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / |
| glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. |
| Some drivers don't support all the features required in OpenGL 4.6. OpenGL |
| 4.6 is **only** available if requested at context creation. |
| Compatibility contexts may report a lower version depending on each driver. |
| |
| Mesa 22.2.0 implements the Vulkan 1.3 API, but the version reported by |
| the apiVersion property of the VkPhysicalDeviceProperties struct |
| depends on the particular driver being used. |
| |
| SHA256 checksum |
| --------------- |
| |
| :: |
| |
| b1f9c8fd08f2cae3adf83355bef4d2398e8025f44947332880f2d0066bdafa8c mesa-22.2.0.tar.xz |
| |
| |
| |
| New features |
| ------------ |
| |
| - WGL_ARB_create_context_robustness |
| |
| - d3d12 ARB_robust_buffer_access_behavior |
| |
| - VK_EXT_robustness2 for lavapipe |
| |
| - VK_EXT_image_2d_view_of_3d on RADV |
| |
| - zink and d3d12 GL_EXT_memory_object_win32 and GL_EXT_semaphore_win32 support |
| |
| - vertexAttributeInstanceRateZeroDivisor support for lavapipe |
| |
| - panfrost Valhall support (conformant OpenGL ES 3.1 on Mali-G57) |
| |
| - VK_EXT_primitives_generated_query on RADV |
| |
| - VK_EXT_non_seamless_cube_map on RADV, ANV, lavapipe |
| |
| - VK_EXT_border_color_swizzle on lavapipe, ANV, turnip, RADV |
| |
| - VK_EXT_shader_module_identifier on RADV |
| |
| - variablePointers on lavapipe |
| |
| - VK_EXT_multisampled_render_to_single_sampled on lavapipe |
| |
| - VK_EXT_shader_subgroup_vote/ballot on lavapipe |
| |
| - EGL_KHR_context_flush_control for all drivers |
| |
| - GLX_ARB_context_flush_control for all drivers |
| |
| - VK_EXT_attachment_feedback_loop_layout on RADV |
| |
| |
| Bug fixes |
| --------- |
| |
| - turnip/a660: dEQP-VK.spirv_assembly.instruction.terminate_invocation.terminate.subgroup_ballot failure due to "movmsk" reordered with "kill" |
| - [radeonsi] Graphical corruption in Valve Source-based games |
| - Updates to 22.2 RCs cause artifacts on nouveau and blank screen on VirtIO |
| - nouveau: corruption on NVAC after switching to the NIR backend |
| - zink: GL_TEXTURE_LOD_BIAS vs mipLodBias, and GTF-GL46.gtf21.GL3Tests.texture_lod_bias.texture_lod_bias_all failure on Turnip |
| - Gallium OpenGL megadriver fails to build if LLVM is enabled but not required by any component built |
| - Abort due to "Unknown intrinsic: intrinsic store_deref" (firefox, radeonsi) |
| - turnip: dEQP-VK.subgroups.vote.graphics.subgroupallequal_*int regression on a635 |
| - regression from !17949 |
| - r600/sfn: xonotic shows a big performance regression when using the NIR backend with sb disabled |
| - r600/sfn: register allocation fails for a number of piglits and deqp tests |
| - DG2 corruption in gnome desktop from F36 |
| - radeonsi: error allocating sparse buffers.. bug? |
| - [BISECTED/radeonsi] Flickering triangles with wayfire cube |
| - ../src/util/u_queue.c:218: util_queue_fence_destroy: Assertion \`fence->signalled' failed for iris when disabled UTIL_FUTEX_SUPPORTED |
| - turnip: gfxbench aztec ruins crashes on zink |
| - glsl: textureGatherOffset param broken |
| - v3dv: android: VTS big regression after 316728a55bc8fe398be1ac2796a22f8c91fb886c |
| - radeonsi: driver crashes with kwin-effects-cube |
| - [Turnip] Final Fantasy X: Incorrect hair rendering |
| - turnip: gfxbench car chase crash on zink |
| - [r300] - osu! - Shader optimizations cause black and glitchy rendering |
| - [Bisected] Microsoft CLC: build failure with MSVC and Dshared-llvm=disabled with AMDGPU target available |
| - anv: invalid returns from vkCreateDescriptorPool |
| - [Bisected][RadeonSI] Blender crashes on startup |
| - turnip: depth/stencil regressions |
| - llvmpipe: assert in KHR-Single-GL45.enhanced_layouts.xfb_all_stages |
| - Poor performance/utilization with Splitgate |
| - freedreno / mesa 22.1.3 crashes Firefox Nightly |
| - [Bisected][RADV] Rendering issues and performance drop with GFX10 on RPCS3 |
| - dzn: Build failure when it's the only driver built - 22.2-devel regression |
| - d3d12: Unresolved external symbols from Version.lib |
| - turnip: Geometry shader never invoked if there is no EmitVertex and gl_Position is not written |
| - Trine 4 causing GPU HANG: ecode 12:1:849f7804, in vkr-ring-7 |
| - turnip: GPU fault on clear |
| - intel_nullhw.c:179:28: error: passing 'const void \*const' to parameter of type 'void \*' discards qualifiers [-Werror,-Wincompatible-pointer-types-discards-qualifiers] |
| - macOS undefined symbol _glapi_tls_Context build error |
| - Build on macOS regressed |
| - Regression: virgl: The switch to NTT breaks "Total War: Warhammer" |
| - macOS's LIST_ENTRY macro conflicts with ours |
| - src/util/disk_cache_os.c:234:21: error: too many arguments provided to function-like macro invocation |
| - [RadeonSI] Teamfortress 2 crashing frequently with mesa 22.1.1 |
| - Unity game segfaults often |
| - Alien Isolation segfaulting in radeonsi_dri.so starting with Mesa 22.1.0 |
| - turnip: dEQP-VK.subgroups.ballot_broadcast.framebuffer.* has failures on older deqp-vk |
| - !17062 broke XTS xts5/Xlib9/XDrawImageString(16) with Xephyr using glamor on llvmpipe |
| - DG2 full BAR broken by low BAR? |
| - crocus: SSBOs regressed |
| - Homerun Clash game triggers assert in compiler |
| - radv: line rasterization mode affects multisampling |
| - turnip: caselist fail |
| - vtn: mediump_16bit_alu and compatibility |
| - radv: dynamic vertex input failure |
| - navi21\\radv\\amdgpu Halo Infinite screen corruption and then hang |
| - ci/freedreno: a618_vk 1/7 timeouts |
| - Windows OpenGL32.dll without shared-glapi currently broken |
| - anv: KHR-GL46.tessellation_shader.single.xfb_captures_data_from_correct_stage fails on TGL |
| - perfetto: crash on context teardown / driver unload |
| - panfrost shader classification is wrong in some cases |
| - kopper: the latest awfulness exposed by piglit glx-multi-window-single-context |
| - android: r600 building error after commit 79ca456b |
| - Error compiling gallium-nine on i686 using musl libc |
| - [anv][regression][bisected][bsw,gen9atom] dEQP-VK.memory.mapping.dedicated_alloc failing on bsw and gen9atom |
| - anv: GTF-GL46.gtf32.GL3Tests.packed_pixels.packed_pixels_pbo failure |
| - r600: Tesseract does not render properly with the new NIR backend |
| - radv: some zink cts fails |
| - anv: ICL hiz issue |
| - [RADV, bisected] addrlib changes cause Navi CTS regressions |
| - r300: Segfault in shaders/closed/steam/kerbal-space-program/1017.shader_test |
| - Get rid of debug_assert() |
| - nine: docs? |
| - d3d12: Gallium d3d12 backend performance is slow |
| - [RADV][bisected] after commit 124848bf9efa1cc5d8a3cdfc654e0c03d57540e3 the games which use vulkan API not starting |
| - freedreno: skqp instability from GPU hangs |
| - ci/a618: intermittent network failure |
| - turnip: EXT_primitives_generated_query broken |
| - Godot crashes with RX5500m |
| - intel: Some Alan Wake shader fails validation due to bad register usage in EOT message |
| - r600/sfn: Add support for fmulz |
| - radv: failure in image.sample_texture.*_compressed_format* |
| - radeonsi: Hang on RX 5500 XT on latest mesa git |
| - blob_align assert triggers |
| - radv: Enabling support for instance extensions inside vkCreateInstance does not work |
| - zink: rendering artifacts in Portal 2 |
| - a630: AmnesiaTDD trace gpu faults |
| - freedreno build issue - FAILED: src/freedreno/decode/crashdec.txt |
| - segfaults under X11 since the Ubuntu update to 22 |
| - anv: renderdoc explosion on icl |
| - turnip: EXT_depth_clip_enable broken |
| - lava: test result parsing fail |
| - turnip: LRZ fast clear |
| - [ANV] GTA IV flickering |
| - [TGL] 25-100% perf impact on SynMark2 with 2GB dynamic pool size |
| - [Crocus] - Minecraft with Forge and JourneyMap - Graphical glitches |
| - intel - Minecraft char bar is broken after Mesa upgrade |
| - [wine] RAGE: texture problems |
| - "Unsupported modifier, resource creation failed." on stderr |
| - [RADV/ACO] Star Citizen Flickeriing Lights |
| - anv: query regressions |
| - ir3: Infinite loop in scheduler when splitting |
| - ci/lava: "Listened to connection" spam |
| - ci: docs build fail |
| - turnip: 3d copy/clear/... ops should not be counted by occlusion query |
| - anv: sampleLocationsEnable ignored in VkPipelineSampleLocationsStateCreateInfoEXT |
| - turnip: wrong border color for Z24S8 |
| - aco_tests fail with llvm-15 |
| - Clang 14 issues compiling src/util/u_cpu_detect.c |
| - eglExportDMABUFImageQueryMESA crashes on amdgpu gallium/drivers/radeonsi |
| - crocus: GL_SELECT regressed on hsw |
| - Random texture flickering in Minecraft |
| - panfrost: Broken interaction between CRC and tile size |
| - panfrost: allow_forward_to_be_killed is too conservative |
| - APU downgraded to OpenGL 3.0 since UBO size change in radeonsi |
| - [anv, bisected, regression] Pipeline statistics query segfaults |
| - radv/NIR: Suboptimal codegen for AccessChain on buffer device addresses |
| - Regression - vah264dec + glimagesink causes green taint on entire video with mesa>=22.0.0 |
| - va: AMD: Wrong buffer size of P010 Luma plane |
| - 625b352f14b broke glcts on Navi1x |
| - Use marketing_name for the Vulkan device string in RADV |
| - Gallium driver crashes due to fd double close in iris driver |
| - radv: Khronos ray tracing sample issues |
| - r300: copy propagate does not correctly account for control flow |
| - r300: Amnesia the Dark Descent slowdown since NIR loop unrolling |
| - r300: some unused uniforms are not removed with the NIR varying linker (compared to old glsl varying linker) |
| - Mesa currently FTBFS on x32 architecture |
| - Runescape randomly crashes. malloc_consolidate(): unaligned fastbin chunk |
| - tu: viewport assert |
| - d3d12: point coord regression due to NIR IO changes |
| - nir_lower_phis_to_regs_block: Assertion \`src->src.is_ssa' failed. |
| - Redream emulator showing artifacts |
| - Build failure on sparc |
| - r600 video playback broken |
| - r300: hundreds of tests fail since "clear color buffers using color from a constant buffer" |
| - turnip: shader viewport handling |
| - Signal 6 (Aborted) when opening GTK3 applications |
| - radv: Conditional rendering on compute queue |
| - anv: line failure |
| - 32 bit build failure |
| - d3d12 build regression: d3d12video.h: error: expected ‘;’ at end of member declaration |
| - panfrost midgard - major issues with MelonDS emulator - not able to trace properly too |
| - Intel/anv: The cloak is flickering in Batman:Arkham Knight |
| - regression: "NIR based varying linker" breaks some xwayland/glamor applications with etnaviv |
| - pvr: build failure on archs with unsigned long != uint64_t |
| - turnip: dEQP-GLES31.functional.copy_image.non_compressed.viewclass_32_bits.* fails with angle |
| - Nheko misrendering on RK3399 (Mali T860) |
| - intel_pps_driver.cc:308:74: error: too many arguments to function ‘void intel_perf_query_result_accumulate_fields(intel_perf_query_result*, const intel_perf_query_info*, const void*, const void*, bool) |
| - radv: no nir validation in radv_link_shaders |
| - r300: deqp regressions from "glsl: enable the use of the nir based varying linker" |
| - [crocus] glmark2 -b jellyfish rendering issues |
| - radeonsi-stoney: Failing jobs during a CI bug |
| - iris: skqp gles_atlastext flake |
| - turnip: use-after-free in shader compilation |
| - anv: EXT_primitives_generated_query fails |
| - r300: vs predicate logic fails in vs-loop-complex-unroll-with-else-break.shader_test |
| - [Intel/Vulkan] White blinking artifacts in Wolfenstein Youngblood |
| - radv: Indirect unaligned dispatch |
| - [radv] DCC causes artifacts in Senran Kagura Shinovi Versus |
| - Radeon RX 5700XT crash when using prusa-slicer |
| - glcts proxy_textures_invalid_samples tests assert on iris |
| - turnip: gfxbench mh3.1 offscreen crash with ANGLE |
| - dEQP-GLES31.functional.shaders.builtin_functions.integer.[i|u]mulextended.* fail |
| - Radv leaks a descriptor set layout |
| - radeonsi: KHR_debug thread safety |
| - [radeon, rv370] Running piglit shaders@glsl-vs-raytrace-bug26691 test causes hard lockup & reboot |
| - clover: Assertion \`NumContainedTys && "Attempting to get element type of opaque pointer"' failed |
| - wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes causes memory leak in Apex Legends |
| - v3d: port v3dv workaround for GFXH-1918 |
| - freedreno: fragile location packing can break cull/clip distance |
| - turnip: dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.generated_args.* failures |
| - Panfrost: wrong textures and lighting in "Thief" games in wine (not happening when using Software render) |
| - [Bisected] r600: Big performance regression on some games |
| - turnip: report corresponding drmFormatModifierPlaneCount for planar format |
| - piglit ext_image_dma_buf_import.ext_image_dma_buf_import* regression |
| - Huge amount of anon_inode:sync_file file descriptor created |
| - enable warnings for GNU extensions we don't want |
| - Grid Autosport via proton/vulkan: glitches on the grass on amdgpu, fine on amdgpu-pro |
| - NIR validation failed after nir_opt_if |
| - Etnaviv: The triangle is lost during linking together triangle strips with degenerate triangles |
| - intel: integer_mad_hi / integer_mad_sat / integer_mul_hi produce invalid results |
| - Aperture Desk Job rendering artifacts Intel TGL |
| - intel: integer_sub_sat produces stack corruption / OpISubBorrow compilation issue |
| - r300/rv530: "Black rendering" of Baldur's Gate Dark Alliance in Xbox emulator CXBX-R under wine with Gallium Nine and RADEON_DEBUG=use_tgsi |
| - radv/aco: xfb bug |
| - '../src/amd/common/ac_surface.c:1575: void ac_copy_dcc_equation(const struct radeon_info \*, ADDR2_COMPUTE_DCCINFO_OUTPUT \*, struct gfx9_meta_equation \*): assertion "dcc->equation.gfx9.num_bits <= ARRAY_SIZE(equation->u.gfx9.bit)" failed' |
| - Occasional flicker corruption in Rage 2, e.g. after loading, with ACO on RX 5700 XT |
| - Vulkan::Calling vkWaitForFences Timeout |
| - turnip: bad gmem rendering with PUBG mobile on angle |
| - Intel (CHT) - Uplink text rendering bugged out in Mesa 22.0 |
| - gen9atom gpu hang on dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_1 |
| - bad memory managment on panfrost RK3399 - cannot alocate more ram - fury unleashed |
| - radv,aco: Regression with global atomics with negative offsets |
| |
| |
| Changes |
| ------- |
| |
| Adam Jackson (32): |
| |
| - docs: Note EGL enum allocation for EGL_EXT_present_opaque |
| - docs: Update supported drivers for 22.x |
| - glx/kopper: Wire up a way for SwapBuffers to fail |
| - wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes |
| - kopper: Grow a swap interval API |
| - glx/kopper: Enable GLX_EXT_swap_control etc. |
| - egl/kopper: Hook up eglSwapInterval |
| - wsi/x11: Hook up KHR_incremental_present |
| - dri: Inline dri2InvalidateDrawable into its one caller |
| - dri: Move driCopySubBufferExtension into drisw |
| - dri: move driQueryRendererIntegerCommon near its one caller |
| - dri: Move driC{reate,oncat}Configs near their only caller |
| - dri: Move dri{Get,Index}ConfigAttrib near their vtable |
| - dri: Remove what's left of utils.[ch] |
| - glx/dri2: Require DRI2 >= 1.3 for working buffer invalidation |
| - dri2: Require a loader with working buffer invalidation |
| - zink: Print the VkResult if vkCreateInstance fails |
| - glx/dri: Fix DRI drawable release at MakeCurrent time |
| - mesa: Stub out GL_APPLE_object_purgeable |
| - glx: Remove the remains of GLX_MESA_multithread_makecurrent |
| - gallium/dri: Remove the memory of GLX_MESA_multithread_makecurrent |
| - glx: Remove some can't-happen checks from protocolDestroyDrawable |
| - glx: Remove unused __glXSetupForCommand from __glXIsDirect |
| - glx: Remove excess screen parameter from CreateContext |
| - glx/dri2: Stop tracking the protocol version in the display state |
| - glx: Fix return values from __glXGetDrawableAttribute |
| - glx: Fix drawable refcounting for naked Windows |
| - kopper: Implement {EGL,GLX}_EXT_buffer_age |
| - wsi/x11: Synchronously check for error from xcb_present_pixmap |
| - mesa: Remove Purgeable fields from texture/buffer/renderbuffer objects |
| - llvmpipe: Indentation fix |
| - gallium/dri2: Enable {GLX_ARB,EGL_KHR}_context_flush_control |
| |
| Alejandro Piñeiro (22): |
| |
| - v3dv/pipeline: fix small comment typo |
| - v3dv/pipeline_cache: adds check to skip searching for a entry |
| - v3dv/pipeline_cache: add on disk cache hit stats |
| - v3dv/pipeline: include pipeline layout on the pipeline sha1 |
| - v3dv/bo: unmap any bo that goes to the bo cache |
| - v3dv/format: no need for GetPhysicalDeviceFormatProperties |
| - v3dv/format: Add support for VK_KHR_format_feature_flags2 |
| - nir: get res binding using component 0, instead of asssumig an uint |
| - v3dv/pipeline: call nir_lower_explicit_io after first nir optimization loop |
| - v3dv/pipeline: expand nir_optimize, drop st_nir_opts |
| - v3dv/device: add comment on depthClamp feature |
| - broadcom: add no_merge_jobs debug option |
| - broadcom: document which debug options are specific for a given driver |
| - broadcom/compiler: wrap nir_convert_to_lcssa with NIR_PASS_V |
| - broadcom/compiler: use NIR_PASS for nir_lower_vars_to_ssa at v3d_optimize_nir |
| - nir: call nir_metadata_preserve at nir_remove_unused_io_vars |
| - v3dv/pipeline: use nir_shader_instructions_pass when lowering layout |
| - broadcom/compiler: removed unused function |
| - broadcom/compiler: make several passes to return a progress |
| - v3d/v3dv: use NIR_PASS(_ |
| - vc4/v3d: stop adding NORAST when SHADERDB debug option is used |
| - vc4/v3d: restore calling debug_get_option_vc4/v3d_debug |
| |
| Alessandro Astone (5): |
| |
| - kopper: Only include XCB headers on supported platforms |
| - Android.mk: Cleanup mesa3d-lib |
| - Android.mk: Make mesa3d-lib work with absolute path meson outputs |
| - Android.mk: Generate the dummy source in local-generated-sources-dir |
| - Android.mk: Intermediate output paths may already be absolute |
| |
| Alexander Kanavin (1): |
| |
| - swrast_kms: use swkmsDRI2Extension instead of driDRI2Extension |
| |
| Alexey Bozhenko (2): |
| |
| - intel/compiler: fix singleton pointer coverity warning |
| - spirv: fix OpBranchConditional when both branches are the same |
| |
| Alyssa Rosenzweig (454): |
| |
| - panfrost: Add 48-bit and 96-bit tiling routines |
| - panfrost: Remove bpp check for tiling |
| - panfrost: Only handle 2D in should_linear_convert |
| - panfrost: Implement tiled 3D resource access |
| - panfrost: Only tile for 2D dimensions |
| - panfrost: Allow tiling all texture targets |
| - panfrost: Assert alignment in tiling routine |
| - panfrost: Document arguments to tiling routines |
| - panfrost: Unit test u-interleaved tiling routines |
| - docs/panfrost: Document u-interleaved tiling |
| - panfrost: Rework dirty tracking for Valhall |
| - panfrost: Dirty track batch masks |
| - panfrost: Rework PAN_GPU_ID mechanism |
| - docs/panfrost: Document drm-shim usage |
| - meson: Ignore unused variables in release builds |
| - pan/bi: Simplify bi_propagate_pass_flags |
| - panvk: Respect line width |
| - panvk: Consider primitive restart in index buffer walks |
| - pan/bi: Implement fquantize2f16 |
| - panvk: Handle empty shaders gracefully |
| - panvk: Lower shared memory |
| - util: Add a generic worklist implementation |
| - nir: Use u_worklist to back nir_block_worklist |
| - pan/bi: Implement pack_uvec[24]_to_uint |
| - panfrost: Add texture features enum to v9.xml |
| - pan/blit: Compile blit shaders without IDVS |
| - pan/blit: Generalize texture alignment |
| - pan/blit: Prepare for Valhall port |
| - pan/blit: Support v9 data structures |
| - panfrost: Don't allocate storage for PSIZ on Valhall |
| - panfrost: Compile libpanfrost for v9 |
| - panfrost: Add helper to emit UBOs |
| - panfrost: Port uniform/UBO logic to Valhall |
| - panfrost: Don't fix up alpha test on Bifrost |
| - panfrost: Use common state emit for compute jobs |
| - panfrost: Adapt compute job emit for Valhall |
| - panfrost: Specialize rasterizer state for Valhall |
| - panfrost: Simplify attribute format expression |
| - panfrost: Specialize vertex elements for Valhall |
| - panfrost: Specialize ZSA state for Valhall |
| - panfrost: Specialize shader descriptors for Valhall |
| - panfrost: Use dirty flags to emit v9 descriptors |
| - panfrost: Generate Valhall Malloc IDVS jobs |
| - panfrost: Compile for v9 |
| - panfrost: Kick off v9 support in the driver |
| - panfrost: Advertise all textures in drm-shim |
| - panfrost: Split pan_layout.c from pan_texture.c |
| - panfrost: Always use 64-bit SD with strides |
| - panfrost: Add afbc_superblock_{size, width, height} helpers |
| - panfrost: Use panfrost_afbc_superblock_width |
| - panfrost: Don't pretend to support multiplane AFBC |
| - panfrost: Extract panfrost_afbc_is_wide helper |
| - panfrost: Use pan_block_size in layout calculation |
| - panfrost: Unify panfrost_block_size paths |
| - panfrost: Unify paths through image_layout_init |
| - panfrost: Remove unused dev argument |
| - panfrost: Unit test block size queries |
| - panfrost: Simplify how image_layout_init is called |
| - panfrost: Add a test for pan_image_layout_init |
| - asahi: Workaround broken GLSL compiler |
| - agx: Implement some shader-db stats |
| - agx: Note that RA proceeds in dominance-order |
| - agx: Track write registers more accurately |
| - agx: Match order for designated initializers |
| - agx: Use correct types for some IR enums |
| - agx: Wrap compiler header in extern "C" |
| - agx: Add unit test infrastructure |
| - agx: Use pseudo ops for mov/not/and/xor/or |
| - agx: Implement simple copyprop |
| - agx: Dynamically allocate agx_instr->src |
| - agx: Add p_split pseudoinstruction |
| - agx: Add phi pseudo instruction |
| - agx: Add a hash table for vector extracts |
| - agx: Add helper to emit combines |
| - agx: Add helper to emit splits |
| - agx: Use a transfer graph for parallel copies |
| - agx: Lower p_split after RA |
| - agx: Optimize p_split(kill) specially |
| - agx: Emit splits for intrinsics |
| - agx: Use extract helper for tex internally |
| - agx: Remove else optimization |
| - agx: Use a dynarray for predecessors |
| - agx: Add predecessor index helper |
| - agx: Mark the logical ends of blocks |
| - agx: Add agx_after_block_logical helper |
| - agx: Introduce worklist infrastructure |
| - agx: Adapt liveness analysis for SSA |
| - agx: Make DCE dumber |
| - agx: Translate phi nodes |
| - agx: Copy prop into phi nodes |
| - agx: Don't lower phis in NIR |
| - agx: Lower phi nodes to parallel copies |
| - agx: Split up RA from post-RA lowering |
| - agx: Always use hash table for extracts |
| - agx: Unit test parallel copy lowering |
| - agx: Remove identity moves |
| - agx: Add validation pass |
| - agx: Update RA comment |
| - agx: Remove has_liveness |
| - agx: Remove nir_register support |
| - agx: Lower mediump flat shading |
| - panvk: Ignore point size for !points |
| - panvk: Use correct point size limits |
| - panfrost: Only flip point sprites for GL |
| - panvk: Streamline no shader RSD case |
| - panvk: Gate rasterization on !discard, not fs req |
| - panvk: Emit fragment RSDs even with no shader |
| - panfrost: Remove ancient unused code |
| - panfrost: Call tgsi_to_nir earlier |
| - panfrost: Don't subclass pipe_compute_state |
| - panfrost: Use s->info.stage instead of tgsi stage |
| - panfrost: Simplify shader key architecture |
| - panfrost: Refactor variant selection code |
| - panfrost: Refactor variant rebind code |
| - panfrost: Lower point sprites on Bifrost |
| - panfrost: Lower user clip planes |
| - panfrost: Flip point coords in hardware |
| - pan/va: Generalize message? check for asm |
| - pan/va: Add missing TABLE (SFU) instructions |
| - pan/va: Mark NOT instructions as deprecated |
| - pan/va: Add VAR_TEX_BUF_GRADIENT instruction |
| - pan/va: Add non-IDVS VAR_TEX instructions |
| - panvk: Lower global vars to local |
| - panvk: Allocate stack for each thread |
| - pan/bi: Handle texture offset + index |
| - panvk: Lower image deref atomics |
| - panvk: Only write depth when depth test enabled |
| - pan/decode: Print row strides, not line strides |
| - panvk: Remove unused layout structs |
| - panvk: Report row_stride in GetImageSubresourceLayout |
| - panfrost: Add helpers to work with legacy strides |
| - panfrost: Use row stride for explicit layouts |
| - panfrost: Rename away from bytes_per_pixel |
| - panfrost: Use row stride to calculate slice size |
| - panfrost: Adapt get_param for row strides |
| - panfrost: Use row_stride even for linear resources |
| - lima,panfrost: Use row stride for tiling routines |
| - panfrost: Remove line_stride |
| - panfrost: Unify row stride and AFBC row stride |
| - panfrost: Unit test stride calculations |
| - panfrost: Unit test "from legacy" helper |
| - pan/bi: Rename bi_block->name to bi_block->index |
| - pan/bi: Count blocks |
| - pan/bi: Add u_worklist wrapper macros |
| - pan/bi: Use bi_worklist in analyze_helper_requirements |
| - pan/bi: Use bi_worklist for liveness |
| - pan/bi: Use bi_worklist for post-RA liveness |
| - pan/bi: Use worklist for scoreboard analysis |
| - pan/bi: Use a dynarray for predecessors |
| - nir: Don't set writes_memory for reading XFB |
| - panfrost: Relax image check |
| - pan/va: Serialize memory stores |
| - pan/bi: Align spilled registers on Valhall |
| - pan/bi: Add option to test spilling |
| - pan/bi: Assert that blend shaders may not spill |
| - panfrost: Correct XML for TLS |
| - panfrost: Use emit_tls |
| - panfrost: Use packed TLS on Valhall |
| - panfrost/ci: Smoke test spilling |
| - panfrost: Don't use VARYING_SLOT_TEX0 internally |
| - panvk: Don't use VARYING_SLOT_TEX0 internally |
| - pan/bi: Rework varying linking on Valhall |
| - panfrost: Increase the maximum tiler heap size |
| - mesa/st: Fix building tests on macOS |
| - panvk: Stub pipeline cache using the common code |
| - panvk: Conform viewport code to Vulkan spec |
| - panvk: Call nir_opt_trivial_continues |
| - pan/va: Add some whitespace to Valhall disassembly |
| - pan/va: Add whitespace after disassembled branches |
| - agx: Restore Valve copyright header |
| - pan/bi: Don't analyze td on Valhall |
| - pan/bi: Emit LEA_TEX on Valhall |
| - pan/bi: Support image loads on Valhall |
| - pan/bi: Preserve AXCHG.i32 destination |
| - pan/bi: Handle shared atomic exchange on Valhall |
| - pan/bi: Handle shared/scratch on Valhall |
| - pan/bi: Support atomics on Valhall |
| - pan/bi: Ensure the end NOP isn't eliminated |
| - asahi: Fix hangs waiting on the notification queue |
| - pan/bi: Update bi_count_write_registers for Valhall |
| - pan/va: Mark more source sizes |
| - pan/va: Lower split 64-bit sources |
| - pan/va: Use 64-bit lowering for texturing |
| - pan/bi: +JUMP can't read same-cycle temp |
| - pan/bi: Lower phis to scalar |
| - pan/bi: Use value-based interference with LCRA |
| - pan/bi: Add source/destination counts |
| - pan/bi: Add collect and split instructions |
| - pan/bi: Add bi_before_block cursor |
| - pan/bi: Lower split/collect before RA |
| - pan/bi: Fix mov and pack_32_2x16 |
| - pan/bi: Expand MAX_DESTS to 4 |
| - pan/bi: Create COLLECT during isel |
| - pan/bi: Lift split/collect cache from AGX |
| - pan/bi: Simplfy BLEND emit |
| - pan/bi: Emit collect and split |
| - pan/va: Use split for 64-bit lowering |
| - pan/va: Don't use bi_word in FAU unit test |
| - pan/bi: Scalarize bi_lower_swizzle |
| - pan/bi: Scalarize bi_opt_cse |
| - pan/bi: Scalarize modifier propagation |
| - pan/bi: Scalarize copyprop |
| - pan/bi: Remove bi_word and bi_word_node |
| - pan/bi: Simplify register precolouring in the IR |
| - pan/bi: Remove liveness metadata tracking |
| - pan/bi: Don't propagate discard |
| - pan/bi: Optimize split of collect |
| - pan/bi: See through splits for var_tex fusion |
| - pan/bi: Validate preload constraints are satisfied |
| - pan/bi: Validate vector widths |
| - pan/va: Handle 2-src blend in lower_split_src |
| - pan/va: Remove DISCARD.f32 destination |
| - pan/va: Use ^ instead of ` to indicate last-use |
| - asahi: Run lower_blend for all fragment shaders |
| - asahi: Use z24_in_z32f lowering |
| - asahi: Validate depth/stencil formats |
| - asahi: Don't store to unbound render targets |
| - asahi: Handle uncompressed Z32F depth buffers |
| - asahi: Pass depth/stencil clear values to firmware |
| - asahi: Flush on partial clears |
| - asahi: Handle 3rd deflake address |
| - asahi: Account for every byte of the cmdbuf |
| - asahi: Identify more depth pipeline fields |
| - asahi: Dump "unknown 4a" structure |
| - asahi: Add XML used for visibility tests |
| - asahi: Relax Draw "command" check |
| - asahi: Note unknown field |
| - asahi: Identify depth bias array |
| - asahi: Identify "set depth bias index" field |
| - asahi: Identify depth bias enable bit |
| - asahi: Add a depth bias structure |
| - asahi: Implement polygon offset |
| - asahi: Identify partial render pipeline |
| - asahi: Sync some names with Project Zero |
| - asahi: Add segment list pretty printer |
| - asahi: Split unknown field in segment list header |
| - asahi: Validate segment list length |
| - asahi: Compute segment list length |
| - asahi: Align segment list unknowns with Metal |
| - asahi: Decode resource allocation responses |
| - asahi: Decode resource allocation requests |
| - asahi: Identify suballocation mechanism |
| - asahi: Identify suballocated size |
| - asahi: Drop some unknowns |
| - asahi: Sync some more unknown fields |
| - asahi: Improve render target dumping |
| - asahi: Fix pipeline decoding |
| - asahi: Handle RECT textures as 2D |
| - asahi: Allocate a larger segment list |
| - asahi: Guard against encoder overflows |
| - asahi: Add wrap library |
| - docs/asahi: Add initial driver docs |
| - panfrost: Simplify the kmsro create path |
| - panfrost: Don't allocate SHARED as scanout |
| - pan/bi: Recoalesce tied operands after spilling |
| - pan/bi: Schedule for pressure pre-RA |
| - pan/bi: Allow CSEing LEA_BUF_IMM |
| - panfrost: Fix crash with GALLIUM_HUD |
| - panfrost: Extract afbc_stride_blocks helper |
| - panfrost: Extract pan_afbc_row_stride helper |
| - panfrost: Fix is_wide return type |
| - panfrost: Add pan_afbc_tile_size helper |
| - panfrost: Support tiled AFBC in stride helpers |
| - panfrost: Align layouts to tiles of superblocks |
| - panfrost: Add AFBC stride unit tests |
| - panfrost: Add 3D texture layout unit test |
| - panfrost: Calculate header_size based on row_stride |
| - panfrost: Add a tiled 16x16 layout unit test |
| - panfrost: Add 1x1 layout unit tests |
| - panfrost: Fix AFBC flags on v6 |
| - panfrost: Put comment in correct #ifdef |
| - panfrost: Handle AFBC Tiled |
| - panfrost: Add helper checking tiled AFBC support |
| - panfrost: Gate tiled AFBC on GPUs that support it |
| - panfrost: Advertise 16x16 tiled AFBC |
| - panfrost: Use tiled AFBC textures |
| - pan/bi: Fuse result types |
| - pan/bi: Test CMP result_type optimization |
| - panfrost: Allow FPK on Valhall |
| - panfrost: Set overdraw_alpha[01] flags |
| - panfrost: Set allow_rotating_primitives |
| - pan/bi: Don't allow spilling coverage mask writes |
| - pan/bi: Add VAR_TEX fusing unit test |
| - pan/va: Fill in missing src_flat16 enum |
| - pan/bi: Make LD_VAR w=format instead of w=vecsize |
| - pan/bi: Model Valhall source formats |
| - pan/bi: Interpolate varyings at 16-bit |
| - pan/bi: Extend bi_scoreboard_state for finer tracking |
| - pan/bi: Add slot to bi_instr |
| - pan/bi: Preserve flow control for non-psiz variant |
| - pan/bi: Add ASSERT_SHADER_EQUAL macro |
| - pan/bi: Add a trivial ctx->inputs for unit tests |
| - pan/bi: Extract bit_block helper |
| - pan/bi: Export bi_block_add_successor |
| - pan/bi: Export helper termination analysis |
| - pan/bi: Print flow control on instructions |
| - pan/va: Add pass to insert flow control |
| - pan/va: Do scoreboard analysis |
| - pan/va: Terminate helper threads |
| - pan/va: Add flow control merging pass |
| - pan/va: Assign slots roundrobin |
| - pan/va: Respect assigned slots |
| - pan/va: Unit test flow control insertion |
| - pan/va: Unit test flow control merging |
| - pan/bi: Use flow control lowering on Valhall |
| - pan/mdg: Scalarize with 64-bit sources |
| - pan/mdg: Remove uppercase write masks |
| - pan/mdg: Use "<<" instead of "lsl" |
| - pan/mdg: Only print masked components of swizzle |
| - pan/mdg: Only print 2 sources for ALU |
| - pan/mdg: Only print 1 source for moves |
| - pan/mdg: Distinguish SSA vs reg when printing IR |
| - pan/mdg: Print outmods when printing IR |
| - panfrost: Fix XML for AFBC header on v9 |
| - panfrost: Pass arch instead of dev into afbc_format |
| - panfrost: Add pan_afbc_compression_mode on Valhall |
| - panfrost: Support rendering to AFBC on Valhall |
| - panfrost: Handle texturing from AFBC on Valhall |
| - panfrost: Allow 2D AFBC on Valhall |
| - panvk: Use vk_image_subresource_*_count for clears |
| - pan/bi: Respect swizzles in nir_op_pack_64_2x32_split |
| - pan/bi: Allow vec6 for collects |
| - pan/bi: Align 64-bit register sources |
| - pan/bi: Rework Valhall register alignment |
| - pan/bi: Fix clper_xor on Mali-G31 |
| - panfrost/ci: Mark draw_buffers_indexed.* as flakes |
| - panfrost: Use C11 static_assert for enums |
| - nir: Add transform feedback system values |
| - nir: Export nir_io_add_intrinsic_xfb_info |
| - pan/bi: Add transform feedback lowering pass |
| - pan/bi: Handle transform feedback intrinsics |
| - panfrost: Don't allow vertex shaders to have side effects |
| - panfrost: Wire up transfrom feedback sysvals |
| - panfrost: Create transform feedback shaders |
| - panfrost: Launch transform feedback shaders |
| - pan/va: Do not insert NOPs into empty shaders |
| - panfrost: Extract panfrost_get_index_buffer helper |
| - panfrost: Don't calculate min/max indices on v9 |
| - pan/bi: Extract MUX to CSEL optimization |
| - pan/va: Implement more lanes |
| - pan/va: Lower MUX to CSEL where possible |
| - pan/va: Add MUX lowering tests |
| - pan/bi: Implement b2i with MUX |
| - pan/va: Record which instructions are signed |
| - pan/va: Try negating small constants when lowering |
| - pan/va: Try widening small constants |
| - pan/va: Unit test constant lowering pass |
| - panfrost: Enable Mali-G57 |
| - docs: Document Mali-G57 conformance |
| - panfrost: Fix inverted indirect dispatch condition |
| - pan/bi: Implement load_push_constant |
| - pan/bi: Don't reorder push with no_ubo_to_push |
| - panfrost: Use push constants for indirect dispatch |
| - panfrost: Use push constants for indirect draws |
| - panvk: Use push constants for clear |
| - panvk: Use push constants for copy shaders |
| - panfrost: Deduplicate indirect dispatch structs |
| - panfrost: Remove redundant first_tag access |
| - panfrost: Remove is_64b assignments |
| - panfrost: Add panfrost_query_l2_slices helper |
| - pan/perf: Fix performance counters on G57 |
| - mesa,gallium: Make point coord origin a CAP |
| - asahi,agx: Don't set PIPE_CAP_POINT_COORD_ORIGIN_UPPER_LEFT |
| - panvk: Drop support for Midgard |
| - panfrost: Allow pixels using discard to be killed |
| - panfrost: Inline pan_fbd_has_zs_crc_ext |
| - panfrost: Disable CRC at <16x16 tile sizes |
| - panvk: Don't specialize clear shaders for RT |
| - panvk: Remove unused pushmaps |
| - panvk: Clear Z/S attachments without a shader |
| - panvk: Fix stencil clears of combined ZS images |
| - panvk: Simplify depth clear preload condition |
| - panvk: Use common CmdBeginRenderPass |
| - panfrost: Disable QUADS on Valhall |
| - panfrost: Fix batch state changes on Valhall |
| - panfrost: Add tristate_is_defined helper |
| - panfrost: Implement provoking vertices on Valhall |
| - pan/va: Move VA_NUM_GENERAL_SLOTS to common |
| - pan/va: Simplify insert flow tests |
| - pan/va: Workaround quirk of barrier handling |
| - pan/va: Unit test barrier handling |
| - pan/bi: Mark bi_postra_liveness_ins as MUST_CHECK |
| - pan/bi: Constify bi_is_staging_src argument |
| - pan/va: Add header guards to valhall_enums.h |
| - pan/va: Move va_flow_is_wait_or_none to common |
| - pan/va: Include BLEND for va_swap_12 |
| - pan/va: Use validate_register_pair for BLEND pack |
| - pan/va: Mark last register reads |
| - pan/va: Unit test va_mark_last |
| - pan/bi: Fix LD_BUFFER.i16 definition |
| - pan/bi: Align accesses with packed TLS |
| - pan/va: Lower FADD_RSCALE.f32 to FMA_RSCALE.f32 |
| - pan/va: Implement sample positions FAU packing |
| - pan/va: Rename RSCALE to LDEXP |
| - pan/bi: Remove FRSCALE from IR |
| - pan/bi: Model MKVEC.v2i8 |
| - pan/bi: Constant fold MKVEC.v2i8 |
| - pan/va: Pack MKVEC.v2i8 byte lanes |
| - pan/va: Replace MKVEC.v4i8 with MKVEC.v2i8 |
| - pan/bi: Tune lower_vars_to_scratch |
| - panfrost: Enable more FP16 caps on Valhall |
| - panfrost: Enable CAP_INDIRECT_TEMP_ADDR on Valhall |
| - panfrost: Bump ESSL_FEATURE_LEVEL on Valhall |
| - v3d: Drop workaround for u_blitter bug |
| - agx: Handle loop { if { loop { .. } } } |
| - r600g: Remove streamout-based buffer copy path |
| - u_blitter: Remove util_blitter_copy_buffer |
| - lima,panfrost: Do not ralloc contexts against the screen |
| - pan/va: Handle terminal barriers |
| - ttn: Set nir->info.separate_shader |
| - pan/bi: Fix unpack_32_2x16 definition |
| - pan/va: Add missing <roundmode/> to V2F32_TO_V2F16 |
| - pan/bi: Implement f2f16{_rtz, _rtne} |
| - panfrost: Separate core ID range from core count |
| - panfrost: Move bifrost_lanes_per_warp to common |
| - panfrost: Fix PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS |
| - panfrost: Fix PIPE_COMPUTE_CAP_SUBGROUP_SIZE |
| - panfrost/ci: Disable 0ad trace on T860 |
| - panfrost: Clear with a quad to avoid flushing |
| - panfrost/ci: Disable T760 jobs |
| - nir/opt_shrink_vectors: Round to supported vec size |
| - panfrost: Revert provoking vertex assertion |
| - panfrost: Only key points to point coord origin |
| - pan/va: Dump unencodable instructions |
| - pan/va: Use invalid_instruction in more places |
| - pan/va: Print instructions with pack assert fails |
| - pan/va: Handle BIFROST_MESA_DEBUG=nosb |
| - panfrost: Make pan_select_max_tile_size O(1) |
| - panfrost: Use implementation-specific tile size |
| - panfrost: Protect pandecode by a mutex |
| - pan/bi: Require ATEST coverage mask input in R60 |
| - pan/va: Stall after ATEST |
| - panfrost: Fix shader_modifies_coverage on Valhall |
| - panfrost: Add decoupled early-ZS helpers |
| - panfrost: Unit test early-ZS helpers |
| - panfrost: Add zsa->zs_always_passes flag |
| - panfrost: Use early-ZS helpers |
| - util: Enable list_assert in debugoptimized builds |
| - util: Gate simple_mtx_assert_locked on !NDEBUG |
| - agx: Port minifloat tests to GTest |
| - pan/bi: Assert that we use the correct CLPER |
| - pan/bi: Rename CLPER_V6.i32 to CLPER_OLD.i32 |
| - pan/decode: Fix overrun decoding planes |
| - nir/lower_idiv: Be less creative about signs |
| - agx: Fix packing of samplers in texture instrs |
| - panfrost: Don't segfault on unknown models |
| - pan/bi: Don't reorder image loads across stores |
| - pan/bi: Don't allow ATEST to take a temporary |
| - pan/mdg: Print 3 sources for CSEL |
| - pan/bi: Fix dual texturing with uniforms |
| - pan/bi: Fix out-of-bounds write in va_lower_split_64bit |
| - pan/bi: Consider all dests in helper_block_update |
| - agx: Fix float copyprop of neg(neg) case |
| - panfrost: Respect buffer offset for OpenCL |
| |
| Andres Calderon Jaramillo (2): |
| |
| - nir: Account for YUV range. |
| - mesa/st: Account for YUV color space and range. |
| |
| Andres Gomez (5): |
| |
| - radv/ci: update vkd3d-proton results for AMD's Kabini |
| - ci: install a more recent version of Wine in the test-vk image |
| - ci: install gfxinfo-mupuf in the test-vk image |
| - ci: move b2c artifacts to its own directory |
| - ci: add wrapper script for Valve's traces runner |
| |
| Anuj Phogat (2): |
| |
| - include/uapi: Update drm_fourcc.h from kernel |
| - isl,iris: Add I915_FORMAT_MOD_4_TILED support for XeHP |
| |
| Arvind Yadav (7): |
| |
| - nir: Add a lowering pass for polygon and line smoothing |
| - radeonsi: Add nir_lower_poly_line_smooth pass for polygon and line smoothing |
| - nir: add load_point_coord_maybe_flipped intrinsics for point smoothing |
| - nir: Add a lowering pass for point smoothing |
| - ac/llvm : Adding Number of all interpolated inputs in ac_shader_abi |
| - ac/llvm: Implement nir_intrinsic_load_point_coord_maybe_flipped opcodes |
| - radeonsi: Enable nir_lower_point_smooth lowering pass for point smoothing |
| |
| Autumn on Tape (5): |
| |
| - gallivm: add subgroup shuffle support |
| - lavapipe: enable subgroup shuffle operations |
| - gallivm: use shufflevector for shuffles when index is constant data |
| - gallivm: use VPERMPS (x86/AVX2) for 32-bit 8-element shuffles |
| - lavapipe: enable subgroup quad operations |
| |
| Axel Davy (6): |
| |
| - frontend/nine: Skip invalid swvp calls |
| - frontend/nine: Fix buffer tracking out of bounds |
| - frontend/nine: Fix ATOC handling |
| - frontend/nine: Fix cso restore bug |
| - frontend/nine: Fix shader multi-use crash |
| - frontend/nine: Fix ff position_t fallback when w = 0 |
| |
| Bas Nieuwenhuizen (39): |
| |
| - vulkan/wsi/x11: Ensure we have the required number of images for acquire. |
| - radv: always enable NGG on GFX11 |
| - radv: Mark all formats as DCC compatible with each other on gfx11. |
| - radv: gfx11 register changes. |
| - radv: Adjust for new SWIZZLE_ENABLE. |
| - radv: Add gfx11 DCC fast clear support. |
| - radv: Disable transform feedback for gfx11. |
| - radv: update the initialization of SGPR0/1 registers for HS and GS on GFX11 |
| - radv: Deal with derefs from opaque types in function parameters. |
| - radv: Add performance counter reg write. |
| - radv: move ac_perfcounters to physical_device. |
| - radv: Add pstate locking for perfcounters. |
| - radv: Rename SPM functions. |
| - radv: Make inhibiting clockgating not static. |
| - radv: Add helper BO for perf counters. |
| - radv: Add perf counter lock/unlock commandbuffers. |
| - radv: Add a list of performance counters. |
| - radv: Implement support for querying performance counters. |
| - radv: Implement listing performance counters. |
| - radv: Expose VK_KHR_performance_query. |
| - radv: Add ability to override the build id for the cache. |
| - radv: Add a 32bit memory type. |
| - radv: Skip setting empty index buffers to avoid hang |
| - radv: Expose function to write vertex descriptors for dgc. |
| - radv: Always store stride in the vbo descriptor. |
| - radv: Require 32bit memory for indirect buffers. |
| - radv: Expose helper for base pa_su_sc_mode_cntl. |
| - radv: Make radv_get_vgt_index_size non-static. |
| - radv: Add helper to write scissors. |
| - radv: Add flushing for DGC. |
| - radv: Add DGC meta shader. |
| - radv: Implement DGC generated command layout structure. |
| - radv: Implement DGC cmdbuffer generation. |
| - radv: Implement CmdExecuteGeneratedCommandsNV. |
| - radv: Add stub for vkCmdBindPipelineShaderGroupNV. |
| - radv: Expose VK_NV_device_generated_commands. |
| - vulkan/wsi: Take max extent into consideration for modifier selection. |
| - amd/common: Don't rely on DCN support checks with modifiers. |
| - amd/common: Disable DCC retile modifiers on RDNA1 |
| |
| Ben Skeggs (3): |
| |
| - nv50/ir: fix OUT.FINAL on ampere |
| - nv50/ir: fix scope of memory ops on ampere |
| - nv50/ir: fix ALD/AST where address comes from AL2P |
| |
| Benjamin Cheng (5): |
| |
| - radv: fix memory leak of descriptor set layout |
| - radv: refactor image binding into struct |
| - radv: consolidate descriptor set buffer counts |
| - radv: implement disjoint memory for multiplane images |
| - radv: expose disjoint image support |
| |
| Boris Brezillon (159): |
| |
| - dzn: Skip binding entries with zero descriptors |
| - dzn: Make sure sampler heaps don't contain more than 2048 samplers |
| - microsoft/compiler: Add a dunmmy SV_SampleIndex when needed |
| - microsoft/spirv_to_dxil: Allow forcing per-sample shading |
| - dzn: Add support for sampleRateShading |
| - dzn: Pass the right type to CreateCommandList() in the reset path |
| - dzn: refactor error-handling |
| - ci/windows: Add a variable to globally disable jobs using windows runners |
| - dzn: Use core helpers to fill physical device features/properties |
| - dzn: Report actual device limits |
| - d3d12: Move the D3D12ResourceState code to the gallium driver |
| - ci/windows: Split the windows-related tests |
| - dzn: Add support for EXT_vertex_attribute_divisor |
| - dzn: Rename a bunch of dzn_descriptor_set helpers |
| - dzn: Pass layout objects to various dzn_descriptor_set_ptr_xxx helpers |
| - dzn: Split the write desc helpers in two halves |
| - dzn: Add support for KHR_descriptor_update_template |
| - microsoft/spirv_to_dxil: Add a draw_id field to vertex_runtime_data |
| - dzn: Fix exec_buf_stride calculation |
| - dzn: Simplify the triangle_fan_rewrite_index shader |
| - dzn: Update the draw_id sysval when issuing indirect draws |
| - dzn: Advertise KHR_shader_draw_parameters |
| - dzn: Pass an image view to dzn_cmd_buffer_clear_attachment() |
| - dzn: Set MAX_RTS to D3D12_SIMULTANEOUS_RENDER_TARGET_COUNT |
| - dzn: Move all render-related states to a substruct |
| - dzn: Inline dzn_cmd_buffer_clear_attachments() |
| - dzn: Add support for dynamic rendering |
| - dzn: Get rid of the render pass logic |
| - dzn: Allow multiDrawIndirect and drawIndirectFirstInstance |
| - dzn: Add KHR_draw_indirect_count support |
| - dzn: Add missing VKAPI_{ATTR,CALL} specifiers to BeginCommandBuffer() |
| - dzn: Pass the dzn_event pointer to _mesa_hash_table_insert() |
| - dzn: Fix the STATIC_ASSERT() in dzn_meta_blits_get_context() |
| - vulkan/wsi: Make wsi_win32_surface_get_capabilities() return the current extent |
| - dzn: Fix dzn_cmd_buffer_collect_queries() |
| - dzn: Use the vk_sync_binary abstraction |
| - spirv: Fix windows build |
| - dzn: Use the correct ICD path on Win32 |
| - microsoft/spirv_to_dxil: Fix push_constants type declaration |
| - microsoft/spirv_to_dxil: Fix the push_constant UBO size calculation |
| - dzn: Emit missing transition barriers for rendering attachments |
| - ci/windows: Bump spirv-samples commit-ID |
| - microsoft/spirv2dxil: Add a debug option |
| - microsoft/compiler: Make sure all SV_Position components are written |
| - microsoft/spirv_to_dxil: Extract NIR passes out of spirv_to_dxil() |
| - microsoft/spirv_to_dxil: Constify the descriptor binding mapping fields |
| - microsoft/compiler: Make input_mappings immune to driver_location holes |
| - microsoft/spirv_to_dxil: Remove dead IO vars at the end of dxil_spirv_nir_passes() |
| - microsoft/spirv_to_dxil: Add a linking helper |
| - microsoft/compiler: Allow vulkan users to handle varyings linking |
| - microsoft/spirv_to_dxil: vectorize IOs |
| - dzn: Fix maxVertexInputAttributes limit |
| - dzn: Handle varying linking explicitly |
| - dzn: Fix unitialized var access in dzn_meta_compile_shader() |
| - dzn: Fix vertex input handling |
| - microsoft/spirv_to_dxil: Let spirv_to_dxil() handle the linking |
| - microsoft/compiler: Drop the auto_link parameter passed to nir_to_dxil() |
| - dzn: Switch to ID3D12Device2 |
| - dzn: Drop an unused argument passed to dzn_graphics_pipeline_translate_vi() |
| - dzn: Use CreatePipelineState() |
| - dzn: Enable depth-bounds testing |
| - microsoft/compiler: Lower fquantize2f16 |
| - nir: Fix flat new_var assignment in create_new_io_vars() |
| - microsoft/compiler: Pick a type that matches interpolation mode for structs |
| - microsoft/compiler: Truncate function names when needed |
| - vulkan/util: Fill VkPhysicalDeviceIDProperties::deviceNodeMask |
| - spirv: Add a dst/src type to image deref loads/stores coming from OpAtomic{Load,Store} |
| - microsoft/compiler: Images are no longer reprensented by uniform vars |
| - microsoft/compiler: Set typed_uav_load_additional_formats when appropriate |
| - microsoft/compiler: s/assert()/unreachable()/ in emit_intrinsic() |
| - microsoft/compiler: Support load_sample_id_no_per_sample |
| - dzn: Decorrelate external image info and external image properties |
| - dzn: Return empty external properties |
| - dzn: Add a dummy GetDeviceGroupPeerMemoryFeatures() |
| - dzn: Add a dummy GetImageSparseMemoryRequirements2() |
| - dzn: Add dummy {Create,Destroy}SamplerYcbcrConversion() implementations |
| - dzn: Check that no export is requested in the memory allocation path |
| - dzn: Lower alignment requirements when allocating buffers or single-sample images |
| - dzn: Add a dzn_desc_type_has_sampler() helper |
| - dzn: Implement GetDescriptorSetLayoutSupport() |
| - dzn: Vulkan 1.1 wants maxPerSetDescriptors >= 1024 |
| - dzn: Prepare support for pipeline variants |
| - dzn: Fix primitiveRestart support |
| - dzn: Fix triangle-fan emulation |
| - dzn: Support dynamic depth testing parameters properly |
| - dzn: Support dynamic line width |
| - ci/windows: Allow skipping deqp tests if we have to |
| - ci/windows: Add dEQP-VK.dynamic_state.* to the test list |
| - ci/windows: Skip dEQP-VK.api.command_buffers.record_many_draws* |
| - d3d12: Keep access masks up-to-date when removing vars in d3d12_disable_multisampling() |
| - microsoft/compiler: Move d3d12_fix_io_uint_type() to dxil_nir.c |
| - microsoft/compiler: Fix Layer type |
| - microsoft/compiler: Support gl_Layer |
| - dzn: Disable rasterization if the last geometry stage doesn't write the position |
| - dzn: Enable support for geometry shaders |
| - ci/dzn: Move to a .toml description |
| - dzn: Fix the 2DMSArray case in dzn_image_get_dsv_desc() |
| - dzn: Fix CmdPushConstants() |
| - dzn: Clamp depthBiasConstantFactor when doing the float -> int conversion |
| - dzn: Transition resource to RENDER_TARGET/DEPTH_WRITE before clears |
| - microsoft/compiler: Fix emit_ubo_var() |
| - ci/dzn: Copy testlog.{css,xsl} to the result dir |
| - nir/serialize: Put dest last in packed_instr::tex |
| - dzn: Advertise anisotropic filtering support |
| - dzn: Advertise shaderImageGatherExtended support |
| - dzn: Enable dynamic indexing on all kind of descriptors |
| - dzn: Enable shader{Clip,Cull}Distance |
| - dzn: Enable the depthClamp feature |
| - nir/serialize: Support texop >= 16 |
| - nir/serialize: Silence integer-overflow false positive |
| - vulkan: Add a vk_pipeline_hash_shader_stage() helper |
| - anv: Use vk_pipeline_hash_shader_stage() |
| - v3dv: Fix nir_shader leaks in v3dv_meta_{clear,copy}() |
| - v3dv: Use vk_pipeline_hash_shader_stage() |
| - tu: Use vk_pipeline_hash_shader_stage() |
| - radv: Kill unused fs_m local var in radv_create_shaders() |
| - radv: Use vk_pipeline_hash_shader_stage() |
| - dzn: Hook-up device-lost detection |
| - dzn: Fix indentation |
| - dzn: Initialize UUIDs |
| - dzn: Drop dzn_pipeline_cache.c |
| - dzn: Add missing D3D12_CACHED_PIPELINE_STATE to MAX_COMPUTE_PIPELINE_STATE_STREAM_SIZE |
| - dzn: Fix assertion in d3d12_pipeline_state_stream_new_desc() |
| - dzn: Use d3d12_compute_pipeline_state_stream_new_desc() in the compute path |
| - dzn: Pass the maximum stream size to d3d12_pipeline_state_stream_new_desc() |
| - dzn: Drop useless while(link_mask != 0) |
| - dzn: Drop unused allocator passed to dzn_graphics_pipeline_create() |
| - dzn: Let dzn_pipeline_init() initialize the root signature in the stream |
| - dzn: Fix potential nir_shader leak |
| - dzn: Use vk_to_mesa_shader_stage() |
| - dzn: Save a few indentation levels in graphics_pipeline_compile_shaders() |
| - dzn: Don't delegate binding translation to dxil_spirv_nir_passes() |
| - dzn: Hash pipeline layout stages |
| - dzn: Add a generic cached blob wrapper |
| - dzn: Drop unneeded goto statement in dzn_compute_pipeline_create() |
| - dzn: Move the compute shader compilation logic to a sub-function |
| - dzn: Cache NIR shaders |
| - dzn: Caculate a binding translation hash |
| - dzn: Cache DXIL shaders |
| - dzn: Cache pipeline info |
| - ci/dzn: Add pipeline cache tests to the include list |
| - dzn: Cache and pack transition barriers |
| - dzn: Don't call dzn_cmd_buffer_queue_transition_barriers() prematurely |
| - dzn: Rename dzn_cmd_buffer_queue_image_subres_range_transition() |
| - dzn: Introduce dzn_cmd_buffer_queue_image_range_state_transition() |
| - dzn: Allow shaders to read from depth/stencil attachments |
| - dzn: Fix copies of block-based formats |
| - dzn: Fix unused var warnings |
| - dzn: Fix 'statement with no effect' warning |
| - dzn: Fix printf() format in dxgi_get_factory() error message |
| - dzn: Fix {back,front}_wr_uses_ref assignment |
| - dzn: Fix -Wswitch warnings |
| - dzn: Use the vk_descriptor_set_layout base struct |
| - dzn: Use the vk_pipeline_layout base struct |
| - dzn: Mark transition barriers as executed when we execute barriers |
| - dzn: Query D3D12_FEATURE_D3D12_OPTIONS3 features |
| - dzn: Provide a helper to check if 2 formats are compatible |
| - dzn: Support native image copies when formats are compatible |
| - dzn: Take VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT into account |
| |
| Boyuan Zhang (9): |
| |
| - radeonsi/vcn: add vcn 4.0 encode support |
| - radeonsi/vcn: add vcn 4.0 encode fw interface version |
| - radeonsi/vcn: add encode context ib for vcn 4.0 |
| - radeonsi/vcn: add session init ib for vcn 4.0 |
| - radeonsi/vcn: enable vcn 4.0 encode for gfx11 asic |
| - radeonsi/vcn: use calculated max hierarchy depth for hevc enc |
| - vl: add cabac init idc for h264 encode |
| - frontends/va: get entropy mode info for h264 encode |
| - radeonsi/vcn/enc: enable cabac entropy mode |
| |
| Brian Paul (101): |
| |
| - gallivm: assorted clean-ups in lp_bld_nir_aos.c |
| - gallivm: whitespace fixes in lp_bld_tgsi.h |
| - gallivm: asst. clean-ups in lp_bld_sample.h |
| - llvmpipe: asst. clean-ups in lp_linear_fastpath.c |
| - llvmpipe: asst. clean-ups in lp_setup.c |
| - llvmpipe: asst. clean-ups in lp_state_setup.c |
| - gallivm: asst. clean-ups in lp_bld_type.[ch] |
| - gallivm: cleanup lp_bld_arit.c |
| - gallivm: clean-up in lp_bld_nir.h |
| - gallivm: add const qualifier in lp_bld_nir_soa.c |
| - llvmpipe: asst. clean-up in lp_state_derived.c |
| - llvmpipe: asst. clean-ups in lp_rast_linear.c |
| - llvmpipe: fix comment typo in lp_linear_fastpath.c |
| - llvmpipe: asst. clean-ups in lp_rast.c |
| - llvmpipe: asst. clean-ups in lp_linear.c |
| - llvmpipe: asst. clean-ups in lp_scene.h |
| - llvmpipe: asst. clean-up and add comments in lp_state_fs.h |
| - gallivm: clean-up in lp_bld_sample.c |
| - llvmpipe: asst. clean-up in lp_setup_context.h |
| - gallivm: asst. clean-ups in lp_bld_sample_soa.c |
| - gallivm: clean-ups in lp_bld_tgsi.[ch] |
| - llvmpipe: asst. clean-ups in lp_state_fs.c |
| - llvmpipe: minor clean-ups in lp_jit.h |
| - llvmpipe: asst. clean-ups in lp_state_fs_linear_llvm.c |
| - llvmpipe: asst. clean-ups in lp_state_fs_linear.c |
| - llvmpipe: asst. clean-ups in lp_setup_tri.c, lp_setup_rect.c |
| - llvmpipe: whitespace clean-ups in lp_rast.h |
| - llvmpipe: minor whitespace/formatting cleanups in lp_state_setup.h |
| - llvmpipe: minor clean-ups in lp_state_fs_analysis.c |
| - llvmpipe: misc clean-up in lp_rast_tri.c |
| - llvmpipe: asst. clean-ups in lp_bld_blend_aos.c |
| - llvmpipe: asst. clean-ups in lp_rast_tri_tmp.h |
| - llvmpipe: asst. clean-ups in lp_rast_priv.h |
| - gallivm: minor clean-ups in lp_bld_sample_soa.c |
| - llvmpipe: misc clean-ups in lp_bld_interp.c |
| - gallivm: asst. clean-ups in lp_bld_nir.c |
| - llvmpipe: fix comment typo in lp_bld_interp.h |
| - llvmpipe: asst. clean-ups in lp_linear_sampler.c |
| - llvmpipe: asst. clean-ups in lp_tex_sample.c |
| - llvmpipe: asst. clean-ups in lp_scene.c |
| - llvmpipe: minor clean-up in lp_state_blend.c |
| - llvmpipe: use enum blend types instead of unsigned |
| - llvmpipe: s/unsigned/enum pipe_compare_func/ |
| - gallivm: asst. clean-ups in lp_bld_logic. |
| - llvmpipe: asst. clean-ups in lp_state_fs.c |
| - draw: s/unsigned/enum pipe_prim_type/ |
| - draw: asst. clean-up in draw_context.h |
| - draw: asst. clean-up in draw_gs.h |
| - draw: asst. clean-up in draw_pipe.c |
| - draw: asst. clean-up in draw_pipe_validate.c |
| - draw: asst. clean-up in draw_private.h |
| - draw: asst. clean-up in draw_pt.h |
| - draw: asst. clean-up in draw_pt.c |
| - draw: asst. clean-up in draw_pt_fetch_shader_emit.c |
| - draw: asst. clean-up in draw_pt_fetch_shader_pipeline.c |
| - draw: asst. clean-up in draw_pt_util.c |
| - draw: asst. clean-up in draw_pt_vsplit.c |
| - draw: asst. clean-up in draw_vbuf.h |
| - gallivm: use enum types instead of unsigned in lp_bld_sample.[ch] |
| - llvmpipe: add minor comments in lp_rast.h, lp_setup-rect.c |
| - llvmpipe: remove lp_rast_cmd_arg::state field |
| - gallivm: s/unsigned/enum pipe_swizzle/ |
| - llvmpipe: add simple assertion in generate_fragment() |
| - util/bitset: add BITSET_SIZE() |
| - lavapipe: rework code to compute textures_used, samplers_used |
| - llvmpipe: minor code re-org in lp_state_fs_analysis.c |
| - nir: add const qualifiers, move some decls in nir_to_tgsi_info.c |
| - llvmpipe: replace if/then with switch in llvmpipe_nir_fn_is_linear_compat() |
| - gallivm: s/0/LP_BLD_TEX_MODIFIER_NONE/ |
| - llvmpipe: replace GET_A0() macro w/ inline function |
| - llvmpipe: add missing tex_info->texture_unit assignment |
| - llvmpipe: fix invalid memory used in lp_fs_linear_run |
| - llvmpipe: fix texcoord analysis in llvmpipe_nir_fn_is_linear_compat() |
| - llvmpipe: replace LP_RAST_OP\_ #defines with enum type |
| - gallivm: increase LP_MAX_TGSI_SHADER_IMAGES from 16 to 32 |
| - lavapipe: fix logicop, independent blend enable/disable |
| - llvmpipe: don't allow texture/resource swizzles on linear path |
| - llvmpipe: initialize a local var to fix compiler warning in release build |
| - lavapipe: fix incorrect sv[] array size |
| - llvmpipe: fix texcoord analysis regression |
| - llvmpipe: asst. clean-up in lp_bld_depth.c |
| - llvmpipe: minor white-space clean-up in lp_setup.h |
| - llvmpipe: asst. clean-ups in lp_setup_rect.c |
| - gallivm: asst. clean-ups in lp_bld_const.c |
| - llvmpipe: asst. clean-ups in lp_linear_interp.c |
| - llvmpipe: minor optimization in lp_linear_init_interp() |
| - llvmpipe: asst. clean-ups in lp_state_setup.c |
| - llvmpipe: remove unneeded casts |
| - llvmpipe: replace 'unsigned' with enum types |
| - gallivm: asst. clean-ups in lp_bld_swizzle.c |
| - gallivm: simplify shuffling in emit_store_reg() |
| - gallivm: fix incorrect memset() in SOA emit_load_const() |
| - gallivm: rename 'var' param in emit_store_reg(), emit_store_var(). |
| - llvmpipe: make opaque field a 1-bit field |
| - llvmpipe: minor whitespace clean-up in lp_state_fs.h |
| - llvmpipe: pass frontfacing param to jit_function() call |
| - llvmpipe: asst. clean-up in lp_rast_linear_fallback.c |
| - llvmpipe: remove unneeded casts in lp_rast_linear_fallback.c |
| - llvmpipe: consolidate code in lp_rast_linear_fallback.c |
| - llvmpipe: tighten up some code in lp_rast_linear_fallback.c |
| - llvmpipe: add some const qualifiers in lp_rast_linear_fallback.c |
| |
| Caio Oliveira (1): |
| |
| - intel/compiler: In XeHP prefer <1;1,0> regions before compacting |
| |
| Chad Versace (2): |
| |
| - venus: Refactor vn_fix_graphics_pipeline_create_info |
| - venus: Don't encode ignored pTessellationState |
| |
| Charlie Turner (4): |
| |
| - ci: fix shellcheck violations in the test-vk building script |
| - ci: add Linux build of apitrace |
| - ci: upgrade gfxreconstruct build to v0.9.10 |
| - ci: upgrade to DXVK v1.10.1 |
| |
| Charmaine Lee (20): |
| |
| - gallium/util: Increase the debug_flush map depth to 64 |
| - draw: fix double free of NIR IR |
| - svga: fix shader IR type passed to draw create shader function |
| - svga/nir: enable PIPE_CAP_TGSI_TEXCOORD |
| - svga: enable PIPE_CAP_TGSI_TEXCOORD for vgpu10 and up |
| - svga/tgsi: add translation for DTRUNC |
| - svga/nir: enable lower double option |
| - svga: workaround for unexpected double swizzle |
| - svga: fix clip distance shadow copy writemask |
| - svga: add a helper function for common shader creation |
| - svga: create svga_shader_info |
| - svga: add svga_compile_shader helper function |
| - svga: add need_texcoord_semantic to tgsi_add_point_sprite & tgsi_add_aa_point |
| - svga: fix aa point |
| - svga: fix reference to NULL offset for atomic counter |
| - svga: fix shared memory index |
| - svga: fix mksstats build |
| - mesa/st: fix reference to nir->info after nir_to_tgsi |
| - svga: support TGSI_SEMANTIC_TEXCOORD in swtnl draw context |
| - svga: fix invalid component access of domain location |
| |
| Chia-I Wu (61): |
| |
| - turnip: fix drm modifier support with planar formats |
| - util: Keep quiet NaNs quiet when converting to half float. |
| - anv: advertise rectangularLines only for Gen10+ |
| - turnip: let modifier takes precedence over TU_DEBUG=noubwc |
| - freedreno/fdperf: make refresh rate configurable |
| - freedreno/fdperf: support dumping counters |
| - turnip: fix sampledImageIntegerSampleCounts |
| - turnip: emit VPC_SO_DISABLE in xfb begin/end |
| - ir3: fix mem_ctx for ir3_disasm_info::nir |
| - turnip: fix off-by-one in border color bitset |
| - turnip: fix tu6_pack_border_color for z24 |
| - turnip: disable UBWC for SNORM formats |
| - turnip: if-checks autotune debug macros |
| - turnip: base pipeline draw costs on colorWriteMask |
| - turnip: include stencil test for drawcall costs |
| - turnip: estimate render pass costs |
| - turnip: consider render pass costs in autotune |
| - ci/turnip: add a link to VkPhysicalDeviceMemoryProperties failure |
| - turnip: force sample interpolations for sample shading |
| - turnip: advertise VK_EXT_border_color_swizzle |
| - freedreno/fdl: use XYZW swap for PIPE_FORMAT_X24S8_UINT |
| - vulkan/device_select: remove get_pdevice_proc_addr |
| - freedreno/drm-shim: add a660 |
| - turnip: fix an assertion with drm-shim |
| - ir3: update ir3_const_state comment |
| - ir3: copy req_local_mem for MESA_SHADER_KERNEL |
| - ir3: fix output_loc size |
| - ir3: remove unused patch_vertices_in |
| - ir3: increment constoff right after it is assigned |
| - ir3: fix tess param allocation |
| - ir3: set UL flag before ir3_lower_subgroups |
| - turnip: remove shared_consts from tu_compiled_shaders |
| - turnip: shared_consts and push_consts are mutually exclusive |
| - turnip: add tu_common.h as the common header |
| - turnip: remove includes that are already in tu_common.h |
| - turnip: add tu_drm.h |
| - turnip: add tu_suballoc.h |
| - turnip: update tu_cs.h |
| - turnip: add tu_query.h |
| - turnip: add tu_image.h |
| - turnip: add tu_formats.h |
| - turnip: update tu_descriptor_set.h |
| - turnip: add tu_shader.h |
| - turnip: add tu_pipeline.h |
| - turnip: add tu_clear_blit.h |
| - turnip: add tu_dynamic_rendering.h |
| - turnip: add tu_lrz.h |
| - turnip: add tu_pass.h |
| - turnip: add tu_wsi.h |
| - turnip: update tu_autotune.h |
| - turnip: add tu_device.h |
| - turnip: add tu_cmd_buffer.h |
| - turnip: add tu_android.h |
| - turnip: update tu_util.h |
| - turnip: move away from tu_private.h |
| - turnip: remove tu_private.h |
| - turnip: remove headers from libtu_files |
| - turnip: use SPDX-License-Identifier |
| - turnip: fix a use-after-free in autotune |
| - turnip: lower the queue priority to 1 |
| - ir3: fix predicate splitting in scheduler |
| |
| Christian Gmeiner (12): |
| |
| - isaspec: Extend split_bits(..) to accept a bitsize |
| - isaspec: Handle patterns bigger then 64 bit |
| - nir: Fix unused-variable compile warnings |
| - freedreno: Wait for async compile jobs to finish |
| - ir3: Create at least one thread for async shader compile |
| - gtest: Fix maybe-uninitialized compiler warning |
| - r600: Switch to unreachable(..) |
| - svga: Clear query before usage |
| - ci: Remove -Wno-error=maybe-uninitialized |
| - etnaviv: factor out shader screen related init/deint |
| - etnaviv: async shader compile |
| - etnaviv: Move nir_shader_compiler_options to compiler |
| |
| Chuansheng Liu (1): |
| |
| - iris,anv: correct the max thread number for DG2+ |
| |
| Clément Guérin (1): |
| |
| - venus: implement VK_EXT_physical_device_drm |
| |
| Connor Abbott (69): |
| |
| - tu: Delete unused tu_clear_blit GS handling |
| - ir3, freedreno, tu: Make ir3_shader_variant store stream_output |
| - ir3, fd, tu: Copy misc. info from ir3_shader to ir3_shader_variant |
| - ir3: Use ir3_shader_variant::type more often |
| - ir3: Allocate disasm_info under variant |
| - ir3: Add ir3_shader_variant::compiler |
| - tu: Keep original blit shaders separately |
| - ir3: Remove ir3_shader_variant::shader |
| - ir3: Add functions to serialize variants |
| - ir3: Support disabling the pipeline cache |
| - ir3: Add ir3_shader_create_variant() |
| - tu: Hash pipeline layout contents |
| - tu: Rewrite cache UUID based on radv |
| - tu: Include turnip debug flags in pipeline cache UUID |
| - tu: Implement pipeline caching with shared Vulkan cache |
| - tu: Zero-initialize compute driver key |
| - tu: Implement VK_EXT_pipeline_creation_feedback |
| - tu: Implement VK_EXT_pipeline_creation_cache_control |
| - tu: Convert to sync2 entrypoints |
| - tu: Expose VK_KHR_synchronization2 |
| - tu: Implement VK_EXT_image_2d_view_of_3d |
| - tu: Enable shaderTessellationAndGeometryPointSize |
| - tu: Enable VK_KHR_swapchain_mutable_format |
| - tu: Fix partial-write LRZ determination |
| - tu: Implement extendedDynamicState2LogicOp |
| - tu: Implement VK_EXT_color_write_enable |
| - tu: Enable uniform texture buffers with NPOT formats |
| - freedreno/a6xx: Support more 3-component formats |
| - tu: Fix 3d GMEM store swizzle in texture descriptor |
| - tu: Add missing WFI to the 3d GMEM store path |
| - tu: Save/restore \*_BIN_CONTROL in 3d GMEM store path |
| - ir3/sched: Fix could_sched() determination |
| - tu: Fix linemode for tessellation with isolines |
| - ir3: Fix vectorizer condition for SSBOs |
| - freedreno: Document a650-specific CP_REG_WRITE flag |
| - freedreno/a6xx: Document border color/swizzle interaction |
| - freedreno/a6xx: Fix border color swizzling |
| - freedreno/a6xx: Use fdl format swizzle |
| - freedreno/fdl: Note border color constraints |
| - freedreno/a6xx: Fix border color with Z24UINT_S8UINT |
| - tu: Fix stencil border color with has_z24uint_s8uint |
| - mesa/st: Fix border color type for stencil sampling |
| - nv50/ir/ra: Fix copying compound for moves |
| - freedreno/a6xx: VPC_SO_NCOMP is actually VPC_SO_BUFFER_STRIDE |
| - tu: Use incoherent CCU write for buffer accesses |
| - tu: Support resolving D24S8 -> S8 |
| - tu: Fix resolving d32s8 into s8 on fast path |
| - tu/lrz: Do not use framebuffer when inheriting LRZ |
| - tu: Treat CP_WAIT_FOR_ME as a cache invalidate |
| - tu: Rewrite tess factor emission |
| - tu: Move TU_DONT_CARE_AS_LOAD into attachment_set_ops() |
| - tu: Disable GMEM for multiview inside tu_render_pass_gmem_config |
| - tu: Move tu_render_pass definition up |
| - tu: Remove usage of RenderPassBeginInfo |
| - tu: Initial support for dynamic rendering |
| - tu: Remove has_subpass_predication |
| - tu: Update more state with secondaries |
| - tu: Split out some state into a separate struct |
| - util/trace: Fix u_trace_*_iterator with no tracepoints |
| - tu: Add support for suspending and resuming renderpasses |
| - tu: Use common vk_image struct |
| - tu: Use common vk_image_view base struct |
| - tu: Add debug option to use emulated renderpass support |
| - tu: Restore formatting of tu_clear_blit.c |
| - tu: Fix sysmem depth attachment clear flushing |
| - ir3/spill: Fix extracting from a vector at the end of a block |
| - tu/lrz: Fix multiple subpass case with secondaries |
| - tu/lrz: Fix multiple depth attachment case with secondaries |
| - tu: Don't preload variable-count descriptors |
| |
| Constantine Shablya (1): |
| |
| - anv: set image_read_without_format NIR option on Vulkan 1.3 |
| |
| Corentin Noël (5): |
| |
| - mesa: Make sure to fallback to handling the original choose texture format |
| - virgl/ci: Update virglrenderer |
| - virgl/ci: Update virglrenderer |
| - virgl: Add support for passing the supported number of components from virgl |
| - virgl: Only progagate the uniform numbers if the numbers are actually right |
| |
| Cristian Ciocaltea (4): |
| |
| - virgl/ci: Rename virgl-lava-* jobs to virgl-iris-* |
| - virgl/ci: Update checksums for some virgl iris traces |
| - ci: Add CPU frequency adjustment capability |
| - ci: Limit Intel CPU scaling frequency for performance tests |
| |
| Daniel Schürmann (36): |
| |
| - nir/lower_alu_to_scalar: don't set the nir_builder cursor |
| - aco: remove 'max_waves' and use 'num_waves' to adjust for LDS and workgroup size |
| - aco: split num_waves adjustment into separate function |
| - aco: adjust num_waves for LDS before scheduling |
| - aco/optimizer: prevent any overflow between SGPR and const offset on MUBUF |
| - aco/ra: fix live-range splits of phi definitions |
| - aco: don't re-create vectors for load_barycentric_* intrinsics |
| - aco/ra: improve split_vector register assignment if the operand is not killed |
| - aco/optimizer: re-combine and copy-propagate p_create_vector(p_split_vector) |
| - radv: move two NIR passes out of tight optimization loop |
| - radv: call nir_lower_vars_to_ssa later |
| - aco: fix spilling of phis without temp operands |
| - nir/opt_vectorize: add callback for max vectorization width |
| - nir: introduce new nir_alu_alu_width() with nir_vectorize_cb callback |
| - nir/opt_vectorize: refactor src rewriting to avoid unnecessary mov instructions |
| - aco: correctly validate v_fma_mixhi_f16 register assignment |
| - radv: don't lower vectorized instructions to 32bit |
| - radv: use callback for nir_lower_to_scalar |
| - radv: vectorize nir_op_fdiv & nir_op_flrp & nir_op_ineg |
| - radv: vectorize nir_op_fabs |
| - radv: fix swizzles after nir_opt_algebraic_late |
| - radv: reorder some NIR passes |
| - nir/opt_shrink_vectors: fix re-using of components for vecN |
| - radv/shader_info: fix load_frag_coord and load_sample_pos read masks |
| - aco: fix packed 16bit fneg/fsat optimization |
| - aco: avoid unnecessary copies in emit_wqm() |
| - aco: Avoid live-range splits in Exact mode |
| - aco: fix assertion in insert_exec_mask |
| - spirv: handle phis decorated with RelaxedPrecision |
| - radv: Only create noop-fs for graphics pipelines |
| - radv: create RT traversal as separate shader |
| - radv/rt: use derefs for the traversal stack |
| - radv/rt: use stage ID as handle for general and closestHit shaders |
| - radv/rt: Don't load ClosestHit SBT on every hit, but only once after traversal |
| - radv/rt: fix nir_builder cursor in lower_rt_instructions() |
| - radv/rt: simplify lower_rt_instructions() |
| |
| Daniel Stone (10): |
| |
| - dzn/ci: Don't spam conformance warnings |
| - CI: Disable Windows jobs |
| - ci: Allow lima jobs to be globally disabled |
| - ci: Also disable Windows container builds when down |
| - ci/windows: Eliminate duplicate script definition |
| - ci/zink: Add glx-tfp to skips |
| - ci/lava: Sleep before, not after, API calls |
| - ci: freedreno farm is down |
| - CI: Disable Collabora lab |
| - Revert "CI: Disable Collabora lab" |
| |
| Danylo Piliaiev (53): |
| |
| - turnip: Try harder to keep LRZ valid and fix a few edge cases |
| - freedreno/a6xx: Add UNK fields to CP_REG_TEST and CP_COND_REG_EXEC |
| - turnip: Skip load/stores for tiles with no geometry |
| - turnip: Add debug option to print gmem load/store skip stats |
| - turnip: Fix tu_debug_flags values clashing |
| - tu: Implement VK_EXT_image_view_min_lod |
| - tu: Do not flush ccu in clear/blits during renderpass |
| - pps: Open writable renderer node in DrmDevice::create |
| - tu: Fix indices of drm_msm_gem_submit_cmd when filling them |
| - freedreno/a6xx: Add EARLYPREAMBLE flag to all a6xx_sp_xs_ctrl_reg0 |
| - util/u_trace: Add json output |
| - docs/u_trace: document u_trace usage |
| - ir3: handle gl_Layer and gl_ViewportIndex when there is TES + GS |
| - ir3: Force late-z if FS has global store/atomic |
| - tu: Fix D32S8 resolves into D32 or S8 in sysmem mode |
| - spirv: Workaround for RelaxedPrecision on OpLogical* in 3DMark |
| - turnip: Allow nested CP_COND_REG_EXEC |
| - turnip: Fix out-of-bounds builder->shader_iova access |
| - tu/autotune: Prevent division by zero |
| - turnip: copy disasm strings for pipeline executables |
| - tu: Disable sample counting for 3d blits during occlusion query |
| - tu: Reset xfb_used at the end of a renderpass |
| - meson: Define _GNU_SOURCE for android host system |
| - meson/pps: Check if libdrm exists to compile pps |
| - meson/tu: Don't compile libdrm paths if KGSL is selected |
| - tu: Do not expose storage image/buffer features for PACK16 formats |
| - tu: Use hw binning or sysmem with QUERY_TYPE_PRIMITIVES_GENERATED |
| - tu: Don't count 3d blits in QUERY_TYPE_PRIMITIVES_GENERATED |
| - freedreno: Document rest of GRAS_LRZ_CNTL, clarify UNK_25 event |
| - freedreno: document GRAS_UNKNOWN_810A |
| - tu: Overhaul LRZ, implement on-GPU dir tracking and LRZ fast-clear |
| - ir3: Use NIR's info.writes_memory to detect when when to force late-z |
| - tu,freedreno: Refactored START/STOP events for pipeline stats |
| - tu: Fix prim gen query and pipeline stats query interaction |
| - zink: re-enable EXT_primitives_generated_query for Turnip |
| - util/u_trace: Fix iteration over config_control |
| - turnip: Refactor tracepoints generation to reduce duplication |
| - turnip: Add TU_GPU_TRACEPOINT envvar to toggle tracepoints |
| - freedreno: Add the rest of tracepoints with start/end to perfetto |
| - freedreno: Refactor tracepoints generation to reduce duplication |
| - freedreno: Add FD_GPU_TRACEPOINT envvar to toggle tracepoints |
| - docs/u_trace: Add documentation for \*_GPU_TRACEPOINT envvars |
| - turnip: Disable LRZ fast-clear for gen1 and gen2 |
| - tu: Do not dereference descriptorSetLayout in push descriptors tmpl |
| - zink/turnip: Add comments to already triaged failures for tu baseline |
| - turnip: Simple breadcrumbs implementation to debug hangs |
| - tu: WFI after PC_TESSFACTOR_ADDR update |
| - freedreno: WFI after PC_TESSFACTOR_ADDR update |
| - ir3/ra: Always insert interval for precolored inputs |
| - ir3: Never remove GS_HEADER_IR3 sysval input |
| - tu: Disable LRZ write when alpha-to-coverage is enabled |
| - freedreno: Disable LRZ write when alpha-to-coverage is enabled |
| - ir3: Prevent reordering movmsk with kill |
| |
| Dave Airlie (82): |
| |
| - clc: add simple llvm initialise API |
| - clc: initialise one more llvm stage |
| - u_blitter/stencil: take dstbox x/y into accounts for dst fb width |
| - util/stencil: fix stencil fallback blit shader texture types. |
| - llvmpipe: add user memory resources to the debug global list. |
| - gallium/vl: wrap codec support checks in a common function. |
| - gallium/omx: add video codec supported hook for decode paths. |
| - meson: add a video codec support option |
| - gallium/vl: respect the video codecs configure in meson |
| - vulkan/wsi: keep allocate queue families in image, just don't fill them |
| - gallivm: move to new pass manager to handle coroutines change. |
| - radv: precalculate hs offchip parameters. |
| - radv: precalculate tess ring sizes/offsets. |
| - radv/ac: introduce a new common function for hs calcs. |
| - ac: port radeonsi tess factor calculations to common code. |
| - radeonsi: port tess ring calcs to the common helper. |
| - zink: avoid extra submits for empty wait cmdbufs. |
| - zink: don't finish barriers cmd buffer if not used. |
| - radv: cache the slab bo pointer in the pipeline. |
| - radv: only dirty line stipple when necessary. |
| - radv: optimised command buffer reset of vertex bindings. |
| - aco: move info pointer to a copy. |
| - aco: move to a minimal aco shader info struct. |
| - aco/info: reduce the gs ring info to what is needed. |
| - aco: remove radv specific streamout info |
| - aco: convert vs and so info over to aco structs. |
| - aco/info: add some more fields. |
| - aco/radv: convert to aco shader info at the radv level. |
| - aco: remove radv vs prolog key from aco internals. |
| - aco: drop unused radv include |
| - ac/radv: drop info pointer from the ac and radv shader structs |
| - aco/radv: drop radv_nir_compiler_options from aco. |
| - llvmpipe: align scratch size to 64-bit size. |
| - meson: add build-id to pipe libraries |
| - vl: fix codec checks to disable properly |
| - lavapipe: fix depth bias offset flag enables. |
| - llvmpipe: flush resources for kms swrast path. |
| - llvmpipe: handle timespec overflow on fence waits. |
| - nouveau: move codegen into a standalone library. |
| - nouveau: move codegen to a common higher level directory. |
| - nouveau/codegen: drop all ubytes from codegen. |
| - nouveau/codegen: drop gallium headers from the interface. |
| - llvmpipe: disable alpha test branch if multisample is enabled. |
| - egl/x11: split large put image requests to avoid server destroy |
| - egl/x11: add missing put_image cookie cleanups |
| - wsi/x11: add xcb_put_image support for larger transfers. |
| - glx/drisw: use xcb instead of X to query connection |
| - draw/cliptest: add support for guard band + full Z |
| - zink: workaround depth sampler border color when z24 is z32 |
| - aco: refactor the radv binary builder out of the core aco fn. |
| - aco/radv: provide a callback from aco shader building to build binary |
| - aco/radv: provide a vs prolog callback from aco to radv. |
| - aco: drop radv_shader.h include |
| - radv: add a dynamic vertex format cache. |
| - lavapipe: drop unreachable pNext checks. |
| - lavapipe: enable variablePointers |
| - kms/dri: add mutex lock around map/unmap |
| - crocus: fail query begin if upload allocation fails. |
| - lavapipe: state latest conformance tests passed |
| - llvmpipe: fix aniso cube map arrays. |
| - crocus: fix leak in query code. |
| - llvmpipe: make last_fence a screen/rast object not a context one. |
| - llvmpipe: keep context list and use to track resource usage. |
| - gallivm: use progress from subgroup lowering. |
| - gallivm: fix casting around read invocation. |
| - llvmpipe: export ARB_shader_ballot |
| - lavapipe: expose some subgroup extensions |
| - zink: fixup subgroup vote/ballot enables. |
| - zink/llvmpipe/ci: skip broken CTS ballot test. |
| - util: add reallocarray wrapper for win32 |
| - llvmpipe: Allocate scene tiles dynamically |
| - gallium: add a rasterizer state bit for unrestricted depth values. |
| - mesa/st: disable unrestricted fragment depth values for GL/GLES |
| - util/pack_color: clamp depth values outside range for unorm formats. |
| - llvmpipe/fs: handle unrestricted depth values. |
| - lavapipe: enable VK_EXT_depth_range_unrestricted. |
| - util/reallocarray: add errno.h include. |
| - lavapipe: enable VK_FORMAT_R5G5B5A1_UNORM_PACK16 |
| - gallivm: fix printf hook for cached shaders. |
| - draw: don't touch info values that aren't valid. |
| - nir_to_tgsi_info: drop const_buffers_declared |
| - llvmpipe: finish rendering before flushing frontbuffer resources. |
| |
| David Heidelberg (58): |
| |
| - ci: skqp: patch skqp to report also GL results |
| - ci: skqp: upgrade to skqp based on Android CTS 11.0 (r7) |
| - ci: skqp: update URL |
| - ci: intel: Merge anv and iris into src/intel/ci |
| - ci/iris: Enable SKQP on Tiger Lake boards |
| - ci: traces: switch to Valve trimmed traces |
| - ci: traces: re-enable disabled traces which are now trimmed traces |
| - ci: traces: virgl: reenable previously crashing Godot trace |
| - ci: uprev piglit 2022-05-10 |
| - ci/iris: skqp: add default GLES rendertests for TGL |
| - ci/iris: skqp: remove flaking atlastext for TGL |
| - ci/virgl: fix checksum for valve traces which run on iris |
| - ci/freedreno: enable ROR and Nheko traces |
| - ci/i915: add entries for RoR and Nheko traces |
| - ci/intel: add RoR and Nheko traces and reenable most of Valve traces |
| - ci/crocus: add RoR and Nheko traces |
| - ci/llvmpipe: add RoR and Nheko traces |
| - ci/radeonsi: add RoR and Nheko traces |
| - ci/virgl: traces: add RoR and Nheko traces |
| - ci/panfrost: add RoR and Nheko traces |
| - ci/iris: skqp: remove flaking atlastext for TGL (gl version) |
| - ci: traces: temporarily disable nheko trace |
| - ci/virgl: traces: temporarily disable nheko trace |
| - ci: uprev piglit 2022-06-09 |
| - ci/etnaviv: sort fails.txt |
| - ci/etnaviv: add recent fails and remove 3 unexpected passes |
| - ci/etnaviv: disable tests which takes too long |
| - ci/etnaviv: add recent flakes for gc2000 |
| - ci/etnaviv: gc2000: drop flakes from fails |
| - ci/freedreno: add Blender, Warzone2100, Freedoom and Unvanquished traces |
| - ci/freedreno: fix A530 glmark2@ideas:speed=10000 trace |
| - ci/llvmpipe: add Blender, Warzone2100, Freedoom and Unvanquished traces |
| - ci/iris: add Blender, Warzone2100, Freedoom and Unvanquished traces |
| - ci/virgl: add Blender, Warzone2100, Freedoom and Unvanquished traces |
| - ci/virgl: iris: disable Counter Strike 1.6 trace flaking |
| - ci/crocus: add Blender, Warzone2100, Freedoom and Unvanquished traces |
| - ci/panfrost: add Blender, Warzone2100, Freedoom and Unvanquished traces |
| - ci/iris: disable Blender Diorama trace (incompatible with perf. traces) |
| - ci/freedreno: update restricted traces to the brotli compressed version |
| - ci/freedreno: smuggle myself into restricted traces access list |
| - ci/freedreno: disable non-existing trace |
| - ci: Uprev apitrace to 790380e05854 |
| - ci: traces: switch to brotli compressed traces |
| - ci/freedreno: add more restricted traces |
| - ci/freedreno: temporary disable AmnesiaTDD |
| - ci/freedreno: disable Stellaris trace |
| - ci/freedreno: disable SpecOps trace, each run flaky |
| - Revert "ci/freedreno: temporary disable AmnesiaTDD" |
| - ci/traces: piglit, be more verbose |
| - docs: briefly document Gallium Nine |
| - ci: add script for manage Mesa CI |
| - ci: Allow disabling the whole of the Igalia farm |
| - ci: move apitrace instalation outside of x86_test-vk |
| - ci: wine-apitrace bump to 11.1 |
| - ci/freedreno: 3 pixel change in Raven restricted trace |
| - nine: char can be unsigned on non-x86 platforms |
| - mailmap: fix order of emails for Axel Davy |
| - ci: set reasonable amount of wget retries for each download |
| |
| Dawn Han (6): |
| |
| - Add iub features and properties passthrough |
| - Refactor the descriptor enums to be extensible |
| - Add iub type to vn_descriptor_type Verified the pNext VkWriteDescriptorSet is not NULL on iub type |
| - Add the iub binding count tracking |
| - Refactor and add template support for iub |
| - venus: enabled iub in physical device |
| |
| Denis Pauk (1): |
| |
| - panvk: Return VK_ERROR_INCOMPATIBLE_DRIVER for Midgard |
| |
| Dmitry Osipenko (1): |
| |
| - virgl: Fix unmapping of blob resources |
| |
| Dylan Baker (74): |
| |
| - VERSION: bump to 22.2-devel for next cycle |
| - docs: truncate new_features.txt |
| - docs: add release notes for 22.0.2 |
| - docs: Add sha256sums for 22.0.2 |
| - docs: update calendar and link releases notes for 22.0.2 |
| - docs: update calendar for 22.1.0-rc1 |
| - docs: update calendar for 22.1.0-rc2 |
| - winsys/radeon: remove unused simple_list.h |
| - r300: remove unused simple_list.h |
| - gallium/rbug: replace simple_list.h with list.h |
| - gallium/driver_trace: remove simple_list.h |
| - util/list.h: add a function to move an item in a list |
| - gallium/util/u_cache.c: remove trailing whitespace |
| - gallium/util/u_cache: replace simple_list.h with list.h |
| - gallium/draw: remove trailing whitespace |
| - gallium/draw: use list.h instead of simple_list.h |
| - llvmpipe: cleanup trailing whitespace |
| - llvmpipe: replace uses of simple_list.h with list.h |
| - util: remove simple_list |
| - docs: Extend calendar entries for 22.0 by 1 releases. |
| - docs: update calendar and link releases notes for 22.0.3 |
| - docs: Extend calendar entries for 22.1 by 1 releases. |
| - docs: update calendar for 22.1.0-rc3 |
| - docs: update calendar for 22.1.0-rc4 |
| - docs: add release notes for 22.0.3 |
| - docs: Add sh256sum for mesa 20.0.3 |
| - docs: update calendar and link releases notes for 22.1.0 |
| - docs: add release notes for 22.1.0 |
| - relnotes: Add sha256sum and fix minor formatting issues |
| - docs: Add calendar entries for 22.1 release. |
| - docs: add release notes for 22.0.4 |
| - docs: add sha256sum to 22.0.4 notes |
| - docs: Extend calendar entries for 22.0 by 1 releases. |
| - docs: update calendar and link releases notes for 22.0.4 |
| - docs: add release notes for 22.1.1 |
| - docs: Add sh256sums for 22.1.1 |
| - docs: update calendar and link releases notes for 22.1.1 |
| - docs: add release notes for 22.0.5 |
| - docs: update sha256 for 22.0.5 |
| - docs: update calendar and link releases notes for 22.0.5 |
| - docs: add release notes for 21.3.9 |
| - docs Add sha256 sums for 21.3.9 |
| - docs: link releases notes for 21.3.9 |
| - docs: add release notes for 22.1.2 |
| - docs: add sha256sum to 22.1.2 notes |
| - docs: update calendar and link releases notes for 22.1.2 |
| - docs: drop 22.1.4 calendar entry. |
| - docs: Add calendar entries for 22.2 release candidates. |
| - docs: Extend calendar entries for 22.1 by 3 releases. |
| - docs: add release notes for 22.1.3 |
| - docs: add sah256sum for mesa 22.1.3 |
| - docs: update calendar and link releases notes for 22.1.3 |
| - docs: Bump 22.2 branchpoint by two weeks |
| - anv: add gfx version 12.5 flushes to CCS path |
| - iris|anv: gfx version 12.5 data cache flush is not a workaround |
| - VERSION: bump 22.3.0-devel |
| - VERSION: bump for 22.2.0-rc1 |
| - .pick_status.json: Update to 8e6bdb2ed31f27c28d258859ec48e8e67f51ff41 |
| - .pick_status.json: Update to 0a0205f04552ab21971c431a5f8bb7486d664422 |
| - .pick_status.json: Update to 1a3b086b06c6c1ff63a4abe3752ca0a7a854dc77 |
| - .pick_status.json: Update to c67e60ae8fefaeadbe06bf0cbd8ad2ee267a6c0e |
| - .pick_status.json: Update to 70891edd9728e7dd73115171fdb03622ce6450ba |
| - VERSION: update to 22.2.0 |
| - Revert "VERSION: update to 22.2.0" |
| - VERSION: bump to 22.2.0-rc2 |
| - .pick_status.json: Update to a3bf0da1cbd4b10043c80bf44609a3024b5fcc36 |
| - .pick_status.json: Update to 24b9ad7cd5ebc7cfa5d03cf0f243ea4841c971b9 |
| - .pick_status.json: Update to 74fc367127ccf945f4c649dd6ddff955c802e36e |
| - .pick_status.json: Mark 11ab6087797f805cf158048915c67945613c9a72 as denominated |
| - VERSION: bump to 22.2.0-rc3 |
| - .pick_status.json: Update to 0c6fbfca0c91ef012e8ab767a317c07f1f6dc5e6 |
| - .pick_status.json: Update to 8eac45b27446cd9b9eaeb147af97fff1e09832cb |
| - .pick_status.json: Update to baf24dea943202b3a92cad0c9f9648597040955a |
| - .pick_status.json: Update to 3d4c36a3bcc51ed441b2667d92291bea30ef7449 |
| |
| Eli Schwartz (1): |
| |
| - meson: add various generated header dependencies as order-only deps |
| |
| Ella Stanforth (1): |
| |
| - v3dv: Implement VK_KHR_performance_query |
| |
| Emil Velikov (1): |
| |
| - c11: reinstate the original license and authorship |
| |
| Emma Anholt (232): |
| |
| - nouveau/nir: Don't try to emit OP_FMA pre-nvc0. |
| - Revert "ci: Disable Google's lab" |
| - ci/virgl: Add piglit and GL4.3 testing to the virpipe path. |
| - nir: Add lowering for fround_even on r300. |
| - nir: Skip fround_even on already-integral values. |
| - nir: Avoid generating extra ftruncs for array handling. |
| - turnip: Add TU_DEBUG=layout for dumping image layouts. |
| - freedreno/ir3: Make sched nodes before adding deps. |
| - turnip: Add nir_opt_conditional_discard. |
| - freedreno/ir3: Call nir_opt_find_array_copies(). |
| - turnip: Ignore TOP/BOTTOM_OF_PIPE bits in subpass src/dst dep flags. |
| - turnip: Don't disable LRZ in subpasses after the first in the easy case. |
| - nir_to_tgsi: Add a workaround for virglrenderer TG4. |
| - nir_to_tgsi: Fix assertion failures handling 64-bit vec3/vec4 ssa undefs. |
| - nir_to_tgsi: Don't forget to split 64-bit store_per_vertex_output. |
| - nir_to_tgsi: Avoid swizzling from undefined channels in load_output. |
| - nir_to_tgsi: Allocate the primid sysval to num_inputs, not num_outputs. |
| - tgsi: Emit ureg HW_ATOMIC decls in range order. |
| - tgsi_to_nir: Emit load_ubo_vec4 instead of load_ubo on non-integer HW. |
| - nine: Disable optional use of TTN when MUL_ZERO_WINS is available. |
| - r600: Fix up some mis-indentation of blocks. |
| - ci/r600: Manual run updates. |
| - r600: Fix reading back from a temp array immediately after writing on RV770. |
| - r600/sb: Avoid causing an exception when getting the reciprocal of 0u. |
| - r600: Use nir-to-tgsi instead of TGSI when the NIR debug opt is disabled. |
| - nouveau/nv50: Print the number of loops in shader-db output. |
| - ci/nouveau: Add expectations files for GM206. |
| - ci/nouveau: Add nv92 xfails. |
| - nouveau/nir: Disable bitfield ops pre-nvc0. |
| - nouveau/nv50: Enable mesa/st alpha test lowering on nv50 with NIR. |
| - nouveau/nv50: Set the primid sysval flag if it's in the sysval list, too. |
| - nouveau/nir: Fix the inverted sense of usesSampleMaskIn. |
| - nouveau/nir: Fix edgeflag input detection. |
| - nouveau/nir: Set the input for vertex/instance ID like TGSI does. |
| - ci/dzn: Cut the runtime by running 1/3 of the enabled tests. |
| - nir_to_tgsi: Make vec_to_movs avoid unsupported coalescing for 64-bit. |
| - nir_to_tgsi: Lower FS input array indexing since we don't declare input arrays. |
| - tgsi: Extend array lengths when merging usage_masks. |
| - virgl: Switch to nir-to-tgsi by default. |
| - nir/lower_tex: Make the adding a 0 LOD to nir_op_tex in the VS optional. |
| - nir_to_tgsi: Drop the txl(lod=0)->tex nir_lower_tex() fixup. |
| - nouveau/nir: Move FS output stores to the end of the last block. |
| - ci/nouveau: Add MESA_GLES_VERSION_OVERRIDE=3.1 baseline state. |
| - nouveau/nir: Add support for pre-GF100 images and ssbos. |
| - nouveau/nir: Put the UBO offset indirect into the address reg. |
| - nouveau: Enable the NIR backend by default. |
| - util: Add some unit tests of the half-float conversions. |
| - nir: Eliminate out-of-bounds read/writes in local lowering. |
| - nir: Don't assert on tg4 offset range. |
| - svga: Add support for requesting NIR and translating to TGSI. |
| - ci/iris: Demote APL deqp to manual-only for now. |
| - ci/iris: Add a bunch of APL and KBL flakes recently. |
| - mesa/st: Always generate NIR from GLSL, and use nir_to_tgsi for TGSI drivers. |
| - mesa/st: Remove st_glsl_to_tgsi. |
| - mesa/st: Remove TGSI shader program variant handling. |
| - mesa/st: Drop the attrib remapping in st_prepare_vertex_program(). |
| - mesa/st: Remove TGSI-only shader lowering code. |
| - tgsi_scan: Remove unused input_array_last/output_array_last. |
| - glsl: Remove UBO reference lowering. |
| - mesa: Remove unused Mesa IR PROGRAM_* register files. |
| - gallium/tgsi: Remove tgsi_emulate. |
| - mesa/st: Remove now unused TGSI paths from disk cache. |
| - glsl: Retire the non-NIR GLSL linking paths. |
| - gallium: Remove now-unused shader caps. |
| - glsl: Remove the unused lower_if_to_cond_assign. |
| - glsl: Drop the dead MOD_TO_FLOOR path. |
| - mesa/st: Clean up a bit of st_prepare_vertex_program(). |
| - compiler/glsl: Remove the dead parts of build_program_resource_list(). |
| - glsl: Remove EmitNoLoops and the associated lower_jumps(lower_break=true) code. |
| - glsl: Remove unused lower_variable_index_to_cond_assign. |
| - svga: Set lower_bitops for vpu9 screen. |
| - glsl: Stop lowering ir_quadop_vector. |
| - nouveau: disable fences when running under drm-shim. |
| - Revert "ci: remove nouveau from shader-db runs" |
| - ci/iris: Cut the glk-deqp test coverage in half. |
| - glsl: Make all drivers take the GLSLOptimizeConservatively path. |
| - turnip: Add a TU_DEBUG=perf debug option. |
| - ci/crocus: Manual CI updates after CI was down for a bit. |
| - ci/crocus: Merge the piglit runs with the deqp runs. |
| - i915g/ci: Add depth-clear-precision-check xfails like everyone else. |
| - ci/crosvm: Simplify the CID setup. |
| - ci/crosvm: Rename VSOCK_TEMP_DIR -> VM_TEMP_DIR. |
| - ci/crosvm: Terminate the previous crosvm after a deqp-runner timeout. |
| - freedreno/ir3: Fix 16-bit bit_count. |
| - spirv: Use nir_vec_scalars() to simplify matrix transpose. |
| - spirv_to_nir: Cast RelaxedPrecision ALU op dests to mediump. |
| - turnip: Make RelaxedPrecision-decorated ALU ops 16-bit. |
| - freedreno/ir3: Add support for 16-bit nir_texop_lod. |
| - freedreno/ir3: Lower texture instructions used only for f2f16 to 16-bit. |
| - mesa/st: Only scalarize for doubles lowering if we're lowering doubles. |
| - glsl,nir: Move i/umulExtended lowering to NIR. |
| - panfrost: always print the bad ALU op if we're failing to translate. |
| - ci/turnip: Add missing xfails for a618 full VK run |
| - nir_lower_mediump: Drop assertion about not containing movs. |
| - freedreno/ir3: Enable load/store vectorization for SSBO access, too. |
| - freedreno/ir3: Make the types of tex coord padding match. |
| - freedreno/ir3: Fix validation of half-precision image store values. |
| - freedreno/ir3: Fold 16-bit conversions into image load/store src/dsts. |
| - i915g: Ignore the new SEPARABLE_PROGRAM property. |
| - ci/i915: Merge the piglit and deqp runs. |
| - ci/i915: Update manual piglit job expectations. |
| - turnip: Don't bother creating tile_load/store_cs for sysmem rendering. |
| - turnip: Refactor a bit of tu6_emit_tile_select(). |
| - turnip: Move binning decisions from FB usage time to FB creation time. |
| - turnip: Use fb->binning_possible to decide on conditional tile load/stores. |
| - ci/turnip: Add a little forced touch-testing of XFB with no binning requested. |
| - turnip: Refactor a bit of repeated code for subpass setup. |
| - turnip: Include 3d-based CmdClearAttachments() in binning visibility. |
| - turnip: Refactor a bit of subpass attachment processing. |
| - turnip: Allow load/store skipping in vkCmdClearAttachments(). |
| - turnip: Move tile loads back into the draw CS. |
| - nouveau/nv30: Make sure fsat is lowered in the VS. |
| - gallium+glsl: Remove EmitNoSat/PIPE_CAP_VERTEX_SHADER_SATURATE |
| - glsl: Remove stale lower_instructions comments. |
| - glsl: Move exp/log-to-exp2/log2 lowering to glsl-to-NIR. |
| - tgsi_exec: Fix inf/nan handling for divide by zero. |
| - glsl: Drop the div-to-mul-rcp lowering for floats. |
| - nir: Fix idiv lowering on !NativeIntegers when lower_fdiv is also set. |
| - glsl: Drop INT_DIV_TO_MUL_RCP lowering. |
| - ci/crocus: Drop g41's xfail for rgba_half_float_oes. |
| - ci/iris: Disable skqp until it can be stabilized. |
| - turnip: Reverse the order of walking pipes or tiles on odd rows. |
| - mesa/st: Abort the linking on driver link failure. |
| - .gitignore: Qualify the path for the ignored build directory. |
| - nir: Rename is_arb_asm to use_legacy_math_rules and document its meaning. |
| - gallium: Rename MUL_ZERO_WINS to LEGACY_MATH_RULES. |
| - nir_to_tgsi: Set LEGACY_MATH_RULES cap for use_legacy_math_rules shaders. |
| - r600/sfn: Implement 0*x=0 behavior for use_legacy_math_rules. |
| - nouveau/nir: Implement mul_zero_wins behavior for use_legacy_math_rules. |
| - tgsi_to_nir: Set use_legacy_math_rules for TGSI_PROPERTY_LEGACY_MATH_RULES. |
| - gallium: Rename PIPE_CAP_TGSI_LEGACY_MATH_RULES to drop "TGSI" |
| - Revert "nine: Disable optional use of TTN when MUL_ZERO_WINS is available." |
| - iris: Enable PIPE_CAP_LEGACY_MATH_RULES. |
| - crocus: Enable PIPE_CAP_LEGACY_MATH_RULES. |
| - ci/dzn: Skip a test that intermittently times out at a minute. |
| - ci/iris: Disable blender-demo-cube_diorama on APL. |
| - ci: Terminate capture-devcoredump before tarring up artifacts. |
| - turnip: Document some fields about resolves. |
| - ci/bare-metal: Apply autopep8 to our python scripts. |
| - ci/bare-metal: Add per-boot-stage timeouts for fastboot and poe. |
| - ci/bare-metal: Get rid of servo's serial feed threads. |
| - ci/bare-metal: Add handling of netboot firmwares for servo boards. |
| - ci/bare-metal: Add test phase timeouts to all boards. |
| - ci: Remove the unused .test-manual |
| - ci/vc4: Merge quick_shader in with deqp-gles |
| - ci/vc4: Turn on deqp-egl testing by default. |
| - vc4: Propagate txf_ms's dest_type to the lowered txf. |
| - freedreno/ir3: Disable image/ssbo 16-bit conversion folding pre-a6xx. |
| - freedreno/a5xx: Set the buffer bit appropriately in XS_CTRL_REG0. |
| - ci/freedreno: Turn a530 back on by default and update expectations. |
| - ci/bare-metal: Remove "stage: test" from .baremetal-test. |
| - ci/bare-metal: Consolidate needs declarations in .baremetal-test-\*. |
| - ci/bare-metal: Collapse artifacts wget by default. |
| - ci/traces: GC unused code for DXVK trace replay. |
| - ci/traces: Drop ZINK_USE_LAVAPIPE ICD override. |
| - ci/freedreno: Add vulkan+gl integration testing in piglit. |
| - ci/turnip: Test traces on turnip using zink. |
| - ci/turnip: Trim the a630 VK run a bit. |
| - ci: Use "!references" to manage scheduled pipeline rules. |
| - ci: Pull out farm disables to a reused rule. |
| - ci: Pull common zink frontend rules to a shared rule list. |
| - ci/freedreno: Use !references to clean up restricted traces rules. |
| - ci: Simplify vulkan rules using !references. |
| - ci: Split core GL from core VK-or-GL rules. |
| - ci/freedreno: Filter when we run test jobs for VK or GL changes. |
| - ci/zink: Reuse anv-rules. |
| - ci/zink: Simplify lavapipe rules setup, and clarify what the rules are for. |
| - ci: Remove the trailing "when: never"s from rules. |
| - ir3: Retire the cp postsched pass now that we do RA in SSA. |
| - ci: Uprev deqp to 1.3.3.0. |
| - ci/llvmpipe: Skip ssbo.layout.random.all_shared_buffer.36 |
| - ci: Make the retry policy default for all jobs. |
| - ci: Rename .ci-run-policy rules to .build-rules. |
| - ci: Drop .build-rules from container jobs. |
| - ci: Remove .build-rules from core test job definitions. |
| - ci: Fix non-freedreno performance jobs running during Marge merges. |
| - ci: Remove GIT_STRATEGY from .test-manual-mr. |
| - mesa/st: Disable OES_texture_3D for PIPE_CAP_MAX_TEXTURE_3D_LEVELS==0. |
| - vc4: Disable OES_texture_3D being exposed. |
| - vc4: Move previous existing 3D xfails up to the group of 3d xfails. |
| - vc4: Add notes on the remaining dEQP failures. |
| - turnip: Account for additional_cs_reserve_size for both bin and render. |
| - turnip: Fix up per-stage additional size accounting. |
| - turnip: Fix the reservation for vertex inputs. |
| - turnip: Refactor vertex input setup a little. |
| - ir3: Make sure to pass the interp_mode through in our load_bary lowering. |
| - nir+ir3: Rename load_size_ir3 to load_center_rhw_ir3. |
| - freedreno: Rename the "SIZE" regs for interpolateAtOffset to "CENTERRHW" |
| - ir3: Clarify what's happening in the interpolateAtOffset() math. |
| - ir3: Use non-persp interpolation when appropriate for interpolateAtOffset. |
| - zink: Do the timestamp-to-ns math in a double to have better precision. |
| - mesa: Fix the error check for VertexAttrib*. |
| - vc4: Work around a HW bug with 2-vert line loops. |
| - vc4: Add links to test bug reports. |
| - kopper: Fix the return value of kopperSetSwapInterval(). |
| - kopper: Use the swap interval that was set at swapchain creation time. |
| - glx: Unify dri2/dri3 vblank_mode logic. |
| - kopper: Respect the vblank_mode env var. |
| - egl: Just include driconf.h instead of redefining its values. |
| - loader: Just include driconf.h instead of redefining its values. |
| - ci/freedreno: Mark an occasional flake pass that happens on a530. |
| - ir3: Fix the no-emitted-vertex condition emission in geom lowering. |
| - mesa/arbprog: Move the GLSLFragCoordIsSysVal handling to prog_to_nir. |
| - mesa/arbprog: Use nir_lower_io_to_temporaries. |
| - mesa/arbprog: Stop doing optimization in the ARB program IR. |
| - ci/crocus: Update portal 2 trace shas for the recent fix. |
| - ci/crocus: Disable the blender trace. |
| - ci/crocus: Drop xfails for the recent image external fix. |
| - freedreno/ir3: Enable core NIR's 16-bit ALU optimizations. |
| - intel/fs: Simplify brw_barycentric_mode() args. |
| - ci/bare-metal: Add timeouts to the shell commands called in fastboot. |
| - ci/freedreno: Switch a630 to manual/disabled for lab maintenance. |
| - glsl: Use the same NIR path for shared mem lowering as SPIRV does. |
| - glsl: Remove optimize_swizzles. |
| - ci/turnip: Add a bit of spilling-vs-ballot testing on a618. |
| - ci/freedreno: Add some more known flakes for a630 from our IRC logs. |
| - ci/turnip: Bump up the a630 full run timeout. |
| - ci/turnip: Add a couple of missing a630 fails. |
| - Revert "ci/freedreno: Switch a630 to manual/disabled for lab maintenance." |
| - nir: Split usub_sat lowering flag from uadd_sat. |
| - ci/crocus: Fix the commented YML for blender-demo-cube_diorama. |
| - gallivm: Refactor a bit of UBO/SSBO range checking. |
| - gallivm: Optimize single-invocation SSBO stores. |
| - ci/virgl: Make the VM-based virgl tests report to #virgl-ci, too. |
| - nir: Add an opt pass for phis after if choosing between true/false. |
| - glsl: Remove opt_conditional_discard(). |
| - nir: Use nir_foreach_phi_src consistently. |
| - ci/freedreno: Update a630 s8 resolve xfails. |
| - turnip: Split the tiling config into separate layouts based on CCU usage. |
| - turnip: Use the GMEM CCU space for attachments when the stores won't. |
| - zink: Make sure that we keep the existing ici pNext chain on inserts. |
| - spirv: Mark phis as mediump instead of directly lowering them to 16 bit. |
| |
| Enrico Galli (6): |
| |
| - microsoft/spirv_to_dxil: Add pass to lower dynamic accesses on ubo[1] |
| - microsoft/compiler: Fix for arrays of 1 element |
| - microsoft/compiler: Add support for unnamed ubos |
| - microsoft/compiler: Fix when using a shadow sampler more than once |
| - microsoft/spirv_to_dxil: Fix discard semantics |
| - microsoft/spirv_to_dxil: Fix discard to demote + return pass |
| |
| Eric Engestrom (36): |
| |
| - meson: replace deprecated dep.get_pkgconfig_variable(...) with dep.get_variable(pkgconfig : ...) |
| - docs: show and link to the new macOS CI |
| - broadcom/rpi4-skips: drop duplicated lines |
| - mailmap: update my email to Igalia |
| - vulkan: add script to detect & replace identifiers that have been renamed |
| - vulkan/runtime: use updated tokens from vk.xml |
| - vulkan: use updated tokens from vk.xml |
| - nir: use updated tokens from vk.xml |
| - radv: use updated tokens from vk.xml |
| - v3dv: use updated tokens from vk.xml |
| - turnip: use updated tokens from vk.xml |
| - zink: use updated tokens from vk.xml |
| - lavapipe: use updated tokens from vk.xml |
| - pvr: use updated tokens from vk.xml |
| - anv: use updated tokens from vk.xml |
| - dzn: use updated tokens from vk.xml |
| - panvk: use updated tokens from vk.xml |
| - venus: use updated tokens from vk.xml |
| - zink: fix portability_subset usage after rename from EXTX to KHR |
| - zink: drop unused VkPhysicalDevicePortabilitySubsetPropertiesKHR |
| - zink: add missing guards around \`have_{ext}` |
| - docs/isl: fix typos & formatting |
| - vk/util: handful of pythonic cleanups |
| - vk/util: simplify extensions gen code |
| - util/list: rename LIST_ENTRY() to list_entry() |
| - nouveau: use existing list_first_entry() macro instead of re-writing it |
| - docs/features.txt: mark VK_ANDROID_native_buffer as supported on turnip |
| - docs/features.txt: mark VK_ANDROID_native_buffer as supported on v3dv |
| - docs/ci: fix Emma's name |
| - turnip: expose support for VK_EXT_acquire_drm_display |
| - ci: add \`--print-errorlogs` to \`meson test` |
| - bin/gen_release_notes.py: bump advertised vulkan version to 1.3 |
| - vk/device-select-layer: fix .sType of VkPhysicalDeviceGroupProperties |
| - wsi/x11: fix memleak in wsi_x11_connection_create() |
| - meson: replace manual compiler flags with meson arguments |
| - broadcom: fix dependencies in static_library() calls |
| |
| Erico Nunes (11): |
| |
| - ci: put lima farm back online |
| - docs/ci: fix literalinclude of caching setup files |
| - lima/ppir: optimize branch comparison when possible |
| - lima/ppir: merge branch nodes in simple cases |
| - egl: Fix DETECT_OS macro usage |
| - ci: update linux kernel to v5.17 |
| - ci: update docs for linux kernel uprevs |
| - lima/ci: update piglit ci expectations |
| - lima/ci: reenable lima-mali450-piglit-gpu:arm64 |
| - lima/ci: Don't skip piglit opengl 1 tests |
| - ci: Revert "CI: Lima farm is offline" |
| |
| Erik Faye-Lund (157): |
| |
| - gallium: rename continue shader-cap |
| - gallium: rename dround shader-cap |
| - gallium: rename dfracexp/dldexp shader-cap |
| - gallium: rename ldexp shader-cap |
| - vulkan: explicitly cast object-type enum |
| - meson: turn on -Wno-unused-function project-wide |
| - nir: introduce and use nir_component_mask |
| - dzn: fixup indent |
| - dzn: remove all usage of ComPtr<T> |
| - dzn: always use ID3D12Device1 |
| - dzn: always use ID3D12GraphicsCommandList1 |
| - dzn: use ID3D10Blob instead of ID3DBlob |
| - dzn: pass IDXGIAdapter1 to d3d12_create_device |
| - dzn: port to d3d12 c-api |
| - dzn: add a bunch of missing struct-keywords |
| - dzn: drop auto usage |
| - dzn: drop using references |
| - dzn: no more reinterpret_cast |
| - dzn: use define instead of constexpr |
| - dzn: use c-style for-statement |
| - dzn: use c-style initialization |
| - dzn: c-style casts |
| - dzn: do not set unused default member initializer |
| - dzn: port code to plain c |
| - dzn: merge util sources |
| - dzn: remove needless defines |
| - dzn: remove some needless casts |
| - dzn: clean up pointer syntax |
| - dzn: add space before += operator |
| - dzn: drop needless enum-casts |
| - dzn: fixup indent |
| - dzn: remove stale cpp_args |
| - nir/lower_int64: do not try to clamp floats to int-range |
| - vulkan: do not depend on alignof(void) |
| - lavapipe: quiet non-conformant warning on ci |
| - panvk: quiet non-conformant warning on ci |
| - vulkan: use c_msvc_compat_args for shared code |
| - vulkan: drop empty vulkan_wsi_args |
| - microsoft/clc: fixup indentation |
| - microsoft/compiler: fixup indentation |
| - meson: remove unused defines |
| - meson: deprecate specifying osmesa-bits |
| - gallium/xlib: fix stale comment |
| - mesa: add missing error-path |
| - util: fix test on msvc |
| - lavapipe: tolerate NULL UBO and SSBO descriptors |
| - lavapipe: tolerate NULL image view descriptors |
| - lavapipe: tolerate NULL image-buffer view descriptors |
| - lavapipe: tolerate NULL sampler-buffer view descriptors |
| - lavapipe: add support for null-descriptors from EXT_robustness2 |
| - zink: pass screen to extension getter macro |
| - zink: initialize dispatch-table before queue-init |
| - zink: call vk-functions through dispatch-table |
| - zink: pass screen to zink_create_instance |
| - zink: dynamically load a few functions |
| - zink: use run-time linking to loader |
| - dzn: reset correct list |
| - vulkan/wsi: unbreak win32-support |
| - d3d12: add missing cmath includes |
| - tests/graw: rename shaders from .sh to .txt |
| - dzn: add a debug-flag to wait for the debugger |
| - dzn: add debug option to redirect stdout/stderr |
| - editorconfig: remove scons-ism |
| - editorconfig: remove html-rule |
| - editorconfig: remove m4-rule |
| - editorconfig: remove pl-rule |
| - radeonsi: port amdgcn_glslc build to meson |
| - freedreno: remove stale makefile |
| - intel: remove stale makefile |
| - meson: modernize win_flex stdint.h logic |
| - c99_compat.h: remove inline and __func__ shims |
| - util: add missing c99_compat.h includes |
| - util: remove needless c99_compat.h includes |
| - egl: remove needless c99_compat.h includes |
| - gallium/util: remove needless c99_compat.h includes |
| - mapi: remove needless c99_compat.h includes |
| - gallium: remove needless c99_compat.h includes |
| - include: remove needless c99_compat.h includes |
| - intel/isl: remove needless c99_compat.h includes |
| - vulkan/device_select: remove needless c99_compat.h includes |
| - include: drop c99_math.h |
| - include: drop c11_compat.h |
| - pvr: do not use c_msvc_compat_args |
| - turnip: do not do STATIC_ASSERT on a variable |
| - freedreno: do not use variable in STATIC_ASSERT |
| - freedreno: do not do STATIC_ASSERT on variables |
| - iris: do not do STATIC_ASSERT on variables |
| - freedreno: degrade STATIC_ASSERT to assert |
| - v3dv: do not do STATIC_ASSERT on variables |
| - util: add IS_POT macro |
| - intel/compiler: use macro for power-of-two check |
| - broadcom/compiler: use macro for power-of-two check |
| - util: use static_assert directly |
| - util: implement STATIC_ASSERT using c++11 / c11 primitives |
| - meson: add back -Werror=vla to msvc compat flags |
| - ci/windows: add back build-error detection |
| - d3d12: move cubemap-lowering to common-code |
| - microsoft/compiler: mark image-functions as such |
| - microsoft/compiler: make sampler-lowering optional |
| - microsoft/spirv_to_dxil: lower cube-images to 2d arrays |
| - dzn: enable binding_model tests |
| - dzn: correct assert-condition |
| - dzn: handle stencil-attachment-optimal layout |
| - dzn: use a null-rtv to handle no-attachment |
| - dzn: enable two more exts |
| - util: sort includes |
| - llvmpipe: assume c99 |
| - c99_compat.h: assume c99 support |
| - util: use c11 alignof instead of our own |
| - glcpp: remove outdated msvc hack |
| - vulkan/util: assume c11 |
| - util: use c11 alignas instead of rolling our own |
| - gallium: remove stale comment |
| - gallium: reorganize includes |
| - llvmpipe: use c11 alignas instead of PIPE_ALIGN_VAR |
| - gallium: use c11 alignas instead of PIPE_ALIGN_VAR |
| - gallium: use c++11 alignas instead of PIPE_ALIGN_VAR |
| - gallium: remove repeated check |
| - gallium: remove unused macros |
| - gallium: clean up PIPE_ALIGN_STACK-definition |
| - docs/zink: correct requirement for tbos |
| - dzn: correct order of src_subres and dst_subres args |
| - docs: set language to english |
| - dzn: fill in driver name and info |
| - dzn: fill in float-control details |
| - dzn: fill in non-uniform-indexing props |
| - dzn: fill in bindless props |
| - dzn: fill in depth/stencil resolve props |
| - dzn: fill in minmax props |
| - dzn: fill misc props |
| - dzn: expose VK_KHR_driver_properties |
| - zink: remove pointless test |
| - zink: remove pointless assert |
| - zink: drop pointless comment |
| - llvmpipe: merge all rasterizer-bind functions |
| - llvmpipe: consolidate legacy_points and multisample state |
| - llvmpipe: clean up bresenham implementation |
| - docs/zink: correct name of extension |
| - docs/zink: document required VK_KHR_swapchain_mutable_format |
| - docs/zink: document required VK_EXT_border_color_swizzle |
| - zink: warn about missing image2DViewOf3D feature |
| - zink: fix EXT_color_write_enable check |
| - gallium/hud: do not use texture-rect for font |
| - llvmpipe: remove unused function |
| - zink: clean up extension emitting |
| - zink: set dynamic indexing caps |
| - zink: more accurately set {Sampled,Image}1D caps |
| - zink: more accurately set {Sampled,Image}Buffer caps |
| - zink: more accurately set ImageQuery cap |
| - docs: fixup link to virgl docs |
| - zink: add have_D32_SFLOAT_S8_UINT boolean |
| - zink: do not use VK_FORMAT_D32_SFLOAT_S8_UINT without checking |
| - zink: type_main -> type_void_func |
| - zink: add spirv_builder_function_call |
| - zink: wrap discard in a function |
| - zink: clamp miplodbias when creating sampler |
| - docs/zink: document rgtc requirement |
| |
| Fabrice Fontaine (1): |
| |
| - src/util/futex.h: fix build on 32-bit architectures using 64-bit time_t |
| |
| Feng Jiang (1): |
| |
| - virgl: do not share virgl_screen between different drm_files |
| |
| Filip Gawin (5): |
| |
| - r300: Print warning when stubbing derivatives |
| - r300: keep negation if w is an inline constant |
| - r300: don't check for unitialized reads when rewriting register |
| - r300: prefer old not native swizzle in constant folding |
| - r300: don't read from output transform_r300_vertex_SEQ/SNE |
| |
| Francisco Jerez (2): |
| |
| - intel/dev: Compute pixel pipe information based on geometry topology DRM query. |
| - intel/fs/xehp+: Emit scheduling fence for all NIR barriers on platforms with LSC. |
| |
| Frank Binns (3): |
| |
| - pvr: replace p_compiler.h boolean with stdbool |
| - pvr: shorten error to err in label names |
| - pvr: Add TI AM62 as a supported device. |
| |
| Georg Lehmann (74): |
| |
| - aco: Remove some old comments in aco_opcodes.py. |
| - aco: Reuse previous -1 result in find_msb to avoid using VOP3. |
| - nir/lower_mediump: Add a new pass to fold 16bit image load/store. |
| - aco: Implement image_store d16. |
| - aco: Implement image_load d16. |
| - radv: Use nir_fold_16bit_image_load_store_conversions. |
| - radv: Run copy_prop and dce after folding 16bit sampling/load/store. |
| - aco: Remove D16 zero components from image stores. |
| - meson: Tell glslang to be quiet. |
| - radeonsi: Use nir_fold_16bit_image_load_store_conversions. |
| - wsi/x11: Don't leak xcb_get_geometry_reply_t. |
| - nir/opt_algebraic: Fix mask in shift by constant combining. |
| - nir/opt_algebraic: Simple xor/ishr optimizations. |
| - radv: Don't retile read-only images. |
| - nir/opt_algebraic: Optimize Doom Eternal's word extract by LSB. |
| - aco: Implement uclz. |
| - aco: Implement usub_sat. |
| - aco: Implement isub_sat. |
| - ac/llvm: Implement usub_sat and isub_sat. |
| - ac/llvm: Implement uclz. |
| - nir: Add optional lowering for mul_32x16. |
| - radv: Lower mul_32x16. |
| - radv: Lower hadd/hradd. |
| - radv: Lower 8bit isub_sat/usub_sat. |
| - radv: Enable VK_INTEL_shader_integer_functions2. |
| - radv, aco: Packed usub_sat/isub_sat. |
| - zink: Use VK_USE_64_BIT_PTR_DEFINES to check for 64bit platforms. |
| - radv, aco: Lower txf offset in NIR. |
| - nir/lower_tex: Add lower_array_layer_round_even option. |
| - radv, aco: Round texture array layer in NIR. |
| - vulkan: Spec update to 1.3.217. |
| - radv: Implement VK_EXT_non_seamless_cube_map. |
| - aco: Print r128/a16 MIMG bits separately. |
| - aco: Remove r128_a16 MIMG builder option. |
| - anv: Implement VK_EXT_non_seamless_cube_map. |
| - radv: Remove redundant memset in radv_descriptor_set_create. |
| - radv: Check descriptor pool entry count before allocating a new set. |
| - radv: Don't check if we need to copy immutable samplers for non push templates. |
| - aco/ra: Static assert that changing instruction type to VOP2 is valid. |
| - aco/ra: Move mac encoding optimization to its own function. |
| - aco/ir: Pad SOP2 and SOPC to the same size as SOPK. |
| - aco/ra: Optimize some SOP2 instructions with literal to SOPK. |
| - aco/optimizer: Convert s_add_u32 with literals to s_add_i32 if carry is not used. |
| - aco/optimizer: Optimize SOPC with literal to SOPK. |
| - r600/sfn: Add missing std::array include. |
| - aco: Fix swapping sources in SOPC -> SOPK optimization. |
| - aco/assembler: Fix s_bitreplicate_b64_b32 on GFX9. |
| - nir_to_tgsi: Lower uadd_sat/usub_sat. |
| - zink: Lower uadd_sat/usub_sat. |
| - r600: Lower uadd_sat/usub_sat. |
| - ir3: Implement [iu]sub_sat. |
| - nir/lower_io: Fix array length of buffers larger than INT32_MAX. |
| - nir/opt_algebraic: Optimize [ui](add|sub)_sat with 0. |
| - radv,aco: Don't lower and vectorize 16bit iabs. |
| - radv: Remove ineg from lower_bit_size_callback callback. |
| - radv, aco: Don't lower 16bit isign. |
| - ir3: Only run 16bit tex NIR passes on a5xx+. |
| - ir3: Lower alu to scalar if nir_legalize_16bit_sampler_srcs made progress. |
| - nir: Rewrite and merge 16bit tex folding pass with 16bit image folding pass. |
| - nir/lower_mediump: Add an option to only fold if all tex sources can be folded. |
| - ir3: Stop using nir_legalize_16bit_sampler_srcs. |
| - aco: Add G16 opcodes. |
| - aco: Support 16bit sources for texture ops. |
| - radv: Fold 16bit tex sources. |
| - aco: Add a note about G16 without A16 to README-ISA.md. |
| - aco/ir: Generalize (un)ordered_swapped. |
| - aco/ir: Add integer get_cmp_info. |
| - aco/ir: Fix swapped nle. |
| - aco/ir: Add vcmpx opcode to get_cmp_info. |
| - aco/ir: Add v_cmp_class to get_cmp_info. |
| - aco/ir: Add swapped opcode for v_cmp_u/v_cmp_o. |
| - radeonsi: Stop using nir_legalize_16bit_sampler_srcs. |
| - nir/fold_16bit_tex_image: Default to only_fold_all. |
| - aco: Force tex operand to have the correct sub dword size before packing. |
| |
| Gert Wollny (85): |
| |
| - virgl: Extend integer write out output fix to all non-move integers ops |
| - r600/sfn: Fix store_shared_r600 write masks |
| - ntt: Add option to not optimized register allocation |
| - nir: Add pass to split 64 bit vec3 and vec4 variable access and phis |
| - ntt: Split 64 bit vec3 and vec4 local variables and phis |
| - virgl: Clear all indirect flags when writing to a temporary |
| - nir: Don't optimize to 64 bit fsub if the driver doesn't support it |
| - ntt: remove dead input variables before lowering FS IO |
| - r600/sb: Don't optimize float GT and GE |
| - r600: tune nir options |
| - r600: Add support for TGSI_OPCODE_ATOMIMIN and IMAX |
| - virgl: don't move input gl_SampleMaskIn to a temp |
| - r600: Allow eight bit, three channel formats for vertex buffers |
| - nir: honor lower_double options for ffloor and ffract |
| - r600/sb: Don't create three source ops with all kcache values |
| - r600/sb: Fall back to un-optimized shader if scheduling fails |
| - virgl: Don't let ntt optimize the register allocation |
| - tgsi: Don't bother setting the TGSI_PROPERTY_SEPARABLE_PROGRAM |
| - virgl: Submit the TGSI_PROPERTY_SEPARABLE_PROGRAM when the host understands it |
| - r600: Add number of ALU groups to statistics |
| - r600: Make sure that LDS instructions only use bank swizzle 012 |
| - r600: Update nir options |
| - r600/sfn: rewrite NIR backend |
| - r600: enable sb by default also for NIR |
| - r600: Cleanup nir options |
| - r600/sfn: support nir_op_mulz and legazy math rules |
| - r600/sfn: Add missing copyright information |
| - r600/sfn: Don't use assert to test success of operation |
| - r600: limit loops when trying to merge alu groups |
| - r600/sfn: Never consider an op with register dest as dead |
| - r600/sfn: Add support for fdph |
| - r600/sfn: lower CLIPVERTEX to clip planes |
| - r600/sfn: emulate pmr::monotonic_buffer_resource if needed |
| - r600/sfn: clean up some commented out code |
| - r600/sfn: fix storing shader output array locations |
| - r600/sfn: lower undef to zero |
| - r600/sfn: count LDS queue pop reads separately in assembler |
| - r600/sfn: Fix the kcache failure handling |
| - r600/sfn: Fix emmision of LDS instructions |
| - r600/sfn: Fix used of local shared write mask |
| - r600/sfn: Fix streamout with non-zero component write |
| - r600: don't dump shader info to files on debug |
| - r600/sfn: make sure the memory pool is released after shader translation |
| - r600/sfn: lower the right shader variant |
| - r600: Delete possible old NIR variant when translating from TGSI |
| - r600: don't clone the nir shader, just use it |
| - r600ß/sfn: Don't fall through with VS outputs |
| - r600/dri-tweak: Disable ARB_shader_image_load_store for "Tomb Raider 2013" |
| - r600/sb: fall back to unoptimized shader if RA fails |
| - r600: Don't set TES_EVAL UCP dirty when it is not supported |
| - virgL: lower fneg and fabs |
| - virgl: honor host caps in shader cash sha evaluation |
| - r600/sfn: Factor out fragement shader class for EG |
| - r600/sfn: Support pre-EG style FS input |
| - r600/sfn: Prepare StreamOut instruction for pre EG opcodes |
| - r600/sfn: Handle slot differences between pre-EG and EG |
| - r600/sfn: Handle chipclass with kcache allocation |
| - r600/sfn: Handle buffer size query on pre-EG |
| - r600/sfn: Handle pre-EG GS input fetch |
| - r600/sfn: Handle primitive input handling for pre-EG |
| - r600/sfn: Handle non-compressed MS texel fetch |
| - r600/sfn: Set trans flag for flt_to_int on pre-EG |
| - r600/sfn: Handle pre-EG buffer fetch |
| - r600/sfn: Disable FP64 for pre-CEDAR |
| - r600/sfn: Advertise support for fmulz |
| - r600/sfn: Enable NIR for pre RG hardware |
| - r600/sfn: remove leftover debug output |
| - r600/sfn: Handle R600 class sin/cos |
| - r600: increase possible stack size in binary code |
| - r600/sfn: Schedule shift instruction on R600 in t-slot |
| - r600/sfn: Add GS thread fix just like the TGSI code path |
| - r600/sfn: Sort FS inputs to make interpolated values come first |
| - r600/sfn: Fix color outputs when color0 writes all |
| - r600: Force NOPs when loading AR on R600 class hardware |
| - r600/sfn: Handle R600 scratch read |
| - r600: Don't use SB with R600 style scratch reads |
| - r600/sfn: Handle color0 writes all on R700 like on EG |
| - r600/sfn: Don't tag mem-ring and stream instructions as exports |
| - r600/sfn: Don't schedule GDS instructions early |
| - r600/sfn: Don't scan the whole block for ready instructions |
| - r600/sfn: Use a heuristic to keep SSBO setup and store close |
| - r600: Fix reporting TGSI IR support |
| - r600/sfn: Use a low number for unused target register |
| - virgl: when reading back wait first, then do the transfer |
| - r600/sfn: override register ID when it doesn't matter |
| |
| Glenn Kennard (1): |
| |
| - nv30: Fix non-scissored clears after a scissor has been set |
| |
| Greg Depoire--Ferrer (5): |
| |
| - util/queue: add missing space to comment in util_queue_destroy |
| - zink: check for error when initializing util_queue |
| - lavapipe: check for error when initializing lvp_queue |
| - util/queue: handle thread cration failure in util_queue_adjust_num_threads |
| - zink: fix crash on dlclose when zink_internal_create_screen failed |
| |
| Guilherme Gallo (48): |
| |
| - ci/lava: Encapsulate job data in a class |
| - ci/lava: Filter log lines from LAVA return |
| - ci/lava: Return test-suite result as exit code |
| - ci/lava: Fix LAVA job validation |
| - ci/lava: Set lava-signal to kmsg |
| - ci/lava: Always validate the lava job |
| - ci/lava: Use lava-test-case to run custom scripts in LAVA |
| - ci/lava: Trap init-stage2.sh background processes |
| - ci/lava: Fix shebang in deqp-runner.sh |
| - ci/lava: Parse all test cases from 0_mesa suite |
| - ci/lava: Let LAVA job submitter run without JWT file |
| - ci/lava: Reduce LAVA boot phase timeout to 3 minutes |
| - ci/lava: Cancel the job if the script is interrupted |
| - ci/lava: Improve exception handling |
| - ci/lava: Retry when data fetching log RPC call is corrupted |
| - ci: Fix init-stage2 exit code |
| - ci: Update trace after CI fix |
| - ci: Fix tests expectations |
| - ci/lava: Make job submitter parse the job result |
| - ci/lava: Fix colored LAVA outputs |
| - ci/lava: Fix Gitlab Section markers |
| - ci/lava: Add support for more complex color codes |
| - ci/lava: Filter out undesired messages |
| - ci/lava: Improve result parsing regex |
| - ci/lava: Create Gitlab log sections handler |
| - ci/lava: Create LogFollower and move logging methods |
| - ci/lava: Follow job execution via LogFollower |
| - ci/lava: heartbeat: don't consider kernel message logs |
| - ci/lava: Wrap job info into a collapsed section |
| - ci/lava: Wrap job definition dump into a collapsed section |
| - ci/lava: Add an integration test for LAVA jobs |
| - ci/lava: Don't print LAVA debug messages |
| - ci/lava: Highlight kernel messages in bold |
| - ci/lava: Highlight job retrying message |
| - ci/lava: Stop printing after the result line |
| - ci/lava: Flexibilize section marker regexes |
| - ci/lava: Rename console color names |
| - ci/lava: Update license header |
| - ci/lava: Split lava_log into modules |
| - ci/lava: Detect R8152 issues preemptively and retry |
| - ci/lava: Make hung job status yellow |
| - ci/lava: Color red for fatal and yellow for warning |
| - ci/lava: Add \`slow` pytest marker |
| - ci/lava: Add canceled job status |
| - ci/lava: Add delay before calling lava-test-case |
| - ci/lava: Increase boot timeout |
| - ci/lava: Only parse result within testcase section |
| - ci/lava: Remove MINIO_RESULTS_UPLOAD if requested |
| |
| Hans-Kristian Arntzen (4): |
| |
| - radv: Fix RTPSO hashing of pGroups. |
| - vulkan: Update Vulkan XML and headers to 1.3.219. |
| - vulkan: Add common code for VK_EXT_shader_module_identifier. |
| - radv: Implement VK_EXT_shader_module_identifier. |
| |
| Hoe Hao Cheng (4): |
| |
| - zink/codegen: remember the fields in feats/props structs of extensions |
| - zink/codegen: do not include compilation structs with extension structs |
| - zink/codegen: do not automatically consider extensions promoted |
| - zink/codegen: use the updated identifiers |
| |
| Hyunjun Ko (15): |
| |
| - radv: fix assertion for the count of xfb output |
| - turnip: add an assertion for max descriptor set count. |
| - ir3: handle intrinsic_load_draw_id when scanning driver constants |
| - turnip: clamp to 1 for negative bottom-right of viewport. |
| - turnip: fix wrong size for lrz cs when lrz_track_quirk is enabled. |
| - freedreno,ir3: rename Z_CLAMP_ENABLE to Z_CLIP_DISABLE |
| - turnip: keep the depth_clip_disable state at the pipeline builder. |
| - zink: remove the workaround for depth_clip_enable on turnip |
| - turnip: Remove an unnecessary assert. |
| - turnip: expose VK_EXT_shader_module_identifier |
| - ir3: change maximum size of const files. |
| - ir3: handle shared consts. |
| - turnip: clean up unused parameters for user consts. |
| - turnip: Change to use shared consts for PushConstants |
| - tu: increase maxPushConstantsSize to 256. |
| |
| Iago Toral Quiroga (79): |
| |
| - v3dv: ignore no-op barriers |
| - v3dv: drop obsolete comment |
| - v3dv: don't emit final noop job if we don't have anything to signal |
| - v3dv: document Vulkan requirements for signaling operations |
| - v3dv: ignore barriers for image layout transitions from undefined layouts |
| - v3dv: drop default sampler states if not used |
| - v3dv: flag BCL barriers for all-graphics and all-commands stages |
| - v3dv: skip binning sync if binning shaders don't access external resources |
| - v3dv: use wait stage for wait semaphores to decide about binning syncs |
| - v3dv: check input attachment usage as sampled usage |
| - v3dv: implement VK_EXT_separate_stencil_usage |
| - v3dv: trivially implement VK_KHR_separate_depth_stencil_layouts |
| - docs/features: flag VK_KHR_timeline_semaphore as implemented |
| - v3dv: expose separateDepthStencilLayouts |
| - v3dv: implement vkGetPipelineExecutableInternalRepresentationsKHR |
| - v3dv: implement vkGetPipelineExecutablePropertiesKHR |
| - v3d/compiler: add more stats to prog_data |
| - v3dv: implement vkGetPipelineExecutableStatisticsKHR |
| - v3dv: expose VK_KHR_pipeline_executable_properties |
| - v3dv: don't leak variant QPU when pipeline compile fails |
| - v3dv: drop unsused struct |
| - v3dv: expose VK_KHR_shader_float_controls |
| - v3dv: expose VK_KHR_spirv_1_4 |
| - v3dv: expose VK_KHR_vulkan_memory_model |
| - v3dv: fix format usage checks when extended usage flag is set |
| - broadcom/compiler: don't promote early fragment tests when writing sample mask |
| - v3dv: consume barriers at the right stages |
| - v3dv: only clear BCL barrier state if we don't have pending graphics barriers |
| - v3dv: use an explicit struct type to track barrier state |
| - v3dv: track sources of barriers |
| - v3dv: limit sync for barriers to hw queues selected by source mask |
| - v3dv: don't be too aggressive disabling early Z |
| - v3dv: use the global RCL EZ disable if we don't have any EZ draws in the job |
| - broadcom/compiler: disable flags optimization for loop conditions |
| - v3dv: merge pending secondary barrier state into primary command buffers |
| - v3dv: handle barriers at the end of a command buffer |
| - v3dv: fix leak |
| - broadcom/compiler: fix postponed TMU spills with multiple writes |
| - broadcom/compiler: don't predicate postponed spills |
| - v3dv: fix pool descriptor count for inline uniform buffers |
| - v3dv: fix merge jobs |
| - nir: fix documentation for uadd_carry and usub_borry opcodes |
| - v3dv: don't lower uadd_carry and usub_borrow |
| - v3dv: remove obsolete comment |
| - nir/serialize: fix missing divergence info after deserialization |
| - broadcom/compiler: don't try to hide TMU latency at QPU scheduling |
| - broadcom/compiler: don't over-estimate latency of TMU instructions |
| - v3dv: fix comment in texel buffer shader copy path |
| - v3dv: remove unused lowering for nir_intrinsic_load_layer_id |
| - v3dv: fix comment for point_sprite_mask filed in shader key |
| - v3dv: clean up get_internal_type_bpp_for_image_aspects |
| - v3d,v3dv: stop hardcoding various image limits |
| - v3d: stop using a smaller texture limit in OpenGL |
| - v3d,v3dv: stop copying and pasting the translate_swizzle helper |
| - v3dv: only save/restore push constant data for meta operations if needed |
| - v3dv: move push constant data to the command buffer state |
| - v3dv: limit upload of indirect push constant data |
| - v3dv: don't allocate MAX_PUSH_CONSTANTS_SIZE bytes for the push constants UBO |
| - v3dv: stop tracking push constant buffer references |
| - v3dv: allocate more push constant buffers if needed |
| - v3dv: add a dirty state for pending push constants UBO updates |
| - v3dv: explain why we clear certain state after a draw call |
| - v3dv: remove duplicate condition |
| - nir: add nir_address_format_2x32bit_global |
| - broadcom/compiler: implement 2x32 global intrinsics |
| - broadcom/compiler: track if a shader uses global intrinsics |
| - v3dv: implement VK_KHR_buffer_device_address |
| - v3dv: handle VK_STRUCTURE_TYPE_DRM_FORMAT_MODIFIER_PROPERTIES_LIST_2_EXT |
| - v3dv: remove assert that poolSizeCount must be larger than 0 |
| - v3dv: fix offset reported by vkGetImageSubresourceLayout |
| - v3dv: remove unnecesary FIXME comment |
| - v3d,v3dv: add a common v3d_hw_prim_type helper |
| - v3dv: only check binning sync for semaphores for the first CL job |
| - v3dv: explicitly set sampler min/max filter properties to false |
| - v3dv: fill in Vulkan 1.2 missing device properties |
| - v3dv: enable missing Vulkan 1.2 features for supported extensions |
| - v3dv: enable missing mandatory Vulkan 1.2 features |
| - broadcom/ci: update list of slow tests |
| - v3dv: expose Vulkan 1.2 |
| |
| Ian Romanick (26): |
| |
| - glsl: Fix mixed tabs and spaces in lower_mat_op_to_vec.cpp |
| - glsl: Add flag to disable part of do_vec_index_to_cond_assign |
| - nir: Use nir_vector_extract to generate code for ir_binop_vector_extract |
| - glsl: Delete lower_extracts code |
| - anv: Remove FS executables when applying the null FS optimization |
| - intel/fs: Add missing synchronization for WaW dependency |
| - nir: i32csel opcodes should compare with integer zero |
| - nir/algebraic: Fix NaN-unsafe fcsel patterns |
| - nir: Add and use algebraic property "is selection" |
| - nir/range_analysis: Teach range analysis about fdot opcodes |
| - intel/compiler: Rename vec4 state URB opcodes to have VEC4\_ prefix |
| - intel/eu: Validate some aspects of URB messages |
| - intel/compiler: Move logical-send lowering to a separate file |
| - intel/fs: Add _LOGICAL versions of URB messages |
| - intel/fs: Lower URB messages to SEND |
| - intel/fs: Remove non-_LOGICAL URB messages |
| - intel/vec4: Set lower_usub_sat |
| - spirv: Fix array length of buffers larger than INT32_MAX. |
| - intel/fs: Make logical URB write instructions more like other logical instructions |
| - intel/fs: Make logical URB read instructions more like other logical instructions |
| - intel/fs: Don't pass flags to lower_urb_read_logical_send or lower_urb_write_logical_send |
| - intel/fs: Eliminate "masked" and "per slot offset" URB messages |
| - intel/fs: Use canonical form for "work around" tags |
| - nir: spirv: Allow 32-bit version of nir_intrinsic_is_sparse_texels_resident |
| - radeonsi: r600: d3d12: st: Use NIR lowering for tg4 offset arrays instead of GLSL lowering |
| - glsl: Remove lower_offset_arrays pass |
| |
| Icecream95 (45): |
| |
| - clc: Use stringstream for printing spirv errors |
| - panfrost: Stop overallocating compressed textures |
| - panfrost: Enable NIR lowering of half float packing |
| - panfrost: Fix pack_32_2x16 implementation |
| - pan/bi: Use texture index instead of sampler for message preloading |
| - pan/va: Improvements to LEA_TEX instructions |
| - pan/va: Add more transcendental operations |
| - pan/va: Add SEG_ADD/SEG_SUB operation |
| - pan/va: Add absneg modifier to V2F32_TO_V2F16 |
| - pan/va: Improve texture instructions |
| - nir/lower_tex: Copy more fields in lower_tex_to_txd and friends |
| - pan/mdg: Keep min_bound at 16 when alignment requires it |
| - pan/mdg: Use MAX2 to set min_alignment |
| - pan/mdg: Fix mask usage when filling before a spill |
| - pan/mdg: Fix disassembly of store instructions |
| - pan/mdg: Return the instruction from mir_insert_instruction_*_scheduled |
| - pan/mdg: Fix multiple spilt writes in the same bundle |
| - panfrost: Copy blend constant into variant even when reusing it |
| - panfrost: Increase the limit for blend shader variants |
| - panfrost: Move patched_s out of the pan_blitter_views struct |
| - nir: Add store_combined_output_pan BASE back |
| - pan/bi: Read base for combined stores |
| - pan/mdg: Read base for combined stores |
| - panfrost: Only write depth / stencil once if MRT is used |
| - pan/bi: Reverse linear constraint bits |
| - pan/bi: Add nodearray datastructure |
| - pan/bi: Use nodearrays for linear constraints |
| - panfrost: Constant stencil value tracking |
| - panfrost: Stop using sparse_array for batch BOs |
| - panfrost: Use a macro for checking for a shared bind type |
| - panfrost: Remove sync arguments from panfrost_batch_submit |
| - panfrost: Enable QUAD_STRIP and POLYGON on v6 |
| - pan/va: Use the _safe iterator when adding blend shader calls |
| - pan/decode: fflush buffers after dumping and before aborts |
| - pan/decode: Use tag bits for resource entry count |
| - pan/decode: Change indent when decoding resources |
| - panfrost: Only emit images when they are present |
| - panfrost: Emit the correct number of attributes |
| - panfrost: Clear the GENERAL varying buffer field if unused |
| - panfrost: Correctly calculate prefetch suppresion varying index |
| - panfrost: Add a debug option for checking overflows on pool uploads |
| - panfrost: Don't unbind recently bound streamout targets |
| - panfrost: Allow NULL streamout targets |
| - nine: Make vdecl_index_map always signed |
| - nine: Only enable thread_submit by default on x86 |
| |
| Igor Torrente (2): |
| |
| - venus: Add support to VK_KHR_maintenance4 extension |
| - venus: Use maintenance4 to get max_size_buffer |
| |
| Ikshwaku Chauhan (1): |
| |
| - Revert "radeon: add EFC support to only VCN2.0 devices" |
| |
| Illia Abernikhin (1): |
| |
| - vulkan/runtime: repair thread safety in method vk_queue_start_submit_thread() |
| |
| Indrajit Das (5): |
| |
| - radeonsi/gfx11: VRS changes |
| - radeonsi: save the fs constant buffer to the util blitter context |
| - gallium/u_blitter: clear color buffers using color from a constant buffer |
| - mesa/st: add nir shader to clear color buffers using constant value |
| - mesa/st: clear color buffers using color from a constant buffer |
| |
| Italo Nicola (3): |
| |
| - mesa/st: call pipe->link_shader even if ir loaded from cache when linking |
| - tgsi: Add SEPARABLE_PROGRAM property |
| - virgl: overpropagate precise flags |
| |
| Iván Briano (6): |
| |
| - vulkan/wsi/wayland: Fix double free on error condition |
| - anv: re-alloc push constants after secondary command buffers |
| - anv: do not get rid of empty/useless fragment shaders |
| - anv: assert inheritance_info is not NULL |
| - anv: emit scissors when the pipeline changes |
| - anv: pipelineStageCreationFeedbackCount is allowed to be 0 |
| |
| James Zhu (5): |
| |
| - amd: update headers to support decode software ring |
| - radeonsi/gfx11: update codec support for gfx11 |
| - radeonsi/vcn: add decode software ring support for gfx11 |
| - radeonsi/vcn: add jpeg decode support for gfx11 |
| - amd/common: some ASICs with gfx9 use compute rings for render |
| |
| Jan Beich (1): |
| |
| - vulkan/wsi: treat EBADF as missing DMA_BUF_IOCTL_{EXPORT,IMPORT}_SYNC_FILE |
| |
| Jan Palus (1): |
| |
| - pvr: use UINT64_C for 64-bit constant |
| |
| Jason Ekstrand (312): |
| |
| - clc: Only initialize LLVM once |
| - clc: Rework logging a bit |
| - clc: Declare LLVMContexts on the stack |
| - intel/compiler: Set lower_fisnormal |
| - nir,microsoft: Move scale_fdiv into a common NIR pass |
| - nir/opcodes: fisfinite32 should return bool32 |
| - util: Remove util_cpu_detect |
| - util/blob: Add align helpers |
| - util/set: Respect found in search_or_add_pre_hashed |
| - vulkan: Add a common VkPipelineCache implementation |
| - vulkan/pipeline_cache: Add helpers for storing NIR in the cache |
| - anv: Rename a fail label in CreateDevice |
| - anv: Switch to the new common pipeline cache |
| - anv: Clean up pipeline cache helpers a bit |
| - nir: Lower all bit sizes of usub_borrow |
| - vulkan: Set signals[i].stageMask = ALL_COMMANDS for QueueSubmit2 wrapping |
| - vulkan: Use ALL_COMMANDS_BIT for waits/signals instead of ~0 |
| - spirv: Handle Op*MulExtended for non-32-bit types |
| - vulkan/log: Allow but warn if called with a NULL object |
| - vulkan/log: Allow but warn for client-invisible objects |
| - nir/constant_folding: Break TXB folding into a helper function |
| - nir: Constant fold sampler/texture offsets |
| - panvk: Take buffer offsets into account in BindVertexBuffers |
| - panvk: Eliminate unused vertex attributes |
| - nir/deref: Add an alu-of-cast optimization |
| - panvk: Add address/range helpers for panvk_buffer |
| - vulkan/wsi: Close file descriptors in wsi_destroy_image |
| - vulkan/wsi: Only use a single fd per wsi_image |
| - vulkan/wsi: Hang on to file descriptors |
| - vulkan/wsi: Reset the image fence right before vkQueueSubmit |
| - vulkan/wsi: Set the right stage flags for semaphore waits |
| - vulkan/wsi: Add signal_fence/semaphore_for_image helpers |
| - nir/cf: Return a cursor from nir_cf_extract as well |
| - nir: Fix constant folding for non-32-bit ifind_msb and clz |
| - nir: Set image_buffers and msaa_images in lower_samplers_as_deref |
| - nir: Stop assuming shader_info::textures_used is 32-bit |
| - shader_info: Make images_used a bitset |
| - nir: Gather samplers_used separately from textures |
| - util/bitset: Support larger ranges in BITSET_TEST/CLEAR_RANGE |
| - shader_info: Bump the number of images and textures supported |
| - gallium/u_threaded_context: Use PIPE_MAX_SHADER_SAMPLER_VIEWS for sampler_buffers |
| - mesa/st: Clamp MaxImageUniforms to MAX_IMAGE_UNIFORMS |
| - gallium: Bump PIPE_MAX_SHADER_IMAGES to 64 |
| - panvk: Call nir_lower_io_arrays_to_elements_no_indirects |
| - nir/lower_blend: Be more explicit about deref assumptions |
| - nir/lower_blend: Stop passing the whole options object around |
| - nir/lower_blend: Support SNORM and integer formats for logic ops |
| - nir/builder: Add a nir_trim_vector helper |
| - nir/builder: Add a nir_resize_vector helper |
| - midgard: Handle FB fetch from non-vec4 output variables. |
| - nir/lower_blend: Expand or shrink output variables as needed |
| - panvk: Advertise support for logicOp |
| - iris: Set BindingTableEntryCount for compute shaders |
| - nir: Mark negative re-distribution on fadd as imprecise |
| - clover: Set images/samplers_used when lowering images |
| - lavapipe: Set shader_info::samplers_used |
| - gallium: Set shader_info::samplers_used in pstipple_fs |
| - ttn: Set shader_info::samplers_used |
| - mesa/st: Set samplers_used in lower_tex_src_plane |
| - gallium/draw: Properly handle nr_samplers != nr_sampler_views in keys |
| - llvmpipe: Fill out samplers even if nr_samplers = 0 |
| - lavapipe: Set images_used in lvp_lower_pipeline_layout |
| - ntt: Don't gather samplers_declared twice |
| - panvk: Move CreateDescriptorSetLayout to per-arch |
| - panvk: Break descriptor lowering into its own file |
| - panvk: Add a buffer to each descriptor set |
| - panfrost,panvk: Make fixed_sysval_ubo < 0 mean compiler-assigned |
| - panfrost: Add some sanity checking for sysvals |
| - bifrost,midgard: Allow providing a fixed sysval layout |
| - panvk: Get rid of the per-pipeline sysvals BO |
| - panvk: Use a flat sysvals struct |
| - panvk: Stop calling lower_uniforms_to_ubo |
| - panvk: Put the sysval and push const UBOs at fixed indices |
| - panvk: Interleave UBOs with multiple descriptor sets |
| - panvk: Put SSBO addresses in the descriptor buffer |
| - vulkan: Start collecting enabled features in vk_device |
| - panvk: Enable robustBufferAccess |
| - panvk: Advertise VK_KHR_variable_pointers |
| - panvk: Add an elems field to panvk_buffer_view |
| - pavnk: Pass bind layouts to texture and image descriptor helpers |
| - panvk: Implement texture/image queries |
| - FIXUP: Use 16-bit things for texture sizes |
| - panvk: Re-arrange descriptor set functions |
| - panvk: Rewrite the write portion of vkUpdateDescriptorSets |
| - panvk: Set immutable samplers properly up-front |
| - panvk: Implement descriptor copies properly |
| - panvk: Drop panvk_descriptor |
| - panvk: Round FillBuffer sizes down to a multiple of 4 |
| - nir: Preserve metadata if remove_dead_derefs makes no progress |
| - nir: Add a var set version of lower_indirect_derefs |
| - mesa/st: Use nir_shader_instructions_pass for st_nir_lower_builtin |
| - mesa/st: Check deref modes in lower_builtin_instr |
| - mesa/st: Use lower_indirect_var_derefs in st_nir_lower_builtin |
| - nir: Put the builder first in lower_input_attachments helpers |
| - nir: Use nir_shader_instructions_pass in nir_lower_input_attachments |
| - bifrost: Run nir_lower_global_vars_to_local before nir_lower_vars_to_scratch |
| - panvk: So more nir_lower_tex before descriptor lowering |
| - panvk: Lower blending after lower_var_copies |
| - v3dv: Add a create_image_view helper for internal views |
| - vulkan: Only be clever about vk_image_view::view_format for normal views |
| - v3dv: Don't use color aspects for depth/stencil images |
| - v3dv: Drop the region temporary from blit_shader |
| - vulkan,v3dv: Add a driver_internal flag to vk_image_view_init/create |
| - radv: Only use PLANE_0 in meta when actually needed |
| - radv: Use vk_image as the base for radv_image |
| - radv: Use vk_image_view as the base for radv_image_view |
| - glsl: Drop this != NULL assertions |
| - vulkan/pipeline_cache: Implement deserialize for raw objects |
| - v3dv: Loosen an assert in copy_buffer_to_image_shader |
| - vulkan: Fall back to raw data objects when deserializing if ops == NULL |
| - lavapipe: Use the correct ICD path on Win32 |
| - vulkan,anv,turnip: Add a common CmdBindVertexBuffers wrapper |
| - radv: Add a sqtt entrypoint for CmdBindVertexBuffers2 |
| - radv: Use the common CmdBindVertexBuffers wrapper |
| - lavapipe: Use the common BindVertexBuffers wrapper |
| - nir/algebraic: Add two more pack/unpack rules |
| - intel/fs: Drop fs_visitor::emit_alpha_to_coverage_workaround() |
| - intel/fs: Copy color_outputs_valid into wm_prog_data |
| - anv: Drop alpha_to_coverage from the NULL FS optimization |
| - anv: Handle the null FS optimization after compiling shaders |
| - anv: Don't disable the fragment shader if XFB is enabled |
| - intel: Only set VectorMaskEnable when needed |
| - vulkan: Add a base struct for buffers |
| - anv: Use the base vk_buffer struct |
| - panvk: Use the vk_buffer base struct |
| - vulkan/runtime: Add min_lod to vk_image_view |
| - intel/isl: Add isl_view::min_lod_clamp for IVB+ |
| - anv: Implement VK_EXT_image_view_min_lod |
| - isl: Add some asserts about multisampled surfaces |
| - nir: Handle register sources in lower_phis_to_regs_block |
| - nir: Add a correctness note for nir_lower_phis_to_regs_block |
| - nir: Rename nir_gather_xfb_info to nir_shader_get_xfb_info |
| - nir: Add a nir_xfb_info to nir_shader |
| - anv: Use nir_shader_gather_xfb_info |
| - lavapipe: Use nir_shader_gather_xfb_info |
| - turnip: Use nir_gather_xfb_info |
| - radv: Use nir_gather_xfb_info |
| - nir: Drop nir_shader_get_xfb_info |
| - glsl/nir: Populate nir_shader::xfb_info after linking varyings |
| - glsl/nir: Stop leaking varyings_info |
| - glsl/nir: Stash the xfb_info in the nir_shader when linking XFB |
| - st,nir: Use nir_shader::xfb_info in nir_lower_io_passes |
| - nir/glsl: Use rzalloc for nir_xfb_info |
| - nir: xfb_buffer_info::stride is in bytes |
| - ttn: Populate the images/textures/samplers_used fields in shader_info |
| - vulkan,anv: Move the image offset/extent sanitize helpers to common code |
| - radv: Use the common image offset/extent sanitize helpers |
| - vulkan,radv: Move vk_format_get_plane_format to common code |
| - vulkan/format: Add a vk_format_get_aspect_format helper |
| - vulkan: Add a helper for image<->buffer copies |
| - anv: Use the common image<->buffer copy helper |
| - vulkan,radv: Steal some image offset/extent helpers from radv |
| - radv: Use vk_image_buffer_copy_layout |
| - panvk: Use vk_image_buffer_copy_layout |
| - lavapipe: Use vk_image_buffer_copy_layout |
| - nir/deref: Break out a helper for compare_deref_paths |
| - nir/deref: Use an index in compare_deref_paths |
| - nir/deref: Make compare_deref_paths take a stop callback |
| - nir/deref: Re-arrange variable checks in compare_deref_paths |
| - nir: Increase nir_variable_data::mode to 16 bits |
| - nir/vars_tests: Use nir_var_mem_global instead of ssbo |
| - nir/deref: Handle SSBO array bindings specially |
| - nir/deref: Handle RESTRICT for SSBO deref bindings |
| - compiler/types: Don't place members in the previous substruct's end padding |
| - anv/wsi: Stop resetting semaphores |
| - vulkan/wsi: Fix a signal_semaphore_with_memory check |
| - wsi: Always signal semaphores and fences in wsi_common_acquire_next_image |
| - panvk: Use the common AcquireNextImage implementation |
| - v3dv: Use the common AcquireNextImage implementation |
| - radv: Drop create_sync_for_memory |
| - lavapipe: Use the common AcquireNextImage implementation |
| - pvr: Use the common AcquireNextImage implementation |
| - turnip: Use the common AcquireNextImage implementation |
| - turnip: Use the common QueuePresent implementation |
| - turnip: Set supported_sync_types before wsi_init |
| - vulkan/wsi: Use a STACK_ARRAY for stage_flags |
| - vulkan/wsi: Reset fences earlier in wsi_common_queue_present |
| - vulkan/wsi: Re-arrange QueueSubmit code in wsi_common_queue_present |
| - vulkan/wsi: Set memory ownership after signaling fences and semaphores |
| - vulkan/wsi: Signal semaphores and fences from the dma-buf |
| - intel/compiler: Use NIR_PASS(_, ...) |
| - anv: Use NIR_PASS(_, ...) |
| - anv: Properly clamp attachment layer counts |
| - vulkan/wsi: Use HAVE_LIBDRM to detect DRM instead of !_WIN32 |
| - vulkan: Add some border color helpers |
| - panvk: Use the new border color helpers |
| - lavapipe: Use the new border color helper |
| - turnip: Use the new border color helpers |
| - vulkan/nir: Make spirv_data const in vk_spirv_to_nir |
| - vulkan: Add a vk_pipeline_shader_stage_to_nir helper |
| - gallium: Add a u_default_clear_buffer helper |
| - iris: Use u_default_clear_buffer |
| - panfrost: Use u_default_clear_buffer |
| - nir/opt_memcpy: lower copies to/from tightly packed types |
| - nir: Add an options parameter to deref_instr_has_complex_use |
| - nir/opt_memcpy: Add another case for function_temp |
| - vulkan/render_pass: Add a better helper for render pass inheritance |
| - anv: Use CmdBeginRendering for resumes in BeginCommandBuffer when possible |
| - vulkan/render_pass: Use a special layout for self-dependencies |
| - vulkan/render_pass: Pass sample locations to barriers |
| - vulkan/render_pass: Allow for mixed sample counts |
| - vulkan/render_pass: Support VkAttachmentSampleCountInfoAMD |
| - vulkan: Fix a comment |
| - vulkan: Add a base struct for descriptor set layouts |
| - vulkan: Add a common implementation of pipeline layouts |
| - lavapipe: Use the vk_descriptor_set_layout base struct |
| - lavapipe: Use the vk_pipeline_layout base struct |
| - panvk: Use the vk_descriptor_set_layout base struct |
| - panvk: Use the vk_pipeline_layout base struct |
| - vulkan: Depend on vk_pipeline_layout in vk_cmd_enqueue |
| - anv: Replace an assert() with unreachable() |
| - util: Re-indent util_sign_extend, comment, and add asserts |
| - util: Use shifts in util_sign_extend |
| - util: Use util_sign_extend in fast_idiv_by_const |
| - mesa/bptc: Use util_sign_extend |
| - util,nir: Move mask_sign_extend from opt_load_store_vectorize to util |
| - isl: Use util_sign_extend |
| - intel/decoder: Use util_mask_sign_extend() |
| - panfrost,asahi: Use util_sign_extend for unpacking |
| - nir: Use util_mask_sign_extend when serializing constants |
| - vulkan/wsi: Pass the size to MapMemory in the SW path |
| - vulkan/wsi/wayland: Use a single memcpy in the SW path |
| - vulkan/wsi: Move select_memory_type to common and rework it a bit |
| - vulkan/wsi: Add a helper for creating CPU images |
| - vulkan/wsi/x11: Use wsi_configure_cpu_image |
| - vulkan/wsi/wayland: Use wsi_configure_cpu_image |
| - vulkan/wsi: Delete SW support from configure_native_image |
| - vulkan/wsi: Persistently map CPU images |
| - vulkan/wsi: Compute stride and size in configure_buffer_image |
| - vulkan/wsi: Align buffer image strides to optimalBufferCopyRowPitchAlignment |
| - vulkan/wsi: Support tiled CPU images |
| - vulkan/wsi/x11: Don't leak shm_reply if we don't have dri3 or present |
| - vulkan/wsi/x11: Only use MIT_SHM if the device supports EXT_external_memory_host |
| - vulkan/wsi/wayland: Use host pointer import when available |
| - dozen: Increase optimalBufferCopy*Alignment |
| - dozen: Stop asking for semaphore/fence signaling |
| - vulkan/wsi/wayland: Only memcpy if the swapchain is actually software |
| - vulkan/wsi: Fix structure chaining in wsi_create_buffer_image_mem |
| - shader_info: Move subgroup_size out of cs and make it an enum |
| - vulkan: Re-order pipeline hashing |
| - vulkan: Constify vk_spirv_version |
| - spirv,vulkan: Set shader_info::subgroup_size |
| - mesa,glsl,ttn: Set subgroup_size to UNIFORM |
| - anv: Drop unnecessary parameters to anv_pipeline_compile_cs |
| - anv: Put a VkPipelineShaderStageCreateInfo* in anv_pipeline_stage |
| - anv: Use vk_pipeline_shader_stage_to_nir |
| - vulkan: Add a vk_pipeline_shader_stage_is_null() helper |
| - anv: Use vk_pipeline_shader_stage_is_null() |
| - intel,anv,iris,crocus: Drop subgroup size from the shader key |
| - anv: Implement VK_EXT_shader_module_identifier |
| - anv: Init supported extensions before WSI |
| - vulkan/wsi/wayland: Add a BUFFER_TYPE flag |
| - vulkan/wsi: Add debug variables to force the SW and PRIME buffer blit paths |
| - intel/blorp: Set uses_sample_shading for MSAA blit shaders |
| - radv: Set uses_sample_shading for copy shaders |
| - intel/fs: Use shader_info::fs::uses_sample_shading |
| - nir: Add a pass for lowering shaders to single-sampled |
| - intel/fs: Use nir_lower_single_sampled |
| - glsl/nir: Set uses_sample_shading from glsl_to_nir |
| - spirv/nir: Set uses_sample_shading from spirv_to_nir |
| - nir/gather_info: Stop gathering uses_sample_shading |
| - glsl/nir: Set sample_shading if a FS output ever shows up as an rvalue |
| - intel/fs: Simplify persample_dispatch |
| - intel/fs,anv: Rework handling of coarse and sample shading |
| - panfrost: Simplify sample_shading |
| - anv: Stop compacting surface state tables |
| - vulkan: Detect pNext chain loops in vk_foreach_struct() |
| - anv: Don't use the wrong ARRAY_SIZE |
| - Revert "vulkan: Detect pNext chain loops in vk_foreach_struct()" |
| - vulkan: Add a vk_limits.h file for runtime limits |
| - vulkan: Add an enum for all dynamic graphics states |
| - vulkan: Add standard sample locations |
| - vulkan: Add data structures to store all graphics state |
| - util: Constify __bitset_test_range |
| - vulkan: Add a common data structure for dynamic states |
| - vulkan: Add a common implementation of vkCmdSet* |
| - vulkan: Copy the depth/stencil state optimization code from ANV |
| - Use vk_foreach_struct_const where needed |
| - vulkan: Detect pNext chain loops in vk_foreach_struct() (v2) |
| - anv: Add an instance multiplier to anv_pipeline |
| - anv: Handle vertex buffer sizes in anv_CmdBindVertexBuffers2 |
| - anv: Stop recording sample locations per-sample-count |
| - anv: Convert to using vk_graphics_pipeline_state |
| - anv: Rework setting primitive topology |
| - anv: Stop merging DEPTH_STENCIL state |
| - anv: Switch to using common dynamic state tracking |
| - vulkan/graphics_state: Improve the depth/stencil optimization code |
| - anv: Do depth/stencil optimization for dynamic depth/stencil |
| - vulkan: Call lower_clip_cull_distance_arrays in vk_spirv_to_nir |
| - vulkan/nir: Don't remove dead XFB outputs |
| - vulkan: Call gather_xfb_info in vk_spirv_to_nir |
| - lavapipe: Use more Vulkan NIR heleprs |
| - vulkan: Allow up to 12 pointers in multialloc |
| - vulkan: Record shader stages in vk_graphics_pipeline_state |
| - vulkan: Multisample state isn't always included in fragment shader state |
| - vulkan: Depth/stencil isn't fragment output state but multisample is |
| - vulkan: Add a fully_dynamic_state_groups() helper |
| - vulkan: Fix pipeline libraries with dynamic-only VI or FSR state |
| - vulkan: Input assembly and depth/stencil can also be fully dynamic |
| - vulkan: Don't assert VkGraphicsPipelineLibraryCreateInfoEXT::Flags == 0 |
| - zink: Mark depth/stencil visual tests as flakes |
| - lavapipe: Use shader_info::uses_sample_shading |
| - lavapipe: Use the vk_graphics_pipeline_state |
| - vulkan: Append subpass structures to VkRenderingInfo last |
| - vulkan: Include self-dep info in rendering continues |
| - intel/eu: Don't throw validation errors on float MOV_INDIRECT |
| - lima: Lower undefs to zero and run DCE after from_ssa |
| - nir/from_ssa: Ignore undef sources |
| - vulkan: Always populate vk_render_pass_state::render_pass |
| - vulkan: Improve the docs for vk_subpass_info |
| - radv: Use both aspects for depth/stencil blit destinations |
| - vulkan: Dirty VP_VIEWPORTS/SCISSORS when copying viewports/scissors |
| |
| Jason Volk (4): |
| |
| - r600: Fix userspace pointer support for evergreen compute. |
| - r600: Improve compute memory pool performance; reduce fragmentation. |
| - r600: Elide downloads for discarded and immutable compute memories. |
| - radeon: Support shared memory user pointers. |
| |
| Jeffrey Knockel (1): |
| |
| - egl: dereference XCB drawable pointers |
| |
| Jesse Natalie (158): |
| |
| - d3d12: Don't block DISPLAY based on format for non-Windows |
| - wgl: Implement WGL_ARB_create_context_robustness |
| - d3d12: Hook up robustness extensions |
| - d3d12: Plug context leaks |
| - d3d12: Plug screen leaks |
| - d3d12: Attempt screen reset during context create |
| - d3d12: Add a unit test for context reset recovery |
| - microsoft/compiler: Unload DXIL validator library \*after* calling Release() |
| - d3d12: Correctly key off of polygon stipple enable cap |
| - mesa: Ensure ARB programs end in a newline |
| - bin: Add a script for filtering/sorting refcount logs |
| - u_debug_refcnt: Don't loop for initial refcounts if the initial value is huge |
| - u_debug_symbol: Use correct sizeof for DbgHelp |
| - u_debug_symbol: Allow re-init of DbgHelp to work around already-initialized issues |
| - u_debug_stack: Lock around stack dumps to prevent interleaving |
| - u_primconvert: Refactor to remove recursion from util_primconvert_draw_vbo |
| - u_primconvert: Handle take_index_buffer_ownership |
| - d3d12: Use a pipe_reference in d3d12_bo |
| - d3d12: Fix leaks in map with do-not-wait |
| - d3d12: Fix a couple over-releases from incorrect take-ownership flag |
| - d3d12: Destroy empty residency bo set during early-return |
| - d3d12: Include windows.h on Windows before dxgicommon.h |
| - d3d12: Don't use VLAs |
| - d3d12: MSVC warning around operator precedence causing uint32_t==bool |
| - dzn: Add new parameter to vk_image_view_init call |
| - d3d12: Fix forward decl for sw_winsys |
| - microsoft/compiler: Fixup sampler derefs in tex instrs that don't \*need* samplers |
| - nir_lower_task_shader: Fix return from lower_task_intrin (bool, not void*) |
| - d3d12: Re-add missed bitmasks to shader key comparison |
| - CI/d3d12: Add skips and update baselines |
| - CI/windows: Install Vulkan SDK from LunarG directly isntead of Chocolatey |
| - CI/windows: Delete comment for installing vulkan-runtime |
| - CI/windows: Disable chocolatey progress printing to shrink log size |
| - CI/windows: Disable LLVM CMake developer warnings to shrink log size |
| - CI/windows: Move --fraction for deqp-runner to run command line |
| - dzn: Update fails/flakes |
| - CI: Re-enable Windows builds |
| - nir: Consider PNTC to be a varying |
| - wgl, d3d12: Handle front buffer writes for double-buffered MSAA surfaces backed by swapchains |
| - meson: Use a C99 STDC_VERSION for flex |
| - d3d12: Report PIPE_CAP_ACCELERATED as 0 on WARP |
| - d3d12: Properly set HS input control point count |
| - dzn: Init sync types before wsi |
| - mesa: Use C11 alignas for cross-platform SSE support |
| - CI: Lima farm is offline |
| - dzn: Skip another test that times out occasionally |
| - util/disk_cache: Implement disk_cache_get_function_identifier for Windows |
| - microsoft/clc: Enable tests that pass on server 2022 |
| - microsoft/clc: Fix test double free in the case of compilation failure |
| - microsoft/clc: Remove dead image vars |
| - microsoft/clc: Add a unit test for unused image kernel args |
| - vulkan_entrypoints_gen: For Windows ARM64EC, prefix symbols with '#' |
| - dzn: Support Vulkan loader interface v5 |
| - gallium/util: Move u_dl and u_pointer to src/util |
| - dzn: Annotate sometimes-unused variables. |
| - dzn: Define COBJMACROS before any includes |
| - dzn: Use os_time_sleep instead of Win32 Sleep |
| - dzn: Use u_dl to load D3D12 |
| - dzn: Add an eventfd sync implementation |
| - dzn: Fix maybe-uninitialized warning |
| - dzn: Skip dxil validator and some Windows-only debug flags for Linux |
| - dzn: Use IUnknown instead of IDXGIAdapter1 as the stored adapter |
| - dzn: Don't hash adapter LUID as part of device UUID |
| - dzn: Use a custom adapter desc instead of DXGI adapter desc |
| - dzn: Move DXGI code to a separate file and only build it on Windows |
| - dzn: Add a DXCore enumeration path |
| - dzn: Add ABI helpers for D3D12 functions returning structs |
| - vulkan/wsi/win32: Use the new helpers and persistent map |
| - dzn: Fix winsys reporting |
| - dzn: Only support high/normal queue priorities |
| - dzn: Add for condition to break nested loop |
| - dzn: Re-order includes in dzn_nir to prevent TRUE redefinition |
| - dzn: Only add Werror flags that the compiler supports |
| - dzn: Disable Wunused-value for C++ files |
| - dzn: Fix comparison order for format enums |
| - dzn: Re-order dzn_physical_device_desc so = { 0 } doesn't initialize a struct |
| - dzn: Use fabs on floating point input |
| - CI: Add dzn to debian vulkan/clang/release builds |
| - mesa: Support D3D11/D3D12 memory imports |
| - mapi: Add more EXT_external_objects_win32 functions/enums |
| - gallium: Add a 'name' field to winsys_handle |
| - mesa: Implement ImportMemoryWin32NameEXT |
| - gallium: Add 'name' field to Win32 semaphore import |
| - driver_noop: Remove infinite recursion from create_fence_win32 |
| - gallium: Add a new fence type with a pipe cap to indicate it can be imported |
| - mesa: Support importing D3D12 fences as timeline semaphores |
| - mesa: Implement ImportSemaphoreWin32NameEXT |
| - gallium, mesa: Support setting timeline semaphore values |
| - d3d12: Store the rest of the device IDs in the screen |
| - d3d12: Compute UUIDs required by external objects extension |
| - d3d12: Add pipe getters for Win32 and base external objects device matching |
| - d3d12: Support B4G4R4A4 format |
| - d3d12: Get adapter LUID after device creation |
| - d3d12: PIPE_BIND_SHARED doesn't mean linear and is always on opened resources |
| - d3d12: Support creating memory objects |
| - d3d12: Implement resource_from_memobj |
| - d3d12: Support opening resources and memobj by name |
| - d3d12: Support importing fences / timeline semaphores |
| - d3d12: Implement server signal/wait |
| - d3d12: Implement fence opening and value setting |
| - d3d12: Fix up resource import validation |
| - docs: Update features.txt and new_features.txt |
| - u_atomic: Implement p_atomic_xchg for Windows |
| - u_atomic: Fix MSVC p_atomic_add_return |
| - u_atomic: Add p_atomic_fetch_add which returns the old value |
| - zink: Use p_atomic_fetch_add |
| - simple_mtx: Replace GCC sync intrinsics with u_atomic ops |
| - Fix static glapi on Windows |
| - dzn: Missed ABI fixes for GetCustomHeapProperties |
| - dzn: Remove the cast when the SDK version is high enough |
| - d3d12: Add helpers to build with correct ABI for MinGW |
| - microsoft/clc: Add helpers to build with correct ABI for MinGW |
| - subprojects: Point DirectX-Headers at a specific tag |
| - CI: Update DirectX-Headers dependency for MinGW/Debian |
| - util: Add a Win32 futex impl |
| - d3d12: Support clip halfz |
| - d3d12: Add a blank d3d12_resource_state.h/cpp |
| - d3d12: Move desired resource state to new files |
| - d3d12: Move current resource state to new files |
| - d3d12: Give bos a unique identifier to be used for state tracking |
| - d3d12: Add a list of contexts alive for the current screen |
| - d3d12: Hold lock when removing resources from residency list |
| - d3d12: Notify contexts about deletion of bos |
| - d3d12: Add a context state tracking structure |
| - d3d12: Treat depth/stencil as planar for plane count helper |
| - d3d12: Create/free context state entries |
| - d3d12: Track a global resource state for non-simultaneous-access resources |
| - d3d12: Record a state fixup command list when necessary |
| - d3d12: Swap the remainder of state tracking to new method |
| - d3d12: Rename bind invalidate options to transition flags |
| - d3d12: Extract core barrier logic |
| - d3d12: Optimize transition_subresource_states that covers a whole resource |
| - d3d12: Add a transition flag indicating that state accumulation is needed |
| - d3d12: Remove state tracking implementation details from header |
| - nir_lower_io_to_scalar: Support arrayed (per-vertex) I/O |
| - microsoft/compiler: Add a max shader model option |
| - microsoft/compiler: Add a max validator version |
| - microsoft/compiler: Emit metadata based on long-lived signature data |
| - microsoft/compiler: Split signature processing into two parts |
| - microsoft/compiler: Ensure 4-component position writes via NIR |
| - microsoft/compiler: Lower I/O to scalar |
| - microsoft/compiler: Support DXIL validator version 1.5 |
| - microsoft/compiler: Compute correct usage masks |
| - microsoft/compiler: Correctly compute dynamic indexing I/O masks |
| - microsoft/compiler: DXIL validator 1.6 counts UAVs differently |
| - microsoft/compiler: Set the barycentrics flag for attribute_at_vertex |
| - microsoft/compiler: DXIL validator 1.6 uses a new PSV struct version |
| - microsoft/compiler: DXIL validator 1.6 uses a new PSV resource struct |
| - microsoft/compiler: Support DXIL validator 1.6 |
| - microsoft/compiler: Support DXIL validator 1.7 |
| - microsoft/compiler: Add a getter for a validator version from DXIL.dll |
| - microsoft/compiler: Blacklist DXIL validator 1.6 from 20348 SDK |
| - d3d12: Retrieve validator version |
| - dzn: Retrieve validator version |
| - egl/wgl: Delete unused variables/code |
| - egl/wgl: Fix some awkward sizeof formatting |
| - microsoft/compiler: Discard shouldn't be marked readnone |
| - microsoft/compiler: Fix PSV struct when numthreads is 0 |
| |
| Jiang Feng (1): |
| |
| - virgl: Set res->maybe_busy to true when creating resources |
| |
| Jonathan Gray (2): |
| |
| - intel/dev: sync ADL-S pci ids with linux |
| - intel/dev: add RPL-S pci ids from drm-intel-next |
| |
| Jonathan Marek (2): |
| |
| - freedreno/registers: add a7xx registers for drm/msm kernel driver |
| - freedreno/registers: add missing varset="chip" for new enum values |
| |
| Jonathan Weinstein (1): |
| |
| - lavapipe: support instance rate zero for VK_EXT_vertex_attribute_divisor |
| |
| Jordan Justen (48): |
| |
| - intel/dev: Add device info for RPL-P |
| - vulkan/wsi/x11: Use atomic_int with sent_image_count |
| - drm-uapi/i915_drm.h: Update from drm-next (2022-04-28) |
| - intel/dev: Read hwconfig from i915 |
| - intel_dev_info: Add --hwconfig command line parameter |
| - iris: Fix assertion meant to only target the clear-color stride |
| - nir/divergence: handle more \*_intel intrinsics |
| - intel/dev: Add INTEL_PLATFORM_DG2_G12 |
| - intel/dev: Add DG2 G12 PCI IDs |
| - intel/dev: Enable first set of DG2 PCI IDs |
| - intel: Fix build of mi_builder_tests by including c99_compat.h |
| - intel: Build mi_builder_test whenever build-tests is set |
| - drm-uapi/i915_drm.h: Update from drm-next (2022-05-06) |
| - intel/gem: Add support for I915_ENGINE_CLASS_COMPUTE |
| - anv: Add support for I915_ENGINE_CLASS_COMPUTE in init_device_state() |
| - anv: Move STATE_BASE_ADDRESS programming into init_common_queue_state() |
| - anv, iris: Add support for I915_ENGINE_CLASS_COMPUTE |
| - anv, iris: Enable compute engine with INTEL_COMPUTE_CLASS=1 |
| - intel/dev: Add devinfo::mem to store i915 regions information |
| - anv: Use memory info from devinfo |
| - iris/bufmgr: Use memory info from devinfo |
| - intel/tools: Print memory info in intel_dev_info |
| - vulkan/wsi: Disable dma-buf sync file if ENOSYS is returned |
| - intel/compiler: Don't create vec4 reg-set for gen8+ |
| - intel/dev: Add vram.unmappable.size region info |
| - intel/tools: Print unmappable region info in intel_dev_info |
| - intel/dev: Add intel_vram_all_mappable() |
| - iris/resource: Assert that DG2 CCS buffers don't also try to set BO_ALLOC_SMEM |
| - iris/resource: Avoid mapping when not needed in iris_resource_init_aux_buf() |
| - iris/bufmgr: Add all_vram_mappable which is currently always true |
| - iris/bufmgr: Set mmap_mode to IRIS_MMAP_NONE for lmem in small-BAR mode |
| - iris: Set clear_color_unknown if the bo is not mappable |
| - anv/allocator: Add assert and TODO comment for future small BAR uapi |
| - iris/bufmgr: Add assert and TODO comment for future small BAR uapi |
| - drm-uapi/i915_drm.h: Update from drm-next (2022-07-22) |
| - iris/bufmgr: Add I915_GEM_CREATE_EXT_FLAG_NEEDS_CPU_ACCESS for vram mappable buffers |
| - intel/dev: Use i915 region probed_cpu_visible_size when non-zero |
| - intel/dev: Determine the amount of free vram using small BAR uapi |
| - intel/dev: Enable remaining DG2 and ATS-M device IDs |
| - anv: Fix PHYSICAL_DEVICE_MEMORY_BUDGET_PROPERTIES with large BAR |
| - intel/dev: Fill in system memory info when using INTEL_DEVID_OVERRIDE |
| - iris: Drop extra file-descriptor dup in iris_drm_screen_create() |
| - intel/pci_ids: Update ADL-S strings |
| - intel/pci_ids: Add 0x468b ADL-S PCI-id |
| - intel/pci_ids: Update ATS-M device names |
| - intel/pci_ids: Update DG2 device names |
| - intel/pci_ids: Add dg2 0x5698 pci-id |
| - intel/pci_ids: Drop non-upstream dg2 pci-ids |
| |
| Jose Maria Casanova Crespo (1): |
| |
| - v3dv/ci: increase fraction to 10 on v3dv ci jobs. |
| |
| Josh Billingsley (1): |
| |
| - driconf: add SD Gundam G Generation Cross Rays |
| |
| Joshua Ashton (1): |
| |
| - aco: Use movk for AddressHi bits in vertex prolog |
| |
| José Fonseca (6): |
| |
| - trace: Fix framebuffer state serialization. |
| - lavapipe: Prevent mapping buffers beyond their size. |
| - trace: Dump NIR. |
| - trace: Parse character data more efficiently. |
| - trace: Bring state dump up to speed. |
| - trace: Allow to control nir dumping via an environment variable. |
| |
| Juan A. Suarez Romero (37): |
| |
| - ci/v3dv: remove fixed test |
| - v3d: report the correct unsupported blit format |
| - v3d: do not tile 1D textures |
| - v3d: use surface format defined on pipe_blit |
| - gallium: add hook on getting canonical format |
| - v3d: define our own canonical supported formats |
| - v3d: enable GL_ARB_copy_image extension |
| - gallium: move get_canonical_format hook to pipe_screen |
| - v3dv: duplicate key on hashtable insert |
| - v3dv: store device_id on device init |
| - v3d/simulator: add support for AMD cards |
| - mesa: unref syncobj after wait_sync |
| - v3d: enable early Z/S clears |
| - v3d: disable early-Z on odd frame dimensions |
| - v3d: allow TFU blitting for single layer textures |
| - v3d: add hook to get on-disk shader cache |
| - v3d: add builtin support for white/black clamp-to-border |
| - v3dv: check BO mapping result |
| - docs: update extensions for V3D and VC4 |
| - ci: bump VK-GL-CTS to 1.3.2.0 |
| - v3d: fix blending for mixed RT formats |
| - v3d: expose GL_ARB_draw_buffers_blend |
| - v3d/ci: Add traces |
| - v3d: use function to initialize refcount |
| - v3d: save only required states in blitter |
| - v3d: release all color buffers on context destroy |
| - v3d: add ARB_polygon_offset_clamp extension support |
| - v3dv/ci: Update expected results |
| - v3d/ci: Add flake test |
| - v3d/v3dv/ci: update expected results |
| - v3d/ci: (partially) revert expected list |
| - v3d/ci: update expected list |
| - v3dv/ci: Update expected list |
| - v3d/vc4/ci: Fix typo in skipped list |
| - v3d/ci: enable ARB_texture_cube_map piglit tests |
| - vc4: properly restore vc4 debug option |
| - vc4: store tex sampler in proper register |
| |
| Kai Wasserbäch (1): |
| |
| - fix(gallivm): Replace LLVMConstF* with LLVMBuild* methods. |
| |
| Karmjit Mahil (26): |
| |
| - pvr: Fix clang-format errors caused by vk outarrays. |
| - pvr: Add PVR_WORKGROUP_DIMENSIONS define. |
| - pvr: Add initial implementation of vkCmdDispatch(). |
| - pvr: Add compute update shared kernel in vkCmdDispatch(). |
| - pvr: Add compute update kernel in vkCmdDispatch(). |
| - pvr: Update pvrsrvkm to fw 1.17 . |
| - pvr: Add csb enum helper funciton header. |
| - pvr: Write immutable descriptor words in vkAllocateDescriptorSets(). |
| - pvr: Fix incorrect samples to cr_isp_aa_mode conversion. |
| - pvr: Change compute hard coding infrastructure. |
| - pvr: Remove unused output_selects from vertex shader state. |
| - pvr: Add graphics pipeline hard coding infrastructure. |
| - pvr: Enable 33.15.11.3 (AXE-1-16M). |
| - pvr: Extend hard coding infra to allow per stage per device coding. |
| - pvr: Fix physical device limits. |
| - pvr: Handle vdm degen_cull_enable. |
| - pvr: Fix off by 1 error in buffer_id for ubo pds program. |
| - pvr: Rename loop iterator variable. |
| - pvr: Update pvrsrv build version for fixed size fw. |
| - pvr: Rename uniform pds program to descriptor pds program. |
| - pvr: Add static consts support in descriptor pds program. |
| - pvr: Add static consts to hard coding infra. |
| - pvr: Add initial vkCmdPipelineBarrier skeleton. |
| - pvr: Add IDF/WDF program for compute pipeline barrier. |
| - pvr: Add compute pipeline barrier handling. |
| - pvr: Fix cdm shared reg usage reported to fw. |
| |
| Karol Herbst (20): |
| |
| - nir: add 16 and 64 bit fisnormal lowering |
| - vtn: clamp SpvOpImageQuerySize dest to 32 bit |
| - nv50/nir: align tlsspace to 0x10 |
| - llvmpipe: PIPE_COMPUTE_CAP_GRID_DIMENSION is uint64_t |
| - llvmpipe/fence: make the fence id counter atomic |
| - iris/cs: take buffer offsets into account for CL |
| - nir: add a nir_remove_non_entrypoints helper |
| - ntt: make use of new samplers_used field |
| - clc: parse localSize and localSizeHint |
| - clc: drop parsingComplete check |
| - clc: speed up compilation by not relying on opencl-c.h |
| - clc: fix compiler features_macro CTS Test |
| - clc: disable opaque pointers until they are supported |
| - nv50/ir: recoginse Ampere |
| - nir/gather_info: allow to run it before inlining |
| - nir/gce: pin call instructions |
| - nouveau: use the contexts pushbuf and client where possible |
| - nouveau/mm: make code thread safe |
| - nv50/ir: fix OP_UNION resolving when used for vector values |
| - nv50: properly flush the TSC cache on 3D |
| |
| Ken Johnson (2): |
| |
| - d3d12: Reduce cost of shader key comparisons |
| - mesa/main: Correct alloca() use without #include "c99_alloca.h" |
| |
| Kenneth Graunke (41): |
| |
| - anv: Fix INTEL_DEBUG=bat on XeHP |
| - intel/decoder: Fix binding table pointer decoding with large offsets |
| - iris: Add FLUSH_HDC to PIPE_CONTROL_CACHE_FLUSH_BITS |
| - Revert "st/mesa: Transcode ASTC to BC7 (BPTC) where possible" |
| - intel: Drop Wa_1409226450 (stall before instruction cache invalidation) |
| - intel: Fix analysis invalidation in eliminate_find_live_channel |
| - intel/compiler: Move spill/fill tracking to the register allocator |
| - mesa: Split tx_compress_dxtn into per-format functions |
| - mesa: Avoid temp images in _mesa_texstore_rgb_dxt1 for GL_RGBA source |
| - mesa: Require temp images in _mesa_texstore_*_dxt* with SkipImages |
| - mesa: Avoid temp images in _mesa_texstore_*_dxt* for stride = 0 |
| - isl: Add an isl_format_supports_typed_atomics() helper. |
| - intel/compiler: Fix uncompaction of signed word immediates on Tigerlake |
| - intel: Set a more useful fake devinfo->gtt_size in no-hw mode |
| - intel/compiler: Fix A64 header construction with a uniform address |
| - iris: Update comment about 2GB dynamic state range |
| - iris: Use PIPE_* defines rather than ones from main/config.h |
| - crocus: Use PIPE_* defines rather than ones from main/config.h |
| - intel/compiler: Stop including src/mesa/main/config.h |
| - intel/compiler: Fix brw_gfx_ver_enum.h to be a proper header file |
| - intel/compiler: Split 3DPRIM_* defines out to a separate header. |
| - intel/tools: Stop malloc'ing device info in i965_disasm |
| - intel/compiler: Move opcode_desc handling to a separate header |
| - intel/compiler: Introduce a new brw_isa_info structure |
| - intel/compiler: Remove use of thread_local for opcode tables |
| - intel/compiler: Convert brw_eu.cpp back to brw_eu.c |
| - intel/compiler: Handle split-sends in EOT high-register pinning case |
| - intel/fs: Opportunistically split SEND message payloads |
| - intel/compiler: Avoid copy propagating large registers into EOT messages |
| - intel/compiler: Don't set SBID on EOT send messages |
| - mesa: Mark render-to-texture as unsafe if there's no pipe_resource |
| - intel/eu: Handle compaction when inserting validation errors |
| - intel/eu: Fix XeHP register region validation for hstride == 0 |
| - intel/eu: Clarify spec citations for XeHP region restrictions |
| - intel/eu: Mark header present in URB memory fences on XeHP |
| - intel/compiler: Lower FIND_[LAST\_]LIVE_CHANNEL in IR on Gfx8+ |
| - intel/compiler: Delete unused Gfx8+ code in brw_find_live_channel() |
| - iris: Fix PIPE_CAP_UMA |
| - iris: Use linear for exported resources if we can't convey tiling |
| - iris: Delete unused iris_screen::aperture_bytes field |
| - crocus: Fix memory leaks on iris_resource_create failure paths |
| |
| Konrad Dybcio (1): |
| |
| - freedreno: Enable A619 |
| |
| Konstantin Seurer (106): |
| |
| - radv: Fix ray queries with !15854 |
| - radv: Do not discard hits with t=tmax |
| - radv: Enable rt primitive culling for spirv2nir |
| - ci: Always install glslangValidator for build jobs |
| - radv: Use VAs in radv_meta_buffer.c |
| - radv: Make fill_buffer_shader non-static |
| - radv: Format radv_acceleration_structure.c |
| - radv: Add the fuchsia radix sort |
| - radv: Build accaleration structures using LBVH |
| - radv: Clean up the accel-struct build shaders |
| - radv: Fix lowering ignore_ray_intersection |
| - vulkan: Add a shader module clone helper |
| - radv: Copy shader modules to avoid use after free |
| - radv/radix_sort: Add missing entry points |
| - radv/radix_sort: Make variable names consistent |
| - radv: Use the entire morton code as sort key |
| - ac/nir: Do not set cursor in try_extract_additions |
| - radv: Move RADV_RT_STAGE_BITS to radv_private.h |
| - radv: Treat rt stages like compute stages |
| - radv: Use RADV_RT_STAGE_BITS more often |
| - nir: Add a ray launch size addr intrinsic |
| - radv: Allow indirect dispatches without bo |
| - radv: Indirect unaligned dispatches |
| - radv,aco: Use ray_launch_size_addr |
| - radv: Implement vkCmdTraceRaysIndirectKHR |
| - radv: Advertise indirect ray tracing |
| - radv: Allow radv_fill_buffer to work with VAs only |
| - radv: Use radv_fill_buffer for accel struct builds |
| - Revert "radv: Make fill_buffer_shader non-static" |
| - radv: Add and use radv_bvh_node_type |
| - radv: Replace nir_vector_extract with nir_channel |
| - radv/ci: Skip slow/crashing ray tracing tests |
| - radv: Enable ray queries by default |
| - radv: Add fail/flake lists for navi22 |
| - radv: Fix handling of primitiveOffset |
| - radv: Ignore transformOffset if transformData is 0 |
| - vulkan: Use the correct enum in vk_buffer |
| - radv: Use vk_buffer |
| - radv: Add and use radv_cp_dma_wait_for_stages |
| - radv: Some acceleration structure cleanups |
| - radv: Require an alignment of 64 for accel structs |
| - radv: Fix the accel struct size calculation |
| - vulkan: Spec update to 1.3.216 |
| - spirv: Add plumbing for ray_cull_mask |
| - radv: ray cull mask |
| - radv: Add accel struct queries for maintenance1 |
| - radv: ACCESS_2_SHADER_BINDING_TABLE_READ_BIT_KHR |
| - radv: PIPELINE_STAGE_2_ACCELERATION_STRUCTURE_COPY |
| - nir: Handle ray_launch_size_addr in opt_preamble |
| - radv: vkCmdTraceRaysIndirect2KHR |
| - radv: Advertise VK_KHR_ray_tracing_maintenance1 |
| - nir,spirv: Preserve inbounds access information |
| - nir/nir_lower_io: Optimize 32-bit inbounds access |
| - radv: Move some rt intrinsics to the top |
| - radv: Introduce an enum for the any hit status |
| - radv: Fix the t comparison for triangles |
| - radv: Refactor lower_rt_derefs |
| - nir: Add a nir_test_mask helper |
| - radv: Use nir_test_mask instead of i2b(iand) |
| - nir: Use nir_test_mask instead of i2b(iand) |
| - d3d12: Use nir_test_mask instead of i2b(iand) |
| - intel: Use nir_test_mask instead of i2b(iand) |
| - microsoft: Use nir_test_mask instead of i2b(iand) |
| - dozen: Use nir_test_mask instead of i2b(iand) |
| - radv: Use two bools for ahit_status |
| - radv: Skip pipeline_no_null_shaders_flag tests |
| - radv: Fix acceleration structure size queries |
| - radv: Use a global address for ray_launch_size |
| - radv: Use a global address for sbt_base |
| - radv: Only set rt stack size for dynamic stacks |
| - vulkan: Common vk_format_get_component_bits |
| - nir: Add a common gen_rect_vertices implementation |
| - radv: Use nir_gen_rect_vertices |
| - v3dv: Use nir_gen_rect_vertices |
| - radeonsi: Set vertex_id_zero_based |
| - aco: Remove dead nir_intrinsic_load_vertex_id case |
| - ac/llvm: Remove load_vertex_id handling |
| - util/format: Use an explicit length for the descs |
| - util/format: Assert that formats are valid |
| - radv: Remove format desc null checks |
| - radv: Use desc->format |
| - v3dv: Remove format desc null asserts |
| - turnip: Remove format desc null assert |
| - gallivm: Remove format desc null checks |
| - tgsi: Remove format desc null check |
| - util: Remove format desc null check |
| - agx: Remove format desc null check |
| - etnaviv: Remove format desc null check |
| - llvmpipe: Remove format desc null checks |
| - panfrost: Remove format desc null check |
| - r300: Remove format desc null check |
| - r600: Remove format desc null checks |
| - radeonsi: Remove format desc null checks |
| - softpipe: Remove format desc null check |
| - virgl: Remove format desc null checks |
| - pvr: Remove format desc null check |
| - util/format: Remove format desc null checks |
| - gallium/tests: Remove format desc null checks |
| - radv: Add meta saving/restoring for predicating |
| - radv: Use RADV_META_SUSPEND_PREDICATING |
| - radv: Simplify the meta init fail path |
| - radv: Fix out of bounds access in RTPSO creation |
| - vulkan: Fix descriptor set layout allocation scope |
| - vulkan: Handle descriptor set layout alloc fails |
| - radv: Use vk_descriptor_set_layout |
| - radv: Fix stack size calculation with stage ids |
| |
| Kristian Høgsberg (1): |
| |
| - util/u_debug: Use 'initialized' instead of 'first' |
| |
| Kyle Russell (1): |
| |
| - etnaviv: don't halve max_instructions for GC400 |
| |
| Leandro Ribeiro (1): |
| |
| - Revert "egl/wayland: deprecate drm_handle_format() and drm_handle_capabilities()" |
| |
| Lionel Landwerlin (120): |
| |
| - anv: use shadow surface for stencil input attachment on gfx7 |
| - intel: fix URB programming for GT1s |
| - intel/fs: require memory fence commit bit on Gfx9 |
| - anv: improve INTEL_DEBUG for submit |
| - anv/intel: add a new debug flag for stalling after every draw/dispatch |
| - anv: implement DEBUG_SYNC |
| - anv: allow getting the address of the beginning of the batch |
| - intel/fs: tidy up lower of ray queries |
| - ci/iris: mark another test as flaky |
| - anv: implement VK_EXT_primitives_generated_query |
| - nir/divergence: handle load_global_block_intel |
| - anv: fix INTEL_DEBUG=sync |
| - intel: fixup number of threads per EU on XeHP |
| - anv: remove unused enum |
| - anv: fix acceleration structure descriptor template writes |
| - anv: skip acceleration structure in binding table emission |
| - anv: reemit 3DSTATE_STREAMOUT after memcpy |
| - anv: fix clflush usage on utrace copy batch |
| - anv: limit clflush usage |
| - vulkan/runtime: 0-out pipeline cache object |
| - iris: use new kernel uAPI to compute video memory |
| - anv: fix invalid utrace memcpy l3 config on gfx < 11 |
| - anv: reset all dynamic state after secondary execution |
| - anv: add missing logic op set in pipeline dyn state |
| - Revert "anv: fix dynamic state emission" |
| - anv: rework sample location |
| - anv: never emit 3DSTATE_CPS in the pipeline batch |
| - anv: don't emit 3DSTATE_STREAMOUT in pipeline batch |
| - anv: don't emit 3DSTATE_WM in pipeline batch |
| - anv: don't emit 3DSTATE_BLEND_STATE_POINTERS in pipeline batch |
| - anv: don't emit 3DSTATE_DEPTH_BOUNDS in pipeline batch |
| - anv: don't emit 3DSTATE_VF_TOPOLOGY in pipeline batch |
| - anv: remove static_state_mask |
| - docs: explain state emission in Anv |
| - intel: fix stall debug option |
| - nir/cf: return cursor after insertion of cf_list |
| - nir/lower_shader_calls: don't use nop instructions as cursors |
| - nir/lower_shader_calls: don't insert code after break/continue |
| - nir/lower_shader_calls: put inserted instructions into a dummy block |
| - anv: fix primitives generated queries values |
| - intel/perf: add support for new opcodes in code generation |
| - intel/perf: store a copy of devinfo |
| - intel/perf: add support for dualsubslice count variable |
| - intel/perf: add support for GtSlice/GtSliceXDualsubsliceY variables |
| - intel/perf: stop overriding oa_format |
| - intel/perf: add OA A counter type |
| - intel/perf: add support new variable counting the number of EUs in slice0-3 |
| - intel/perf: add max vfuncs |
| - intel/perf: add new layout for Gfx12.5 products |
| - intel/perf: disable sseu setting on Gfx12.5+ |
| - intel/perf: deal with OA reports timestamp values on DG2 |
| - intel/ds: fix compilation |
| - u_trace/anv/iris: drop cs argument for recording traces |
| - intel/disasm: add missing handling of <1;1,0> |
| - util/debug: add an enable parsing helper |
| - util/perf: add options to enable/disable tracepoints |
| - anv: move internal RT shaders to the internal cache |
| - nir/divergence: handle load_scratch_base_ptr |
| - nir/divergence: handle load_ray_num_dss_rt_stacks_intel |
| - anv: don't expose EXT_border_color_swizzle on gfx7 |
| - intel/validator: validate dst/src types against devinfo support |
| - anv: fix index buffer emission |
| - vulkan/wsi: fix crash with debug names on swapchain |
| - intel/dev: add a helper to update memory info |
| - intel/dev: add a fallback when memory regions are not available |
| - anv: limit RT writes to number of color outputs |
| - anv: disable perf queries on non RCS engines |
| - anv: make apply_pipeline_layout/compute_push_layout visible to NIR debug |
| - anv: silence border color swizzle debug message |
| - clc: add new feature options for intel_clc |
| - intel/clc: enable fp16 & subgroups for GRL |
| - anv: disable non uniform indexing of UBOs |
| - anv: remove device features handling now in runtime |
| - anv: use the right helper to invalidate memory |
| - intel/fs: make sure memory writes have landed for thread dispatch |
| - intel/nir: temporarly disable opt_uniform_atomics for RT/CL |
| - anv/utrace: use a bo pool for utrace buffers |
| - intel/fs: ray query fix for global address |
| - nir/serialize: restore ray query variables |
| - isl: add new helper for format component compatibility |
| - anv: track if images can be fast clear with non-zero color |
| - anv: move gfx12+ CCS_E clear color restriction |
| - anv: deal with isl format swizzles for buffer views |
| - anv: validate image view lowered storage formats for storage |
| - anv: add a comment about handling buffer view swizzles on gfx7 |
| - intel: protect against empty invalidate ranges |
| - anv: skip flush/invalidate faster |
| - spirv: switch to uint64 for rayquery internal type |
| - anv: ensure tile flush before streamout writes |
| - intel/fs: Set NonPerspectiveBarycentricEnable when the interpolator needs it. |
| - anv: remove local computation of dynamic states |
| - anv: simplify dynamic buffer count in pipeline layout |
| - anv: break up anv_pipeline_compile_graphics() |
| - anv: split graphics nir loading |
| - anv: rename internal function for consistency |
| - anv: move CreateGraphicsPipelines to common code |
| - anv: move CreateComputePipelines to common code |
| - anv: move CreateRayTracingPipelines to common code |
| - anv: reorder & document fields of anv_graphics_pipeline |
| - anv: name non dynamic state fields correctly |
| - anv: fix primitive topology dynamic state emission on gfx7 |
| - anv: move restart index to gfx state |
| - anv: remove pipeline setting dynamic state |
| - intel/dev: deal with i915 unallocated_size on smem |
| - anv: allocate RT scratch in local memory |
| - anv: make use of the new smallbar uAPI |
| - anv: decode init batch with INTEL_DEBUG=bat |
| - intel/dev: move verification function to a header |
| - intel/tools/drm-shim: fixup eu_stride for topology |
| - intel/tools/stub: fixup parsing of --platform= |
| - intel/dev: add a test verifying that device override works |
| - intel/dev: remove INTEL_DEVID_OVERRIDE |
| - anv: fixup PIPE_CONTROL restriction on gfx8 |
| - anv: don't return incorrect error code for vkCreateDescriptorPool |
| - anv: fix GetPipelineExecutableStatistics for ray tracing pipelines |
| - anv: fixup assertions on lowered storage formats |
| - anv: fix assert in memory budget code when extension is not supported |
| - intel/fs: fix load_scratch intrinsic |
| - intel/fs: fixup scratch load/store handling on Gfx12.5+ |
| - intel/fs: fixup SEND validation check on overlapping src0/src1 |
| |
| Liviu Prodea (7): |
| |
| - dzn: Fix incompatible pointer type error affecting MSYS2 MINGW32 |
| - ci/MSVC: LLD is not really used anywhere |
| - ci/MinGW: LLD is not really used anywhere |
| - d3d12/dzn/spirv2dxil: Require version library |
| - Microsoft clc: strip lib prefix Otherwise OpenCLon12 ICD can't load it |
| - meson: Microsoft / maybe Intel CLC need the all-targets workaround just like clover |
| - meson: Only draw with llvm depends on native directly Tests, softpipe or AMD drivers don't depend on it directly |
| |
| Louis-Francis Ratté-Boulianne (4): |
| |
| - d3d12: Move d3d12_nir_lower_vs_vertex_conversion() to a common place |
| - microsoft/compiler: Add support for more scaled formats |
| - dzn: Allow some non-native formats to be used as vertex inputs |
| - dzn: Enable dEQP-VK.pipeline.vertex_input.* test cases |
| |
| Lucas Stach (37): |
| |
| - etnaviv: update headers from rnndb |
| - etnaviv: set VIVS_GL_VERTEX_ELEMENT_CONFIG depending on prim type |
| - etnaviv: don't supertile textures if supertiling is disabled via debug option |
| - etnaviv: fill all minor GPU features from the kernel |
| - etnaviv: update headers from rnndb |
| - etnaviv: correct bits per tile and clear value for HALTI5 |
| - etnaviv: use new PE pipe address states on >= HALTI0 |
| - etnaviv: use feature flag to determine which RS states to use |
| - etnaviv: use feature bit for one const src per instuction limitation |
| - etnaviv: clean up tiling setup in etna_compile_rs_state |
| - etnaviv: add support for big tile RS states |
| - etnaviv: properly set additional DEC400 compression states |
| - etnaviv: use feature bit to check for big tile support |
| - etnaviv: add tile size helper |
| - etnaviv: reinstate accidentially commented assert |
| - etnaviv: improve single buffer setup |
| - etnaviv: add linear PE support |
| - etnaviv: switch depth mode when linear PE is used |
| - etnaviv: expose real map buffer alignment |
| - etnaviv: allow mapped buffers during execution |
| - etnaviv: drm: always use hash to track BO index |
| - etnaviv: drm: don't cache mmap offset |
| - etnaviv: drm: make etna_bo_map thread safe |
| - etnaviv: drm: rename etna_drm_table_lock |
| - etnaviv: add function to get resource status |
| - etnaviv: rework resource status tracking (again) |
| - etnaviv: drop scanout assumption for resources allocated via modifier interface |
| - kmsro: add 'imx-lcdif' driver support |
| - etnaviv: move dummy BOs to screen |
| - etnaviv: add texture descriptor suballocator |
| - etnaviv: tex_desc: make error handling more consistent |
| - etnaviv: tex_desc: remove descriptor patch TODO comment |
| - etnaviv: fix use after free in async shader compile |
| - etnaviv: remove etna_align_up() function |
| - etnaviv: move checking for MC2.0 for TS into screen init |
| - etnaviv: use linear PE rendering only on properly aligned surfaces |
| - etnaviv: add debug option to disable linear PE feature |
| |
| M Henning (7): |
| |
| - nouveau/nir: Allow up to 6 nested joins |
| - nouveau: Emit cctl to flush L1 cache for atomics |
| - nouveau: Skip cctl for atomic counters in tgsi |
| - nouveau/nir: Split fewer 64-bit loads |
| - ttn: Make ttn_optimize_nir match gl_nir_opts |
| - nv50: Fix tlsSpace alignment |
| - nouveau: Drop C++03 compat code |
| |
| Manas Chaudhary (2): |
| |
| - panvk: Add PANVK_DEBUG_DUMP dump mappings |
| - panvk: Allow PAN_BO_INVISIBLE to be mapped with dump flag |
| |
| Manuel Stoeckl (4): |
| |
| - dri_interface: add XBGR16161616 format |
| - gbm: add GBM_FORMAT_[AX]BGR16161616 |
| - dri: fix incorrect fourcc for R16G16B16A16_UNORM format |
| - dri: add [ax]bgr16161616 to format lookup tables |
| |
| Marcin Ślusarz (40): |
| |
| - anv: invalidate all metadata in anv_nir_lower_ubo_loads |
| - intel/compiler: invalidate all metadata in brw_nir_lower_intersection_shader |
| - intel/compiler: invalidate metadata in brw_nir_initialize_mue |
| - compiler: add VARYING_SLOT_CULL_PRIMITIVE |
| - intel/compiler: handle VARYING_SLOT_CULL_PRIMITIVE in mesh |
| - intel/compiler: handle gl_Viewport and gl_Layer in FS URB setup |
| - anv: update task/mesh distribution with the recommended values |
| - intel/compiler: print name of the unhandled intrinsic |
| - anv: disable streamout before emitting mesh shading state |
| - intel/compiler: implement primitive shading rate for mesh |
| - anv: handle primitive shading rate for mesh |
| - intel/tools: add support for GEM_CREATE_EXT in intel_dump_gpu |
| - intel/tools: clean up mmio regs definitions |
| - intel/tools: drop wrappers around mmio regs macros |
| - intel/tools/aubinator: add support for 2 "new" subopcodes |
| - intel/tools/aubinator: list all platforms in help message |
| - intel/tools: add macros for gfx12+ variant of VCSUNIT0 |
| - nir: add load_mesh_inline_data_intel intrinsic to nir_divergence_analysis |
| - nir: add load_task_payload intrinsic to nir_divergence_analysis |
| - intel/tools: fix 32-bit build |
| - anv: remove redundant code calculating dynamic states mask |
| - anv: remove invalid copy/pasted comment |
| - anv: mask out not applicable state flags when setting up mesh pipeline |
| - intel/dev: fixup max_cs_workgroup_threads after hwconfig is read |
| - nir/opt_load_store_vectorize: handle task payloads |
| - intel/compiler: vectorize task payload loads/stores |
| - nir/lower_task_shader: insert barrier before/after shared memory read/write |
| - intel/compiler: assert that base is 0 for [load|store]_shared intrins |
| - intel: fix typos found by codespell |
| - intel/common: allocate space for at least one task urb |
| - intel/compiler: adjust task payload offsets as late as possible |
| - anv: disable injection of primitive shading rate for mesh |
| - intel/compiler: print shaders after nir_remove_unused_varyings |
| - nir: convert unused mesh outputs to shared memory |
| - intel/compiler: emit URB fences for TASK/MESH |
| - intel/compiler: there are 4 types of fences on gfx >= 12.5 |
| - intel/compiler: insert URB fence before task/mesh termination |
| - intel/compiler: use NIR_PASS more |
| - intel/compiler: fix mesh urb write regression |
| - anv: disable task redistribution |
| |
| Marek Olšák (311): |
| |
| - gallium: move drivers/radeon/* into drivers/radeonsi/ |
| - gallium: move radeon_winsys.h into gallium/include/winsys/ |
| - gallium/winsys: move {amdgpu,radeon_drm}_public.h contents into radeon_winsys.h |
| - gallium/ddebug: implement pipe_vertex_state callbacks |
| - nir: fix an uninitialized variable valgrind warning in nir_group_loads |
| - ac/surface: fix an addrlib race condition on gfx9 |
| - radeonsi: set more state bits for disabled binning |
| - radeonsi: don't compile monolithic PS for dual source blending |
| - radeonsi: add an SQTT workaround for chips with disabled RBs |
| - radeonsi: set PA_SC_EDGERULE according to bottom_edge_rule state |
| - radeonsi: initialize line stipple config registers |
| - radeonsi: fix ALPHA_IS_ON_MSB determination |
| - radeonsi: handle an out-of-bounds sampler/image index better |
| - radeonsi: don't treat skipped GS copy shader compilation as successful |
| - radeonsi: dump shaders after si_shader_binary_upload to fix printed LDS stat |
| - radeonsi: ignore AMD_DEBUG=nodcc if the texture is imported |
| - radeonsi: run nir_io_add_const_offset_to_base for TES/TCS as late optimizations |
| - radeonsi: scalarize IO instructions |
| - radeonsi: add pipe_stream_output_info into si_shader_context |
| - radeonsi: use nir_shader_get_entrypoint in si_nir_scan_shader |
| - radeonsi: apply key.ge.opt.kill_{outputs,pointsize,clipdistance} in NIR |
| - radeonsi: gather pipe_stream_output_info from NIR intrinsics |
| - radeonsi: move most "info" fields from si_shader_selector into si_shader_info |
| - radeonsi: move si_shader_info::stage into si_shader_selector |
| - ac: add ac_nir_optimize_outputs, a NIR version of ac_optimize_vs_outputs |
| - radeonsi: assign param export indices before compilation |
| - ac/llvm: remove LLVM pass ac_optimize_vs_outputs |
| - amd: document chips |
| - ac/surface/tests: generalize and extend gfx10 tests |
| - ac: add more non-shadowed registers to the lists |
| - ac/llvm: add AC_WAIT_EXP for ac_build_waitcnt |
| - ac/llvm: remove immoffset parameter from ac_build_tbuffer_store |
| - ac/llvm: remove immoffset parameter from ac_build_tbuffer_load |
| - ac/llvm: remove inst_offset parameter from ac_build_buffer_store_dword |
| - ac/llvm: use soffset correctly in ac_build_buffer_load |
| - ac/llvm: remove inst_offset parameter from ac_build_buffer_load |
| - ac/llvm: remove immoffset parameter from ac_build_tbuffer_load_byte/short |
| - ac/llvm: remove unused code from ac_llvm_build.c/h |
| - ac/llvm: don't use tbuffer_store as a fallback for swizzled stores |
| - ac/llvm: rename tbuffer -> buffer where tbuffer instructions are not generated |
| - ac,radeonsi: rework how scratch_waves is used and move it to ac_gpu_info.c |
| - ac/gpu_info: remove old and unused fields from radeon_info |
| - ac/llvm: remove unused LLVM helpers |
| - ac/llvm: don't create the target machine if the LLVM processor is unsupported |
| - radeonsi: remove an obsolete comment about LLVM 14 |
| - radeonsi: fail to create pipe_screen if LLVM doesn't support the GPU |
| - winsys/amdgpu: fix a mutex deadlock when we fail to create pipe_screen |
| - radeonsi/ci: add polaris11 baseline |
| - util: add pipe formats R10G10B10X2_SINT, B10G10R10X2_SINT, G16R16_SINT |
| - util: change util_format_snorm8_to_sint8 -> snorm_to_sint for radeonsi |
| - ac/gpu_info: rework how num_se is derived |
| - ac: remove really_needs_scratch, parameter from ac_parse_shader_binary_config |
| - ac/llvm: set the correct cache policy for sparse buffer loads |
| - ac/llvm: don't set ac_dlc in visit_intrinsic |
| - ac: support GR channel order in ac_choose_spi_color_formats |
| - ac,radeonsi: report SCALED formats as unsupported by samplers and color buffers |
| - radeonsi: always print the lower-case GPU name for radeonsi-run-tests.py |
| - radeonsi: cleanup local includes |
| - radeonsi: align the tessellation ring address to 2MB |
| - radeonsi: remove unused si_cs_shader_state::uses_scratch |
| - radeonsi: simplify setting COMPUTE_PGM_RSRC2.SCRATCH_EN |
| - radeonsi: code style fix |
| - radeonsi: add si_can_use_compute_blit |
| - radeonsi: don't report R64_*INT as a sampler format because it doesn't work |
| - radeonsi: don't allow broken compute image copies into A8R8_UNORM |
| - radeonsi: add color buffer support for 2_10_10_10 pipe formats |
| - radeonsi: add color buffer support for R9G9B9E5_FLOAT on gfx10.3 |
| - radeonsi: implement compute_copy_image between 1D_ARRAY and other texture types |
| - radeonsi: cosmetic changes in si_compute_copy_image |
| - radeonsi: assert format support in si_compute_copy_image |
| - radeonsi: don't expose formats that CB supports, but not shaders |
| - radeonsi: rename si_test_blit -> si_test_image_copy_region |
| - radeonsi/test_image_copy: remove whole image copies |
| - radeonsi/test_image_copy: remove linear and power-of-two size testing |
| - radeonsi/test_image_copy: remove special SDMA codepaths |
| - radeonsi/test_image_copy: simplify texture size generation |
| - radeonsi/test_image_copy: separate generating a random pipe_resource |
| - radeonsi/test_image_copy: separate printing pipe_resource into a function |
| - radeonsi/test_image_copy: test all interesting texture targets |
| - radeonsi/test_image_copy: test copying a block-compressed format |
| - radeonsi/test_image_copy: test mipmaps |
| - radeonsi/test_image_copy: test all formats |
| - radeonsi/ci: update sienna_cichlid results |
| - radeonsi: handle compressed formats in si_compute_copy_image |
| - radeonsi: improve the subsampled copy_image calculation |
| - radeonsi: remove compressed and subsampled gfx copy from resource_copy_region |
| - radeonsi: remove si_create_sampler_view_custom and related code |
| - radeonsi: remove si_create_surface_custom |
| - mesa: consider the sample count when choosing a texture format |
| - frontend/dri: allow swapped BGR->RGB channel order for MSAA color buffers |
| - amd/registers: hardcode GC base offsets in the json generator |
| - amd/registers: add gfx11 to the json generator |
| - amd/registers: add gfx11.json |
| - amd/registers: add gfx11-rsrc.json |
| - amd: enable gfx11 in header generator, fix drivers with renamed gfx6-10 defs |
| - amd: add gfx11 to packet definitions |
| - amd: import gfx11 addrlib |
| - amd: add Mesa-only addrlib changes |
| - ac: don't align VGPRs to 8 or 16 for gfx11 |
| - ac: implement ac_get_tbuffer_format for gfx11 |
| - ac: scratch buffer register changes for gfx11 |
| - ac: implement register shadowing for gfx11 |
| - ac/llvm: export mrt0 instead of null on gfx11 |
| - ac/llvm: don't set DLC on gfx11 because it means something else there |
| - ac/llvm: update pknorm and waitcnt for gfx11 |
| - ac/llvm: don't set GLC for stores on gfx11 |
| - ac/gpu_info: set cu_mask correctly for gfx11 |
| - ac/surface: gfx11 changes |
| - ac/surface: define gfx11 modifiers |
| - ac/surface: add gfx11 support to modifiers tests |
| - radeonsi/gfx11: increase the hw screen offset alignment |
| - radeonsi/gfx11: always allow DCC stores |
| - radeonsi/gfx11: expect packed threadID VGPRs |
| - radeonsi/gfx11: enable NGG-only draw paths |
| - radeonsi/gfx11: enable arbitrary DCC format reinterpretation |
| - radeonsi/gfx11: use the new TCS WaveID SGPR to compute vs_rel_patch_id |
| - radeonsi/gfx11: program inst_pref_size for graphics |
| - radeonsi/gfx11: buffer descriptor changes |
| - radeonsi/gfx11: add CB deltas |
| - ac,radeonsi/gfx11: remove FMASK loads |
| - ac,radeonsi/gfx11: set SWIZZLE_ENABLE correctly |
| - radeonsi/gfx11: don't set non-existent SPI_SHADER_USER_DATA_VS_x |
| - radeonsi/gfx11: don't set COMPR for exports, use 0x3 channel mask instead |
| - radeonsi/gfx11: implement attributes through memory |
| - radeonsi/gfx11: scattered register deltas |
| - radeonsi/gfx11: don't set non-existent CP_COHER_START_DELAY |
| - radeonsi/gfx11: emit SQ_NON_EVENT for tessellation at the end of IBs |
| - radeonsi/gfx11: don't use FLUSH_AND_INV_DB_META |
| - radeonsi/gfx11: TF_RING_SIZE changed to a per-SE size |
| - radeonsi/gfx11: limit MSAA color buffers to the RGBA channel order |
| - radeonsi/gfx11: update the initialization of SGPR0/1 registers for HS and GS |
| - radeonsi/gfx11: change LDS allocation granularity for PS |
| - radeonsi/gfx11: don't count the non-existent scratch_byte_offset SGPR |
| - radeonsi/gfx11: resolve MSAA using u_blitter |
| - radeonsi/gfx11: mark streamout as unimplemented for now |
| - radeonsi/gfx11: limit CP DMA to max 32KB sizes |
| - radeonsi: inline si_cp_dma_prefetch in si_draw_vbo for lower overhead |
| - radeonsi/gfx11: add a workaround for CB perf counters |
| - amd: add chip identification for gfx1100-1103 |
| - radeonsi: remove tautologies when setting CU_EN |
| - amd: rename ring_type --> amd_ip_type and match the kernel enum values |
| - amd: rename RING_* enums to AMD_IP_* |
| - amd: replace num_rings[type] with ip[type].num_queues |
| - ac/gpu_info: simplify HW IP querying |
| - ac/gpu_info: remove dead GDS query code |
| - ac/gpu_info: print all IP versions reported by the kernel |
| - ac/gpu_info: fix incorrect IP versions reported by the kernel |
| - radeonsi: report 10_10_10_2 scaled formats as unsupported on gfx6-9 as well |
| - radeonsi: report more missing/broken texture formats as unsupported on gfx6-9 |
| - ac/llvm: implement memory_barrier_tcs_patch properly to fix LLVM 15 |
| - radeonsi: wait before s_barrier in TCS epilog to fix LLVM 15 |
| - ac/llvm: inline emit_membar |
| - ac/llvm: replace LLVMBuildFence with ac_build_waitcnt(VSTORE) |
| - ac/llvm: add gl_shader_stage parameter into ac_build_s_barrier |
| - ac/llvm: move the gfx6 optimization for TCS barriers into ac_build_s_barrier |
| - ac/llvm: fix the remaining s_barriers for LLVM 15 |
| - radeonsi: add si_get_tcs_epilog_key |
| - ac/llvm: skip s_barrier if tess patches don't cross a wave boundary |
| - radeonsi: inline si_nir_build_llvm |
| - ac/llvm: remove ac_shader_abi::emit_outputs |
| - radeonsi: cleanups getting rid of the ambigous "epilogue" word |
| - radeonsi: get rid of the ambiguous "prologue" word |
| - radeonsi: try to group stage-specific code in si_llvm_translate_nir |
| - ac: replace 5 ac_shader_abi::load_* callbacks with 1 intrinsic_load callback |
| - amd: change chip_class naming to "enum amd_gfx_level gfx_level" |
| - radeonsi: rename RADEON_FLAG_UNCACHED -> RADEON_FLAG_GL2_BYPASS |
| - winsys/amdgpu-radeon: rework how we describe heaps |
| - winsys/amdgpu: handle RADEON_FLAG_ENCRYPTED as a heap modifier |
| - radeonsi: use PIPE_RESOURCE_FLAG_UNMAPPABLE and DRIVER_INTERNAL more |
| - winsys/amdgpu-radeon: clean up code related to heaps |
| - include: update amdgpu_drm.h |
| - radeonsi: use the new flag AMDGPU_GEM_CREATE_DISCARDABLE |
| - radeonsi: use AMDGPU_VM_PAGE_NOALLOC to disable MALL (infinity cache) |
| - radeonsi: fix a crash when failing to create a context |
| - radeonsi: print an error when failing to create a context |
| - radeonsi/gfx11: set BIG_PAGE for the attribute ring |
| - ac/gpu_info: silence a valgrind warning due to amdgpu_query_hw_ip_info |
| - radeonsi: remove GFX9_MERGED_NUM_USER_SGPR definition |
| - ac/gfx11: fix the scratch buffer |
| - radeonsi/gfx11: fix alpha-to-coverage with stencil or samplemask export |
| - radeonsi/gfx11: don't insert shader code for GS_PIPELINE_STATS_EMU |
| - radeonsi/gfx11: fix the value of VGT_GS_OUT_PRIM_TYPE at the beginning of IBs |
| - Revert "radeonsi/gfx11: limit MSAA color buffers to the RGBA channel order" |
| - radeonsi/gfx11: fix VM faults due to the attribute ring |
| - radeonsi/gfx11: optimize attribute stores |
| - radeonsi: merge all preamble states into one |
| - radeonsi: move CS preamble emission into the winsys |
| - winsys/amdgpu: initialize IB_PREAMBLE in advance |
| - radeonsi: fix resource_copy_region with ETC formats (e.g. for Stoney) |
| - Revert "frontend/dri: allow swapped BGR->RGB channel order for MSAA color buffers" |
| - Revert "mesa: consider the sample count when choosing a texture format" |
| - amd: rename fishes to Navi21, Navi22, Navi23, Navi24, and Rembrandt |
| - r300,r600,svga: save the FS constant buffer for u_blitter to fix a regression |
| - st/mesa: remove unused GENERIC input and output from the clear VS |
| - ac/gpu_info: always retile DCC on gfx10 and newer chips |
| - mesa: clamp UINT values greater than INT_MAX for glGetInteger |
| - mesa: change GL_MAX_COMPUTE_WORK_GROUP_COUNT to UINT |
| - mesa: change GL_MAX_UNIFORM_BLOCK_SIZE to UINT |
| - mesa: change GL_MAX_TEXTURE_BUFFER_SIZE to UINT |
| - mesa: change GL_MAX_SHADER_STORAGE_BLOCK_SIZE to UINT |
| - st/mesa: limit MaxComputeWorkGroupCount to INT_MAX - 1 due to failing tests |
| - st/mesa: round down MaxUniformBlockSize to a multiple of 4 for piglit |
| - gallium: rename PIPE_SHADER_CAP_MAX_CONST_BUFFER_SIZE to \*_BUFFER0_* |
| - gallium: add PIPE_CAP_MAX_CONSTANT_BUFFER_SIZE_UINT for UBOs |
| - gallium: rename PIPE_CAP_MAX_TEXTURE_BUFFER_SIZE->MAX_TEXEL_BUFFER_ELEMENTS_UINT |
| - gallium: rename PIPE_CAP_MAX_SHADER_BUFFER_SIZE -> \*_UINT |
| - radeonsi: increase the max compute LDS size to 64KB for gfx7+ |
| - radeonsi: report correct maximum compute grid sizes |
| - radeonsi: clamp against MAX_TEXEL_BUFFER_ELEMENTS correctly |
| - radeonsi: change max TBO/SSBO sizes again and rework max alloc size |
| - radeonsi: compute PIPE_CAP_MAX_TEXEL_BUFFER_ELEMENTS_UINT correctly |
| - radeonsi: set the max UBO size same as the max SSBO size |
| - winsys/amdgpu: use AMDGPU_IB_FLAG_PREAMBLE for the CS preamble on gfx10+ |
| - radeonsi: print LDS size in bytes |
| - radeonsi: don't use info.gs.invocations if it's not GS |
| - radeonsi: fix uninitialized wait_mem_scratch_tmz |
| - radeonsi: remove unused dword from wait_mem_scratch |
| - radeonsi: fix polygon stippling without color and Z outputs (v2) |
| - radeonsi: restructure PS no-export fixups |
| - radeonsi: determine DB_SHADER_CONTROL in si_shader_ps |
| - radeonsi: set INTERPOLATE_COMP_Z to 0 to work around an EQAA bug |
| - radeonsi: add BREAK_BATCH at the beginning of IBs |
| - radeonsi: simplify how pipeline statistic offsets are computed |
| - radeonsi: rework how VS_STATE_BITS are set for VS, TES, and GS |
| - radeonsi: rename and regroup VS_STATE definitions |
| - radeonsi: move GS_STATE bits to the end to make space at the beginning |
| - radeonsi: rework how vs_state_bits is set and unpacked |
| - radeonsi: fix an NGG streamout hang with monolithic shaders |
| - radeonsi: fix a crash in gfx10_sh_query_get_result_resource |
| - radeonsi: unconditionally enable the streamout overflow query with NGG |
| - radeonsi: inline gfx10_emit_streamout_begin/end |
| - radeonsi: fix NGG streamout hang by allocating GDS in the right place |
| - radeonsi: remove streamout code from shaders if no streamout buffers are bound |
| - winsys/amdgpu: add a kernel GDS management workaround retrying on -ENOMEM |
| - radeonsi: allocate GDS only once per process |
| - radeonsi: allocate only 1 GDS OA counter for gfx10 NGG streamout |
| - ac,radeonsi: don't export null from PS if it has no effect on gfx10+ |
| - ac/gpu_info: clamp gart_size_kb and vram_size_kb to fix buggy kernel driver |
| - gallium/u_threaded: fix buffer busyness tracking |
| - ac/llvm: fix tcs_wave_id unpacking on gfx11 |
| - amd/gfx11: add PixelWaitSync packet fields |
| - radeonsi/gfx11: compile monolithic PS if it writes memory |
| - radeonsi/gfx11: allocate more space for pipeline statistics |
| - radeonsi/gfx11: synchronize correctly before setting SPI_ATTRIBUTE_RING_* |
| - radeonsi/gfx11: don't use memory for waiting for cache flushes |
| - radeonsi/gfx11: don't allocate unused wait_mem_scratch |
| - radeonsi/gfx11: drop the ES vertex count requirement |
| - radeonsi/gfx11: add missing register shadowing code |
| - radeonsi/gfx11: rework GDS streamout code to single-lane and enable streamout |
| - amd: update addrlib - trivial changes |
| - ac/surface: expose all 64K_R_X and 256K_R_X modifiers on gfx11 |
| - radeonsi: fix random PS wave size |
| - util: remove our redefinition of assert |
| - Change all debug_assert calls to assert |
| - util: remove now unused debug_assert |
| - ttn: set dest_type for TXQ |
| - gallium: add pipe_blit_info::dst_sample, update u_blitter to write that sample |
| - ac/gpu_info: rename info fields to num_cu, memory_bus_width, memory_freq_mhz |
| - ac/gpu_info: remove tabs |
| - ac/gpu_info: rework and extend device info to be more informative |
| - ac/gpu_info: use hw_ip::ip_discovery_version to set IP versions |
| - ac/gpu_info: use the kernel-reported GFX IP version to set gfx_level |
| - ac/gpu_info: remove vram_size and gtt_size in favor of \*_kb variants |
| - ac/gpu_info: remove redundant uvd_decode |
| - ac/gpu_info: remove redundant jpeg_decode |
| - ac/gpu_info: remove redundant uvd_encode |
| - ac/gpu_info: remove redundant vce_encode |
| - ac/gpu_info: remove redundant vcn_encode |
| - amd: require amdgpu DRM 3.2.0 from April 2016 |
| - gallium/radeon: require radeon DRM 2.45.0 from April 2016 |
| - radeonsi: remove workarounds for radeon DRM < 2.45.0 |
| - ac/gpu_info: use drm_amdgpu_device_info instead of amdgpu_gpu_info |
| - ac/gpu_info: remove amdgpu_gpu_info parameter from ac_query_gpu_info |
| - ac/gpu_info: remove unused has_unaligned_shader_loads |
| - gallium/radeon: require radeon DRM 2.50.0 (kernel 4.12) from July 2017 |
| - ac/gpu_info: require amdgpu DRM 3.15.0 (kernel 4.12) from July 2017 |
| - util/u_threaded: fix a CPU storage crash due to an invalid codepath taken |
| - glthread: don't ignore glPushAttrib/glPopAttrib when tracking GL_CULL_FACE |
| - glthread: don't sync on IsEnabled(GL_DEPTH_TEST) by tracking it in glthread |
| - util/format: add util_format_rgbx_to_rgba helper |
| - amd/addrlib: fix 3D texture allocation failures on gfx11 |
| - ac/surface: don't forbid 256KB swizzle modes on smaller gfx11 chips |
| - radeonsi: unify VGT_TESS_DISTRIBUTION programming |
| - radeonsi: move SI_MAX_VRAM_MAP_SIZE to si_debug_options.h |
| - radeonsi: call pipe->blit instead of util_blitter_blit after MSAA resolving |
| - radeonsi: make si_launch_grid_internal static |
| - radeonsi: add common helper si_launch_grid_internal_images that is more robust |
| - radeonsi: remove compute-based DCC decompression because it's broken |
| - radeonsi: don't do image stores with RGBX, L, LA, I, and SRGB formats |
| - radeonsi: follow shader_info.float_controls_execution_mode (mostly) |
| - radeonsi: add need_fmask_expand parameter into si_decompress_subresource |
| - radeonsi: check for 16-bit hw support instead of relying on options.fp16 |
| - radeonsi: move compute-related code from si_blit.c to si_compute_blit.c |
| - radeonsi: fold async_copy into the preceding conditional in si_blit |
| - radeonsi: cosmetic changes around do_hardware_msaa_resolve |
| - radeonsi/gfx11: use correct VGT_TESS_DISTRIBUTION settings |
| - radeonsi/gfx11: set SAMPLE_MASK_TRACKER_WATERMARK = 15 and clean up |
| - radeonsi/gfx11: use better PRIM_GRP_SIZE_GFX11 setting |
| - radeonsi/gfx11: skip code in si_update_shaders that has no effect |
| - radeonsi/gfx11: rename si_calc_inst_pref_size -> si_get_shader_prefetch_size |
| - radeonsi/gfx11: enable shader prefetch except for initial chip revisions |
| - radeonsi/gfx11: use a better workaround for the export conflict bug |
| - radeonsi: make various blit functions non-static |
| - radeonsi: allow texture_map to upload only 1 sample for MSAA instead of all |
| - radeonsi: add a randomized blit test |
| - radeonsi: move the no-AA small prim precision cull constant into an SGPR |
| - radeonsi: move small prim precision computation out of si_emit_cull_state |
| - radeonsi: don't assume that TC_ACTION_ENA invalidates L1 cache on gfx9 |
| - glthread: unbind framebuffers in glDeleteFramebuffers |
| - glthread: call _mesa_glthread_DeleteBuffers unconditionally |
| - ac/surface: disallow 256KB swizzle modes on gfx11 APUs |
| |
| Marek Vasut (3): |
| |
| - dri: Check buffer height and avoid overflow |
| - etnaviv: Rename etna_emit_tex() args |
| - etnaviv: Implement TXD |
| |
| Mark Collins (1): |
| |
| - tu: Clamp priority in DRM submitqueue creation |
| |
| Mark Janes (11): |
| |
| - intel: parse intel_measure environment without side effects |
| - intel: relax assertion in INTEL_MEASURE |
| - anv: fix performance bug in INTEL_MEASURE |
| - intel: provide a callback to clean up after intel_measure_gather |
| - iris: provide a callback to INTEL_MEASURE to clean up snapshots |
| - util: define helpful macros for compiler diagnostic features |
| - iris: pad all structures used in a shader key |
| - iris: reorder to minimize padding |
| - intel/compiler: pad all data structures used by shader cache keys |
| - intel/compiler: reorder shader cache keys to minimize padding |
| - intel/dev: drop warning for unhandled hwconfig keys |
| |
| Martin Roukala (né Peres) (27): |
| |
| - ci/b2c: fix the generation of the IMAGE_UNDER_TEST variable |
| - ci/b2c: bump the kernel to 5.17.1 |
| - ci/b2c: update b2c to enable monitoring support |
| - radv/ci: add a Van Gogh dEQP runner |
| - ci: add ACO_DEBUG to the list of variables to pass down for testing |
| - ci/radv: remove amdgpu.gpu_recovery=0 from the kernel cmdline |
| - radv/ci: reboot the DUTs when hitting a ring timeout |
| - radv/ci: reduce the execution timeout |
| - radv/ci: re-enable fossils testing |
| - radv/ci: enable the experimental support for mesh shaders |
| - radv/ci: update the list of flakes |
| - radv/ci: allow polaris10 to retry more times |
| - radv/ci: bump the timeout of vangogh/renoir |
| - radv/ci: update the vkcts expectations for the CI machines |
| - radv/ci: remove an expected fail from all families |
| - radv/ci: try to guess failure expectations for Fiji/Raven/Navi14/Navi22 |
| - ci/valve: do not extract the install tarball on the runner |
| - ci/valve: uprev boot2container |
| - Revert "ci/valve: uprev boot2container" |
| - radv/ci: test vkd3d on kabini |
| - ci/radv: move a test from the fail list to the flake list |
| - ci/radv: add a missing flake |
| - ci/radv: enable vkcts testing on kabini |
| - ci/x86: update to llvm 13 |
| - radv/ci: test the llvm backend on navi21 |
| - radv/ci: allow specifying a warning pattern |
| - radv/ci: do not reboot on soft-recovered hangs, just warn |
| |
| Matt Coster (20): |
| |
| - pvr: csbgen: Formatting pass (PEP-8 plus other minor changes) |
| - pvr: csbgen: Add a missing @staticmethod decorator |
| - pvr: csbgen: Make some loops more pythonic |
| - pvr: csbgen: Cleanup imports |
| - pvr: csbgen: Remove "dim" functionality from Group |
| - pvr: csbgen: Remove unused function parameters |
| - pvr: csbgen: Add typing information |
| - pvr: csbgen: Isolate "main" function with __name__ check |
| - pvr: csbgen: Replace map() and filter() with list comprehension |
| - pvr: csbgen: Fix "local variable may be undefined" lints |
| - pvr: csbgen: Add \*_unpack() functions for all generated struct types |
| - pvr: debug: Implement PVR_DEBUG variable |
| - pvr: csbgen: Make all generated enums unambiguous |
| - pvr: Add a const qualifier to inline pvr_csb_*() helpers |
| - pvr: Remove pvr_cmd_struct() macro |
| - pvr: Make pvr_cmd_pack() macro clearly internal |
| - pvr: Split out unioned structs from struct pvr_sub_cmd |
| - pvr: Implicitly assert that the correct sub-command type is present |
| - pvr: Split pvr_dev_addr_t into a separate header |
| - pvr: Add helper macros for creating pvr_dev_addr_t instances |
| |
| Matt Turner (3): |
| |
| - mesa: Drop dead #include "sparc/sparc.h" |
| - freedreno/ir3: Move the texture array coord fixup to nir |
| - freedreno/ir3: Pass 16-bit sampler coordinates when possible. |
| |
| Matti Hamalainen (7): |
| |
| - gallium/tools: fixes to option handling |
| - gallium/tools: add option for ignoring junk calls in trace dumper |
| - gallium/tools: reimplement tracediff completely in Python |
| - pytracediff: add per-line difference highlighting for blocks |
| - pytracediff: make -M ("method only") option print arguments for differing calls |
| - pytracediff: change how 'junk' calls are handled |
| - pytracediff: implement pager ('less') invocation internally |
| |
| Max Kellermann (2): |
| |
| - gallium/u_threaded: add missing reference counts for draw_multi slots |
| - gallium/u_threaded: fix offset calculation for draw_multi slots |
| |
| Michael Olbrich (1): |
| |
| - crocus: export GEM handle with RDWR access rights |
| |
| Michael Skorokhodov (2): |
| |
| - glsl: Fix ir_quadop_vector validation |
| - anv: Update line range |
| |
| Michel Zou (7): |
| |
| - gallium: fix unused symbols warnings |
| - meson: call run_command with check=true |
| - vulkan/wsi: fix missing unistd include |
| - zink: fix pointer size conversion warning |
| - lavapipe: allow null handleTypes |
| - meson: drop dladdr check on win32 |
| - util/xmlconfig: fix -Wmaybe-uninitialized warning |
| |
| Mihai Preda (28): |
| |
| - gallium: add opaque pointers shim for LLVM < 8.0 |
| - gallium: use LLVM opaque pointers in draw_llvm.c |
| - gallium/llvmpipe: use LLVM opaque pointers in lp_bld_interp.c |
| - gallium/llvmpipe: use LLVM opaque pointers in lp_bld_depth.c |
| - gallivm: use LLVM opaque pointers in lp_bld_arit.c |
| - gallivm: use LLVM opaque pointers in lp_bld_assert.c |
| - gallivm: use LLVM opaque pointers in lp_bld_format_aos.c |
| - gallivm: use LLVM opaque pointers in lp_bld_conv.c |
| - gallivm: use LLVM opaque pointers in lp_bld_coro.c |
| - gallivm: use LLVM opaque pointers in lp_bld_printf.c |
| - gallivm: use LLVM opaque pointers in lp_bld_flow.c |
| - gallivm: use LLVM opaque pointers in lp_bld_intr.c |
| - gallivm: use LLVM opaque pointers in lp_bld_format_s3tc.c |
| - gallivm: use LLVM opaque pointers in lp_bld_gather.c |
| - gallivm: use LLVM opaque pointers in lp_bld_nir_soa.c |
| - gallivm: use LLVM opaque pointers in lp_bld_sample.c |
| - gallivm: use LLVM opaque pointers in lp_bld_sample_soa.c |
| - gallivm: use LLVM opaque pointers in lp_bld_struct.c |
| - gallivm: use LLVM opaque pointers in lp_bld_tgsi_soa.c |
| - gallivm: use LLVM opaque pointers in lp_bld_tgsi_aos.c |
| - gallivm: LLVM opaque pointers small changes |
| - gallium: refactor a channel loop in draw_llvm.c |
| - Revert "gallivm: use LLVM opaque pointers in lp_bld_tgsi_soa.c" |
| - gallium: LLVM-15 contexts use non-opaque pointers |
| - gallivm: fix a few llvm non-opaque pointers |
| - gallivm: refactor a bit the cache access in view of LLVM opaque pointers |
| - gallivm: add lp_build_struct_get() variants that take the LLVM type |
| - gallivm: push LLVM version guard into assert |
| |
| Mike Blumenkrantz (634): |
| |
| - nir/lower_tex: fix rect queries with lower_rect set |
| - mesa/st: set normalized coords for RECT samplers if rects are unsupported |
| - zink: unset PIPE_CAP_TEXRECT |
| - zink: radv baseline fix |
| - zink: nv ci update |
| - zink: clamp max shader images to 32 |
| - zink: remove tcs patch slot map |
| - zink: disable EXT_extended_dynamic_state2 on AMDPRO |
| - zink: add baseline for amdpro |
| - zink: reject resource creation if format features don't match attachment |
| - zink: use mixed zs renderpass for depth read/write |
| - zink: unconditionally set line width on rasterizer state change |
| - zink: support restart with PIPE_PRIM_LINES_ADJACENCY if ext is available |
| - zink: fix extended restart prim types without dynamic state2 |
| - zink: do not create fences at all if timeline semaphores are supported |
| - zink: require timeline semaphores |
| - zink: remove batch lock |
| - zink: rename zink_query::batch_id |
| - zink: remove driver-based max_fences throttling |
| - zink: hook up VK_KHR_create_renderpass2 |
| - zink: only use VK_DEPENDENCY_BY_REGION_BIT if sync2 is available |
| - zink: require renderpass2 |
| - zink: require KHR_imageless_framebuffer |
| - zink: delete all non-imageless framebuffer code |
| - zink: remove framebuffer indirection |
| - zink: rename imageless framebuffer functions |
| - zink: remove the worst part of basic framebuffer support |
| - zink: remove ZINK_NO_TIMELINES |
| - zink: make a kopper debug print into an error |
| - zink: rework zink_kopper_update() assert |
| - zink: always bind gfx pipeline at the top of draw |
| - zink: store min required stride values on the vertex state |
| - zink: selectively disable dynamic vertex stride |
| - nir/fold_16bit_sampler_conversions: add a mask for supported sampler dims |
| - lavapipe: run nir_fold_16bit_sampler_conversions |
| - zink: set VK_QUERY_RESULT_WAIT_BIT when copying to qbo |
| - zink: NV_linear_color_attachment |
| - zink: nv ci update |
| - zink: fix synchronization when drawing from streamout |
| - zink: fix xfb counter buffer barriers |
| - zink: remove xfb_barrier flag |
| - zink: handle device-local unsynchronized maps |
| - util/draw: fix map size of indirect buffer in util_draw_indirect_read |
| - util/draw: handle draw_count=0 when reading indirect parameters |
| - util/draw: fix indirect draw count readback |
| - gallium: add PIPE_CAP_MULTI_DRAW_INDIRECT_PARTIAL_STRIDE |
| - zink: unset PIPE_CAP_MULTI_DRAW_INDIRECT_PARTIAL_STRIDE |
| - zink: move the kopper present fence to the displaytarget object |
| - wgl: support GL 4.6 |
| - zink: make unsupported blit formats an error |
| - zink: make update_framebuffer_state() public |
| - zink: rework clear_depth_stencil hook |
| - zink: add a stencil blit fallback path |
| - zink: fix tcs control barriers for use without vk memory model |
| - zink: fix semantics mask for compute control barriers |
| - zink: hook up VK_KHR_vulkan_memory_model |
| - zink: use vulkan memory model shader semantics for tcs barriers |
| - zink: always use vk mem model for shaders if available |
| - zink: fix memory_barrier intrinsic |
| - zink: fix stride=0 for dynamic state vertex stride checking |
| - zink: add synchronization for buffer clears |
| - mesa/st: clamp GL_RENDERBUFFER to GL_TEXTURE_2D for sparse queries |
| - aux/trace: make get_sparse_texture_virtual_page_size useful |
| - glsl/nir: set new_style_shadow for sparse tex ops as necessary |
| - zink: fix group memory barrier emission |
| - vulkan: bump layer api versions to current vk header version |
| - vk/cache: unbreak vk_pipeline_cache_create with flags |
| - kopper: add DISPLAY_TARGET bind for depth buffer |
| - zink: fix/improve swapchain surface info updating |
| - zink: fix up swapchain depth buffer geometry during fb update |
| - drisw: remove dead code |
| - kopper: always fetch and store drawable info |
| - kopper: move drawable geometry updating up in function |
| - kopper: store whether screen has dmabuf support |
| - kopper: copy a bunch of code for texture_from_pixmap |
| - zink: add supported present modes to kopper displaytarget |
| - zink: add a util function for finding swapchain resource |
| - zink: flag swapchains when updating fails |
| - zink: handle dead swapchains in acquire |
| - zink: unset image layout when killing swapchain |
| - zink: rework swapchain object check for acquires |
| - zink: fix return for zink_kopper_acquire |
| - zink: handle swapchain acquire failures more directly |
| - zink: flag swapchain resources as swapchains |
| - zink/kopper: add a mechanism for checking swapchain status |
| - zink: radv ci updates |
| - ac: remove gallivm header include |
| - kopper: add some error logging for pixmap->texture failure |
| - kopper: check for modifiers to use modifier functionality |
| - zink/kopper: add a function for determining if running on software |
| - kopper: check whether zink is using sw |
| - kopper: rename a confusing variable |
| - kopper: implement texture-from-pixmap for software (non-dmabuf) |
| - zink: fix dmabuf plane returns |
| - zink: export fd info for all 2d images |
| - zink: ignore KMS handle types |
| - zink: remove drm_fd |
| - llvmpipe: add a ci flake |
| - llvmpipe: disable statistic queries during draw if queries are disabled |
| - llvmpipe: remove misleading query comment |
| - llvmpipe: disable compute statistics queries if queries are disabled |
| - wgl: always set alpha on kopper windows |
| - zink: hook up VK_EXT_primitives_generated_query |
| - zink: pass query object to get_num_queries() |
| - zink: pass query object to get_num_query_pools() |
| - zink: pass query object to get_num_results() |
| - zink: pass screen param to convert_query_type() |
| - zink: add and use a function to detected emulated primgen queries |
| - zink: add a flag to zink_query to trigger rasterizer discard workaround |
| - zink: use VK_EXT_primitives_generated_query when available |
| - wgl: don't auto-load zink before software drivers |
| - lavapipe: lower quad_broadcast intrinsics |
| - zink: add a ci flake |
| - kopper: invalidate drawables when resizing textures in place |
| - kopper: pass the current context to dri_flush |
| - util/blitter: fix sampler restore with 0 saved samplers |
| - radv: emit fewer framebuffer registers |
| - zink: restore conditional ordering for query begin/end |
| - nir/gather_info: flag fbfetch on subpass image loads |
| - llvmpipe: never infer early zs tests when fbfetch is active |
| - zink: split renderpasses for TextureBarrierNV() usage |
| - zink: scale depth bias by factor of 2 |
| - zink: pass sparse backing page offset to binding function |
| - zink: pass sparse bind bo offset through for texture binds |
| - zink: set all usage flags when querying sparse features |
| - zink: fix multisample conditional in sparse image query |
| - zink: bump number of image binds that can be batched to 50 |
| - zink: semi-handle 1D sparse texture rewrites for drivers that don't support them |
| - zink: decouple descriptor templates from layouts |
| - zink: flatten push descriptor template into normal template array |
| - zink: implement indirect buffer indexing |
| - zink: delete nir_lower_dynamic_bo_access |
| - lavapipe: enqueue pipeline destruction |
| - zink: fix null buffer/surface formats |
| - zink: add more image usage for null surfaces |
| - zink: use descriptor surfaces for notemplates ref updating |
| - lavapipe: handle null samplerviews |
| - gallivm: fix ssbo oob reads |
| - lavapipe: fix CmdPushDescriptorSetWithTemplateKHR with refcounting |
| - st/draw_feedback: set constant buffer stride |
| - gallivm/draw: fix oob ubo reads |
| - llvmpipe: always set ssbo data pointers for draw |
| - gallivm: fix oob txf swizzling |
| - lavapipe: set robustBufferAccess2 and robustImageAccess2 |
| - lavapipe: ci updates |
| - docs: EXT_robustness2 for lavapipe |
| - zink: ci updates |
| - kopper: remove unused zink include |
| - zink: remove unused headers/struct members/linkage |
| - zink: outdent some code |
| - zink: add a flag to indicate whether a resource is exportable |
| - zink: break out resource bind-adding into separate function for reuse |
| - zink: introduce copy contexts |
| - zink: create a copy context for the screen on init |
| - Revert "zink: export fd info for all 2d images" |
| - zink: always check for fb rebinds when starting renderpass |
| - zink: unset pipe_resource::next pointers when creating resource copies |
| - zink: add a bind flag to indicate a resource is exported as a dmabuf |
| - zink: fix conditional for modifier usage |
| - zink: force memory exports if dmabuf bind is specified |
| - zink: add a LINEAR drm modifier if rebinding to add dmabuf export |
| - zink: rebind resources for export as needed |
| - zink: create images with modifiers any time there is an import handle |
| - zink: fix up sparse texture sampling for shadow samplers |
| - zink: clamp out min_lod operands for explicit lod ops |
| - zink: fix sparse binding for arrayed textures |
| - zink: set sparse flag in cubemap lowering |
| - zink: remove misleading sparse comment |
| - zink: fix sparse texture depth calcs for arrayed textures |
| - zink: remove implicit query resets |
| - zink: remove refs from shader images |
| - zink: reuse local res pointer in set_shader_images |
| - zink: simplify dumb update flagging in set_shader_images |
| - zink: read shader image r/w usage from incoming data struct |
| - zink: copy incoming shader image struct after doing updates |
| - zink: stop leaking shader image surfaces |
| - zink: fix 3d shader image miplevel calc for depth |
| - zink: directly reuse surface ivci when rebinding |
| - gallium/noop: implement pipe_screen::create_fence_win32 |
| - aux/trace: screen::create_fence_win32 |
| - zink: add win32 guards for device extensions with win32 in name |
| - lavapipe: ignore depth/stencil states if attachment is not present |
| - lavapipe: fix pipeline handling for dynamic render zs state with pipeline library |
| - spirv: fix barrier scope assert |
| - zink: manually validate shaders in debug builds |
| - zink: fix init with MESA_SHADER_CACHE_DISABLE enabled |
| - zink: use copy context to eliminate dependency on EXT_calibrated_timestamps |
| - zink: move get_timestamp to screen |
| - zink: add tu baseline for a630 |
| - zink: add an anv-tgl ci job |
| - zink: never create a sw context unless sw is requested |
| - zink: only reject non-blittable resource creation if it has no other binds |
| - zink: add separate error message for push descriptor set alloc fail |
| - zink: add extra validation for resource creation |
| - zink: remove input attachment usage when pruning surface usage |
| - zink: add extended usage and retry if resource creation fails |
| - zink: fix surface/bufferview cache comparisons |
| - zink: force render target usage for sampler-only resources |
| - zink: clamp 1D_ARRAY sparse textures to 2D as needed |
| - zink: handle PIPE_BUFFER sparse texture queries |
| - zink: fix non-dynamic vertex stride update flagging |
| - zink: add c++ guards for zink_screen.h |
| - zink: add ZINK_DEBUG=sync |
| - zink: update radv baseline |
| - zink: rename a fb surface variables |
| - zink: always use zink_batch_rp to init renderpass |
| - zink: use zink_batch_no_rp to end renderpasses |
| - zink: move renderpass handling to zink_render_pass.c |
| - zink: use global image rebind counter for dmabuf export |
| - zink: remove fb rebind check during renderpass begin |
| - zink: use local variable for getting rt barrier info |
| - zink: break out sample location updating to separate function |
| - zink: break up swapchain renderpass update to util function |
| - zink: remove renderpass state flag for swapchain |
| - zink: pass rt attrib to zink_render_pass_attachment_get_barrier_info() |
| - zink: fix indentation |
| - zink: break out color rt attrib analysis into util function |
| - zink: break out zs rt attrib analysis into util function |
| - zink: track whether there's any transient attachments bound |
| - zink: lift some renderpass mechanics up a level in the api |
| - zink: split out scissor flagging from framebuffer updating |
| - zink: use dynamic rendering (most of the time) |
| - turnip: fix assert for max xfb outputs |
| - zink: only warn once for SRGB fb without KHR_swapchain_mutable_format |
| - zink: clamp dynamic render layerCount to 1 |
| - zink: create all descriptor ref templates at once |
| - zink: add screen-based indexing to descriptor sets |
| - zink: use descriptor indices in compiler |
| - zink: use descriptor set index indirection in program init |
| - zink: use descriptor set index indirection in cache state updates |
| - zink: flag descriptor sets as changed before updating hashes |
| - zink: add a "compact" descriptor mode |
| - zink: use XOR for descriptor hash accumulation |
| - zink: radv ci updates |
| - zink: use a zink_render_pass_pipeline_state bit for fbfetch |
| - zink: add renderpass bits for color/depth r/w |
| - zink: update radv piglit fails |
| - vtn: assert that vector shuffle indices are in-bounds |
| - vtn: assert that composite members have the same bit size as the result |
| - zink: require draw params ext/cap in ntv if drawid is used |
| - zink: emit spirv cap for subgroup vote ops |
| - zink: use correct scope for subgroup vote ops |
| - zink: fix subgroup vote detection |
| - zink: lower subgroup width to 1 for unsupported subgroup vote stages |
| - zink: remove first_frame stalling |
| - zink: drop wideLines requirement |
| - zink: drop largePoints requirement |
| - st/pbo_compute: do pbo readback directly to the buffer object if it exists |
| - st/pbo_compute: use different calc for non-3d compute buffer sizing |
| - st/pbo_compute: make compute download conditional in shader slightly more readable |
| - st/pbo_compute: fix z coords for compute pbos |
| - zink: only infer modifiers if winsys handle has a stride |
| - zink: drop requirement for 6 descriptorsets to use bindless |
| - zink: store the real (non-compact) binding usage for programs |
| - zink: fix change flagging for compact descriptor cache |
| - zink: flag all cached descriptors as needing update on program change |
| - zink: add a #define for descriptor compaction |
| - zink: make swapchain kill message more descriptive |
| - st/texture: use base teximage for compute blits |
| - st/texture: allow compute blits for (some) non-finalized textures |
| - lavapipe: fix dynamic patchControlPoints |
| - mesa: throw a log warning any time a fallback texture is used |
| - zink: fix radv piglit fails |
| - zink: force queries to start/end out of renderpass |
| - zink: tu ci updates |
| - zink: reorder pipeline dynamic state1 substruct |
| - zink: improve packing for pipeline dynamic state1 info |
| - zink: break out dynamic state1 pipeline info to struct |
| - zink: use memcmp for comparing dynamic state1 info |
| - zink: use dynamic cull mode |
| - zink: break out pipeline dynamic state2 into struct |
| - zink: use dynamic patch vertices if available |
| - zink: expand patch_vertices pipeline key bitsize |
| - zink: fix framebuffer attachment usage asserts for dummy attachments |
| - aux/trace: fix sampler view dumping |
| - aux/trace: implement pipe_screen::is_compute_copy_faster |
| - zink: disable compute pbos on turnip |
| - egl/wayland: manually swap backbuffer when using zink |
| - egl/wayland: skip buffer creation on zink |
| - kopper: use get_drawable_info path for non-x11 drawables |
| - ci: disable unit tests |
| - zink: fix dmabuf plane layout struct scoping |
| - zink: represent plane offsets using offset from plane 0 vs size of plane |
| - zink: rename a variable |
| - zink: handle aux plane imports |
| - Revert "zink: remove drm_fd" |
| - zink: add back kms handling |
| - mesa: improve relocation problem message |
| - mesa/st: bump param reservation to 20 |
| - format_utils: properly parenthesize macro params |
| - d3d12: skip time-elapsed piglit tests in ci |
| - virgl: add some ci flakes |
| - mesa/st: bump param reservation to 28 |
| - mesa: add statevar for atomic counter offsets |
| - mesa: conditionally set constants dirty for atomic counter binds |
| - st/glsl_to_nir: call st_set_prog_affected_state_flags() as late as possible |
| - mesa: handle atomic counter lowering for drivers with big ssbo offset aligns |
| - zink: remove ANV depth clip control workaround |
| - zink: disable EXT_primitives_generated_query on turnip |
| - zink: track invalidation for image resources |
| - zink: flag renderpass for change if image resource changes valid state |
| - zink: rename renderpass attrib value |
| - zink: split out a dynamic render ternary |
| - zink: hook up surface invalidation to LOAD_OP_DONT_CARE |
| - zink: invalidate blit dsts if fully covered |
| - zink: remove buffer valid range tracking from blit |
| - mesa/streaming-memcpy: add memcpy fallback |
| - streaming-memcpy: move to src/util/ and compile unconditionally |
| - mesa/st: use util_streaming_load_memcpy for compute pbo copying |
| - zink: break out of zs mixed layout update loop when work is done |
| - zink: only update layout when doing mixed zs attachment renderpass check |
| - aux/trace: add pipe_context::link_shader |
| - aux/tc: handle link_shader |
| - zink: simplify some compiler bo code |
| - zink: fix buffer descriptor hashing |
| - zink: always create descriptor templates |
| - zink: flag optimization pass after inlining uniforms |
| - zink: break out nir alu typing in ntv |
| - zink: run nir_lower_alu_to_scalar in optimizer loop for 64bit lowering |
| - zink: always scalarize pack/unpack alu ops |
| - zink: don't lower pack/unpack intrinsics |
| - zink: run more int64 passes during optimization loop if int64 isn't available |
| - zink: fix ntv partial stores |
| - zink: flag all shader i/o variables as !is_xfb |
| - zink: fix variable locations in manual xfb emission |
| - zink: if a variable is flagged as being xfb, treat it as manual xfb output |
| - zink: improve/expand manual xfb emission |
| - zink: implement nir_op_unpack_64_2x32 in ntv |
| - zink: rewrite 64bit shader i/o as 32bit |
| - zink: scalarize when rewriting explicit 64bit io |
| - zink: avoid adding more 64bit alus during bo rewriting |
| - zink: fix 32bit bo rewriting |
| - zink: update tu a630 baseline |
| - zink: add new flakes |
| - zink: fix feature requirement |
| - zink: add feature req for border colors |
| - zink: more requirement docs |
| - zink: rewrite atomic ssbo intrinsics as atomic derefs |
| - zink: run copy_prop_vars during optimization |
| - zink: warn on missing customBorderColorWithoutFormat |
| - zink: more lavapipe glcts fails |
| - zink: cap driver inlining using ssa allocation limit |
| - zink: use separate pointer for swapchain presents |
| - zink: keep a count of async presents occuring for a given swapchain |
| - zink: defer old swapchain destruction |
| - zink: add implicit sync workaround for non-mesa drivers |
| - zink: unset res->swapchain upon killing a swapchain |
| - zink: fix up KILL to a more sensible log message |
| - lavapipe: VK_EXT_border_color_swizzle |
| - anv: VK_EXT_border_color_swizzle |
| - zink: fix dynamic stride conditional in pipeline creation |
| - zink: fix generated tcs update path |
| - zink: use the patch_vertices value from the shader key, not the context |
| - zink: unset generated tcs pointer from tes on deletion |
| - zink: fix generated tcs deletion |
| - zink: only add necessary binds during dmabuf export |
| - zink: avoid uninit values in renderpass state |
| - zink: don't print VK_EXT_shader_atomic_float warning for nir_intrinsic_image_deref_atomic_exchange |
| - zink: hook up VK_EXT_non_seamless_cube_map |
| - zink: use VK_EXT_non_seamless_cube_map when available |
| - zink: rename nonseamless struct members to emulate_nonseamless |
| - zink: skip some emulated cube code if the extension is present |
| - zink: fix cubemap lowering for array samplers |
| - zink: fix cubemap lowering bit size |
| - zink: clamp dynamic renderArea to framebuffer size |
| - zink: check for pending clears to determine write status of zs attachments |
| - zink: add a function for getting the minimum framebuffer layers |
| - zink: clamp dynamic fb layers better |
| - zink: clamp renderpass layers better |
| - zink: anv ci updates |
| - zink: use uint64_t for batch_id |
| - zink: remove thread_queue |
| - zink: use a dynarray for semaphore wait flags |
| - zink: add batch api for adding a wait semaphore |
| - zink: add semaphore handling for sparse binds |
| - zink: detect sparse-only binding queue |
| - zink: always bind on the sparse queue |
| - zink: only enable vtn debugging if validation is active |
| - zink: tu ci baseline update |
| - zink: radv baseline updates |
| - drisw: probe kms if fd is passed |
| - kmsro: add error message on drm ioctl failure |
| - zink: rename a variable |
| - zink: unify actual map calls in buffer mapping |
| - zink: use only the extents for buffer unmap flushes |
| - zink: fix buffer transfer flushes with offsets |
| - zink: store and reuse memory heap flags for buffer placement |
| - zink: move BAR allocation demotion up the stack |
| - zink: be a little selective about BAR fallback memory type |
| - zink: hook up VK_EXT_border_color_swizzle |
| - zink: use VK_EXT_border_color_swizzle |
| - zink: print runtime warning on missing EXT_border_color_swizzle |
| - mesa/st: add PIPE_CAP_DITHERING |
| - zink: unset PIPE_CAP_DITHERING |
| - mesa/st: increment num_uniforms for hw accel select shaders |
| - zink: move draw-time barrier generation down a little |
| - zink: track vertex buffer bind counts on resources |
| - zink: rework buffer barrier generation |
| - zink: track image binds |
| - zink: add a #define for vk shader bits |
| - zink: collect gfx stages for all bindings during barrier generation |
| - zink: don't short-circuit gfx stage finding during barrier generation |
| - zink: relax zink_resource_buffer_needs_barrier checks |
| - zink: ci updates |
| - glx/drisw: store the flush extension to the screen |
| - glx/drisw: invalidate drawables upon binding context if flush extension exists |
| - zink: fix dual_src_blend driconf workaround |
| - nir/lower_point_size: apply point size clamping |
| - mesa: break out PointSizeIsOne setting to util function |
| - mesa: rename PointSizeIsOne -> PointSizeIsSet |
| - mesa: skip pointsize exports if pointsize attenuation is enabled |
| - mesa: enforce pointsize exports if pointsize is being clamped |
| - mesa: explicitly disallow multiple pointsize exports from generating |
| - zink: ci updates |
| - zink: add flag to indicate if a resource is a dmabuf |
| - zink: flag dmabufs for foreign queue transition on flush_resource call |
| - lavapipe: zero out blend info if blend isn't enabled |
| - lavapipe: copy more pNexts for pipeline creation |
| - lavapipe: fix renderpass info handling during pipeline creation |
| - lavapipe: add a pipeline library assert |
| - lavapipe: skip post-copy pNext checking during pipeline creation for composites |
| - zink: use the bigger of the variable type and interface type for bo sizing |
| - zink: track ssbo bind counts |
| - zink: track overall samplerview bind counts |
| - zink: fix image bind counting |
| - zink: track gfx/compute descriptor barrier info |
| - zink: use tracked barrier info for generated barriers |
| - zink: remove another zink/tu fail |
| - zink: fix-ish depth clipping without VK_EXT_depth_clip_enable |
| - zink: add a turnip driver workaround for EXT_depth_clip_enable |
| - zink: remove swizzle from fbfetch lowering |
| - lavapipe: always set point_tri_clip |
| - zink: fix kopper_acquire return value |
| - zink: track whether current swapchain has data |
| - zink: move to AoS for storing swapchain struct data |
| - zink: move 'acquired' flag onto swapchain struct |
| - zink: use better determination for handling swapchain acquire submits |
| - zink: remove broken kopper assert |
| - zink: remove unnecessary code in kopper readback acquire |
| - zink: improve no-op acquire detection |
| - zink: flag batch as having work during frontbuffer flush |
| - zink: ensure pending present flushes are handled during frontbuffer flush |
| - zink: delete zink_resource_object::acquire |
| - zink: no-op pixmap frontbuffer flush calls |
| - zink: acquire swapchain images on image map |
| - zink: use kopper_displaytarget type directly |
| - zink: delete zink_resource_object::dt_has_data |
| - zink: set PIPE_RESOURCE_FLAG_DONT_MAP_DIRECTLY for device-local resources |
| - lavapipe: break out part of shader optimization into function |
| - lavapipe: streamline xfb shader setup |
| - nir/types: fix glsl_matrix_type_is_row_major() assert |
| - zink: store context flags |
| - zink: don't increment screen->num_contexts for copy context |
| - zink: disable turnip traces temporarily |
| - lavapipe: don't remove xfb outputs |
| - zink: enforce viewport depth clamping |
| - zink: size uniform_0 ubo accurately |
| - zink: further improve bo sizing |
| - zink: force unsized array in bo creation using the last block member type |
| - zink: move/refactor bo_vars struct creation in compiler |
| - zink: handle bitsizes in get_bo_vars() analysis |
| - zink: don't run nir_lower_uniforms_to_ubo repeatedly |
| - zink: pass zink_shader to optimize_nir during final optimize pass |
| - zink: add an optimizer pass to enforce bo bounds |
| - zink: use std430 packing by default |
| - zink: use a std430 interface for bo types |
| - mesa: remove incomplete texture warning |
| - mesa/st: make get_sampler_view_format() public |
| - mesa/st: add PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_FREEDRENO |
| - zink: init driver workarounds earlier in screen creation |
| - zink: disable customBorderColorWithoutFormat on turnip |
| - zink: don't warn for missing customBorderColorWithoutFormat on turnip |
| - zink: support PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_FREEDRENO |
| - zink: flush pending clears for fb texture barriers |
| - mesa: fix SignalSemaphoreEXT behavior |
| - lavapipe: don't overwrite entire VkFormatProperties3 struct |
| - lavapipe: don't crash on null xfb buffer pointer |
| - zink: reject swizzled format blits |
| - lavapipe: support inlined shader spirv for compute |
| - zink: allow multiple tex components for depth tg4 |
| - zink: always use 32bit sample ops |
| - zink: add a compiler pass to match up tex op dest types |
| - zink: use sampler_bind_count to simplify some code |
| - zink: break out samplerview layout reset code |
| - zink: always update sampler descriptor layouts on fb surface unbind |
| - zink: always use 32bit floats for so output types |
| - zink: handle bare matrix types in xfb emission |
| - zink: split xfb block emission from array/matrix handling |
| - zink: fix xfb array inlining |
| - zink: use right glsl length getter for ntv partial stores |
| - zink: bitcast extracted streamout components to uint before creating uvec |
| - zink: add a compiler pass to split xfb block outputs |
| - zink: fix xfb emit check in compiler |
| - mesa: move is_wrap_gl_clamp() to samplerobj.h and deduplicate |
| - mesa: track which sampler wrap params use GL_CLAMP |
| - mesa: update GL_CLAMP emulation when binding/unbinding textures |
| - zink: remove bogus range tracking from texture clear |
| - zink: fix transient attachment rp assert |
| - zink: remove format check from clear texture |
| - zink: fix coverage check for texture clears |
| - zink: stop using u_blitter for texture clears |
| - zink: improve zink_clear_depth_stencil check for current attachment |
| - zink: massively simplify zink_clear_depth_stencil |
| - zink: remove non-renderpass clear path from zink_clear_texture |
| - zink: remove u_blitter usage from zink_clear_render_target |
| - zink: always use storeOp=STORE for depth renderpass |
| - zink: track a bitmask of fb attachments with mismatched layer counts |
| - zink: add explicit (awful) handling for fb layer mismatch clears |
| - zink: clamp color clear values based on format |
| - zink: lift clearing on fb state change up a level |
| - zink: remove out-of-renderpass clears |
| - zink: delete zink_fb_clear_util_unpack_clear_color |
| - zink: delete srgb tracking for clears |
| - zink: simplify zink_framebuffer_clear_data union |
| - zink: inject a 0,0,0,1 clear for RGBX formats |
| - zink: break out tc/trace context unwrapping |
| - zink: nv baseline update |
| - zink: tu a630 baseline update |
| - mesa/st: add implicit zeroing of clipdistance array |
| - zink: move descriptor_mode to be a global |
| - zink: delete ZINK_DESCRIPTOR_MODE_NOFALLBACK |
| - zink: add an explicit ZINK_DESCRIPTOR_MODE_CACHED |
| - zink: default to lazy/templated descriptors |
| - zink: handle max_vertices=0 in geometry shader |
| - zink: handle null samplerview in get_imageview_for_binding() |
| - zink: fix crashing on bo allocation failure |
| - zink: improve oom error message |
| - zink: call bind_last_vertex_stage() when binding vs |
| - zink: fix viewport count tracking |
| - zink: use gfx_barrier for synchronization when binding buffer descriptors |
| - vulkan/wsi: fix multiple acquires for sw without mit-shm |
| - llvmpipe: ignore multisample state for alpha_to_one |
| - zink: don't try to start a renderpass before CmdClearAttachments |
| - zink: don't flag renderpass change on depth write change as often |
| - zink: flag renderpass change if renderpass clears change |
| - zink: do dynamic render implicit clear updates on rp_changed |
| - zink: unflag ctx->rp_changed after dynamic render update |
| - zink: remove redundant renderpass update check for clears |
| - zink: add an assert to verify that deferred clears aren't added in renderpass |
| - zink: split out rp_changed to be more granular for dynamic render |
| - zink: break out renderpass attachment load op calc |
| - zink: break out zs renderpass layout calc |
| - zink: break out color rt layout calc |
| - zink: reuse rt layout functions to refactor zink_render_pass_attachment_get_barrier_info() |
| - zink: add a bits union for zink_render_pass_state comparisons |
| - zink: improve granularity of renderpass switching |
| - zink: allow no-op renderpass updates in zink_batch_rp() |
| - zink: move layout-setting for dynamic render attachments after no-op case |
| - zink: handle null samplerview/image rebinds more gracefully |
| - zink: scale number of shader cache-get threads |
| - zink: invoke descriptor_program_deinit for programs on context destroy |
| - vulkan/wsi: return VK_SUBOPTIMAL_KHR for sw/x11 on window resize |
| - zink: add env var to abort on device-lost if no reset callback is set |
| - util/vbuf: handle multidraws |
| - anv: disable dev.i915.perf_stream_paranoid=0 warning |
| - lavapipe: propagate xfb info for pipeline library rasterization stages |
| - zink: remove deqp fails for lavapipe |
| - zink: only flag fbfetch as rp update if fbfetch wasn't already configured |
| - zink: assert !rp_changed after renderpass has started |
| - zink: avoid crashing when bindless texture updates aren't used |
| - lavapipe: EXT_non_seamless_cube_map |
| - lavapipe: accurately set image/ssbo access based on shader usage |
| - lavapipe: implement extreme uniform inlining |
| - lavapipe: remove some skips |
| - zink: add a ci flake |
| - lavapipe: move vertex input dynamic states into vertex input block |
| - lavapipe: make binding/layout helper functions static inline |
| - lavapipe: use correct offset calculation for flagging ssbo/image write access |
| - lavapipe: use 64bit masks for shader access |
| - lavapipe: set image access on pipeline bind, not descriptor bind |
| - llvmpipe: remove weird clamping for format support with sample counts |
| - llvmpipe: allow resource_copy_region to perform replicate operations |
| - lavapipe: fix stencil clears |
| - llvmpipe: add a shortcut for blitter resolves |
| - vulkan: hook up VK_EXT_multisampled_render_to_single_sampled |
| - lavapipe: VK_EXT_multisampled_render_to_single_sampled |
| - lavapipe: advertise VK_EXT_multisampled_render_to_single_sampled |
| - util/blitter: respect PIPE_TEXTURE_RECT |
| - lavapipe: fix shader access stage conversion from mesa -> pipe |
| - zink: stop ORing unordered barrier access |
| - zink: convert ZINK_DEBUG to enum |
| - zink: add doc for ZINK_DEBUG=compact |
| - zink: add ZINK_DEBUG=noreorder |
| - zink: zink_resource::unordered_barrier -> unordered_exec |
| - zink: explicitly unset unordered_exec when resource can't be unordered |
| - zink: rework unordered_exec handling |
| - zink: add a second resource param to get_cmdbuf() for unordered promotion |
| - zink: promote image/buffer copy operations to unordered |
| - zink: move buffer/image functions down in file |
| - zink: expand unordered_exec |
| - zink: make get_cmdbuf() public |
| - zink: enable unordered blit/resolve ops |
| - virgl: more ci flakes |
| - d3dadapter: fix sw pipe loading |
| - nine: set view_mask=0 |
| - zink: fix gfx program cache pruning with generated tcs |
| - zink: don't flag lazy push constant set dirty on batch change |
| - zink: combine loops for lazy descriptor program deinit |
| - zink: handle mutable swapchain images with dmabuf |
| - zink: store VkFormatFeatureFlags on creation |
| - zink: use modifier feature flags during surface creation when necessary |
| - zink: init cache_put program fence on program creation |
| - zink: force flush clears on fb change if fb geometry changes |
| - zink: force a new framebuffer for clear_depth_stencil if the clear region is big |
| - zink: drop mode_changed check from linewidth/depthbias draw updates |
| - zink: handle unscaled depth bias from nine |
| - zink: handle !half_pixel_center |
| - zink: fix return for PIPE_CAP_DEPTH_CLIP_DISABLE |
| - nine: check return on resource_get_handle |
| - zink: demote dmabuf tiling to linear if modifiers aren't supported |
| - zink: use modifier_aspect to check for modifier plane in zink_resource_get_param |
| - zink: don't add modifiers if EXT_image_drm_format_modifier isn't present |
| - zink: add all format modifiers when adding for dmabuf export |
| - zink: don't fixup sparse texops |
| - vulkan: Update the XML and headers to 1.3.224 |
| - radv: fix return type for meta resolve shaders |
| - nir/validate: clamp unsized tex dests to 32bit |
| - mesa: fix blending when using luminance/intensity emulation |
| - mesa: require render target bind for A/L/I in format selection |
| - vk/render_pass: don't deref null resolve attachments |
| - zink: handle nir_intrinsic_sparse_residency_code_and mechanics |
| - zink: ignore nir_texop_lod for tex dest matching |
| - tu: fix invalid free on alloc failure |
| - zink: don't call util_queue_fence_init in zink_screen_get_pipeline_cache() |
| - zink: don't emit entrypoints for function temp variables |
| - zink: check the variable mode before taking samplemask path in ntv |
| - zink: explicitly use unsigned types for bit shifts |
| - zink: only add srgb mutable for images with modifiers |
| - zink: don't emit illegal interpolation |
| - llvmpipe: don't assume pipe_context is always available in flush_frontbuffer |
| - zink: flag all assigned output slots as mapped |
| - zink: handle split acquire/present |
| - st_pbo/compute: fix 1D_ARRAY offsets |
| - st_pbo/compute: fix 1D coord dimension by pre-trimming vectors |
| |
| Mike Lothian (1): |
| |
| - util: Fix invalid usage of alignas in u_cpu_detect.c |
| |
| Mykhailo Skorokhodov (3): |
| |
| - anv: Use sampleLocationsEnable for sample locations |
| - iris: Move Wa_1806527549 and enable by default |
| - anv: Move Wa_1806527549 and enable by default |
| |
| Nagappa Koppad, Basanagouda (1): |
| |
| - iris:Duplicate DRM fd internally instead of reuse. |
| |
| Nanley Chery (11): |
| |
| - iris: Return a 64B stride for clear color plane |
| - intel/isl: Add a score for I915_FORMAT_MOD_4_TILED |
| - isl,iris: Add DG2 CCS modifier support for XeHP |
| - mesa: Simplify UNORM8 conversions for sRGB ASTC textures |
| - mesa: Follow ASTC decode mode extension for RGBA8 output |
| - iris: Don't leak surface states for compressed resources |
| - iris: Don't leak compressed resources in iris_create_surface |
| - iris: Use fill_surface_states for compressed resources |
| - anv: Make the D16 reg mode single-sampled |
| - iris: Make the D16 reg mode single-sampled |
| - iris: Dedent enum iris_depth_reg_mode |
| |
| Nicolas Caramelli (1): |
| |
| - egl: Fix EGL_EXT_platform_xcb name string to match the registry |
| |
| Panagiotis Apostolou (1): |
| |
| - util: Don't block SIGSEGV for new threads |
| |
| Patrick Lerda (1): |
| |
| - panfrost: Fix unwanted valgrind message related to restart_index |
| |
| Paulo Zanoni (4): |
| |
| - iris: fix race condition during busy tracking |
| - intel/compiler: compute int64_options based on devinfo->has_64bit_int |
| - intel/compiler: split handling of 64 bit floats and ints |
| - intel/compiler: call ordered_unit() only once at update_inst_scoreboard() |
| |
| Pavel Asyutchenko (7): |
| |
| - gallium: add PIPE_CAP_FBFETCH_ZS and expose extension |
| - glsl: add language support for GL_ARM_shader_framebuffer_fetch_depth_stencil |
| - nir_to_tgsi: Don't count ZS fbfetch vars as outputs |
| - llvmpipe: enable per-sample shading when FB fetch is used |
| - llvmpipe: simplify early/late zs tests selection |
| - llvmpipe: implement FB fetch for depth/stencil |
| - llvmpipe: enable PIPE_CAP_FBFETCH_ZS |
| |
| Pavel Ondračka (13): |
| |
| - r300: set PIPE_BIND_CONSTANT_BUFFER for const_uploader |
| - r300: skip draws instead of using a dummy vertex shader |
| - r300: fix vertex shader control flow in loops |
| - r300: deduplicate common NIR options |
| - r300: be less agresive with copy propagate in loops |
| - r300: don't try to use inline constants instead of constant swizzles |
| - r300: Update list of RV515 dEQP failures and add some flakes |
| - r300: merge simple movs with constant swizzles together |
| - r300: expose PIPE_CAP_LEGACY_MATH_RULES |
| - r300: only run merge_movs pass on R500 |
| - r300: Keep rc_rename_regs() from overflowing RC_REGISTER_MAX_INDEX |
| - r300: disable PIPE_CAP_PREFER_IMM_ARRAYS_AS_CONSTBUF |
| - r300: fix variables detection for paired ALU and TEX instructions in different branches |
| |
| Pierre-Eric Pelloux-Prayer (81): |
| |
| - radeonsi: make si_pc_emit_shaders non static |
| - radeonsi: add helpers to deal with spm counters |
| - radeonsi: add spm counters setup code |
| - radeonsi/sqtt: setup spm bo |
| - radeonsi/sqtt: enable spm counters on gfx10+ |
| - radeonsi: make sure profile_peak is used before using sqtt |
| - ac/spm: setup write broadcasting correctly |
| - ac/surface: adjust gfx9.pitch[*] based on surf->blk_w |
| - radeonsi: remove commutative_blend_add option |
| - radeonsi: drop assume_no_z_fights option |
| - drirc: remove i965 entry |
| - gallium: add PIPE_CAP_ALLOW_DRAW_OUT_OF_ORDER |
| - radeonsi: enable PIPE_CAP_ALLOW_DRAW_OUT_OF_ORDER |
| - radeonsi: fix VS kill_outputs handling |
| - radeonsi: don't always strip NIR debug infos |
| - radeonsi: generate unique shader name in si_get_nir_shader |
| - amd: update gfx10_format_table.py for gfx11 |
| - radeonsi/gfx11: add assert in legacy vs path |
| - radeonsi/gfx11: register changes |
| - radeonsi/gfx11: image descriptor changes |
| - radeonsi/tests: allow empty line and comments in csv files |
| - radeonsi/tests: add a heuristic to pick the baseline |
| - radeonsi/tests: reformat with black |
| - u_threaded: clear non-async debug callback correctly |
| - radeonsi: flush VGT streamout like PAL |
| - radeonsi: implement pipeline stats workaround |
| - gallium: add a union to access queries counters |
| - gallium: always return the correct pipeline stats index |
| - gallium: deduplicate target -> stats index code |
| - radeonsi: deduplicate query offsets |
| - radeonsi: store the pipeline stats index |
| - radeonsi: implement GL_GEOMETRY_SHADER_PRIMITIVES_EMITTED_ARB in shaders |
| - radeonsi/test: update sienna_cichlid results |
| - radeonsi/query: use the qbo correct size |
| - radeonsi/ngg: reuse the pipeline stats buffer when using atomics |
| - radeonsi: fix gs_invocation query with NGG |
| - radeonsi: don't use wave32 for GE on gfx10 if culling is used |
| - amd: fix ac_build_mbcnt_add in wave32 mode |
| - util/u_queue: rework UTIL_QUEUE_INIT_SCALE_THREADS to scale faster |
| - radeonsi: remove unused params from si_shader_select_with_key |
| - radeonsi: scale the number of shader compiler threads |
| - radeonsi: store shader variants in an array |
| - glx: set max values for pbuffer width / height |
| - radeonsi: fix glTexBuffer max size handling |
| - radeonsi: wait for PS idle in si_set_framebuffer_state |
| - radeonsi/tests: add a --slow option |
| - radeonsi/tests: use a smaller tests-per-group value |
| - radeonsi: don't use sel->nir in si_check_blend_dst_sampler_noop |
| - ac: export LLVM c++ ResetCommandLineParser method |
| - ac: reset LLVM command line parser |
| - ac/llvm: add use_waterfall_for_divergent_tex_samplers option |
| - radeonsi: enable use_waterfall_for_divergent_tex_samplers |
| - st/mesa: use mutex in st_get_texture_sampler_view_from_stobj |
| - gallium: update winsys_handle documentation |
| - gallium: add size attribute to winsys_handle |
| - va/surface: set the correct size in vaExportSurfaceHandle |
| - radeonsi: set size in si_texture_get_handle |
| - tradeonsi: fix preamble state producing incorrect packets |
| - radeonsi: prevent recursion in si_decompress_dcc |
| - radeonsi: add helper to use si_screen::aux_context |
| - radeonsi: use helpers to access si_screen::aux_context |
| - Revert "winsys/amdgpu: use AMDGPU_IB_FLAG_PREAMBLE for the CS preamble on gfx10+" |
| - dri: store internal_format when known |
| - mesa/st: don't guess the internal format if it's known |
| - util/blob: use memcpy in read functions |
| - mesa: skip draws with invalid indices offset |
| - vulkan/wsi: define pWaitDstStageMask in the blit submission |
| - ac: use LLVMContextSetOpaquePointers if available |
| - radeonsi: use LLVMBuildLoad2 where possible |
| - ac: add per output is_16bit flag to ac_shader_abi |
| - ac/llvm: handle opaque pointers in visit_store_output |
| - ac/llvm: use LLVMBuildLoad2 in visit_load |
| - radeonsi: use LLVMBuildLoad2 in llvm PS |
| - radeonsi: use LLVMBuildLoad2 for inter-stage outputs loads |
| - radeonsi: check last_dirty_buf_counter and dirty_tex_counter |
| - amdgpu/bo: update uses_secure_bos when importing buffers |
| - nir: add a nir_opt_if_options enum |
| - radeonsi: prevent u_blitter recursion in si_update_ps_colorbuf0_slot |
| - radeonsi: use nir_opt_large_constants earlier |
| - mesa: avoid reading back textures from VRAM |
| - radeonsi: invalidate L2 when using dcc stores |
| |
| Qiang Yu (63): |
| |
| - radeonsi: lower nir_intrinsic_sparse_residency_code_and |
| - mesa/program: fix nir output reg overflow |
| - mesa/vbo: remove unused vbo_context->binding |
| - nir/builder: add load/store array variable helper functions |
| - mesa: add hardware accelerated select constant |
| - mesa: add _mesa_bufferobj_get_subdata |
| - mesa: refine name stack code to prepare for hw select |
| - mesa: add hw select name stack code path |
| - mesa/vbo: enclose none-vertex functions with HW_SELECT_MODE |
| - mapi: add api setup header for hw select mode |
| - mesa: set CurrentServerDispatch too when glBegin/End |
| - mesa: add HWSelectModeBeginEnd dispatch table |
| - mesa: pass select result buffer offset as attribute/varying |
| - mesa/st: implement hardware accelerated GL_SELECT |
| - zink: reset PIPE_CAP_ACCELERATED when cpu soft rendering |
| - virgl: return -1 for PIPE_CAP_ACCELERATED |
| - mesa: enable HardwareAcceleratedSelect |
| - ac/llvm: get back nir_intrinsic_load_tess_rel_patch_id_amd |
| - nir: add nir_intrinsic_load_lshs_vertex_stride_amd |
| - ac/nir: use nir_intrinsic_load_lshs_vertex_stride_amd |
| - radeonsi: implement load_lshs_vertex_stride abi |
| - radeonsi: add tcs_vgpr_only_inputs parameter to si_get_nir_shader |
| - ac/nir: skip gl_Layer/gl_ViewportIndex write for LS |
| - radeonsi: replace llvm ls/hs interface lds ops with nir lowered ones |
| - gallium: disable hardware select for crocus |
| - ac/llvm: conditionally check wave id in gs sendmsg |
| - radv: no need to do gs_alloc_req for newer chips in ngg vs/tes |
| - radeonsi: deserialize nir binary in si_check_blend_dst_sampler_noop |
| - radeonsi: use si_shader as parameter in si_get_nir_shader |
| - radeonsi: support multi stage shader state creation in nir shaderlib |
| - radeonsi: add si_create_passthrough_tcs |
| - radeonsi: replace llvm based fixed tcs with nir |
| - radeonsi: implement nir_intrinsic_load_tcs_num_patches_amd |
| - ac/nir: remove unused parameter in tes input lower |
| - ac/nir: add nir_intrinsic_load_hs_out_patch_data_offset_amd |
| - ac/nir: use nir_intrinsic_load_hs_out_patch_data_offset_amd in tess lower |
| - ac/nir: add pass_tessfactors_by_reg param to hs output lower |
| - ac/nir: add wave_size parameter to ac_nir_lower_hs_outputs_to_mem |
| - radeonsi: preload tess offchip ring for tcs |
| - radeonsi: implement nir_intrinsic_load_ring_tess_offchip(_offset)_amd |
| - radeonsi: implement nir_intrinsic_load_tess_rel_patch_id_amd for both tcs and tes |
| - ac/llvm: handle write mask for nir_intrinsic_store_buffer_amd |
| - ac/nir: add no_input_lds_space param to hs output lower |
| - radeonsi: replace llvm tcs output with nir lower pass |
| - radeonsi: enable PIPE_CAP_GLSL_TESS_LEVELS_AS_INPUTS |
| - radeonsi: set uses_vmem_load_other for nir_intrinsic_load_buffer_amd |
| - radeonsi: replace llvm tes input load with nir lowering |
| - ac/llvm: fix code format alignment in visit_load_local_invocation_index |
| - radeonsi: implement two esgs ring nir intrinsic |
| - ac/nir: skip gl_ViewportIndex and gl_Layer write in ES |
| - radeonsi: set lds for gs/es to handle nir shared memory load/store |
| - ac/nir: remove unused param num_reserved_es_outputs from gs input lower |
| - ac/nir: change es output lower param to esgs_itemsize |
| - radeonsi: replace llvm es output with nir lowering |
| - ac/nir: add triangle_strip_adjacency_fix to gs input lower |
| - ac/llvm: get back nir_intrinsic_load_gs_vertex_offset_amd |
| - radeonsi: replace llvm gs input handle with nir lowering |
| - ac/nir/ngg: fix nogs culling scratch size |
| - ac/nir/ngg: add a barrier before prim id export |
| - ac/nir/ngg: Decouple primitive ID store and primitive export. |
| - nir/lower_gs_intrinsics: fix primitive count for points |
| - radeonsi: fix tcs_out_lds_offsets arg alignment |
| - winsys/amdgpu: fix non-page-aligned sparse buffer creation |
| |
| Rajnesh Kanwal (29): |
| |
| - pvr: Remove double error reporting in error path. |
| - pvr: Change phase_rate_change type to bool from uint32_t. |
| - pvr: Setup common nop shader. |
| - pvr: Add services winsys transfer context support. |
| - pvr: Add support to create transfer context and setup required shaders. |
| - pvr: Add vk_sync support and remove service winsys syncobjs interface. |
| - pvr: Fix argument comments in render job submission. |
| - pvr: Pass device info struct in pds generator. |
| - pvr: Add support for R32G32B32A32_UINT, R32_UINT and R8_UINT formats. |
| - pvr: Allow signal_sync pointer to be NULL in job submission. |
| - pvr: Remove vk_format_is_pure_integer and use common helper. |
| - pvr: Add transfer 3d parameter heap support. |
| - pvr: Fix ISP_RENDER_MODE_TYPE enum in rogue_cr.xml |
| - pvr: Restrict argument to dev_info where possible. |
| - pvr: Fix UTIL_ARCH_BIG_ENDIAN check in vk_format.h. |
| - pvr: Change ALIGN macro to ALIGN_ATTR. |
| - pvr: Add services winsys transfer cmd submit interface. |
| - pvr: Move transfer logic to pvr_job_transfer.[ch] |
| - pvr: Remove clang-format off comment from vk_format.h |
| - pvr: Implement vkCmdDraw API. |
| - pvr: Init and store default sampler in pvr_device. |
| - pvr: Use common vk_buffer implementation. |
| - pvr: Implement vkCreateBufferView and vkDestroyBufferView. |
| - pvr: Move binding related checks in common code. |
| - pvr: Add support for sampler descriptor. |
| - pvr: Add support for combined image and sampler descriptor. |
| - pvr: Add support for image descriptor. |
| - pvr: Add support for storage and uniform texel descriptor. |
| - pvr: Add support for input attachment descriptor. |
| |
| Renato Pereyra (4): |
| |
| - venus: add support for vk_xwayland_wait_ready |
| - venus: Add driconf option to enable implicit fencing |
| - Revert "wsi/x11: Don't leak xcb_get_geometry_reply_t." |
| - Revert "wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes" |
| |
| Rhys Perry (94): |
| |
| - aco: fix disassembly of SMEM with both SGPR and constant offset |
| - ac/nir: properly handle large global access constant offsets |
| - nir/gather_info: fix system_value_read for rt/mesh system values |
| - nir/builder: add nir_{ine,ibfe,ubfe}_imm() helper |
| - radv: remove unnecessary align_mul/offset initialization |
| - radv: initialize meta shader options earlier |
| - radv: use radv_meta_build_nir_vs_generate_vertices() more |
| - radv: use nir_op_imm helpers |
| - radv: use inot(a) instead of ine(a, true)/ieq(a, false) |
| - ac/nir: fix 64-bit NGG GS output stores |
| - radv: always use nir_lower_io_lower_64bit_to_32 |
| - nir/copy_prop_vars: fix non-vector shader call payloads |
| - radv: fix clearing of TRUNC_COORD with tg4 and immutable samplers |
| - radv: consider radix sort shaders to be internal/meta |
| - aco/ra: fix vgpr_limit |
| - aco: ensure that definitions fixed to operands have matching regclasses |
| - radv: add radv_has_shader_buffer_float_minmax |
| - radv: allow LBVH on GFX6+ |
| - radv: remove subtractions in address calculations |
| - radv: use SMEM for a few load_global |
| - aco: split load_sbt_amd result |
| - aco: skip s_barrier if TCS patches are within subgroup |
| - ac/nir: skip s_barrier if TCS patches are within subgroup |
| - aco: fix cmpswap global atomic definition on GFX6 |
| - nir: allow NIR_PASS(_, ) |
| - nir: print file when validation fails |
| - nir: call nir_metadata_preserve in nir_io_add_const_offset_to_base |
| - nir: call nir_metadata_preserve in nir_vectorize_tess_levels |
| - nir: call nir_metadata_preserve in nir_lower_memory_model |
| - radv: call nir_metadata_preserve in various lowering passes |
| - radv: add missing NIR_PASS() and switch from NIR_PASS_V() |
| - radv: validate shaders after linking passes |
| - aco: fix p_constaddr with a non-zero offset |
| - aco/tests: add test for p_constaddr with a non-zero offset |
| - aco: only add/subtract low bits of program addresses |
| - aco/ra: fix usage of invalidated iterator |
| - aco: use p_parallelcopy for uniform reduction with zero source |
| - aco: clarify a portion of do_pack_2x16 |
| - aco/tests: disable regalloc.subdword_alloc.reuse_16bit_operands on GFX11 |
| - aco/tests: fix gfx11 variants printed as gfx12 |
| - aco: disable sdwa on gfx11 |
| - aco: use v_perm_b32 to copy 0xff00/0x00ff/0xff/0x00 |
| - aco/tests: update for GFX11's removal of SDWA |
| - radv,aco: disable shader cache when ACO debug options are used |
| - aco: fix SMEM load_global_amd with non-zero offset |
| - aco: fix SMEM load_global with VGPR address and non-zero offset |
| - aco: consider fma with multiplication by power-of-two unfused |
| - ac: use ResetAllOptionOccurrences instead of ResetCommandLineParser |
| - gitlab: ask for useful titles |
| - gitlab: ask that reporters don't include long logs in descriptions |
| - nir/deref: stop assuming coherent accesses of different SSBOs may alias |
| - aco: update SMEM offset workaround for LLVM 15 |
| - aco: don't skip VS->TCS barrier if TCS output vertices doesn't match input |
| - aco: cleanup force-waitcnt output |
| - aco/ra: update register file when updating phi definition |
| - radv/ci: re-enable vega10 fossils testing |
| - aco: add and use constantValue16() |
| - aco: don't accept med3 opcodes in get_minmax_info() |
| - aco: include _e64 variants of 16-bit min/max in minmax optimizations |
| - aco: ignore precise flag when optimizing integer clamps |
| - aco: fix single-alignbyte do_pack_2x16() path with fp inline constants |
| - aco: fix propagate_constants_vop3p with integer vop3p and 16-bit constants |
| - aco: don't use opsel to fold constants into dot accumulation sources |
| - aco: fix redirect combine in propagate_constants_vop3p() with negatives |
| - aco: try sign-extending or shifting constants in propagate_constants_vop3p |
| - aco: don't use 32-bit fp inline constants for fp16 vop3p literals |
| - aco/tests: add vop3p constant combine tests |
| - nir: rename fsin_r600/fcos_r600 to fsin_amd/fcos_amd |
| - nir/algebraic: optimize bcsel(c, fsin/cos_amd(a), fsin/cos_amd(b)) |
| - nir: allow 16-bit fsin_amd/fcos_amd |
| - radv,aco,ac/llvm: use nir_op_f{sin,cos}_amd |
| - aco: make flat access latency match mtbuf/mubuf/mimg |
| - aco: include flat-like in vmem clause statistics |
| - aco: make FLAT_instruction::offset signed |
| - aco: improve support for scratch_* instructions |
| - aco: combine additions and constants into scratch load/store |
| - aco: handle subtractions in parse_base_offset |
| - aco: refactor VGPR spill/reload lowering |
| - aco: avoid WAW hazard with BVH MIMG and other VMEM |
| - aco: include scratch/global in VMEM WAW optimization |
| - aco: treat flat-like as vmem in some scheduling heuristics |
| - aco: initialize scratch base registers on GFX9-GFX10.3 |
| - aco: use scratch_* for scratch load/store on GFX9+ |
| - aco: use scratch_* for VGPR spill/reload on GFX9+ |
| - radv: fix vbo_bound_mask indexing |
| - radv: don't update misaligned_mask in CmdBindVertexBuffers2 |
| - aco: remove unnecessary exp_cnt increments |
| - aco: fix LdsBranchVmemWARHazard with 2+ branch chains |
| - aco: fix hash statistic |
| - radv: remove claimed support for sRGB vertex buffer formats |
| - radv: fix 16-bit support in radv_lower_vs_input |
| - aco: fix 16-bit VS inputs |
| - aco: don't expand vec3 VS input load to vec4 on GFX6 |
| - aco: add SCC clobber in build_cube_select |
| |
| Ricardo Garcia (1): |
| |
| - vulkan: allow null descriptor set layouts in pipeline layouts |
| |
| Riteo (1): |
| |
| - vulkan/device_select_wayland: fix a memory leak with DRM device handling |
| |
| Rob Clark (64): |
| |
| - freedreno: Misc indent fixes |
| - freedreno/drm/virtio: Appease valgrind |
| - freedreno/drm: Drop FD_PP_PGTABLE |
| - freedreno/drm: Close bo handle after bo->destroy() |
| - freedreno/drm: Move bo common init |
| - freedreno/drm/virtio: Support ring_idx |
| - freedreno/drm/virtio: Use userspace IOVA allocation |
| - freedreno/drm/virtio: Drop blocking in host |
| - freedreno/drm/virtio: Fix SHAREABLE+MAPPABLE |
| - freedreno/drm/virtio: Protocol updates |
| - freedreno/drm/virtio: Pass guest handles to execbuf |
| - freedreno/drm/virtio: Async ccmd batching |
| - freedreno/drm/virtio: Split up large uploads |
| - freedreno/a6xx: A couple more FD_BO_NOMAP |
| - freedreno: Split out helper for staging blit |
| - freedreno: Use staging transfer if mmap fails |
| - freedreno/drm: Fix bos_on_stack calculation |
| - drm-shim: Add GET_UNIQUE support |
| - freedreno/drm-shim: Update to latest uapi version |
| - freedreno/drm-shim: Robustify error handling |
| - drm-shim: Better mmap offsets |
| - drm-shim: Add error handling for bo_init() |
| - freedreno/drm-shim: Better iova handling |
| - drm-shim: Cleanup on device file close |
| - freedreno/drm: Use DEBUG_GET_ONCE_OPTION() |
| - freedreno/drm/virtio: Don't try to mmap imported bo's |
| - freedreno/devices: Add another SKU |
| - Revert "ci: freedreno farm is down" |
| - freedreno: Add pkt4 assert |
| - freedreno/registers: Small cleanup |
| - freedreno/a6xx: Split VFD_FETCH[] if needed |
| - freedreno/a6xx: Initialize VFD_FETCH[n].SIZE to zero |
| - freedreno/a6xx: assert valid vertex_flags reg |
| - turnip: assert valid vertex_flag reg |
| - freedreno/ir3: Don't lower_gs multiple times |
| - freedreno/ir3: Add copy_vars() helper |
| - freedreno/ir3: Copy vars if needed on EndPrimitive() |
| - freedreno/ir3: Add more tess varying slots |
| - freedreno/a6xx: Fix indentation |
| - freedreno/a6xx: Fix VS const packet size |
| - freedreno/ir3: Fix GS clip-plane lowering |
| - freedreno: rename ir3_emit_driver_params() |
| - freedreno/a6xx: Handle driver-params in GS/DS |
| - freedreno/a6xx: Do clip-plane lowering in backend |
| - freedreno/gmem: Reverse order of alternative tile rows |
| - util: Fix c++ enum casting pickiness |
| - libsync: Fix compiler warning |
| - freedreno/a6xx: Remove fd6_format.[ch] |
| - freedreno/a6xx: Fix enum tag |
| - freedreno: Extract common helper macros |
| - freedreno/ir3: Remove unneeded forward declaration |
| - freedreno/autotune: Make 'offset' macro "private" |
| - freedreno/drm: Combine upper and lower 32b of OR val |
| - freedreno/a6xx: De-open-code CACHE_FLUSH enum |
| - freedreno: Use enum for primtypes table |
| - freedreno/ci: Update unit test reference decodes |
| - freedreno/registers: Whitespace fix for gen_header.py |
| - freedreno/registers: Allow varset to be specified on enum |
| - freedreno/registers: Move varset to <enum> |
| - freedreno/registers/a6xx: Some reg64 conversion |
| - freedreno/drm: Fix potential bo cache vs export crash |
| - freedreno/gmem: Fix col0 calc |
| - llvmpipe: Add some missing locking |
| - freedreno: We really don't need aligned vbo's |
| |
| Roman Stratiienko (3): |
| |
| - Android.mk: Fix c11-related build failures |
| - v3dv: Enable sync_fd importing/exporting on Android |
| - v3dv: Limit API version to v1.0 for Android |
| |
| Ruijing Dong (6): |
| |
| - radeonsi/vcn: Add support of array_mode for gfx11 |
| - radeonsi/vcn: update av1 decoding to support vcn4 |
| - radeonsi/vcn: prepare for unified queue in vcn4 |
| - radeonsi/vcn: support unified queue in vcn4 |
| - frontends/va: do texture_map when needed |
| - frontends/va: WA for ffmpeg 10bit encoding crash |
| |
| Ryan Neph (5): |
| |
| - venus: update venus-protocol with VK_EXT_image_view_min_lod |
| - venus: enable VK_EXT_image_view_min_lod |
| - venus: support VK_KHR_copy_commands2 |
| - venus: refactor VkCommandBufferBeginInfo fixups to function |
| - venus: add support for VK_KHR_dynamic_rendering |
| |
| Sagar Ghuge (1): |
| |
| - anv: Disable storage image compression for possible atomic ops |
| |
| Sami Kyöstilä (2): |
| |
| - subprojects: Roll Perfetto to v27.1 |
| - util: Shut down Perfetto before driver unload |
| |
| Samuel Pitoiset (213): |
| |
| - radv: exclude PRIMITIVE_{COUNT,INDICES} from the per-vertex output mask |
| - radv: use shader_info::writes_memory |
| - radv: use shader_info::gs::active_stream_mask |
| - radv: use shader_info::inputs_read/outputs_written for FS IO |
| - radv: use shader_info::outputs_written/per_primitive_outputs for VS outputs |
| - radv: use shader_info::system_values_read |
| - radv: fix enabling adjust_frag_coord_z and apply per-pipeline |
| - radv: add support for independent descriptor set layouts |
| - radv: add support for VkShaderModuleCreateInfo as pNext |
| - radv: fix initializing pipeline_key::topology for GFX9 and older |
| - radv: mark all active stages earlier in the pipeline creation path |
| - radv: use the hardware primitive topology everywhere |
| - radv: initialize the vertex input interface state in only one place |
| - radv: do not support UNIFORM_TEXEL_BUFFER with SRGB |
| - radv: only apply enable_mrt_output_nan_fixup for 32-bit float MRTs |
| - radv/ci: update the flakes list for GFX9 chips |
| - aco: fix load_barycentric_at_{sample,offset} on GFX6-7 |
| - nir: fix marking XFB varyings as always active IO |
| - nir: mark XFB varyings as unmoveable to prevent them to be remapped |
| - radv: clarify why STAGE_2_CLEAR_BIT needs to wait for CP DMA to be idle |
| - radv: remove redundant VK_PIPELINE_STAGE_2_TRANSFER_BIT for CP DMA idle |
| - radv: add an SQTT workaround for chips with disabled RBs |
| - radv: fix handling divisor == 0 with dynamic vertex input state |
| - radv: allow to disable sinking of load inputs for FS via drirc |
| - radv: enable radv_disable_sinking_load_input_fs for Grid Autosport |
| - radv: use correct push constants range for internal operations |
| - radv/ci: update list of expected failures for Bonaire (GFX7) |
| - radv/ci: remove empty flakes lists for GFX9 |
| - radv: fix the number of generated primitive queries with NGG GS vs legacy |
| - radv/ci: stop skipping dEQP-VK.synchronization.* on Bonaire |
| - radv: use 3D views for 3D internal operations on GFX6-8 |
| - radv: re-emit dynamic line stipple state if the primitive topology changed |
| - radv/radix: handle intentional allocation failures properly |
| - ac,radeonsi: add has_sqtt_auto_flush_mode_bug |
| - radv/sqtt: fix configuring AUTO_FLUSH_MODE on GFX10.3 |
| - radv: only init acceleration structure if RT is enabled |
| - radv: disable instance packing to fix pipeline query stats |
| - radv: remove useless check against nir_texop_samples_identical |
| - radv/winsys: allow to use RADV_FORCE_FAMILY=gfx1100 |
| - radv: disable DCC for Senra Kagura Shinovi Versus |
| - aco: recognize GFX11 in few places |
| - aco: do not align VGPRS to 8 or 16 on GFX11 |
| - radv,aco: add support for packed threadID VGPRs on GFX11 |
| - radv,aco: use the new TCS WaveID SGPR to compute vs_rel_patch_id on GFX11 |
| - aco: export MRT0 instead of NULL on GFX11 |
| - aco: do not set DLC for loads on GFX11 |
| - aco: do not set GLC stores on GFX11 |
| - aco: update LDS allocation granularity for PS on GFX11 |
| - aco: update waitcnt on GFX11 |
| - aco: do not set RESOURCE_LEVEL for buffer descriptors on GFX11 |
| - aco: do not set COMPR for exports but use 0x3 channel mask on GFX11 |
| - radv,aco: export alpha-to-coverage via MRTZ on GFX11 |
| - aco: only retrieve the scratch offset when it's declared |
| - radv,aco: do not implicitly export the primitive ID for mesh shaders |
| - aco: remove unreachable code about viewport index/layer and mesh shaders |
| - radv,aco: add a workaround for binding 2D views of a 3D image on GFX9 |
| - radv: implement VK_EXT_image_2d_view_of_3d |
| - radv: advertise VK_EXT_image_2d_view_of_3d |
| - radv: do not declare the scratch offset on GFX11 |
| - aco: do not emit the primitive ID twice for NGG VS or TES with GS |
| - radv: do not align VGPRS to 8 or 16 on GFX11 |
| - radv: use the new format table on GFX11 |
| - radv: update scratch buffer registers on GFX11 |
| - radv: update sampler registers on GFX11 |
| - radv: update buffer descriptor registers on GFX11 |
| - radv: update image descriptor registers on GFX11 |
| - radv: use PIXEL_PIPE_STATE_DUMP event instead of ZPASS_DONE on GFX11 |
| - radv: configure DB_RENDER_CONTROL on GFX11 |
| - radv: do not emit non-existent CP_COHER_START_DELAY on GFX11 |
| - radv: do not emit FLUSH_AND_INV_DB_META on GFX11 |
| - radv: update TF_RING_size to a per-SE size on GFX11 |
| - radv: update VRS registers on GFX11 |
| - radv: apply a workaround for CB perf counters on GFX11 |
| - radv: limit CP DMA to max 32KB sizes on GFX11 |
| - radv: more register changes on GFX11 |
| - radv: do not lower loading TESS/ESGS rings using the ABI for LLVM |
| - aco: use ac_is_llvm_processor_supported() for checking LLVM asm support |
| - radv: rename shader compile functions to spirv_to_nir/nir_to_asm |
| - radv: do not try to dump the NIR of the trap handler shader |
| - radv: add new pipeline helpers for NIR->ASM compilation |
| - radv: init states from pTessellationState at only one place |
| - radv: init states from pViewportState at only one place |
| - radv: init states from pRasterizationState at only one place |
| - aco: remove unnecessary intrinsics that are lowered at the ABI level |
| - radv: determine if shaders use NGG before linking |
| - radv: export implicit primitive ID in NIR for legacy VS or TES |
| - radv: constify radv_pipeline in more radv_pipeline_generate_XXX() helpers |
| - radv: rename radv_pipeline_generate_XXX() to radv_pipeline_emit_XXX() |
| - radv: remove couple of useless pCreateInfo pointers in the emit path |
| - radv: remove unused radv_get_conservative_raster_mode() helper |
| - radv: move gs_table_depth to the physical_device |
| - radv: cleanup using device/physical_device during pipeline creation |
| - radv: split radv_pipeline into radv_{graphics,compute,library}pipeline |
| - radv: introduce radv_graphics_pipeline_info and split existing info structs |
| - radv/ci: fix fails list for NAVI21 |
| - radv: move HS info and task_num_entries to the physical device |
| - radv: rename radv_cmd_state::pipeline to graphics_pipeline |
| - radv: unify radv_pipeline_has_XXX() helpers |
| - radv: move streamout_shader to radv_graphics_pipeline |
| - radv: move active_stages to radv_graphics_pipeline |
| - radv: rename VERT_GRP_SIZE to VERTS_PER_SUBGRP on GFX11 |
| - radv: do not emit more non-existent registers on GFX11 |
| - radv: update framebuffer registers on GFX11 |
| - radv: update VRS rates on GFX11 |
| - radv: use the fragment resolve path by default on GFX11 |
| - radv: report adjusted LDS size for fragment shaders on GFX11 |
| - radv: fix configuring COLOR_INVALID on GFX11 |
| - radv: fix VK_BLEND_FACTOR_CONSTANT_COLOR translation on GFX11 |
| - radv: configure DB_Z_INFO.NUM_SAMPLES correctly on GFX11 |
| - radv: disable VK_AMD_shader_fragment_mask on GFX11 |
| - radv: disable attachementFragmentShadingRate on GFX11 |
| - radv: use pipeline->slab_bo in more places |
| - radv: fix writing buffer markers with non-zero memory offset |
| - radv: copy viewport/scissor when initializing radv_viewport_info |
| - radv: init states from pMultisampleState at only one place |
| - radv: init states from pDepthStencilState at only one place |
| - radv: init states from VkPipelineRenderingCreateInfo at only one place |
| - radv: use AMD values for computing blend related state |
| - radv: init states from pColorBlendState at only one place |
| - radv: init states from VkAttachmentSampleCountInfo at only one place |
| - radv: init states from VkPipelineDiscardRectangleState at only one place |
| - radv: init states from VkPipelineFragmentShadingRateState at only one place |
| - radv: ignore DYNAMIC_STENCIL_OP if stencil test isn't enabled |
| - radv: reduce radv_tessellation_info::patch_control_points to 8-bit |
| - radv: reduce radv_input_assembly_info::primitive_topology to 8-bit |
| - radv: reduce radv_rasterization_info::polygon_mode to 8-bit |
| - radv: reduce radv_discard_rectangle_info::count to 8-bit |
| - radv: fix lowering GS intrinsics if NGG is disabled per pipeline |
| - radv/ci: update list of failures for Pitcairn |
| - radv: enable radv_zero_vram for Hammerting |
| - radv/winsys: add get_chip_name() to the null winsys |
| - radv: report the marketing name as part of the device name |
| - radv: rename ngg_gs_state to ngg_query_state |
| - radv: track if primitives generated query features are enabled |
| - radv: add primitives_generated_query to the graphics pipeline key |
| - radv: declare the NGG query argument for primitives generated query |
| - ac/nir/ngg: count the number of generated primitives for VS and TES |
| - radv: flush the NGG query state when the argument is declared |
| - radv: add few helpers related to streamout |
| - radv: implement VK_EXT_primitives_generated_query |
| - radv: advertise VK_EXT_primitives_generated_query |
| - radv/ci: rename deqp-XXX jobs to vkcts-XXX |
| - zink/ci: skip two KHR-GL46 tests that timeout most of the time with RADV |
| - zink/ci: update list of expected failures with RADV |
| - radv: remove redundant check when importing vertex input info |
| - radv: update radv_is_vrs_enabled() to use radv_graphics_pipeline_info |
| - radv: update the check to determine if dynamic discard rectangle is needed |
| - radv: update the check to determine if dynamic sample location is needed |
| - radv: update the check to determine if rasterization is enabled |
| - radv: remove remaining unused pCreateInfo pointers |
| - ci: bump the hang-detection tool in the test image for Vulkan |
| - ci: fix passing down MESA_LOADER_DRIVER_OVERRIDE for zink-anv-tgl |
| - zink/ci: change the surface type used for dEQP |
| - ci/valve: split .b2c-test into .b2c-test-vk and .b2c-test-gl |
| - radv/ci: add GLES/GLCTS testing with Zink on NAVI10 |
| - zink/ci: update list of failures again |
| - radv/ci: refactor test machines and dEQP rules to reduce copy&paste config |
| - ci: update vkd3d-proton builder/runner |
| - ci: print a message when vk3d-proton returns successfully |
| - ci: add VKD3D_PROTON_RESULTS to the list of variables to pass down for testing |
| - radv/ci: add vkd3d-proton testing |
| - radv: advertise VK_EXT_border_color_swizzle on GFX10+ |
| - ci: uprev vkd3d-proton to 39d07dea2cef34bfb3ed39741f026bc637e3eec4 |
| - radv/amdgpu: make sure to reset the number of BO when there is no ranges |
| - radv/amdgpu: do not add "global" BO to the virtual BO list |
| - aco: fix validation of SOP1 instructions without definitions |
| - radv,aco: rename radv_prolog_binary/radv_shader_prolog to shader_part |
| - radv/llvm: always emit a null export even if the FS doesn't discard |
| - radv: disable small primitive culling for user sample locations |
| - radv: use RADEON_FLAG_VA_UNCACHED for the trace BO |
| - radv: fix command line for dumping waves with UMR |
| - radv: dump UMR waves before UMR rings |
| - radv: do not emit SQTT user data packets on the SDMA queue |
| - radv/ci: add CI lists for LLVM on NAVI21 |
| - radv: use LOAD_CONTEXT_REG to load the opaque buffer size on GFX10+ |
| - radv: do not abort if SPM isn't supported for the current GPU |
| - radv: remove old workaround for HTILE layers with F1 2021 |
| - zink/ci: update list of failures for piglit with RADV |
| - radv: disable DCC for Melty Blood Actress Again Current Code |
| - radv: always enable VK_EXT_debug_utils |
| - zink/ci: update list of failures |
| - radv/ci: update list of failures against CTS 1.3.3.0 |
| - aco: fix load_barycentric_at_sample without MSAA |
| - radv: fix dumping VS prologs assembly |
| - radv/ci: enable fossils testing for GFX1100 |
| - radv: remove the radv_report_apu_as_dgpu workaround for Red Dead Redemption 2 |
| - radv: fix wide points/lines by configuring the guardband correctly |
| - zink/ci: update list of CTS flakes for RADV |
| - ci: uprev vkd3d-proton to 5b73139f182d86cd58a757e4b5f0d4cfad96d319 |
| - radv,aco: track if a fragment shader needs an epilog |
| - radv,aco: introduce {radv,aco}_ps_epilog_key |
| - radv: declare a new user SGPR arg in FS for the epilog PC |
| - radv: add a function that declares PS epilog shader arguments |
| - aco: add new pseudo instruction p_jump_to_epilog |
| - aco: emit p_jump_to_epilog if the main fragment shader has an epilog |
| - aco: do not abort if the FS doesn't export anything but has an epilog |
| - aco: prevent adding DONE/VM to the last export if the FS has an epilog |
| - aco: add aco_postprocess_shader() helper |
| - radv,aco: rename radv_aco_build_prolog to radv_aco_build_shader_part |
| - aco: refactor export_fs_mrt_color() for PS epilogs preparation |
| - aco: add support for compiling PS epilogs |
| - radv: implement PS epilogs |
| - radv: disable viewport depth clamping only when necessary |
| - aco: requires Exact for p_jump_to_epilog |
| - radv: only force 1x sample for Bresenham lines when pipeline draws lines |
| - vulkan: add support for VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT |
| - radv: implement VK_EXT_attachment_feedback_loop_layout |
| - radv: ignore out-of-order rasterization if stencil write mask is dynamic |
| - radv: fix gathering XFB info if there is dead outputs |
| - radv: fix cleaning the meta query state if an error occured |
| - radv: re-emit viewports if negative one to one or depth clamp mode changed |
| - radv: fix bogus assertion with RADV_FORCE_VRS |
| - radv: fix pipelineStageCreationFeedbackCount when it's 0 |
| |
| Sarah Walker (2): |
| |
| - pvr: Update for firmware 1.17@6256262 |
| - pvr: Move BRN 44079, 48492 and 66011 code into pvrsrvkm specific directory |
| |
| Sathishkumar S (2): |
| |
| - radeon/vcn: skip create and destroy message for jpeg |
| - radeon/vcn: engage all available jpeg engines |
| |
| Sebastian Keller (1): |
| |
| - egl/wayland: Don't try to access modifiers u_vector as dynarray |
| |
| Sergi Blanch Torne (1): |
| |
| - ci/crosvm: clean crosvm socket |
| |
| Sidney Just (19): |
| |
| - wgl: add a flag to determine if running on zink |
| - wgl: add zink to the list of auto-loaded drivers |
| - zink: support VK_KHR_win32_surface |
| - kopper: add win32 loader interface |
| - zink: support win32 wsi |
| - wgl: support kopper |
| - gallium: support for implementing EXT_external_objects_win32 |
| - mapi: added EXT_external_objects_win32 definitions |
| - gallium/noop: implement win32 device LUID and node mask getters |
| - gallium/trace: implement win32 device LUID and node mask getters |
| - mesa: implement GL_DEVICE_LUID_EXT and GL_DEVICE_NODE_MASK_EXT getters |
| - mesa: implement import win32 memory and semaphore |
| - mesa: pipe cap for EXT_memory_object_win32 and EXT_semaphore_win32 |
| - zink: codegen for Win32 external object extensions |
| - zink: implement win32 fence import |
| - zink: implement win32 memory handle import |
| - zink: implement device LUID and node mask |
| - zink: enable pipe cap for win32 external memory and fences |
| - docs: updated zink features and release notes |
| |
| Sil Vilerino (39): |
| |
| - gallium/vl: Add software winsys and offscreen winsys |
| - util/u_format: Drop assert that has valid/well-defined behavior |
| - util/vl_vlc: Support compiling in C++ |
| - st_vdpau: Pass format when opening resource from handle in st_vdpau_resource_from_description |
| - d3d12: Add .clang_format file |
| - gallium/va/radeonsi: Using private as a parameter name conflicts with C++ keywords |
| - gallium va: VaDeriveImage support stride/offset being different for NV12 planes |
| - gallium: Add values to pipe_video_cap for multi-slice and multi-reference encode |
| - gallium va: Handle new VA attributes with new pipe video caps |
| - gallium: Add multiple slice support to pipe_h264_enc_picture_desc |
| - gallium va: Add support for multiple slices encoding |
| - gallium radeon/r600/omx/va: Adds support for multiple reference encoding |
| - gallium: Add MinLumaBiPredSize8x8 to pipe_h264_sps, pic_init_qs_minus26 to pipe_h264_pps |
| - gallium omx: Fill out MinLumaBiPredSize8x8 and pic_init_qs_minus26 |
| - gallium vdpau: Fill out level_idc and MinLumaBiPredSize8x8 |
| - gallium va: VaHandlePictureParameterBufferH264 fill out new pipe params MinLumaBiPredSize8x8, pic_init_qs_minus26, chroma_format_idc, bit_depth_chroma, bit_depth_chroma_minus8 |
| - d3d12: Add d3d12_promote_to_permanent_residency |
| - ci: Update x86 debian build to pick up tag v1.602.0-r1 from DirectX-Headers |
| - d3d12: Add video decode implementation of pipe_video_codec |
| - d3d12: Add video encode implementation of pipe_video_codec |
| - d3d12: Add create_video_codec and create_video_buffer entrypoints |
| - d3d12: Add util video functions to d3d12_format |
| - d3d12: Improve planar resource support to handle video requirements |
| - d3d12: Add support for d3d12 video in d3d12_screen |
| - d3d12: Add GLSL singleton refcounting to screen for offscreen non-GL rendering |
| - meson: Support d3d12 as a video-supporting driver |
| - winsys: Do not use Display type when X11 is not present in build |
| - meson: Update DirectX-Headers dependency to 1.602.0 version with fallback to the wrap |
| - meson: Add build option for gallium-d3d12-video feature |
| - d3d12: Align output buffer offset access to D3D12_FEATURE_DATA_VIDEO_ENCODER_RESOURCE_REQUIREMENTS.CompressedBitstreamBufferAccessAlignment |
| - d3d12: Fix debug_printf format strings for cross platform builds |
| - d3d12: Video - Remove unused spCopyQueues from enc/dec objects |
| - pipe/video: Add PIPE_VIDEO_ENTRYPOINT_PROCESSING |
| - vl: Replace usage of entrypoint UNKNOWN with PROCESSING for VP |
| - va: Replace usage of entrypoint UNKNOWN with PROCESSING for VP |
| - gallium/video: Add video post processing interface |
| - va: Add support for VPP rotation, flip, alpha blend, crop, scaling |
| - d3d12: Add pipe_video_codec::process_frame implementation |
| - d3d12: Enable VPP rotation, flip, alpha blend, crop, scaling via pipe_video_codec::process_frame |
| |
| SoroushIMG (4): |
| |
| - zink: Fix BO size when it's not aligned to 16 bytes |
| - zink: Fix spirv stream 0 vertex emit for multistream shaders |
| - Zink: Fix clear being missed when using emulated draws in zink_blit |
| - zink: Fix incorrect emission of SPIR-V shift ops |
| |
| SureshGuttula (4): |
| |
| - radeonsi: Set display_remote for non-refernced frames |
| - radeonsi/vcn : update enc->dpb ref_use for index 0 |
| - Revert "radeonsi: Set display_remote for non-refernced frames" |
| - Revert "radeon: hardcode uvd/vce encoder not_referenced value to false" |
| |
| Sviatoslav Peleshko (7): |
| |
| - mesa: flush bitmap caches when changing scissors or window rects state |
| - anv: workaround apps that assume full subgroups without specifying it |
| - intel/blorp/gen6: Set BLEND_STATEChange only if emitting the blend state |
| - anv: Dirty all dynamic state bits when creating command buffer state |
| - intel/blorp: Dirty depth bounds dynamic state bits after blorp |
| - intel/nullhw: Use correct macro to fix build regression |
| - iris: Always initialize shader compilation queue ready fence |
| |
| TSnake41 (1): |
| |
| - zink: print result code string on vulkan failure |
| |
| Tapani Pälli (5): |
| |
| - isl: disable mcs (and mcs+ccs) for color msaa on DG2 |
| - anv: use anv_cmd_dirty_mask_t type for dynamic state |
| - intel/fs: setup SEND message descriptor from nir scope |
| - iris: implement Wa_14015264727 for DG2 |
| - anv: implement Wa_14015264727 for DG2 |
| |
| Tatsuyuki Ishi (9): |
| |
| - radv: Fix redundant subpass barriers due to erroneous comparison |
| - radv/ci: skip dEQP-VK.fragment_operations.transient_attachment_bit |
| - radv/ci: Move transient_attachment_bit from fail to skip list |
| - radv/ci: skip image.sample_texture.*_compressed_format* |
| - radv/ci: Move sample_texture.*_compressed_format_* to faillist for gfx<=9 |
| - radv: Fix vkCmdCopyQueryResults -> vkCmdResetPool hazard. |
| - amd: Revert gfx10 addrlib changes |
| - radv: Only set pstate for the first hw_ctx. |
| - radv: Implement radv_flush_before_query_copy to workaround UE Vulkan bugs. |
| |
| Thomas Debesse (2): |
| |
| - gallium/clover: LLVM setLangDefaults moved from clangFrontend to clangBasic |
| - gallium/clover: pass -no-opaque-pointers to Clang |
| |
| Tiago Koji Castro Shibata (2): |
| |
| - d3d12: fallback to integrated adapter instead of arbitrary one |
| - d3d12: add more formats to supported conversions |
| |
| Timothy Arceri (71): |
| |
| - glsl/st: vectorise interfaces of SSO shader programs |
| - nir: fix setting varying from uniform as flat |
| - nir: fix sorting before assigning varying driver locations |
| - nir: always set the exact_trip_count_unknown loop terminator property |
| - nir: remove unreachable loop terminators |
| - svga: remove duplicate nir compile options |
| - nir/glsl: add glsl_record_compare() wrapper |
| - nir/glsl: wrapper contains_{double,interger}() |
| - nir/glsl: wrapper field_index() |
| - nir/glsl: wrap component_slots_aligned() |
| - glsl/mesa: move parse_program_resource_name() to common linker_util code |
| - glsl: add new build program resource helpers |
| - nir: add variable data fields required for NIR glsl varying linking |
| - nir: abort io info gathering if location is not set or is a temp value |
| - nir: skip lowering io to scalar for must_be_shader_input |
| - glsl: implement lower_packed_varyings() as a NIR pass |
| - glsl: add a NIR based varying linker |
| - glsl: implement opt_dead_builtin_varyings() as a NIR pass |
| - glsl: implement lower_xfb_varying() as a NIR pass |
| - glsl: enable the use of the nir based varying linker |
| - glsl: remove now unused GLSL IR varying linker code |
| - glsl: simplify finding cursor in varying packing code |
| - nir: add support for forced sampler indirect loop unrolling |
| - gallium/drivers: set force_indirect_unrolling_sampler for all required drivers |
| - glsl: move validation of sampler indirects to the nir linker |
| - i915g: use a valid setting for force_indirect_unrolling |
| - softpipe: switch to NIR loop unrolling |
| - asahi: switch to NIR loop unrolling |
| - d3d12: switch to NIR loop unrolling |
| - etnaviv: switch to NIR loop unrolling |
| - i915g: switch to NIR loop unrolling |
| - r300: disable GLSL IR loop unrolling |
| - r600: switch to NIR loop unrolling |
| - vc4: disable GLSL IR loop unrolling |
| - nir/i915g/r300/nv30: skip marking varyings as flat in some drivers |
| - etnaviv: assert if etna_shader_io_file reg overrun |
| - etnaviv: vectorise io |
| - nir/gcm: fix pushing instructions into if blocks |
| - ci: uprev piglit 2022-05-31 |
| - nir: add nir based version of the lower_const_arrays_to_uniforms pass |
| - glsl/nir: skip adding hidden uniforms to the remap tables |
| - glsl/nir: allow the nir linker to remove dead uniforms we created |
| - glsl: move common link time optimisation calls to linker code |
| - glsl: move gl_nir_link_opts() call out of the st code |
| - glsl: switch to NIR based implementation of lower_const_arrays_to_uniforms() |
| - glsl: remove now unused lower_const_arrays_to_uniforms() |
| - gallivm: disable GLSL IR loop unrolling in LLVMPIPE |
| - freedreno/ir3: tidy up duplication of common nir options |
| - freedreno: switch to NIR loop unrolling |
| - lima: lower all undefs to zero in vs |
| - lima: fixup nir indirect unroll options to match gallium CAP |
| - lima: switch to NIR loop unrolling |
| - nouveau/nv30: disable GLSL IR loop unrolling |
| - nouveau/nv50: disable GLSL IR loop unrolling |
| - nouveau/nvc0: disable GLSL IR loop unrolling |
| - svga: disable GLSL IR loop unrolling |
| - gallium: remove PIPE_SHADER_CAP_MAX_UNROLL_ITERATIONS_HINT CAP |
| - glsl: remove the now unused GLSL IR loop unrolling code |
| - glsl: drop extra optimise swizzles call |
| - glsl: tidy up link_varyings_and_uniforms() |
| - glsl: remove never true do_dead_code() parameter |
| - glsl: inline do_common_optimization() call |
| - util: add dri config option to disable GL_MAP_UNSYNCHRONIZED_BIT |
| - util: use force_gl_map_buffer_synchronized workaround with RAGE |
| - glsl: merge lower_buffer_access with lower_shared_reference |
| - glsl: drop non-nir path for atan in builtin functions |
| - st/glsl: fix broken vertex attrib mapping |
| - nir/loop_unroll: clean up after complex_unroll_single_terminator() |
| - glsl: correctly track cross slot component packing |
| - Revert "nir: Preserve offsets in lower_io_to_scalar_early" |
| - glsl: dont lower precision for textureGatherOffsets |
| |
| Timur Kristóf (119): |
| |
| - radv: Minor formatting fix in radv_device.c file. |
| - radv: Move queue submit sparse bindings to a separate function. |
| - radv: Move empty queue submit code path to a separate function. |
| - radv: Move normal (non-empty) queue submit to a separate function. |
| - radv: Move up early exit for transfer queues in radv_get_preamble_cs. |
| - radv: Simplify some boolean code in radv_get_preamble_cs. |
| - radv: Initialize BO pointers when creating preambles. |
| - radv: Refactor cache flush code for the initial preambles. |
| - radv: Don't create continue preamble when it's not needed. |
| - radv: Rename fill_geom_tess_rings to radv_fill_shader_rings. |
| - radv: Don't use pointers to pointers when updating the preambles. |
| - radv: Only emit what is appropriate to the queue family for preambles. |
| - ac/nir: Add ac_nir_load_arg helper for shader arguments. |
| - radv: Move radv_nggc_settings enum out of radv_cmd_buffer. |
| - nir: Add upper bound for AMD shader arg intrinsics. |
| - radv: Lower ABI in NIR for tess/ESGS/NGG shader arguments. |
| - ac/nir: Remove now-superfluous ac_nir_lower_tess_to_const. |
| - aco: Remove now-superfluous intrinsics. |
| - ac/llvm: Remove now-superfluous intrinsics and ABI callbacks. |
| - radv/amdgpu: Group queue submit info into a structure. |
| - radv/amdgpu: Initialize semaphore info with a designated initializer. |
| - radv/amdgpu: Pass new queue submit info structure to internal function. |
| - radv/amdgpu: Pass correct struct type instead of repeating the cast. |
| - radv/amdgpu: Use scheduled dependency for submitting to multiple queues. |
| - ac/nir: Add I/O lowering for task and mesh shaders. |
| - ac: Add task ring entry shader argument. |
| - radv: Add radv_pipeline_has_task helper. |
| - radv: Set user data register for task shaders. |
| - radv: Set wave size for task shaders. |
| - radv: Fill task shader info. |
| - radv: Add task shader arguments. |
| - radv: Add task ring entry argument for mesh shaders. |
| - radv: Implement task shader intrinsics in the ABI. |
| - radv: Enable nir_opt_offsets for task shaders. |
| - radv: Use I/O lowering for task and mesh shaders. |
| - radv: Postprocess task shader configuration. |
| - radv: Allow linking task shaders. |
| - radv: Lower shared and task_payload variables in task/mesh shaders. |
| - radv: Print task shader stage name before disasm. |
| - radv: Fix task shader info. |
| - radv: Fix loading task shader ring buffer addresses. |
| - aco: Fix scratch with task shaders. |
| - radv: Introduce ring info structure for queues, refactor preamble generation. |
| - radv: Use a ws variable during preamble creation. |
| - radv: Pass radv_device pointer to some functions. |
| - radv: Refactor queue state to separate structure. |
| - radv: Clarify emitting graphics shader pointers. |
| - nir: Add explicit task payload atomic intrinsics. |
| - radv: Implement conditional rendering for async compute queue. |
| - ac: Remove trailing white space. |
| - aco: Remove trailing whitespace. |
| - radv: Remove trailing whitespace. |
| - radv: Disable predication for supass clear and image clears. |
| - nir: Add new launch_mesh_workgroups intrinsic. |
| - nir: Add common task shader lowering to make the backend's job easier. |
| - radv: Add mesh and task stage names to pipeline executable properties. |
| - nir: Keep track of cross-invocation mesh shader output access. |
| - radv, ac/nir: Fix multiview layer export for mesh shaders. |
| - ac/nir/taskmesh: Use task shader lowering from common NIR code. |
| - ac/nir/taskmesh: Use 3 dimensional workgroup ID. |
| - ac/nir: Add remappability to tess and ESGS I/O lowering passes. |
| - ac: Add RDNA2 task+mesh shader draw packet opcodes. |
| - ac: Add task shader ring information. |
| - radv: Implement task shader draw and payload rings. |
| - nir: Introduce new intrinsics for AMD specific mesh shader task ring. |
| - ac/nir/ngg: Clean up mesh shader output LDS layout. |
| - ac/nir/ngg: Use mesh shader scratch ring when outputs don't fit LDS. |
| - ac/nir/ngg: Use variables for outputs without cross-invocation access. |
| - radv: Implement mesh shader scratch ring. |
| - radv: Enable NGG wave ID for mesh shader scratch ring. |
| - ac/nir/ngg: Add mesh shader workgroup index. |
| - radv: Lower mesh shader 3D workgroup ID to 1D index. |
| - ac/nir/taskmesh: Preserve workgroup ID Y and Z when applying firstTask. |
| - radv: Always use 3D block ID and grid size in task shaders. |
| - gallium/u_blitter: Fix depth. |
| - zink: Enable the VK_EXT_depth_clip_enable extension. |
| - zink: Always enable depth clamping, make depth clipping independent. |
| - nir/lower_task_shader: don't use base index for shared memory intrinsics |
| - ac/nir/ngg: Refactor LDS instructions in NGG GS vertex emit and export. |
| - ac/nir/ngg: Ignore driver location for mesh shader outputs. |
| - radv: Don't assign driver locations to mesh shader outputs. |
| - radv: Add CULL_PRIMITIVE to special output mask. |
| - radv: Use NIR optimization to move discards to the top. |
| - radv: Only initialize DGC state when DGC is enabled. |
| - radv: Remove trailing whitespace introduced by DGC commits. |
| - radv: Refactor predication for compute queues. |
| - radv: Refactor some CP DMA functions to work with radeon_cmdbuf. |
| - radv: Refactor radv_emit_userdata_address to work with radeon_cmdbuf. |
| - radv: Refactor radv_emit_descriptor_pointers to work with radeon_cmdbuf. |
| - radv: Refactor radv_emit_inline_push_consts to work with radeon_cmdbuf. |
| - radv: Refactor view index emit to use a per-stage function. |
| - radv: Move inline push constants to a new function. |
| - radv: Minor cleanup of radv_queue_submit_normal. |
| - radv: Allow reusing pipeline compute state emit functions. |
| - radv: Add dispatch_initiator_task field to radv_device. |
| - radv: Create internal cmdbuf when a graphics pipeline needs compute. |
| - radv: Flush descriptors and push constants for task shaders. |
| - radv: Introduce radv_before_taskmesh_draw. |
| - radv: Implement mesh shading draw calls with task shaders. |
| - radv: Synchronization for task shaders. |
| - radv: Support task shaders in secondary cmd buffers. |
| - radv: Workaround MEC taskmesh dispatch hang when count buffer has zero. |
| - radv: Submit internal compute cmdbuf. |
| - radv: Copy BO list to ACE internal CS. |
| - radv: Enable task shader feature for NV_mesh_shader. |
| - ac/nir/ngg: Move primitive ID workgroup barrier to proper place. |
| - aco: Remove hack for primitive ID export. |
| - ac/nir/ngg: Create output variable for primitive ID export. |
| - ac/llvm: Implement load_num_subgroups for NGG shaders. |
| - ac/llvm: Add LLVM bug workaround to ac_build_mbcnt_add. |
| - ac/llvm: Use gs_prim_id for NGG VS. |
| - ac/nir/ngg: Copy comment about LDS layout for NGG GS. |
| - ac/nir/ngg: Remember proper bit sizes of GS output variables. |
| - ac/llvm: Implement GDS atomic add NIR intrinsic. |
| - radv/llvm: Remove incorrect hardcoded workgroup size from NGG GS. |
| - radv: Properly set LDS size for LLVM NGG shaders. |
| - radv: Use NIR-based NGG lowering with LLVM. |
| - aco: Fix p_init_scratch for task shaders. |
| - nir/gather_info: Clear cross-invocation output mask. |
| |
| Tomeu Vizoso (4): |
| |
| - lima/ci: Disable lima-mali450-piglit-gpu:arm64 |
| - panvk/ci: Disable CI for a while |
| - ci: Disable jobs to the Collabora lab |
| - Revert "ci: Disable jobs to the Collabora lab" |
| |
| Vadym Shovkoplias (4): |
| |
| - anv: Fix geometry flickering issue when compute and 3D passes are combined |
| - driconf: Add a limit_trig_input_range option |
| - drirc: Set limit_trig_input_range option for the Wolfenstein Youngblood |
| - drirc: Set limit_trig_input_range option for glmark2 |
| |
| Vasily Khoruzhick (7): |
| |
| - lima: Implement texture_barrier |
| - lima: enable PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT |
| - lima: Lower nir_op_idiv with other scalar math operations. |
| - u_transfer_helper: flush temporary transfer first for MSAA |
| - lima: implement lima-specific blitter |
| - lima: wire up MSAA 4x support |
| - lima: fix buffer overallocation for index, vertex and constant buffers |
| |
| Victor Hermann Chiletto (1): |
| |
| - radv: always check entry count in descriptor pool when allocating |
| |
| Viktoriia Palianytsia (1): |
| |
| - anv: Add workaround for sample mask with multisampling |
| |
| Vinson Lee (14): |
| |
| - panfrost: Fix memory leaks on unit test failures. |
| - zink: Fix memory leak on error path. |
| - radeonsi: Move NULL check before dereference. |
| - dri2: Fix memory leak. |
| - d3d12: Initialize d3d12_video_encoder_bitstream member m_uiOffset. |
| - amd: Initialize Gfx11Lib members in constructor. |
| - d3d12: Remove unnecessary NULL check. |
| - clc: Fix build with llvm-15. |
| - microsoft/spirv_to_dxil: Fix missing-prototypes errors. |
| - radv: Remove unnecessary null check. |
| - microsoft/compiler: Fix assert. |
| - tu: Check dereferenced value of rop_reads_dst. |
| - aco: Initialize spill_ctx members in constructor. |
| - r600/sfn: Initialize TestShaderFromString member m_instr_factory. |
| |
| Vlad Zahorodnii (2): |
| |
| - winsys/amdgpu-radeon: Allow specifying context priority |
| - radeonsi: Add support for EGL_IMG_context_priority |
| |
| Yevhenii Kolesnikov (1): |
| |
| - nir: Remove single-source phis before opt_if_loop_last_continue |
| |
| Yiwei Zhang (51): |
| |
| - venus: override aspectMask for internal tiling modifier |
| - venus: use linear modifier for legacy common wsi path |
| - venus: fix view format for ahb image |
| - venus: refactor to add struct vn_env |
| - venus: add env perf options and introduce no_async_set_alloc |
| - venus: add VN_PERF option no_async_buffer_create |
| - venus: add VN_PERF option no_async_queue_submit |
| - venus: refactor vn_cmd_submit |
| - venus: flush when batched draw calls reach a threshold |
| - venus: disable VK_EXT_extended_dynamic_state2 |
| - venus: refactor android gralloc pieces |
| - venus: cache front_rendering_usage bit at gralloc init |
| - venus: update vn_GetSwapchainGrallocUsage2ANDROID for shared present |
| - venus: enable ANB shared presentable image prop |
| - venus: add VN_RELAX_BASE_SLEEP_US env var for tuning cpu utils |
| - venus: log performance envs upon instance creation with debug init |
| - venus: extend buffer cache to cover layering usage |
| - vulkan: remove the VkPhysicalDeviceMemoryProperties workaround |
| - radv: remove VkPhysicalDeviceMemoryProperties2 workaround |
| - venus: use common entry point for VkPhysicalDeviceMemoryProperties |
| - venus: resolve AHARDWAREBUFFER_FORMAT_Y8Cb8Cr8_420 to external format |
| - venus: fix a bug on buffer cache init failure path |
| - venus: fix the queue init failure path |
| - venus: fix a missing mtx_destroy in vn_device_init |
| - venus: adjust device resources init order |
| - venus: let device track queue families used for device creation |
| - venus: add vn_feedback_pool backed by coherent buffer (part 1) |
| - venus: add vn_feedback_pool backed by coherent buffer (part 2) |
| - venus: add feedback check/set/reset helpers |
| - venus: add event feedback |
| - venus: add NO_FENCE_FEEDBACK perf option and disable sparse resource |
| - venus: create per queue family command pool for feedback cmd |
| - venus: renderer to store allow_vk_wait_syncs capset |
| - venus: add fence feedback |
| - venus: resolve YV12 AHB to VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM |
| - venus: use narrow range to match up with mesa EGL |
| - venus: swizzle the chroma channels for YVU420 to match the VkFormat |
| - venus: GetPhysicalDeviceSparseImageFormatProperties to match sparse support |
| - venus: suballocate more for layering |
| - venus: add more tracepoints for perf analysis |
| - venus: filter out VK_EXT_physical_device_drm on the driver side |
| - venus: check dynamic state for raster enablement |
| - venus: reenable VK_EXT_extended_dynamic_state2 |
| - Revert "venus: suballocate more for layering" |
| - anv: enable VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM for modifier support |
| - venus: fix external memory ext filtering |
| - venus: avoid feedback for external fence |
| - venus: allow no external memory handle when renderers lacks support |
| - zink: fix zink_create_fence_fd to properly import |
| - zink: fix in-fence lifecycle |
| - venus: ignore pInheritanceInfo if not secondary command buffer |
| |
| Yogesh Mohan Marimuthu (12): |
| |
| - radeonsi/gfx11: make flat_scratch changes for graphics |
| - radeonsi/gfx11: make flat_scratch changes for compute |
| - radeonsi/gfx11: interp changes for 32bit |
| - radeonsi/gfx11: interp changes for 16bit |
| - radeonsi/gfx11: instruction cache line size is 128 bytes |
| - radeonsi/gfx11: program inst_pref_size for compute |
| - radeonsi/gfx11: program db render control register |
| - radeonsi/gfx11: export alpha through mrtz for alpha-to-coverage if mrtz is there |
| - ac,radeonsi/gfx11: swizzle MRT0/1 for dual source blending |
| - vulkan/wsi: fix extra free if buffer_blit_queue |
| - radeonsi: remove tabs from code |
| - radeonsi: no need to call si_pm4_clear_state() in si_pm4_free_state() |
| |
| Yogesh Mohanmarimuthu (1): |
| |
| - radeonsi/gfx11: use PIXEL_PIPE_STATE_DUMP event instead of ZPASS_DONE |
| |
| Yonggang Luo (119): |
| |
| - zink: Remove redundant framebuffer_mtx from zink_screen.h |
| - win32: Fixes 32 bits visual studio module definition files by add script gen_vs_module_defs.py |
| - win32: Do not use BUILD_GL32, we use def file to export win32 dll symbols. |
| - misc: Replace \`#ifdef\\t__cplusplus` with \`#ifdef\\s\\s__cplusplus` |
| - misc: replace \`#ifdef\\s\\s__cplusplus` with \`#ifdef\\s__cplusplus` |
| - zink: Trim spaces in zink.rst |
| - zink: Improve zink.rst with links |
| - st: Improve _mesa_error about GL_OUT_OF_MEMORY in st_cb_texture.c |
| - llvmpipe: Do not use _Atomic keyword that doesn't support by MSVC |
| - radv: fixes msvc compiling error |
| - radv: Fixes compiling error with msvc |
| - radv: Add losing member hw_ip_version_major and hw_ip_version_minor for win32 |
| - radv: Add macros for win32 that accessed |
| - nir: Fixes MSVC compiling error about unused variable \`_` |
| - egl/wgl: On win32, there is no support for EGL_EXT_device and EGL_EXT_platform_device |
| - util: Fixes test_util_get_process_exec_path on windows host with msys2/mingw |
| - util: Fixes typo in test_util_get_process_exec_path |
| - ci/vs2019: Clear CI_COMMIT_MESSAGE and CI_COMMIT_DESCRIPTION for please meson |
| - radv: Remove usage of \`cnd_monotonic.h` |
| - d3d12: Fixes d3d12 compiling errors in \`D3D12ResourceState.h` for mingw |
| - d3d12: Move shared code that include d3d12 headers into d3d12_common.h |
| - d3d12: Fixes compiling error by include \`<unknwn.h>` |
| - d3d12: \`#include <windows.h>` instead \`#include <Windows.h>` for building under linux with mingw |
| - d3d12/wgl/test: Fixes wgl_tests.cpp for d3d12 with mingw |
| - dzn: Fixes compiling error when build with msys2/mingw |
| - dzn: Fixes compiling error by include \`<unknwn.h>` |
| - microsoft/clc: Fixes compiling of microsoft clc with mingw |
| - docs: Fixes dispatch.rst with __THREAD_INITIAL_EXEC |
| - meson: Fixes timespec_get detect code |
| - c11/threads: Remove the detecting code for \`HAVE_TIMESPEC_GET` in threads_win32.h |
| - c11: Implement c11/time.h with c11/impl/time.c |
| - c11: Implement timespec_get on win32 properly when not available |
| - util: Use timespec_get directly, it's always present |
| - util: Remove usage of designated initializers in timespec_test.cpp |
| - util: always enable timespec_test.cpp |
| - util: Remove unused function timespec_passed |
| - util: Add unittest for timespec_get |
| - ci/vs2019: There is no meaning to set $env:ErrorActionPreference |
| - ci/vs2019: switch to powershell 7 |
| - ci/vs2019: Update editorconfig for ps1 files |
| - ci/vs2019: Give each option its own line in mesa_build.ps1 |
| - ci/vs2019: Rename to vsInstallPath |
| - ci/vs2019: Do not install subprojects |
| - ci/vs2019: Ignore cert install output, it's too long |
| - ci/vs2019: Ignore error when removing directory recursively |
| - ci/vs2019: Disable progress when install python |
| - ci/vs2019: Disable progess bar for pip3 install |
| - ci/vs2019: Use shared VULKAN_SDK_VERSION for install and build vulkan related packages |
| - ci/vs2019: Add a dedicated script for entering VS dev shell |
| - ci/vs2019: Split call to C:\\vs_buildtools.exe into multiple lines |
| - ci/vs2019: Show the total vs2019 install time |
| - ci/vs2019: Reduce the vs2019 build tools components to be installed |
| - ci/vs2019: ignore _build and _install directory by write * into .gitignore |
| - ci/vs2019: Install all build artifacts into C:\\mesa-deps |
| - ci/vs2019: Split choco install output |
| - ci/vs2019: Split install of vs2019 and choco into separate Docker image |
| - ci/vs2019: Remove comment for windows_test_vs2019 that not true |
| - ci/vs2019: Convert mesa_build.ps1 to using PowerShell 7 && operator |
| - microsoft/compiler: Include stddef.h before using size_t |
| - microsoft/compiler: Fix error from double extern |
| - dxil: Fixes compiling dxcapi.h with mingw. |
| - dxil: Sync dxil_spirv_shader_stage with gl_shader_stage |
| - dxil: Fixes warning about comparing different enum value and enum literal |
| - microsoft/spirv_to_dxil: Delete unused local variable |
| - microsoft/clc: Delete unused local variables |
| - microsoft/clc: Add missing void to no-parameter function signature |
| - microsoft/spirv_to_dxil: Fixes maybe-uninitialized compiling error in dxil_spirv_nir.c |
| - ci: Building spirv-to-dxil on both mingw and debian/x64 |
| - intel: using C++11 keyword thread_local |
| - glx: using C++11 keyword thread_local |
| - util: Define WIN32_LEAN_AND_MEAN before include of \`windows.h` in u_thread.h |
| - c11: Move the implementation of threads.h into c source code |
| - c11: Fixes memory leak of pack in thrd_create on win32 |
| - c11: Implement thread_local in c11/threads.h |
| - util/c11: Update function u_thread_create to be c11 conformance |
| - meson: Using get_argument_syntax as the \`--compiler_id` option for gen_vs_module_defs.py |
| - meson: Use cc.get_argument_syntax instead cc.get_id when possible. |
| - meson/rtti: Using get_argument_syntax to check msvc compatible compiler |
| - meson/clover: Getting clover can be built with no-rtti llvm in mingw |
| - ci/vs2019: Upgrade Windows 10 SDK version to 20348 |
| - docs: Update windows build requirement with Visual Studio |
| - d3d12: Use static_cast instead of dynamic_cast in d3d12_video_enc_h264.cpp |
| - d3d12: Convert #include <Windows.h> to #include <windows.h> for mingw on linux |
| - d3d12: Fixes compiling error in d3d12/wgl/d3d12_wgl_framebuffer.cpp with gcc |
| - microsoft/clc: Fixes narrowing error in clc_compiler_test.cpp with mingw/gcc |
| - microsoft/clc: Disable clc_compiler_test on non-windows platform |
| - dzn: Fixes incompatible pointer type error |
| - ci/x86_build: Getting pushd popd be paired, avoid using cd |
| - ci: Prepare the container for building all mesa components with mingw under linux |
| - ci: Trigger the new mingw/linux dockers to be build |
| - d3d12: Turn d3d12_format.h to include d3d12_common.h |
| - ci: Building all mesa functional with mingw on debian |
| - meson: Enable wgl tests on mingw |
| - CODEOWNERS: Update c11 code owners |
| - CODEOWNERS: evelikov renamed to xexaxo |
| - gallium: Move -DHAVE_PIPE_LOADER_DRI and -DHAVE_PIPE_LOADER_KMS to be pre_args |
| - gallium: Do not call to pipe_loader_sw_probe_kms when not present |
| - gallium: Rename macros HAVE_PIPE_LOADER* |
| - gallium: Disable dri2 interface on OSX |
| - glx: __glX_tls_Context should use __THREAD_INITIAL_EXEC |
| - util: thread_local doesn't support to be exported on OSX |
| - glx: Convert tab to space in applegl_glx.c |
| - dri: Trim trailing space in dri/dri_util.* |
| - ci: Building osmesa on OSX |
| - util: Disable tests/sparse_array_test.cpp on MacOS as it's too slow |
| - mapi: Trim trailing spaces in stub.c and u_current.c |
| - meson: Remove usage of use_elf_tls |
| - egl: Remove usage of USE_ELF_TLS macro |
| - mapi: Remove usage of USE_ELF_TLS |
| - loader: Remove usage of USE_ELF_TLS in loader.c |
| - glx: Remove usage of USE_ELF_TLS |
| - util: Remove usage of USE_ELF_TLS in u_thread.h |
| - meson: Remove pre_args += '-DUSE_ELF_TLS' |
| - glx: Remove usage of pipe_tsd_set and pipe_tsd_get |
| - auxiliary: Remove pipe_tsd |
| - d3d12: Fixes compile error with mingw/gcc-x64 when static linkage to runtime library |
| - microsoft/clc: Fixes compiling errors with clang/mingw64 in clc/clc_compiler_test.cpp |
| - util: Fixes invalid assumption that return non null by function util_format_fetch_rgba_func |
| - c11: #include <threads.h> when the os/platform provide it |
| |
| Yurii Kolesnykov (2): |
| |
| - Remove Travis CI config |
| - Init macOS GitHub Action |
| |
| Zack Rusin (4): |
| |
| - svga: Don't try to build x86/x64 assembly on different arch's |
| - svga: finish readbacks before mapping resources |
| - svga: Use direct maps when GB objects are present |
| - svga: Add support for SVGAv3 |
| |
| Zhang, Jianxun (1): |
| |
| - iris: Wa_14016820455 for GFX_VERx10 == 12.5 |
| |
| illiliti (1): |
| |
| - Use proper types for meson objects |
| |
| jheaff1 (1): |
| |
| - build(glx): Fix build by adding missing deps |
| |
| naveen (1): |
| |
| - github/ci: Set permissions for GitHub actions |
| |
| newbluemoon (1): |
| |
| - nine: replace ulimit with sysconf call |
| |
| prashanth (1): |
| |
| - vulkan/device_select: force default device if environment variable set |
| |
| sjfricke (2): |
| |
| - isl: fix bug where sb.MOCS is not being set |
| - anv: fix assert to build with shader cache disabled |
| |
| xperia64 (1): |
| |
| - wgl: Add driver_zink as a dependency of the wgl frontend |
| |
| İlhan Atahan (1): |
| |
| - Add Adreno 616 and 620 to use turnip on these GPU's . |