| Mesa 24.2.0 Release Notes / 2024-08-14 |
| ====================================== |
| |
| Mesa 24.2.0 is a new development release. People who are concerned |
| with stability and reliability should stick with a previous release or |
| wait for Mesa 24.2.1. |
| |
| Mesa 24.2.0 implements the OpenGL 4.6 API, but the version reported by |
| glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / |
| glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. |
| Some drivers don't support all the features required in OpenGL 4.6. OpenGL |
| 4.6 is **only** available if requested at context creation. |
| Compatibility contexts may report a lower version depending on each driver. |
| |
| Mesa 24.2.0 implements the Vulkan 1.3 API, but the version reported by |
| the apiVersion property of the VkPhysicalDeviceProperties struct |
| depends on the particular driver being used. |
| |
| SHA256 checksum |
| --------------- |
| |
| :: |
| |
| c02bb72cea290f78b11895a0c95c7c92394f180d7ff66d4a762ec6950a58addf mesa-24.2.0.tar.xz |
| |
| |
| New features |
| ------------ |
| |
| - VK_KHR_dynamic_rendering_local_read on RADV |
| - VK_EXT_legacy_vertex_attributes on lavapipe, ANV, Turnip and RADV |
| - VK_MESA_image_alignment_control on RADV |
| - VK_EXT_shader_replicated_composites on ANV, dozen, hasvk, lavapipe, nvk, RADV, and Turnip |
| - VK_KHR_maintenance5 on v3dv |
| - VK_KHR_maintenance7 on RADV |
| - VK_EXT_depth_clamp_zero_one on v3dv |
| - GL_ARB_depth_clamp on v3d |
| - Defaulting to a new shader cache implementation, reducing filesystem overhead. |
| |
| |
| Bug fixes |
| --------- |
| |
| - u_debug_stack_test.capture_not_overwritten fails on i386 |
| - turnip: vulkaninfo crashed |
| - turnip-freereno Build error (/usr/local/include/xf86drm.h:40:10: fatal error: drm.h: No such file or directory) |
| - tu: compilation failing when compiling turnip with only kgsl and X11 enabled |
| - samplerCube constructor in fragment shader no longer converting uvec2 bindless texture handles ( segmentation fault , core dumped ) |
| - anv: gen9.5 flakiness in dEQP-VK.multiview.dynamic_rendering.depth.* |
| - vaapi decoding corruption with green blocks |
| - interpolateAt precision lowering unhandled by glsl_to_nir() |
| - [anv] CS2 crashes on LNL |
| - [anv] Dota2 does not start on LNL |
| - [radeonsi][bisected][regression] glClientWaitSync() quickly times out with INT64_MAX timeout |
| - d3d10umd: Build regression in 24.2.0-devel |
| - zink/tu: glcts flake on a750 |
| - nouveau: advertises GL_EXT_memory_object without implementing \`*UID` callbacks |
| - LIBGL_DRIVERS_PATH gone |
| - [Regression][Vulkan][TGL][Bisected]vkCmdCopyQueryPoolResults failed to write buffer with compute pipeline on Mesa 24.1 |
| - Worms Revolution: not rendering explosion effects |
| - crash on pushbuf_validate nvc0_blit do_blit_framebuffer |
| - piglit: cl-api-build-program crashes |
| - i915g: glGenerateMipmap() fails with 2048×2048 textures |
| - [radeonsi] Asterix & Obelix XXLRomastered: river misrendered (completely black) |
| - Build fails without Vulkan |
| - No dependency check for PyYAML in meson.build |
| - GPU Hang in Metal Gear Rising Revengeance |
| - VK_ERROR_DEVICE_LOST A770 DXVK Fallout 3 |
| - [Bisected] Recent compile issue in libnak |
| - anv: Wrong push constant values for bytes? |
| - anv: dEQP-VK.protected_memory tests GPU hang on MTL |
| - RustiCL (or maybe not...): radeonsi freezes after 2 hours of simulation, zink works just fine |
| - ci_run_n_monitor.py doesn't monitor manual jobs |
| - Crash in util_idalloc_resize due to glBindTexture with a way-too-large ID |
| - mesa-24.1.2 fails to compile: ast_to_hir.cpp:5371:39: error: '%s' directive argument is null |
| - [regression][bisected] VMware Xv video displays as black rectangle |
| - Blender 4.2,4.3 crashes when rendering with motion blur on RDNA3 cards (OpenGL/radeonsi) |
| - nvk: regression with multiple games crashing |
| - Transparent background in Blender 3D view with nouveau |
| - turnip: latest git does not build anymore |
| - ACO Unimplemented intrinsic instr |
| - RADV/ACO: assert on per-sample interpolation |
| - radv: large descriptor layout creation is slow |
| - Gnome shell (wayland) crashes when opening any window |
| - DRI Intel drivers fix a problem in Redhat 7 (Mesa 18), but are not included for Redhat 8 (Mesa versions v23, v24) |
| - Vulkan: ../src/nouveau/vulkan/nvk_physical_device.c:1109: VK_ERROR_INCOMPATIBLE_DRIVER |
| - RADV: Smooth lines affect triangle rendering |
| - [armhf build error][regression] error: ‘StringMapIterator’ was not declared in this scope; did you mean ‘llvm::StringMapIterator’? |
| - Build fails with latest llvm 19: error: no matching function for call to unwrap(LLVMOpaqueModule*&) |
| - tu: support KHR_8bit_storage |
| - Incorrect colours on desktop and apps |
| - nir: Incorrect nir_opt_algebraic semantics for signed integer constants causing end-to-end miscompiles |
| - src/gallium/frontends/clover/meson.build:93:40: ERROR: Unknown variable "idep_mesaclc". |
| - panfrost: mpv is broken on T604 |
| - Nightly CI is broken |
| - [radv] GPU hang in Starfield on RX 5700 XT |
| - anv, isl, iris: Clarify and improve CCS + FCV on gfx12 |
| - isl: CPCB horizontal and vertical alignment requirements unknown |
| - Indika: flickering black artifacting on the snow |
| - intel/isl: Split Xe2 changes into new files of Xe2. |
| - rusticl: Generated rusticl_mesa_bindings.c fails to find include |
| - isl: Remove 512B pitch requirement for non-displayable CCS |
| - MESA 24.1 - broken zink OpenGL under Windows |
| - Blue flickering rectangles on AMD RX 7600 |
| - GPU hangs on AMD Radeon RX 6400 on a fragment shader |
| - v3dv: vkcube-wayland crashes |
| - intel/brw: scoreboarding regression |
| - regression in !29436 for radv+angle on stoney |
| - [radv][regression] Starfield invisible terrain on a 7900 XTX |
| - free_zombie_shaders() leave context in a bad state (access violation occurs) |
| - r300: X11 fails to start with the modesetting driver (glamor is broken with R300/R400 gpus). |
| - [NINE]Far Cry 1 trees flicker regression[bisected][traces] |
| - Vulkan: Most sync2 implementations are missing new access flags |
| - Incorrect buffer_list advance when writing disjoint image descriptors |
| - ANV: Block shadows in Cyberpunk on Intel A770 |
| - ACO ERROR: Temporary never defined or are defined after use |
| - [ANV] Graphics memory allocation in Total War: Warhammer 3 |
| - DG2: God of War trace fails to play |
| - Borderlands trace fails to play on dg2 |
| - NVK: Vulkan apps simply terminated with segfault under wayland and Xwayland |
| - NVK: VK_ERROR_OUT_OF_DEVICE_MEMORY on swapchain creation |
| - anv/zink regression: piglit.spec.arb_fragment_layer_viewport.layer-no-gs |
| - [anv] failures when upgrading vulkancts 1.3.6 -> 1.3.7 on intel mesa ci |
| - RustiCL: deadlock when calling clGetProfilingInfo() on callbacks |
| - [Intel][Vulkan][Gen12] Vulkan compute shader is 3x slower than the same OpenCL kernel |
| - turnip: Broken AHB support |
| - zink: nir validation failures in Sparse code |
| - nir: nir_opt_varyings uses more stack than musl libc has |
| - dEQP-VK.pipeline.pipeline_library.shader_module_identifier.pipeline_from_id.graphics regression |
| - freedreno + perfetto missing dependency on adreno_common.xml.h |
| - anv: unbounded shader cache |
| - radv: Crash due to nir validation fail in Enshrouded |
| - vulkan/wsi/wayland: valgrid reports invalid read in \`vk_free` call in \`wsi_wl_surface_analytics_fini` |
| - android: sRGB configs no longer exist after !27709 |
| - bisected: turnip: deqp regressions |
| - aco: Radeonsi unable to use rusticl |
| - anv: clean up default_pipeline_cache in anv_device |
| - [24.1-rc4] fatal error: intel/dev/intel_wa.h: No such file or directory |
| - Turnip driver is crashing since turnip: ANB/AHB support got merged |
| - vcn: rewinding attached video in Totem cause [mmhub] page fault |
| - When using amd gpu deinterlace, tv bt709 properties mapping to 2 chroma |
| - a530: ir3_context_error assertion (unknown vertex shader output name: VARYING_SLOT_EDGE) |
| - VCN decoding freezes the whole system |
| - [RDNA2 [AV1] [VAAPI] hw decoding glitches in Thorium 123.0.6312.133 after https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28960 |
| - radv regression bisected: Flickering textures (vega) |
| - [Regression][Bisected] EGL/Wayland: QT applications terminated by SIGSEGV (Address boundary error) when using dGPU |
| - WSI: Support VK_IMAGE_ASPECT_MEMORY_PLANE_i_BIT_EXT for DRM Modifiers in Vulkan |
| - nvk: Tracker issue for gamescope support |
| - nvk: Implement VK_EXT_image_drm_format_modifier |
| - nvk: NVK_DEBUG=zero_memory is hitting an assert |
| - nvk: Implement VK_EXT_conservative_rasterization |
| - zink sparse: Improper semaphore handling |
| - zink sparse: Reference to mip tails should be refcounted |
| - radv: Enshrouded GPU hang on RX 6800 |
| - NVK Zink: Wrong color in Unigine Valley benchmark |
| - intel vulkan incremental build takes forever |
| - intel vulkan incremental build takes forever |
| - 24.0.6: build fails |
| - shader with multidimensional array in shader storage buffer takes too long to compile |
| - panforst: T604 issue with using u32 for flat varyings |
| - lp_screen.c:601:4: error: 'snprintf' will always be truncated; specified size is 16, but format string expands to at least 17 |
| - [anv] FINISHME: support YUV colorspace with DRM format modifiers |
| - gen9/11 test became flaky: piglit.spec.!opengl 1_4.blendminmax |
| - mesa 24 intel A770 KOTOR black shadow smoke scenes |
| - nvk: Implement VK_EXT_pipeline_robustness |
| - [bisected][regression] kitty fails to start due to \`glfwWindowHint(GLFW_SRGB_CAPABLE,true)` |
| - r600: bisected 5eb0136a3c561 breaks a number of piglits |
| - [bdw][bisected][regression] assertion failure in nir_validate.c |
| - Graphical glitches in RPCS3 after updating Vulkan Intel drivers |
| - [R600] OpenGL and VDPAU regression in Mesa 23.3.0 - some bitmaps get distorted. |
| - VAAPI radeonsi: VBAQ broken with HEVC |
| - tu: weird fail in packing tests |
| - radv/video: 10-bit support |
| - radv: vkCmdWaitEvents2 is broken |
| - anv: add support for EDS3::extendedDynamicState3AlphaToCoverageEnable |
| - ci: switch from CI_JOB_JWT to id_tokens |
| - Zink: enabled extensions and features may not match |
| - anv: share embedded samplers |
| |
| |
| Changes |
| ------- |
| |
| Adam Jackson (8): |
| |
| - mesa: Enable EXT_shadow_samplers for GLES2 |
| - gallium: Rename ${target}/target.c to ${target}/{$target}_target.c |
| - treewide: Include mesa_interface.h not dri_interface.h |
| - mesa_interface: Set ourselves free |
| - mesa_interface: Move out of GL/internal/ |
| - gallium/dril: Compatibility stub for the legacy DRI loader interface |
| - dri: Let dril handle the DRI driver link farm |
| - gallium/meson: Deconflate swrast/softpipe/llvmpipe |
| |
| Adrian Perez de Castro (1): |
| |
| - Revert "egl/wayland: Remove EGL_WL_create_wayland_buffer_from_image" |
| |
| Alejandro Piñeiro (29): |
| |
| - v3dv/cmd_buffer: always bind pipeline static state |
| - v3dv/ci: dEQP-VK.dynamic_state.*.double_static_bind are fixed now |
| - v3dv: port dynamic state tracking to use Mesa Vulkan |
| - v3dv: provide implementation for vkCmdBindVertexBuffers2 |
| - v3dv: provide implementation for CmdSetViewportWithCount |
| - v3dv: CullMode and FrontFace are dynamic now |
| - v3dv: DepthBoundsTestEnable is dynamic now |
| - v3dv: move depth CFG bits setting to cmd buffer emission |
| - v3dv: ez_state/incompatible_ez_test could be recomputed at cmd_buffer |
| - v3dv: PrimitiveTopology is now dynamic |
| - v3dv: StencilOp and StencilTestEnable are now dynamic |
| - v3dv/ci: update expected list due VK_EXT_extended_dynamic_state |
| - v3dv: enable VK_EXT_extended_dynamic_state |
| - v3dv/cmd_buffer: missing updates due PrimitiveTopology being dynamic now |
| - v3dv: fixes StencilTestEnable handling |
| - v3dv: PrimitiveRestartEnable is now dynamic. |
| - v3dv: DepthBiasEnable is dynamic now |
| - v3dv: SetRasterizerDiscardEnable is dynamic now |
| - v3dv: enable VK_EXT_extended_dynamic_state2 |
| - v3dv: add debug option to disable custom pipeline caches for meta operations |
| - v3dv/meta_clear: take into account multiview for the custom clear pipeline caches |
| - v3dv/meta_clear: use v3dv_renderpass used as parameter |
| - v3dv/device: compute maxDescriptorSet*Limits multiplying per-stage by 4 |
| - v3dv/device: set DescriptorUpdateAfterBind limits |
| - v3d/devinfo: unify comment style |
| - broadcom: move HW-dependant constants to v3d_device_info |
| - v3d,v3dv: document cl_emit_with_prepacked |
| - v3dv/pipeline: ensure vk_graphics_pipeline_all_state alive when still needed |
| - drm-shim: stub synobj_timeline_wait and query ioctl |
| |
| Aleksi Sapon (5): |
| |
| - lavapipe: fixes for macOS support |
| - lavapipe: build "Windows" check should use the host machine, not the \`platforms` option. |
| - util: fix memory related OS calls on macOS |
| - wsi: fix compilation on macOS |
| - util: macOS support for cnd_monotonic |
| |
| Alessandro Astone (1): |
| |
| - egl/gbm: Walk device list to initialize DRM platform |
| |
| Alex Deucher (1): |
| |
| - ac/surface: fix version check for gfx12 DCC |
| |
| Alexandre Marquet (2): |
| |
| - pan/mdg: quirk to disable auto32 |
| - panfrost: implement SFBD raw format support on v4 |
| |
| Alexandros Frantzis (2): |
| |
| - egl/wayland: Pass dri2_wl_formats to create_dri_image |
| - egl/wayland: Fail EGL surface creation if opaque format is unsupported |
| |
| Ali Homafar (1): |
| |
| - lavapipe: Set ICD api_version to 1.3 |
| |
| Alyssa Rosenzweig (222): |
| |
| - vulkan: optimize vk_dynamic_graphics_state_any_dirty |
| - vulkan: add helper to fill out spirv caps automatically |
| - nir/lower_subgroups: add filter |
| - nir/lower_subgroups: add generic scan/reduce lower |
| - nir/lower_subgroups: relax ballot_type_to_uint |
| - nir/lower_robust_access: also handle image derefs |
| - docs: add header-stub for vk_enum_to_str |
| - vulkan: add vk_debug_ignored_stype helper |
| - nvk: use common stype debug |
| - broadcom: use common stype debug |
| - pvr: use common stype debug |
| - anv,hasvk: use common stype debug |
| - dzn: use common stype debug |
| - nir: add is_first_fan_agx sysval |
| - nir: add texops for AGX border colour emulation |
| - nir: add quad_ballot_agx intrinsic |
| - nir,agx: add depth=never workaround |
| - nir,agx: fix load_active_subgroup_index |
| - compiler: add ACCESS_IN_BOUNDS_AGX |
| - agx: optimize and/or with booleans |
| - agx: enable more lowering |
| - agx: fix query LOD of array |
| - agx: fix some ms texture packing |
| - agx: handle cross-workgroup memory barriers |
| - agx: allow 8-bit bcsel |
| - agx: fix phi translation corruption |
| - agx: fix load_helper_invocation with sample shading |
| - agx: fix frag sidefx with sample shading |
| - agx: handle subgroup barriers |
| - agx: fix spilling inside sample loop |
| - agx: switch to demote internally |
| - agx: lower nir_intrinsic_load_num_subgroups |
| - agx: delete unreachable blocks |
| - agx: model more subgroup ops |
| - agx: lower shuffle |
| - agx: handle non-immediate shuffles in divergent CF |
| - agx: handle quad_broadcast |
| - agx: handle quad swaps |
| - agx: add missing b2b16 implementation |
| - agx: forbid uniforms on ballots |
| - agx: lower 8-bit subgroups |
| - agx: flesh out subgroup lowering |
| - agx: report uses_txf |
| - agx: expose agx_link_libagx |
| - agx: document another sample_mask restriction |
| - agx: reserve scratch registers for mem<-->mem swaps |
| - agx: optimize txf with lod 0 |
| - agx: fix bogus unit test |
| - agx: stash early_fragment_test info |
| - agx: handle quad reduce |
| - agx: implement quad_ballot |
| - agx: lower more quad ops |
| - agx: optimize elect() |
| - agx: fix UB in cursor comparison |
| - ail: constify everything |
| - asahi: mark eMRT loads as in-bounds |
| - asahi: calculate validity when unpacking |
| - asahi: agx_translate_sample_count |
| - asahi: assert bo size > 0 |
| - asahi: unwrap pointless null check |
| - asahi: implement PIPE_CAP_QUERY_MEMORY_INFO |
| - asahi: rm unused #include |
| - asahi: resize key |
| - asahi: cleanup fs epilog link info |
| - asahi: move agx_link_varyings_vs_fs |
| - asahi: fix prolog emit |
| - asahi: pack UVS key properly |
| - asahi: plumb shader stage into info |
| - asahi: get debug in common |
| - asahi: rm deadcode |
| - asahi: drop rgb10a2_sint rendering |
| - asahi: add missing rgba4 format |
| - asahi: fix 1D array atomics |
| - asahi: fix txf/image_load robustness with arrays |
| - asahi: rework VBO lower for divisor=0 |
| - asahi: mv AGX_MAX_OCCLUSION_QUERIES define |
| - asahi: handle agx_ppp_fragment_face_2 with no info |
| - asahi: clarify format code in image lowering |
| - asahi: fix rgb565 blending |
| - asahi: fix store_output component/offset |
| - asahi: fix sample ID with multiblock epilogs |
| - asahi: lower texture instructions with epilogs |
| - asahi: fix cull unknown bits |
| - asahi: simplify image atomic lowering |
| - asahi: move primitive MSAA field |
| - asahi: free libagx if we don't use a ralloc memctx |
| - asahi: eliminate troublesome empty uniforms |
| - asahi: rearrange VS uniforms |
| - asahi: set src_type for store_output |
| - asahi: rm dead code |
| - asahi: add agx_index_size_to_B helper |
| - asahi: move some GS lowering into lower_gs |
| - asahi: don't use load_num_vertices in geometry shaders |
| - asahi: mv vertex_id_for_topology_class into GS lowering |
| - asahi: rm another num_vertices use |
| - asahi: rm dated comment |
| - asahi: rm unused lower |
| - asahi: rm num_vertices uses |
| - asahi: rm redundant input_vertices |
| - asahi: mv initialization of grid z for indirect GS |
| - asahi: rm more dead lowering |
| - asahi: rm always true param |
| - asahi: update comment |
| - asahi: update comment for maint5 |
| - asahi: eliminate num_workgroups for VS->GS + VS->TCS |
| - asahi: drop bogus assertion |
| - asahi: pack tilebuffer usc word ahead-of-time |
| - asahi: add agx_ppp_push_merged helper |
| - asahi: use ppp_merge |
| - asahi: don't allocate varyings ourselves |
| - asahi: don't allocate for ppp updates |
| - asahi: extend varying linking for tri fan weirdness |
| - asahi: plumb tri fan flatshading through common |
| - asahi: don't ralloc in agx_fast_link |
| - asahi: extend epilog key for force early frag handling |
| - asahi: don't reserve extra UVS space for layer |
| - libagx: use sub_group_scan_inclusive_add |
| - libagx: add query copy kernel |
| - libagx: don't use get_group_id() |
| - asahi/decode: QoL improvements |
| - asahi: track imports for decode |
| - asahi: clean up bg/eot counts |
| - asahi: rename meta -> bg/eot |
| - asahi: don't allocate for USC words |
| - asahi: split frag shader words |
| - asahi: split CDM Launch words |
| - asahi: unify naming for COUNTS structs |
| - nir/tex_instr_result_size: handle subpass_ms |
| - nir/lower_point_size: support lowered i/o |
| - asahi/decode: drop Apple-specific decode check |
| - libagx: rm unused field |
| - libagx: fix static assert |
| - libagx: fix triangle fan + prim restart + GS/XFB |
| - libagx: drop unused !indexed path |
| - libagx: add libagx_copy_xfb_counters helper |
| - asahi: be robust against out of sync shader info |
| - agx: fix draw param gather for sw vs |
| - asahi: split param structs for GS internal kernel |
| - agx: rework libagx I/O lowering |
| - asahi: add missing lowerings |
| - asahi: force bindless for eMRT |
| - asahi: bounds check eMRT stores |
| - asahi: support bigger buffer textures |
| - asahi: add AGX_TEXTURE_FLAG_CLAMP_TO_0 flag |
| - agx: handle discard with force early tests |
| - asahi: pack blend key |
| - agx: switch to combined clip/cull |
| - asahi: add flag controlling sample mask without MSAA |
| - asahi: use scalar outputs for rast shaders |
| - asahi: move null descriptor routines to common |
| - asahi: implement rba2 for uniform texel buffers |
| - asahi: implement rba2 for storage texel buffers |
| - agx: prepare for lower_wpos_center |
| - asahi: extract agx_calculate_vbo_clamp |
| - agx: fix indirect CF accounting |
| - mesa: fix duplicate initializer |
| - nir/lower_wpos_center: clean up |
| - nir/builtin_builder: factor out nir_build_texture_query |
| - asahi: use nir_build_texture_query |
| - gallium: remove ability to probe asahi |
| - asahi: add broken bits of unstable Linux UAPI |
| - agx: fix 64-bit bcsel ingestion |
| - agx: fix fmin/fmax with (-0, 0) pair |
| - libagx: fix uint8_t definition |
| - libagx: make index buffer fetch robust |
| - libagx: generalize query copies |
| - asahi: implement rba2 semantics for vbo |
| - asahi: fix vbo clamp with stride=0 |
| - asahi: implement robustness2 for msaa image stores |
| - asahi: be more clever about GS side effects |
| - nir/lower_robust_access: handle MSAA images |
| - nir: add nir_metadata_control_flow |
| - treewide: use nir_metadata_control_flow |
| - nir: document restriction on load_smem_amd constantness |
| - vulkan: reference count vk_descriptor_update_template |
| - vulkan: handle push DUT with emulated secondaries |
| - vulkan: fix potential UAF with vk_cmd_enqueue_CmdPushDescriptorSetKHR |
| - vulkan: handle enqueueing CmdPushConstants2KHR |
| - vulkan: handle enqueueing CmdPushDescriptorSet2KHR |
| - lvp: use common push constant enqueue |
| - lvp: use common push descriptor set enqueue |
| - lvp: fix silly casting for sampler desc updates |
| - lvp: use common descriptor update templates |
| - nir/format_convert: remove unorm bit size assert |
| - nir: add nir_def_replace helper |
| - treewide: use nir_def_replace sometimes |
| - agx: fix insidious ballot optimizer bug |
| - agx: add unit test for ballot bug |
| - agx: set discard_is_demote |
| - nir: add nir_break_if helper |
| - treewide: use nir_break_if |
| - nir: fix miscompiles with rules with INT32_MIN |
| - nir/algebraic: explicitly suffix constants |
| - nir/opt_constant_folding: fix array size define |
| - zink: move print_pipeline_stats |
| - zink: print pipeline stats for compute shader-db |
| - util: add dui/uid helpers |
| - nir: add nir_alu_instr float controls queries |
| - nir/search: use ALU float control helpers |
| - nir: use MIN2/MAX2 opcodes for imin/umax folding |
| - nir: strengthen fmin/fmax definitions with signed zero |
| - glsl/float64: handle signed zero with min/max |
| - nir/lower_double_ops: handle signed zero with min/max |
| - nir/lower_alu: remove dead #define |
| - nir: add lower_fminmax_signed_zero |
| - agx: set lower_fminmax_signed_zero |
| - agx: do not flush denorms for fp16 fmin/fmax |
| - asahi: drop old comment |
| - asahi: drop stale comment |
| - asahi: make agx_pack opencl compatible |
| - asahi: tuck in null query check |
| - libagx: specify heap size explicitly |
| - asahi,libagx: tessellate on device |
| - libagx: add kernel for incrementing CS counter |
| - asahi: handle CS pipeline stat with indirect dispatch |
| - libagx: handle VS/IA pipeline stats on GPU |
| - asahi: eliminate load_num_workgroups from TCS unrolled ID |
| - nouveau/drm-shim: set ram_user |
| - nvk: add instruction count exec property |
| - nir/opt_constant_folding: fix array size define, pt 2 |
| - zink: remove extraneous \\n with shaderdb |
| - zink: match shader-db report.py format |
| |
| Amber (1): |
| |
| - tu: Disable depth and stencil tests when attachment state requires it |
| |
| Amit Pundir (1): |
| |
| - android: Fix zink build failure |
| |
| Amol Surati (1): |
| |
| - nine: avoid using post-compacted indices with state expecting pre-compacted ones |
| |
| Antoine Coutant (1): |
| |
| - drisw: fix build without dri3 |
| |
| Antonio Ospite (2): |
| |
| - meson: fix deprecation warning in create-android-cross-file.sh |
| - android: simplify building libgallium_dri on Android |
| |
| Arthur Huillet (4): |
| |
| - nvk: update 3d classes for conservative raster |
| - nvk: implement VK_EXT_conservative_rasterization |
| - nvk: import SPH headers files from open-gpu-doc |
| - nvk: generate Rust bindings from SPH header files |
| |
| Asahi Lina (1): |
| |
| - asahi: Make asahi_clc build work on x86_64->x86 builds |
| |
| Bas Nieuwenhuizen (5): |
| |
| - radv: Use zerovram for Enshrouded. |
| - util/disk_cache: Fix cache marker refresh. |
| - util/disk_cache: Delete the old multifile cache if using the default. |
| - util/cache_test: Add tests for old cache deletion. |
| - relnotes: Add an entry about the new cache default. |
| |
| Benjamin Lee (1): |
| |
| - vk/graphics_state: Add last bits for extraPrimitiveOverestimationSize |
| |
| Boris Brezillon (81): |
| |
| - panvk: Prepare things so we can push sysvals to our push uniform buffer |
| - panvk: Put dynamic uniform buffers at the end of the UBO array |
| - panvk: Move the dynamic SSBO descriptors to their own UBO |
| - panvk: Prepare dynamic buffer descriptors at bind time |
| - panvk: Lower sysvals to push uniforms |
| - panvk: Kill panvk_sysval_vec4 |
| - panfrost: do not write outside num_wg_sysval |
| - panfrost: Add the BO containing fragment program descriptor to the batch |
| - pan/kmod: Fix a syncobj leak in the panthor backend |
| - pan/kmod: Make default allocator thread-safe |
| - panvk: clang-format fixups |
| - panvk: Kill panvk_queue_get_device() |
| - panvk: Get rid of panvk_descriptor_state::dirty |
| - panvk: Move panvk_cmd_state::batch to panvk_cmd_buffer::cur_batch |
| - panvk: Kill unused dynamic state bits |
| - panvk: Get rid of special attribute support |
| - panvk: Split the graphics and compute state at the cmd_buffer level |
| - panvk: Split compute/graphics pipeline objects |
| - panvk: Use memory pools to store pipeline shaders/descriptors |
| - panvk: Kill the panvk_pipeline_builder object |
| - panvk: Transition the graphics pipeline logic to vk_graphics_pipeline_state |
| - panvk: Fully transition to vk_viewport_state |
| - panvk: Fully transition to vk_rasterization_state |
| - panvk: Fully transition to vk_input_assembly_state |
| - panvk: Use vk_color_blend_state to fill our blend constant |
| - panvk: Fully transition to vk_depth_stencil_state |
| - panvk: Fully transition to vk_multisample_state |
| - panvk: Set unused attribute buffers descriptors to zero |
| - panvk: Rename non_vs_attribs into img_attribs |
| - panvk: Prevent re-emission of image attributes used in vertex shaders |
| - panvk: Move VS attribute/buffer state to panvk_cmd_graphics_state |
| - panvk: Emit VS-accessible image attributes at a fixed offset |
| - panvk: Leave holes in the attribute locations used by a shader |
| - panvk: Fully transition to vk_vertex_binding_state |
| - pan/blend: Move constant inlining out of pan_blend_create_shader() |
| - pan/blend: Expose pan_blend_create_shader() |
| - panvk: Add a blend library to deal blend shaders/descriptors |
| - panvk: Don't pass the stage to shader_create() |
| - panvk: Simplify shader initialization in the pipeline logic |
| - panvk: Fix/simplify the shader linking logic |
| - panvk: Replace the stages array in panvk_draw_info by vs/fs fields |
| - panvk: Move fs_rsd fields to an fs sub-struct |
| - panvk: Emit the fragment shader RSD dynamically |
| - panvk: Lower global memory IOs |
| - panvk: Clean Midgard leftovers in the cmd_close_batch() path |
| - panvk: Use vk_pipeline_shader_stage_to_nir() |
| - panvk: Kill cmd_get_tiler_context() |
| - panvk: Make sure we dump memory mappings before crashing |
| - pan/decode: Be robust to NULL texture payload |
| - pan/desc: Add missing format in translate_s_format() |
| - pan/jc: Drop unused pool argument passed to pan_jc_add_job() |
| - panvk: Add a render state to panvk_cmd_graphics_state |
| - panvk: Take VK_RENDERING_{RESUM,SUSPEND}ING_BIT flags into account |
| - panvk: Force a preload when the render area is not 32x32 aligned |
| - panvk: Skip depth/stencil attachments with non-matching aspect mask |
| - panvk: Fix dynamic rendering with images containing both depth and stencil |
| - panvk: Make sure we don't lose clear-only operations |
| - panvk: Make sure we run the fragment shader if alpha_to_coverage is enabled |
| - panvk: Make sure replay of command buffers containing Dispatch calls works |
| - panvk: Override the default GetRender[in]AreaGranularityKHR() |
| - egl: Use gbm_bo_create_with_modifiers2() when the surface has non-zero flags |
| - panvk: Fix formatting around OpaqueCaptureAddress implementation |
| - panvk/ci: Flag exact_sampling.*.edge_right test as fails |
| - pan/bi: Make sure global loads/stores don't exceed 16 bytes |
| - pan/bi: Fix dynamic indexing of push constants |
| - panvk: Fix Cube/2DArray/3D img -> buf copies |
| - panvk: Don't bail out when allocationSize is zero in AllocateMemory() |
| - panvk: Prepare for Valhall image views |
| - panvk: Prepare for Valhall buffer views |
| - panvk: Prepare things for compiling valhall source files |
| - panvk: Extend Valhall descriptor set implementation to support Bifrost |
| - panvk: Overhaul the Bifrost descriptor set implementation |
| - panvk: Refcount private BOs |
| - panvk: Store private BOs in lists instead of dynarrays |
| - panvk: Prepare panvk_mempool for shared device memory pools |
| - panvk: Use memory pools for internal GPU data attached to vulkan objects |
| - pan/desc: Extend pan_emit_fbd() to support multilayer rendering |
| - pan/desc: Prepare things for fragment job chaining |
| - pan/blitter: Let pan_preload_fb() callers queue the jobs to the job chain |
| - panvk: Use IDVS jobs when we can |
| - panvk: Add support for layered rendering |
| |
| Caio Oliveira (35): |
| |
| - intel/brw: Don't print IP as part of the dump |
| - intel/brw: Hide register pressure information in dumps |
| - intel/brw: Use \`vNN` instead of \`vgrfNN` when printing instructions |
| - intel/brw: Fix commas when dumping instructions |
| - spirv: Add MESA_SPIRV_DEBUG=values to dump all values |
| - intel/brw: Track the number of uses of each def in def_analysis |
| - intel/brw: Fix typo in DPAS emission code |
| - intel/brw: Add unit tests for scoreboard handling FIXED_GRF with stride |
| - intel/brw: Make component_size() consistent between VGRF and FIXED_GRF |
| - glsl: Fix warning related to tg4_offsets in release mode |
| - intel/brw: Print SWSB information when dumping instructions |
| - intel/brw: Reorganize lowering of LocalID/Index to handle Mesh/Task |
| - anv: Use brw_nir_lower_cs_intrinsics for lowering Mesh/Task LocalID |
| - intel/brw: Remove unused brw_reg related functions |
| - intel/brw: Remove RALLOC helper from fs_reg |
| - intel/brw: Remove unused variable from test |
| - intel/brw: Move fs_reg data members up to brw_reg |
| - intel/brw: Use public inheritance for fs_reg/brw_reg |
| - intel/brw: Move most member functions from fs_reg to brw_reg |
| - intel/brw: Remove conversion from fs_reg to brw_reg |
| - intel/brw: Replace some fs_reg constructors with functions |
| - intel/brw: Remove duplicated functions between fs_reg/brw_reg |
| - intel/brw: Rename brw_reg() helper to brw_make_reg() |
| - intel/brw: Make fs_reg an alias of brw_reg |
| - intel/brw: Replace uses of fs_reg with brw_reg |
| - intel/brw: Rename fs_reg_* helpers to brw_reg_* |
| - intel/brw: Move brw_reg helpers into brw_reg.h |
| - intel/brw: Don't set acc_wr_control for Xe2 |
| - intel/brw: Use brw_inst_set_group() to set QtrCtrl and NibCtrl |
| - intel/brw: Account for reg_unit() in assembler |
| - intel/brw: Don't print extra newlines in assembler |
| - intel/brw: Split off assembler logic into library |
| - spirv: Don't warn about FPFastMathMode if not OpenCL |
| - intel/brw: Convert missing uses of ralloc to linear in fs_live_variables |
| - intel/elk: Convert missing uses of ralloc to linear in fs_live_variables |
| |
| Chia-I Wu (8): |
| |
| - gallium: add pipe_picture_desc::flush_flags |
| - frontends/va: track whether there are imported/exported surfaces |
| - frontends/va: set PIPE_FLUSH_ASYNC when possible |
| - radeonsi: prep for pipe_picture_desc::flush_flags |
| - radeonsi: respect pipe_picture_desc::flush_flags |
| - radv: check gs_copy_shader directly for executable props |
| - radv: make radv_pipeline_has_ngg static |
| - drm-shim: intercept access as well |
| |
| Christian Gmeiner (42): |
| |
| - isaspec: Add method to get the displayname of BitSetEnumValue |
| - isaspec: Improve 'meta' handling |
| - etnaviv: isa: Drop capturing of python output |
| - etnaviv: isa: Add clang-format special comments |
| - etnaviv: isa: Print dst_full for ALU |
| - etnaviv: isa: Switch to enum isa_thread |
| - etnaviv: isa: Add more flags to etna_inst |
| - etnaviv: isa: Rework modeling of left shift for store/load |
| - etnaviv: isa: Add name for full writemask |
| - mr-label-maker: Add teflon marker |
| - etnaviv: isa: Do src swizzle with isaspec |
| - clc: Always use spir for 32 bit |
| - etnaviv: Zero init all srcs passed to etna_emit_alu(..) |
| - ci: uprev mold to 2.32.0 |
| - gallium: Add vkms entrypoint |
| - nak: Move nak_optimize_nir declaration to nak_private.h |
| - meson: Update proc-macro2 subproject |
| - meson: Update syn subproject |
| - meson: Add pest rust dependencies |
| - meson: Add roxmltree rust dependency |
| - meson: Add indexmap rust dependencies |
| - etnaviv: isa: Add meta elements to instructions |
| - etnaviv: isa: Generate Rust FFI bindings for asm.h |
| - etnaviv: isa: Make header C++ safe |
| - etnaviv: isa: Add meson version check |
| - etnaviv: isa: Add IsaParser proc_macro_derive |
| - etnaviv: isa: Add struct etna_asm_result |
| - etnaviv: isa: Make etna_asm_result usable in Rust |
| - etnaviv: isa: Add EtnaAsmResultExt trait |
| - etnaviv: isa: Add parser module |
| - etnaviv: isa: Add C function impl |
| - etnaviv: isa: Add cli assembler |
| - etnaviv: isa: Extend disasm test |
| - ci/etnaviv: Drop shaders\@glsl-bug-110796 line |
| - etnaviv: isa: Drop 1:1 mapping of opc to bits |
| - etnaviv: isa: Add support for extended instructions |
| - nak: Update comment about explicit padding |
| - etnaviv: isa: Add support for bitset's displayname |
| - etnaviv: isa: Rework branch instruction |
| - nak: Set has_imad32 conditionally |
| - nak: Move imad late optimization to nir |
| - dri: fix driver names |
| |
| Christopher Michael (3): |
| |
| - v3d: Move spec\@arb_texture_view\@rendering-formats, Crash in broadcom-rpi4-fails |
| - v3d: Move spec\@!opengl 1.1\@getteximage-formats, Fail in broadcom-rpi4-fails |
| - broadcom: fix issue of ‘addr’ is used uninitialized |
| |
| Colin Marc (3): |
| |
| - radv/video: don't truncate frame_num and POC to 32 |
| - vulkan/video: generate profile_tier_level structure correctly |
| - vulkan/video: correctly set sub-layer ordering in H.265 VPS/SPS |
| |
| Collabora's Gfx CI Team (6): |
| |
| - Uprev Piglit to 7aa7bc1b01d57b4b091c4fc82a94a6ff47f38ebf |
| - Uprev Piglit to 8a6ce9c6fc5c8039665655bca4904d5601c6dba0 |
| - Uprev Piglit to e180f96239edba441f22f58dfc852cafb902844a |
| - Uprev Piglit to fdf3fc09deb6beecdf212e65a16c645112540b59 |
| - Uprev Piglit to 647d0725024f72bc49bbc91c686c5f61168a1fe8 |
| - Uprev Piglit to 582f5490a124c27c26d3a452fee03a8c85fa9a5c |
| |
| Cong Liu (1): |
| |
| - nir: Fix out-of-bounds access in ntt_emit_store_output() |
| |
| Connor Abbott (92): |
| |
| - ir3: Add scan_clusters_macro to ir3_valid_flags() |
| - ir3: Add scan_clusters.macro to is_subgroup_cond_mov_macro() |
| - ir3: Validate tied sources better |
| - ir3/ra: Don't demote movmsk instructions to non-shared |
| - ir3: Rewrite postsched dependency handling |
| - ir3/legalize: Use define for register size |
| - ir3: Rewrite regmask implementation |
| - ir3/ra: Prepare for shared half-regs |
| - ir3/ra: Fix printing shared reg file |
| - ir3/ra: Prepare for shared phis |
| - ir3: Fix lowering shared parallel copies with immed src |
| - ir3/lower_pcopy: Fix immed/const flags for copy from shared |
| - ir3: Fix shared parallel copy validation |
| - ir3: Don't use swz with shared registers |
| - ir3/lower_copies: Handle HW bug with shared half-floats |
| - ir3/lower_copies: Fix "inaccessible" half reg lowering with shared regs |
| - ir3/ra: Use ra_reg_get_num() for validating num |
| - ir3: Use INVALID_REG in array store |
| - ir3: Reset num when creating parallel copies |
| - ir3: Validate that shared registers are in-bound |
| - ir3: Allow propagation of normal->shared copies |
| - ir3: Moves with shared destination are always legal |
| - ir3/legalize: Take (ss) into account in WaR hazards |
| - ir3/legalize: Remove bad (eq) micro-optimization |
| - ir3/legalize: any/all/getone are non-prefetch helper users |
| - ir3: Use correct category for OPC_PUSH_CONSTS_LOAD_MACRO |
| - ir3: Add support for "scalar ALU" |
| - ir3: Implement source restrictions for shared ALU |
| - ir3: Validate scalar ALU sources |
| - ir3: Immediate source for stc is invalid |
| - ir3: Don't emit single-source collects |
| - ir3/cp: Support swapping mad srcs for shared regs |
| - ir3/cf: Don't fold shared conversions |
| - ir3: Distinguish lowered shared->normal moves |
| - ir3: Add support for ldc.u |
| - ir3: Add builder support for shared immediates |
| - ir3: Create reduce identity directly |
| - ir3: Make type_flags() return a bitmask enum |
| - ir3: Support scalar ALU in the builder |
| - ir3: Add scalar ALU-specific passes |
| - ir3: Get sources before emitting scan_clusters.macro |
| - ir3: Rewrite shared reg handling when translating from NIR |
| - ir3: Directly use shared registers when possible |
| - ir3/nir: Fix imadsh_mix16 definition |
| - ir3: Use scalar ALU instructions when possible |
| - ir3: Don't scalarize all SSBO instructions |
| - ir3: Don't manually scalarize SSBO loads |
| - freedreno/a7xx: Add AQE-related registers from kgsl |
| - freedreno/a7xx: Add A7XX_HLSQ_DP_STR location from kgsl |
| - freedreno/crashdec: Initial a7xx support |
| - freedreno: Update HLSQ_*_CMD registers for a7xx |
| - docs/android: Fix example meson cross file again |
| - ir3: Put VS->TCS barrier after preamble |
| - ir3/legalize: Insert dummy bary.f after preamble |
| - freedreno,ir3: Add has_early_preamble |
| - tu: Workaround early preamble HW bug |
| - freedreno/a6xx: Workaround early preamble HW bug |
| - ir3: Add ir3_info::early_preamble |
| - tu: Implement early preamble |
| - freedreno/a6xx: Implement early preamble |
| - ir3: Enable early preamble |
| - tu: Use image aspects for feedback loops |
| - tu: Support VK_EXT_attachment_feedback_loop_dynamic_state |
| - tu: Use a7xx terminology for flushes |
| - freedreno, tu: Use CLEAN events on a7xx |
| - tu: Fix unaligned indirect command synchronization |
| - tu: Don't WFI after every dispatch |
| - freedreno/a7xx: Fix register file size |
| - ir3: Make sure constlen includes stc/ldc.k/ldg.k instructions |
| - freedreno: Disable early preamble on a6xx gen4 |
| - ir3, tu, freedreno: Move early_preamble to ir3_shader |
| - tu: Add early preamble statistic |
| - ir3: Introduce elect_any_ir3 |
| - ir3: Use elect_any_ir3 in preambles |
| - freedreno: Fix RBBM_NC_MODE_CNTL variants |
| - tu: Add support for aligned substreams |
| - ir3: Fix UBO size with indirect driver params |
| - tu: Make cs writeable for GMEM loads when FDM is enabled |
| - tu: Fix fdm_apply_load_coords patchpoint size |
| - tu: Support VK_EXT_fragment_density_map on a750 |
| - tu: Support bufferDeviceAddressCaptureReplay on kgsl |
| - freedreno: Fix decoding primitive counter events on a7xx |
| - tu: Add VPC hardware workaround for a750 |
| - ir3: Fix stg/ldg immediate offset on a7xx |
| - nir/instr_set: Return the matching instruction |
| - nir/instr_set: Don't remove matching instruction |
| - ir3: Split out bindless tex/samp encoding |
| - ir3: Don't consider r63.x as a GPR |
| - ir3: Plumb through descriptor prefetch intrinsics |
| - ir3: Make preamble rematerialization common code |
| - ir3: Expand preamble rematerialization |
| - ir3: Add descriptor prefetching optimization on a7xx |
| |
| Constantine Shablia (5): |
| |
| - pan/bi: fix 1D array tex coord lowering |
| - pan/bi: clean up tex coord lowering |
| - panfrost: report correct MAX_VARYINGS |
| - panvk: remove descriptor pool counters |
| - panvk: enable KHR and EXT BDA |
| |
| Corentin Noël (8): |
| |
| - zink: Always call deinit_multi_pool_overflow when destroying zink_descriptor_pool_multi |
| - ci: Allow to override the virglrenderer render server |
| - venus: sync protocol for VK_KHR_maintenance5 |
| - venus: enable VK_KHR_maintenance5 |
| - venus/ci: add more recently found flakes |
| - wsi: Make sure to return a valid wayland id string |
| - venus/ci: Update expectations |
| - ci: Make sure to install libraries in the right directory on debian |
| |
| Craig Stout (8): |
| |
| - util: detect_os: add DETECT_OS_FUCHSIA and DETECT_OS_POSIX_LITE |
| - util: u_thread: add Fuchsia support |
| - util: os_misc: add Fuchsia support |
| - util: u_dl: add Fuchsia support |
| - util: os_time: add Fuchsia support |
| - vulkan/util: add missing dependencies |
| - meson: remove unnecessary line continuation |
| - vulkan/runtime: add spirv_info_h to vulkan_lite_runtime_header_gen_deps |
| |
| Daniel Lundqvist (1): |
| |
| - radeonsi: Fix unused variable when LLVM is not used for AMD. |
| |
| Daniel Schürmann (69): |
| |
| - aco/ra: fix kill flags after renaming fixed Operands |
| - aco/ra: assert that the register file is empty after register allocation completed |
| - aco/lower_phis: simplify check for uniform predecessors |
| - aco: introduce aco_opcode::p_boolean_phi |
| - aco/vn: copy-propagate trivial phis |
| - aco/lower_phis: generalize init_state() so that it works with any scalar phis |
| - aco/lower_phis: implement SGPR phi lowering |
| - aco: use SGPR phi lowering for uniform phis in divergent merge blocks |
| - aco: use SGPR phi lowering for all loop header phis |
| - aco: use SGPR phi lowering for all scalar phis |
| - aco/optimizer: remove p_linear_phi handling from optimizer |
| - radv: mark nir_opt_loop() as not idempotent |
| - radv: move nir_opt_dead_cf() before nir_opt_loop() |
| - panfrost: skip gles-3.0-transform-feedback-uniform-buffer-object on Mali G52 and G57 |
| - nir/loop_analyze: adjust negative (or huge) iteration count check for bit size |
| - nir/opt_if: don't split ALU of phi into otherwise empty blocks |
| - nir/opt_loop: add loop peeling optimization |
| - aco/ra: fix handling of killed operands in compact_relocate_vars() |
| - aco/ra: Fix array access when finding register for subdword variables |
| - aco/ra: refactor get_reg_simple() with increased stride. |
| - aco/ra: move can_write_m0() check into get_reg_specified() |
| - aco/ra: re-use registers from killed operands |
| - aco/ra: change heuristic to first fit |
| - aco/ra: use round robin register allocation |
| - aco/assembler: fix MTBUF opcode encoding on GFX11 |
| - aco/assembler: slightly refactor MTBUF assembly for more readability |
| - aco/assembler: fix GFX67 MTBUF opcode encoding |
| - aco/scheduler: remove unused register_demand parameter |
| - aco: move live var information into struct Program |
| - aco/reindex_ssa: replace live_var parameter with boolean |
| - aco: make aco::monotonic_buffer_resource declaration visible for aco::IDSet |
| - aco: use aco::monotonic_allocator for IDSet |
| - spirv: make gl_HelperInvocation volatile if demote is being used |
| - radv: emit discard as demote by default |
| - nir: introduce discard_is_demote compiler option |
| - nir/opt_peephole_select: handle nir_terminate{_if} |
| - nir: remove nir_intrinsic_discard |
| - zink: pass zink_screen to nir_to_spirv(). |
| - nir/shader_info: remove uses_demote |
| - spirv: workaround for tests assuming that OpKill terminates invocations or loops |
| - aco/scheduler: fix register_demand validation debug code |
| - aco/spill: Unconditionally add 2 SGPRs to live-in demand |
| - aco: calculate register demand per instruction as maximum necessary to execute the instruction |
| - aco: track and use the live-in register demand per basic block |
| - aco: remove get_demand_before() |
| - aco/live_var_analysis: slightly refactor handling of additional register demand for Operand copies |
| - aco/live_var_analysis: ignore dead phis |
| - aco/spill: don't remove spilled phis |
| - aco/ra: use live_in_demand in should_compact_linear_vgprs() |
| - aco: add RegisterDemand member to Instruction |
| - aco/util: skip empty blocks in IDSet::insert(IDSet) |
| - aco/live_var_analysis: refactor using ctx struct |
| - aco/live_var_analysis: ignore phi definition and operand demand at predecessors |
| - aco/live_var_analysis: inline block->register_demand updates |
| - aco/live_var_analysis: remove unused includes |
| - aco/live_var_analysis: use separate allocator for temporary live sets |
| - aco/ra: remove special-casing of p_logical_end |
| - nir: implement loop invariant code motion (LICM) pass |
| - radv: use NIR loop invariant code motion pass |
| - nir/opt_sink: ignore loops without backedge |
| - aco: compute live-in variables in addition to live-out variables |
| - aco/ra: use live-in variables directly rather than computing them |
| - aco/spill: use live-in variables directly rather than computing them |
| - aco/cssa: use live-in variables instead of live-out variables |
| - aco/validate: use live-in variables for RA validation |
| - aco/print_ir: print live-in instead of live-out variables |
| - aco: remove live-out variables from IR |
| - aco/spill: Don't add phi definitions to live-in variables |
| - util/disk_cache: enable Mesa-DB disk cache by default |
| |
| Daniel Stone (27): |
| |
| - Revert "ci: disable g52" |
| - gbm: Support fixed-rate compression allocation |
| - venus/ci: Fix timeout |
| - venus/ci: Significantly reduce CTS fraction |
| - venus/ci: Temporarily disable jobs |
| - dri: Fix BGR format exclusion |
| - egl/surfaceless: Enable RGBA configs |
| - egl/gbm: Enable RGBA configs |
| - egl/dri2: Use createImageFromNames for DRM buffers |
| - dri: Remove old createImageWithModifiers |
| - dri: Remove createImageFromFds |
| - dri: Stop answering DRI_IMAGE_ATTRIB_FORMAT |
| - gallium/dri: Delete unused helper function |
| - gallium/dri: Drop mesa_format indirection for lookup |
| - loader/dri3: Use FourCC for create-image entrypoints |
| - egl/x11: Update to createImageFromNames |
| - dri: Delete createImageFromName |
| - dri: Unify createImage and createImageWithModifiers |
| - egl/x11: Remove __DRI_IMAGE_FORMAT remnants |
| - loader/dri3: Use FourCC for buffer allocations |
| - u_format: Rewrite format table to use YAML |
| - format: Generate endian-independent format aliases |
| - format: Generate sRGB<->linear conversions from table |
| - u_format: Reword introduction |
| - build: Check for PyYAML in Meson build |
| - dri: Allow INVALID for modifier-less drivers |
| - gbm/dri: Remove erroneous assert |
| |
| Danylo Piliaiev (38): |
| |
| - tu: Handle non-overlapping WaW hazard with buffer copy/fill/update |
| - tu/a7xx: Don't set FLUSH_PER_OVERLAP_AND_OVERWRITE for feedback loops |
| - tu/a750: Disable HW binning when there is GS |
| - freedreno/devices: Add support for Adreno A32 (G3x Gen 2) |
| - util/u_trace: Allow mixing of ArgStruct and Arg |
| - tu: Add more info to renderpass tracepoint |
| - vulkan/wsi: Make current_frame usable in all cases |
| - util/u_trace: Pass explicit frame_nr argument to delimit frames |
| - tu: Use current_frame from vk device to delimit u_trace frames |
| - anv: Use current_frame from vk device to delimit u_trace frames |
| - freedreno: Make fd_pps_driver.h usable without including other FD sources |
| - turnip/msm: Do rd dump only when there are commands in submission |
| - turnip: Implement VK_EXT_depth_clamp_zero_one |
| - freedreno/a7xx: Update TPL1_DBG_ECO_CNTL1 to fix UBWC corruption |
| - ir3/a7xx: Fix FS consts corruption when other FS has zero constlen |
| - tu: Add LRZ disable reason to renderpass tracepoint |
| - util/u_trace: Add support for fixed-length string params in tracepoints |
| - tu: Add attachments' UBWC info to renderpass tracepoint |
| - freedreno/rddecompiler: Make possible to use original shader |
| - freedreno/replay: Fix replaying without SET_IOVA |
| - freedreno/ir3: mova has special meaning for (r) flag |
| - ir3: Correctly assemble mova1 with (r) on const |
| - tu: Fix issues with render_pass tracepoint |
| - freedreno: Rename TPL1_DBG_ECO_CNTL1.UBWC_WORKAROUND into TP_UBWC_FLAG_HINT |
| - tu: Add enable_tp_ubwc_flag_hint feature to a7xx |
| - freedreno/devices: Turn off enable_tp_ubwc_flag_hint for a740 by default |
| - freedreno/devices: Fix magic regs for Adreno A32 |
| - freedreno: Describe LRZ feedback mechanism |
| - freedreno/devices: Define and appropriately set has_lrz_feedback |
| - tu: Use LRZ feedback in gmem |
| - tu: Enable LRZ feedback in sysmem |
| - freedreno: Use LRZ feedback in gmem |
| - ir3: Print bindless samp/tex ids for tex prefetch |
| - ir3/tests: Make possible to specify raw instr value as uint64 |
| - ir3/tests: Make possible to add generated disasm tests |
| - ir3: Fix decoding of stib.b/ldib.b with offset |
| - turnip/kgsl: Support external memory via ION/DMABUF buffers |
| - tu: Have single Flush/Invalidate memory entrypoints |
| |
| Dave Airlie (27): |
| |
| - radv/video/encode: fix quality params on v2 hw. |
| - Revert "zink: use a slab allocator for zink_kopper_present_info" |
| - nvk: Only enable WSI modifiers if the extension is supported. |
| - draw/texture: handle mip_offset[0] being != 0 for layered textures. |
| - nouveau/nvc0: increase overallocation on shader bo to 2K |
| - nvidia: fixup classes import and import new classes. |
| - nouveau/push: add support for m2mf/i2mf to dumper |
| - nouveau/nvc0: add support for using common pushbuf dumper |
| - radv/video: fix layered decode h264/5 tests. |
| - radv/video: use vcn ip versions for encoder detection. |
| - ac/radv/radeonsi: move av1 ctx/probs size/filling to common code. |
| - ac/radv/radeon: move film grain init to common code. |
| - st/mesa: drop u_simple_shaders.h include where not used. |
| - gallivm: create a pass manager wrapper. |
| - gallivm: move ppc denorm disable to inline |
| - gallivm: split some code out from init module. |
| - gallivm: make lp_bld_coro.h c++ include safe. |
| - gallivm: export target init code for orc-jit to reuse |
| - gallivm: split out generating LLVM Mattrs |
| - llvmpipe: Introduce llvmpipe_memory_allocation |
| - nvk: use 2k overallocation for shader heap. |
| - anv/video: use correct offset for MPR row store scratch buffer. |
| - radv/video: advertise mutable/extended for dst video images. |
| - draw/orcjit: supply stub function for tcs coro |
| - llvmpipe/cs/orcjit: add stub function name for coro |
| - gallivm/sample: fix sampling indirect from vertex shaders |
| - nvc0: fix null ptr deref on fermi due to debug changes. |
| |
| David (Ming Qiang) Wu (2): |
| |
| - radeonsi/vcn: set accurate size for dec header and index_codec |
| - radeonsi/vcn: support DPB_MAX_RES on VCN5 |
| |
| David Heidelberg (58): |
| |
| - turnip: rename tu_queue_submit struct to follow ODR |
| - ci: fail pipeline for users who got access to restricted traces |
| - ci/traces: majanes has no longer access to the restricted traces |
| - ci/deqp: correct EGL_EXT_config_select_group detection |
| - egl/x11: Move RGBA visuals in the second config selection group |
| - mailmap: add Freya Gentz entry |
| - etnaviv: migrate from piglit include to generic deqp and toml spec |
| - freedreno/ci: move platform to the deqp toml file for a530 |
| - freedreno/ci: move the disabled jobs from include to the main file |
| - freedreno/ci: Switch a306_* to deqp-runner |
| - freedreno/ci: do not depend on single job rules for another jobs |
| - freedreno/ci: switch a306 to weston |
| - freedreno/ci: re-enable a306_piglit |
| - ci/panfrost: disable G52 until machines gets fixed |
| - ci: drop unused piglit-test and integrate it into piglit-traces-test |
| - freedreno/ci: Drop duplicated include and add missing stages |
| - freedreno/ci: Implement nightly piglit job for Adreno 630 and 618 |
| - ci/freedreno: update expectations from the nightly run |
| - ci: bump ANGLE |
| - ci: Revert "ci: update failures list with angle for jsl, tgl" |
| - ci/intel: add new jsl flake |
| - ci/panfrost: Revert "ci/panfrost: disable G52 until machines gets fixed" |
| - ci/alpine: re-enable Mold linker |
| - ci/etnaviv: add flakes from nightly runs |
| - winsys/i915: depends on intel_wa.h |
| - subprojects: uprev perfetto to v45.0 |
| - ci/r300: update flake list from nightly reports |
| - ci/nouveau: move disabled jobs back from include into main gitlab-ci.yml |
| - ci/nouveau: separate HW definition from SW |
| - ci/nouveau: adjust and add DEVICE_TYPE |
| - ci/freedreno: a3xx will never have Vulkan support |
| - docs: correct svga3d redirected URLs |
| - ci/radv: dEQP-GLES3.functional.polygon_offset.fixed16_render_with_units passes now |
| - ci: re-enable shader-db for nouveau |
| - ci: do not build Nine in debian-build-testing |
| - ci/piglit: be explicit about what we building |
| - ci/lava: enable Piglit OpenCL tests so we can test rusticl on the HW |
| - ci/lava: do not build Vulkan for armhf images |
| - ci/lava: move wayland-protocols to the main section |
| - ci/freedreno: document new failure after piglit update |
| - ci/etnaviv: skip Vulkan tests on GC2000 |
| - ci/etnaviv: remove duplicated line from skips |
| - mailmap: update my email |
| - ci/arm64: rustify the build |
| - ci/lava: add support for RustiCL |
| - ci/meson: reuse meson installation |
| - ci: move (c)bindgen to own shell script |
| - ci/radv: Document recent flake |
| - ci/lava: the containers take sometimes more than 60m |
| - ci: propagate RUSTICL_ENABLE and DEBUG variables to the DUTs |
| - rusticl: add -cl-std only when it's not defined |
| - ci/freedreno: some A306 tests now pass/skip since proper GL detection in Piglit |
| - ci: introduce tool for comparing nightly runs |
| - util: bump blake3 from 1.3.3 to 1.5.1, improve armv7 and aarch64 performance |
| - build: pass licensing information in SPDX form |
| - intel/debug: allow silencing CL warnings |
| - llvmpipe: Silence "possibly uninitialized value" warning for ssbo_limit (cont) |
| - ci/alpine: use llvm variables |
| |
| David Rosca (39): |
| |
| - radv/video: Set correct bit depth and format for 10bit input |
| - radv/video: Check encode profiles and bit depth in capabilities query |
| - radv/video: Report maxBitrate in encode capabilities |
| - radeonsi/vcn: Allocate session buffer in VRAM |
| - radeonsi/vcn: Fix 10bit HEVC VPS general_profile_compatibility_flags |
| - radeonsi/vcn: Only enable VBAQ with rate control mode |
| - frontends/va: Fix AV1 slice_data_offset with multiple slice data buffers |
| - Revert "radeonsi/vcn: AV1 skip the redundant bs resize" |
| - frontends/va: Only increment slice offset after first slice parameters |
| - radeonsi: Update buffer for other planes in si_alloc_resource |
| - frontends/va: Store slice types for H264 decode |
| - radeonsi/vcn: Ensure DPB has as many buffers as references |
| - radeonsi/vcn: Allow duplicate buffers in DPB |
| - radeonsi/vcn: Ensure at least one reference for H264 P/B frames |
| - frontends/va: Fix leak when destroying VAEncCodedBufferType |
| - radeonsi/vcn: Avoid copy when resizing bitstream buffer |
| - frontends/va: Send all bitstream buffers to driver at once |
| - frontends/va: Fix crash in vaRenderPicture when decoder is NULL |
| - radv/video: Add missing VCN 3.0.2 to decoder init switch |
| - radeonsi: Make si_compute_clear_image work with 422 subsampled formats |
| - gallium/vl: Init shaders on first use |
| - frontends/va: Don't require exact match for packed headers |
| - gallium: Add is_video_target_buffer_supported |
| - radeonsi: Implement is_video_target_buffer_supported |
| - frontends/va: Use is_video_target_buffer_supported for EFC |
| - frontends/va: Rework EFC logic |
| - frontends/va: Check if target buffer is supported in vlVaEndPicture |
| - gallium: Remove PIPE_VIDEO_CAP_EFC_SUPPORTED |
| - frontends/va: Simplify AV1 slice parameters handling |
| - frontends/va: Move slice_data_offset to context |
| - frontends/va: Rename slice_idx to have_slice_params and move to context |
| - frontends/va: Support multi elements slice parameter buffers for H264/5 |
| - gallium: Remove pipe_h264_picture_desc.slice_parameter.slice_count |
| - radeonsi/vcn: Limit size to target size in AV1 decode |
| - radeonsi: Add debug option to enable low latency encode |
| - radeonsi/vcn: Add low latency encode support |
| - frontends/va: Support frame rate per temporal layer for AV1 |
| - radeonsi/vcn: Support 10bit RGB for EFC input |
| - radeonsi/vcn: Add decode DPB buffers as CS dependency |
| |
| Deborah Brouwer (1): |
| |
| - ci/lava: Detect a6xx gpu recovery failures |
| |
| Derek Foreman (13): |
| |
| - wsi/wayland: refactor wayland dispatch |
| - egl/wayland: Use loader_wayland_dispatch |
| - perfetto: Add flows |
| - wsi/wayland: Add perfetto flows to image acquisition and presentation |
| - wsi/wayland: Add flow id to presentation feedback |
| - wsi/wayland: Add timing debugging |
| - perfetto: Add simple support for counters |
| - wsi/wayland: Add latency information to perfetto profiling |
| - perfetto: Add some functions for timestamped events |
| - wsi/wayland: Add a perfetto track for image presentation |
| - wsi/wayland: Add tracepoint in wsi_wl_swapchain_wait_for_present |
| - wsi/wayland: Fix use after free from improperly stored VkAllocationCallbacks |
| - wsi/wayland: Use different queue names for different queries |
| |
| Dmitry Baryshkov (1): |
| |
| - freedreno/registers: drop display-related register files |
| |
| Dmitry Osipenko (1): |
| |
| - venus: make cross-device optional |
| |
| Doug Brown (1): |
| |
| - xa: add missing stride setup in renderer_draw_yuv |
| |
| Dr. David Alan Gilbert (1): |
| |
| - treewide: Cleanup unused structs |
| |
| Dylan Baker (12): |
| |
| - meson: use glslang --depfile argument when possible |
| - clc: remove check for null pointer that cannot be true in llvm_mod_to_spirv |
| - compiler/glcpp: don't recalculate macro |
| - intel/compiler: move predicated_break out of backend loop |
| - anv/grl: add some validation that we're not going to overflow |
| - egl/wayland: fix memory leak in error handling case |
| - compilers/clc: Add missing break statements. |
| - mesa: fix memory leak when using shader cache |
| - util/glsl2spirv: fixup the generated depfile when copying sources |
| - tgsi_to_nir: free disk cache value if the size is wrong |
| - crocus: properly free resources on BO allocation failure |
| - crocus: check for depth+stencil before creating resource |
| |
| Echo J (3): |
| |
| - nvk: Add sha1_h as a dependency |
| - d3d10umd: Use pipe_resource_usage enum in translate_resource_usage() |
| - util: Fix the integer addition in os_time_get_absolute_timeout() |
| |
| Eli Schwartz (2): |
| |
| - meson: create libglsl declared dependency to propagate order-only deps |
| - meson: add various generated header dependencies as order-only deps |
| |
| Emma Anholt (13): |
| |
| - nir,panfrost,agx: Fix driver PIXEL_COORD_INTEGER setting and drop workaround. |
| - dri: Fix a pasteo in dri2_from_names() |
| - dri: Consistently use createImageWithModifiers2() |
| - dri: Consistently use createImageFromFds2(), not createImageFromFds() |
| - dri: Replace createImageFromDmaBufs() with createImageFromDmaBufs3() |
| - dri: Drop old createImageFromRenderbuffer() |
| - dri: Consistently use createImageFromDmabufs() not createImageFromFds() |
| - dri: Drop createImageFromFds2() in favor of createImageFromDmaBufs() |
| - dri: Move EGL image lookup/validate setup to dri_init_screen() |
| - mesa: Drop some version checking around ValidateEGLImage |
| - dri: Collapse dri2_validate_egl_image() into dri_validate_egl_image() |
| - dri: Fold lookup_egl_image_validated into its one caller |
| - dri: Drop the old lookupEGLImage wrapper function. |
| |
| Eric Engestrom (295): |
| |
| - VERSION: bump to 20.2 |
| - docs: reset new_features.txt |
| - docs: add release notes for 24.0.6 |
| - docs: update calendar for 24.0.6 |
| - docs: add an extra 24.0.x release |
| - docs: add sha256sum for 24.0.6 |
| - docs: update calendar for 24.1.0-rc1 |
| - ci: fix container rules on release branches and tags |
| - panvk/ci: add WSI testing to all the deqp-vk jobs |
| - lavapipe/ci: add WSI testing to all the deqp-vk jobs |
| - freedreno/ci: add flake |
| - lavapipe/ci: add flakes |
| - ci: pass MESA_VK_ABORT_ON_DEVICE_LOSS through to the DUT |
| - rpi3/ci: drop duplicate comment without any corresponding actual skip line |
| - v3dv/ci: skip all the WSI tests, they are way too flaky to be worth it |
| - spirv: deduplicate default debug log level |
| - v3dv/ci: add rpi5 failure |
| - ci: mark microsoft farm as offline |
| - meson: simplify \`-gsplit-dwarf` compiler argument check |
| - egl+glx: fix two #ifdef that should be #if like the rest |
| - meson: always set USE_LIBGLVND |
| - meson: use bool.to_int() instead of manually converting |
| - lavapipe/ci: drop fixed test from failures |
| - lavapipe/ci: add the rest of the failures introduced by the 1.3.8.2 uprev |
| - lavapipe/ci: skip another test that goes over the timeout |
| - meson: move tsan-blacklist.txt to build-support with the other build support files |
| - llvmpipe/ci: fix indentation |
| - llvmpipe/ci: only run jobs when their corresponding files are changed |
| - lavapipe/ci: fix indentation |
| - lavapipe/ci: avoid running all lavapipe jobs when llvmpipe ci is changed |
| - lavapipe/ci: only run jobs when their corresponding files are changed |
| - docs: update calendar for 24.1.0-rc2 |
| - llvmpipe/ci: trigger jobs on draw & gallivm changes |
| - lavapipe/ci: trigger jobs on draw & gallivm changes |
| - lavapipe/ci: add flakes seen lately |
| - lavapipe/ci: generalize flakes list to all formats for these flaky tests |
| - lavapipe/ci: skip ray tracing tests that sometimes time out |
| - vc4/ci: add fails seen overnight |
| - ci: uprev mold to 2.31.0 |
| - lavapipe/ci: skip two more timing out ray query tests |
| - ci: backport fix for gl_PointSize bug in CTS |
| - lavapipe/ci: move a few skips out from under the "llvm jit" comment |
| - mr-label-maker: fix yaml syntax |
| - docs: add release notes for 24.0.7 |
| - docs: update calendar for 24.0.7 |
| - docs: add sha256sum for 24.0.7 |
| - docs: update calendar for 24.1.0-rc3 |
| - ci/debian-build-testing: drop extra nesting section |
| - ci/shader-db: drop extra nesting section |
| - rpi4/ci: use deqp-runner suite for vk job as well |
| - rpi5/ci: use deqp-runner suite for vk job |
| - microsoft/clc: fix incorrect changes that got through while the Windows CI was down |
| - llvmpipe: wrap the push/pull in the ifdef as well |
| - radv/ci: add navi21 flakes |
| - zink: avoid designated initializers as they are not supported in C++ < 20 |
| - Revert "ci: fail pipeline for users who got access to restricted traces" |
| - radeonsi/ci: document new crash (assert) |
| - util/format: add missing null check in util_format_is_srgb() |
| - ci: drop default VKD3D_PROTON_RESULTS file name |
| - ci: hardcode \`-vkd3d` namespace for VKD3D_PROTON_RESULTS |
| - amd/ci: track changes to VKD3D_PROTON_RESULTS files |
| - mr-label-maker: mark \*-vkd3d.txt files as CI results expectations files |
| - ci: reuse dead .vkd3d-proton-test to make vkd3d less radv-specific |
| - ci: fix section_end in debian-build-testing |
| - ci: rename debian version variable job to include the word "version" |
| - ci: factor out all the deps to build the debian containers into .debian-container |
| - ci: inherit the debian container building infra for test container images |
| - ci/b2c: rename B2C_TIMEOUT_FIRST_* to B2C_TIMEOUT_FIRST_CONSOLE_ACTIVITY_* |
| - ci/b2c: rename B2C_TIMEOUT_* to B2C_TIMEOUT_CONSOLE_ACTIVITY_* |
| - ci/b2c: allow setting timeouts in seconds |
| - ci: drop dead VK_CPU option |
| - ci/piglit-traces: drop re-definition of VK_DRIVER_FILES |
| - ci/init-stage2: set VK_DRIVER_FILES for both xorg and wayland |
| - ci/vkd3d: un-hardcode architecture |
| - ci/vkd3d: fix version sanity check |
| - ci/vkd3d: fail job when failing to get driver version |
| - ci/b2c: remove dead rules: that's always overwritten |
| - ci/env: move dead-code-with-comment to the end of the list to make it clearer |
| - zink/ci: rename .zink-lvp-venus-rules to .zink-venus-lvp-rules to match the rest of the names |
| - README: update links to our own docs |
| - docs: update calendar for 24.1.0-rc4 |
| - mailmap: add entry to unify Roman Stratiienko's contributions |
| - nvk/ci: add nvk job on a GA106 (RTX 3060) |
| - zink/ci: add zink+nvk glcts+piglit job on a GA106 (RTX 3060) |
| - zink+nvk/ci: skip glx piglit tests as they all fail |
| - zink+nvk/ci: skip timing out test |
| - zink+nvk/ci: skip more tests that times out |
| - zink+nvk/ci: document flakes seen during stress-testing |
| - zink+nvk/ci: update expected failures |
| - docs: add release notes for 24.0.8 |
| - docs: update calendar for 24.0.8 |
| - docs: add sha256sum for 24.0.8 |
| - docs: add release notes for 24.1.0 |
| - docs: add sha256sum for 24.1.0 |
| - docs: update calendar for 24.1.0 |
| - ci: fix build-kernel.sh -> download-prebuilt-kernel.sh |
| - ci: drop dead variables (see previous commit) |
| - ci: rename debian/arm*_test to debian/baremetal_arm*_test to be clear about which infra uses that |
| - ci: prepare base debian test image for multi-arch |
| - ci: prepare GL debian test image for multi-arch |
| - ci: prepare VK debian test image for multi-arch |
| - ci/image-tags: rename DEBIAN_X86_64_TEST_*_TAG to drop the x86 mention |
| - ci: add debian/arm64_test images for gl & vk |
| - zink/ci: rename zink-turnip collabora rule to make it unambiguous |
| - ci/b2c: add aarch64 tests for gl & vk |
| - turnip/ci: add vkcts jobs on the a750 |
| - turnip+zink/ci: add gl & gles CTS jobs on the a750 |
| - nvk/ci: adjust the regex for "dut is broken and needs to be rebooted" |
| - nvk/ci: mark the job as failing in case of hangs, instead of silently rebooting |
| - nvk/ci: add missing .test rules to avoid running nvk tests in post-merge pipeline |
| - radv/ci: move amdgpu-specific kernel message warning to src/amd/ci/ |
| - ci/b2c: make B2C_JOB_WARN_REGEX optional |
| - zink+nvk/ci: more KHR-GL46.packed_pixels.varied_rectangle.* flakes, so mark the group as flaky |
| - zink+nvk/ci: add more flakes seen in nightly |
| - zink+nvk/ci: spec\@ext_external_objects\@vk-vert-buf-reuse has been fixed |
| - mr-label-maker: label src/vulkan/wsi/ as wsi |
| - .mailmap: fix email address for \@cpmichael |
| - v3dv/ci: fix typo in \`renderer_check` |
| - ci: disable debian-build-testing until it can be fixed |
| - vc4/ci: skip VK piglit tests |
| - freedreno/a6xx: fix kernel -> compute handling |
| - zink+nvk/ci: add flakes seen in latest nightly run |
| - docs/calendar: add 24.2 branchpoint and release candidates schedule |
| - panfrost/ci: add missing genxml trigger path |
| - panfrost: mark tests as fixed |
| - etnaviv/ci: skip VK piglit tests |
| - radv/ci: document angle regressions from !29436 on stoney |
| - zink+nvk/ci: add flakes seen in latest nightly run |
| - docs/meson: replace deprecated pkgconfig with pkg-config |
| - zink+nvk/ci: add flakes seen in latest nightly run |
| - v3dv: add missing bounds check in VK_EXT_4444_formats |
| - docs: add release notes for 24.1.1 |
| - docs: add sha256sum for 24.1.1 |
| - docs: update calendar for 24.1.1 |
| - turnip/ci: add a750 flakes seen in the latest nightly |
| - radv/ci: fix manual rules |
| - radv/ci: move radv manual rules into their own group |
| - nvk+zink/ci: add another flake seen in nightly |
| - docs: add release notes for 24.0.9 |
| - docs: update calendar for 24.0.9 |
| - docs: add sha256sum for 24.0.9 |
| - venus/ci: add flake that's been blocking MRs |
| - v3d/drm-shim: emulate a rpi4 instead of a rpi3 |
| - nvk+zink/ci: add another flake seen in nightly |
| - radv/ci: document navi31 regression from !29235 |
| - ci: set a common B2C_JOB_SUCCESS_REGEX with the message that's printed for all jobs |
| - ci/deqp: uprev gl & gles cts |
| - radeonsi/ci: mark a bunch of tests as fixed on vangogh |
| - radv/ci: drop duplicate navi21-aco flakes line |
| - radv/ci: drop duplicate navi31-aco flakes line |
| - turnip+zink/ci: mark a dEQP-GLES(2|3).functional.rasterization.(fbo|primitives).line_(strip_|)wide as fixed |
| - turnip/ci: add a750 flakes seen in the latest nightly |
| - panfrost/ci: remove duplicate path |
| - nvk+zink/ci: mark KHR-GL46.sparse_texture2_tests.SparseTexture2* as fixed |
| - nvk+zink/ci: add flakes seen in nightly pipeline |
| - nvk+zink/ci: consider all the \`double` tests in spec\@glsl-4.00\@execution\@built-in-functions to be flaky |
| - freedreno/ci: disable mid-testing reboot on a750 |
| - driconf: drop param for setting default gpu vendor id in DRI_CONF_FORCE_VK_VENDOR() |
| - egl: fix teardown when using xcb |
| - egl: move android-specific code into an android branch |
| - egl: ensure future platforms get their teardown implemented |
| - egl/device: drop unnecessary intermediate variable |
| - ci: fix meson install script |
| - lavapipe/ci: update trace checksum following nir change |
| - lavapipe/ci: document regression while it's being worked on |
| - turnip+zink/ci: mark dEQP-GLES3.functional.fbo.depth.depth_test_clamp.* tests as fixed |
| - bin/ci: escape literal url in regex |
| - glx: fix build -D glx-direct=false |
| - nvk+zink/ci: mark spec\@ext_image_dma_buf_import\@ext_image_dma_buf_import-refcount-multithread as fixed |
| - nvk+zink/ci: add flakes seen over the last few nightlies |
| - asahi/lib: generate git_sha1.h for agx_device.c |
| - ci/vkd3d: deduplicate the diff between the expectation and the results |
| - ci/vkd3d: print a message when the expected failures file is missing |
| - ci/vkd3d: drop override of job artifacts |
| - ci/vkd3d: fix error message printing |
| - ci/vkd3d: stop ignoring errors in a block where errors can't happen |
| - ci/vkd3d: don't ignore errors |
| - ci/vkd3d: group version check lines together |
| - ci/vkd3d: limit the vulkaninfo capture to the driverInfo line |
| - ci/vkd3d: print a real error message when failing to get the list of failing tests |
| - ci/vkd3d: rename vkd3d test log file to end in .txt |
| - ci/vkd3d: print URL to the vkd3d-proton.log file to make it easier to access |
| - ci/vkd3d: put \`then` on the same line as the \`if` to match the rest of the code style |
| - ci/vkd3d: drop the "clear results folder without deleting the folder" logic |
| - ci/vkd3d: drop \`quiet` wrapper |
| - ci/vkd3d: drop redundant "vkd3d-proton execution: SUCCESS" |
| - docs: add release notes for 24.1.2 |
| - docs: add sha256sum for 24.1.2 |
| - docs: update calendar for 24.1.2 |
| - venus/ci: fix indentation of list nested in a dict item |
| - venus/ci: add manual/nightly venus-lavapipe-full |
| - venus/ci: skip timed out test |
| - nvk+zink/ci: add flakes seen over the last two nightly runs |
| - nvk+zink/ci: catch more \`double` flakes |
| - venus+zink/ci: drop fraction and add missing timeout on zink-venus-lvp |
| - loader: use os_get_option() to allow android to set LIBGL_DRIVERS_PATH, GBM_BACKENDS_PATH, GALLIUM_PIPE_SEARCH_DIR |
| - gallium/hud: use os_get_option() to allow android to set GALLIUM_HUD and related vars |
| - egl: use os_get_option() to allow android to set EGL_LOG_LEVEL |
| - venus/ci: make sure nightly job doesn't get retried |
| - venus/ci: drop fixed test from fails list |
| - docs/ci: fix indentation of list nested in a dict item |
| - docs/ci: merge test-docs and test-docs-mr |
| - docs/ci: auto-run test-docs in fork pipelines |
| - docs/ci: drop .no_scheduled_pipelines-rules from test-docs |
| - ci: reorder alpine/x86_64_build rules to fix the nightly pipelines |
| - drm-shim: stub syncobj_timeline_signal ioctl |
| - llvmpipe/ci: add comment for later on weird-looking code |
| - llvmpipe/ci: fix indentation of list nested in a dict item |
| - llvmpipe/ci: set rusticl variables in deqp-runner instead of passing them down from the job |
| - ci: include rusticl in the arm64 build |
| - llvmpipe,rusticl/ci: move rusticl files rule out of llvmpipe |
| - v3d/ci: add nightly job for rusticl testing |
| - panfrost/ci: drop duplicate job rules |
| - panfrost/ci: split gl & vk jobs rules |
| - radeonsi/ci: mark test as fixed |
| - lavapipe/ci: skip timing out test |
| - broadcom/ci: disable auto-retry on manual jobs |
| - docs/features: mark VK_KHR_maintenance7 as implemented on anv and lvp |
| - docs: add release notes for 24.1.3 |
| - docs: update calendar for 24.1.3 |
| - docs: add sha256sum for 24.1.3 |
| - ci_run_n_monitor: add support for new \`canceling` job status |
| - ci_run_n_monitor: be coherent about using sets for \`element in group` checks |
| - ci_run_n_monitor: use COMPLETED_STATUSES in more places |
| - ci_run_n_monitor: add RUNNING_STATUSES and use it where appropriate |
| - bin/ci: allow bugfixes in requirements.txt |
| - ci: split .no-auto-retry out of .scheduled_pipeline-rules |
| - ci: simplify setting .no-auto-retry now that it isn't bundled with unrelated rules: |
| - v3d/ci: include results of GL full run in expectations |
| - v3d/ci: include results of CL run in expectations |
| - zink+nvk/ci: ascii-sort fails |
| - zink+nvk/ci: document regression from !30033 |
| - turnip+zink/ci: add two more CS related flakes |
| - lvp+zink/ci: document a flake seen in a merge pipeline |
| - v3d/ci: add disabled job for GL testing on the RPi5 |
| - v3d/ci: rename "rusticl on v3d" suite to \`v3d-rusticl` |
| - v3d/ci: add disabled job for CL testing on the RPi5 |
| - features.txt: specify that VK_EXT_depth_clamp_zero_one is only supported on v3dv/vc7+ |
| - features.txt: specify that VK_EXT_depth_clip_enable is only supported on v3dv/vc7+ |
| - features.txt: specify that GL_ARB_depth_clamp is only supported on v3d/vc7+ |
| - docs: add release notes for 24.1.4 |
| - docs: update calendar for 24.1.4 |
| - docs: add sha256sum for 24.1.4 |
| - ci: replace gallium-drivers=swrast with gallium-drivers=llvmpipe,softpipe |
| - bin/ci_run_n_monitor: explain that the 'Universal Recycling symbol' ♲ emoji means these jobs were cancelled |
| - bin/ci_run_n_monitor: add text labels next to the emojis |
| - bin/ci_run_n_monitor: replace ♲ with 🗙 to represent cancelled jobs |
| - meson: fix filename printed when generating devenv files |
| - meson/megadriver: fix install message to match the rest of meson |
| - meson/megadriver: stop removing the "master" .so file |
| - meson/megadriver: replace hardlinks with symlinks |
| - ci/vkd3d: fix LD_LIBRARY_PATH |
| - v3d/ci: mark spec\@amd_performance_monitor\@vc4 tests as fixed |
| - llvmpipe/ci: mark spec\@!opengl 1.1\@gl_select tests as fixed |
| - Revert "bin/ci_run_n_monitor: explain that the 'Universal Recycling symbol' ♲ emoji means these jobs were cancelled" |
| - VERSION: bump for 24.2.0-rc1 |
| - .pick_status.json: Update to 0cc23b652401600e57c278d8f6fe6756b13b9f6a |
| - radeonsi/ci: skip timing out test |
| - freedreno/ci: double job timeout for a306 |
| - freedreno/ci: document extra variants of failing tests on a618 and a630 |
| - anv+zink/ci: mark some tests as fixed |
| - anv+zink/ci: document two tests, one failing and one crashing |
| - anv+zink/ci: mark a couple of tests as flaky |
| - venus/ci: skip timing out test |
| - loader: gc loader_get_extensions_name() and __DRI_DRIVER_{GET\_,}EXTENSIONS defines |
| - .pick_status.json: Update to 3b6867f53a6718de80bbff4acb84ffd5aca8a8c8 |
| - nak: fix meson typo |
| - venus: initialize bitset in CreateDescriptorPool() |
| - v3d/ci: mark spec\@amd_performance_monitor\@vc4 tests as flaky |
| - meson: xcb & xcb-randr are needed by the loader whenever x11 is built |
| - .pick_status.json: Update to c30e5d44b1027ed03a8fd542829df0055d3e1a96 |
| - .pick_status.json: Update to 6cd4372460b197fea98d257217328ddc3406e6ad |
| - docs: add stub header for u_format_gen.h |
| - .pick_status.json: Update to c33d2db06ac0ea4d3d5372caa93bee3bbbe028c7 |
| - VERSION: bump for 24.2.0-rc2 |
| - .pick_status.json: Update to ad90bf0500e07b1bc35f87a406f284c0a7fa7049 |
| - ci/baremetal: fix logic for retrying boot when it failed |
| - meson: don't select the deprecated \`swrast` option ourselves |
| - meson: improve wording of "incompatible llvm options" error |
| - ci: remove llvmpipe in the job that disables llvm |
| - .pick_status.json: Update to aa9745427b917bb0613b753ccd59c6c1e6f07584 |
| - VERSION: bump for 24.2.0-rc3 |
| - .pick_status.json: Update to 366e7e2ddc7d3b340bbf040eca1d3223219e6122 |
| - meson,ci: remove dead \`kmsro` option in \`gallium-drivers` |
| - .pick_status.json: Mark 93f9afa1e039cbf681adcc6d170aec987d9f0f65 as denominated |
| - .pick_status.json: Mark f427c9fe233e862bfa30d0c7441ce77592ce4654 as denominated |
| - .pick_status.json: Update to d58f7a24d1be7b8b50ebdc0c1c3ce26bd65317a5 |
| - .pick_status.json: Update to d9849ac46623797a9f56fb9d46dc52460ac477de |
| - .pick_status.json: Update to ef88af846761ca9e642f7ed46011db7d3d6b61fd |
| - VERSION: bump for 24.2.0-rc4 |
| - .pick_status.json: Update to c90e2bccf756004e48f9e7e71e555db0d03c1b98 |
| - ci: pass MESA_SPIRV_LOG_LEVEL from job to the test |
| - android: fix build in multiple ways |
| - .pick_status.json: Update to 214b6c30406f844560bdf35a54ff8a51ee248709 |
| - .pick_status.json: Update to cc2dbb8ea5329b509d79eedb6c0cbb9a1903b5ad |
| |
| Eric R. Smith (8): |
| |
| - panfrost: add a barrier when launching xfb jobs in CSF |
| - get_color_read_type: make sure format/type combo is legal for gles |
| - glsl: test both inputs when sorting varyings for xfb |
| - glsl: make the xfb varying sort stable |
| - panfrost: fix some omissions in valhall flow control |
| - panfrost: change default rounding mode for samplers |
| - panfrost: fix texture.border_clamp regression for valhall |
| - panfrost: use RGB1 component ordering for R5G6B5 pixel formats |
| |
| Erico Nunes (6): |
| |
| - ci: lima farm maintenance |
| - lima/ci: update piglit ci expectations |
| - Revert "ci: lima farm maintenance" |
| - lima: fix surface reload flags assignment |
| - mesa/st: don't set lower_fdot in draw_nir_options |
| - dri: fix sun4i-drm driver name |
| |
| Erik Faye-Lund (106): |
| |
| - panfrost: add PAN_MAX_TEXEL_BUFFER_ELEMENTS define |
| - panfrost: clamp buffer-size to max-size |
| - panfrost: remove nonsensical assert |
| - panfrost: do not deref potentially null pointer |
| - panfrost: check return-value from u_trim_pipe_prim |
| - panfrost: assert that drmSyncobjWait returns 0 |
| - panfrost: check return-code of drmSyncobjWait |
| - panfrost: correct first-tracking for signature |
| - panvk: drop needless null-check |
| - panvk: do not leak bindings |
| - panvk: drop needless null-checks |
| - panvk: avoid dereferencing a null-pointer |
| - docs/panfrost: compact gpu-table |
| - docs/panfrost: move details to separate articles |
| - docs/panfrost: link to conformant products |
| - panfrost: simplify panfrost_texture_num_elements |
| - panfrost: explicitly loop over surfaces |
| - panfrost: untangle faces from layers |
| - util/format: correct a typo |
| - mesa/main: rewrite mipmap generation code |
| - mesa/main: remove unused function |
| - mesa/main: rework GL_IMAGE_PIXEL_TYPE query |
| - mesa/main: clean up _mesa_uncompressed_format_to_type_and_comps |
| - mesa/main: clean up switch statement |
| - mesa/main: do not return _REV format for uncompressed format |
| - mesa/main: prefer non-suffixed enums |
| - mesa/main: fixup indent |
| - mesa/main: updates for EXT_texture_format_BGRA8888 |
| - docs: wrap long words instead of overflowing |
| - meson: bump test-timeout |
| - mesa/main: remove unused function |
| - panfrost: lower maxVertexInputStride to match vulkan runtime |
| - mesa/main: remove stale prototype |
| - mesa/main: remove duplicate error-checks |
| - mesa/main: require EXT_texture_integer for GL 3.0 |
| - mesa/main: do not allow RGBA_INTEGER et al in gles3 |
| - mesa/main: factor out format/type enum checking |
| - mesa/main: use extension-helper |
| - mesa/main: tighten rg/half-float interaction |
| - mesa/main: use _mesa_is_gles1()-helper |
| - mesa/main: remove needless check |
| - mesa/main: simplify conditions |
| - mesa/main: merge identical checks |
| - panvk: move macro-definition to header |
| - mailmap: invert tomeu's mapping |
| - mailmap: merge Robert and Bob Beckett into one |
| - mailmap: invert my mailmapping |
| - mailmap: map collabora.co.uk to collabora.com |
| - mailmap: move konstantin to the right sorted position |
| - mailmap: use consistent spelling for constantine |
| - mailmap: update rohan's primary email address |
| - nir: fix utf-8 encoding-issue |
| - Revert "docs: use html_static_path for static files" |
| - docs: edgeflag -> edge flag |
| - docs: zink -> Zink |
| - docs: Anv -> ANV |
| - docs: tgsi -> TGSI |
| - docs: hw -> HW |
| - docs: mooth -> smooth |
| - docs: unify spelling of front/back-facing |
| - docs: eg. -> e.g. |
| - docs: url -> URL |
| - docs: nabled -> enabled |
| - docs: sommelier -> Sommelier |
| - docs: remove apostrophe from uppercased |
| - docs: utrace -> trace |
| - docs: google -> Google |
| - docs: Nvidia -> NVIDIA |
| - docs: ssbo/ubo -> SSBO/UBO |
| - docs: cpu -> CPU |
| - docs: gpu -> GPU |
| - docs: renderpass -> render pass |
| - docs: spell out "stencil reference" |
| - docs: submision -> submission |
| - docs: Steamos -> SteamOS |
| - docs: colour -> color |
| - docs: occured -> occurred |
| - docs: precidence -> precedence |
| - docs: undifined behaviour -> undefined behavior |
| - docs: debian -> Debian |
| - docs: zink -> Zink |
| - docs: vulkan -> Vulkan |
| - docs: attachements -> attachments |
| - docs: acress -> across |
| - docs: pluggins -> plug-ins |
| - docs: pusbuf -> pushbuf |
| - docs: metadatas -> metadata |
| - docs: use os.pardir |
| - docs: allow out-of-tree docs build |
| - meson: build html-docs |
| - docs: automatically generate depfile |
| - meson: error when missing hawkmoth |
| - meson: allow specifying html-docs-path |
| - ci: build docs using meson |
| - panvk: support x11 wsi |
| - vulkan/runtime: tne -> the |
| - vulkan/runtime: initizlie -> initialize |
| - vulkan/runtime: abreviation -> abbreviation |
| - vulkan/runtime: multiesample -> multisample |
| - vulkan/runtime: implementaiton -> implementation |
| - docs: fix bootstrap-extension |
| - docs/panfrost: fix numbered list |
| - docs/panfrost: fix math-notation |
| - docs/panfrost: use math-role more |
| - docs/panfrost: use c:func-role for function |
| - docs/panfrost: quote identifiers |
| |
| Esdras Tarsis (1): |
| |
| - nvk: Enable 8bit and 16bit access in VK_KHR_workgroup_memory_explicit_layout. |
| |
| Faith Ekstrand (297): |
| |
| - nak: Don't saturate depth writes |
| - nvk: Only clip Z with the guardband |
| - nouveau/class_parser.py: Fix the docs for --out-rs |
| - nvk: Advertise VK_EXT_pipeline_robustness |
| - nouveau/headers: Clean up the meson a bit |
| - spirv: Auto-generate spirv_info.h |
| - spirv: Update the JSON and headers |
| - spirv: Better handle duplicated enums in the JSON parser |
| - spirv: Generate a spirv_capabilities struct |
| - spirv: Record capabilities rather than ad-hoc bools |
| - mesa: Stop pretending to support SPV_AMD_gcn_shader in OpenGL |
| - spirv: Move the old AMD extensions out of capabilities |
| - spirv: Move the printf enable out of capabilities |
| - spirv: Add supported_capabilities to vtn_builder |
| - spirv: Use supported_capabilities for various checks |
| - spirv: Drop the SubgroupUniformControlFlow check |
| - spirv: Add a table of all implemented capabilities |
| - spirv: Check capabilities using the supported_capabilities table |
| - spirv: Add support for specifying caps through the new struct |
| - spirv: Use spirv_capabilities in tests |
| - mesa: Flip the script on SPIR-V extension enabling |
| - mesa: Use the new spirv_capabilities struct |
| - clover: Use the new spirv_capabilities struct |
| - rusticl: Use the new spirv_capabilities struct |
| - vulkan: Set SPIR-V caps from supported features |
| - radv: Use vk_physical_device_get_spirv_capabilities() |
| - intel/kernel: Use the new capabilities struct |
| - asahi/clc: Use the new spirv_capabilities struct |
| - zink: Use the new spirv_capabilities struct |
| - anv: Use spirv_capabilities for the float64 shader |
| - ir3: Use spirv_capabilities in ir3_cmdline |
| - microsoft: Use spirv_capabilities for spirv_to_dxil |
| - spirv: Get rid of the old caps struct |
| - nvk: Re-emit sample locations when rasterization samples changes |
| - nvk/meta: Restore set_sizes[0] |
| - nvk: Get rid of sets_dirty |
| - nvk: Don't rely on push_dirty for which push sets exist |
| - nouveau/headers: Add a bool for whether or not to dump offsets |
| - nvk/upload_queue: Only upload one line of data |
| - nvk/upload_queue: Add some useful asserts |
| - nvk/upload_queue: Add a _fill method |
| - nvk: Use the upload queue for NVK_DEBUG=zero_memory |
| - nvk: Improve the GetMemoryFdKHR error |
| - nouveau/winsys: Take a reference to BOs found in the cache |
| - nouveau/winsys: Make BO_LOCAL and BO_GART separate flags |
| - nvk: Allow GART for dma-bufs |
| - nil: Use the right PTE kind for Z32 pre-Turing |
| - nvk: Set color/Z compression based on nil_image::compressed |
| - nil: Default to NV_MMU_PTE_KIND_GENERIC_MEMORY on Turing+ |
| - nvk: Allow VK_IMAGE_ASPECT_MEMORY_PLANE_0_BIT |
| - drm-uapi: Sync nouveau_drm.h |
| - nouveau/winsys: Add back nouveau_ws_bo_new_tiled() |
| - nvk: Support image creation with modifiers |
| - nvk: Set tile mode and PTE kind on dedicated dma-buf BOs |
| - nvk: Implement DRM format modifier queries |
| - nvk: Advertise VK_EXT_queue_family_foreign |
| - nvk: Advertise VK_EXT_image_drm_format_modifier |
| - vulkan/wsi: Bind memory planes, not YCbCr planes. |
| - nvk/wsi: Advertise modifier support |
| - zink: Set workarounds.can_do_invalid_linear_modifier for NVK |
| - nvk: Fix misc. whitespace and style issues |
| - nvk: Go wide for query copies |
| - nvk: Store descriptor set addresses in descriptor state |
| - nvk: Add static asserts for nvk_buffer_address layout |
| - nvk: Store an nvk_buffer_address for each set in the root table. |
| - nvk: Advertise 32 descriptor sets |
| - nvk: Move and better document set_dynamic_buffer_start |
| - nvk: Add an NVK_MAX_SAMPLES #define |
| - nvk: Refactor nvk_meta_begin() to use a desc helper |
| - nvk/meta: Save and restore set_dynamic_buffer_start |
| - nak: Emit !PT for carries on IADD3 |
| - nak: Add with -0 for fabs() |
| - nak: Don't emit a plop3 for immediate shift sources |
| - nak: Encode LDC directly |
| - vulkan: Update XML and headers to 1.3.286 |
| - spirv: Update the JSON and headers |
| - nir: Handle cmat types in lower_variable_initializers |
| - spirv: Handle constant cooperative matrices in OpCompositeExtract |
| - spirv: Assert that non-vector composites have the right length |
| - spirv: Implement SPV_EXT_replicated_composites |
| - nvk: Advertise VK_EXT_shader_replicated_composites |
| - anv: Advertise VK_EXT_shader_replicated_composites |
| - hasvk: Advertise VK_EXT_shader_replicated_composites |
| - radv: Advertise VK_EXT_shader_replicated_composites |
| - turnip: Advertise VK_EXT_shader_replicated_composites |
| - lavapipe: Advertise VK_EXT_shader_replicated_composites |
| - dozen: Advertise VK_EXT_shader_replicated_composites |
| - nir/print: Improve divergence information |
| - nak: Fix NAK_DEBUG=serial for warp barriers |
| - nak: Only convert the written portion of the buffer in NirInstrPrinter |
| - nak: Fix BasicBlock::phi*() for OpAnnotate |
| - nak: BMov is always variable-latency |
| - nak: Only copy-prop neg into iadd2/3 if no carry is written |
| - nak: Get rid of OpINeg |
| - nak: Expose a BasicBlock::map_instrs() helper |
| - nak: Add some helpers for uniform instructions and registers |
| - nak: Add OpR2UR |
| - nak: Clean up bindless cbuf handles |
| - nak/ra: Move an assert |
| - nak: Make SSARef::file() return Option<RegFile> |
| - nak: Drop BasicBlock::new() |
| - nak: Add a concept of uniform blocks |
| - nak/to_cssa: Resolve phi register file mismatches |
| - nak/ra: Spill UGPRs and UPreds |
| - nak/ra: Never move uniform regs in non-uniform blocks |
| - nak: Support uniform regs in lower_copy_swap() |
| - nak/sm70: Defer ALU src processing until encode_alu() |
| - nak/sm70: Rework ALU source encode helpers |
| - nak/sm70: Add support for encoding uniform ALU ops |
| - nak/sm70: Fix encoding of fadd/fsetp and friends with UGPRs |
| - nak/sm70: Implement a bunch of uniform ops on SM75+ |
| - nak/legalize: Fold immediate sources before instructions |
| - nak/legalize: Drop some pointless plop3 logic |
| - nak/legalize: Be more precise about shfl and out |
| - nak/legalize: Fix imad and ffma legalization on SM50 |
| - nak/legalize: Patch a RegFile through to copy helpers |
| - nak/legalize: Handle uniform sources in warp instructions |
| - nak/legalize: Ensure all SSA values for a given ref are in the same file |
| - nak/legalize: Copy uniform vectors in non-uniform control-flow |
| - nak/legalize: Uniform instructions can't have cbuf sources |
| - nak/legalize: Explicitly ignore OpPhiSrcs and OpPhiDsts |
| - nak/calc_instr_deps: Rename a couple variables |
| - nak/calc_instr_deps: Rewrite calc_delays() again |
| - nak/calc_instr_deps: Add latencies for uniform instructions |
| - nak: Add a opt_uniform_instrs() pass |
| - nak/copy_prop: Rewrap a couple comments |
| - nak/copy_prop: Don't propagate UBOs into uniform instructions |
| - nak/lower_cf: Parent scopes are never NULL |
| - nak/lower_cf: Track block divergence |
| - nak: Convert to LCSSA before divergence analysis |
| - nak/lower_cf: Flag phis as convergent when possible |
| - nak/from_nir: Clean up phi annotations |
| - nak: Add a UniformBuilder |
| - nak/from_nir: Emit uniform instructions when !divergent |
| - nak/sm70: Properly encode bindless cbufs |
| - nak/dce: Account for bindless CBuf handles |
| - nak/calc_instr_deps: Account for bindless CBufs |
| - nak/bitset: Add an iterator |
| - nak/ra: Handle bindless CBufs |
| - nak/ra: Pull searching for unused/unpinned regs into a helper |
| - nak/ra: Rename PinnedRegAllocator to VecRegAllocator |
| - nak/ra: Add a concept of pinned registers to RegAllocator |
| - nak: Add OpPin and OpUnpin |
| - nak/legalize: Allow pinned uniform vectors in non-uniform blocks |
| - nak/legalize: Bindless cbufs must be pinned in non-uniform blocks |
| - nak/copy_prop: Don't propagate bindless cbufs into non-uniform blocks |
| - nir: Add some new _nv intrinsics |
| - nvk,nak: Switch to nir_intrinsic_ldc_nv |
| - nak: Implement r2ur_nv |
| - nak: Implement [un]pin_cx_handle_nv |
| - nir: Add nir_foreach_block_in_cf_node_safe() iterators |
| - nak: Lower non-uniform ldcx_nv to global loads |
| - nak: Implement nir_intrinsic_ldcx_nv |
| - nvk: Split SSBO and UBO address formats |
| - nvk: Split write_[dynamic\_]buffer_desc into UBO and SSBO variants |
| - nvk: Align buffer descriptors |
| - nvk: Rename nvk_cmd_buffr_get_cbuf_descriptor() |
| - nvk: Make nvk_min_cbuf_alignment() inline |
| - nvk/lower_descriptors: Add a descriptor_type_is_ubo/ssbo() helper |
| - nvk: Move the zero offset optimization to load_descriptor_for_idx_intrin() |
| - nvk: Allow the cbuf optimization for VK_DESCRIPTOR_TYPE_MUTABLE_EXT |
| - nvk/descriptor_set_layout: Record which dynamic buffers are UBOs |
| - nvk: Use bindless cbufs on Turing+ |
| - nvk: Be much more conservative about rebinding cbufs |
| - nvk: Use cbuf loads for variable pointers dynamic SSBO descriptors |
| - nvk: s/draw_idx/draw_index/g |
| - nvk: Pass the base workgroup and global size to flush_compute_state() |
| - nvk: Use helper macros for accessing root descriptors |
| - nvk: Pass the queue to draw/dispatch_state_init() |
| - nvk: Use inline constant buffer updates for CB0 |
| - nvk: Only write draw parameters to cb0 when they change |
| - nvk: Refactor build_cbuf_map() |
| - nak,nir: Drop r2ur_nv in favor of as_uniform |
| - nouveau: Fix a race in nouveau_ws_bo_destroy() |
| - nvk: Use NVK_VK_GRAPHICS_STAGE_BITS in dirty_cbufs_for_descriprots() |
| - nvk: Dirty cbufs in CmdPushDescriptorSetWithTemplate2KHR |
| - intel/blorp: Set nir_shader::options up-front before building |
| - util/format_pack: Fix packing of signed 1010102 SSCALED formats |
| - util/format_pack: Also use iround for SCALED formats |
| - util/format_pack: Clamp SNORM values to [-1, 1] when unpacking |
| - util/format: Round to nearest even when converting to R11G11B20F |
| - util/format: Handle denorms when converting to R11G11B10F |
| - nir/format_convert: Smash NaN to 0 in pack_r9g9b9e5() |
| - nir/format_convert: Use fmin/fmax to clamp R9G9B9E5 data |
| - nir: Add a nir_intrinsic_use for unit tests |
| - nir: Move most of nir_format_convert to a C file |
| - nir: Support 0 and 32 bits in some format conversion helpers |
| - util: Make format_srgb.h C++ safe |
| - nir: Add a format pack helper and tests |
| - nir: Add a format unpack helper and tests |
| - nir/format_convert: Assert that UNORM formats are <= 16 bits |
| - ci: Update trace SHAs |
| - vulkan/meta: Use demote instead of discard |
| - nvk: Fix whitespace issues around conservative rasterization |
| - nvk: Re-order conservative rasterization checks |
| - nvk: Don't emit conservative rasterization before Maxwell B |
| - nvk: Silently fail to enumerate if not on nouveau |
| - util/cnd_monotonic: Move the guts to a c file |
| - util/cnd_monotonic: Use a void * on Windows |
| - vulkan/wsi/wayland: Use mtx_t and u_cnd_monotonic |
| - vulkan/wsi/x11: Use c11/threads for thread spawning |
| - vulkan/wsi/x11: Use mtx_t and u_cnd_monotonic |
| - vulkan/wsi/display: Use mtx_t and u_cnd_monotonic |
| - vulkan/wsi/queue: Use mtx_t and u_cnd_monotonic |
| - vulkan/wsi: Delete wsi_init_pthread_cond_monotonic |
| - vulkan: Use u_cnd_monotonic for vk_sync_timeline |
| - nvk: Why are nvk_image.c/h writeable? |
| - nvk: Bump the sparse alignment requirement on buffers to 64K |
| - nvk: Align sparse-bound images to the sparse binding size |
| - nvk: Fetch debug flags from the physical device |
| - nvk: Initialize the debug flags in nvk_instance |
| - nvk: Add the start of a KMD abstraction |
| - nvk/nvkmd: Implement dev and pdev for nouveau |
| - nvk: Use the NVKMD interface for device enumeration |
| - nvk/nvkmd: Add memory and virtual address interfaces |
| - nvk/nvkmd: Implement the mem and va interfaces for nouveau |
| - nvk: Add static wrappers for image/buffer binding |
| - nvk: Use nvkmd_mem for nvk_device_memory |
| - nvk: Use nvkmd_mem for nvk_image::linear_tiled_shadow_mem |
| - nvk: Use nvkmd_mem for nvk_cmd_pool |
| - nvk: Use nvkmd_mem for nvk_descriptor_pool |
| - nvk: Use nvkmd_mem in nvk_upload_queue |
| - nvk: Use nvkmd_mem for descriptor tables |
| - nvk: Use nvkmd_mem for shader and event heaps |
| - nvk: Use nvkmd_mem for query pools |
| - nvk: Use an nvkmd_mem for the SLM area |
| - nvk: Drop extra_bos from nvk_queue_submit_simple() |
| - nvk: Use nvkmd_mem for the nvk_queue_state::push |
| - nvk: Use nvkmd_mem for the zero page, VAB, and CB0 |
| - nvk/nvkmd: Add a context interface |
| - nvk/nvkmd: Implement nvkmd_ctx for nouveau |
| - nvk: Convert the upload queue to nvkmd_ctx |
| - nvk: Use an nvkmd_ctx for sparse binding |
| - nvk: Use nvkmd_ctx for queue submit |
| - nvk: Remove the last vestages of nouveau/winsys from core NVK |
| - nouveau/mme: Don't dereference an empty vector |
| - nouveau/mme: Don't leak data_bo |
| - nouveau/mme: Use fixed BO addresses in the MME test |
| - nvk: Move Heaps and BO binding into nvkmd |
| - nvk: Move debug flags int nvk_debug.h |
| - nvk/nvmkd: Plumb parent pointers through everywhere |
| - nvk/nvkmd: Re-implement NVK_DEBUG=vm |
| - nvk: Do mem maps directly in nvkmd on nouveau |
| - nvk/nvkmd: Add real mem<->bo flag translation |
| - nvk/nvkmd: Flip the script on NO_SHARED |
| - nvk: Drop nvk_buffer::is_local |
| - nvk/nvkmd: Rework memory placement flags |
| - nvk/nvkmd: Be more specific about memory alignments |
| - nvk/nvkmd: Be a lot more pedantic about VA alignments |
| - nvk: Put CB0 in VRAM |
| - nvk: Put descriptors in VRAM |
| - nouveau/push: Cache the last header DW to avoid read-back |
| - nak/sph: Stop storing the shader model in ShaderProgramHeader |
| - nak: Move encode_sm* to to sm*.rs |
| - nak/sm50: Get rid of the hand-rolled align_up/down() helpers |
| - nak: Plumb a ShaderModel trait through everywhere |
| - nak/ra: Move the NAK_DEBUG=spill logic into RA |
| - nak: Move RegFile::num_regs() into ShaderModel |
| - nak: Move Instr::can_be_uniform() into ShaderModel |
| - nak: Move instruction encoding into ShaderModel |
| - nak/sm70: Move instruction encoding into a trait |
| - nak/sm70: Re-organize the code a bit |
| - nak/legalize: Move a bunch of helpers to a trait |
| - nak/legalize: Handle OpBreak and OpBSSy specially |
| - nak/legalize: Handle RA instructions up-front |
| - nak/sm70: Move legalization into SM70Op |
| - nak/sm50: Move instruction encoding into a trait |
| - nak/sm50: Move legalization into SM50Op |
| - nak: Add a legalize() method to ShaderModel |
| - nak/sm50: Re-order all the ops |
| - nak/sm50: Fix immediates for IMnMx |
| - zink/kopper: Set VK_COMPOSITE_ALPHA_OPAQUE_BIT when PresentOpaque is set |
| - nak: gather instr count explicitly |
| - nvk/nvkmd: nouveau uses the OS page size |
| - nvk: Drop the sparse alignment back down to 4096 |
| - nvk: Use the page size queried from NVKMD |
| - nak/nir: Use an indirect load for sample locations |
| - nak/copy_prop: Propagate OpSel with a selector of SrcRef::Zero |
| - nak/copy_prop: Ignore the top 16 bits of OpPrmt::sel |
| - nak: Don't print the destination of OpIpa twice |
| - nir,nak: Add a nir_op_prmt_nv |
| - nak/nir: Use prmt in texture lowering |
| - nak/nir: Use prmt for barycentric offset lowering |
| - nak/nir: Make interpolate_at_sample more efficient |
| - nak: Add some helpers for working with OpPrmt selectors |
| - nak: Optimize nested OpPrmt |
| - nak: Add a pass macro for more consistent debug printing |
| - nak: Run copy-prop again after opt_prmt and opt_lop |
| - nvk: Fix indirect cbuf binds pre-Turing |
| - nvk: Don't advertise sparse residency on Maxwell A |
| - nvk: Reject sparse images on Maxwell A and earlier |
| - nak/spill_values: Don't assume no trivial phis |
| - meson/megadriver: Don't invoke the megadriver script with no drivers |
| - nak: Sample locations are byte-aligned |
| - nvk: Require color or depth/stencil attachment support for input attachments |
| - nvk: Support STORAGE_READ_WITHOUT_FORMAT on buffers |
| - zink: Align descriptor buffers to descriptorBufferOffsetAlignment |
| |
| Francisco Jerez (33): |
| |
| - intel/brw/xe2+: Keep PS sample mask in the f1.0 register whether or not kill is used. |
| - intel/brw: Don't emit Z coordinate interpolation if CPS isn't in use. |
| - intel/brw/xe2+: Fix indirect extended descriptor setup for scratch space. |
| - iris: Allocate fixed amount of space for blend state. |
| - blorp: Allocate fixed amount of space for blend state. |
| - intel/brw/xe2+: Don't use SEL peephole on 64-bit moves. |
| - intel/brw/xe2+: Fix 64-bit subgroup scan intrinsics not to rely on SEL instructions. |
| - intel/brw/xe2+: Lower 64-bit SHUFFLE and CLUSTER_BROADCAST. |
| - intel/xe2+: Enable native 64-bit integer arithmetic. |
| - nir: Add option to lower 64-bit uadd_sat. |
| - intel/brw/xe2+: Lower 64-bit integer uadd_sat. |
| - intel/brw/xe2+: Round up spill/unspill data size to nearest reg_size multiple. |
| - intel/xe2+: Enable native 64-bit integer arithmetic. |
| - iris,anv/xe2+: Enable the DX10/OGL border mode for YCrCb as per Wa_14014226147. |
| - iris,anv/xe2+: Set tessellation redistribution regions per patch to recommended values. |
| - iris,anv/xe2+: Use pipelined variant of 3DSTATE_DRAWING_RECTANGLE. |
| - intel/brw/xe2+: Use active-thread-only barriers available since Xe2+. |
| - iris/xe2+: Fix format of scratch space surface address in various 3DSTATE packets. |
| - anv/xe2+: Fix format of scratch space surface address in various 3DSTATE packets. |
| - intel/fs/gfx20+: Fix surface state address on extended descriptors for NIR scratch intrinsics. |
| - intel/fs/xe2+: Ask driver for PS payload registers based on barycentric load intrinsics in use. |
| - iris/gfx11+: Request PS payload fields for ALU-based interpolation via 3DSTATE_PS_EXTRA. |
| - anv/gfx11+: Request PS payload fields for ALU-based interpolation via 3DSTATE_PS_EXTRA. |
| - intel/fs/xe2+: Don't lower barycentric load offsets to fixed-point format on Xe2+. |
| - intel/fs/xe2+: Add ALU-based implementation of barycentric interpolation at a per-channel offset. |
| - intel/fs/xe2+: Add ALU-based implementation of barycentric interpolation at a per-channel sample. |
| - intel/dev: Add GRF size information to the intel_device_info struct. |
| - anv/xe2+: Align push constant ranges to GRF boundaries. |
| - intel/brw: Implement null push constant workaround. |
| - intel/dev: Add devinfo flag for TBIMR push constant workaround. |
| - anv/gfx12.5: Pass non-empty push constant data to PS stage for TBIMR workaround. |
| - iris/gfx12.5: Pass non-empty push constant data to PS stage for TBIMR workaround. |
| - iris: Pin pixel hashing table BO from iris_batch submission instead of from iris_state. |
| |
| Friedrich Vock (7): |
| |
| - aco/tests: Insert p_logical_start/end in reduce_temp tests |
| - aco/spill: Insert p_start_linear_vgpr right after p_logical_end |
| - radv: Use max_se instead of num_se where appropriate |
| - radeonsi: Use max_se instead of num_se where appropriate |
| - radv/rt: Fix memory leak when compiling libraries |
| - aco/spill: Don't spill phis with all-undef operands |
| - aco: Limit rt stages to 128 vgprs |
| |
| GKraats (3): |
| |
| - i915g: fix generation of large mipmaps |
| - i915g: fix mipmap-layout for npots |
| - i915g: fix max_lod at mipmap-sampling |
| |
| Ganesh Belgur Ramachandra (4): |
| |
| - radeonsi: add GL_EXT_texture_filter_minmax extension |
| - radeonsi: add GL_ARB_texture_filter_minmax extension |
| - radeonsi: fix eptich on chips without image opcodes (e.g. gfx940) |
| - amd/common: skip lane size determination for chips without image opcodes (e.g. gfx940) |
| |
| Georg Lehmann (88): |
| |
| - aco/tests: don't use undef for descriptors |
| - aco/tests/post_ra: fix various validation errors |
| - aco/lower_to_hw: fix v_cvt_pk_u16_u32 instruction format |
| - aco/lower_to_hw: fix 16bit p_insert on gfx8 |
| - aco/tests: validate before and after post-ra tests |
| - spirv: preserve signed zero in modf |
| - aco/lower_to_hw: don't use regClass to identify subdword reductions |
| - aco: add a subdword lowering pass |
| - aco: add tests for lower_subdword |
| - aco/ra: remove gfx6/7 subdword paths |
| - aco/lower_to_hw: remove gfx6/7 subdword paths |
| - ac/nir: explicitly use pack_half_2x16_rtz |
| - radv, radeonsi: don't use D16 for f2f16_rtz |
| - radv: always run nir_opt_16bit_tex_image |
| - nir/opt_16bit_tex_image: pass options to opt_16bit_dest |
| - nir/opt_16bit_tex_image: optimize packed conversions too |
| - aco/gfx11+: use v_cvt_pk_u8_f32 for 8bit constant copies |
| - aco/gfx10: use v_add_u16 with literal for constant copies |
| - aco/tests: simplify small constant copy test |
| - aco/gfx11+: optimize v_fma_mix throughput |
| - zink: use bitcasts instead of pack/unpack double opcodes |
| - aco/gfx11: use v_swap_b16 |
| - aco/optimizer: remove ineffective vcc opt |
| - aco/optimizer: remove ineffective undef opt |
| - aco: remove perfwarn |
| - aco: don't pass program to emit_bpermute |
| - aco/lower_to_hw: add copy_constant_sgpr |
| - aco: small constant copy optimizations |
| - aco/lower_to_hw: use copy_constant_sgpr for masks |
| - aco/lower_to_hw: optimize split 64bit constant copies |
| - aco/optimizer: use p_create_vector to create mask when a copy can't be used |
| - nir: remove unpack_half_flush_to_zero |
| - nir/opt_uniform_atomics: handle inverse_ballot when detecting single lane ifs |
| - aco: optimize branching sequence with p_create_vector exec producer |
| - nir: sink/move inverse_ballot like moves |
| - ac: set has_pack_32_4x8 |
| - nir: lower pack_uvec4_to_uint to pack_32_4x8 if supported |
| - nir/opt_algebraic: alternative 8bit pack_[us]norm_4x8 lowering |
| - aco: rework how affinities for acc operands are determined |
| - aco: add affinities for possible sopk optimizations |
| - aco/gfx11+: fix inline constants for v_pk_fmac_f16 |
| - aco: move literal unswizzle opt to RA |
| - aco/ra: use a switch to check vop2acc instruction support |
| - aco: move s_add_u32 -> s_addk_i32 optimization fully to ra |
| - amd/common: set COMPUTE_STATIC_THREAD_MGMT_SE2-3 correctly on gfx10-11 |
| - aco: add more anonymous namespaces |
| - aco: make local functions static in files without anonymous namespace |
| - radv: inline partial push constant loads |
| - nir: add ford, funord, fneo, fequ, fltu, fgeu |
| - aco: implement ford, funord, fneo, fequ, fltu, fgeu |
| - ac/llvm: implement ford, funord, fneo, fequ, fltu, fgeu |
| - ac/nir: enable ford, funord, fneo, fequ, fltu, fgeu |
| - nir/opt_algebraic: look through fabs/fneg when matching fmulz/ffmaz |
| - nir/optimize cmp(a, -0.0) |
| - nir/opt_algebraic: optimize cmp(fneg(a), #b) and feq with fabs |
| - nir/opt_algebraic: add various unordered/ordered patterns from aco |
| - aco: remove ordered/unordered optimizations |
| - aco/ir: remove unused vopc helpers |
| - iris/ci: update trace checksums |
| - aco/ra: fix affinity for s_addk |
| - aco: fix s_delay_alu with salu and trans dependency |
| - aco,nir: add dpp16_shift_amd intrinsic |
| - radv/nir: add a pass to optimize shuffle/booleans dependent only on tid/consts |
| - radv: use radv_nir_opt_tid_function for shuffles |
| - radv: use radv_nir_opt_tid_function to create inverse_ballot |
| - aco/gfx12: use trans s_delay_alu for pseudo scalar |
| - aco/gfx12: don't allow vgpr operands for pseudo scalar |
| - aco/gfx11.5: select s_cvt_[ui]32_f32 |
| - aco/gfx11.5: select s_(ceil|floor|trunc|rndne) |
| - aco: add aco_opcode::p_s_cvt_f16_f32_rtne |
| - aco/gfx11.5: select SALU float conversions |
| - aco/gfx11.5: fix s_fmac acc to definition |
| - aco/gfx11.5: select SOP2 float instructions |
| - aco/gfx11.5: select SOPC float instructions |
| - aco/gfx11.5: select SALU fsat |
| - aco/gfx11.5: select SALU fsign |
| - aco/gfx11.5+: allow sgpr dst for trans ops and use pseudo scalar ops on gfx12 |
| - aco/gfx11.5: select SALU fneg/fabs |
| - aco/gfx11.5: select SALU fquantize2f16 |
| - aco: micro optimize VALU fquantize2f16 |
| - aco: handle clustered uniform reductions correctly |
| - nir: constant fold inverse_ballot |
| - aco: remove optimize_cmp_subgroup_invocation |
| - spirv: ignore more function param decorations |
| - aco/optimizer: update temp_rc when converting to uniform bool alu |
| - aco/gfx11+: don't use VOP3 v_swap_b16 |
| - nir/lower_int64: replace uadd_sat with ior for find_lsb64 and ufind_msb64 |
| - aco/gfx10+: set lateKill for sgprs used by wave64 VALU writing a mask |
| |
| Gert Wollny (4): |
| |
| - zink/kopper: Wait for last QueuePresentKHR to finish before acquiring for readback |
| - mesa/st: don't use base shader serialization when uniforms are not packed |
| - r600/sfn: Set bit size for newly created store intrinsic |
| - zink: limit minSampleShading to a maxium value of 1.0 |
| |
| Guilherme Gallo (3): |
| |
| - ci: Add S3 id_token for all jobs |
| - ci: Use id_tokens for JWT auth |
| - ci/lava: Fix cmdline for UART/fastboot devices |
| |
| Hans-Kristian Arntzen (5): |
| |
| - vulkan: Update XML and headers to 1.3.285. |
| - ac/surface: Add surface flags to prefer 4K and 64K alignment. |
| - radv: Implement VK_MESA_image_alignment_control |
| - wsi/common: Do not update present mode with MESA_VK_WSI_PRESENT_MODE. |
| - wsi/x11: Bump maximum number of outstanding COMPLETE events. |
| |
| Heinrich Fink (1): |
| |
| - zink: remove workaround of FB modifiers forcing present state |
| |
| Iago Toral Quiroga (53): |
| |
| - v3dv: fix VK_KHR_vertex_attribute_divisor |
| - v3d,v3dv: stop hard-coding max attrib divisor |
| - broadcom/compiler: assert on array overflow |
| - v3d: fix array_len when precompiling outputs for shader-db |
| - broadcom/compiler: fix num_textures for precompiled shaders |
| - broadcom/compiler: don't read excess channels on image loads |
| - broadcom/compiler: simplify v3d_vir_emit_tex |
| - broadcom/cle: fix up shader record for V3D 7.1.10 / 2712D0 |
| - v3d: support 2712D0 |
| - v3dv: support 2712D0 |
| - broadcom/compiler: make add_node return the node index |
| - broadcom/compiler: don't assign payload registers to spilling setup temps |
| - broadcom/compiler: apply payload conflict to spill setup before RA |
| - broadcom/compiler: check if vertex shader writes point size |
| - v3dv: only flag 'shader writes point size' if the shader actually writes it |
| - v3dv: emit a default point size when drawing points |
| - v3dv: drop unused stride field from v3dv_pipeline_vertex_binding |
| - v3dv: fix incorrect index buffer size |
| - v3dv: use pSizes paramater in vkCmdBindVertexBuffers2 |
| - v3dv: implement vkCmdBindIndexBuffer2KHR |
| - v3dv: handle VkBufferUsageFlags2CreateInfoKHR |
| - v3dv: handle VkPipelineCreateFlags2CreateInfoKHR |
| - v3dv: lower maxVertexInputBindingStride to match vulkan runtime |
| - v3dv: shader modules are deprecated with VK_KHR_maintenance5 |
| - v3dv: implement vkGetImageSubresourceLayout2KHR |
| - v3dv: refactor create_image |
| - v3dv: add a get_image_subresource_layout helper |
| - v3dv: implement vkGetDeviceImageSubresourceLayoutKHR |
| - v3dv: implement vkGetRenderingAreaGranularityKHR |
| - v3dv: fix pipeline leaks when meta pipeline cache is disabled |
| - v3dv: fix a few asserts that check layerCount instead of array_layers |
| - v3dv: allow VK_REMAINING_ARRAY_LAYERS in VkImageSubresourceLayers |
| - v3dv: remove blit shader restriction on depth/stencil not being linear |
| - v3dv: disable some TLB paths for cases of linear depth/stencil stores |
| - v3dv: support VK_FORMAT_A1B5G5R5_UNORM_PACK16_KHR |
| - v3dv: add more checks for device loss |
| - v3dv: fix handling of pipeline flags when pipeline init fails |
| - v3dv: expose VK_KHR_maintenance5 |
| - broadcom/compiler: initialize payload_conflict for all initial nodes |
| - v3dv: don't call wsi_device_init too early |
| - broadcom/compiler: don't spill in between multop and umul24 |
| - broadcom/compiler: fix per-quad spilling |
| - broadcom/compiler: validate rtop + thrsw hazard |
| - broadcom/compiler: drop multop if we dce umul24 |
| - broadcom/compiler: add missing signal compatibilities for V3D 7.x |
| - broadcom/compiler: add new float32 unpack modifiers in V3D 7.x |
| - broadcom/compiler: disallow copy propagation of FMOV exclusive modifiers |
| - broadcom/compiler: implement nir_op_fsat |
| - v3d: don't lower fsat on V3D 7.x |
| - v3dv: make nir helpers receive nir compiler options from caller |
| - v3dv: don't lower fsat on V3D 7.x |
| - v3d: skip tlb loads when emitting clears with a draw call |
| - v3d: rename job->clear to job->clear_tlb |
| |
| Ian Romanick (33): |
| |
| - intel/brw: Fix optimize_extract_to_float for i2f of unsigned extract |
| - intel/brw: Avoid optimize_extract_to_float when it will just be undone later |
| - intel/elk: Fix optimize_extract_to_float for i2f of unsigned extract |
| - nir/algebraic: Optimize some extract_* expressions |
| - spirv: Use fp16 fp_fast_math settings when lowering fp16 asin and acos |
| - intel/brw: Remove dsign optimization |
| - intel/elk: Remove dsign optimization |
| - intel/brw: Use fs_inst::resize_sources in brw_fs_opt_algebraic |
| - intel/brw: Add support for fcsel opcodes |
| - intel/brw: Handle fsign optimization in a NIR algebraic pass |
| - intel/brw: Update CSEL source type validation |
| - intel/brw: Combine constants and constant propagation for CSEL |
| - intel/brw: Algebraic optimizations for CSEL |
| - intel/brw: Implement more strictly correct fsign lowering |
| - intel/brw: Use range analysis to optimize fsign |
| - nir/algebraic: Add nir_lower_int64_options::nir_lower_iadd3_64 |
| - nir/search: Fix is_16_bits for vectors |
| - nir/search: Refactor is_16_bits |
| - nir/algebraic: More patterns to generate iadd3 |
| - nir/algebraic: intel/fs: Optimize some patterns before lowering 64-bit integers |
| - intel/brw: Temporarily disable result=float16 matrix configs |
| - intel/brw: Major rework of lower_cmat_load_store |
| - intel/brw/xe2+: Catch invalid uses of writes_accumulator earlier |
| - intel/brw/xe2+: Adjust size_read() for DPAS |
| - intel/brw/xe2+: Scale size_written by reg_unit for DPAS |
| - intel/brw/xe2+: Adjust DPAS lowering to DP4A to accommodate larger GRF and SIMD16 |
| - intel/brw/xe2+: Allow vec16 for cooperative matrix |
| - nir: dpas_intel second source can have different number of components |
| - intel/brw/xe2+: Add LNL cooperative matrix configurations |
| - intel/tools: Advertise I915_PARAM_HAS_EXEC_TIMELINE_FENCES |
| - intel/brw: Test corner case CSE of ADD3 instructions |
| - intel/brw: Don't propagate saturate to an instruction that writes flags |
| - intel/elk: Don't propagate saturate to an instruction that writes flags |
| |
| Icenowy Zheng (7): |
| |
| - llvmpipe: add shader cache support for ORCJIT implementation |
| - gallivm: orcjit: use a mutex to protect symbol looking up |
| - util: detect LoongArch architecture |
| - gallivm: add LoongArch support to the mattrs setting code |
| - llvmpipe: add LoongArch support in ORCJIT |
| - gallivm: orcjit: keep the ownership of tm for LPJit |
| - gallivm: orcjit: use atexit to release LPJit singleton at exit |
| |
| Italo Nicola (1): |
| |
| - nir: add {load,store}_global_etna intrinsics |
| |
| Iván Briano (21): |
| |
| - compiler: reorder FLOAT_CONTROLS enums |
| - nir: track some float controls bits per instruction |
| - spirv: gather some float controls bits per instruction |
| - nir: check inf/nan/sz preserve per-instruction |
| - nir/algebraic: support float controls conditions per instruction |
| - nir/algebraic: move float control conditions to be per instruction |
| - vtn: support float controls2 |
| - anv: enable VK_KHR_shader_float_controls2 |
| - anv: check requirements for VK_IMAGE_USAGE_FRAGMENT_SHADING_RATE |
| - anv: fix casting to graphics_pipeline_base |
| - anv: consolidate DestroyPipeline for graphics and graphics_lib |
| - intel/brw: fix subgroup size of geometry stages for lnl+ |
| - anv: check cmd_buffer is on a transfer queue more properly |
| - intel/brw: add fetch_viewport_index function |
| - intel/brw: always read LAYER/VIEWPORT from the FS payload |
| - vulkan/runtime: pColorAttachmentInputIndices is allowed to be NULL |
| - vulkan/properties: handle LayeredApiPropertiesListKHR |
| - anv: enable VK_KHR_maintenance7 |
| - anv: get scratch surface from the correct pool |
| - anv: set MOCS for protected memory when needed |
| - intel/rt: fix terminateOnFirstHit handling |
| |
| JCWasmx86 (1): |
| |
| - meson: Fix invalid kwarg name |
| |
| Jeremy Gebben (1): |
| |
| - radv: Return hang status from radv_check_gpu_hangs() |
| |
| Jesse Natalie (14): |
| |
| - nir_opt_algebraic: Add a couple optimizations for lowered unpack(pack()) |
| - wgl: Delete pixelformat support query |
| - wgl: Fix flag check for GDI compat |
| - nir_range_analysis: Use fmin/fmax to fix NAN handling |
| - d3d12: Use GetResourceAllocationInfo instead of GetCopyableFootprints for residency sizes |
| - nir: Remove assert-only variable by inlining its single use |
| - zink: Add ASSERTED to assert-only local variable |
| - mesa: Add ASSERTED to assert-only local variable |
| - subprojects: Use depth=1 in the git wrap files |
| - blake3: fix Windows ARM64 build and detect ARM64EC as ARM64 |
| - ci/windows: Disable zlib in LLVM |
| - ci/windows: Specify numpy < 2.0 to prevent breaking changes |
| - microsoft/clc: Split struct copies before vars_to_ssa in pre-inline optimizations |
| - meson: Add an error message for llvmpipe without llvm draw support |
| |
| Jessica Clarke (3): |
| |
| - Revert "meson: Do not require libdrm for DRI2 on hurd" |
| - Revert "meson: fix with_dri2 definition for GNU Hurd" |
| - meson: egl: Build egl_dri2 driver even for plain DRI |
| |
| Jianxun Zhang (43): |
| |
| - intel/isl: Allow multi-sample on depth aux usage (xe2) |
| - isl: Add a heading 4KB to MCS surface (xe2) |
| - isl: Add AUX MCS encoding into aux modes (xe2) |
| - blorp: Scaledown rectangle of MSAA fast clear (xe2) |
| - blorp: Fix offset when ambiguating MCS buffer (xe2) |
| - isl: Clone from isl_gfx12.* files (xe2) |
| - isl: Update isl_gfx20 code (xe2) |
| - isl: Add isl_gfx20 into build (xe2) |
| - isl: Add dispatching in isl.c (xe2) |
| - isl: Implement a part of WA_22018390030 (xe2) |
| - isl: Remove code for Xe2 from isl_gfx12.c |
| - isl: Update render CMF mapping (xe2) |
| - isl: Don't set clear values or their address (xe2) |
| - blorp: Get fast clear rectangle of non-MSAA surfaces (xe2) |
| - blorp: Pass down fast clear color value (xe2) |
| - intel/genxml,blorp,common: Update 3DSTATE_PS command (xe2) |
| - iris: Update aux state for color fast clears (xe2) |
| - iris: Limit FCV_CCS_E to platforms that enable it |
| - anv: Don't enable compression with modifiers (xe2) |
| - iris: Add more restrictions on compression (Xe2) |
| - anv: Don't enable compression on external bos (xe2) |
| - iris: Disable PAT-based compression on depth surfaces (xe2) |
| - anv: Disable PAT-based compression on depth images (xe2) |
| - iris: Update synchronization of fast clear (xe2) |
| - iris: Workaround: Don't allocate compressed bo from cache (xe2) |
| - isl: Remove restriction of CCS_E support on formats (xe2) |
| - blorp: Don't convert ccs_e formats for copy (xe2) |
| - isl: Initialize the last usage in isl_encode_aux_mode[] (xe2) |
| - anv: Update synchronization of fast clear (xe2) |
| - iris: Disable predraw resolve (xe2) |
| - blorp: Ensure MSAA fast clear in correct modes (xe2) |
| - intel/dev: Select a compressed PAT entry (xe2) |
| - isl: Add some formats not covered in CMF table (xe2) |
| - anv: Disable tracking fast clear and aux state (xe2) |
| - anv: Fix Vulkan CTS failure related to MCS (xe2) |
| - anv: Support arbitrary fast-clear value on all layouts (xe2) |
| - anv: Disable tracking of clear color on color attachment |
| - intel/common: Ensure SIMD16 for fast-clear kernel (xe2) |
| - intel/common: Remove blank lines in intel_set_ps_dispatch_state() (xe2) |
| - anv: Fix assertion failures on BMG (xe2) |
| - iris: Fix an assertion failure with compressed format |
| - anv: Disable compression on legacy modifiers (xe2) |
| - anv: Disable legacy CCS setup in binding (xe2) |
| |
| Job Noorman (33): |
| |
| - ir3: simplify cat5 parsing |
| - ir3: add encoding for isam.v |
| - ir3: use isam.v for multi-component SSBO loads |
| - ir3: add encoding of ldib/stib offsets |
| - ir3: lower SSBO access imm offsets |
| - nir/opt_offsets: add callback for max base offset |
| - nir/opt_offsets: add option to allow offset wrapping |
| - nir/opt_offsets: add load/store_ssbo_ir3 |
| - ir3: use nir_opt_offsets for SSBO accesses |
| - ir3: optimize SSBO offset shifts for nir_opt_offsets |
| - ir3: remove spilled splits in shared RA |
| - ir3: set wrmask for spilled splits in shared RA |
| - ir3: print sharedness/halfness of merge set regs |
| - ir3: print intervals when dumping merge sets |
| - ir3: print dst_offset of spill.macro |
| - ir3: debug print limit pressure and post-spill max pressure |
| - ir3: set current instruction before all validation asserts |
| - ir3: fix crash in try_evict_regs with src reg |
| - ir3: fix handling of early clobbers in calc_min_limit_pressure |
| - ir3: set offset on splits created while spilling |
| - ir3: correctly set wrmask for reload.macro |
| - ir3: don't remove intervals for non-killed tex prefetch sources |
| - ir3: don't remove collects early while spilling |
| - ir3: expose instruction indexing helper for merge sets |
| - ir3: make indexing instructions optional in ir3_merge_regs |
| - ir3: index instructions before fixing up merge sets after spilling |
| - ir3: move liveness recalculation inside ir3_ra_shared |
| - ir3: restore interval_offset after liveness recalculation in shared RA |
| - ir3: add ir3_cursor/ir3_builder helpers |
| - ir3: refactor ir3_spill.c to use the ir3_cursor/ir3_builder API |
| - ir3: only add live-in phis for top-level intervals while spilling |
| - ir3: print rounding mode for cov |
| - ir3: set rounding mode for all floating point conversions |
| |
| Jordan Justen (33): |
| |
| - blorp: Update programming for XY_FAST_COLOR_BLT on xe2 |
| - intel/genxml: Add XY_FAST_COLOR_BLT for xe2 |
| - intel/genxml: Update 3DSTATE_BTD for xe2 |
| - intel/dev: Allow setting FORCE_PROBE for intel PCI IDs |
| - intel/dev: Support INTEL_FORCE_PROBE env-var |
| - docs: Document INTEL_FORCE_PROBE env-var |
| - intel/dev: Add LNL device info |
| - pci_ids/intel: Add LNL PCI IDs (with FORCE_PROBE set) |
| - anv/grl: Set INTEL_FORCE_PROBE=* when running intel_clc |
| - intel/brw: Simplify enabling brw_fs_test_dispatch_packing |
| - intel/brw: Allow xe2 in brw_stage_has_packed_dispatch() |
| - intel/brw: Fix SSBO/shared load offset register size for Xe2 |
| - anv/grl: Build for xe2 |
| - Revert "anv: Disable Ray Tracing on xe2 until our compiler supports Xe2 RT" |
| - intel/dev/mesa_defs.json: Update LNL WA entries |
| - intel/dev: Add INTEL_PLATFORM_BMG enum, BMG WA info |
| - intel/dev: Add BMG device info |
| - intel/dev: Add BMG PCI IDs (with FORCE_PROBE set) |
| - intel/dev: Silence INTEL_FORCE_PROBE warning for intel_clc |
| - intel/dev: If building the driver, always allow getting device info |
| - Revert "anv/grl: Set INTEL_FORCE_PROBE=* when running intel_clc" |
| - intel/compiler: Don't set size written in brw_lower_logical_sends.cpp |
| - intel/tools: Fix intel_dev_info --hwconfig switch |
| - isl: Move isl_get_render_compression_format in isl_genX_helpers.h |
| - isl: Implement isl_get_render_compression_format for xe2 |
| - intel/brw: Retype some regs to BRW_TYPE_UD for Xe2 indirect accesses |
| - intel/perf/xe: Fix free pointer location in xe_add_config() |
| - intel/dev: Enable LNL PCI IDs without INTEL_FORCE_PROBE |
| - anv/generated_indirect_draws: Adjust xe2 simd32 sends_count_expectation |
| - intel/dev: Disable LNL PCI IDs on Mesa 24.2 (require INTEL_FORCE_PROBE) |
| - intel/brw/validate: Simplify grf span validation check by not using a mask |
| - intel/brw/validate: Update dst grf crossing check for Xe2 |
| - intel/brw/validate: Convert access mask to be grf based |
| |
| Jordan Petridis (1): |
| |
| - Revert "ci: mark microsoft farm as offline" |
| |
| Jose Maria Casanova Crespo (9): |
| |
| - v3d: fix CLE MMU errors avoiding using last bytes of CL BOs. |
| - v3dv: fix CLE MMU errors avoiding using last bytes of CL BOs. |
| - v3d: Increase alignment to 16k on CL BO on RPi5 |
| - v3dv: Increase alignment to 16k on CL BO on RPi5 |
| - v3dv: V3D_CL_MAX_INSTR_SIZE bytes in last CL instruction not needed |
| - v3dv/ci: Add more dEQP-VK subgroups that are currently skipped |
| - v3dv: Emit stencil draw clear if needed for GFXH-1461 |
| - v3dv: really fix CLE MMU errors on 7.1HW Rpi5 |
| - v3d: really fix CLE MMU errors on 7.1HW Rpi5 |
| |
| Josh Simmons (3): |
| |
| - radv: Fix crash when using SQTT and NO_COMPUTE |
| - radv: Add \`RADV_PROFILE_PSTATE` envvar |
| - radv: Fix shader mask for SQ_WGP SPM counters |
| |
| José Expósito (2): |
| |
| - meson: Update proc_macro2 meson.build patch |
| - llvmpipe: Init eglQueryDmaBufModifiersEXT num_modifiers |
| |
| José Roberto de Souza (87): |
| |
| - intel/perf: Nuke platform_supported |
| - intel/perf: Remove i915_drm.h include from gen_perf.py |
| - intel/perf: Fix the error check of i915_add_config() |
| - intel/perf: Change oa_format to uint64_t |
| - intel/perf: Store pointer intel_device_info to in intel_perf_config |
| - intel/perf: Add intel_perf_free() |
| - intel/perf: Add intel_perf_free_context() |
| - intel/ds: Free perf config and context |
| - intel/ds: Nuke ralloc_ctx and ralloc_cfg |
| - anv: Free intel_perf_config when destroying physical device |
| - hasvk: Free intel_perf_config when destroying physical device |
| - iris: Free intel_perf_config and intel_perf_context |
| - crocus: Free intel_perf_config and intel_perf_context |
| - intel/perf: Add and use a function to return platform OA format |
| - intel/perf: Add function to open perf stream |
| - intel/perf: Fix return of read_oa_samples_until() |
| - anv: Nuke perf_query_pass from anv_execbuf |
| - intel/perf: Replace I915_OA_FORMAT_* usage by platform check |
| - intel/perf: Move code that will be shared by both KMDs |
| - intel/perf: Move i915 specific code from common code |
| - intel/perf: Move i915 specific code to load configurations to i915 file |
| - intel/perf: Allocate sseu in heap memory |
| - intel/perf: Replace drm_i915_perf_record_header by intel_perf_record_header |
| - intel/perf: Add a macro with header + sample length |
| - intel/perf: Add intel_perf_stream_read_samples() |
| - intel/dev: Add LNL stepping mapping |
| - intel/dev: Add BMG stepping mapping |
| - intel: Move slm functions from brw_compiler.h to intel_compute_slm.c/h |
| - intel/common: Implement Xe2 SLM encode |
| - intel/common: Implement preferred SLM encode |
| - intel/dev: Use topology variables to calculate strides in Xe KMD |
| - intel/dev: Add function to get the number of EUs per subslice |
| - intel: Set preferred SLM allocation size >= than SLM size for Xe2 |
| - anv: Set maxComputeSharedMemorySize value for Xe2 platforms |
| - intel: Compute the optimal preferred SLM size per subslice |
| - anv: Initialize variable to fix static analyzer warning |
| - intel/genxml/gfx20: Sync POSTSYNC_DATA struct with spec |
| - anv/xe2: Enable compute walker and BTD thread preemption |
| - anv/xe2: Add STATE_COMPUTE_MODE individual masks |
| - anv: Remove block promoting non CPU mapped bos to coherent |
| - intel/isl: Set dummy_aux_address to implement Wa_14019708328 |
| - anv: Implement Wa_14019708328 |
| - iris: Implement Wa_14019708328 |
| - anv: Implement Wa_14019857787 |
| - iris: Implement Wa_14019857787 |
| - intel/dev: Add compressed PAT entry |
| - anv: Add support for compressed images allocation in Xe2 |
| - anv: Give apps the choice of compressed or uncompressed but cpu visible images |
| - iris: Add support for compressed images allocation in Xe2 |
| - anv: Fix assert in xe_gem_create() |
| - intel/perf: Change order of if blocks |
| - intel/perf: Add assert to check if allocated enough query fiels |
| - intel/dev: Add engine_class_supported_count to intel_device_info |
| - intel/perf: Add LNL OA XML |
| - intel/perf: Add INTEL_PERF_QUERY_FIELD_TYPE_SRM_OA_PEC |
| - intel/perf: Extend intel_perf_query_result_read_gt_frequency() to gfx 20 |
| - intel: Sync xe_drm.h |
| - intel/perf: Implement function that returns OA format for Xe KMD |
| - intel/perf: Add function to check if OA/perf is supported by Xe KMD |
| - intel/perf: Replace i915_perf_version and i915_query_supported by a feature bitmask |
| - intel/perf: Refactor and add Xe KMD support to add and remove configs |
| - intel/perf: Add Xe KMD perf stream open function |
| - intel/perf: Refactor and add Xe KMD support to enable and disable perf stream |
| - intel/perf: Refactor and add Xe KMD support to change stream metrics id |
| - tool/pps: Add Xe KMD support |
| - intel/perf: Remove i915_drm.h includes from common code |
| - intel/perf: Implement Xe KMD perf stream read |
| - anv: Implement Xe KMD query pools |
| - intel/perf: Enable perf on Xe KMD |
| - intel/perf: Implement intel_perf_query_result_accumulate() for gfx 20+ |
| - intel/perf: Add support for LNL OA sample format size |
| - intel/perf: Return LNL OA sample format |
| - intel/perf: Do not add INTEL_PERF_QUERY_FIELD_TYPE_SRM_OA_PEC |
| - intel/perf: Adjust EU count for Xe2+ |
| - intel/dev: Replace intel_device_info::apply_hwconfig by a gfx version check |
| - intel: Rename XE_PERF to XE_OBSERVATION |
| - anv: Fix return of PAT index for compressed bos for discrete GPUs |
| - intel/dev: Drop DG1 PAT entries |
| - intel/dev: Add documentation about intel_device_info_pat_entry::mmap |
| - intel/dev: Drop coherency from intel_device_info_pat_entry |
| - intel/dev: Add comment documenting the PAT entries |
| - intel/dev: Use GPU WB PAT for Xe2 writecombining |
| - intel/dev: Drop writeback_incoherent from Xe2 |
| - isl: Fix Xe2 protected mask |
| - anv: Propagate protected information to blorp_batch_isl_copy_usage() |
| - intel: Sync xe_drm.h |
| - intel/dev: Support new topology type with SIMD16 EUs |
| |
| Juan A. Suarez Romero (57): |
| |
| - vc4/ci: update results |
| - vc4/v3d/ci: update expected list |
| - vc4: set src type on storing sample mask |
| - broadcom/compiler: remove unused parameters in vpm read |
| - broadcom/compiler: do not run lowering I/O for FS |
| - v3d/vc4/ci: set full renderer version check |
| - nir,v3d: rename tlb_color_v3d intrinsic |
| - vc4: use tlb_color_brcm intrinsic |
| - .gitignore: add .cache folder |
| - vc4: use IO semantics for location |
| - v3d: use BITSET for the masks |
| - v3d: remove handled cases for devices <= 42 |
| - ci: define SNMP base interface on runner |
| - v3d: use screen name in disk cache |
| - v3d,v3dv: add compatibility revision in GPU name |
| - broadcom/ci: update expected results |
| - v3dv/ci: add expected failure |
| - v3dv/ci: fix spurious line in expected |
| - v3dv/ci: add new timeouts |
| - dri: cast constant to uint for bitshift |
| - util: do not access member of a NULL structure |
| - util: use unsigned types when performing bitshift |
| - vulkan: do not access member of a NULL structure |
| - nir: fix overflow when negating maxint in constant expressions |
| - nir: use unsigned types when performing bitshifting |
| - glsl: fix downcasting addresses to wrong object types |
| - egl: do not access member of a NULL structure |
| - mesa: use unsigned types when performing bitshifting |
| - mesa: do not pass NULL pointer to function not expecting NULLs |
| - ci: disable Igalia farm |
| - broadcom/compiler: use unsigned types when performing bitshifting |
| - v3dv: do not access member of a NULL structure |
| - v3dv: do not pass NULL pointer to function not expecting NULLs |
| - v3dv: restrict to channels when encoding border color |
| - v3dv: fix misalignment in descriptor layout structure |
| - v3d: do not access member of a NULL structure |
| - v3d: do not pass NULL pointer to function not expecting NULLs |
| - vc4: use unsigned types when performing bitshifting |
| - vc4: do not access member of a NULL structure |
| - vc4: do not pass NULL pointer to function not expecting NULLs |
| - vc4: do not create 0-bytes variable length arrays |
| - vc4: fix out-of-bounds access to array |
| - Revert "ci: disable Igalia farm" |
| - v3d: use original enabled_mask on setting vertex buffers |
| - broadcom/ci: read 32-bit kernel from arm32 path |
| - broadcom/ci: remove arch from hardware name |
| - vc4/ci: run tests in 64-bits |
| - broadcom/ci: run some GL tests in arm32 arch |
| - broadcom/qpu: clean all versions not supported |
| - broadcom: follow version naming convention |
| - broadcom/ci: add more jobs to test with rpi5 |
| - broadcom/ci: update traces for rpi4 |
| - v3d/ci: update expected list |
| - v3dv: adversise VK_EXT_depth_clamp_zero_one |
| - v3d: expose ARB_depth_clamp in V3D 7.x |
| - v3dv: free temp image created when copying with blit |
| - v3dv: don't leak cache key |
| |
| Julian Orth (1): |
| |
| - egl/wayland: ignore unsupported driver configs |
| |
| Juston Li (8): |
| |
| - venus: refactor out image requirements helpers |
| - venus: extend image cache to vkGetDeviceImageMemoryRequirements |
| - sync protocol for VkRingPriorityInfoMESA |
| - venus: forward nice priority when creating ring |
| - zink: disable cpu_storage for PIPE_USAGE_STREAM |
| - venus: add missing sTypes for vk_set_physical_device_properties_struct |
| - venus: sync protocol for conditionally ignored dyn arrays |
| - anv/android: set ANV_BO_ALLOC_EXTERNAL for imported AHW |
| |
| Karmjit Mahil (6): |
| |
| - ir3: Don't set saturation on \`flat.b` |
| - zink: Add missing currentExtent special value handling |
| - turnip: Remove workaround for CTS bug zero-sized inline uniform block |
| - mailmap: Add Karmjit Mahil |
| - freedreno/isa: Fix isaspec map for a3xx-ld |
| - tu: Set \`TU_ACCESS_CCHE_READ` for transfer ops with read access |
| |
| Karol Herbst (159): |
| |
| - nir: add SYSTEM_VALUE_BASE_WORKGROUP_ID |
| - nir/divergence_analysis: handle load_base_global_invocation_id |
| - intel/compiler: lower workgoup id to index only for mesh shaders |
| - v3d: call nir_lower_compute_system_values to get rid of base intrinsics |
| - lavapipe: lower base_workgroup_id to zero |
| - mesa/st: lower base invoc and workgroup id |
| - nir: remove global_invocation_id_zero_base |
| - nir: remove workgroup_id_zero_base |
| - nir: document base_global_invocation_id and base_workgroup_id |
| - core/kernel: skip validating unique kernel signatures |
| - rusticl/program: Arc the stored KernelInfo |
| - rust/program: remove Program::kernels |
| - nouveau: fix potential double-free in nouveau_drm_screen_create |
| - nir: fix nir_shader_get_function_for_name for functions without names. |
| - rusticl: use stream uploader for cb0 if prefered |
| - rusticl/kernel: properly handle grid and offsets being usize |
| - rusticl: lower huge grids |
| - rusticl: add RUSTICL_MAX_WORK_GROUPS |
| - rusticl/event: use Weak refs for dependencies |
| - rusticl/icd: remove CLObject |
| - rusticl/spirv: enable more caps |
| - Revert "rusticl/event: use Weak refs for dependencies" |
| - event: break long dependency chains on drop |
| - rusticl/device: add DeviceCaps and move timestamp stuff into it |
| - rusticl/device/caps: move enough for has_images |
| - rusticl/device: properly handle devices with no support for images |
| - rusticl/mesa/context: flush context before destruction |
| - rusticl: merge rusticl_nir and rusticl_mesa_bindings_inline_wrapper targets |
| - rusticl: move mesa_version_string out of the inline wrapper |
| - rusticl: bump bindgen req to 0.65 |
| - rusticl: bump meson req to 1.4 |
| - rusticl: make use of new \`output_inline_wrapper` meson.rust.bindgen feature |
| - nir/lower_cl_images: set binding also for samplers |
| - nouveau: import nvif/ioctl.h file from libdrm_nouveau |
| - gallium/vl: stub vl_video_buffer_create_as_resource |
| - gallium/vl: remove stubs which are defined in mesa_util |
| - meson: centralize galliumvl_stub handling |
| - rusticl: link against libgalliumvl_stub |
| - wgl: link against libgalliumvl_stub |
| - gallium/drivers: do not link against libgalliumvl directly |
| - rusticl/event: fix deadlock when calling clGetEventProfilingInfo inside callbacks |
| - iris: fix PIPE_RESOURCE_PARAM_STRIDE for buffers |
| - rusticl/icd: make sure returned function pointers are of the right type |
| - rusticl/kernel/launch: fix mapping usize types to GPU pointer sizes |
| - rusticl/kernel/launch: remove useless upload of the input |
| - rusticl/kernel: move most of the code in launch inside the closure |
| - rusticl/kernel/launch: move allocation of resources vec |
| - rusticl/kernel/launch: rework how the printf buffer is allocated |
| - rusticl/kernel/launch: get rid of Arc clones for global resources |
| - rusticl/kernel/launch: add helper to bind global buffers |
| - broadcom/compiler: handle load_workgroup_size |
| - v3d: add support for load_workgroup_size |
| - rusticl/spirv: do not pass a NULL pointer to slice::from_raw_parts |
| - rusticl/memory: copies might overlap for host ptrs |
| - gallium: reduce pipe_resource.usage to 4 bits |
| - gallium: properly type pipe_resource.usage with the enum |
| - gallium: properly type fields of pipe_resource.usage |
| - nir_lower_mem_access_bit_sizes: support unaligned store_scratch |
| - nir: add global_atomic_2x32 variants to nir_get_io_offset_src_number |
| - broadcom/compiler: support global load/store intrinsics |
| - broadcom/compiler: use nir_lower_mem_access_bit_sizes for memory lowering |
| - broadcom/compiler: convert 2x32 global operations to scalar variants |
| - broadcom/compiler: only handle load_uniform explicitly in v3d_nir_lower_load_store_bitsize |
| - broadcom/compiler: rework scratch lowering |
| - rusticl/meson: add build root dir to the include dirs of rusticl_c |
| - rusticl: depend on the spirv_info target |
| - util/u_printf: properly handle %% |
| - rusticl/memory: assume minimum image_height of 1 |
| - rusticl/memory: fix clFillImage for buffer images |
| - rusticl: add new CL_INVALID_BUFFER_SIZE condition for clCreateBuffer |
| - rusticl: add bsymbolic to linker flags |
| - rusticl/icd: rename all entry points to the actual correct name |
| - radeonsi: set bo_size for user memory allocations |
| - rusticl/queue: gracefully stop the worker thread |
| - rusticl/queue: run rustfmt |
| - nir/lower_alu: support 8 and 16 bit bit_count |
| - nir/opt_sink: add load_kernel_input |
| - gallium: add PIPE_CAP_TEXTURE_SAMPLER_INDEPENDENT |
| - rusticl/device: require PIPE_CAP_TEXTURE_SAMPLER_INDEPENDENT for image support |
| - rusticl/mesa/context: handle clear_buffer not set by driver |
| - rusticl/mesa/screen: handle get_timestamp not set by driver |
| - rusticl/kernel/launch: fix global work offsets for 32 bit archs again |
| - broadcom/compiler: add generated v3d_nir_lower_algebraic |
| - broadcom/compiler: handle fp16 conversion ops |
| - broadcom/compiler: fix iu2f32 for 8 and 16 bit inputs |
| - broadcom/compiler: try handling 8/16 bit alu operations |
| - broadcom/compiler: handle up to vec16 load_uniforms |
| - broadcom/compiler: abort on unknown intrinsics |
| - broadcom/compiler: implement load_kernel_input |
| - broadcom/compiler: call nir_lower_64bit_phis |
| - broadcom/compiler: handle variable shared memory |
| - v3d: implement gallium APIs for OpenCL support |
| - v3d: treat SHADER_KERNEL as SHADER_COMPUTE |
| - v3d: lower CL alus |
| - v3d: lower 64 bit ALUs |
| - v3d: support variable shared memory |
| - v3d: fix MAX_GLOBAL_SIZE and MAX_MEM_ALLOC_SIZE |
| - v3d: never replace a mapped bo |
| - rusticl: enable v3d |
| - nir/schedule: add write dep also for shared_atomic |
| - meson: rename with_gallium_opencl to with_gallium_clover |
| - rusticl/program: move binary parsing into its own function |
| - rusticl/program: make binary API not crash on errors |
| - rusticl/program: use blob.h to parse binaries |
| - rusticl/program: update binary format |
| - rusticl/program: use default in more places |
| - Revert "rusticl/queue: run rustfmt" |
| - Revert "rusticl/queue: gracefully stop the worker thread" |
| - rusticl/buffer: harden bound checks against overflows |
| - rusticl/context: move SVM pointer tracking into own type |
| - rusticl/ptr: add a few APIs to TrackedPointers |
| - rusticl/memory: complete rework on how mapping is implemented |
| - rusticl: remove unused interfaces to simplify code |
| - rusticl/mesa: remove ResourceType::Cb0 |
| - rusticl/memory: optimize sw_copy when the row_pitch matches the height |
| - rusticl/mesa: make PipeResource repr(transparent) |
| - v3d: support unnormalized coords |
| - rusticl/spirv: support more caps |
| - rusticl/device: fix image_3d_write_supported for embedded |
| - rusticl/device: turn image_3d_write_supported into a cap |
| - rusticl/device: fix advertizement of 3d write images support |
| - rusticl: require PIPE_CAP_IMAGE_STORE_FORMATTED for image support. |
| - rusticl/event: make set_status handle error status properly |
| - rusticl/queue: do not overwrite event error states |
| - rusticl/queue: properly check all dependencies for an error status |
| - rusticl/event: properly implement CL_EXEC_STATUS_ERROR_FOR_EVENTS_IN_WAIT_LIST |
| - rusticl/queue: properly implement in-order queue error checking |
| - rusticl/event: return execution errors when doing a blocking enqueue |
| - rusticl/mesa: handle failures with u_upload_data |
| - rusticl/mesa: set take_ownership to true in set_constant_buffer_stream |
| - rusticl/event: fix outdated comment in call |
| - rusticl/queue: format file |
| - zink: fix OpenCL read_write images |
| - rusticl: support read_write images |
| - spirv: generate info for FunctionParameterAttribute |
| - spirv: initial parsing of function parameter decorations |
| - spirv: handle function parameters passed by value |
| - nak: allow clippy::not_unsafe_ptr_arg_deref lints |
| - clc: force linking of spirvs with mismatching pointer types in signatures |
| - rusticl: fix clippy lint having bounds defined in multiple places |
| - rusticl/program: protect against 0 length in slice::from_raw_parts |
| - rusticl/api: protect against 0 length in slice::from_raw_parts |
| - rusticl/spirv: protect against 0 length in slice::from_raw_parts |
| - nouveau: handle realloc failure inside cli_kref_set |
| - mesa: check for enabled extensions for \*UID enums |
| - nouveau/winsys: fix handling of NV_DEVICE_TYPE_IGP |
| - nouveau: use nv_devince_info and fill in PCI and type information |
| - nouveau: add nv_device_uuid |
| - nouveau: implement driver_uuid and device_uuid |
| - nvk: use nv_device_uuid |
| - zink: lower 64 bit find_lsb, ufind_msb and bit_count |
| - zink: lower 8/16 bit alu ops vk spirv doesn't allow |
| - rusticl/kernel: properly respect device thread limits per dimension |
| - rusticl/memory: Fix memory unmaps after rework |
| - rusticl/image: take pitches into account when allocating memory for maps |
| - rusticl/image: properly sync mappings content for 1Dbuffer images |
| - rusticl/queue: add clSetCommandQueueProperty |
| - util/u_printf: do not double print format string with unused arugments |
| - rusticl/memory: fix sampler argument size check |
| |
| Kenneth Graunke (63): |
| |
| - isl: Set MOCS to uncached for Gfx12.0 blitter sources/destinations |
| - intel/brw: Delete gfx10 table for align1 3src type encoding |
| - intel/brw: Drop NF type support |
| - intel/brw: Rework BRW_REGISTER_TYPE's representation semantics |
| - intel/brw: Stop using long BRW_REGISTER_TYPE enum names |
| - intel/brw: Reindent after shortening BRW_REGISTER_TYPE_* to BRW_TYPE_* |
| - intel/brw: Use newer brw_type_is_* shorter names |
| - intel/brw: Replace brw_reg_type_from_bit_size by brw_type_with_size |
| - intel/brw: Replace type_sz and brw_reg_type_to_size with brw_type_size_* |
| - intel/brw: Combine a1/a16 3src type encoding functions |
| - intel/brw: Combine a1/a16 3src type decoding functions |
| - intel/brw: Rename brw_reg_type_to_hw_type to brw_type_encode |
| - intel/brw: Don't use inst return value when it isn't needed |
| - intel/brw: Make a helper for finding the largest of two types |
| - intel/brw: Add builder helpers for math functions |
| - intel/brw: Add builder helpers that allocate temporary destinations |
| - intel/brw: Use new builder helpers that allocate a VGRF destination |
| - intel/brw: Print W/UW immediates correctly |
| - intel/brw: Do not create empty basic blocks when removing instructions |
| - intel/brw: Support CSE on more ops |
| - intel/brw: Don't include unnecessary undefined values in texture results |
| - intel/brw: Add a new VEC() helper. |
| - intel/brw: Use VEC for load_const |
| - intel/brw: Use VEC for gl_FragCoord |
| - intel/brw: Use VEC for TCS/TES/GS input/output loads |
| - intel/brw: Use VEC for FS outputs |
| - intel/brw: Use VEC for output stores |
| - intel/brw: Use VEC for NIR vec*() sources |
| - intel/brw: Use VEC for emit_unzip() |
| - intel/nir: Set src_type on TCS quads workaround store_output |
| - intel/brw: Blockify convergent load_shared on Gfx11-12 as well |
| - intel/brw: Recreate GS output registers after EmitVertex |
| - intel/brw: Skip fs_nir_setup_outputs for compute shaders |
| - intel/brw: Handle scratch address swizzling of constants |
| - intel/brw: Add a idom_tree::dominates(a, b) helper. |
| - intel/brw: Make brw_reg::bits publicly accessible from fs_reg |
| - intel/brw: Update instructions_match() to compare more fields |
| - intel/brw: Drop compiler parameter from try_constant_propagate() |
| - intel/brw: Drop BRW_OPCODE_IF from try_constant_propagate |
| - intel/brw: Refactor try_constant_propagate() |
| - intel/brw: Refactor code to commute immediates into legal positions |
| - intel/brw: Delete SAD2 and SADA2 opcodes |
| - intel/brw: Make VEC() perform a single write to its destination. |
| - intel/brw: Make gl_SubgroupInvocation lane index loading SSA |
| - intel/brw: Skip LOAD_PAYLOADs after every texture instruction if possible |
| - intel/brw: Add a new def analysis pass |
| - intel/brw: Print defs in dump_instructions |
| - intel/brw: Write a new global CSE pass that works on defs |
| - intel/brw: Switch to the new defs-based global CSE pass |
| - intel/brw: Delete old local common subexpression elimination pass |
| - intel/brw: Introduce a new SSA-based copy propagation pass |
| - intel/brw: Use the defs-based copy propagation along with the old one |
| - intel/brw: Make opt_copy_propagation_defs clean up its own trash |
| - intel/brw: Build the scratch header on the fly for pre-LSC systems |
| - intel/brw: Skip discarding the interference graph |
| - intel/brw: Delay liveness calculations in saturate propagation |
| - intel/brw: Make an alu2 builder helper |
| - intel/brw: Make bld.ADD(x, 0) emit no instructions and return x directly |
| - intel/brw: Support CSE of ADD3 |
| - intel/brw: Add a lower_csel pass and allow building it for all types |
| - intel/nir: Don't needlessly split u2f16 for nir_type_uint32 |
| - intel/brw: Don't mix types for unary extended math instructions |
| - intel/brw: Disallow scalar byte to float conversions on DG2+ |
| |
| Kevin Chuang (6): |
| |
| - anv: Properly fetch partial results in vkGetQueryPoolResults |
| - anv: Properly handle cases for different query types in copy_query_results_with_shader |
| - intel/genxml: add task/mesh shader statistics registers |
| - anv: Update pipeline statistics mask for task/mesh shader invocations |
| - anv: implement mesh shader queries |
| - anv: toggle meshShaderQueries based on whether we support mesh_shader or not |
| |
| Khem Raj (1): |
| |
| - amd: Include missing llvm IR header Module.h |
| |
| Konstantin (4): |
| |
| - docs: Add documentation about debugging GPU hangs on RADV |
| - ac/debug,radv: Read UMR wave dumps into memory before parsing |
| - radv: Use a struct for the trace_bo layout |
| - radv: Trace indirect dispatch sizes |
| |
| Konstantin Seurer (59): |
| |
| - radv: Handle all dependencies of CmdWaitEvents2 |
| - nir/print: Do not access invalid indices of load_uniform |
| - radv: Fix radv_shader_arena_block list corruption |
| - radv: Remove arenas from capture_replay_arena_vas |
| - radv: Zero initialize capture replay group handles |
| - radv/ci: Add back pipeline library flakes |
| - radv/ci: Document recent flakes |
| - gitlab: Reference hang debugging documenttion |
| - radv: Remove radv_cmd_dirty_dynamic_bits |
| - llvmpipe: Use a second LLVMContext for compiling sample functions |
| - radv: Add locking to radv_replay_shader_arena_block |
| - radv: Replace is_rt_shader with RADV_SHADER_TYPE_RT_PROLOG |
| - radv: Remove uses_dynamic_rt_callable_stack |
| - radv/rt: Track ray_launch_id reads |
| - radv/rt: Track ray_launch_size reads |
| - radv/rt: Remove load_rt_dynamic_callable_stack_base_amd |
| - radv: Return a block from radv_replay_shader_arena_block |
| - ac/llvm: Fix DENORM_FLUSH_TO_ZERO with exact instructions |
| - ac/llvm: Enable helper invocations for vote_all/any |
| - radv/ci: Bring back vkcts-navi21-llvm-valve |
| - khronos-update: Add ANDROID guards to vk_android_native_buffer.h |
| - zink: Always include renderdoc_app.h |
| - zink: Blit using one triangle for nearest filtering |
| - llvmpipe: Lock shader access to sample_functions |
| - llvmpipe: Stop using a sample_functions pointer as cache key |
| - llvmpipe: Only evict cache entries if a fence is available |
| - lavapipe: Always call finish_fence after lvp_execute_cmd_buffer |
| - radv: Clean up pipeline barrier handling |
| - radv: Remove dead access bits |
| - radv/meta: Use READ access for dst_access_flush |
| - radv/rra: Detect BVHs with back edges |
| - radv/rra: Move some code into handle_accel_struct_write |
| - radv/rra: Fix disabling the ray history |
| - radv/rra: Fix reporting the isec invocations |
| - radv/rra: Bump rt_driver_interface_version to 8.0 |
| - radv/rra: Reduce the memory requirement of copy_after_build |
| - radv/rra: Rework calculating the ray history size |
| - radv/rra: Enable RADV_RRA_TRACE_COPY_AFTER_BUILD by default |
| - util: Add a helper for querying sparse tile sizes |
| - lavapipe: Do not allocate 0 sized buffers for descriptor sets |
| - gallium: Add a memory range parameter to resource_bind_backing |
| - llvmpipe: Use an anonymous file for memory allocations |
| - lavapipe: Implement sparse buffers and images |
| - lavapipe: Implement shaderResourceResidency |
| - venus: Refactor hiding sparse features and properties |
| - venus: Disable sparse binding on lavapipe |
| - vulkan: Handle group stages in vk\_.*_access2_for_pipeline_stage_flags2 |
| - vulkan: Add vk_expand_(dst|src)_access_flags2 |
| - radv: Use vk_expand_(src|dst)_access_flags2 |
| - radv: Remove no-op access flag handling |
| - radv: Remove handling for expanded access flags |
| - radv: Remove write access handling from radv_dst_access_flush |
| - radv: Handle AS access bits like shader storage access bits |
| - radv: Refactor radv_(dst|src)_access_flush |
| - radv: Fix smooth lines with dynamic polygon mode and topology |
| - radv: Always use dynamic line smoothing |
| - nir: Stop using "capture : true" for nir_opt_algebraic |
| - nir: Add FLOAT_CONTROLS\_.*_PRESERVE |
| - aco: print s_delay_alu INSTSKIP>3 correctly |
| |
| Leo Liu (4): |
| |
| - ac/surface: add GFX12 256B tile mode for video |
| - ac/surface/tests: add the test for ADDR3_256B_2D |
| - radeon/vcn: use pipe video buffers for dpb |
| - radeon/vcn: enable dpb to use pipe video buffer with swizzle mode |
| |
| Lionel Landwerlin (125): |
| |
| - anv: disable dual source blending state if not used in shader |
| - anv: reuse embedded samplers across shaders |
| - anv: simplify multisampling check |
| - anv: fixup indentation |
| - anv: factor out wm_prog_data get in runtime flush |
| - intel/brw: fixup wm_prog_data_barycentric_modes() |
| - intel/fs: decouple alphaToCoverage from per sample dispatch |
| - intel/brw: add min_sample_shading value in wm_prog_data |
| - anv: track sample shading enable & min sample shading |
| - anv: add dirty tracking of fs_msaa_flags in runtime |
| - anv: move 3DSTATE_WM::BarycentricInterpolationMode programming to runtime |
| - anv: move more PS_EXTRA programming to runtime |
| - anv: move 3DSTATE_PS to partial packing |
| - anv: move 3DSTATE_MULTISAMPLE to partial emission |
| - anv: remove fs_msaa_flags from the graphics pipeline |
| - anv: enable EDS3 AlphaToCoverageEnable & RasterizationSamples |
| - anv: fixup alloc failure handling in reserved_array_pool |
| - anv: fix leak of custom border colors |
| - anv: avoid requirement to put flush_data as first field |
| - anv: move device initialization as the last step of vkCreateDevice |
| - anv: move empty_vs_input to physical device |
| - anv: VK_EXT_legacy_vertex_attributes |
| - docs: update anv features |
| - anv: fix ycbcr plane indexing with indirect descriptors |
| - intel/hang_replay: use newer API of i915 execbuffer |
| - intel/hang_replay: use hw image param |
| - intel/tools: add README file |
| - brw: add more condition for reducing sampler simdness |
| - intel: move debug identifier out of libintel_dev |
| - brw: drop dependency on libintel_common |
| - anv: fix push constant subgroup_id location |
| - nir/divergence: add missing load_printf_buffer_address |
| - nir: add a base offset for printf indexing |
| - nir: add ptr_bit_size parameter to nir_lower_printf |
| - nir: add a low level printf emission helper |
| - intel/nir: remove unused prototypes |
| - intel/nir: add reloc delta to load_reloc_const_intel intrinsic |
| - intel/compiler: store u_printf_info in prog_data |
| - intel/nir: add printf lowering |
| - anv: add debug shader printf support |
| - intel/clc: enable printfs support |
| - anv: shader printf example |
| - anv: switch to vk_device::mem_cache field for default cache |
| - anv: use weak_ref mode for global pipeline caches |
| - anv: fix shader identifier handling |
| - intel/brw: ensure find_live_channel don't access arch register without sync |
| - anv: fix utrace compute walker timestamp captures |
| - anv: fix timestamp copies from secondary buffers |
| - anv: move last compute command pointers to the state structure |
| - u_trace: extend tracepoint end_of_pipe bit into flags |
| - anv: optimize POSTSYNC_DATA rewrites in timestamp emissions |
| - intel: fix HW generated local-id with indirect compute walker |
| - brw: use a single virtual opcode to read ARF registers |
| - brw: limit dependencies on SR register |
| - brw: better model READ_ARF_REG opcode |
| - anv: fix Gfx9 fast clears on srgb formats |
| - anv: rewrite Wa_18019816803 tracking to be more like state |
| - anv: factor out some more gpu_memcpy setup |
| - anv: fix pipeline flag fields |
| - anv: expose VK_MESA_image_alignment_control |
| - anv: support setting CFE_STATE::StackIDControl per application |
| - anv: limit aux invalidations to primary command buffers |
| - anv: ensure completion of surface state copies before secondaries |
| - anv: simplify TRTT initialization |
| - anv: reuse setup_execbuf_fence_params for utrace submissions |
| - anv: rework utrace submission |
| - anv: move trtt submissions over to the anv_async_submit |
| - anv: use reserved array pool for legacy custom border colors |
| - anv: make device initialization more asynchronous |
| - mi-builder: rename relocated api |
| - mi-builder: c++ warning fix |
| - mi-builder: make instruction pointer manipulation more obvious |
| - mi-builder: add missing write completion check |
| - mi-builder: add relocated register/memory writes |
| - mi-builder: add a write check parameter |
| - anv: centralize mi_builder setup |
| - anv: use the new relocated write mi-builder api |
| - anv: move more MI_SDI to mi_builder |
| - anv: use default mocs for memory bits only touched by CS |
| - anv: set query mi-builder mocs only once |
| - anv: use new mi-builder write check API to avoid stalls |
| - genxml: add MI_MEM_FENCE for Gfx20 |
| - mi-builder: add read/write memory fencing support on Gfx20+ |
| - intel/fs: fix lower_simd_width for MOV_INDIRECT |
| - anv: add custom mi write fences |
| - anv: emit conditional after gfx state flushing |
| - anv: factor out STATE_BASE_ADDRESS filling to helper function |
| - anv: predicate emission of STATE_BASE_ADDRESS |
| - anv: reuse device local variable |
| - anv: avoid initalizing TRTT stuff without sparseBinding |
| - anv: fix vkCmdWaitEvents2 handling |
| - anv: don't apply descriptor array bound checking |
| - brw: add missing break |
| - brw: factor out source extraction for rematerialization |
| - brw: improve rematalization of surface/sampler handles |
| - brw: bound the amount of rematerialized NIR instructions |
| - brw: remove rematerialization assert |
| - brw: remove some brackets |
| - brw: enable rematerialization of non 32bit uniforms |
| - brw: always use new registers for load address increments |
| - brw: annotation send instructions with surface handles generated with exec_all |
| - brw: avoid Wa_1407528679 in uniform cases |
| - brw: blockify load_global_const_block_intel |
| - brw: enable A64 loads source rematerialization |
| - anv: limit vertex fetch invalidation on indirect read |
| - anv: add a protected scratch pool |
| - anv: prepare 2 variants of all shader instructions |
| - anv: allocate compute scratch using the right scratch pool |
| - anv: emit the right shader instruction for protected mode |
| - anv: workaround flaky xfb query results on Gfx11 |
| - anv: fix u_trace on < Gfx12.0 |
| - intel/ds: remove duplicate arguments |
| - hasvk: move cmd_emit_timestamp initialization to genX |
| - hasvk: pass anv_address to predicate helper |
| - brw: fix uniform rebuild of sources |
| - anv: get rid of the second dynamic state heap |
| - isl: account for protection in base usage checks |
| - anv: properly flag image/imageviews for ISL protection |
| - anv: propagate protected information for blorp operations |
| - anv: fix check on pipeline mode to track buffer writes |
| - vulkan/runtime: allow null/empty debug names |
| - anv: reuse object string for RMV token |
| - anv: add missing MEDIA_STATE_FLUSH for internal shaders |
| - anv/blorp: force CC_VIEWPORT reallocation when programming 3DSTATE_VIEWPORT_STATE_POINTERS_CC |
| - brw/rt: fix ray_object_(direction|origin) for closest-hit shaders |
| |
| Louis-Francis Ratté-Boulianne (20): |
| |
| - dri_interface: add interface for EGL_EXT_surface_compression |
| - gallium: add interface for fixed-rate surface/texture compression |
| - egl/wayland: factor out common part of DRI image creation |
| - egl: wire up EGL_EXT_surface_compression extension |
| - st/dri2: add support for fixed-rate compression interface |
| - egl/dri2: add support for EGL_EXT_surface_compression |
| - mapi: add EXT_texture_storage_compression extension |
| - mesa/st: add compression parameter to st_texture_create |
| - mesa: implement EXT_texture_storage_compression extension |
| - mesa: implement EXT_EGL_image_storage_compression extension |
| - panfrost: Add AFRC overlay in v10 xml specification |
| - panfrost: add device querying for AFRC support |
| - panfrost: add utils for AFRC fixed-rate support |
| - panfrost: encode component order as an inverted swizzle (v10) |
| - panfrost: add support for AFRC textures |
| - panfrost: add support for AFRC render targets |
| - panfrost: add support for AFRC modifiers |
| - panfrost: add translation between modifier and compression rates |
| - panfrost: add support for fixed-rate compression |
| - panfrost: add PAN_AFRC_RATE env var to force a compression rate |
| |
| Luc Ma (4): |
| |
| - loader: silence implicit-load zink error by the loader |
| - gallium: properly propagate the usage of resource |
| - gallium: inline trivial needs_pack() |
| - meson: Build pipe-loader when build-tests is true |
| |
| Lucas Fryzek (7): |
| |
| - llvmpipe: query winsys support for dmabuf mapping |
| - u_gralloc/fallback: Set fd from handle directly |
| - egl/x11/sw: Implement swapbuffers with damage |
| - vulkan/wsi: Update sw x11 wsi to only copy damage regions |
| - egl/x11/sw: Implement shm support |
| - egl/x11: Remove force software check for exporting SBWD |
| - lp: only map dt buffer on import from dmabuf |
| |
| Lucas Stach (2): |
| |
| - etnaviv: drm: don't skip flush when there are active PMRs |
| - etnaviv: always flush pending queries on get_query_result |
| |
| M Henning (2): |
| |
| - nir: Handle texop_*_nv in nir_tex_instr_is_query |
| - nak: Add minimum bindgen requirement |
| |
| Maaz Mombasawala (2): |
| |
| - svga: Retry DRM_VMW_SYNCCPU ioctl on failure. |
| - svga: Replace shared surface flag and simplify surface creation |
| |
| Marcin Ślusarz (2): |
| |
| - intel/genxml/xe2: update MESH_CONTROL |
| - anv,intel/compiler/xe2: fill MESH_CONTROL.VPandRTAIndexAutostripEnable |
| |
| Marek Olšák (174): |
| |
| - ac/gpu_info: set tcc_rb_non_coherent only if number of TCCs != number of RBs |
| - ac/surface: disable DCC for 3D textures on gfx9 to improve performance |
| - ac/surface: enable thick tiling for 3D textures for better perf on gfx6-8 |
| - radeonsi: don't invalidate L2 for internal compute without DCC stores |
| - radeonsi: fix KHR-GL46.texture_lod_bias.texture_lod_bias_all on gfx10-11 |
| - radeonsi: validate IO semantics in scan_io_usage |
| - radeonsi: add workarounds for DCC MSAA for gfx9-10 |
| - radeonsi: enable DCC for MSAA on gfx10-10.3 |
| - radeonsi: check for FMASK correctly in gfx10_get_bin_sizes |
| - amd/ci: 17 piglit failures are fixed for raven |
| - nir: add ACCESS_CP_GE_COHERENT_AMD |
| - nir: add nir_atomic_op_ordered_add_gfx12_amd |
| - nir: add streamout intrinsics for AMD GFX12 |
| - nir: add sleep intrinsics for AMD |
| - nir: validate src_type of store_output intrinsics, require bit_size >= 16 |
| - nir: add more build helpers |
| - nir: add shader_info::use_aco_amd |
| - nir/lower_tex: support FMASK loads with a 16-bit sample index |
| - nir/lower_image: support FMASK loads with a 16-bit sample index |
| - drm-uapi: update amdgpu_drm.h and drm_fourcc.h for gfx12 |
| - amd: import gfx12 addrlib |
| - amd: add gfx12 register definitions |
| - amd: add gfx12 register definitions into the register header generator |
| - amd: add initial common code for gfx12 |
| - ac/nir: update ac_nir_lower_resinfo for gfx12 |
| - ac/nir,llvm: add GS VGPR changes for gfx12 |
| - ac/llvm: use new s_wait instructions and split the existing ones for gfx12 |
| - ac/llvm: add new cache flags for gfx12 |
| - ac/llvm: add CS SGPR changes for gfx12 |
| - ac/llvm: update inline assembly for buffer_load_format_xyzw with TFE for gfx12 |
| - ac/nir: add ac_nir_sleep and handle the intrinsics |
| - ac/nir: add gfx12 streamout NIR code |
| - ac/llvm: handle nir_atomic_op_ordered_add_gfx12_amd |
| - ac/llvm: implement nir_intrinsic_ordered_xfb_counter_add_gfx12_amd |
| - ac/llvm: add a workaround for nir_intrinsic_load_constant for LLVM on gfx12 |
| - ac/surface: add gfx12 |
| - ac/surface/tests: add gfx12 tests |
| - radeonsi: add gfx12 |
| - util: shift the mask in BITSET_TEST_RANGE_INSIDE_WORD to be relative to b |
| - ac/llvm: improve/simplify/fix load_ssbo |
| - radeonsi: serialize shader disassembly string to fix asm dumps for ACO |
| - radeonsi: fix the size of the query result SSBO |
| - radeonsi: validate the buffer range in si_set_shader_buffer |
| - radeonsi: remove GDS tests |
| - radeonsi: set flags directly instead of having needs_db_flush |
| - radeonsi/gfx11: use a lighter workaround for Navi31 dEQP failures |
| - radeonsi: get NIR options from si_screen instead of calling get_compiler_options |
| - radeonsi: minor simplifications of clear/copy_buffer shaders |
| - radeonsi: simplify the complex clear/copy_buffer shader |
| - radeonsi: use set_work_size for all internal compute dispatches |
| - radeonsi: replace the clear_12bytes_buffer shader with the DMA compute shader |
| - radeonsi: remove slow code from si_msaa_resolve_blit_via_CB |
| - radeonsi/ci: fix caselists for vk-gl-cts/main |
| - radeonsi/ci: update failures for all generations |
| - radeonsi/ci: remove some gfx11 flakes |
| - radeonsi: constify struct pipe_vertex_buffer * |
| - nir/algebraic: eliminate pack+unpack and unpack+pack pairs |
| - ac: move radv_mem_vectorize_callback to common code |
| - ac/llvm: global stores should have no holes in the writemask |
| - radeonsi: call nir_lower_int64 later to fix ACO failure with Tomb Raider |
| - radeonsi: vectorize load/stores and shrink stores |
| - amd: update addrlib |
| - amd: add more gfx11 APUs |
| - amd: enable 32B minimum DCC block size for gfx1151 |
| - ac/llvm: fix incorrect parameter type in llvm.amdgcn.s.nop |
| - radeonsi: vectorize loads/store after ABI lowering and optimizations |
| - radeonsi/gfx12: fix the alpha ref value |
| - radeonsi/gfx12: fix incorrect condition for when to do clear_buffer via compute |
| - radeonsi/gfx12: disable CU1 instead of CU0 for GS due to SQTT |
| - radeonsi/gfx12: fix a regression in si_set_mutable_tex_desc_fields |
| - radeonsi/gfx12: fix depth bounds register values |
| - radeonsi/gfx12: fix a regression in si_init_depth_surface |
| - radeonsi: don't lower UBO/SSBOs to descriptors if they are already lowered |
| - radeonsi: lower NIR resource srcs to descriptors last |
| - ac/descriptors: fix gfx12 regressions |
| - ac/nir/lower_ngg: use global_atomic_amd to fix gfx12 streamout |
| - ac/nir/lower_ngg: use voffset in global_atomic_add for xfb |
| - ac: add gfx12 DCC shared code |
| - radeonsi/gfx12: fix GPU deadlocks due to query result incoherency |
| - radeonsi: assume si_set_ring_buffer is only used by gfx6-10.3 |
| - radeonsi: remove cp_to_L2 and L2_to_cp, inline the values |
| - radeonsi: remove RADEON_FLAG_READ_ONLY |
| - radeonsi: allow RADEON_HEAP_BIT_GL2_BYPASS for VRAM |
| - radeonsi: remove leftover comment of non-existent RADEON_FLAG_MALL_NOALLOC |
| - radeonsi/gfx12: add DCC |
| - ac/surface: pass the correct addrlib handle to Addr3GetPossibleSwizzleModes |
| - amd: update addrlib |
| - ac/nir/lower_ngg: don't use gfx12 xfb defs outside their basic block on gfx11 |
| - radeonsi/gfx12: fix stencil corruption |
| - gallium/u_blitter: add option to override fragment shader for util_blitter_blit |
| - radeonsi: don't declare 3D coordinates in the compute blit if they aren't needed |
| - radeonsi: use better workgroup sizes for compute blits to improve perf |
| - radeonsi: ignore PIPE_SWIZZLE_1 for 40% VGPR usage reduction for compute blits |
| - radeonsi: remove fp16_rtz from the compute blit |
| - radeonsi: use MIMG D16 (16-bit data) for image instructions in compute blits |
| - radeonsi: optimize unaligned compute blits |
| - radeonsi: fix sample0_only for the compute blit |
| - radeonsi: reject unsupported parameters as the first thing in the compute blit |
| - radeonsi: don't use si_can_use_compute_blit in the compute blit |
| - radeonsi: don't fail due to DCC when using the compute blit on compute queues |
| - radeonsi/gfx11: enable MSAA image stores in the compute blit |
| - radeonsi: document better how X/Y flipping in the compute blit works |
| - radeonsi: cosmetic and robustness changes for the compute blit |
| - radeonsi: extend the compute blit to do image clears as well |
| - radeonsi: switch compute image clears to the compute blit shader |
| - radeonsi: rename si_compute_blit "testing" parameter to "fail_if_slow" |
| - radeonsi: rename si_compute_copy_image -> si_compute_copy_image_old |
| - radeonsi: add a new version of si_compute_copy_image using the compute blit |
| - radeonsi: switch the old compute image copy to the new one using the blit |
| - radeonsi: remove the old si_compute_copy_image |
| - radeonsi: convert the compute blit shader hash table to u64 keys |
| - radeonsi: split xy_clamp_to_edge to separate X and Y flags for the compute blit |
| - radeonsi: restructure (rewrite) the compute blit shader |
| - radeonsi: adds flags parameter into si_compute_blit to replace fail_if_slow |
| - radeonsi: change the compute blit to clear/blit multiple pixels per lane |
| - radeonsi: extend NIR compute helpers to allow returning 16-bit results |
| - radeonsi: use MIMG A16 (16-bit image coordinates) in compute blits |
| - radeonsi: print the compute shader blit key for AMD_DEBUG |
| - radeonsi: use shader_info::use_aco_amd to determine whether to use ACO |
| - radeonsi: add use_aco into CS blit shader key |
| - radeonsi: clear color buffers via compute for special tiling cases |
| - radeonsi: add a custom MSAA resolving pixel shader |
| - radeonsi: add fail_if_slow parameter into si_msaa_resolve_blit_via_CB |
| - radeonsi: add a new blit microbenchmark |
| - radeonsi: add decision code to select when to use CB_RESOLVE for performance |
| - radeonsi: add decision code to select when to use compute blit for performance |
| - ac/nir: import the MSAA resolving pixel shader from radeonsi |
| - ac/nir: import the universal compute clear/blit shader |
| - ac/nir: import the dispatch logic for the universal compute clear/blit shader |
| - Revert "radeonsi: fix initialization of occlusion query buffers for disabled RBs" |
| - radeonsi/ci: update gfx10.3 failures |
| - nir/lower_io_to_scalar: add new_component temporary variable |
| - nir/lower_io_to_scalar: don't create output stores that have no effect |
| - nir: add nir_opt_vectorize_io, vectorizing lowered IO |
| - glsl/linker: vectorize lowered IO |
| - nir: add a NIR option flag nir_io_prefer_scalar_fs_inputs |
| - ac/nir/cdna: allow 16-bit coordinates |
| - ac/nir/cdna: ignore image_descriptor intrinsics |
| - ac/nir/cdna: don't use image_descriptor intrinsics if the src is a descriptor |
| - mesa: switch remaining shader functions from SHA1 to BLAKE3 |
| - radeonsi: replace shader SHA1 hashes with BLAKE3 |
| - radeonsi: don't use CP DMA on GFX940 |
| - nir: rename ordered_xfb_counter_add_gfx12_amd -> ordered_add_loop_gfx12_amd |
| - ac/nir: remove sleeps from gfx12 streamout code |
| - ac/llvm: remove s_nop from ordered_add_loop_gfx12_amd |
| - ac/llvm: fix inline assembly register constraints for ordered_add_loop_gfx12_amd |
| - as/llvm: add s_nops before the ordered add loop and s_wait_alu workaround |
| - radeonsi: implement nir_intrinsic_load_ssbo_address |
| - radeonsi: expose internal buffer bindings to compute shaders |
| - radeonsi/gfx12: always set BO metadata, not just during export |
| - radeonsi/gfx12: fix compute register settings for global_atomic_ordered_add |
| - ac/surface: finish display DCC for gfx11.5 |
| - ac/surface: finish display DCC for gfx12 |
| - radeonsi: add fail_if_slow parameter into compute_clear/copy_buffer |
| - radeonsi: use a hash_table and define a shader key for the DMA compute shader |
| - radeonsi: add dwords_per_thread parameter into si_compute_clear_copy_buffer |
| - radeonsi: clear buffers with a 12B clear value by clearing 4 dwords per thread |
| - radeonsi: rewrite the clear/copy_buffer microbenchmark |
| - radeonsi/ci: update gfx11 failures |
| - radeonsi: replace si_shader::scratch_bo with scratch_va, don't set it on gfx11+ |
| - radeonsi: don't update compute scratch if the compute shader doesn't use it |
| - ac: add radeon_info::has_scratch_base_registers |
| - radeonsi: lock a mutex when updating scratch_va for compute shaders |
| - util: make util_idalloc_exists private |
| - util: don't use variable names that can appear in args of idalloc foreach macros |
| - util: add util_idalloc_sparse, solving the excessive virtual memory usage |
| - mesa: switch ID allocation to util_idalloc_sparse to reduce virtual memory usage |
| - nir/opt_algebraic: use fmulz for fpow lowering to fix incorrect rendering |
| - radeonsi/gfx12: fix a GPU hang due to an invalid packet with window rectangles |
| - radeonsi: ensure TC_L2_dirty is set if we don't sync after internal SSBO blits |
| - radeonsi: fix buffer coherency issues on gfx6-8,12 due to missing PFP->ME sync |
| - radeonsi/gfx12: fix register programming to fix GPU hangs |
| - radeonsi/gfx12: fix VS output corruption with streamout |
| - ac/surface/gfx12: turn off HiZ for pre-production samples |
| |
| Mark Burton (1): |
| |
| - gallivm: Fix compilation errors when using LLVM 13. |
| |
| Mark Collins (21): |
| |
| - vdrm: Add fixed VA parameter for mapping memory |
| - tu: Handle VkDeviceMemory BO unmapping in VkUnmapMemory |
| - tu: Implement VK_EXT_map_memory_placed |
| - docs/features: Add VK_EXT_map_memory_placed |
| - tu/shader: Allow LRZ when write pos with explicit early frag test |
| - tu/lrz: Emit GRAS_LRZ_CNTL2 on A7XX |
| - tu/lrz: Use actual CHIP rather than hardcoding A6XX |
| - fd/a7xx: Initialize magic register 8C34 to 0 |
| - fd/a7xx: Initialize magic register 8008 to 0 |
| - tu: Allow LRZ on A7XX |
| - tu/lrz: Add structure for LRZ FC layout |
| - tu: Update LRZ FC allocation for A7XX layout |
| - tu: Update LRZ FC dirty clear for A7XX |
| - tu: Specify LRZ FC depth clear value on A7XX |
| - tu: Enable LRZ fast-clear for A7XX |
| - fd/a7xx: Document \`LRZ_FLIP_BUFFER` event |
| - docs/freedreno: Add documentation on A7XX LRZ |
| - tu: Emit GRAS_LRZ_DEPTH_BUFFER_INFO correctly |
| - tu/kgsl: Spin unti KGSL reports queue timestamp during profiling |
| - tu/kgsl: Fix profiling buffer GPU IOVA |
| - fd/meson: Only build 'ds' when system has DRM |
| |
| Martin Krastev (2): |
| |
| - svga: convert license block to SPDX |
| - svga: update timespan in copyright message |
| |
| Martin Roukala (né Peres) (9): |
| |
| - ci/b2c: Reduce the length of the kernel cmdline |
| - nvk+zink/ci: rename the ga106 jobs to be more in line with RADV |
| - nvk+zink/ci/ga106: make the expectations codename-specific |
| - nvk+zink/ci: document more flakes in the ga106 |
| - turnip/ci: document a missing flake from the a750_vk job |
| - turnip/ci: bump the a750_vk timeout |
| - turnip+zink/ci: add more flakes to the expectations |
| - radv+zink/ci: document recent flakes |
| - radv/ci: add a bunch of flakes |
| |
| Mary Guillemard (86): |
| |
| - nak: Pass has_mod to all form of src2 requiring it |
| - panvk: Ensure we lower load_base_workgroup_id to 0 |
| - panfrost: Skip new failure from VKCTS 1.3.8.x |
| - nvk, nak: Wire up conservative rasterization underestimate |
| - docs/features: Add EXT_conservative_rasterization for NVK |
| - agx: speed-up dce |
| - panvk: Only clear UBOs descriptors when set isn't present |
| - nouveau: nvidia_header: Add AMPERE_B class generation |
| - nak: Set SPH version to 4 on SM75+ |
| - nak: Migrate sph.rs to use SPH headers defintion |
| - bi: Reformat code |
| - midgard: Reformat code |
| - bi: Alloc replacement array once in opt_cse |
| - pan/lib, panvk: Ensure data_size is on 64 bits |
| - panvk: Fix shader destruction when vk_shader_module_to_nir fail |
| - panvk: Remove panvk_lower_blend |
| - panvk: Remove dynarray from panvk_shader |
| - panvk: Keep panvk_shader alive in panvk_pipeline_shader |
| - panvk: Upload shader in panvk_shader |
| - panvk: Upload copy tables in panvk_shader |
| - panvk: Upload render state in panvk_shader |
| - panvk: Move the linking bits to panvk_shader |
| - panvk: Kill panvk_pipeline_shader and use panvk_shader directly |
| - panvk: Link shaders at draw time |
| - panvk: Move compile logic out of shader_create |
| - panvk: Move NIR lower logic out of shader_create |
| - panvk: Move preprocess logic out of shader_create |
| - panvk: Implement vk_shader |
| - panvk: Remove panvk_pipeline |
| - pan/va: Ensure no clash with other defs in disassembler |
| - bi: Make disassembler take a const void* |
| - midgard: Make disassembler take a const void* |
| - bi: Move bi_disasm definitions to their own header |
| - panfrost: Add pan_shader_disassemble |
| - panvk: Implement executable IR reporting |
| - panvk: Advertise VK_KHR_pipeline_executable_properties |
| - panvk: Generate proper device and driver UUIDs |
| - panvk: Advertise VK_EXT_pipeline_creation_cache_control and VK_EXT_pipeline_creation_feedback |
| - panvk: Advertise VK_EXT_shader_module_identifier |
| - panvk: Advertise VK_KHR_pipeline_library and VK_EXT_graphics_pipeline_library |
| - panvk: Enable pipeline library in CI for Mali-G52 |
| - docs: Update features.txt to add panvk for BDA extensions |
| - panvk: Advertise VK_KHR_device_group and VK_KHR_device_group_creation |
| - panvk: Reorder extensions by name |
| - panvk: Advertise VK_KHR_maintenance3 |
| - panvk: Add missing null check in DestroyCommandPool |
| - panvk: Add missing clean up in blend_shader_cache_init |
| - panvk: Make mempool detect NULL BOs |
| - panvk: Check for maxBufferSize in panvk_CreateBuffer |
| - panvk: Make panvk_kmod_zalloc use correct allocation scope on non-transient |
| - panvk: Ensure to unref transient bo in reset for mempools |
| - panvk: Fix device mempool leaks |
| - panvk: Add more allocation checks in create_device |
| - panvk: Implement CmdDispatchBase |
| - panvk: Enable device_init, null_handle and object_management in CI for Mali-G52 |
| - panvk: Advertise shaderModuleIdentifier feature |
| - panvk: Report correct min value for discreteQueuePriorities |
| - panvk: Enable dEQP-VK.info tests in CI for Mali-G52 |
| - panvk: Clamp viewport scissor to valid range |
| - panvk: Enable offscreen_viewport tests in CI for Mali-G52 |
| - panvk: Skip dispatch on empty workgroup |
| - panvk: Report proper workgroup invocation and size |
| - panvk: Enable compute pipeline in CI for Mali-G52 |
| - panvk: Advertise VK_EXT_private_data |
| - panvk: Do not emit blend shader when color_mask is 0 |
| - panvk: Run nir_lower_io_to_vector for fragment shader |
| - panvk: Enable glsl.440.linkage in CI for Mali-G52 |
| - panvk: Implement and advertise anisotropy support |
| - panvk: Advertise VK_KHR_sampler_mirror_clamp_to_edge |
| - panvk: Enable texture filtering in CI for Mali-G52 |
| - pan/kmod: Avoid deadlock on VA allocation failure on panthor |
| - panfrost: Handle context_init errors correctly |
| - panfrost: Handle gracefully resource BO alloc failures |
| - ci/panfrost: Update t760 fails |
| - rusticl: Add panthor when panfrost is present in RUSTICL_ENABLE |
| - bi: Clean up mem_access_size_align_cb |
| - bi: Enable lower_pack_64_4x16 |
| - bi: Lower pack_32_4x8_split and pack_32_2x16_split in algebraic |
| - bi: Enable lower_pack pass in compiler |
| - bi: Implement basic 8-bit vec support |
| - panfrost: Rewrite set_global_binding to make resources truly global |
| - panfrost: Do not recreate bo if already mapped |
| - panfrost: Increase address space to 48-bit |
| - panfrost: Fetch available system memory |
| - panvk: Fix image support in vertex jobs |
| - panvk: Pass attrib_buf_idx_offset to desc_copy_info |
| |
| MastaG (1): |
| |
| - gallivm: Call StringMapIterator from llvm:: scope |
| |
| Matt Coster (1): |
| |
| - docs: List VK_EXT_debug_utils |
| |
| Matt Turner (8): |
| |
| - intel: Build float64 shader only for Vulkan |
| - intel/clc: Free parsed_spirv_data |
| - intel/clc: Free disk_cache |
| - intel/brw: Use REG_CLASS_COUNT |
| - intel/elk: Use REG_CLASS_COUNT |
| - docs: Drop references to LIBGL_DRIVERS_PATH |
| - util: Add ATTRIBUTE_OPTIMIZE(flags) |
| - util: Force emission of stack frame in stack unit test |
| |
| Mauro Rossi (1): |
| |
| - intel/common: fix building error in intel_common.c |
| |
| Maíra Canal (7): |
| |
| - v3dv: Use errno when logging an error to stderr |
| - drm-uapi: Update v3d_drm.h |
| - broadcom/common: Add maximum number of perf counters to v3d_device_info |
| - v3dv: Use DRM_IOCTL_V3D_GET_COUNTER to get perfcnt information |
| - v3d: Use DRM_IOCTL_V3D_GET_COUNTER to get perfcnt information |
| - broadcom/simulator: Add DRM_V3D_PARAM_MAX_PERF_COUNTERS parameter support |
| - broadcom/simulator: Add DRM_IOCTL_V3D_GET_COUNTER to simulator |
| |
| Michel Dänzer (4): |
| |
| - wsi/wayland: Dispatch event queue in wsi_wl_swapchain_queue_present |
| - wsi: Call drmSyncobjQuery only once for all images |
| - egl/dri: Use packed pipe_format |
| - dri: Go back to hard-coded list of RGBA formats |
| |
| Mike Blumenkrantz (162): |
| |
| - glthread: check for invalid primitive modes in DrawElementsBaseVertex |
| - zink: reconstruct features pnext after determining extension support |
| - zink: prune zink_shader::programs under lock |
| - zink: fully wait on all program fences during ctx destroy |
| - kopper: fix bufferage/swapinterval handling for non-window swapchains |
| - zink: slightly better swapinterval failure handling |
| - kopper: don't set drawable buffer age |
| - zink: handle swapchain currentExtent special value |
| - zink: clean up accidental debug print |
| - dri: rename 'implicit' param from earlier series |
| - tu: support VK_EXT_legacy_vertex_attributes |
| - llvmpipe: add KHR-Single-GL45.arrays_of_arrays_gl.AtomicUsage skip |
| - ci: disable lavapipe-vk-asan job |
| - lavapipe: VK_EXT_legacy_vertex_attributes |
| - zink: clamp buffer_indices_hashlist resets to used region |
| - zink: delete GS conditional in update_so_info |
| - zink: use zink_shader_key_optimal unions for pipeline state asserts |
| - zink: use info.fs.uses_sample_qualifier instead of manual scan |
| - zink: simplify confusing return in rewrite_tex_dest |
| - zink: simplify flagging legacy shadow samplers |
| - zink: rename zink_shader variable in create functions |
| - zink: break out shadow sampler scanning |
| - zink: always block the precompile threads when pruning shaders |
| - zink: more effectively synchronize separate shader program precompiles |
| - zink: use zink_shader type directly in zink_create_gfx_shader_state() |
| - zink: split shader create into 2-stage functions |
| - zink: reorder precompile_separate_shader_job() in file |
| - zink: split generated tcs creation into 2-stage functions |
| - zink: move gfx shader init to thread |
| - zink: reorder some code in zink_create_gfx_program() |
| - zink: reorder fencing in zink_create_gfx_program() |
| - zink: split gfx program creation into 2-stage functions |
| - zink: precompile_job() -> gfx_program_precompile_job() |
| - zink: move blocking gfx program init functions to thread |
| - ci: disable g52 |
| - egl/x11: disable dri3 with LIBGL_KOPPER_DRI2=1 as expected |
| - zink: add a batch ref for committed sparse resources |
| - u_blitter: stop leaking saved blitter states on no-op blits |
| - freedreno/replay: use inttypes format string for 64bit |
| - zink: delete unused zink_batch struct member |
| - zink: move in_rp to zink_context |
| - zink: move ref_lock from zink_batch to zink_batch_state |
| - zink: move has_work from zink_batch to zink_batch_state |
| - zink: rename last_was_compute -> last_work_was_compute |
| - zink: move last_work_was_compute from zink_batch to zink_context |
| - zink: move work_count from zink_batch to zink_context |
| - zink: move swapchain from zink_batch to zink_context |
| - zink: rename zink_batch::state -> zink_batch::bs |
| - zink: delete all zink_batch uses from zink_query.c |
| - zink: remove zink_batch usage from zink_clear.c |
| - zink: remove all uses of zink_batch from zink_batch.c |
| - zink: remove all zink_batch usage from zink_resource.h |
| - zink: remove all zink_batch usage from zink_draw.cpp |
| - zink: remove all zink_batch usage from zink_render_pass.c |
| - zink: remove all zink_batch usage from zink_context.c |
| - zink: delete zink_batch |
| - zink: zink_batch_state::has_barriers -> has_reordered_work |
| - zink: reset all the has_work flags in the same place |
| - zink: check all has_work flags for flushes |
| - zink: rely on zink_get_cmdbuf() to set has_work flags |
| - zink: flag has_work in a few more places |
| - zink: stop flagging has_work on batch tracking |
| - zink: don't submit main cmdbuf if has_work is not set |
| - frontends/dri: only release pipe when screen init fails |
| - frontends/dri: always init opencl_func_mutex in InitScreen hooks |
| - zink: use u_minify for sparse calcs |
| - zink: always commit full miptails |
| - zink: refcount miptails |
| - zink: clean up semaphore arrays on batch state destroy |
| - zink: add a batch array for tracked semaphores |
| - zink: stop leaking sparse semaphores |
| - zink: rework sparse semaphore waits |
| - ci: bump VVL to snapshot-2024wk19 |
| - zink: hook up VK_EXT_legacy_vertex_attributes |
| - zink: set all spirv caps for the vvl vtn pass |
| - ci: bump VVL to v1.3.285 |
| - zink: make unassigned io variables unreachable |
| - zink: minor tweaks to shader io assignment |
| - zink: outdent assign_producer_var_io() |
| - zink: outdent assign_consumer_var_io() |
| - zink: pass a struct through io assignment functions |
| - zink: track masks of io locations used during linking |
| - zink: unify io assignment |
| - zink: move 'reserved' into io assign struct |
| - zink: split slot map between regular varyings and patch |
| - zink: ci updates |
| - egl/dri2: fix error returns on dri2_initialize_x11_dri3 fail |
| - nir/lower_aaline: fix for scalarized outputs |
| - nir/linking: fix nir_assign_io_var_locations for scalarized dual blend |
| - lavapipe: split out DGC into separate file |
| - lavapipe: plumb print_cmds through NV DGC |
| - lavapipe: lvp_indirect_command_layout -> lvp_indirect_command_layout_nv |
| - zink: remove dgc debug mode |
| - zink: add atomic image ops to the ms deleting pass |
| - build/amd: add amd-use-llvm build option |
| - ir3: flag progress from nir_lower_io_to_scalar |
| - ir3: assert that no further optimizations can be done if !progress |
| - gallium: add drawid_offset to draw_mesh_tasks interface |
| - gallium: stop dropping drawid_offset param with util_draw_indirect |
| - vulkan: Update XML and headers to 1.3.287 |
| - zink: add HKP to tiler mode switch |
| - lavapipe: fix mesh+task binding with shader objects |
| - mesa/st: fix zombie shader handling for non-current programs |
| - zink: null check pipe loader config before use |
| - zink: split out msaa replication |
| - zink: implement msaa replication with dynamic rendering |
| - radeonsi: enable compute pbo blits |
| - ci: kill filament trace globally |
| - zink: add a driver workaround to disable 2D_VIEW_COMPATIBLE+sparse |
| - zink: free sparse page for miptail on uncommit |
| - zink: remove adreno from broken_cache_semantics driver workaround |
| - egl: deduplicate MESA_image_dma_buf_export enablement |
| - egl: only enable MESA_image_dma_buf_export with PIPE_CAP_DMABUF |
| - lavapipe: maint7 |
| - st/pbo: fix MESA_COMPUTE_PBO=spec crash on shutdown |
| - st/pbo_compute: special case stencil extraction from Z24S8 |
| - mesa/st: use compute pbo download for readpixels |
| - ci: bump vvl to v1.3.289 |
| - zink: add an a750 skip |
| - zink: enable compute pbos for turnip |
| - aux/tc: update docs to indicate replaced buffers have multiple pipe_resources |
| - zink: don't lower fpow |
| - zink: propagate valid buffer range to real buffer when mapping staging |
| - zink: track the "real" buffer range from replacement buffers |
| - zink: modify some buffer mapping behavior for buffer replacement srcs |
| - zink: move all driverID checks to a helper function |
| - zink: hook up maintenance7 |
| - zink: use maint7 to capture venus driver and more accurately use workarounds |
| - mesa/st: load state params for feedback draws with allow_st_finalize_nir_twice |
| - egl/x11/sw: fix partial image uploads |
| - egl/x11/sw: plug in swap_buffers_with_damage handling |
| - winsys/radeon: take the full winsys struct in radeon_get_drm_value() |
| - winsys/radeon: wrap fd access with util function |
| - winsys/radeon: switch to rendernode when card node doesn't work |
| - winsys/radeon: revert recent changes |
| - glx: directly link to gallium |
| - egl: link with libgallium directly |
| - gbm: link directly with libgallium |
| - loader: delete loader_open_driver() |
| - loader/dri3: check xfixes version in loader_dri3_open() |
| - loader/dri3: avoid killing the xcb connection if dri3 not found |
| - loader/glx: move multibuffers check to loader |
| - egl: use loader's multibuffer check to deduplicate lots of code |
| - vl/dri3: use loader's dri3 init code and delete everything else |
| - zink: move image aoa access to nir pass |
| - zink: use PIPE_CAP_NIR_SAMPLERS_AS_DEREF |
| - gallium: install gallium-$version.so to libdir |
| - ci: prune dri from LD_LIBRARY_PATH |
| - dril: rework config creation |
| - llvmpipe: handle vma allocation failure |
| - llvmpipe: only use vma allocations on linux |
| - dri: fix kmsro define |
| - Revert "vl/dri3: use loader's dri3 init code and delete everything else" |
| - glx: include src/gallium for apple |
| - dri: link with libloader |
| - kopper: check swapchain size after possible loader image resize |
| - pipe-loader: fix driconf memory management |
| - dril: always take the egl init path |
| - egl: fix zink init |
| - dri: fix kms_swrast screen fail |
| - egl/wayland: bail on zink init in non-sw mode if extension check fails |
| - zink: fix partial update handling |
| |
| Mike Lothian (2): |
| |
| - radeonsi,aco: Run ac_nir_lower_global_access pass |
| - ac/llvm: Remove global access ops handling |
| |
| Mingcong Bai (2): |
| |
| - meson: set default drivers for ppc, ppc64 |
| - meson: set default Vulkan drivers for ppc, ppc64 |
| |
| Mohamed Ahmed (4): |
| |
| - nil: Add a nil_image::compressed bit |
| - nil: Add some helpers for DRM format modifiers |
| - nil: Support creating images with DRM modifiers |
| - nvk: enable rendering to DRM_FORMAT_MOD_LINEAR images |
| |
| Mykhailo Skorokhodov (2): |
| |
| - egl/wayland: Fix sRGB format look up for config |
| - ci/lima: expect fail of window_8888_colorspace_srgb on wayland |
| |
| Nanley Chery (29): |
| |
| - intel/isl: Add and use _isl_surf_info_supports_ccs |
| - intel/isl: Reduce halign for disabled CCS on XeHP |
| - intel/isl: Update quote for XeHP's CCS halign rule |
| - intel/isl: Allow sampling from 3D HIZ_CCS_WT |
| - intel/blorp: Factor bpb into the fast-clear rect |
| - intel/blorp: Allow gfx12 fast-clears without CCS surf |
| - intel/isl: Add and use ISL_DRM_CC_PLANE_PITCH_B |
| - anv: Refactor modifier plane layout queries |
| - intel/aux_map: Add and use INTEL_AUX_MAP_MAIN_PITCH_SCALEDOWN |
| - intel/aux_map: Add and use INTEL_AUX_MAP_META_ALIGNMENT_B |
| - intel/aux_map: Add and use INTEL_AUX_MAP_MAIN_SIZE_SCALEDOWN |
| - intel/isl: Add and use ISL_MAIN_TO_CCS_SIZE_RATIO_XE |
| - intel/isl: Add and use multi-engine surf usage bits |
| - iris: Simplify bo import in memobj_create_from_handle |
| - intel/isl: Assert alignments of surface addresses |
| - anv: Rely on the primary surf usage to disable aux |
| - anv,hasvk: Drop anv_get_isl_format_with_usage |
| - anv: Support multiple aspects in anv_formats_ccs_e_compatible |
| - anv: Rely more on ISL_SURF_USAGE_DISABLE_AUX_BIT |
| - anv: Restrict CCS ISL surface creation to gfx9-11 |
| - iris: Add and use comp_ctrl_surf_offset on gfx12 |
| - intel/isl: Drop support for the gfx12 CCS ISL surf |
| - intel/isl: Add and use isl_drm_modifier_needs_display_layout |
| - iris,anv: Disable gfx12.0 fast-clears with unaligned pitch |
| - intel/isl: Consolidate some tiling checks for CCS |
| - intel/isl: Require display flag for 512B pitch alignment |
| - intel/isl: Pad the pitch on gfx12.0 for fast-clears |
| - anv+zink/ci: Change sparse test result from crash to fail |
| - intel/isl: Enable Tile4 for CPB surfaces |
| |
| Natanael Copa (1): |
| |
| - nir/opt_varyings: reduce stack usage |
| |
| Neha Bhende (2): |
| |
| - svga: Retrieve stride info from hwtnl->cmd.vdecl for swtnl draws |
| - dri: fix macro name check to detect svga driver |
| |
| Oskar Viljasaar (8): |
| |
| - vulkan/properties: support Android in the property generator |
| - v3dv: constify arguments of vendor/device id getters |
| - v3dv: Use common runtime vk_properties |
| - vulkan/properties: Document RENAMED_PROPERTIES in the property generator |
| - anv: Move completely over to common runtime GetPhysicalDeviceProperties2 |
| - hasvk: switch to use runtime physical device properties infrastructure |
| - vulkan: add a property struct setter function |
| - venus: Use common physical device properties |
| |
| Patrick Lerda (8): |
| |
| - gallium/auxiliary/vl: fix typo which negatively impacts the src_stride initialization |
| - clover: fix pipe_box update regression |
| - clover: fix memory leak related to optimize |
| - r600: fix vertex state update clover regression |
| - mesa/main: fix stack overflow related to the new mipmap code |
| - radeonsi: fix assert triggered on gfx6 after the tessellation update |
| - clover: fix meson opencl-spirv option |
| - st/pbo_compute: fix async->nir memory leak |
| |
| Paulo Zanoni (31): |
| |
| - isl: add ISL_TILING_64_XE2 to isl_tiling_to_name() |
| - anv/sparse: add the MSAA block shape tables |
| - anv/sparse: we can't do multi-sampled depth/stencil sparse images |
| - anv/sparse: properly reject sample counts we don't support |
| - anv/sparse: reject all sample flags that non-sparse doesn't support |
| - anv/sparse: fix block_size_B when the image is multi-sampled |
| - anv/sparse: exclude Xe2's Tile64's non-standard block shapes |
| - anv/sparse: flush the tile cache when resolving sparse images |
| - anv/sparse: enable MSAA for Sparse when applicable |
| - anv: check for VK_RENDERING_SUSPENDING_BIT once at CmdEndRendering |
| - anv+zink/ci: add failures related to multi-sampled sparse binding |
| - anv/sparse: assert a format can't be standard and non-standard |
| - anv/xe: fix declaration of memory flags for integrated non-LLC platforms |
| - anv/sparse: reject 1D sparse residency images |
| - anv/sparse: fix the image property sizes for multi-sampled images |
| - anv/sparse: fix reporting of VK_SPARSE_IMAGE_FORMAT_SINGLE_MIPTAIL_BIT |
| - intel/isl: pass struct isl_tile_info to choose_image_alignment_el() |
| - anv/sparse: dump info about opaque binds when DEBUG_SPARSE |
| - anv/sparse: fix TR-TT page table bo size and flags |
| - anv/sparse: remove obsolete linear tiling code path |
| - anv/sparse: unify and rework tile size calculation |
| - anv/sparse: use ANV_SPARSE_BLOCK_SIZE instead of tile_size when possible |
| - anv: properly store the engine_class_supported_count values |
| - anv: LNL+ doesn't need the special flush for sparse |
| - anv: reimplement the anv_fake_nonlocal_memory workaround |
| - iris: fix iris_xe_wait_exec_queue_idle() on release builds |
| - anv/trtt: fix the process of picking device->trtt.queue |
| - anv/xe: try harder when the vm_bind ioctl fails |
| - anv: don't expose the compressed memory types when DEBUG_NO_CCS |
| - anv: disable CCS for Source2 games on Xe2 |
| - intel: fix compute SLM sizes on Xe2 and newer |
| |
| Pavel Ondračka (13): |
| |
| - r300: replace constant size field with usemask |
| - r300: move dead constants pass earlier for vertex shaders |
| - r300: switch to a new constant remap table format |
| - r300: compact scalar uniforms into empty slots |
| - r300: better packing for immediates |
| - r300/ci: fails list update |
| - r300: fix cycles counting for KIL |
| - r300: fix writemask rewrite when converting to omod |
| - r300: fix for ouput modifier and DDX/DDX |
| - r300: fix RC_OMOD_DIV_2 modifier |
| - r300: missing whitespace in shader stats |
| - r300: vectorization tweaks for R300/R400 |
| - r300: bias presubtract fix |
| |
| Philipp Zabel (7): |
| |
| - etnaviv: drm: Suppress get-param error message for non-existent core |
| - etnaviv: drm: Stop after model query failure |
| - etnaviv: Pass npu to etna_screen_create in a separate parameter |
| - etnaviv: Add a separate NPU pipe |
| - etnaviv: Allow collecing both GPU and NPU specs |
| - etnaviv/nn: Pipe through input/accumulation buffer depth from hwdb |
| - etnaviv: update headers from rnndb |
| |
| Pierre-Eric Pelloux-Prayer (34): |
| |
| - radeonsi/sqtt: use ac_sqtt_get_shader_mask for spm counters |
| - radeonsi/sqtt: cleanup si_sqtt_add_code_object a bit |
| - radeonsi/sqtt: support sqtt buffer auto-resizing |
| - radeonsi: add new si_shader_binary_upload_at method |
| - radeonsi/sqtt: use si_shader_binary_upload_at to reupload shaders |
| - radeonsi: allocate sqtt and spm buffers in GTT |
| - radeonsi: add testmemperf mem bandwidth test |
| - radv/sqtt: use radeon_check_space before emit_spm_* |
| - radeonsi: use the common SQTT implementation |
| - radeonsi/sqtt: add AMD_THREAD_TRACE_INSTRUCTION_TIMING |
| - ac/sqtt: make VA helpers static |
| - ac/llvm: implement WA in nir to llvm |
| - radeonsi: store the total binary size in si_shader |
| - radeonsi: handle DBG(TEX) after tc_compatible_htile is set |
| - radeonsi/tests: don't match gfx10_3 baseline for gfx10 family |
| - radeonsi/tests: add a shortcut to re-run only failing tests |
| - ac/surface: reject modifiers with retile_dcc and bpe != 32 |
| - radeonsi: add gfx11 workaround for upgraded_depth |
| - ac/nir: don't use the compute blit for PIPE_FORMAT_R5G6B5_UNORM |
| - radeonsi/tests: update tests baseline |
| - radeonsi/tests: clarify the output when results changes |
| - radeonsi: fix buffer_size in si_compute_shorten_ubyte_buffer |
| - Revert "ac, radeonsi: remove has_syncobj, has_fence_to_handle" |
| - ac/info: remove has_syncobj |
| - winsys/radeon: fill lds properties |
| - radeonsi: fix crash in si_update_tess_io_layout_state for gfx8 and earlier |
| - radeonsi/tests: correctly parse the family name |
| - radeonsi: fix ac_create_shadowing_ib_preamble parameter |
| - radeonsi, radv: bump libdrm_amdgpu version requirement |
| - ci: bump Fedora and Android libdrm2 to 2.4.122 |
| - radeonsi: fix si_get_dmabuf_modifier_planes for gfx12 |
| - frontends/dri: add error logs to dri2_create_image_from_fd |
| - amd: use a valid size for ac_pm4_state allocation |
| - egl,gbm,glx: fix log message spam |
| |
| Qiang Yu (8): |
| |
| - glsl: respect GL_EXT_shader_image_load_formatted when image is embedded in a struct |
| - radeonsi: add missing nir_intrinsic_bindless_image_descriptor_amd |
| - nir: fix lower array to vec metadata preserve |
| - nir: fix clip cull distance lowering metadata preserve |
| - nir: add filter parameter to nir_lower_array_deref_of_vec |
| - nir: nir_vectorize_tess_levels support indirect access |
| - nir: consider more deref types when fixup deref |
| - glsl: fix indirect tess factor access for compact_arrays=false drivers |
| |
| Rebecca Mckeever (9): |
| |
| - panvk: Add jm and bifrost dirs |
| - panvk: Add push_uniform/constant helpers |
| - panvk: Make helper functions panvk_cmd_buffer agnostic |
| - panvk: Move panvk_descriptor_state to bifrost subdir |
| - panvk: Move vkCmdDraw* functions to their own file |
| - panvk: Move vkCmdDispatch* functions to their own file |
| - panvk: Move vkCmd*Event functions to their own file |
| - panvk: Add Valhall DescriptorSetLayout implementation |
| - panvk: Add Valhall Descriptor{Set,Pool} implementations |
| |
| Renato Pereyra (2): |
| |
| - anv: Attempt to compile all pipelines even after errors |
| - intel/perf: Move sysmacros.h include from header to implementation |
| |
| Rhys Perry (95): |
| |
| - aco/tests: add tests for hidden breaks/continues |
| - aco/tests: add tests for divergent merge phi with undef |
| - nir/dead_cf: stop reindexing blocks for each non-block cf node |
| - aco/stats: fix s_waitcnt parsing |
| - aco/stats: don't use VS counter pre-GFX10 |
| - aco/waitcnt: fix DS/VMEM ordered writes when mixed |
| - aco: make wait_imm indexable |
| - aco/waitcnt: add target_info |
| - aco/waitcnt: refactor for indexable wait_imm |
| - aco/stats: refactor for indexable wait_imm |
| - aco: add wait_imm::unpack and wait_imm::max |
| - radv: keep track of unaligned dynamic vertex access |
| - aco: form hard clauses in VS prologs |
| - aco: copy VS prolog constants after loads |
| - aco: support VS prologs with unaligned access |
| - aco/util: improve small_vec assertion |
| - radv: advertise VK_EXT_legacy_vertex_attributes |
| - aco: don't count certain pseudo towards VMEM_STORE_CLAUSE_MAX_GRAB_DIST |
| - aco/tests: support GFX12 |
| - aco: add SFPU/ValuPseudoScalarTrans instr class |
| - aco: add GFX11.5+ opcodes |
| - aco: support GFX12 in assembler |
| - aco/tests: add GFX12 assembler tests |
| - aco: don't change prefetch mode on GFX11.5+ |
| - aco/gfx12: disable s_cmpk optimization |
| - aco: add GFX12 wait counters |
| - aco/waitcnt: support GFX12 in waitcnt pass |
| - aco/stats: support GFX12 in collect_preasm_stats() |
| - aco: update VS prolog waitcnt for GFX12 |
| - aco/lower_phis: create loop header phis for non-boolean loop exit phis |
| - aco: create lcssa phis for continue_or_break loops when necessary |
| - aco: use scalar phi lowering for lcssa workaround |
| - aco: remove nir_to_aco |
| - aco/lower_phis: don't create boolean loop header phis in some situations |
| - radv: malloc graphics pipeline stages |
| - aco: support GFX12 in insert_NOPs |
| - aco/gfx12: implement subgroup shader clock |
| - aco/gfx12: implement workgroup barrier |
| - aco/gfx12: sign-extend s_getpc_b64 |
| - aco/gfx12: don't create v_fmac_legacy_f32 |
| - aco/gfx12: use ttmp9/ttmp7 for workgroup id |
| - radv/gfx12: don't add workgroup id shader args |
| - aco/gfx12: remove MIMG vector affinity |
| - aco/gfx12: decrease max_nsa_vgprs for VSAMPLE |
| - aco/gfx12: disallow SCC and most constants for BUF SOFFSET |
| - aco: fix fddx/y with uniform inf/nan input |
| - meson: remove --depfile for aco_tests |
| - ac/llvm: implement load_subgroup_id |
| - aco/gfx12: implement load_subgroup_id |
| - ac/nir: skip subgroup_id/local_invocation_index lowering for gfx12 |
| - aco/gfx12: fix s_wait_event immediate |
| - aco: don't combine vgpr into writelane src0 |
| - aco: implement nir_atomic_op_ordered_add_gfx12_amd |
| - aco: implement nir_intrinsic_nop_amd and nir_intrinsic_sleep_amd |
| - ac/nir: support lowering of sub-dword push constants |
| - radv: lower sub-dword push constants |
| - ac/llvm: remove support for sub-dword push constants |
| - aco: remove support for sub-dword push constants |
| - aco/gfx6: set glc for buffer_store_byte/short |
| - aco: inline store_vmem_mubuf/emit_single_mubuf_store |
| - aco: use ac_hw_cache_flags |
| - aco: use GFX12 scope/temporal-hint |
| - ac: stop using radeon_info for ac_get_hw_cache_flags |
| - aco: use ac_get_hw_cache_flags() |
| - aco: remove some missing label resets |
| - nir/opt_loop: rematerialize derefs instead of creating phis |
| - nir/opt_loop: fix formatting |
| - aco: insert s_nop before discard early exit sendmsg(dealloc_vgpr) |
| - radv: lower push constants in NIR |
| - ac/llvm: remove push constants |
| - aco: remove push constants |
| - aco/insert_exec_mask: ensure top mask is not a temporary at loop exits |
| - vtn: ensure TCS control barriers have a large enough memory scope |
| - aco: use 1.5x vgprs for gfx1151 and gfx12 |
| - aco: skip continue_or_break LCSSA phis when not needed |
| - aco: use s_pack_ll_b32_b16 for pack_32_2x16_split |
| - aco: combine extracts into s_pack_ll_b32_b16 |
| - aco: use s_pack_*_b32_b16 more in p_insert/p_extract lowering |
| - aco: turn split(vec()) into p_parallelcopy instead of p_create_vector |
| - aco: add missing isConstant()/isTemp() checks |
| - aco: fix follow_operand with combined label_extract and label_split |
| - aco: use alignment information in visit_load_constant() |
| - aco: fix wmma raw hazard |
| - aco: replace constant v_bfrev_b32 with v_mov_b32 to create vopd |
| - aco/gfx11: don't use v_bfrev_b32 with wave64 |
| - glsl: always lower non-TCS outputs to temporaries |
| - gallium: remove PIPE_CAP_SHADER_CAN_READ_OUTPUTS |
| - nir/linking_helpers: remove special case for read mesh outputs |
| - nir/linking_helpers: remove varying accesses in nir_remove_unused_io_vars |
| - nir/linking_helpers: remove nested IF |
| - radv: remove unecessary nir_remove_unused_varyings cleanup passes |
| - aco/gfx11.5: workaround export priority issue |
| - aco: fix validation of v_s\_ opcodes |
| - docs: update ACO_DEBUG documentation for scheduler options |
| - docs: update ACO_DEBUG documentation for perfwarn |
| |
| Rob Clark (63): |
| |
| - tu: Add missing error path cleanup |
| - tu: Fix a6xx lineWidthGranularity |
| - freedreno/ir3: Skip DAG validation on release builds |
| - llvmpipe: Fix build error with clang-18 |
| - freedreno/ci: Switch a618_piglit to deqp-runner |
| - vulkan/android: Add helper to probe AHB support |
| - vulkan: Don't request Ycbcr conversion for rgb |
| - vulkan: Add helper to resolve Android external format |
| - tu: Skip YUV conversion for RGB formats |
| - tu: Support VkExternalFormatANDROID |
| - freedreno/ci: Remove some skips |
| - freedreno/ci: Remove some obsolete skips |
| - freedreno/ci: Refactor out common a6xx skips list |
| - freedreno/ci: Skip unsupported legacy gl stuff |
| - freedreno/ci: Skip max-texture-size |
| - freedreno/ci: Add a common skips file to a618_piglit. |
| - freedreno/ci: Skip built-in-functions VS/GS tests |
| - freedreno/ci: Skip some slow tests |
| - freedreno/ci: Increase a630/a618 piglit fraction |
| - freedreno/ir3: Fix ldg/stg offset |
| - egl/android: Fix sRGB visuals |
| - docs/features: Add missing AHB for tu |
| - tu: Don't advertise AHB handle time on non-android |
| - freedreno: Namespace DEFINE_CAST() |
| - virgl: Update headers |
| - loader: Add better support for virtgpu nctx driver loading |
| - freedreno/loader: Switch over to probe_nctx |
| - vulkan/android: Fix suggestedYcbcrModel with !mapper4 |
| - tu: Fix imageview + ahb |
| - vulkan/android: Fix YcbcrRange for !mapper4 |
| - ir3: Add some more missing progress accumulation |
| - gallium/tc: Add optional buffer replacement limit |
| - freedreno: Use buffer replacement limit |
| - gallium/tc: Allow replacement if replacing valid_range |
| - freedreno/drm: Add rd dumper support |
| - st/mesa/pbo: Set src type on image_store |
| - freedreno: Handle non-null cb with null buffer |
| - u_blitter+d3d12: Move stencil fallback clear to caller |
| - freedreno/a6xx: Implement S8 support |
| - freedreno: Implement stencil blit fallback |
| - freedreno: Use LINEAR for staging resources |
| - freedreno/a6xx: Drop 16b packed image formats |
| - freedreno/bc: Rework flush order |
| - freedreno/a6xx: Tweak blitter traces |
| - freedreno/a6xx: Skip blitter for L/A conversions |
| - freedreno/a6xx: Add more format swizzles |
| - freedreno/a6xx: Allow blit based transfers |
| - freedreno: Enable the X1-85 |
| - tu: Fix issues with 16k (or larger) page sizes |
| - freedreno/drm/virtio: Fix issues with 16k (or larger) page sizes |
| - freedreno/a6xx: Implement reg stomper support |
| - freedreno/a7xx: Fix GRAS_UNKNOWN_80F4 writes |
| - freedreno/cffdec: Fix a7xx CP_EVENT_WRITE decoding |
| - tu/drm/virtio: Add missing a7xx case |
| - freedreno/drm: Handle a7xx case |
| - freedreno: Move GENX/CALLX magic to common |
| - freedreno: Extract out common UBWC helper |
| - freedreno: Extract out shared LRZFC layout helpers |
| - freedreno/a6xx: Allocate lrcfc when needed for direction tracking |
| - freedreno/a6xx: Refactor CP_EVENT_WRITE emit |
| - freedreno/a6xx: Rework CCU_CNTL emit for a7xx |
| - freedreno/a6xx: Initial a7xx support |
| - gallium: Add option to not add version to libgallium filename |
| |
| Robert Mader (3): |
| |
| - egl: Implement EGL_EXT_config_select_group |
| - egl: Implement EGL_MESA_x11_native_visual_id |
| - egl/x11: Allow all RGB visuals to match 32-bit RGBA EGLConfigs |
| |
| Rohan Garg (21): |
| |
| - anv: formatting fix when printing pipe controls |
| - anv: allocate space for generated indirect draw id's using the temporary allocation helper |
| - intel/brw: update Xe2 max SIMD message sizes |
| - Revert "iris: slow clear higher miplevels on single sampled 8bpp resources that have TILE64" |
| - intel/eu/xe2+: Fix src1 length bits of SEND instruction with UGM target. |
| - intel/brw: Advertise fp64 atomic add's when we have 64 bit float support and a LSC |
| - intel/brw: We no longer have atomic fmin/fmax ops for fp64 in xe2 |
| - intel/genxml: add the new state byte stride instruction |
| - intel/genxml: update 3DSTATE_CPSIZE_CONTROL_BUFFER for xe2+ |
| - isl: enable compression for CPS buffers on xe2+ |
| - intel/genxml: update CFE_STATE for LNL |
| - intel/genxml: Update XY_BLOCK_COPY_BLT |
| - intel/genxml: update MI_SEMAPHORE_WAIT for Xe2 |
| - intel/genxml: Update STATE_COMPUTE_MODE for Xe2 |
| - anv: 3D stencil surfaces have fewer layers for higher miplevels |
| - isl: disable CCS for 3D depth/stencil surfaces when WA is applicable |
| - isl: Enable volumetric STC_CCS,HiZ+CCS on gfx12.0 |
| - intel/genxml: Add RESOURCE_BARRIER for xe2 |
| - intel/compiler: fix shuffle generation on LNL |
| - anv: flag WSI images as scanout images for ISL |
| - anv: reuse existing macro to query for flushes |
| |
| Roland Scheidegger (1): |
| |
| - lavapipe: add option to enable snorm blending |
| |
| Romain Naour (1): |
| |
| - glxext: don't try zink if not enabled in mesa |
| |
| Roman Stratiienko (11): |
| |
| - vulkan/android: Add basic u_gralloc support |
| - vulkan/android: Add common vkGetSwapchainGrallocUsage{2}ANDROID |
| - vulkan/android: Add android buffer classification to vk_image |
| - vulkan/android: Add common helpers for the ANB extension |
| - vulkan/android: Add common helpers for the AHB extension |
| - vulkan/android: Add common vkGetAndroidHardwareBufferPropertiesANDROID |
| - turnip/android: Migrate to common ANB code |
| - v3dv/android: Migrate ANB and AHB to use common helpers |
| - u_gralloc/fallback: Extract modifier from QCOM native_handle |
| - turnip/android: Use DETECT_OS_ANDROID in tu_device |
| - turnip/android: Use DETECT_OS_ANDROID in freedreno_rd_output |
| |
| Romaric Jodin (1): |
| |
| - intel/brw: allocate large table in the heap instead of the stack |
| |
| Ruijing Dong (14): |
| |
| - radeonsi/vcn: add vcn5 encoding interface change |
| - radeonsi/vcn: add vcn5.0 for h264 enc only |
| - radeonsi/vcn: add hevc support for vcn5 |
| - radeonsi/vcn: enable decoding in vcn5. |
| - radeonsi/vcn: correct tile_size_bytes_minus1 |
| - radeonsi/vcn: add cdef modes for vcn5 encoding |
| - radeonsi/vcn: apply cdef mode to vcn5 |
| - radeonsi/vcn: share functions between vcn4/vcn5 |
| - frontends/va: parsing uniform_tile_spacing flag |
| - radeonsi/vcn: add header files for vcn5 av1 tile |
| - radeonsi/vcn: enable av1 encoding in vcn5 |
| - radeonsi/vcn: enable roi feature for vcn5 |
| - radeonsi/vcn: remove tile_config_flag |
| - radesonsi/vcn: update vcn4 tile processing logic |
| |
| Ryan Neph (7): |
| |
| - venus: reclaim signal semaphore feedback resources for wasteful clients |
| - venus: sync headers for VK_EXT_external_memory_acquire_unmodified |
| - venus: enable VK_EXT_external_memory_acquire_unmodified |
| - venus: factor image memory barrier fixes to common implementation |
| - venus: refactor image memory barrier fix storage and conventions |
| - venus: skip barrier fixes as early as possible |
| - venus: chain VkExternalMemoryAcquireUnmodifiedEXT for wsi ownership transfers |
| |
| Rémi Bernon (2): |
| |
| - zink: Add VKAPI_PTR specifier to zink_stub_function_not_loaded. |
| - zink: Add VKAPI_PTR specifier to generated stub functions. |
| |
| Sagar Ghuge (8): |
| |
| - intel/compiler: Fix destination type for CMP/CMPN |
| - intel/disasm: Fix cache load/store disassembly for URB messages |
| - iris: Load 32-bit MMIO PREDICATE register from buffer |
| - intel/compiler: No need to re-type the destination register |
| - intel/fs: Adjust destination register size for untyped atomic on Xe2+ |
| - intel/fs: Adjust destination register size for global atomic on Xe2+ |
| - intel/compiler: Don't use half float param for sample_b |
| - intel/compiler: Add indirect mov lowering pass |
| |
| Samuel Pitoiset (399): |
| |
| - radv: fix image format properties with fragment shading rate usage |
| - docs: Add an alternative way to debug GPU hangs with RADV |
| - radv/rt: add radv_ray_tracing_state_key |
| - radv/rt: pass radv_ray_tracing_state_key to radv_rt_pipeline_compile() |
| - radv/rt: rework the helper that hashes a ray tracing pipeline |
| - radv/ci: add more flakes |
| - radv: simplify DB_Z_INFO.NUM_SAMPLES with null ds target on GFX11 |
| - radv: remove bogus VkShaderCreateInfoEXT::flags being 0 assert for compute |
| - radv: simplify radv_emit_primitive_restart_enable() |
| - radv: inline radv_get_pa_su_sc_mode_cntl() in radv_emit_culling() |
| - radv: remove useless DB_Z_INFO.NUM_SAMPLES when emitting the MSAA state |
| - radv: pre-compute VGT_TF_PARAM.DISTRIBUTION_MODE |
| - radv: use the bound GS copy shader when emitting shader objects |
| - radv: add GS copy shader BO to the cmdbuf BO list at bind time |
| - radv: add RT prolog BO to the cmdbuf BO list at bind time |
| - radv: add shaders BO to the cmdbuf BO list at bind time |
| - radv: emit compute pipelines directly from the cmdbuf |
| - radv: precompute compute/task shader register values |
| - radv: clear unwritten color attachments for monolithic PS earlier |
| - radv: compact SPI_SHADER_COL_FORMAT as late as possible |
| - radv: rename col_format_non_compacted to spi_shader_col_format |
| - radv: store cb_shader_mask for fragment shaders and epilogs |
| - radv: add a new dirty state for emitting the color output state |
| - radv/ci: document a recent regression on GFX6-8 |
| - radv: split cmdbuf dirty flags into dirty/dirty_dynamic |
| - radv: precompute existing legacy GS register values later |
| - radv: precompute fragment shader register values |
| - radv: precompute mesh shader register values |
| - radv: precompute legacy GS register values |
| - radv: precompute vertex shader register values |
| - radv: precompute DB_SHADER_CONTROL for fragment shaders later |
| - vulkan: Update XML and headers to 1.3.284 |
| - aco: add support for remapping color attachments |
| - radv: implement VK_KHR_dynamic_rendering_local_read |
| - radv: advertise VK_KHR_dynamic_rendering_local_read |
| - radv: add a new mechanism for tracking registers per cmdbuf |
| - radv: move common registers between VS/GS and NGG |
| - radv: precompute NGG register values |
| - radv: remove unused parameter to radv_pipeline_emit_pm4() |
| - radv: stop recomputing the last VGT API stage when emitting graphics shaders |
| - radv: do not emit non-context registers to radv_pipeline::ctx_cs |
| - radv: track and bind more VRS states from the graphics pipeline |
| - radeonsi: remove the _unused parameter in all radeon_xxx macros |
| - radv: remove gfx10_emit_ge_pc_alloc() |
| - radv: do not emit VGT_GS_OUT_PRIM_TYPE to ctx_cs on GFX11 |
| - radv: simplify radv_emit_hw_ngg() slightly |
| - radv: simplify radv_emit_hw_vs() slightly |
| - radv: simplify radv_emit_hw_gs() slightly |
| - radv: fix the late scissor workaround for GFX9 since a recent refactoring |
| - radv: make radv_conv_gl_prim_to_gs_out() a non-static function |
| - radv: emit graphics pipelines directly from the cmdbuf |
| - radv: add graphics shaders context registers that need to be tracked |
| - radv: add more radeon_opt_set_xxx variants |
| - radv: track all graphics shaders context registers |
| - radv: simplify radv_emit_ps_inputs() slightly |
| - radv: stop using radv_physical_device for radeon helpers |
| - radv: introduce radeon_set_reg_seq() |
| - radv: remove redundant radeon_set_perfctr_reg() helper |
| - radv: rename radeon perfctr uconfig helpers |
| - radv: add a helper to configure ring buffer descriptors |
| - radv: only enable VK_MESA_image_alignment_control on GFX9-11.5 |
| - radv: reject unsupported buffer formats earlier |
| - ac,radv,radeonsi: add a helper to translate buffer numformat |
| - ac,radv,radeonsi: add a helper to translate buffer dataformat |
| - radv: simplify radv_emit_default_sample_locations() |
| - radv: pass radv_physical_device to radv_emit_default_sample_locations() |
| - radv: use float instead of double for viewport zscale/ztranslate |
| - radv: add more helpers to emit viewports |
| - radeonsi: refactor si_translate_border_color() |
| - ac,radv,radeonsi: introduce a helper to build a sampler descriptor |
| - radv: stop checking the output value of radv_translate_tex_numformat |
| - radv: use PIPE_FORMAT in radv_translate_tex_numformat() |
| - ac,radv,radeonsi: add a function to translate tex numformat |
| - radv: use PIPE_FORMAT in radv_translate_colorswap() |
| - ac,radv,radeonsi: add a function to translate colorswap |
| - radv: use PIPE_FORMAT in radv_translate_dbformat() |
| - ac,radv,radeonsi: add a function to translate db format |
| - ac,radv,radeonsi: add a function to get the color format endian swap |
| - radv: allow 3d views with VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT |
| - radv: simplify creating gfx10 texture descriptors for sliced 3d/2d view of 3d |
| - radv: remove redundant check for VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 on GFX6-10 |
| - radv: stop checking the return value of ac_get_cb_number_type() |
| - radv: add radv_is_colorbuffer_format_blendable() |
| - amd/common: move some format related helpers to ac_formats.c |
| - ac,radv,radeonsi: add helper to know if a format is supported by CB |
| - ac,radv,radeonsi: add helper to know if a format is supported by DB |
| - ac,radv,radeonsi: introduce a helper to build a FMASK descriptor |
| - radeonsi: reject some texture formats but only on GFX8/CARRIZO |
| - radv: mark some formats as unsupported on GFX8/CARRIZO |
| - radv: set image view descriptors as buffer for non-graphics GPU |
| - ac,radv,radeonsi: add a helper to get the tile mode index |
| - radv: tidy up swizzle in radv_set_mutable_tex_desc_fields() |
| - radv: tidy up custom pitch for gfx10.3 in radv_set_mutable_tex_desc_fields() |
| - radv: tidy up meta_va in radv_set_mutable_tex_desc_fields() |
| - radv: set ITERATE_256 for GFX10+ in radv_set_mutable_tex_desc_fields() |
| - radv: stop clearing unnecessary bitfields in radv_set_mutable_tex_desc_fields() |
| - ac,radv,radeonsi: add a helper to set mutable tex desc fields |
| - ac,radv,radeonsi: add a function for getting border color swizzle |
| - radv: only set ALPHA_IS_ON_MSB if the image has DCC on GFX6-9 |
| - radv: adjust determining if alpha should be on MSB |
| - ac,radv,radeonsi: add function to determine if alpha should be on MSB |
| - ac,radv,radeonsi: add a common helper for translating swizzle |
| - amd/common: only pass gfx_level to ac_get_gfx10_format_table() |
| - amd/common: add new helpers to build buffer descriptors |
| - radv: use new common helpers for building buffer descriptor |
| - aco: use new common helpers for building buffer descriptors |
| - radv: remove unused radv_translate_buffer_dataformat() |
| - radv: remove useless check about FIXED formats |
| - radv: simplify radv_is_vertex_buffer_format_supported() |
| - radv: replace vk_to_non_srgb_format() by vk_format_no_srgb() |
| - radv: fix setting a custom pitch for CB on GFX10_3+ |
| - radv: replace db_{z,stencil}_{read,write}_base by db_{depth,stencil}_base |
| - radv: tidy up radv_initialise_ds_surface() |
| - radv: separate non-mutable vs mutable fields for ds surface |
| - amd/common: add a function to initialize ds surface |
| - radv: use the common helper for initializing DS surfaces |
| - radeonsi: use the common helper for initializing DS surfaces |
| - radv: only enable DB_STENCIL_INFO.ITERATE_FLUSH when necessary |
| - radv: do not check image usage for ITERATE256 with TC-compat HTILE |
| - ac,radv,radeonsi: add function to get the number of ZPLANES |
| - ac,radv,radeonsi: a function that sets mutable DS surface fields |
| - ac,radv,radeonsi: move ZRANGE_PRECISION to mutable DS fields |
| - amd/common: add ac_gpu_info::has_etc_support |
| - radv: use PIPE_FORMAT in radv_translate_tex_dataformat() |
| - ac,radv,radeonsi: add a function to translate texture data format |
| - amd/common: add a helper to set the third word of buffer descriptor |
| - ac,radv: add a stride parameter to ac_build_attr_ring_descriptor() |
| - radeonsi: use common build buffer descriptor helpers |
| - radv: align DCC control settings to RadeonSI for GFX8 |
| - amd/common: add a function to initialize cb surface |
| - radv: use the common helper for initializing CB surfaces |
| - radeonsi: use the common helper for initializing CB surfaces |
| - ac,radv,radeonsi: a function that sets mutable CB surface fields |
| - radv: apply the workaround for no PS inpputs and LDS on GFX11 only |
| - radv: apply the SQ_THREAD_TRACE_WPTR workaround on GFX11 only |
| - radv: fix flushing DB meta cache on GFX11.5 |
| - radv: only emit streamout enable for legacy streamout |
| - amd/common: define SDMA v7.0 for GFX12 |
| - ac,radv,radeonsi: add ac_gpu_info::has_tc_compatible_htile |
| - radv: update NUM_THREAD_FULL bitfields |
| - radv: enable GS_FAST_LAUNCH_2 by default on GFX12 |
| - radv: do not emit non-existent registers on GFX12 |
| - radv: update configuring sample locations on GFX12 |
| - radv: update configuring viewport/scissor on GFX12 |
| - radv: update configuring PS states on GFX12 |
| - radv: update configuring NGG states on GFX12 |
| - radv: update configuring VGT states on GFX12 |
| - radv: update configuring DB states on GFX12 |
| - radv: update configuring rasterization states on GFX12 |
| - radv: update configuring some CB states on GFX12 |
| - radv: update configuring occlusion query state on GFX12 |
| - radv: update configuring MSAA state on GFX12 |
| - radv: update configuring GFX preamble on GFX12 |
| - radv: update configuring tess rings on GFX12 |
| - radv: update binning settings on GFX12 |
| - radv: update emitting discard rectangles on GFX12 |
| - radv: update shader input arguments for GS stage on GFX12 |
| - aco: adjust loading local invocation ID for GS on GFX12 |
| - radv: do not emulate clear state for shadowed regs on GFX12 |
| - radv: update cache flush emission on GFX12 |
| - radv: update emitting stipple line on GFX12 |
| - radv: disallow merging multiple draws into one wave on GFX12 |
| - radv: emit SQ_NON_EVENT packets after drawing with streamout on GFX12 |
| - radv: update configuring the number of patch control points on GFX12 |
| - radv: update configuring VGT_SHADER_STAGES_EN on GFX12 |
| - radv: enable GE_CNTL.DIS_PG_SIZE_ADJUST_FOR_STRIP on GFX12 |
| - radv: update NUM_THREAD_FULL bitfields on GFX12 |
| - radv: update global graphics shader pointers on GFX12 |
| - radv: update SDMA resource type on GFX12 |
| - radv: update VS input VGPRs on GFX12 |
| - radv: do not enable HTILE for depth/stencil storage images |
| - radv: allow STORAGE for depth formats |
| - radv: fix configuring the number of patch control points on GFX6 |
| - radv: configure DB_Z_INFO.NUM_SAMPLES on GFX12 |
| - radv: configure DB_RENDER_CONTROL to zero on GFX12 |
| - radv: do not enable MEM_ORDERED on GFX12 |
| - radv: update configuring the attribute ring on GFX12 |
| - radv: do not flush L2 metadata on GFX12 |
| - radv: mark all images coherent with TC L2 on GFX12 |
| - radv: update configuring SPI_SHADER_PGM_LO_LS on GFX12 |
| - radv: update configuring SPI_SHADER_PGM_LO_ES on GFX12 |
| - radv: update configuring SPI_SHADER_PGM_RSRC4_{HS,GS,PS} on GFX12 |
| - radv: update configuring GE_CNTL.PRIM_GRP_SIZE_GFX11 on GFX12 |
| - radv: update configuring SPI_PS_IN_CONTROL on GFX12 |
| - radv: configure PA_SC_HISZ_CONTROL on GFX12 |
| - radv: configure SPI_SHADER_GS_OUT_CONFIG_PS on GFX12 |
| - radv: update configuring GS_VGPR_COMP_CNT on GFX12 |
| - radv: do not set DX10_CLAMP on GFX12 |
| - radv: fix VRS subpass attachments with mipmaps |
| - ac,radeonsi: set COLOR_SW_MODE for mutable CB surfaces on GFX12 |
| - radv: configure PA_SC_SAMPLE_PROPERTIES on GFX12 |
| - radv: update number of input VGPRs for VS on GFX12 |
| - radv: update configuring color buffers on GFX12 |
| - radv: update configuring depth stencil buffers on GFX12 |
| - radv: update configuring PA_SC_WINDOW_SCISSOR on GFX12 |
| - radv: do not emit SPI_SHADER_PGM_RSRC3_GS on GFX12 |
| - radv: fix configuring NGG registers on GFX12 |
| - radv: do not set VGT_PRIMITIVEID_EN.PRIMITIVEID_EN on GFX12 |
| - radv: cleanup radv_precompute_registers_hw_{ngg,fs} |
| - radv: assert that GDS/GDS OA buffers can't be created on GFX12 |
| - radv: only set valid bitfields for CB/DS surfaces address |
| - radv: only emit VGT_GS_MAX_PRIMS_PER_SUBGROUP on GFX9 |
| - radv: only emit SQ_PERFCOUNTER_MASK on GFX7-9 |
| - radv: do not set VGT_SHADER_STAGES_EN.DYNAMIC_HS on GFX9 |
| - radv: only emit SPI_SHADER_PGM_SRC3_GS on GFX7+ |
| - radv: only emit CB_COLOR0_DCC_CONTROL on GFX8 |
| - radv: use pipe_format when building image view descriptors |
| - ac,radv,radeonsi: add a function to build texture descriptors |
| - amd/common: add MIN_LOD for texture descriptors on GFX12 |
| - Revert "radv/ci: Bring back vkcts-navi21-llvm-valve" |
| - radv: update configuring depth clamp enable on GFX12 |
| - radv: update configuring COVERAGE_TO_SHADER_SELECT on GFX12 |
| - radv: fix emitting VGT_PRIMITIVEID_RESET in the GFX preamble on GFX12 |
| - radv: only set valid bitfields for CB/DS surfaces address on GFX12 |
| - radv: add a helper to get image VA |
| - ac,radeonsi import PM4 state from RadeonSI |
| - ac,radeonsi: add a function to initialize compute preambles |
| - radv: initialize compute preambles with the common helper |
| - radv: fix creating unlinked shaders with ESO when nextStage is 0 |
| - radv: pass a radv_shader to radv_get_compute_pipeline_metadata() |
| - radv: don't assume that TC_ACTION_ENA invalidates L1 cache on gfx9 |
| - ac,radv: add a helper for SQTT control register |
| - ac,radv,radeonsi: add more parameters to ac_sqtt |
| - amd: allow to emit privileged config registers in PM4 |
| - amd: mark more registers that need RESET_FILTER_CAM in PM4 |
| - amd: add a common implementation for SQTT using PM4 |
| - radv: emit more consecutive registers for SQTT on GFX8-9 |
| - radv: use the common SQTT implementation |
| - radv: update VGT_TESS_DISTRIBUTION.ACCUM_ISOLINE value |
| - radv: do not set registers set by CLEAR_STATE in the preamble on GFX10-11.5 |
| - radv: emit SPI_GS_THROTTLE_CNTL1 when the attr ring is emitted |
| - radv: fix incorrect buffer_list advance for multi-planar descriptors |
| - radv: use BDA in the DGC prepare shader |
| - radv: remove dynamic uniform/storage buffers support with DGC |
| - radv: do not use nir_pkt3() when the packet len is constant with DGC |
| - radv: add new macros for emiting packets in DGC |
| - radv: remove redundant nir_builder param in some DGC helpers |
| - radv: add a helper to load the pipeline VA for DGC |
| - radv: store a pointer to the logical device in dgc_cmdbuf |
| - radv: allow VK_NV_device_generated_commands_{compute} with LLVM |
| - radv: always save/restore all shader objects for internal operations |
| - radv: update configuring WALK_ALIGN8_PRIM_FITS_ST on GFX12 |
| - ac/surface: add NBC view support on GFX12 |
| - radv: declare a new user SGPR for the streamout state buffer on GFX12 |
| - radv/nir: lower nir_intrinsic_load_xfb_state_address_gfx12_amd |
| - radv: implement streamout on GFX12 |
| - radv: force using indirect descriptor sets for indirect compute pipelines |
| - radv: emit indirect sets for indirect compute pipelines with DGC |
| - radv: fix emitting indirect descriptor sets in the DGC prepare shader |
| - radv: cleanup getting AC_UD_TASK_RING_ENTRY for mesh shader |
| - radv: use radv_shader_info::user_data_0 for task shaders |
| - radv: remove dead mesh shader code for indirect draws |
| - radv: remove useless masking in radv_cs_emit_indirect_mesh_draw_packet() |
| - radv: remove useless draw_id to radv_emit_userdata_task() |
| - radv: add the DGC preprocess BO to the cmdbuf BO list |
| - radv/amdgpu: allow cs_execute_ib() to pass a VA instead of a BO |
| - radv/amdgpu: use the non-IB path for dumping CS with external IBs |
| - ac/parse_ib: dump PKT3_DISPATCH_{TASKMESH_GFX,TASKMESH_DIRECT_ACE} |
| - radv/amdgpu: fix chaining CS with external IBs on compute queue |
| - radv: add a helper to execute a DGC IB |
| - radv: add support for computing the DGC ACE IB size |
| - radv: prepare for DISPATCH_TASKMESH_GFX emission in the DGC shader |
| - radv: prepare for DISPATCH_TASKMESH_DIRECT_ACE emission in the DGC shader |
| - radv: refactor some DGC helpers in preparation for the ACE IB |
| - radv: add a helper to pad DGC IB |
| - radv: add support for preparing the ACE IB in DGC |
| - radv: add support for executing the DGC ACE IB |
| - radv: fix incorrect cache flushes before decompressing DCC on compute |
| - radv: improve clarity of DGC offset computations |
| - radv: pre-compute the base upload offset in radv_prepare_dgc() |
| - radv: add a helper that determines if DGC uses task shaders |
| - radv: split allocating and emitting push constants with DGC |
| - radv: rework emitting push constants with DGC |
| - radv: reserve space for push constants in the DGC ACE IB |
| - radv: adjust the base upload offset when DGC uses task shaders |
| - radv: emit push constant for task shaders with DGC |
| - radv: disable conditional rendering with DGC and task shaders |
| - radv: fix a synchronization issue with non-preprocessed DGC with task shader |
| - radv: enable task shaders support with NV DGC |
| - radv: suspend user conditional rendering when DGC has task shaders |
| - radv: rename radv_get_user_sgpr() to radv_get_user_sgpr_info() |
| - radv: add radv_get_user_sgpr{_loc}() helpers |
| - radv: use radv_get_user_sgpr_loc() for the GS copy shader too |
| - radv: remove unused parameter to dgc_emit_draw_mesh_tasks_ace() |
| - radv: do not emit compute userdata for empty dispatches |
| - radv: cleanup using vtx_base_sgpr for userdata with DGC |
| - radv: use radv_dgc_with_task_shader() more |
| - radv: move radv_CmdPreprocessGeneratedCommandsNV() to radv_cmd_buffer.c |
| - radv: use the graphics pipeline from the DGC info |
| - radv: use radv_get_user_sgpr() more in DGC |
| - vulkan: Update XML and headers to 1.3.289 |
| - radv: advertise VK_KHR_maintenance7 |
| - ci: bump vkd3d-proton to 3d46c082906c77544385d10801e4c0184f0385d9 |
| - radv: remove unused parameter to radv_pipeline_import_retained_shaders() |
| - radv: simplify importing libraries with retained shaders |
| - radv: remove unused get_vs_output_info() function |
| - radv: remove unnecessary radv_pipeline_has_ngg() function |
| - radv: move radv_hash_shaders() to radv_graphics_pipeline.c |
| - radv: simplify determining when the rasterization primitive is unknown |
| - radv: simplify determining when a VS prolog is needed |
| - radv: stop passing a pipeline to some graphics related helpers |
| - radv: rework generating all graphics state for compiling pipelines |
| - radv: remove radv_descriptor_set_layout::shader_stages |
| - radv: use blake3 for hashing descriptor set layouts |
| - radv: use blake3 for hashing pipeline layouts |
| - radv: disable VK_EXT_sampler_filter_minmax on TAHITI and VERDE |
| - ac,radeonsi: add ac_is_reduction_mode_supported() |
| - radv: use ac_is_reduction_mode_supported() |
| - radv: fix marking RADV_DYNAMIC_COLOR_ATTACHMENT_MAP as dirty |
| - nir/gather_info: handle uses_fbfetch_output for sparse image loads |
| - nir/gather_info: handle uses_fbfetch_output for texture operations |
| - radv: destroy the perf counter BO in radv_device_finish_perf_counter() |
| - radv: add radv_device_init_perf_counter() |
| - radv: add helpers for init/deinit device memory cache |
| - radv: add helpers for init/deinit RGP |
| - radv: simplify keeping shader info for GPU hangs debugging |
| - radv: add radv_device_init_trap_handler() |
| - radv: add helpers for init/deinit device fault detection |
| - radv: add radv_device_init_rmv() |
| - radv: regroup all tools initialization in one helper |
| - radv: use zero allocation for the device queues |
| - radv/meta: remove non-valuable comments |
| - radv/meta: remove unnecessary blit2d_dst_temps struct |
| - radv/meta: remove redundant check for hw resolve pipelines |
| - radv/meta: remove unused number of rectangles for internal operations |
| - radv/meta: remove useless checks for NULL handles before destroying |
| - radv/meta: add a helper to create compute pipeline |
| - radv/meta: add a helper to create pipeline layout |
| - radv/meta: add a helper to create descriptor set layout |
| - zink/ci: skip arb_shader_image_load_store also on NAVI31/VANGOGH |
| - zink/ci: remove redundant arb_shader_image_load_store skips on POLARIS10 |
| - radv: do not expose ImageFloat32AtomicMinMax on GFX11_5 |
| - radv: fix programming DB_RENDER_CONTROL for NULL depth/stencil on GFX11_5 |
| - radv: expose BufferFloat32AtomicMinMax on GFX11_5 |
| - radv: disable SPM trace on GFX11_5 |
| - ac/rgp: assume GFX11_5 use the same SQTT/RGP versions as GFX11 |
| - radv: allow to capture with RGP on GFX11_5 |
| - radv/meta: fix potential race condition when creating the copy VRS pipeline |
| - radv/meta: rework creating the VRS copy HTILE pipeline |
| - radv/meta: remove the depth resummarize operation |
| - radv/meta: avoid potential NULL deref with the gfx depth decompress pipeline |
| - radv/meta: move locking around the gfx depth decompress pipeline |
| - radv/meta: remove unused parameter to radv_get_depth_pipeline() |
| - radv/meta: rework creating the gfx depth decompress pipeline |
| - radv/meta: create the compute depth decompress pipeline on-demand |
| - radv/meta: cleanup creating the compute depth decompress pipeline |
| - radv/meta: separate creating the fill/copy pipelines |
| - radv/meta: create the fill/copy pipelines on-demand |
| - radv/meta: cleanup radv_device_init_meta_blit_{color,depth,stencil]() |
| - radv/meta: move the locking around creating blit pipelines |
| - radv/meta: cleanup meta_emit_blit() |
| - radv/meta: rework creating blit pipelines |
| - radv/meta: create fmask expand layouts regardless on-demand |
| - radv/meta: rework creating FMASK expand pipelines |
| - radv/meta: create fmask copy layouts regardless on-demand |
| - radv/meta: rework creating copy expand pipelines |
| - radv/meta: fix potential race condition when creating DCC retile pipelines |
| - radv/meta: fix potential memleak when creating DCC retile pipelines |
| - radv/meta: rework creating DCC retile pipelines |
| - radv/meta: remove useless memset when destroying DCC retile state |
| - radv/meta: rework creating GFX depth/stencil resolve pipelines |
| - radv/meta: rework creating GFX color resolve pipelines |
| - radv/meta: rework creating compute color resolve pipelines |
| - radv/meta: rework creating compute depth/stencil resolve pipelines |
| - radv/meta: cleanup creating HW resolve pipelines |
| - radv/meta: rework creating HW resolve pipelines |
| - radv/meta: rework creating DCC decompress compute pipelines |
| - radv/meta: rework creating clear HTILE mask pipeline |
| - radv/meta: create clear HTILE mask pipeline on-demand when needed |
| - radv/meta: create DCC comp-to-single pipelines on-demand when needed |
| - radv/meta: add a helper to create itob pipelines |
| - radv/meta: create itob pipelines on-demand when needed |
| - radv/meta: add a helper to create btoi pipelines |
| - radv/meta create btoi pipelines on-demand when needed |
| - radv/meta: add a helper to create btoi r32g32b32 pipeline |
| - radv/meta: create btoi r32g32b32 pipeline on-demand when needed |
| - radv/meta: update the helper that creates itoi pipelines |
| - radv/meta: create itoi pipelines on-demand when needed |
| - radv/meta: add a helper to create itoi r32g32b32 pipeline |
| - radv/meta: create itoi r32g32b32 pipelines on-demand when needed |
| - radv/meta: update the helper that creates clear pipelines |
| - radv/meta: create clear pipeliones on-demand when needed |
| - radv/meta: add a helper to create clear r32g32b32 pipeline |
| - radv/meta: create clear r32g32b32 pipelines on-demand when needed |
| - radv: fix shaders cache corruption with indirect pipeline binds |
| - radv/meta: stop checking that creating NIR shaders failed |
| - radv/meta: remove unnecessary goto |
| - radv/meta: stop creating similar pipeline layouts for depth decompress |
| - radv/meta: create the layouts for blit pipelines on-demand |
| - radv/meta: create the layouts for FS resolve pipelines on-demand |
| - radv/meta: create the layouts for depth decompress on-demand |
| - radv/meta: create the layouts for FMASK copy on-demand |
| - radv/meta: create the layouts for FMASK expand on-demand |
| - radv/meta: create the layouts for compute resolve on-demand |
| - radv/meta: create the louts for DCC comp-to-single clear on-demand |
| - radv/meta: rework getting clear color pipelines |
| - radv/meta: create the layout for clear color on-demand |
| - radv/meta: rework getting depth stencil clear pipelines |
| - radv/meta: create the layout for clear depth/stencil on-demand |
| |
| Saroj Kumar (2): |
| |
| - mesa: Add functions to print blake3 |
| - mesa: replace shader_info::source_sha1 |
| |
| Sathishkumar S (3): |
| |
| - util/format: add planar3 y8_u8_v8_440 pipe format |
| - frontends/va,gallium/vl: add support for yuv440 format |
| - radeonsi/vcn: enable yuv440 jpeg decode |
| |
| Sebastian Wick (1): |
| |
| - vulkan/wsi/wayland: refactor wsi_wl_swapchain_wait_for_present |
| |
| Sergi Blanch Torne (21): |
| |
| - mr-label-maker: specialize CI labels |
| - ci: kernel stored in a different s3 bucket |
| - ci: identify and label S3 buckets |
| - ci: disable Collabora's farm due to maintance |
| - Revert "ci: disable Collabora's farm due to maintance" |
| - ci: fix stress counter in run'n'monitor |
| - ci: disable Collabora's farm due to maintenance |
| - Uprev Piglit to cf8daaf5ba90fc9b8a0e144355026e2a14c79944 |
| - Revert "ci: disable Collabora's farm due to maintenance" |
| - ci: disable Collabora's farm due to runners maintenance |
| - Revert "ci: disable Collabora's farm due to runners maintenance" |
| - ci: continue stress run'n'monitor |
| - ci: Fix parse GitLab pipeline url |
| - ci: run_n_monitor, collect and summarize |
| - ci: disable Collabora's farm due to maintenance |
| - Revert "ci: disable Collabora's farm due to maintenance" |
| - ci: run_n_monitor, arguments review and unicode |
| - ci: run_n_monitor, pretty duration with padding |
| - ci: run_n_monitor, listing job names with a padding |
| - ci: run_n_monitor, sort by name when listing jobs |
| - ci: fix run_n_monitor single execution |
| |
| Sil Vilerino (5): |
| |
| - d3d12: Fix static analysis issues due to bad parenthesis closing |
| - nir: Mark variable as ASSERTED to fix unused variable warning treated as error |
| - d3d12: Video Encode - Fix inputs for older OS support query cap |
| - d3d12: Add missing case for CQP in d3d12_video_encoder_disable_rc_qualitylevels |
| - Revert "d3d12: Video Encode - Remove PIPE_VIDEO_PROFILE_MPEG4_AVC_BASELINE as not supported" This reverts commit d6bb4ddc638f3ee37fbbe066c631dad80aaeb2d3. Fixes: d6bb4ddc638 ("d3d12: Video Encode - Remove PIPE_VIDEO_PROFILE_MPEG4_AVC_BASELINE as not supported") |
| |
| Simon Ser (1): |
| |
| - glapi: fix param type in TexGenxOES |
| |
| Sushma Venkatesh Reddy (4): |
| |
| - drm-uapi: Sync i915_drm.h with a78313bb206e |
| - anv/drirc: add option to provide low latency hint |
| - anv: Fix I915_PARAM_HAS_CONTEXT_FREQ_HINT check |
| - intel/clflush: Utilize clflushopt in intel_invalidate_range |
| |
| Sviatoslav Peleshko (5): |
| |
| - anv: Fix descriptor sampler offsets assignment |
| - anv,driconf: Add fake non device local memory WA for Total War: Warhammer 3 |
| - intel/brw: Actually retype integer sources of sampler message payload |
| - intel/elk: Actually retype integer sources of sampler message payload |
| - mesa: Fix PopAttrib not restoring states that changed on deeper stack level |
| |
| Tapani Pälli (14): |
| |
| - iris: change stream uploader default size to 2MB |
| - anv: skip gfx push constants alloc optimization on gfx9/11 |
| - iris: ForceZeroRTAIndexEnable if last geom stage does not write layer id |
| - vulkan/runtime: add a subpass bit for legacy dithering |
| - anv: VK_EXT_legacy_dithering support |
| - docs/features: add VK_EXT_legacy_dithering |
| - ci: update failures list with angle for jsl, tgl |
| - anv/android: enable emulated astc for applications |
| - anv: implement WA 14018283232 |
| - mesa: remove some conditions in mipmap code |
| - isl: fix condition for enabling sampler route to lsc |
| - isl/iris/anv: provide drirc toggle intel_sampler_route_to_lsc |
| - anv: move some pc was to batch_emit_pipe_control_write |
| - anv: fix a cmd_buffer reference in simple shader |
| |
| Tatsuyuki Ishi (5): |
| |
| - radv: Remove radv_queue::device again |
| - vk_entrypoints_gen: Add missing ATTR_WEAK for instance and physdev entrypoints |
| - vk_entrypoints_gen: Rework ATTR_WEAK to unify Unix and MinGW |
| - vk_entrypoints_gen: Apply hidden visibility to generated symbols |
| - vk_cmd_queue_gen: Exclude CmdDispatchGraphAMDX |
| |
| Thomas H.P. Andersen (2): |
| |
| - nvk: advertise EXT_depth_range_unrestricted |
| - nvk/upload_queue: fix the _fill method |
| |
| Tim Huang (2): |
| |
| - amd: add GFX v11.5.2 support |
| - amd/vpelib: support VPE IP v6.1.3 |
| |
| Timothy Arceri (36): |
| |
| - glsl: wrap nir_opt_loop in NIR_PASS() |
| - glsl: use hash table when serializing resource data |
| - glsl: move geom input array sizing to nir linker |
| - lima: drop unrequired opt from standalone compiler |
| - glsl: remove unused detect_recursion_linked() |
| - lima: remove the standalone compiler |
| - glsl: add support for glsl es 310/320 to standalone compiler |
| - nir: clarify and update loop conditional instruction |
| - nir: more aggressively remove in loop during partial unroll |
| - nir: support more loop unrolling for logical operators |
| - nir: add merge loop terminators optimisation |
| - nir: add test for opt_loop_merge_terminators |
| - nir: correctly track current loop in nir_opt_loop() |
| - nir: test opt_loop_merge_terminators() skips unhandled loops |
| - nir: add additional opt_loop_merge() test of deref handling |
| - glsl: drop dump-builder support from standalone compiler |
| - glsl: remove Par-linking from the standalone linker |
| - glsl: remove do_function_inlining() |
| - glsl: make glsl_to_nir() more generic |
| - glsl: remove unused symbol table functionality |
| - glsl: remove out of date TODO |
| - glsl: move call to create explicit ifc layout out of glsl_to_nir |
| - glsl: drop glsl ir optimisation from the standalone compiler |
| - glsl: make warning tests pass linking |
| - glsl/mesa: remove UniformHash field |
| - glsl/standalone: init EmptyUniformLocations |
| - glsl/tests: fix test_gl_lower_mediump |
| - mesa: remove _mesa_get_log_file() wrapper |
| - util/mesa: move mesa/main log code to util |
| - mesa: add unreachable to _mesa_shader_stage_to_subroutine_prefix() |
| - glsl: set how_declared to hidden for compiler temps |
| - glsl: fix cross validate globals |
| - glsl: remove out of date comment |
| - nir: set disallow_undef_to_nan for legacy ARB asm programs |
| - glsl: fix glsl to nir support for lower precision builtins |
| - glsl: always copy bindless sampler packing constructors to a temp |
| |
| Timur Kristóf (25): |
| |
| - ac/nir/esgs: Slightly refactor emitting IO loads and stores. |
| - ac/nir/tess: Slightly refactor emitting LS outputs. |
| - ac/nir: Add helper macros for emitting IO code. |
| - ac/nir/esgs: Implement packed 16-bit ES->GS I/O using helper macros. |
| - ac/nir/tess: Implement packed 16-bit LS->HS I/O using helper macros. |
| - ac/nir/tess: Implement packed 16-bit HS->TES I/O using helper macros. |
| - aco: Add missing nir_builder include. |
| - ac/nir: Move some helpers to new file. |
| - ac/nir: Add helper for pre-rasterization output info. |
| - ac/nir/ngg: Use new pre-rasterization output info helper. |
| - ac/nir/legacy: Use new pre-rasterization output info helper. |
| - nir: Add nir_opt_load_store_update_alignments. |
| - radv: Add TES num_linked_patch_inputs. |
| - radv: Add shader stats for inputs and outputs. |
| - radv: Fix TCS -> TES I/O linking typo of VARYING_SLOT vs. BIT. |
| - nir/opt_varyings: Print FS VEC4 type when debugging relocate_slot. |
| - nir/opt_varyings: Don't promote flat inputs when moving post-dominator. |
| - ac/nir/tess: Adjust TCS->TES output mapping for linked shaders. |
| - radv: Properly link TCS->TES IO again. |
| - nir/lower_io: Add option to implement mediump as 32-bit. |
| - radv: Ignore mediump IO flag. |
| - ac/nir/tess: Only write tess factors that the TES reads. |
| - ac/nir/tess: Fix per-patch output LDS mapping. |
| - ac/nir/tess: Fix per-patch output VRAM mapping. |
| - radv: Use number of TES inputs for TCS-TES linking. |
| |
| Tomeu Vizoso (2): |
| |
| - etnaviv/nn: Make parallel jobs disabled by default |
| - etnaviv: handle missing alu conversion opcodes |
| |
| Turo Lamminen (1): |
| |
| - radv: Optimize memcpy in write_image_descriptor |
| |
| Tvrtko Ursulin (1): |
| |
| - intel/hang_replay: fix batch address |
| |
| Valentine Burley (40): |
| |
| - docs: Update VK_EXT_legacy_vertex_attributes entries |
| - tu: Add missing VK_EXT_legacy_vertex_attributes feature |
| - tu: Change commas to semicolons in VK_EXT_map_memory_placed features |
| - drm-shim: Stub syncobj reset ioctl |
| - tu: Expose VK_EXT_nested_command_buffer |
| - freedreno/devices: Fix indentation for Adreno A32 |
| - freedreno/ci: Update expectations |
| - wsi: Guard DRM-dependent function implementations with HAVE_LIBDRM |
| - tu: Add support for VkBindMemoryStatusKHR |
| - tu: Add support for NULL index buffer |
| - tu: Add support for version 2 of all descriptor binding commands |
| - tu: Advertise VK_KHR_maintenance6 |
| - tu: Move event related related code to tu_event.cc/h |
| - tu: Handle all dependencies of CmdWaitEvents2 |
| - mr-label-maker: Update nouveau directories |
| - mr-label-maker: Separate freedreno and turnip labels |
| - tu: Handle the new sync2 flags |
| - tu: Remove declaration of unused update_stencil_mask function |
| - tu: Switch to vk_ycbcr_conversion |
| - tu: Use vk_sampler |
| - tu: Use device->vk.enabled_features instead of iterating twice |
| - tu: Move sampler related code to tu_sampler.cc/h |
| - tu: Drop tu_init_sampler helper function |
| - tu: Advertise VK_KHR_shader_float_controls2 |
| - tu: Use the common version of vkGetBufferMemoryRequirements2 |
| - tu: Move buffer related code to tu_buffer.cc/h |
| - tu: Use the common version of vkQueueBindSparse |
| - tu: Use vk_buffer_view |
| - tu: Drop tu_buffer_view_init helper function |
| - tu: Move buffer view related code to tu_buffer_view.cc/h |
| - tu: Rename tu_query.cc/h to tu_query_pool.cc/h |
| - tu: Use the common versions of vkBegin/EndQuery() |
| - tu: Use vk_query_pool |
| - tu: Don't disable 2 10-bit formats |
| - freedreno,tu,ir3: Move threadsize_base and max_waves to fd_dev_info |
| - freedreno/ci: Use the common a6xx-skips on a750 |
| - tu: Enable VK_KHR_shader_subgroup_uniform_control_flow |
| - tu/kgsl: Remove unused variable |
| - vulkan/wsi: Refactor can_present_on_device |
| - tu: Always report that we can present on kgsl |
| |
| Vignesh Raman (3): |
| |
| - virtio/ci: separate hiden jobs to -inc.yml files |
| - ci: add farm variable for devices in collabora farm |
| - ci/lava: add farm in structured log files |
| |
| Vinson Lee (2): |
| |
| - panvk: Remove duplicate variable src_idx |
| - panvk: Fix assert |
| |
| Vlad Schiller (2): |
| |
| - pvr: Handle VK_STRUCTURE_TYPE_EXPORT_MEMORY_ALLOCATE_INFO |
| - pvr: Handle VK_STRUCTURE_TYPE_IMAGE_FORMAT_LIST_CREATE_INFO |
| |
| WANG Xuerui (2): |
| |
| - meson: Force use of LLVM ORCJIT for hosts without MCJIT support |
| - meson: Additionally probe -mtls-dialect=desc for TLSDESC support |
| |
| Weifeng Liu (1): |
| |
| - anv/anroid: Query gralloc for tiling mode |
| |
| X512 (2): |
| |
| - egl/haiku: fix double free of BBitmap |
| - egl/haiku: fix synchronization problems, add missing header |
| |
| Yiwei Zhang (31): |
| |
| - venus: avoid client allocators for ring internals |
| - venus: silence a stack array false alarm |
| - venus: workaround excessive dma-buf import failure on turnip |
| - venus: fix to destroy all pipeline handles on early error paths |
| - meson: disallow Venus debug + LTO build via GCC |
| - turnip: msm: clean up iova on error path |
| - turnip: msm: fix racy gem close for re-imported dma-buf |
| - venus: drop the workaround for excessive dma-buf import oom on turnip |
| - turnip: virtio: fix error path in virtio_bo_init |
| - turnip: virtio: fix iova leak upon found already imported dmabuf |
| - turnip: virtio: fix racy gem close for re-imported dma-buf |
| - vulkan: cast to avoid -Wswitch for Android struct beyond VkStructureType |
| - venus: directly use vk drm and pci props in renderer info |
| - venus: move custom props fill from GPDP2 to props init |
| - venus: move props sanitization to a separate helper |
| - venus: define VN_SET_VK_PROPS(_EXT) to simplify vk props init |
| - vulkan: drop redundant core props query and copy helpers |
| - venus: drop internal memory pools |
| - venus: allow non-wsi image alias path to passthrough upon bind memory |
| - ci/venus: skip a timeout test |
| - anv: use os_get_option instead of getenv |
| - venus: defer qfb buffer init upon query being used |
| - venus: refactor vn_android_image_from_anb |
| - venus: refactor to add vn_android_image_from_anb_internal |
| - venus: support VK_ANDROID_NATIVE_BUFFER_SPEC_VERSION 8 |
| - vulkan: properly ignore unsupported feature structs |
| - venus: tentative fix for test flakiness from invalid ring wait |
| - venus: simplify cached mem type emulation |
| - venus: clarify wsi image ownership |
| - venus: fix a race condition between gem close and gem handle tracking |
| - Revert "meson: disallow Venus debug + LTO build via GCC" |
| |
| Yogesh Mohan Marimuthu (4): |
| |
| - radeonsi: remove si_query_hw_ops table and call func directly |
| - radeonsi: use reseults_end instead of unprepared to init query buffer |
| - radeonsi: rename query_hw_ops to hw_query_ops match sw |
| - radeonsi: add more comments in si_query.c |
| |
| Yonggang Luo (2): |
| |
| - util: Rename DETECT_OS_UNIX to DETECT_OS_POSIX |
| - gallivm: add lp_context_ref for combine usage of LLVMContextSetOpaquePointers |
| |
| Yukari Chiba (7): |
| |
| - llvmpipe: add gallivm_add_global_mapping |
| - llvmpipe: make unnamed global have internal linkage |
| - util: detect RISC-V architecture |
| - gallivm: add riscv support to the mattrs setting code |
| - llvmpipe: add function name to gallivm_jit_function |
| - llvmpipe/tests: add a new test for multiple symbols for orc jit testing |
| - llvmpipe: add an implementation with llvm orcjit |
| |
| Yusuf Khan (7): |
| |
| - nouveau: Fix crash when destination or source screen fences are null |
| - nouveau/headers: Make nvk_cl**** turn to nv_push_cl**** |
| - nvk: remove NVK_MME_COPY_QUERIES |
| - zink/query: begin time elapsed queries even if we arent in a rp |
| - nvc0/vbo: wrap draw_vbo for multidraw performance |
| - nv50/vbo: wrap draw_vbo to avoid ovehead from multidraw |
| - aux/draw: Use the draw info we get passed in instead of our own |
| |
| Zach Battleman (2): |
| |
| - intel/brw: update comment to accurately reflect intended behavior |
| - intel/brw: update Wa_1805992985 to use workarounds mechanism |
| |
| Zack Middleton (2): |
| |
| - gles1: fix GL_OES_vertex_array_object |
| - gles1: fix glBufferSubData() |
| |
| Zan Dobersek (14): |
| |
| - fdperf: use snprintf instead of asprintf |
| - fdperf: select_counter() should work with a countable value |
| - fdperf: prettify logic around the reserved CP counter |
| - fdperf: improve reads of counter values |
| - fdperf: simplify counter value output |
| - freedreno: add a7xx perfcounter support |
| - tu: fix ZPASS_DONE interference between occlusion queries and autotuner |
| - tu: avoid memory polling in occlusion query endings using ZPASS_DONE |
| - tu: use either the 16-bit or 32-bit descriptor |
| - ir3_nir_opt_preamble: handle 8-bit preamble loads and stores |
| - ir3: rework TYPE_S8 as TYPE_U8_32 |
| - tu: support KHR_8bit_storage |
| - tu: add format feature flag checks for VK_IMAGE_USAGE_INPUT_ATTACHMENT_BIT |
| - freedreno/drm: add mesautil dependency |
| |
| bbhtt (1): |
| |
| - nvk: Clean up unused header from libdrm_nouveau |
| |
| chiachih (9): |
| |
| - amd/vpelib: Resolve mismatch with shader |
| - amd/vpelib: Remove linear_0_125 TF |
| - amd/vpelib: Remove gamma cached table |
| - amd/vpelib: Remove support for non-linear FP16 |
| - amd/vpelib: adding blend gamma bypass |
| - amd/vpelib: Remove checks for pitch alignment |
| - amd/vpelib: Fix Color Adjustment Failing Test Cases |
| - amd/vpelib: Fix blndgam bypass flag assignment |
| - amd/vpelib: Bypass de/regam on HLG |
| |
| msizanoen (1): |
| |
| - egl/wayland: Fix direct scanout with EGL_EXT_present_opaque |
| |
| nyanmisaka (1): |
| |
| - frontends/va: add support for A2RGB10/X2RGB10/A2BGR10/X2BGR10 |
| |
| tarsin (4): |
| |
| - turnip: Change tu_image to use common initialization helpers |
| - turnip: Convert tu_device_memory to use vk_device_memory |
| - turnip: Split tu_image_init to use layout setting logic separately |
| - turnip: Support AHardwareBuffer |