| Mesa 22.3.0 Release Notes / 2022-12-02 |
| ====================================== |
| |
| Mesa 22.3.0 is a new development release. People who are concerned |
| with stability and reliability should stick with a previous release or |
| wait for Mesa 22.3.1. |
| |
| Mesa 22.3.0 implements the OpenGL 4.6 API, but the version reported by |
| glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / |
| glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. |
| Some drivers don't support all the features required in OpenGL 4.6. OpenGL |
| 4.6 is **only** available if requested at context creation. |
| Compatibility contexts may report a lower version depending on each driver. |
| |
| Mesa 22.3.0 implements the Vulkan 1.3 API, but the version reported by |
| the apiVersion property of the VkPhysicalDeviceProperties struct |
| depends on the particular driver being used. |
| |
| SHA256 checksum |
| --------------- |
| |
| :: |
| |
| 644bf936584548c2b88762111ad58b4aa3e4688874200e5a4eb74e53ce301746 mesa-22.3.0.tar.xz |
| |
| |
| New features |
| ------------ |
| |
| - GL_ARB_shader_clock on llvmpipe |
| - VK_KHR_shader_clock on lavapipe |
| - Mesa-DB, the new single file cache type |
| - VK_EXT_attachment_feedback_loop_layout on RADV, lavapipe |
| - VK_KHR_global_priority on RADV |
| - GL_KHR_blend_equation_advanced_coherent on zink |
| - VK_EXT_load_store_op_none on RADV |
| - VK_EXT_mutable_descriptor_type on RADV |
| - VK_EXT_shader_atomic_float on lvp |
| - VK_EXT_shader_atomic_float2 on lvp |
| - GL_NV_shader_atomic_float on llvmpipe |
| - VK_EXT_image_robustness on v3dv |
| - VK_EXT_extended_dynamic_state3 on lavapipe |
| - VK_EXT_extended_dynamic_state3 on RADV |
| - VK_EXT_pipeline_robustness on v3dv |
| - Mali T620 on panfrost |
| - Shader disk cache on Panfrost |
| - support for R8G8B8, B8G8R8, R16G16B16 and 64-bit vertex buffer formats on RADV |
| - initial GFX11/RDNA3 support on RADV |
| - various ray tracing optimizations on RADV |
| - extendedDynamicState2PatchControlPoints on RADV (VK_EXT_extended_dynamic_state2 feature) |
| - Radeon Raytracing Analyzer integration (using RADV_RRA_* environment variables) |
| |
| |
| Bug fixes |
| --------- |
| |
| - r600,regression: Wrong color on a Minecraft shaderpack on Cayman |
| - [iris/crocus] Native CS:GO OpenGL: fire produces black rendering artifacts |
| - VDPAU GL interop is broken, causing Black and White video |
| - panfrost: application crashes with 22.2.4/22.3.0-rc3 |
| - etnaviv: compiler assertion fail in glmark2 terrain |
| - anv: dEQP-VK.pipeline.monolithic.multisample.variable_rate.8_1_fb_1 failure |
| - DG2: Age of Empire IV incorrect rendering |
| - r600: Sky not rendered correctly with the NIR backend on Transport Fever 2 |
| - "intel/compiler: Signed integer range analysis for imul_32x16 generation" regresses several OpenCL tests |
| - aco: Missing 8-bit uadd_carry |
| - !16168 broke Xephyr -glamor fallback to swrast |
| - panfrost/rk3399: performance drop in emulationstation and in RetroArch menu [mesa >= 21.3.8, batocera.linux, bisected] |
| - d3d12/VA-API: [-wmissing-prototypes] with MSYS2 MinGW-W64 UCRT64 GCC |
| - clover: windows: library filename has \`-1` suffix and a \`lib` prefix when built with mingw |
| - freedreno: !19236 broke anitchamber trace |
| - radv, dxvk: Rendering errors in World of Tanks after "Switch to dynamic rendering only" |
| - gen9 gt3e/gt4e skus fail dEQP-VK.pipeline.multisample.sample_locations_ext.* |
| - Intel - GPU Hang when starting new game in Hellblade: Senua's Sacrifice |
| - v3d: Wrong colors (pink) in videos in Firefox (likely YUV->RGB shader issue) |
| - anv/zink: cts failures on ICL |
| - panfrost t860 glmark-es2 regression |
| - panfrost_drm.h build error |
| - radv: Artifacts in multiple games with yuzu (regression, bisected) |
| - Doom (1993) Ray-Traced has render glitches. |
| - radv: Flickering in Spider-Man Remastered (Regression) (Bisected) |
| - radv: Hitman 2 using Direct3D 12 has discolored squares on RDNA2 with DCC enabled |
| - panfrost/midgard - on Duckstation PSX emulator: segfault on GLES 3.0 and bad shader compilations on 3.3 |
| - nouveau: Updating to Mesa*22.2.0 makes videos have corrupted colors in Firefox |
| - [radeonsi] OpenGL rendering issue with Unreal Tournament 99 & OldUnreal 4.69 rc4 |
| - Replace the functionals in util/debug.h with util/u_debug.h |
| - gallium: nir_opt_access desynchronizes image access modifiers in NIR with Program->sh.ImageAccess |
| - [radv/navi21] No sprites being rendered in Quake 2 RTX |
| - Hacknet does not render on Panfrost |
| - !19259 broke some piglit arb_tessellation_shader tests with radeonsi if TCS passthrough shader is cached |
| - panfrost g52 issue |
| - [Intel] Doom Eternal crashes shortly after launching |
| - GravityMark segfault when using ray tracing ANV |
| - nir: Incorrect idiv lowering |
| - intel-clc fails with "Unsupported SPIR-V capability" and SEGFAULT. |
| - Navi 14 hangs running "rendercheck -t cacomposite -f a8r8g8b8" since !18170 |
| - glx=xlib can not be built under ubuntu 22.04 |
| - radv: incorrect query result for VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT |
| - crocus: EGL_ANDROID_native_fence_sync extension is missing |
| - freedreno/a6xx: spec@arb_tessellation_shader@execution@tess-instance-id fail |
| - r300: wrong primitive positions in CXBX-R emulator with nine and RADEON_DEBUG=notcl |
| - EGL re-initialization can fail silently |
| - Rusticl: fails to create program if source size is stated, but last char is a null terminator |
| - r300: register allocation sometimes generate invalid swizzles |
| - Mesa git Upgrade causes cursor detection issues in Wayland, issue was not in 22.3.0-git.20220920.8d66c45.fc36 |
| - Proposal: Remove rbug, graw, graw tests, xa, xvmc |
| - intrinsics.h:112:21: error: use of undeclared identifier 'intel_sub_group_shuffle' |
| - tu: broken since dynamic pcp |
| - radv: Crash in dEQP-VK.ray_query.misc.dynamic_indexing |
| - t860: mediapipe application failed |
| - glthread: radeonsi: offset textures in game starsector with glthread enabled |
| - Crashing on Windows VM |
| - zink: descriptor-handling regression |
| - hasvk: noperspective is broken on Haswell hardware (Intel(R) HD Graphics 4400) |
| - util/u_atomic.h can not be included in util/u_debug.h |
| - radv: dynamic patch control points broken |
| - Exanima renders with the wrong colors. |
| - Missing tile cache flushes causing corruptions |
| - turnip: compute shader hang in the Monster Hunter World |
| - anv_private.h:118:10: fatal error: anv_entrypoints.h: No such file or directory |
| - freedreno: mis-rendering on gardensofkadesh/homeworld |
| - piglit.spec.egl 1_4.egl-ext_egl_image_storage times out |
| - nouveau: tegra124: GL_OUT_OF_MEMORY error |
| - r600: Shadows incorrecly rendered on Counter-Strike: Global Offensive with the NIR backend |
| - r300: regalloc doesn't handle brk correctly |
| - Recent WSI commit breaks every Vulkan application on RDNA 2 GPUs |
| - 8775f08bf11 makes warcraft 3 crash with free(): double free detected in tcache 2 |
| - anv: probable ccs issue on icl |
| - iris: remap_tess_levels assertion failure in shader-db runs |
| - RADV: Mafia & Mafia III :Definitive Edition graphical issues with certain effects |
| - Cyberpunk 2077 ground reflectiveness issues using RADV |
| - [RADV] Grand Theft Auto V: Shadow rendering bug on Steam Deck |
| - rusticl: fails to build on non-Linux: cannot find value \`stderr` in this scope |
| - llvmpipe-piglit-rusticl fail at api/clenqueuefillimage |
| - Broken build with Clang-16 due to: aco_register_allocation.cpp:382:7: error: no matching function for call to 'printf' |
| - src/gallium/drivers/r600/sfn/sfn_instr.cpp:92:33: error: expected primary-expression before 'int' |
| - r600: Make NIR the default backend |
| - undefined reference to \`intel_ds_begin_xfb' |
| - data-race leads to crash in u_trace_context_init() |
| - radeonsi/radv: Discrepancy in handling wd_switch_on_eop |
| - Shadows rendering issue in Deathloop |
| - RADV: Regression with RPCS3 where nothing renders |
| - [RADV][bisected] commit 3714c89d0ebf0c541d8f7a7a807ce7c1bfe3cfea at least lead to GPU hang when I run the game "Quake II RTX" |
| - eglCreateContext doesn't generate EGL_BAD_MATCH when share_context is on a different EGLDisplay |
| - Intel - Homerun Clash misrendering issue |
| - r300: dEQP-GLES2.functional.shaders.indexing.tmp_array.float_dynamic_write_dynamic_loop_read_fragment regression |
| - turnip/a660: dEQP-VK.spirv_assembly.instruction.terminate_invocation.terminate.subgroup_ballot failure due to "movmsk" reordered with "kill" |
| - WSI reports 11 RGB bits is unsupported in X11, using zink/swrast |
| - [radeonsi] Graphical corruption in Valve Source-based games |
| - RADV: VK_EXT_load_store_op_none support? |
| - radv: extendedDynamicState2PatchControlPoints |
| - Vega 8 rendering GDM grey blank login screen after recent change |
| - r300: possibility of merging ADD and MUL into MAD when effectively only 2 sources are used |
| - MESA_DEBUG=incomplete_tex shows unnecessary validation during glClear |
| - anv: Performance drop with Vulkan on Wayland |
| - lima: crash during dEQP-EGL execution |
| - Regression from https://gitlab.freedesktop.org/mesa/mesa/-/commit/8fcb4aa0ebd7b9d0d8f80986fb817afea2fc4a87 |
| - radv: Enable instruction timing with RGP for RT |
| - Updates to 22.2 RCs cause artifacts on nouveau and blank screen on VirtIO |
| - nouveau: corruption on NVAC after switching to the NIR backend |
| - egl: Remove big-display-lock |
| - DG2: vulkancts tests spin forever |
| - [regression][bisected] v3dv: dEQP-VK.api.object_management.alloc_callback_fail_multiple.graphics_pipeline fails |
| - v3dv: evaluating reducing current value of DEFAULT_MAX_BO_CACHE_SIZE |
| - venus-lavapipe flakes |
| - amd: LPDDR4 and LPDDR5 hits an assertion failure in ac_memory_ops_per_clock |
| - zink: GL_TEXTURE_LOD_BIAS vs mipLodBias, and GTF-GL46.gtf21.GL3Tests.texture_lod_bias.texture_lod_bias_all failure on Turnip |
| - ci: radeonsi-raven-skqp also tests RADV but only runs on radeonsi changes |
| - Gallium OpenGL megadriver fails to build if LLVM is enabled but not required by any component built |
| - panfrost: crash in dEQP test |
| - bitpack_helpers.h:34:10: fatal error: valgrind.h: No such file or directory |
| - [gen9_iris] piglit causes crash on 4GB-memory machine when running spec/!opengl 1.1/max-texture-size |
| - Abort due to "Unknown intrinsic: intrinsic store_deref" (firefox, radeonsi) |
| - radv: missing support for some 3-component vertex formats in RDNA2 |
| - radv: radv_translate_buffer_dataformat overhead |
| - nir: Replacing GC with manual memory management |
| - turnip: dEQP-VK.subgroups.vote.graphics.subgroupallequal_*int regression on a635 |
| - regression from !17949 |
| - r600/sfn: xonotic shows a big performance regression when using the NIR backend with sb disabled |
| - r600/sfn: register allocation fails for a number of piglits and deqp tests |
| - DG2 corruption in gnome desktop from F36 |
| - d3d12: Delete custom OpenGLOn12 target, add option to override libgallium_wgl.dll name |
| - radeonsi: error allocating sparse buffers.. bug? |
| - RADV: vulkaninfo fails to report extended VkShaderStageFlagBits for VK_KHR_ray_tracing_pipeline |
| - [radeonsi] Transform feedback with array outputs fails |
| - [BISECTED/radeonsi] Flickering triangles with wayfire cube |
| - re-evaluate the linker for CI |
| - [RADV] (Vega/Navi) Halo: The Master Chief Collection: Texture issue on Halo 3 and Reach |
| - ../src/util/u_queue.c:218: util_queue_fence_destroy: Assertion \`fence->signalled' failed for iris when disabled UTIL_FUTEX_SUPPORTED |
| - Freedreno turnip mesa cause emulation close after 15-20min gameplay |
| - glsl: textureGatherOffset param broken |
| - turnip: broken after switching to userspace vma |
| - turnip: gfxbench aztec ruins crashes on zink |
| - !16863 disabled all meson tests in CI |
| - intel unittests failed a lot because recents commits Job Failed #27063524 |
| - Microsoft CLC runtime crash (at least on 32-bit) since !12510 |
| - Job Failed #26465985 |
| - v3dv: android: VTS big regression after 316728a55bc8fe398be1ac2796a22f8c91fb886c |
| - radeonsi: driver crashes with kwin-effects-cube |
| - [Turnip] Final Fantasy X: Incorrect hair rendering |
| - iris: Suboptimal usage of HiZ + CCS for mipmapped buffers |
| - Document state of Dozen Vulkan support in features.txt for MesaMatrix site.. |
| - turnip: gfxbench car chase crash on zink |
| - [r300] - osu! - Shader optimizations cause black and glitchy rendering |
| - [Bisected] Microsoft CLC: build failure with MSVC and Dshared-llvm=disabled with AMDGPU target available |
| - Android.mk: meson enables flags based on the libraries installed on the host system |
| - anv: invalid returns from vkCreateDescriptorPool |
| - [Bisected][RadeonSI] Blender crashes on startup |
| - lavapipe CI times out |
| - turnip: depth/stencil regressions |
| - radv regression: Hitman 2 rendering issue |
| - llvmpipe: assert in KHR-Single-GL45.enhanced_layouts.xfb_all_stages |
| - android: vulkan: Disable blocked KHR_performance_query extension (v3dv, turnip, anv) |
| - Poor performance/utilization with Splitgate |
| - [Bisected][RADV] Rendering issues and performance drop with GFX10 on RPCS3 |
| - freedreno_gmem.c crashes Firefox Nightly in Khronos |
| - freedreno / mesa 22.1.3 crashes Firefox Nightly |
| - !17693 broke piglit oes_egl_image_external_essl3 |
| - VK_PRESENT_MODE_IMMEDIATE_KHR rendering artifacts |
| - dzn: Build failure when it's the only driver built - 22.2-devel regression |
| - d3d12: Unresolved external symbols from Version.lib |
| - a618 vk_full timing out at 60 minutes instead of 120 |
| |
| |
| Changes |
| ------- |
| |
| Adam Jackson (25): |
| |
| - egl: Remove some can't-happen supported API checks |
| - egl: Move an error check earlier in EGL_BUFFER_AGE_EXT query |
| - wsi/x11: Defer clearing image busy flag for non-shm upload |
| - egl/kopper: Don't add EGL_SWAP_BEHAVIOR_PRESERVED_BIT configs |
| - glx: Use XSaveContext, delete glxhash.c |
| - Revert "glx: Use XSaveContext, delete glxhash.c" |
| - wsi/x11: Fix the is-visual-supported check |
| - nouveau: const cleanup |
| - egl/dri2: Respect the arguments to dri2_set_blob_cache_funcs |
| - egl/dri2: Fix a typo in a comment |
| - egl/dri2: Fix some thinkos in old context release |
| - glx/dri*: Unify glx_context subclassing |
| - glx/dri: Avoid a weird indirection in driFetchDrawable |
| - glx: Remove some excess work from the GLX_FBCONFIG_ID fallback |
| - egl: Fix a bunch of maybe-uninitialized warnings |
| - egl: Promote _eglLock/UnlockDisplay to internal API |
| - egl: Untangle some spaghetti around _eglLookupDisplay |
| - egl: Factor out the eglTerminate write-lock pattern |
| - egl: Rewrite eglSetBlobCacheFuncsANDROID to use the standard macros |
| - mesa/fbo: Don't force both read/draw bindings to zero |
| - egl: Formatting fix |
| - egl: Factor some common terminate cleanup up to common code |
| - egl/dri2: Fix a weird conditional in dri2_make_current |
| - egl: Remove a bogus restriction from eglMakeCurrent |
| - rusticl: meson devenv support |
| |
| Adrián Larumbe (5): |
| |
| - panfrost: Handle Job VA cycles when decoding a dump file |
| - panfrost: Remove documentation reference to deprecated parameter |
| - panfrost: Add compiler args option for building usermode tools |
| - panfrost: Sync panfrost_drm.h from drm-misc-next |
| - panfrost: Add userspace crash dump decoder and analyser |
| |
| Alejandro Piñeiro (11): |
| |
| - v3dv/bo: reduce DEFAULT_MAX_BO_CACHE_SIZE to 64 |
| - v3dv/pipeline: use a array instead of individual pointer to stages |
| - v3dv/pipeline: keep qpu_insts around if we expect them to be used later |
| - v3d: replace all TODO around for FIXME |
| - broadcom/compiler: don't call nir_opt_load_store_vectorize on all v3d_optimize_nir calls |
| - broadcom/compiler: update how we compute return_words_of_texture_data on non-ssa |
| - broadcom/compiler: add more lowerings/optimizations on v3d_optimize_nir |
| - v3dv/pipeline: use v3d_optimize_nir |
| - broadcom/compiler: pass a strategy struct to vir_compile_init |
| - broadcom/compiler: call nir_opt_gcm with a custom strategy |
| - v3dv/bo: reset bo and then call gem close |
| |
| Alejandro Tafalla (1): |
| |
| - freedreno: Fix graphic glitches on a4xx and a5xx |
| |
| Alessandro Astone (3): |
| |
| - Android.mk: Disable lmsensors |
| - Android.mk: Filter out --build-id=md5 linker flag |
| - Android.mk: Provide a custom entry name to ensure meson checks succeed |
| |
| Alex Brachet (1): |
| |
| - nir: Fix qsort comparator function |
| |
| Alyssa Rosenzweig (414): |
| |
| - docs/asahi: Document varying interpolation |
| - agx: Fix packing of samplers in texture instrs |
| - agx: Fix ld_var cf packing |
| - asahi: Plumb through lower_clip_fs |
| - asahi: Extend counts in BIND packets |
| - asahi: Allow large uniform records |
| - asahi: Dump all textures&samplers |
| - asahi: Fix using multiple textures/samplers |
| - asahi: Use a single bind texture/sampler per pipeline |
| - asahi: Split vertex/fragment pipeline binds |
| - asahi: Correct bind fragment pipeline size |
| - asahi: Encode known bits of Linkage in the XML |
| - asahi: Fix varying XML |
| - asahi: Decode Interpolation packets |
| - asahi,agx: Rewrite varying linking |
| - agx: Add AGX_MESA_DEBUG=noopt option |
| - agx: Model perspective coefficient reg in the IR |
| - agx: Rename varying load instructions |
| - agx: Use split instead of extract for ldcf |
| - agx: Implement noperspective interpolation |
| - agx: Lower more ALU operations |
| - agx: Align 64-bit register pairs |
| - agx: Assert that registers are naturally aligned |
| - agx: Extract umul_high implementation |
| - agx: Implement nir_op_umul_high |
| - agx: Stop using broken idiv lowering |
| - agx: Handle type-changing splits |
| - agx: Remove p_extract |
| - agx: Only emit the used components of gl_FragCoord |
| - asahi: Don't advertise ARB_clip_control yet |
| - asahi: Identify depth clip mode bit |
| - pan/decode: Don't pass around memory handles |
| - pan/decode: Remove pandecode_msg |
| - pan/decode: Stop passing job index around |
| - pan/decode: Stop passing suffixes around |
| - pan/decode: Simplify pandecode_fbd |
| - pan/decode: Reorder MFBD decoding |
| - pan/decode: Unify SFBD/MFBD decoding |
| - pan/decode: Defeature disassembler stats |
| - pan/decode: Centrally declare pandecode entrypoints |
| - pan/decode: Clean up _bifrost_ decode routines |
| - pan/mdg: Remove disassembler stats |
| - panfrost: Don't segfault on unknown models |
| - pan/bi: Implement some extracts and inserts |
| - pan/bi: Fuse [US][8|16]_TO_F32 ops |
| - pan/bi: Test int8/16 -> float32 opts |
| - pan/bi: Optimize bitwise arithmetic of booleans |
| - pan/bi: Optimize pattern from nir_lower_idiv |
| - pan/bi: Don't use the broken idiv lowering |
| - pan/bi: Use nir_opt_idiv_const |
| - asahi: Fix warning building for macOS 12.0 |
| - pan/bi: Don't reorder image loads across stores |
| - pan/bi: Don't allow ATEST to take a temporary |
| - pan/bi: Handle info.fs.untyped_color_outputs on Valhall |
| - panfrost: Handle untyped_color_outputs on Bifrost |
| - panfrost: Set blit output variable types correctly |
| - panfrost: Don't saturate in Bifrost blend shaders |
| - panfrost: Promote blend shader outputs 8->16-bit |
| - panfrost: Avoid blend shader when not blending |
| - panfrost: Don't compile empty blend shaders |
| - panfrost: Invert no_colour to enabled |
| - panfrost: Simplify blitter blend shader creation |
| - panfrost: Include mask in replace blend shader name |
| - panfrost: Assert that blend shaders are nontrivial |
| - r600/sfn: Don't use broken idiv lowering |
| - gallium: Inline away util_make_fragment_tex_shader_writemask |
| - gallium: Inline away util_make_fragment_tex_shader interp_mode |
| - gallium: Remove util_make_fragment_tex_shader_xrbias |
| - pan/mdg: Replicate swizzles for scalar sources |
| - pan/mdg: Reexpress umul_high packing |
| - pan/mdg: Use correct idiv lowering |
| - pan/mdg: Always write return address to r1.w |
| - pan/mdg: Print 3 sources for CSEL |
| - panfrost/ci: Update G72 xflakes list |
| - gallium: Add u_default_set_debug_callback |
| - v3d: Use u_default_set_debug_callback |
| - vc4: Use u_default_set_debug_callback |
| - softpipe: Use u_default_set_debug_callback |
| - lima: Use u_default_set_debug_callback |
| - etnaviv: Use u_default_set_debug_callback |
| - r300: Use u_default_set_debug_callback |
| - panfrost: Route shader-db to debug, not stderr |
| - pan/bi: Implement general 8-bit vector construction |
| - pan/va: Handle 8-bit lane when lowering constants |
| - pan/bi: Implement nir_op_vec8 and nir_op_vec16 |
| - pan/bi: Assert that vectors are sufficiently small |
| - docs/asahi: Document image layouts |
| - asahi: Identify compressed render target fields |
| - asahi: Rename 'tiled 64x64' to 'twiddled' |
| - asahi: Add XML for multisampled textures |
| - asahi: Fix bind sizes |
| - asahi: Fix ASTC enums |
| - ail: Introduce image layout module |
| - ail: Rewrite tiled memcpy for correctness |
| - asahi: Handle 2-channel sRGB textures |
| - asahi: Fix is_format_supported returns |
| - asahi: Advertise ETC1 |
| - asahi: Advertise ASTC formats |
| - asahi: Rename our fake twiddled DRM modifier |
| - ail: Add unit tests for miptree layouts |
| - pan/bi: Fix dual texturing with uniforms |
| - pan/bi: Fix out-of-bounds write in va_lower_split_64bit |
| - pan/bi: Consider all dests in helper_block_update |
| - pan/bi: Model [IF]CMP_{OR,AND,MULTI} ops |
| - pan/va: Split out compare instructions |
| - pan/bi: Set I->nr_dests, I->nr_srcs |
| - pan/bi: Model 3rd source for ATEST |
| - pan/bi: Consider nr_srcs/dests in CSE |
| - pan/bi: Use nr_{srcs,dests} in bi_foreach_{src,dest} |
| - pan/bi: Add validation for nr_srcs/nr_dests |
| - pan/bi: Remove trivial dest[0] = bi_null() |
| - pan/bi: Clean up destination printing |
| - pan/bi: Improve register printing |
| - pan/bi: Register allocate BLEND dest on Valhall |
| - pan/bi: Don't remove dests in DCE |
| - pan/bi: Use bi_emit_split_i32 for trimming vecs |
| - pan/bi: Introduce TEXC_DUAL psuedoinstruction |
| - pan/bi: Assert destinations are non-NULL |
| - pan/bi: Assume destinations are non-NULL |
| - pan/bi: Don't read nonexistant destinations |
| - pan/bi: Don't read nonexistant sources |
| - pan/bi: Add variable dest/src support to builder |
| - pan/bi: Use variable src/dest for collect/split |
| - pan/bi: Use safe helpers to remove srcs/dests |
| - pan/bi: Rebuild some instructions when lowering |
| - pan/bi: Rebuild DISCARD when optimizing |
| - pan/va: Use builder in va_lower_isel |
| - pan/bi: Use builder for MUX -> CSEL opt |
| - pan/bi: Use builder for IADD -> IADDC opt |
| - pan/bi: Use builder for atomic lowering in scheduler |
| - pan/bi: Reduce memory of scheduler access array |
| - pan/bi: Dynamically allocate source/dests |
| - pan/bi: Don't write registers in optimizer tests |
| - pan/bi: Add bi_after_block_logical helper |
| - pan/bi: Lift bi_predecessor_index helper |
| - pan/bi: Add phi nodes |
| - pan/bi: Validate phi ordering |
| - pan/bi: Don't lower FAU for phis |
| - pan/bi: Remove bogus assert lowering branches |
| - pan/bi: Don't optimize if without else |
| - pan/bi: Don't reorder phis in pre-RA scheduling |
| - pan/bi: Validate before&after pre-RA sched |
| - pan/bi: Inline DCE into bi_lower_vector |
| - pan/bi: Translate phis from NIR |
| - pan/bi: Add SSA-based liveness pass |
| - pan/bi: Add brainless out-of-SSA pass |
| - pan/bi: Reduce some moves when going out-of-SSA |
| - pan/bi: Preserve SSA form from NIR |
| - pan/bi: Clean up after converting to SSA |
| - pan/bi: Clear reg in squeeze_index |
| - pan/bi: Don't use bi_temp_reg |
| - pan/bi: Assume SSA in copyprop |
| - pan/bi: Assume SSA in mod prop |
| - pan/bi: Assume SSA in CSE |
| - pan/bi: Assume non-null sources in constant folding |
| - pan/bi: Assume SSA in minor passes |
| - pan/bi: Assume SSA when scheduling for pressure |
| - pan/bi: Assume SSA when translating NIR |
| - pan/bi: Assume SSA for helper invocation analysis |
| - pan/bi: Move non-SSA liveness into RA |
| - pan/bi: Remove NIR registers from the IR |
| - pan/bi: Get rid of bi_get_node |
| - pan/bi: Inline node_to_index |
| - pan/bi: Remove bi_max_temp |
| - pan/bi: Simplify bi_get_index prototype |
| - pan/bi: Add and use bi_foreach_ssa_src macro |
| - pan/bi: Strengenth assert in the validator |
| - pan/bi: Remove assert(bi_is_ssa(dest)) |
| - pan/bi: Add and use bi_replace_src helper |
| - pan/bi: Add and use bi_num_successors helper |
| - agx: Fix tib access in internal shaders |
| - agx: Fix float copyprop of neg(neg) case |
| - agx: Add more unit tests for float copyprop |
| - agx: Don't hardcode uniform enums |
| - agx: Defeature indirect vbufs |
| - agx: Plumb through nir_op_txf |
| - asahi: Fix Indexed Draw command |
| - asahi: Split up unk 2b |
| - asahi: Note some more unknowns in the XML |
| - asahi: Make ctx->zs a pointer |
| - asahi: Export agx_blitter_save |
| - asahi: Handle empty fragment shaders |
| - asahi: Use util_blitter_clear |
| - asahi: Trim garbage at end of set shader |
| - asahi: Relax assert in decoder |
| - asahi: Label batch flush reasons |
| - asahi: Fix depth for cube maps |
| - asahi: Fix nonmipmapped array textures |
| - asahi: Fix "stride" for tiled textures |
| - asahi: Identify and use first level field of texture |
| - asahi: Correct SET_SHADER_EXTENDED disambig bit |
| - asahi: Add 1D and 1D Array enums |
| - asahi: Handle out-of-bounds clear_stencil |
| - asahi: Fix depth/stencil buffers |
| - asahi: Identify stencil test enable |
| - asahi: Split RASTERIZER into constituent words |
| - asahi: Identify object type field via PowerVR |
| - asahi: Don't use lower_wpos_pntc |
| - asahi: Match PPP data structures with PowerVR |
| - asahi: Hoist constant PPP state to start of batch |
| - asahi: Dirty track everything |
| - asahi: Make BO list growable |
| - asahi: Annotate VDM/CDM commands as per PVR |
| - asahi: Express VDM commands according to PowerVR |
| - asahi: Handle Stream Link VDM commands |
| - asahi: Allocate new cmdbufs if out of space |
| - docs/envvars: Document Asahi variables |
| - pan/decode: Fix job cycle detection |
| - nir/lower_blend: Avoid emitting unnecessary fsats |
| - panfrost: Respect buffer offset for OpenCL |
| - panfrost: Honour cso->req_local_mem |
| - nir: Add nir_intrinsic_texture_base_agx sysval |
| - agx,asahi: Implement nir_intrinsic_load_texture_base_agx |
| - agx: Make p_combine take a dynamic src count |
| - agx: Implement texture offsets and comparators |
| - agx: Implement txd |
| - agx: Implement load_global(_constant) |
| - agx: Lower txs to a descriptor crawl |
| - agx: Don't use nir_find_variable_with_driver_location |
| - asahi: Don't crash on <4 channel render targets |
| - asahi: Handle blending with MRT |
| - u_transfer_helper: Handle Z24X8 for drivers that don't use the interleaved transfer_map |
| - u_transfer_helper: Pack Z24S8 to Z24-in-Z32F and S8 |
| - asahi: Assert cache line alignment on Z/S buffers |
| - asahi: Identify ZLS Control word from PowerVR |
| - asahi: Identify CDM block types |
| - asahi: Decode CDM commands separate from VDM |
| - asahi: Shuffle IOGPU structs |
| - asahi: Identify IOGPU compute header |
| - asahi: Decode IOGPU compute header |
| - asahi: Assert that u_transfer_helper is well-behaved |
| - asahi: Use the internal format internally |
| - asahi: Identify spill buffer histogram |
| - asahi: Simplify IOGPU attachment packing |
| - asahi: Identify shared memory fields |
| - asahi: Consolidate magic numbers for USC controls |
| - asahi/genxml: Overflow up to words when packing |
| - asahi: Overhaul USC control packing |
| - asahi: Identify pixel stride |
| - asahi: Identify shared memory layouts |
| - panfrost: Evict the BO cache when allocation fails |
| - rusticl: Build Panfrost |
| - panfrost: Adapt emit_shared_memory for indirect dispatch |
| - agx: Convert and clamp array indices in NIR |
| - panfrost: Default pipe->clear_texture impl |
| - panfrost: Allow compiling MESA_SHADER_KERNEL |
| - panfrost: Upload default sampler for txf |
| - panfrost: Bump PIPE_CAP_MAX_TEXTURE_ARRAY_LAYERS |
| - panfrost: Advertise PIPE_CAP_INT64 |
| - panfrost: Honour flush-to-zero controls on Valhall |
| - panfrost: Assume launch_grid parameters always change |
| - pan/va: Fix missing swizzle on CLZ.v2u16 |
| - pan/bi: Unit test swizzle lowering |
| - pan/bi: Lower some 8-bit swizzles |
| - pan/bi: Test some 8-bit swizzle lowering |
| - pan/bi: Lower swizzles for 8-bit shifts |
| - pan/bi: Strip negate when lowering swizzles |
| - pan/va: Pack IADD.sat bit |
| - pan/va: Fix v4s8 form of R2 opcodes |
| - pan/bi: Handle uhadd, urhadd opcodes |
| - pan/va: Pack .rhadd bit |
| - pan/bi: Add HADD.v4s8.rhadd packing test cases |
| - pan/va: Add 8-bit integer max assembler case |
| - pan/bi: Lower 8-bit min/max to bcsel+comparison |
| - pan/bi: Lower f2i8, f2u8 |
| - pan/bi: Remove bogus assert for pack_32_2x16 |
| - pan/bi: Allow selecting from an 8-bit vec8 |
| - pan/bi: Handle swizzles in unpack_64_2x32_split_{x,y} |
| - pan/bi: Lower <32-bit bit_count |
| - pan/bi: Lower fisnormal |
| - pan/bi: Scalarize phis before the opt loop |
| - pan/bi: Call nir_lower_64bit_phis |
| - pan/bi: Use .auto for image stores |
| - pan/bi: Fix 1D array indexing on Valhall |
| - pan/bi: Stub out scoped_barrier |
| - pan/bi: Implement unpack_64_2x32 |
| - agx: Fix bfi_mask packing |
| - agx: Emit fewer combines for intrinsics |
| - agx: Use 16-bit immediates |
| - agx: Validate immediates |
| - agx: Test fsat+f2f16 together |
| - agx: Ensure that the optimizer sees legitimate SSA |
| - agx: Pass mask into ld/st_tile instructions |
| - agx: Account for mask when writing registers |
| - agx: Remove load_kernel_input path |
| - panfrost: Remove load_kernel_input path |
| - nir/lower_system_values: Fix cs_local_index_to_id with variable workgroups |
| - asahi: Eliminate "Pixel Format" type from GenXML |
| - asahi/genxml: Defeature uint/float |
| - panfrost,asahi: Remove unused prepare macro |
| - panfrost,asahi: Remove unused function |
| - asahi,panfrost: Remove exact attribute |
| - asahi: Use a header more like Intel's GenXML |
| - asahi: Remove no-direct-packing |
| - agx: Add iterators for phi/non-phi instructions |
| - agx: Set PIPE_SHADER_CAP_INDIRECT_CONST_ADDR |
| - agx: Emit smaller combines for nir_op_vec2/3 |
| - agx: Don't prefix pseudo-ops |
| - agx: Improve printing of immediate sources |
| - agx: Switch to dynamic allocation of srcs/dests |
| - agx: Dynamically size split instruction |
| - agx: Rename "combine" to "collect" |
| - agx: Emit trap at pack-time, not during isel |
| - agx: Print instructions as "dest = src" |
| - agx: Print agx_dim appropriately |
| - agx: Preload vertex/instance ID only at start |
| - agx: Write to r0l with a "nesting" instruction |
| - agx: Introduce ra_ctx data structure |
| - agx: Pass in max regs as a paramter to RA |
| - agx: Reserve live-in regs at the start of block |
| - panfrost/ci: Disable t720 jobs |
| - nir: Be smarter fusing ffma |
| - mesa,gallium: Revert "Make point coord origin a CAP" |
| - panfrost/ci: Remove stale fail |
| - pan/mdg: Lower PIPE_COMPUTE_CAP_MAX_THREADS_PER_BLOCK on Midgard |
| - pan/mdg: Limit work registers for large workgroups |
| - pan/mdg: Fix 16-bit alignment with spiller |
| - nir: Fix nir_fmax_abs_vec_comp |
| - nir: Usher nir_normalize_cubemap_coords into 2022 |
| - mesa: Fix multiple matrix pops in a row |
| - nir/lower_io: Set interpolated_input dest_type |
| - u_transfer_helper: Handle Z24S8 with z24_in_z32f but no interleaving |
| - asahi: Lower point sprites in driver |
| - gallium: Default to PIPE_CAP_MIXED_COLORBUFFER_FORMATS |
| - panfrost: Remove NIR_SERIALIZED support |
| - asahi: Hold a reference to BOs in a batch |
| - nir/opt_preamble: Move load_texture_base_agx |
| - agx: Remove command line compiler |
| - asahi: Remove AGX_FAKE_DEVICE environment variable |
| - asahi: Precompile for shader-db |
| - asahi: Route shader-db stats to debug callback |
| - asahi: Identify "Uniform high" USC word |
| - asahi: Identify more shader-related fields |
| - asahi: Remove "padding" field |
| - asahi: Make agx_varyings a union |
| - asahi: Use USC_UNIFORM_HIGH |
| - asahi: Rename LOD clamps tests to fit other packing |
| - asahi: Don't use unnecessary test fixture |
| - asahi: Add group tests |
| - asahi: Support preamble shaders |
| - agx: Be less sloppy about high uniforms |
| - agx: Check that we don't push too much |
| - agx: Make push_count public |
| - agx: Handle multiple functions |
| - agx: Align preamble shaders |
| - agx: Add uniform_store instruction |
| - agx: Allow larger indices for ld/st |
| - agx: Strengthen assert for packing ld/st instructions |
| - agx: Implement scalar load/store_preamble |
| - agx: Handle uniforms passed to COLLECT |
| - agx: Restrict copyprop of uniforms |
| - agx: Handle 64-bit moves |
| - agx: Avoid reading high uniforms from device_load |
| - agx: Report GPRs to the driver |
| - asahi: Set uniform counts accurately |
| - asahi: Set GPR count accurately |
| - agx: Add agx_nir_opt_preamble pass |
| - agx: Use agx_nir_opt_preamble |
| - panfrost: Don't use lower_wpos_pntc on Midgard |
| - panfrost: Use proper formats for pntc varying |
| - etnaviv: Use correct idiv lowering |
| - nir/lower_idiv: Remove imprecise_32bit_lowering |
| - nir/lower_idiv: Inline convert_instr_precise |
| - panfrost: Zero polygon list for fragment-only |
| - panfrost: Don't allow VS side effects on midgard |
| - pan/bi: Clean up sysval handling a bit |
| - pan/mdg: Fix 64-bit address arithmetic |
| - panfrost: Use compute-based XFB on Midgard |
| - panfrost: Avoid a XFB special case |
| - panfrost: Lower MAX_BLOCK_SIZE on Midgard |
| - pan/mdg: Don't schedule across memory barrier |
| - panfrost,asahi: Support ARB_buffer_storage |
| - gallium: Stub support for Asahi + DRM |
| - gallium: Only use Asahi's software path on macOS |
| - panfrost/ci: Disable trace-based testing |
| - panfrost: Enable rendering to 16-bit and 32-bit |
| - panfrost: Add lots of perf_debug annotations |
| - asahi: Identify counts for compute kernels |
| - softpipe: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT |
| - i915g: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT |
| - etnaviv: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT |
| - gallium: Default to PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT |
| - nir/opt_algebraic: Fuse c - a * b to FMA |
| - pan/bi: Don't lower outputs for compute |
| - panfrost: Don't use nir_variable in the compilers |
| - pan/mdg: Disassemble the .a32 bit |
| - pan/mdg: Don't pair ST_VARY.a32 with other instrs |
| - pan/mdg: Use .u32 for flat shading |
| - pan/mdg: Use bifrost_nir_lower_store_component |
| - panfrost: Don't allocate space for empty varyings |
| - panfrost: Don't use nir_variable to link varyings |
| - panfrost: Delete set_global_resources |
| - panfrost: Move small compute functions to pan_context.c |
| - panfrost: Remove bogus assert |
| - panfrost: Consolidate all shader compiling code |
| - panfrost: Merge pan_assemble.c into pan_shader.c |
| - panfrost: Remove unused req_input_mem copy |
| - panfrost: Rename structs to panfrost_(un)compiled_shader |
| - panfrost: Remove uncompiled_shader->active_variant |
| - panfrost: Use u_dynarray for variants |
| - panfrost: Make fixed_varying_mask a fragment-only key |
| - panfrost: Precompile transform feedback program |
| - panfrost: Precompile shaders |
| - panfrost: Implement a disk cache |
| - docs/new_features: Add Panfrost disk cache |
| - panfrost: Don't copy resources if replaced |
| - panfrost: Replace resource shadowing flush |
| - panfrost: Fix build with Perfetto (again) |
| - panfrost: Copy resources when necessary |
| - panfrost: Require 64-byte alignment on imports |
| - panfrost: Fix reference counting with batch->resources |
| - panfrost: Use PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY |
| - pan/mdg: Fix out-of-order execution |
| - panfrost: Revert "Require 64-byte alignment on imports" |
| |
| Andri Yngvason (1): |
| |
| - gallium/vl: Add opaque rgb pixel formats |
| |
| Anton Bambura (1): |
| |
| - panfrost: Enable Mali-T620 |
| |
| António Monteiro (2): |
| |
| - freedreno: get timestamp from os_time_get_nano instead of os_time_get |
| - gallium/util: remove network class |
| |
| Axel Davy (6): |
| |
| - frontend/nine: Skip invalid swvp calls |
| - frontend/nine: Fix buffer tracking out of bounds |
| - frontend/nine: Fix ATOC handling |
| - frontend/nine: Fix cso restore bug |
| - frontend/nine: Fix shader multi-use crash |
| - frontend/nine: Fix ff position_t fallback when w = 0 |
| |
| Bas Nieuwenhuizen (56): |
| |
| - vulkan/wsi: Take max extent into consideration for modifier selection. |
| - radv: Remove redundant radv_QueuePresentKHR. |
| - radv: Add binding code for 3d sparse images. |
| - radv: Add 3d tile shapes for sparse binding. |
| - radv: Expose 3d sparse images. |
| - amd/common: Don't rely on DCN support checks with modifiers. |
| - amd/common: Disable DCC retile modifiers on RDNA1 |
| - radv: Don't flatten bottom AS exit if statement. |
| - radv: Store top of stack in a register. |
| - Revert "radv/rt: use derefs for the traversal stack" |
| - radv: Use constant for ray traversal exit condition. |
| - radv: Use nested ifs for pushing child nodes in traversal loop. |
| - radv: Add scratch stack to reduce LDS stack in RT traversal. |
| - ac/surface: Fix some warnings. |
| - radv: Hardcode root node id. |
| - radv: Translate the BVH copy shader to glsl from nir_builder. |
| - radv: Remove aabb bounds from instance nodes. |
| - radv: Use proper matrices for instance nodes. |
| - radv: Use GLSL matrices for instance transforms in BVH. |
| - radv: Use deterministic order for dumping acceleration stuctures. |
| - radv: Properly initialize all memory in RRA dumps. |
| - radv: Do an early check of distance in triangle culling. |
| - vulkan/wsi/x11: Fix double free of modifiers. |
| - radv: Clean up unused fields in BVH IR. |
| - radv: Add a field for the offset of the bvh in the blas. |
| - radv: Simplify buffer copy address generation. |
| - radv: Start the BVH after the header. |
| - radv: Generate parent links in BVH. |
| - radv: Only emit parents from parents that actually end up in the tree. |
| - radv: Split global & local bvh node variable. |
| - radv: Add traversal backtracking with a short stack. |
| - radv: Properly annotate all the invalid node usage. |
| - radv: Implement pipeline-wide skiptriangles/skipaabbs. |
| - radv: Consider inactive internal nodes never in the tree. |
| - radv: Constify entry_size. |
| - radv: Use a VkPipelineCache handle for meta shaders. |
| - radv: Rework modification detection of the meta cache. |
| - radv: Mark dEQP-VK.ray_query.misc.dynamic_indexing as crashing in CI. |
| - aco: Pre-split result of bvh64_intersect_ray_amd. |
| - radv: Improve gfx11 clear colors for DCC. |
| - radv: Use correct primgrp size for gfx11. |
| - radv: Add BVH IR header. |
| - radv: Make the number of internal nodes be written on the GPU. |
| - radv: Add PLOC shader |
| - radv: Use PLOC for BVH building |
| - radv: Speculatively tune RT pipelines for GFX11. |
| - radv: Use correct types for loading the rings with LLVM. |
| - radv: Handle GSVS ring intrinsic correctly with LLVM. |
| - radv: Handle attribute ring intrinsic correctly with LLVM. |
| - radv: Fix compute scratch buffer emission. |
| - radv: Make the compute scratch waves per SE as well. |
| - radv: Fix custom build id with C90 compilation. |
| - radv: Use correct init order for ETC2 image views |
| - radv: Fix sampler types in ETC2 decode. |
| - vulkan: Remove asserts that weren't valid for RADV ETC2 emulation. |
| - radv: reserve space for the scissor in vkCmdBeginRendering. |
| |
| Benjamin Tissoires (1): |
| |
| - CI: convert to use the new S3 server instead of the legacy minio |
| |
| Boris Brezillon (1): |
| |
| - ci/panvk: Skip dEQP-VK.api.object_management.max_concurrent.query_pool |
| |
| Brian Paul (113): |
| |
| - st_pbo/compute: fix memset() warning |
| - lavapipe: s/u_foreach_bit/u_foreach_bit64/ in handle_pipeline_access() |
| - llvmpipe: further bump LP_MAX_TGSI_SHADER_IMAGES to 64 |
| - llvmpipe: always pass non-zero writemask to assign_reg() |
| - util: allow GALLIUM_LOG_FILE=stdout |
| - gallivm: move lp_build_nir_aos_context declaration, etc |
| - gallivm: change texture/sampler_index params to unsigned |
| - llvmpipe: asst. formatting, clean-ups in lp_state_fs.c |
| - gallivm: asst. clean-ups in lp_bld_sample_soa.c |
| - gallivm: fix nir AOS swizzling issues |
| - lavapipe: remove continue statements in emit_state() to be more consistent |
| - lavapipe: zero-init sampler objects |
| - cso: asst. clean-ups in cso_hash.[ch] |
| - cso: use util_bitcount |
| - cso: asst. clean-ups in cso_cache.[ch] |
| - cso: asst. clean-ups in cso_context.[ch] |
| - draw: asst. clean-ups in draw_context.[ch] |
| - draw: asst. clean-ups in draw_vs_llvm.c |
| - draw: asst. clean-ups in draw_vs_variant.c |
| - draw: asst. clean-ups in draw_vs.[ch] |
| - draw: remove goto in llvm_pipeline_generic() |
| - draw: remove unused draw_has_llvm() |
| - draw: asst. clean-ups in draw_pipe.h |
| - draw: asst. clean-ups in draw_pt_fetch.c |
| - draw: asst. clean-ups in draw_pt_post_vs.c |
| - draw: asst. clean-ups in draw_pipe_cull.c |
| - draw: asst. clean-ups in draw_pipe_clip.c |
| - draw: asst. clean-ups in draw_pipe_flatshade.c |
| - draw: asst. clean-ups in draw_pipe_offset.c |
| - draw: asst. clean-ups in draw_pipe_pstipple.c |
| - draw: asst. clean-ups in draw_pipe_twoside.c |
| - draw: asst. clean-ups in draw_pipe_unfilled.c |
| - draw: asst. clean-ups in draw_pipe_user_cull.c |
| - draw: asst. clean-ups in draw_pipe_util.c |
| - draw: asst. clean-ups in draw_pipe_wide_line.c |
| - draw: asst. clean-ups in draw_pipe_wide_point.c |
| - draw: asst. clean-ups in draw_vertex.[ch] |
| - draw: asst. clean-ups in draw_pt_so_emit.c |
| - draw: asst. clean-ups in draw_pipe_aapoint.c |
| - draw: asst. clean-ups in draw_llvm.c |
| - draw: asst. clean-ups in draw_cliptest_tmp.h |
| - draw: asst. clean-ups in draw_gs.[ch] |
| - draw: asst. clean-ups in draw_prim_assembler.[ch] |
| - draw: asst. clean-ups in draw_pipe.c |
| - draw: asst. clean-ups in draw_pt_vsplit_tmp.h |
| - draw: asst. clean-ups in draw_split_tmp.h |
| - draw: asst. clean-ups in draw_pt.c |
| - draw: clean-up count assignment in draw_pt_arrays() |
| - draw: asst. clean-ups in draw_private.h |
| - draw: asst. clean-ups in draw_pt_fetch_shade_pipeline_llvm.c |
| - draw: replace double conditional expression with if/else |
| - draw: make draw_geometry_shader_run() return void |
| - draw: fix some const-correctness in draw_vbo() |
| - draw: move vs_slot var in tgsi/llvm_fetch_gs_input() |
| - draw: s/start_or_maxelt/start/ in llvm VS code |
| - draw: fix vertex id offset bug |
| - draw: fix base vertex bug |
| - llvmpipe: asst. clean-ups in lp_rast.h |
| - llvmpipe: asst. clean-ups in lp_rast_debug.c |
| - llvmpipe: asst. clean-ups in lp_rast_priv.h |
| - llvmpipe: asst. clean-ups in lp_rast_rect.c |
| - llvmpipe: asst. clean-ups in lp_rast.c |
| - llvmpipe: asst. clean-ups in lp_rast_linear.c |
| - llvmpipe: asst. clean-ups in lp_rast_linear_fallback.c |
| - gallium: whitespace fixes in p_context.h, p_screen.h |
| - llvmpipe: asst. clean-ups in lp_context.[ch] |
| - llvmpipe: asst. clean-ups in lp_surface.c |
| - llvmpipe: asst. clean-ups in lp_screen.[ch] |
| - llvmpipe: asst. clean-ups in lp_query.c |
| - llvmpipe: asst. clean-ups in lp_setup.c |
| - llvmpipe: asst. clean-ups in lp_setup_rect.c |
| - llvmpipe: asst. clean-ups in lp_setup_vbuf.c |
| - llvmpipe: asst. clean-ups in lp_setup_line.c |
| - llvmpipe: asst. clean-ups in lp_setup_point.c |
| - llvmpipe: remove gotos in do_rect_ccw() |
| - llvmpipe: asst. clean-ups in lp_setup_tri.c |
| - llvmpipe: asst. clean-ups in lp_bld_alpha.c |
| - llvmpipe: asst. clean-ups in lp_bld_blend_aos.c |
| - llvmpipe: asst. clean-ups in lp_bld_depth.c |
| - llvmpipe: asst. clean-ups in lp_bld_interp.c |
| - llvmpipe: asst. clean-ups in lp_debug.h |
| - llvmpipe: asst. clean-ups in lp_draw_arrays.c |
| - llvmpipe: asst. clean-ups in lp_fence.c |
| - llvmpipe: asst. clean-ups in lp_flush.[ch] |
| - llvmpipe: asst. clean-ups in lp_linear_interp.c |
| - llvmpipe: asst. clean-ups in lp_linear_sampler.c |
| - llvmpipe: asst. clean-ups in lp_scene.c |
| - llvmpipe: asst. clean-ups in lp_scene.h |
| - llvmpipe: asst. clean-ups in lp_state_sampler.c |
| - llvmpipe: asst. clean-ups in lp_state_derived.c |
| - llvmpipe: asst. clean-ups in lp_state_clip.c |
| - llvmpipe: asst. clean-ups in lp_state_cs.c |
| - llvmpipe: asst. clean-ups in lp_state_gs.c |
| - llvmpipe: asst. clean-ups in lp_state_rasterizer.c |
| - llvmpipe: asst. clean-ups in lp_state_so.c |
| - llvmpipe: asst. clean-ups in lp_state_surface.c |
| - llvmpipe: asst. clean-ups in lp_state_tess.c |
| - llvmpipe: asst. clean-ups in lp_state_vertex.c |
| - llvmpipe: asst. clean-ups in lp_test_blend.c |
| - llvmpipe: asst. clean-ups in lp_test_conv.c |
| - llvmpipe: asst. clean-ups in lp_test_format.c |
| - llvmpipe: asst. clean-ups in lp_test_main.c |
| - llvmpipe: asst. clean-ups in lp_tex_sample.c |
| - llvmpipe: asst. clean-ups in lp_texture.[ch] |
| - llvmpipe: improve some if/switch code in llvmpipe_get_shader_param() |
| - frontend/dri: assorted clean-ups in dri-screen.c |
| - frontends/dri: clean-ups in dri_util.c |
| - glx: clean-ups in create_context.c |
| - glx: clean-ups in drisw_glx.c |
| - llvmpipe: s/uint/enum pipe_prim_type/ in lp_setup_context.h |
| - llvmpipe: fix comment typo |
| - llvmpipe: asst. clean-ups in lp_state_fs.c |
| - llvmpipe: check rectangle vertices have equal W components |
| |
| Caio Oliveira (38): |
| |
| - intel/compiler: Print more details when fs_visitor::validate() fails |
| - intel/compiler: Call get_mesh_urb_handle one level up in call-stack |
| - intel/compiler: Get URB handle in emit_task/mesh_intrinsic functions |
| - intel/compiler: Make component() work for FIXED_GRF/ARF |
| - intel/compiler: Remove INTEL_DEBUG=tcs8 |
| - intel/compiler: Rename 8_PATCH to MULTI_PATCH |
| - intel/compiler: Use fs_reg helpers for TCS icp_handle selection |
| - intel/compiler: Use fs_reg helpers for GS icp_handle selection |
| - intel/compiler: Create fs_visitor::emit_tcs_barrier() |
| - intel/compiler: Add helper for barrier message payload setup for gfx >= 125 |
| - intel/compiler: Use builder to allocate fs regs for TCS store output |
| - intel/compiler: Make a type for Thread Payload and FS variant |
| - intel/compiler: Use FS thread payload only for FS |
| - intel/compiler: Create struct for TCS thread payload |
| - intel/compiler: Store Patch URB output in TCS thread payload struct |
| - intel/compiler: Store Primitive ID in TCS thread payload struct |
| - intel/compiler: Store start of ICP handles in TCS thread payload struct |
| - intel/compiler: Create and use struct for TES thread payload |
| - intel/compiler: Create and use struct for VS thread payload |
| - intel/compiler: Create and use struct for TASK and MESH thread payloads |
| - intel/compiler: Create and use struct for GS thread payload |
| - intel/compiler: Store start of ICP handles in GS thread payload struct |
| - intel/compiler: Create and use struct for Bindless thread payload |
| - intel/compiler: Export brw_get_subgroup_id_param_index() |
| - intel/compiler: Create and use struct for CS thread payload |
| - intel/compiler: Make thread_payload struct abstract |
| - intel/compiler: Add a few more brw_ud* helpers |
| - intel/compiler: Use brw_ud* helpers in thread payload code |
| - nir/lower_task_shader: Don't fail adding a launch when last instruction is a jump |
| - spirv: Pass vtn_block into vtn_emit_branch() |
| - spirv: Add workaround to ignore OpReturn after OpEmitMeshTasksEXT |
| - spirv: Call vtn_emit_ret_store() only when handling return branch |
| - spirv: Make a helper function to check glslang/shaderc |
| - spirv: Enable OpEmitMeshTasksEXT workaround for Clay Shader Compiler |
| - intel/compiler: Explicitly include build-id when linking intel_clc |
| - nir: Don't reorder volatile intrinsics |
| - intel/compiler: Fix dynarray usage in intel_clc |
| - intel/compiler: Fix missing tie-breaker in brw_nir_analyze_ubo_ranges() ordering code |
| |
| Caleb Cornett (3): |
| |
| - wgl: Homogenize read framebuffer reference counting logic |
| - wgl: Always release local references in stw_make_current_by_handles |
| - wgl: Release the context before deleting it in wglDeleteContext |
| |
| Chad Versace (43): |
| |
| - venus: Remove cryptic pointer arithmetic on present attachments |
| - venus: Drop vn_present_src_attachment::acquire |
| - venus: Track subpass properties |
| - venus/pipeline: Refactor vn_fix_graphics_pipeline_create_info |
| - venus/pipeline: Add pipeline state bools for ignore rules |
| - venus/pipeline: Split ignore rules for pViewportState, pMultisampleState |
| - venus/pipeline: Fix ignore rule for pDepthStencilState |
| - venus/pipeline: Fix ignore rule for pColorBlendState |
| - venus/pipeline: Ignore pViewportState::pViewports |
| - venus/pipeline: Ignore pViewportState::pScissors |
| - venus/pipeline: Ignore basePipelineHandle |
| - venus/pipeline: Fix ignore rules for VK_KHR_dynamic_rendering |
| - venus/pipeline: Fix ignore rules for VK_EXT_extended_dynamic_state |
| - venus: Dedupe pipeline handle creation |
| - venus: Fix failure path on pipeline creation |
| - venus: Enable VK_EXT_pipeline_creation_cache_control |
| - venus: Sort passthrough extensions |
| - vulkan: Make vk_copy_struct_guts() public |
| - venus: Simplify vn_GetPhysicalDeviceFeatures2 |
| - venus: Simplify vn_GetPhysicalDeviceProperties2 |
| - venus: Enable VK_KHR_shader_integer_dot_product |
| - venus: Enable VK_KHR_shader_non_semantic_info |
| - venus: Enable VK_KHR_shader_terminate_invocation |
| - venus: Enable VK_KHR_zero_initialize_workgroup_memory |
| - venus: Enable VK_EXT_subgroup_size_control |
| - venus: Enable VK_EXT_texel_buffer_alignment |
| - venus: Enable VK_EXT_ycbcr_2plane_444_formats |
| - venus: Enable VK_KHR_format_feature_flags2 |
| - venus: Enable VK_EXT_private_data |
| - venus: Enable VK_EXT_tooling_info |
| - venus: Document VK_EXT_texture_compression_astc_hdr support |
| - venus: Enable VK_EXT_pipeline_creation_feedback |
| - venus: Rename some feature/property structs |
| - venus: Refactor VN_ADD_TO_PNEXT |
| - venus: Add macros VN_SET_CORE_* |
| - venus: Fix features/properties for unavailable extensions |
| - venus: Use VkPhysicalDeviceVulkan13{Features,Properties} |
| - venus/queue: Rename some local vars |
| - venus: Change signature of vn_get_fence_feedback_cmd |
| - venus: Add vn_queue_wait_idle_before_present() |
| - venus: Refactor vn_command_buffer_builder temp storage |
| - venus: Enable VK_KHR_synchronization2 |
| - venus: Enable Vulkan 1.3 |
| |
| Charmaine Lee (6): |
| |
| - svga: fix mksstats build |
| - svga: fix SetConstantBufferOffset command opcode |
| - svga: compare graphics shader stages against COMPUTE |
| - mesa/st: fix reference to nir->info after nir_to_tgsi |
| - svga: support TGSI_SEMANTIC_TEXCOORD in swtnl draw context |
| - svga: fix invalid component access of domain location |
| |
| Chia-I Wu (83): |
| |
| - vulkan: sort ALLOWED_ANDROID_VERSION by api levels |
| - vulkan: fix ALLOWED_ANDROID_VERSION for api level 26 |
| - vulkan: fix ALLOWED_ANDROID_VERSION up to api level 29 |
| - vulkan: extend ALLOWED_ANDROID_VERSION up to api level 31 |
| - turnip: add tu_common.h as the common header |
| - turnip: remove includes that are already in tu_common.h |
| - turnip: add tu_drm.h |
| - turnip: add tu_suballoc.h |
| - turnip: update tu_cs.h |
| - turnip: add tu_query.h |
| - turnip: add tu_image.h |
| - turnip: add tu_formats.h |
| - turnip: update tu_descriptor_set.h |
| - turnip: add tu_shader.h |
| - turnip: add tu_pipeline.h |
| - turnip: add tu_clear_blit.h |
| - turnip: add tu_dynamic_rendering.h |
| - turnip: add tu_lrz.h |
| - turnip: add tu_pass.h |
| - turnip: add tu_wsi.h |
| - turnip: update tu_autotune.h |
| - turnip: add tu_device.h |
| - turnip: add tu_cmd_buffer.h |
| - turnip: add tu_android.h |
| - turnip: update tu_util.h |
| - turnip: move away from tu_private.h |
| - turnip: remove tu_private.h |
| - turnip: remove headers from libtu_files |
| - turnip: use SPDX-License-Identifier |
| - vulkan: disallow VK_KHR_performance_query on android |
| - turnip: fix a use-after-free in autotune |
| - turnip: remove unused member in tu_submission_data |
| - turnip: handle fence wrapping in autotune |
| - freedreno/pps: improve interaction with turnip |
| - turnip: lower the queue priority to 1 |
| - turnip: pCreateInfo->pPoolSizes[i] is wordy |
| - turnip: trim down host size of tu_descriptor_pool |
| - turnip: trim down bo size of tu_descriptor_pool |
| - turnip: improve perfetto sync_timestamp |
| - turnip: fix a missing trace_end_gmem_clear |
| - turnip: move trace_start_gmem_store before cond exec |
| - turnip: fix gem_store tracepoint |
| - turnip: tidy up tracepoint header includes |
| - util/u_trace: include the generated header first |
| - util/u_trace: add PERFETTO HeaderScope |
| - turnip: add tu_clone_trace_range helper |
| - turnip: convert tu_perfetto_state to a stack |
| - turnip: clean up tu_perfetto.h |
| - turnip: rename some tracing stages |
| - turnip: add cmd_buffer tracepoint |
| - turnip: improve tracing of secondary cmd buffers |
| - ir3: fix predicate splitting in scheduler |
| - util/perf: move u_perfetto to here |
| - util/perf: add cpu_trace.h |
| - util/perf: fix multiple tracepoints in a scope |
| - util/perfetto: add a simple C wrapper for track events |
| - util/perf: support and prefer perfetto for cpu trace |
| - venus: use MESA_TRACE_* |
| - pps: enable track_event in system.cfg |
| - docs: update perfetto with the latest status |
| - vulkan/wsi: add cpu tracepoints to swapchain functions |
| - vulkan/wsi: add cpu tracepoints for internal waits |
| - turnip: add some cpu tracepionts |
| - freedreno: get cpu timestamp again after FD_TIMESTAMP |
| - util/perf: ignore ut->enabled for iterators |
| - util/perf: remove u_trace::enabled |
| - util/perf: add u_trace_instrument |
| - util/perf: simplify u_trace_instrument |
| - util/perf: use ALWAYS_INLINE for tracepoints |
| - turnip: reduce MAX_STORAGE_BUFFER_RANGE |
| - vulkan: update ALLOWED_ANDROID_VERSION for api level 33 |
| - subprojects: uprev perfetto to v29.0 |
| - turnip: fix error handling for tu_queue_init |
| - turnip: add tu_physical_device::submitqueue_priority_count |
| - turnip: add support for VK_KHR_global_priority |
| - turnip: use vk_buffer |
| - turnip: use vk_descriptor_set_layout |
| - vulkan: update comments to device enumeration callbacks |
| - turnip: fix kgsl tu_enumerate_devices return code |
| - turnip: advertise VkExternalFenceProperties correctly |
| - venus: fix VkImageFormatListCreateInfo for AHB |
| - freedreno/pps: use 64-bit reads when possible |
| - freedreno/pps: loop countables by references |
| |
| Christian Gmeiner (18): |
| |
| - lima/ci: There is no need to redefine HWCI_TEST_SCRIPT |
| - lima/ci: Make use of .lava-piglit |
| - isaspec: Move isa_decode(..) declaration |
| - ci: include etnaviv support in ARM64 container |
| - ci: switch to mkbootimg.py |
| - ci/bare-metal: introduce BM_MKBOOT_PARAMS |
| - ci/etnaviv: add GC7000 support |
| - etnaviv: Remove possibility to force MSAA |
| - etnaviv: rs: Color resolve works |
| - etnaviv: rs: Directly call etna_try_rs_blit(..) |
| - etnaviv: rs: msaa: Set cache mode for SMALL_MSAA |
| - etnaviv: Update headers from rnndb |
| - etnaviv: blt: Add color resolve support |
| - etnaviv: blt: Directly call etna_try_blt_blit(..) |
| - Revert "etnaviv: completely turn off MSAA" |
| - etnaviv: Support negative float inline immediates |
| - etnaviv: nir: lower extract byte/word |
| - etnaviv: Hide MSAA support behind debug flag |
| |
| Christophe (2): |
| |
| - zink: add profile documentation |
| - Zink: add Zink profiles file |
| |
| Connor Abbott (49): |
| |
| - tu: Decouple vertex input state from shader |
| - tu: Implement VK_EXT_vertex_input_dynamic_state |
| - tu: Fix sysmem depth attachment clear flushing |
| - ir3/spill: Fix extracting from a vector at the end of a block |
| - nir: Free instructions more often |
| - tu/lrz: Fix multiple subpass case with secondaries |
| - tu/lrz: Fix multiple depth attachment case with secondaries |
| - tu: Fix descriptor set size bounds |
| - tu: Don't preload variable-count descriptors |
| - tu: Initial implementation of VK_EXT_inline_uniform_block |
| - tu: Also set SP_FS_MRT_REG for unused attachments |
| - tu: Fix case where vertex input is set but not vertex buffers |
| - tu: Remove unused need_indirect_descriptor_sets |
| - tu: Group pipeline state based on VkPipeline*StateCreateInfo |
| - tu: Rename RB_MSAA_CNTL to RB_BLIT_GMEM_MSAA_CNTL |
| - tu: Always set RB_BLIT_GMEM_MSAA_CNTL when clearing GMEM |
| - tu: Make MSAA emission always dynamic |
| - tu: Split out primtype calculation for tess |
| - ir3: Don't use msaa key for a6xx+ |
| - tu: Nuke tu_pipeline_cache |
| - tu: Don't use layer_zero/view_zero in shader key |
| - tu, ir3: Move multi_pos_output to ir3_shader_variant |
| - tu: Don't use output state to compute render components |
| - tu, ir3: Determine dual-src blend from shader for FS outputs |
| - tu: Emit \*_OUTPUT_CNTL1 as part of blend state |
| - tu: Move no_earlyz computation to blend/msaa state |
| - tu: Split up prim order computation |
| - tu: Use vk_pipeline_shader_stage_to_nir |
| - tu: Abstract driver-specific const state |
| - tu: Initial implementation of VK_EXT_graphics_pipeline_library |
| - tu: Fix maxPerStageDescriptorUpdateAfterBindInputAttachments |
| - tu: Set textures_used for input attachments correctly |
| - ir3/analyze_ubo_ranges: Account for reserved consts |
| - Revert "freedreno,ir3: rename Z_CLAMP_ENABLE to Z_CLIP_DISABLE" |
| - freedreno/a6xx, tu: GRAS_CL_CNTL::UNK5 is Z_CLAMP_ENABLE |
| - tu: Fix setting RB_DEPTH_CNTL::Z_CLAMP_ENABLE |
| - ir3: Delete outputs from fixup_regfootprint() |
| - ir3: Move fixup_regfootprint() to ir3_collect_info() |
| - ir3/analyze_ubo_ranges: Account for reserved consts better |
| - ir3: Add missing cat5 encoding to asm parser |
| - tu: Fix param_stride placement |
| - tu: DS primitive stride does not use patch control points |
| - tu: Implement extendedDynamicState2PatchControlPoints |
| - freedreno: Add LABEL flush |
| - tu: Add compute shader instrlen workaround |
| - tu: tu: Clear patchControlPoints dirty state with static patchControlPoints |
| - tu: Use right enum for compute active_shader_stages |
| - tu: Fix binding NULL descriptor sets |
| - ir3: Don't save/restore disasm string pointer |
| |
| Constantine Shablya (3): |
| |
| - nir: add a pass to remove non-uniform access qualifier when the operands are uniform |
| - radv: use nir_opt_uniform_access |
| - anv: use nir_opt_uniform_access |
| |
| Corentin Noël (9): |
| |
| - virgl/ci: Update virglrenderer |
| - ci: disable the freedreno farm. |
| - virgl/ci: Update virglrenderer |
| - gallivm: avoid the use of an uninitialized value |
| - virgl: Add TEXTURE_SHADOW_LOD capability support |
| - virgl: Add nir_lower_tex pass |
| - mesa/main: do not copy the exact size of the string |
| - gallium/hud: use snprintf(..., "%s", ...) instead of strncpy |
| - ci: Remove Wno-error=stringop-truncation from debian-arm64-asan |
| |
| Damian Korczowski (1): |
| |
| - d3d12: Fix placed footprint of subresources |
| |
| Daniel Schürmann (20): |
| |
| - nir/opt_algebraic: a & ~a -> 0 |
| - aco: use std::vector::reserve() more often |
| - aco/live_var_analysis: implement faster merging of live_out sets for some cases |
| - aco/optimizer: disallow can_eliminate_and_exec() with s_not |
| - aco/optimizer: do can_eliminate_and_exec() optimization later |
| - aco/optimizer: optimize s_and(exec, s_and(x, y)) more aggressively |
| - aco/optimizer: change inverse_comparison in-place |
| - radv/rt: create separate radv_rt_pipeline struct |
| - radv/rt: inline radv_rt_pipeline_create_() helper into radv_rt_pipeline_create() |
| - aco: simplify operands_offset calculation in create_instruction() |
| - aco: implement custom memory resource |
| - aco: use monotonic_buffer_resource for instructions |
| - aco: implement allocator_traits for monotonic_allocator<T> |
| - aco/opt_value_numbering: use monotonic_allocator for unordered_map |
| - aco/spill: Fix spilling of Phi operands |
| - aco/ra: only rename fixed Operands if the copy-location matches |
| - aco: change thread_local memory resource to pointer |
| - nir: add AMD RT traversal intrinsics |
| - radv/rt: create traversal shader independent from main shader |
| - radv/rt: overwrite hit args with undef in case of a miss |
| |
| Daniel Stone (4): |
| |
| - ci: Use mold for x86-64 and AArch64 builds |
| - CI: Collabora farm down for maintenance |
| - CI: Re-enable Collabora devices |
| - Revert "panfrost/ci: Disable t720 jobs" |
| |
| Danilo Krummrich (10): |
| |
| - nv50/ir/nir: add conversion ops for bit width < 32 |
| - nv50/ir/nir: convert to 32 bit for all OP_SET opcodes |
| - nv50/ir/nir: avoid 8/16 bit dest regs for OP_MOV |
| - nv50/ir: add isUnsignedIntType() and isIntType() helpers |
| - nv50/ir/nir: convert 8/16 bit src to 32 bit for {i,u}2f64 |
| - nv50/ir/nir: always round towards zero for f2i/f2u |
| - nv50/ir: add intermediate conversion for f2{i,u}{8,16} |
| - nv50/ir: split and cvt 64bit integers for {i,u}2{i,u}{8,16} |
| - nv50/ir: handle S8/S16 integers converting to S64 |
| - nv50/ir: handle U8/U16 integers converting to U64 |
| |
| Danylo Piliaiev (26): |
| |
| - tu: Flush depth on depth img transition from undef layout |
| - tu: Disable LRZ write when alpha-to-coverage is enabled |
| - freedreno: Disable LRZ write when alpha-to-coverage is enabled |
| - tu: Assert that if draw state is enabled it has valid iova |
| - tu: Switch to userspace iova allocations if kernel supports it |
| - tu: Enable bufferDeviceAddressCaptureReplay |
| - tu: Disable userspace iova allocations |
| - tu: Implement VK_EXT_attachment_feedback_loop_layout |
| - freedreno: PC_SO_STREAM_CNTL_STREAM_ENABLE has per-stream enable bits |
| - tu: Fix streamout with tess_use_shared |
| - tu: Update HS_WAVE_INPUT_SIZE formula |
| - util/u_trace: Add locking for ctx_list |
| - freedreno: Name more \*_DBG_ECO_CNTL regs |
| - freedreno: Add all variable magic regs to device-info tables |
| - tu: Use newly obtained magic reg values |
| - tu: Trivially expose VK_EXT_rasterization_order_attachment_access |
| - ir3: Prevent reordering movmsk with kill |
| - turnip: implement VK_EXT_multi_draw |
| - turnip: Implement VK_EXT_mutable_descriptor_type |
| - tu: Expose Vulkan 1.3 |
| - tu: Do not DCE unused output vars used for transform feedback |
| - tu: Lazily init VSC to fix dynamic rendering in secondary cmdbufs |
| - tu: Reset whether there is DS resolve for dynamic subpass |
| - tu: Fix the size of patch control points state |
| - freedreno: Add 'replay' tool which allows to replay cmdstreams |
| - freedreno/fdl: Increase alignment for UBWC images |
| |
| Dave Airlie (85): |
| |
| - gallivm: fix printf hook for cached shaders. |
| - llvmpipe: add shader clock support |
| - lavapipe: enable shader clock |
| - draw: fix up jit type creation for gs/tcs/tes |
| - llvmpipe/gallivm/draw: introduce a buffer type. |
| - zink: add defines for the shader stage templates. |
| - virgl: abstract virgl shader stages from pipe shader stages. |
| - svga: compare shader type against compute not tess eval. |
| - gallium/tgsi: reorder pipe shader type defines. |
| - gallium/iris/crocus: collapse a bunch of conversion functions. |
| - gallium: fixup some inconsistent uses of enum pipe_shader_type. |
| - gallium: use gl shader types as the basis for the gallium ones |
| - gallivm/nir/st: lower image derefs in advance. |
| - lavapipe: lower images to non-derefs in vulkan side |
| - lavapipe: scan shader for info before lowering images. |
| - llvmpipe/ci: update ci results for clover. |
| - draw: don't touch info values that aren't valid. |
| - nir_to_tgsi_info: drop const_buffers_declared |
| - crocus: sync performance monitor code with iris. |
| - vulkan: update rest of the headers to v1.3.225 |
| - gallivm: drop unused parameter to lp_build_sample_aos |
| - gallivm/sample: remove unused base parameter from dynamic callbacks. |
| - gallivm: don't indirect image/sampler destroy. |
| - llvmpipe: finish rendering before flushing frontbuffer resources. |
| - radv: move to using common record_result. |
| - turnip: use common command record result. |
| - panvk: use common command buffer record result. |
| - dozen: move to use common command buffer result code. |
| - lavapipe: fix 3d depth stencil image clearing. |
| - gallivm: add atomic 32-bit float support |
| - llvmpipe: export GL_NV_shader_atomic_float |
| - lavapipe: export VK_KHR_shader_atomic_float |
| - docs: add new llvmpipe/lavapipe atomic float extensions |
| - lavapipe: use slow clear path if conditional rendering. |
| - llvmpipe: handle different blocksizes for sampler/image views |
| - lavapipe: take layers into account around render surface creation. |
| - lavapipe: handle view mask layer count properly for begin rendering. |
| - llvmpipe: memset empty images in the key |
| - llvmpipe: bind sampler views/images properly. |
| - gallivm: handle llvm coroutines for llvm > 15 |
| - gallivm: fix buffer_type for LLVM 15 |
| - lavapipe: add fmin/fmax to image lowering. |
| - gallivm/nir: fix fmin/fmax translation |
| - gallivm/nir: bitcast when non-float ptr type. |
| - llvmpipe/fs: start passing explicit context pointer type. |
| - llvmpipe/fs: convert thread data ptr to opaque ptr friendly apis |
| - llvmpipe/fs: convert linear context to opaque pointers friendly |
| - llvmpipe/cs: convert cs context to opaque friendly api |
| - llvmpipe/cs: convert cs thread data to opaque friendly api |
| - llvmpipe/fs: fix invocations access for opaque ptrs. |
| - llvmpipe/setup: move setup code to explicit pointers. |
| - llvmpipe/cs: move compute code to explicit pointer types |
| - gallivm/struct: add opaque ptr friendly pointer accessors. |
| - llvmpipe/fs: add mask_type for mask_store accesses. |
| - llvmpipe/fs: pass explicit mask_type into interp code. |
| - llvmpipe/fs: port depth code to opaque pointer api |
| - llvmpipe/fs: add sample position type to the interp interface |
| - llvmpipe/fs: pass mask type to alpha to coverage handler |
| - llvmpipe/fs: use explicit api in viewport code. |
| - llvmpipe/fs: handle explicit types around blending and c/zs bufs calcs |
| - llvmpipe/fs: cleanup some remaining mask handling and reuse types |
| - llvmpipe/fs: convert outputs handling to explicit api |
| - gallivm/nir: move to explicit pointer interfaces. |
| - gallivm/nir: fixup atomic path for explicit pointers. |
| - gallivm/sample: move border color load to explicit pointers |
| - gallivm/llvmpipe: hand sample position type in for loading. |
| - llvmpipe/fs/linear: port to explicit pointers. |
| - draw: fix gs/tes explicit pointer handling |
| - draw: handle primid for tess being a different pointer type |
| - gallivm/tgsi: port tgsi to explicit types. |
| - gallium/tgsi: handle temps/outputs array. |
| - vulkan: update beta and video headers to 1.3.230 |
| - llvmpipe/tests: port to new pointer interfaces. |
| - gallivm/llvmpipe: add opaque pointers support to sampler |
| - draw/llvmpipe: move texture/sampler/image member load to opaque. |
| - draw/llvmpipe: add way to return pointer types to generic code. |
| - gallivm/sample: use retrieved types to do opaque pointer loads. |
| - gallivm: remove legacy pointer_get apis |
| - gallivm/sample: move some first_level/last_level calcs out |
| - gallivm/nir: drop some unused struct members. |
| - gallivm/sample: refactor multisample offset calcs code. |
| - glsl/types: fix dword slots calc for float16 matricies. |
| - clover/llvm: add kernel arg info for opaque pointers |
| - nir/lower_bool_to_int32: add support for lowering functions. |
| - gallivm: add coro malloc hooks earlier and always. |
| |
| David Heidelberg (71): |
| |
| - ci: compress LAVA rootfs with zstd instead of gzip |
| - ci: replace gzip usage with zstd where posible |
| - ci/bin: ignore Gitlab GQL cache |
| - ci: Turn off the entire Lima farm (devices report out-of-space) |
| - ci: move DXVK instalation outside of x86_test-vk |
| - ci: make shellcheck happy on dxvk script |
| - ci: separate wine setup into own script |
| - ci: Windows runner is experiencing DNS issues; disable Microsoft farm |
| - ci: prepare piglit-traces for WINE and DXVK |
| - ci/lavapipe: implement traces |
| - docs/ci: show how to run traces locally |
| - ci/lavapipe: wine flakes, switch to manual for now |
| - ci: use apt-get in x86_test-vk instead of apt |
| - ci: do not install i386 sub-architecture in VK images |
| - ci: traces: update freedreno and broadcom to brotli compressed traces |
| - ci: GraphQL was already implemented in ci_run_n_monitor.py |
| - ci: implement stress testing jobs in ci_run_n_monitor.py |
| - ci: fix leftover tag in image-tags.yml |
| - ci: rename debian-gallium to debian-build-testing |
| - ci: move lava-test into debian-build-testing |
| - ci: make shellcheck happy about .gitlab-ci/container/ directory |
| - ci: use shellcheck for .gitlab-ci/container/ directory |
| - ci: drop last bits of wine32 |
| - ci: bump cross compilation to LLVM 13 where possible |
| - ci: uprev piglit 2022-08-30 |
| - ci: set venus on lavapipe to manual due to flakes |
| - ci: print env as other setups do |
| - ci/lava: print set-job-env-vars.sh as other setups do |
| - ci/traces: do not keep a cache, which will be wiped a few steps later |
| - ci/test: collapse printing env |
| - ci/lava: collapse printing env |
| - ci/freedreno: convert traces to new YAML format |
| - ci/virgl: convert traces to new YAML format |
| - ci/llvmpipe: convert traces to new YAML format |
| - ci/lavapipe: convert traces to new YAML format |
| - ci/broadcom: convert traces to new YAML format |
| - ci/panfrost: convert traces to new YAML format |
| - ci/radeonsi: convert traces to new YAML format |
| - ci/intel: convert traces to new YAML format |
| - ci/i915: convert traces to new YAML format |
| - ci/crocus: convert traces to new YAML format |
| - ci: check traces.yml files with yamllint |
| - ci: performance traces: make use of no-perf label |
| - ci: export whole results/ directory for all traces jobs |
| - ci: split ci_run_n_monitor into script and shared parts |
| - ci: introduce update_traces_checksum.py |
| - ci: uprev piglit 2022-09-08 |
| - ci: use xargs instead of find -exec |
| - ci: add jq utility |
| - ci/traces: remove first line with YAML version to prevent failure |
| - ci/radeonsi: drop glmark2 terrain trace |
| - ci/panfrost: drop glmark2 terrain trace |
| - ci/intel: drop glmark2 terrain trace |
| - ci/lava: add wine and apitrace into amd64 rootfs container |
| - ci/lava: prepare wineprefix for GL and DXVK |
| - ci/dxvk: fix DXVK 64-bit only wine setup |
| - ci: also handle default wine rootfs (for testing) |
| - ci/traces: prevent showing wine instalation dialog |
| - ci/radeonsi: add traces for radeonsi Zork |
| - ci/amd: move radeonsi in amd directory |
| - ci: uprev DXVK to 1.10.3 |
| - ci/traces: Blender demo (Cube Diorama) flakes on Intel APL |
| - ci/freedreno: add Guilherme Gallo into restricted traces access list |
| - ci/panfrost: deduplicate gitlab-ci.yml |
| - ci/panfrost: enable piglit-gl on g52 again and deparalelize |
| - ci/panfrost: Humus Portal trace got fixed, update checksum |
| - ci/panfrost: re-enable traces on t760 |
| - ci/linker: update mold to 1.6 and enable on s390x |
| - ci/zink: rename zink job to zink-lvp to better describe it |
| - ci/freedreno: disable antichambers trace |
| - ci/update_traces_checksum.py: check if checksum is in the array, not it's value |
| |
| David Riley (1): |
| |
| - drm-shim: Allow drm-shim to work with glibc fortify. |
| |
| Dawn Han (11): |
| |
| - Update venus-protocol to add extension \`VK_VALVE_mutable_descriptor_type` |
| - Increase enum type in vn_descriptor_set. |
| - Update bitset when init descriptor layout. |
| - Track bitset when create descriptor pool |
| - Enable .VALVE_mutable_descriptor_type in physical_device |
| - venus: support the promoted VK_EXT_mutable_descriptor_type |
| - venus: Update mutable descriptor struct type |
| - venus: extend lifetime of push descriptor set layout |
| - venus: extend VkPipelineLayout lifetime for batched VkCmdPushConstants() |
| - venus: implement vkCmdPushDescriptorSetWithTemplateKHR |
| - venus: enable VK_KHR_push_descriptor |
| |
| Diogo Ivo (1): |
| |
| - nouveau: treat DRM_FORMAT_INVALID as implicit modifier |
| |
| Dmitry Baryshkov (1): |
| |
| - freedreno/registers: update hdmi registers to add more 8x74 regs |
| |
| Dmitry Osipenko (11): |
| |
| - util/disk_cache: Add option to disable compression |
| - util/disk_cache: Enable testing of uncompressed caches |
| - util/disk_cache: Append to disk_cache_load_cache_index() with _foz |
| - util/disk_cache: Move struct cache_entry_file_data to the disk_cache_os.h |
| - util/disk_cache: Make disk_cache_os.h usable by C++ code |
| - util/disk_cache: Add new mesa-db cache type |
| - virgl: Fix unmapping of blob resources |
| - llvmpipe: Align persistent mappings to page size |
| - ci: Bump crosvm version |
| - ci: Update virgl-iris-traces checksums |
| - ci/virgl: Enable virgl-iris-traces |
| |
| Dylan Baker (47): |
| |
| - docs: reset new features |
| - docs: add release notes for 22.1.5 |
| - docs: Add sha256 sum for 22.1.5 |
| - docs: add release notes for 22.1.4 |
| - docs: Add sha256 sum for 22.1.4 |
| - docs: update calendar and link releases notes for 22.1.4 |
| - docs: update calendar and link releases notes for 22.1.5 |
| - docs: update calendar for 22.2.0-rc1 |
| - docs: add release notes for 22.2.0 |
| - docs: Add sha256 sum for 22.2.0 |
| - docs: update calendar and link releases notes for 22.2.0 |
| - docs: Add calendar entries for 22.2 release. |
| - intel/genxml: use a single pattern for res |
| - intel/genxml: use a set for lookups |
| - intel/genxml: don't use parens with python assert statement |
| - intel/genxml: add type annotations to gen_sort_tags.py |
| - intel/genxml: use \`a not in {x, y}` instead of \`a != x or a != y` |
| - intel/genxml: remove unused variable in gen_sort_tags.py |
| - intel/genxml: don't overwrite the input of gen_sort_tags in place |
| - intel/genxml: add 'nonzero' to gen_sort_tags |
| - intel/genxml: add a validation mode to gen_sort_tags |
| - intel/genxml: run gen_sort_tags on all of the xml |
| - meson: run genxml sort tests |
| - docs: add release notes for 22.2.1 |
| - docs: Add sha256 sum for 22.2.1 |
| - docs: update calendar and link releases notes for 22.2.1 |
| - intel/genxml: write a space before the '/>` closing tag |
| - intel/genxml: re-process with space before \`/>` |
| - intel/genxml: remove newlines between xml islands |
| - intel/genxml: re-process with extra whitespace removed |
| - intel/genxml: use ElementTree.write and .indent |
| - intel/genxml: reprocess xml with elementree output |
| - intel/genxml: split the processing code up |
| - intel/genxml: use a main() function |
| - intel/genxml: use Path.replace instead of unlink and rename |
| - docs: add release notes for 22.2.2 |
| - docs: Add sha256 sum for 22.2.2 |
| - docs: update calendar and link releases notes for 22.2.2 |
| - docs: Extend calendar entries for 22.2 by 2 releases. |
| - util/indicies: move common static functions to private header |
| - util/indicies: use itertools.product instead of deeply nested loops |
| - util/indicies: write a file with u_indicies_gen.py |
| - util/indicies: simplify some of u_indicies_gen.py |
| - util/indicies: use itertools.product in u_unfilled_gen.py |
| - util/indicies: convert u_unfilled_gen.py to write a file |
| - util/glsl2spirv: fix type error in argument handling |
| - util/glsl2spirv: fix appending extra flags |
| |
| Ella Stanforth (2): |
| |
| - util: fix missing fcntl.h on musl |
| - vulkan: wsi: fix musl build |
| |
| Emma Anholt (108): |
| |
| - ci/bare-metal: Re-open serial and everything after test phase timeout. |
| - zink: Take the tiling path for resources with modifiers. |
| - zink: Replace the "optimal_tiling" flag with a "linear" flag instead. |
| - ci/zink: Re-enable a630 zink-on-turnip traces job using gbm as the backend. |
| - ci/swrast: Add some flakes I've noticed in the IRC channel. |
| - ci: Add testing of the khr-single tests. |
| - ci: Upgrade deqp-runner to 0.15.0. |
| - zink: Make sure that we keep the existing ici pNext chain on inserts. |
| - tu: Move the vkCreateImage format list checks to helper functions. |
| - tu: Treat viewFormatCount==0 as no format list. |
| - tu: Use the format list to decide whether we can do UBWC with MUTABLE. |
| - ci/tu+zink: Re-enable the traces that were asserting about UBWC compatibility. |
| - ir3: Suppress disasm of internal shaders unless IR3_SHADER_DEBUG=internal. |
| - tu: Only emit as many bindless regs as we have seen descriptor sets. |
| - tu: Only emit as many VPC interp/repl regs as will be referenced. |
| - tu: Only emit FS output regs for as many MRTs as we have (but at least 1). |
| - tu: Emit only as many VBs as we've ever seen bound on the command buffer. |
| - ci/tu: Add another cwe_after_bind fail that appeared in the reshuffle. |
| - ci: Move 'never' rules includes above "on_success" rules includes. |
| - ci: disable the freedreno farm. |
| - freedreno/regs: Add a bit of documentation of what SKIP_IB2 does. |
| - turnip: Fix enabling of IB2 skipping. |
| - nir/lower_mediump: Lower FS outputs to 16-bit when the value was upconverted. |
| - nir_to_tgsi: Add support for TG4 with explicit offsets. |
| - nir_to_tgsi: Add support for 64-bit fsign/isign. |
| - nir_to_tgsi: Add support for demote, is_helper_invocation, and subgroup ops. |
| - nir: Make nir_lower_discard_if() handle demotes and terminates, too. |
| - nir_to_tgsi: Use nir_lower_discard_if for demote_if. |
| - nir_to_tgsi: Add support for bindless textures and images. |
| - turnip: Make sure bandwidth config per pixel starts from 0. |
| - freedreno/ir3: Move chip-specific nir compiler options to C code. |
| - freedreno/ir3: Switch to NIR for a3xx/a4xx's vertex id lowering. |
| - gallium,glsl: Delete PIPE_CAP_VERTEXID_NOBASE and lower_vertex_id. |
| - nir: Add a pass to lower mediump temps and shared mem. |
| - spirv: Mark phis as mediump instead of directly lowering them to 16 bit. |
| - turnip: Enable lowering of mediump temps/CS shared to 16-bit. |
| - turnip: Treating non-d/s-write pipelines as not having d/s feedback loops. |
| - mesa: Remove PIPE_CAP_CS_DERIVED_SYSTEM_VALUES_SUPPORTED/lower_cs_derived. |
| - zink: Don't upload shader immediate arrays through UBO 0. |
| - zink: Don't lower indirect derefs of temp arrays. |
| - Revert "ci: disable the freedreno farm." |
| - ci: Add support for Jetson TK1. |
| - nouveau: Fix compiler warnings about silly address checks in ir_print. |
| - ci/turnip: Add missing a618 full-run bypass fails. |
| - turnip: Only emit descriptor loads for active stages in the pipeline. |
| - freedreno/ir3: Consistently lower mediump inputs to 16-bit (when we can). |
| - nir/lower_mediump_vars: Don't lower mediump shared vars with atomic access. |
| - mesa: Lower mediump temps and CS shared when the driver supports FP16+INT16. |
| - turnip: Add some missing LRZ disable debug. |
| - turnip: Ignore dynamic color write enables past our number of attachments. |
| - turnip: Don't look at RB.Z_READ_ENABLE for setting LRZ.Z_TEST_ENABLE. |
| - turnip: Skip rather than invalidate LRZ on gl_FragDepth writes. |
| - turnip: Ignore pDescriptorCounts[] for non-variable-count layouts. |
| - turnip: Keep a host copy of push descriptor sets. |
| - ci/turnip: Use all 9 a618 runners at once for VK testing. |
| - ci/turnip: Bump the full-run a618 runner count to 3. |
| - ci/turnip: Re-enable the compressed cubemap tests. |
| - ci/turnip: Allow running spirv_ids_abuse in full VK runs. |
| - ci/turnip: Drop a couple of spillall skips. |
| - ci/turnip: Move some 15-second-ish test skips to pre-merge skips. |
| - ci/nouveau: Drop BM_POE_TIMEOUT. |
| - ci/nouveau: Rearrange job setup variables. |
| - ci/nouveau: fix up Jetson Nano |
| - ci/nouveau: Update gm20b xfails. |
| - ci: Update kernel to drm-next-2022-08-12-1 |
| - turnip: Fix leak of autotune fence BOs. |
| - turnip: Add TU_DEBUG=bos to print stats of BOs live at submit time. |
| - turnip: Fix the "written stencil is unmodified" check. |
| - turnip: Use the simplified stencil write flags for the LRZ-allowed check. |
| - ci/zink+turnip: Add a manual full run of the dEQP CTS. |
| - turnip: Fix busy-waiting on syncobjs with OS_TIMEOUT_INFINITE. |
| - util/dag: Move the callback function pointer to the state. |
| - util/dag: Add a validation function. |
| - freedreno/ir3: Validate our scheduling DAGs after construction. |
| - nir/nir_opt_copy_prop_vars: Don't leak dynarray memory during the pass. |
| - nir/vars_to_ssa: Always do OOB load/store removal. |
| - glsl: Remove lower_output_reads. |
| - glsl: Remove do_set_program_inouts. |
| - glsl: Remove opt_array_splitting. |
| - glsl: remove opt_structure_splitting. |
| - glsl: Remove lower_vec_index_to_swizzle. |
| - ci/iris: Update iris traces checksums. |
| - turnip: Don't use the dynamic color write enable during non-dynamic. |
| - ci/zink: Clear stale xfails for turnip. |
| - zink: Skip border color clamping for compressed formats. |
| - zink: Enable ASTC texture format translation. |
| - zink: Map ETC1 to ETC2 to avoid uncompressing in the frontend. |
| - ci/zink: Add some more flakes for turnip. |
| - zink: Fix RelaxedPrecision decoration of texture samples. |
| - zink: Decorate mediump outputs as RelaxedPrecision. |
| - turnip: Add a perf_debug for feedback-related performance traps. |
| - turnip: Move the ubwc_possible check before mutable formats. |
| - turnip: Add perf_debug for UBWC being disabled due to mutable formats. |
| - turnip: Add perf debug for more UBWC-disable cases that we could support. |
| - zink: Fix dummy CB path decision for VK_EXT_cwe presence. |
| - turnip: Be sure we blit depth, not stencil, for Z32FS8 -> Z32F resolves. |
| - nir/opt_phi_precision: Fix missing swizzles when narrowing phi srcs. |
| - turnip: Enable LRZ testing (not writing) in the presence of discards. |
| - ci/freedreno: Update CivV trace expectation. |
| - ci/freedreno: Update known flakes/timeouts. |
| - turnip: Fix reservation for indirect compute's IR3_DP_SUBGROUP_ID_SHIFT. |
| - zink: No need to use a 2-sample dummy image for bindless without null descs. |
| - zink: Lazily allocate the dummy surfaces. |
| - zink: Enable mesa/st frontend shader caching. |
| - rusticl: Fix the invalid memory migration flags check. |
| - iris: Disable GLSL lower_const_arrays_to_uniforms. |
| - gallium: update docs about PIPE_CAP_PREFER_IMM_ARRAYS_AS_CONSTBUF. |
| - ir3/ra: Make sure we don't pick a preferred reg overflowing the file. |
| |
| Eric Engestrom (56): |
| |
| - bin/gen_release_notes.py: bump advertised vulkan version to 1.3 |
| - pvr: fix memleak in error paths |
| - vk/device-select-layer: fix .sType of VkPhysicalDeviceGroupProperties |
| - docs/features: add missing extensions supported by anv |
| - docs/features: add missing extensions supported by lavapipe |
| - docs/features: add missing extensions supported by radv |
| - docs/features: add missing extensions supported by turnip |
| - docs/features: add missing extensions supported by venus |
| - docs/features: start tracking extensions supported by dozen |
| - docs/features: start tracking extensions supported by panvk |
| - docs/features: start tracking extensions supported by powervr |
| - docs/features: drop unsupported VK_KHR_display & VK_KHR_get_display_properties2 for lavapipe |
| - ci: rename GitHub CI workflow from \`ci` to \`macos` to be a bit more explicit about what it covers |
| - meson: drop duplicate \`lib` in lib name |
| - wsi/x11: fix memleak in wsi_x11_connection_create() |
| - aco: drop unused variable |
| - anv: convert assert into unreachable to avoid fallthrough error |
| - meson: replace manual compiler flags with meson arguments |
| - vc4: introduce VC4_DBG() macro to make VC4_DEBUG checks consistent |
| - v3d: introduce V3D_DBG() macro to make V3D_DEBUG checks consistent |
| - wayland: avoid writing interface names as free-form strings |
| - vc4: drop dead qir_POW() helper |
| - v3dv: implement VK_EXT_shader_module_identifier |
| - docs: move rusticl readme into docs/ |
| - broadcom: fix dependencies in static_library() calls |
| - docs: add release notes for 22.1.6 |
| - docs: update calendar and link releases notes for 22.1.6 |
| - docs: add release notes for 22.1.7 |
| - docs: update calendar and link releases notes for 22.1.7 |
| - ci: unexport local variable (and fix formatting) |
| - ci: bump mold to 1.5 |
| - docs/release-calendar: add 22.3 schedule |
| - v3dv: drop error overwrite |
| - vc4: don't lower uadd_carry |
| - vc4: don't use imprecise_32bit_lowering for idiv lowering |
| - vc4: pre-compile shaders to make up for the lack of draw calls in shader-db |
| - vc4: use PIPE_MASK_RGBA name instead of its value 0xf |
| - vc4: send shaderdb output through KHR_debug |
| - vc4: mark piglit copypixels-(draw-)sync as flaky |
| - VERSION: bump to 22.3.0-rc1 |
| - .pick_status.json: Update to 1174f376096ed6ceebb0fb2810456f1501a68df7 |
| - zink: add missing build_id linker args |
| - vk/runtime: drop incorrect UNUSED annotation |
| - v3dv: avoid freeing already-freed memory |
| - VERSION: bump for 22.3.0-rc2 |
| - .pick_status.json: Update to 9bd11f65238ce101bf846f5528f9088630e983f7 |
| - ci: avoid triggering vc4 & v3d tests on v3dv-only MRs |
| - VERSION: bump for 22.3.0-rc3 |
| - .pick_status.json: Update to f2e535e4fed5bdc13e11a443316a4b77cd5eb81a |
| - v3d(v): account for debug flags when using the cache |
| - .pick_status.json: Mark 56bd81ee21cb42e4a5afa2939514b570a1283754 as denominated |
| - VERSION: bump for 22.3.0-rc4 |
| - .pick_status.json: Update to 22be0d09a005b4b955a46b65a919cfd786d6814f |
| - .pick_status.json: Mark 470fbb35efe1935242b346427ec0fa22b40fff2b as denominated |
| - commit_in_branch.py: add support for checking staging branches |
| - docs: add release notes for 22.3.0 |
| |
| Erico Nunes (1): |
| |
| - ci: enable lima farm |
| |
| Erik Faye-Lund (226): |
| |
| - docs: fixup link to virgl docs |
| - zink: add have_D32_SFLOAT_S8_UINT boolean |
| - zink: do not use VK_FORMAT_D32_SFLOAT_S8_UINT without checking |
| - zink: remove needless check |
| - glsl/tests: do not perform eol-conversion on windows |
| - zink: type_main -> type_void_func |
| - zink: add spirv_builder_function_call |
| - zink: wrap discard in a function |
| - zink: fix conditions for draw-parameters |
| - zink: fixup indent in caps-check |
| - mesa/main: simplify implmementation of _mesa_is_format_etc2 |
| - mesa/main: simplify implementation of _mesa_is_format_astc_2d |
| - mesa/st: simplify st_compressed_format_fallback |
| - mesa/st: avoid double-mapping if both images are the same |
| - mesa/st: do not use memcpy when using compressed fallback |
| - mesa/st: add context-flag for bptc-support |
| - mesa: add format-helper for bptc |
| - mesa: add _mesa_unpack_bptc-function |
| - mesa/st: implement fallback for bptc |
| - mesa/st: enable bptc extension with fallback |
| - zink: allow X32 -> A32 |
| - mesa/main: remove always-true define |
| - mesa/main: add fp16-versions of bptc decompression functions |
| - mesa: fall-back to fp16 for bptc-textures |
| - gallium/u_transfer_helper: clean up boolean flags |
| - gallium/u_transfer_helper: add in-place interleave flag |
| - zink: use full transfer-helper api |
| - gallium/u_transfer_helper: hide deinterleave helpers |
| - mesa/st: also override mip filter for integer textures |
| - mesa/st: always force integer filtering to nearest |
| - mesa/st: force nearest-filtering for fp32 textures |
| - zink: request forcing of fp32-samplers to nearest when needed |
| - mesa/st: do not blit when using compressed fallback |
| - zink: clamp miplodbias when creating sampler |
| - zink: add ntv support for demote |
| - zink: add spirv_builder support for is_helper_invocation |
| - zink: add ntv support for is_helper_invocation |
| - zink: expose demote when supported |
| - zink: lower discard/demote depending on derivatives |
| - Revert "Revert "ci: set venus on lavapipe to manual due to flakes"" |
| - mesa/st: use _mesa_generate_mipmap for fallback-formats |
| - mesa/st: do not fallback to srgba formats for dxt |
| - util: allow unpacking less than a block from dxt |
| - mesa/st: add context-flags for s3tc |
| - mesa: add format-helpers for s3tc |
| - mesa/main: add _mesa_unpack_s3tc |
| - mesa/st: implement fallback for s3tc |
| - mesa/st: enable s3tc extensions with fallback |
| - v3d: do not pretend to fake rgtc-support |
| - vc4: respect z-offset in tiled blits |
| - vc4: do not attempt to do deep tiled blits |
| - docs/zink: remove bptc from required formats for gl4.2 |
| - docs/zink: document rgtc requirement |
| - mesa/st: add context-flag for rgtc |
| - mesa: add format-helper for rgtc |
| - util/format: fix broken indentation |
| - util/format: allow unpacking less than a block from rgtc |
| - util/format: implement rgtc -> r8 / r8g8 unpack |
| - mesa/main: add _mesa_unpack_rgtc |
| - mesa/st: implement fallback for rgtc |
| - mesa/st: do not fall back to uncompressed for rgtc |
| - mesa/st: enable rgtc extension with fallback |
| - panfrost: do not fake rgtc-support |
| - docs: update staus of mark GL_ARB_texture_compression_bptc |
| - lima: don't store width in resource-level |
| - lima: do not align width/height for non-shared resources |
| - mesa/st: add context-flag for latc |
| - mesa: add format-helper for latc |
| - mesa/main: add support for latc in _mesa_unpack_rgtc |
| - mesa/st: implement fallback for latc |
| - mesa/st: do not fall back to uncompressed for latc |
| - mesa/st: enable latc extensions with fallback |
| - freedreno: do not fake rgtc-support |
| - u_transfer_helper: rip out fake_rgtc code |
| - mesa/st: always use normalized coords for samplers |
| - mesa/st: always use normalized coords for samplers |
| - mesa/st: always use normalized coords for samplers |
| - zink: assert on unnormalized texcoords |
| - docs: add an amber article |
| - zink: emulate latc formats with rgtc |
| - zink: fix luminance clears |
| - zink: fixup indent |
| - zink: refactor swizzle-clamping code |
| - zink: fix luminance/luminance-alpha emulation |
| - zink: correct emulation of luminance-alpha formats |
| - ci/crocus: update ci results |
| - mesa/main: fix broken indent |
| - mesa/main: don't copy signed rgtc via float |
| - docs: fixup broken rst syntax |
| - gallium/u_threaded_context: remove stale comment |
| - gallium: normalized_coords -> unnormalized_coords |
| - zink: fix broken pool-alloc consolidation |
| - zink: fixup dynarray-type |
| - zink: use util_dynarray_clear |
| - mesa/st: rip out point-sprite cap |
| - mesa/main: remove driver-cap for ARB_point_sprite |
| - mesa/st: move static asserts out of translate_prim |
| - mesa/st: remove translate_mode |
| - docs: correct spelling of "implemented" |
| - docs: correct spelling of "attempts" |
| - docs: correct spelling of "bug fixes" |
| - docs: correct spelling of "specified" |
| - docs: correct spelling of "hierarchical" |
| - docs: correct spelling of "implementation" |
| - docs: correct spelling of "to" |
| - docs: correct spelling of "Khronos" |
| - docs: correct spelling of "choosing" |
| - docs: correct spelling of "subtly" |
| - docs: correct spelling of "Raspbian" |
| - docs: correct spelling of "shareable" |
| - docs: correct spelling of "simplification" |
| - docs: correct spelling of "synced" |
| - docs: correct spelling of "maximum" |
| - docs: correct spelling of "hierarchy" |
| - docs: correct spelling of "commutativity" |
| - docs: correct spelling of "compression" |
| - docs: correct spelling of "laid" |
| - docs: correct spelling of "average" |
| - docs: correct spelling of "addition" |
| - docs: correct spelling of "explicitly" |
| - docs: correct spelling of "rasterizer" |
| - docs: fractionary -> fractional |
| - docs: ie. -> i.e. |
| - docs: correct spelling of "correctly" |
| - docs: correct spelling of "traveling" |
| - docs: correct spelling of "freedreno" |
| - zink: rework primitive rasterization type logic |
| - zink: reuse rast_prim for line-rasterization check |
| - docs: correct spelling of "addition" |
| - docs: correct spelling of "passes" |
| - docs: correct spelling of "irrespective" |
| - docs: correct spelling of "semantic" |
| - docs: correct spelling of "specifies" |
| - docs: correct spelling of "Gouraud" |
| - docs: correct spelling of "Copr" |
| - docs: spell out "build directory" |
| - docs: spell out "SHA256 checksum" |
| - docs: spell out "environment variable" |
| - docs: spell out "distribution" |
| - docs: use the official spelling of SHA-1 |
| - docs: eg. -> e.g. |
| - docs: XOR'd -> XORed |
| - docs: consistently upper-case VC4, V3D etc |
| - docs: consistently spell DRI in all-caps |
| - docs: consistently spell ANV in allcaps |
| - docs: consistently spell URL in allcaps |
| - docs: consistently spell OK in allcaps |
| - docs: consistently use CMake |
| - docs: consistently spell API all-caps |
| - docs: consistently spell VirGL in camel-case |
| - docs: consistently spell Haswell in camel-case |
| - docs: use consistent casing for dEQP |
| - docs: ip -> IP |
| - docs: irc -> IRC |
| - docs: xorg -> X.Org |
| - docs: pci -> PCI |
| - docs: cpu -> CPU |
| - docs: gpu -> GPU |
| - docs: rgb -> RGB |
| - docs: perfetto -> Perfetto |
| - docs: lavapipe -> Lavapipe |
| - docs: freedreno -> Freedreno |
| - docs: adreno -> Adreno |
| - docs: radeon -> Radeon |
| - docs: nouveau -> Nouveau |
| - docs: llvmpipe -> LLVMpipe |
| - docs: vulkan -> Vulkan |
| - docs: lima -> Lima |
| - docs: ppir -> PPIR |
| - docs: autotools -> Autotools |
| - docs: ArchLinux -> Arch Linux |
| - docs: meson -> Meson |
| - docs: Gitlab -> GitLab |
| - docs: xyzw -> XYZW |
| - docs: hw -> HW |
| - docs: lava -> LAVA |
| - docs: skqp -> SkQP |
| - docs: servo -> Servo |
| - docs: Linux fbdev -> Linux Framebuffer |
| - docs: virtualisation -> virtualization |
| - docs: dependant -> dependent |
| - docs: healthcheck -> health check |
| - docs: vmware -> VMware |
| - docs: spell out "distribution" |
| - docs: spell out "transform and lighting" |
| - docs: de-duplicated -> deduplicated |
| - docs: de-nominate -> denominate |
| - docs: freedreno -> Freedreno |
| - docs: llvmpipe -> LLVMpipe |
| - docs: softpipe -> Softpipe |
| - docs: panfrost -> Panfrost |
| - docs: piglit -> Piglit |
| - docs: ini -> INI |
| - docs: swr -> OpenSWR |
| - docs: yml -> YAML |
| - docs: xml -> XML |
| - docs: Adreno aXXX -> Adreno XXX |
| - docs: remove non-existent directory |
| - docs: do not mention classic swrast |
| - docs: name correct driver |
| - docs: clean up labels |
| - docs: add missing link to mesa3d.org |
| - docs: add links to usenet groups |
| - docs: link to replacement forum |
| - docs: Android.mk -> ndk-build |
| - docs: gitlab -> GitLab |
| - docs: nVidia -> NVIDIA |
| - docs: spell ATI codenames in allcaps |
| - docs: nfs -> NFS |
| - docs: fixup rebase mistake |
| - zink: correct depth-bias enable condition |
| - zink: only set line-width if drawing lines |
| - zink: consider polygon-mode for rast_prim |
| - zink: do not read is_generated unless in tcs shader |
| - zink: put union fields into structs named by the shader-stages |
| - docs/zink: fix and cleanup rst syntax |
| - docs: fixup broken link syntax |
| - docs: do not mention EGL_MESA_drm_display |
| - docs: remove stale envvar-reference |
| - zink: fix json-errors in profile-file |
| - zink: remove needless requirements |
| - zink: fix incorrect requirements |
| - zink: update textureCompressionBC requirement |
| - docs/zink: add missing required device-feature |
| - mesa: treat unsupported queries as dummies |
| - d3d12: fix max-array-layers |
| |
| Fabian Vogt (1): |
| |
| - gallivm: Fix LLVM optimization with the new pass manager |
| |
| Feng Jiang (5): |
| |
| - virgl/vtest: fix memory overwrite problem in virgl_vtest_send_get_caps() |
| - gallium: add key size to the structure pipe_picture_desc |
| - vl: change vl_video_buffer_destroy() to non-static |
| - gallium/util: add a new helper function pipe_buffer_size() |
| - virgl: add support for hardware video acceleration |
| |
| Filip Gawin (5): |
| |
| - r300: add list of deqp gles2 r400 failures |
| - r300: add khr r400 failures |
| - gallium: avoid using float based conditions in loops |
| - r300: don't use smooth line if not requested |
| - r300: update r400 tests |
| |
| Francisco Jerez (4): |
| |
| - intel/fs: Fix horiz_offset() to handle FIXED_GRFs with non-trivial 2D regions. |
| - nir/lower_int64: Enable lowering of 64-bit float to 64-bit integer conversions. |
| - nir/lower_int64: Implement lowering of 64-bit integer to 64-bit float conversions. |
| - nir/lower_int64: Fix float16 to int64 conversions. |
| |
| Frank Binns (13): |
| |
| - pvr: get free list min size from the runtime info |
| - pvr: reset all command buffer state in pvr_cmd_buffer_reset() |
| - pvr: Implement vkResetCommandBuffer |
| - pvr: remove redundant TODO |
| - pvr: remove image pointer from image view struct |
| - pvr: set samplerAnisotropy to false |
| - pvr: cube map tex state packing fixes |
| - pvr: don't setup attachment tex state for images with input attachment bit unset |
| - pvr: finish render job sample count setup |
| - pvr: add required pixel formats |
| - pvr: remove implicit sync support |
| - pvr: setup buffer and image format feature bits |
| - CODEOWNERS: remove rajnesh-kanwal as an Imagination maintainer |
| |
| Friedrich Vock (28): |
| |
| - amd/common: move ac_memory_ops_per_clock into ac_gpu_info.h |
| - amd/common: Remove redundant code for determining memory ops per clock |
| - radv/winsys: Return VK_ERROR_INVALID_EXTERNAL_HANDLE if buffer imports fail |
| - radv: Generalize instance count in BVH states to leaf node count |
| - radv: Add metadata to acceleration structures |
| - radv: Add Radeon Raytracing Analyzer trace dumping utilities |
| - radv: Add Radeon Raytracing Analyzer capturing layer |
| - radv: Enable Radeon Raytracing Analyzer traces |
| - docs: Add documentation about RADV RRA tracing |
| - radv: Use radv_CmdUpdateBuffer instead of radv_update_buffer_cp for geometry info writes |
| - radv: Free geometry infos in radv_CmdBuildAccelerationStructuresKHR |
| - radv: Correct accel struct header size |
| - radv: Rename internal node shader to lbvh_internal |
| - radv: Split CmdBuildAccelerationStructuresKHR into different functions |
| - radv: Add BVH IR types |
| - radv: Add conversion shader for internal nodes |
| - radv: Add conversion shader for leaf nodes |
| - radv: Create pipelines for conversion meta shaders |
| - radv: Build acceleration structures using BVH IR |
| - radv: Rename emulated float helpers |
| - radv: Use a struct for AABBs |
| - radv: Add radv_indirect_unaligned_dispatch |
| - radv/rt: Fix internal converter synchronization |
| - radv/rt: Dispatch internal converter indirectly |
| - radv/rt: Track number of inactive leaf nodes |
| - radv: Add global sync utilities |
| - radv: Add REF as a typename macro to .clang-format |
| - radv: Use spirv1.5 instead of vulkan1.2 |
| |
| Georg Lehmann (40): |
| |
| - aco: Check that we don't override exec_val operands during branching sequence optimization. |
| - aco/assembler: Fix v_cmpx with SDWA. |
| - aco: Fix optimizing branching sequence with s_and_saveexec. |
| - aco/assembler: Fix v_cmpx pre GFX10. |
| - aco: Use v_cmpx pre GFX10. |
| - aco: Force tex operand to have the correct sub dword size before packing. |
| - nir: Add nir_ssa_scalar_is_undef. |
| - nir/fold_16bit_tex_image: Add an option to fold image sources. |
| - aco: Implement storage image A16. |
| - aco: Combine 16bit undef and constants instead of using s_pack. |
| - radv: Fold 16bit image sources. |
| - nir: Print selection control for nir_if. |
| - nir/opt_algebraic: Optimize check for single bit. |
| - nir/opt_algebraic: Optimize d3d9 pow with fmulz. |
| - aco: Fix image instructions with lod when 2d_view_of_3d is enabled on GFX9. |
| - ac/llvm: Fix image instructions with lod for 2d on GFX9. |
| - aco: Use plain VOPC for vcmpx when possible. |
| - nir/opt_algebraic: Optimize unpacking of upcasts to 64bit integers. |
| - radv: Enable VK_EXT_load_store_op_none. |
| - aco: Use v_fmaak/v_fmamk if two operands are the same literal. |
| - aco: Unswizzle v_pk_fma_f16 literals to produce more v_pk_fmac_f16. |
| - radv: Fix GLSL BDA struct alignment and use pointer arithmetic SIZEOF. |
| - nir/opt_algebraic: Optimize more (a cmp b ? a : b) to min/max. |
| - aco: Use s_pack_ll for s_bfe operand on GFX9+. |
| - nir: Print nir_selection_control_divergent_always_taken. |
| - nir/opt_algebraic: Mirror optimizations for find_msb_rev. |
| - aco: Implement [ui]find_msb_rev. |
| - ac/llvm: Implement [ui]find_msb_rev. |
| - nir/opt_algebraic: Add an option to lower uclz. |
| - radv,aco: Lower uclz in NIR. |
| - nir/opt_algebraic: Optimize various find_msb_rev patterns. |
| - nir: Fix ifind_msb_rev constant folding. |
| - aco: fmaak/fmamk can't use SDWA. |
| - aco: Don't use opsel for p_insert. |
| - ac/llvm: Implement signed idot on GFX11. |
| - aco: Implement signed idot instructions on GFX11. |
| - radv: Use available dot product instructions on gfx11. |
| - radeonsi: Use available dot product instructions on gfx11. |
| - aco: Use opsel for the third operand. |
| - aco: Use s_pack_ll_b32_b16 for scalar zero extend. |
| |
| Gert Wollny (95): |
| |
| - radeonsi-raven/ci: Move spec@arb_timer_query@timestamp-get to flakes |
| - llvmpipe: Double number of SSBOs |
| - r600/sfn: Schedule shift instruction on R600 in t-slot |
| - r600/sfn: Add GS thread fix just like the TGSI code path |
| - r600/sfn: Sort FS inputs to make interpolated values come first |
| - r600/sfn: Fix color outputs when color0 writes all |
| - r600/sfn: Initialize out buffer when printing op |
| - r600: Force NOPs when loading AR on R600 class hardware |
| - r600/sfn: Handle R600 scratch read |
| - r600: Don't use SB with R600 style scratch reads |
| - r600: Fix SCRATCH OP de-assembly |
| - nir_lower_atomics_to_ssbo: Initialize deref struct |
| - r600/sfn: Handle color0 writes all on R700 like on EG |
| - r600/sfn: Don't tag mem-ring and stream instructions as exports |
| - r600/sfn: Don't schedule GDS instructions early |
| - r600/sfn: Don't scan the whole block for ready instructions |
| - r600/sfn: Use a heuristic to keep SSBO setup and store close |
| - r600: Fix reporting TGSI IR support |
| - r600/sfn: Use a low number for unused target register |
| - virgl: Fix buffer overflow warning: |
| - virgl: Fix ubsan warnings: |
| - virgl: when reading back wait first, then do the transfer |
| - virgl/ci: remove some flakes |
| - r600/sfn: override register ID when it doesn't matter |
| - r600/sfn: ACK all image stores, also at end of loop |
| - r600/sfn: more peephole optimization with conditionals |
| - r600/sfn: Override VPM if access in helpers is requested |
| - r600/sfn: Don't scalarize fdd instructions |
| - virgl: Add some formats that the CTS uses |
| - mesa/glsl: Add support for NV_shader_noperspective_interpolation |
| - r600: set nir option lower_cs_local_index_to_id |
| - virgl: Report CONSTANT_BUFFER_SIZE according to GL_MAX_UNIFORM_BLOCK_SIZE |
| - r600/sfn: Lower tex,txl,txb and txf to backend |
| - r600/sfn: lower txd to backend in nir |
| - r600/sfn: lower txf_ms in nir |
| - r600/sfn: copy-propagate single source texture values |
| - r600/sfn: print tex prepare instructions |
| - r600/sfn: Don't allocate un-used components in texture ops |
| - r600/sfn: Add an easy access to get an instruction as ALU |
| - r600/sfn: Don't assert when setting one value |
| - r600/sfn: Copy propagate into TEX source |
| - r600/sfn: drop some unused code |
| - r600/sfn: copy propagate register load chains |
| - r600/sfn: VS inputs are effectively SSA |
| - r600/sfn: Add a free-channel mask when testing whether a register can switch channel |
| - r600/sfn: fix some channel pinning |
| - r600/sfn: Handle shifts on Cayman |
| - r600/sfn: only use 3 channels on Cayman for trans ops |
| - r600/sfn: Make sure texture lowering is done in the right order |
| - r600/sfn: lower tg4 to backend in NIR |
| - r600: Add slot to ALU disassambly |
| - r600: Make NIR the default shader IR |
| - nir_lower_to_source_mods: Don't sneek in an abs modifier from parent |
| - r600/sfn: Only run 64 bit ops lowering passes when really needed |
| - r600/sfn: Always start a new CF after a KILL instruction |
| - r600/sfn: don't propagate registers into conditional test |
| - r600/sfn: Add peephole optimization for kill instructions |
| - r600/sfn: Delete final lowered nir shader early |
| - r600/sfn: Fix typo |
| - r600/sfn: Use the correct allocator for loop lists |
| - r600/sfn:explicitly initialize the memory pool |
| - r600/sfn: assert on use of abs modifier in op3 |
| - r600/sfn: Unroll loops after doing some optimizations |
| - r600/sfn: Always enforce LDS operation order |
| - r600/sfn: Make sure all components are usable when lowering TF inputs |
| - r600/sfn: run cleanup passes after late algebraic opt |
| - nir: move fusing csel and comparisons to opt_late_algebraic |
| - r600/sfn: Unify the handling of resource IDs in instruction |
| - r600/sfn: Handle nir_op_seq and nir_op_sne |
| - r600: Fix printing t-channel in diss-assambly |
| - r600/sfn: Add test for channel changes in TEX source from opt |
| - r600/sfn: Allow copy-prop of group dest into origin |
| - r600/sfn: Trigger TEX CF based on max TEX CF size |
| - r600/sfn: improve scheduling of tex sources |
| - r600/sfn: Increase scheduling priority on uniform reads and non-ssa writes |
| - r600/sfn: elimiate dead registers too |
| - r600: Account for color and clipvertex when evaluating LDS space |
| - r600/sfn: evaluate LDS location for color and clip-vertex too |
| - r600/sfn: Add .clang-format file and apply style |
| - r600: declare for counter locally and fix signed/unsigned warning |
| - r600: Only count ALU registers that are not clause local |
| - r600: Print MOVA_INT dest on Cayman |
| - r600/sfn: Handle load_workgroup_size |
| - r600/sfn: lower uniforms to UBOs |
| - r600/sfn: remove load_uniform handling |
| - etnaviv: Create MSAA surfaces with PE compatible tiling |
| - nir/algeraic_opt: use double options too for lowering ftrunc@64 |
| - r600/sfn: Fix source modifiers for ffract64 |
| - r600/sfn: Fix f2u32 and remove backend lowring of f2u64 and f2i64 |
| - r600/sfn: Honor shader key w.r.t. atomic counter layout |
| - r600/sfn: Fix location for reading cube array image dimensions |
| - r600/sfn: sort FS color outputs before all other outputs |
| - virgl: don't a use staging when a resources created with the shared flag |
| - virgl: Fix injection of double from const mov instruction |
| - r600/sfn: always use four slots for Cayman trans ops |
| |
| Giancarlo Devich (4): |
| |
| - mesa: Fix stack corruption for PIPE_QUERY_TIMESTAMP |
| - d3d12: Add cache managers for separate buffer usage cases |
| - gallium/pipe: Align allocation size in \`pb_cache_manager_create_buffer` |
| - d3d12: Don't align already-aligned size in \`d3d12_bufmgr_create_buffer` |
| |
| Glenn Kennard (1): |
| |
| - nv30: Fix non-scissored clears after a scissor has been set |
| |
| Guilherme Gallo (29): |
| |
| - ci/lava: Customise sections timeouts via envvars |
| - ci/lava: Add timeout header info for LAVA GL Sections |
| - ci/bin: Add utility to find jobs dependencies |
| - ci/bin: Sort imports from ci_run_n_monitor.py |
| - ci/bin: Make ci_run_n_monitor finds dependencies automatically |
| - ci/bin: Add script to expand jobs manifest |
| - ci/bin: Cache GQL queries |
| - ci/bin: Print job needs DAG in ci_run_n_monitor |
| - ci/skqp: Add an option to run all tests |
| - ci/skqp: Show reports on crashes |
| - ci/skqp: Fix paths in skqp-runner |
| - ci/skqp: Fix Nima-Cpp fetching error |
| - ci/skqp: Build list_gpu_unit_tests and list_gms |
| - ci/skqp: Add support for commenting tests files |
| - ci/skqp: Use SKQP_BIN_DIR instead of hardcoded /skqp dir |
| - ci/skqp: Put generated tests files in artifacts |
| - ci/skqp: Supress irrelevant shellcheck warnings |
| - ci/skqp: Remove .baremetal-skqp-test in favor of .skqp-test |
| - ci/skqp: Add gitlab sections for uncluttering |
| - ci/freedreno: skqp: run with new tests files |
| - ci/radeonsi: skqp: Add fail test files for raven |
| - ci/radeonsi: Add zork jobs and rules |
| - ci/bin: Fix requirements.txt |
| - ci/bin: Remove whitespace from token files |
| - ci: Fix kernel+rootfs.* jobs |
| - ci: Update piglit with s3 support |
| - ci: Update piglit-traces tests expectations |
| - ci: Update ci-fairy in CI, rootfs and containers |
| - freedreno/ci: Skip civilization-v/CivilizationV-trim trace |
| |
| Hans-Kristian Arntzen (3): |
| |
| - vulkan: Update to 1.3.228 headers. |
| - radv: Implement VK_EXT_mutable_descriptor_type. |
| - vk/runtime: Fix narrowing of timeline signal and wait value to u32. |
| |
| Iago Toral Quiroga (87): |
| |
| - broadcom/compiler: don't use imprecise_32bit_lowering for idiv lowering |
| - broadcom/compiler: use nir_opt_idiv_const |
| - broadcom/compiler: simplify code emitted for centroid coordinates |
| - nir/lower_alu: drop unnecessary iand on uadd_carry result |
| - broadcom/compiler: track number of TMU operations in prog data |
| - v3dv: move check_needs_load/store helpers to unversioned code |
| - v3dv: track if a job is compatible with double-buffer mode |
| - v3dv: fix tile state allocation |
| - v3dv: add a v3dv_job_allocate_tile_state helper |
| - v3dv: postpone tile state allocation for render pass jobs |
| - v3dv: tell job_compute_frame_tiling whether we want to use double-buffer mode |
| - v3dv: add a heuristic for double-buffer mode |
| - v3dv: vkCmdClearAttachments no longer generates its own RCL |
| - v3d,v3dv: lower texel buffer aligment requirements |
| - v3dv: implement VK_EXT_texel_buffer_alignment |
| - v3dv: add a helper to destoy pipeline layouts |
| - v3dv: ref/unref pipeline layout objects |
| - v3dv: set maxBufferSize property |
| - v3dv: add a v3dv_image_init helper |
| - v3dv: add a get_image_memory_requirements helper |
| - v3dv: implement vkGetDeviceImageMemoryRequirementsKHR |
| - v3dv: add a get_buffer_memory_requirements helper |
| - v3dv: add a buffer_init helper |
| - v3dv: implement vkGetDeviceBufferMemoryRequirementsKHR |
| - v3dv: implement vkGetDeviceImageSparseMemoryRequirementsKHR |
| - v3dv: expose VK_KHR_maintenance4 |
| - nir/lower_variable_initializers: implement non-scoped barrier path |
| - v3dv: implement VK_KHR_zero_initialize_workgroup_memory |
| - v3dv: also check GS stage on image access for binning barriers |
| - v3dv: implement vkCmdPipelineBarrier2 |
| - v3dv: port remaining entry points to KHR_synchronization2 |
| - v3dv: use VkPipelineStageFlagBits2 when checking semaphore wait stage |
| - v3dv: expose VK_KHR_synchronization2 |
| - v3dv: implement VK_KHR_workgroup_memory_explicit_layout |
| - v3dv: switch to common code for command buffer lifecycles |
| - v3dv: expose VK_EXT_tooling_info |
| - v3dv: fix swap_rb and channel_reverse flags in image views |
| - v3dv: expose VK_EXT_border_color_swizzle |
| - v3dv: ref pipeline layout earlier during pipeline init |
| - v3dv: implement VK_EXT_depth_clip_control |
| - v3dv: expose VK_EXT_attachment_feeback_loop_layout |
| - v3dv: fix variable type |
| - v3dv: limit heap size to 4GB |
| - broadcom/simulator: add a helper to get the amount of free heap memory |
| - v3dv: implement VK_EXT_memory_budget |
| - v3dv: expose VK_EXT_primitive_topology_list_restart |
| - v3dv: don't load an attachment for unaligned render area if we are not storing |
| - v3dv: expose VK_EXT_load_store_op_none |
| - v3dv: don't return incompatible driver if GPU is not present |
| - v3dv: fix program id for binning shaders |
| - broadcom/compiler: detect unifa write from signal |
| - broadcom/compiler: check signal writes to magic regs when updating scoreboard |
| - v3dv: optimize ldunif load into unifa write |
| - v3dv: fix VK_EXT_texel_buffer_alignment |
| - broadcom/compiler: increase V3D_MAX_BUFFE_RANGE to 2^30 bytes |
| - broadcom/compiler: fix robust buffer access |
| - broadcom/compiler: handle shared stores with robust buffer access |
| - broadcom/compiler: don't apply robust buffer access to shared variables |
| - broadcom/compiler: rename v3d_nir_lower_robust_buffer_access.c |
| - broadcom/compiler: rename static helpers involved with robust buffer access |
| - broadcom/compiler: add a lowering for robust image access |
| - v3dv: expose VK_EXT_image_robustness |
| - v3dv: use NIR_PASS with v3d_nir_lower_robust_image_access |
| - v3dv: use enabled features from vk_device |
| - broadcom/compiler: trivial code clean-up |
| - vulkan/runtime: include robustness info when hashing a shader stage |
| - v3dv: implement VK_EXT_pipeline_robustness |
| - v3dv: drop layout refs for all allocated sets from a pool on destroy / reset |
| - broadcom/compiler: handle vec2 load/store index |
| - v3dv: refactor events |
| - v3dv: re-enable sync_fd import/export |
| - v3dv: return out of host memory if we fail to create event pipelines |
| - v3dv: split event implementation to a separate file |
| - v3dv: remove unnecessary check for NULL |
| - v3dv: do a better job at cleaning up the device on init failure |
| - v3dv: fix incorrect return type |
| - v3dv: handle allocation failure during pipeline initialization |
| - v3dv: do better cleanup on failure during pipeline cache operation |
| - v3dv: increase limit for active event objects |
| - v3dv: always check VK_ACCESS_2_MEMORY_READ_BIT for read accesses |
| - v3dv: make the helper to emit pipeline barriers public to other files |
| - v3dv: fix event synchronization |
| - v3dv: vkCmdWaitEvents2 takes an array of VkDependencyInfo |
| - v3dv: use vk_alloc instead of malloc |
| - broadcom/compiler: avoid using ldvary sequence to hide latency of branching |
| - v3dv: fix debug dump on BO free |
| - v3dv: ignore imported BOs when tracking BO memory usage |
| |
| Ian Romanick (8): |
| |
| - nir: spirv: Allow 32-bit version of nir_intrinsic_is_sparse_texels_resident |
| - radeonsi: r600: d3d12: st: Use NIR lowering for tg4 offset arrays instead of GLSL lowering |
| - glsl: Remove lower_offset_arrays pass |
| - nir/comparison_pre: See through an inot to apply the optimization |
| - intel/fs: Fix constant propagation into 32x16 integer multiplication |
| - nir/range_analysis: Set higher default maximum for max_workgroup_count |
| - nir/loop_analyze: Fix get_iteration for nir_op_ine |
| - nir/loop_analyze: Fix get_iteration for nir_op_fneu |
| |
| Igor Torrente (3): |
| |
| - vulkan: Add vk_clock_gettime and vk_time_max_deviation |
| - radv: anv: Use the new vk_clock_gettime and vk_time_max_deviation functions |
| - venus: Fix dEQP-VK.pipeline.timestamp.calibrated.host_domain_test failure |
| |
| Ikshwaku Chauhan (1): |
| |
| - Revert "radeon: add EFC support to only VCN2.0 devices" |
| |
| Illia Abernikhin (2): |
| |
| - utils: Move functions from debug.* to u_debug.* |
| - utils: Merge util/debug.* into util/u_debug.* and remove util/debug.* |
| |
| Illia Polishchuk (2): |
| |
| - mesa: skip extra state updates for clear calls |
| - driconf/Intel: Add lower_depth_range_rate option workaround for Homerun Clash misrendering issue |
| |
| Isaac Bosompem (1): |
| |
| - virgl: Set use_staging in resource_from_handle |
| |
| Italo Nicola (6): |
| |
| - etnaviv: skip disk cache initialization on standalone compiler |
| - etnaviv: fix standalone compiler to work without a tgsi backend |
| - clc: add 32-bit target |
| - rusticl: use 32-bit address format for 32-bit devices |
| - rusticl: correctly check global argument size |
| - rusticl: fix MemConstant invalid arg size check |
| |
| Iván Briano (13): |
| |
| - anv: emit scissors when the pipeline changes |
| - anv: pipelineStageCreationFeedbackCount is allowed to be 0 |
| - anv: populate rt shader groups if they were found in the cache |
| - anv: Set meshShaderQueries for mesh shader ext features |
| - intel/utrace: create the callback events for xfb trace points |
| - anv/grl: Add a GRL file parser |
| - anv: set READ/WRITE_WITHOUT_FORMAT for buffer views |
| - vulkan_hasvk: set READ/WRITE_WITHOUT_FORMAT for buffer views |
| - anv: compile_upload_rt_shader expects a valid pointer |
| - anv: use DX rules for point rasterization |
| - hasvk: use DX rules for point rasterization |
| - anv: support VK_PIPELINE_CREATE_RAY_TRACING_SKIP_* |
| - anv: enable rayTraversalPrimitiveCulling feature |
| |
| James Park (2): |
| |
| - vulkan: Augment _WIN32 stub comparison |
| - meson,amd: Remove Windows libelf wrap |
| |
| James Zhu (4): |
| |
| - amd/common: some ASICs with gfx9 use compute rings for render |
| - util/format: add util format y8_400_unorm |
| - frontends/va: add support for yuv400 and yuv444 |
| - radeonsi/vcn: enable jpeg decode of yuv444 and yuv400 |
| |
| Jami Kettunen (1): |
| |
| - freedreno/ir3: Switch to NIR for a5xx's vertex id lowering. |
| |
| Jan Beich (1): |
| |
| - util: unify FreeBSD futex_wait signature with Linux/OpenBSD/Windows |
| |
| Jason Ekstrand (115): |
| |
| - nir: Clean up and improve nir_dedup_inline_samplers |
| - intel/fs_reg_allocate: Improve compressed instruction self-interference |
| - intel/rt: Handle halts in any-hit shaders properly |
| - intel/rt: Handle multiple exits in lower_shader_returns |
| - genxml: Add BVH data structures |
| - anv: Don't require 32-bit addresses for scratch on Gen12.5+ |
| - vulkan,anv,dozen: Use VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT |
| - radv: Use both aspects for depth/stencil blit destinations |
| - radv: Use VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT for render loops |
| - radv/meta: Respect render area for MSAA resolves |
| - radv/meta: Fix a stack use-after-free with sample locations |
| - radv/meta: Drop subpass_att from radv_fast_clear_color |
| - radv/meta: Use Begin/EndRendering for resolves |
| - radv/meta: s/dest/dst/ in a few functions |
| - radv/meta: Use Begin/EndRendering for clears |
| - radv: Drop set/restore_subpass |
| - radv/meta: Drop redundant depth_view_can_fast_clear |
| - radv: Add a image_transition helper for separate depth/stencil layouts |
| - radv: Don't assume layout == stencilLayout for stencil-only |
| - util,intel: Pull the bit packing helpers from genxml to a common header |
| - broadcom: Use util/bitpack_helpers.h |
| - util: Add util_bitpack_[su]fixed_clamp helpers |
| - asahi: Use util/bitpack_helpers.h |
| - panfrost: Use util/bitpack_helpers.h |
| - anv: Use nir_intrinsic_load_layer_id for ViewId in fragment shaders |
| - anv: Decouple primitive replication check from the fragment shader |
| - anv: Stop looking at the pipeline in multiview lowering |
| - nir: Track per-view outputs in shader_info |
| - intel/compiler: Store the number of position slots in the VUE map |
| - anv: Drop anv_pipeline::use_primitive_replication |
| - pvr: Only destroy the physcial device if it exists |
| - vulkan/cmd_buffer: add record_state to the common command buffer. |
| - vulkan/cmd_queue: Drop vk_cmd_queue::error |
| - vulkan: Add a concept of recycling an object |
| - vulkan: Re-order arguments to vk_command_pool_init |
| - vulkan: Re-order arguments to vk_command_buffer_init |
| - vulkan: Add a vk_command_buffer_ops struct |
| - vulkan: Add a common vkResetCommandBuffer() implementation |
| - vulkan: Add a common vkAllocateCommandBuffers() implementation |
| - vulkan: Recycle command buffers in vk_command_pool |
| - radv: Move to the common command pool framework |
| - tu: Move to the common command pool framework |
| - lvp: Move to the common command pool framework |
| - vulkan,docs: Add documentation Vulkan command pools |
| - vulkan: Dirty VP_VIEWPORTS/SCISSORS when copying viewports/scissors |
| - radv: Set the window scissor to the render area, not framebuffer |
| - radv: Only copy the render area from VRS to HTILE |
| - radv: Leave image layouts alone when doing HW MSAA resolves |
| - radv: Switch to dynamic rendering only |
| - rusticl: Call nir_scale_fdiv |
| - vulkan/runtime: Add a comon vk_descriptor_update_template |
| - anv: Switch to the common descriptor update template struct |
| - vulkan/runtime: Compact descriptor update templates |
| - panvk: Fix buffer views |
| - spirv: Don't use libclc for wait_group_events |
| - nir: Add a helper for finding a function by name |
| - nir/load_libclc: Don't add generic variants that already exist |
| - vulkan: Record more enabled robustness features |
| - vulkan: Add a helper for gathering pipeline robustness |
| - iris: Use a larger alignment for buffer allocations |
| - iris: Fix more BO alignments |
| - iris: Handle resource offsets in buffer copies |
| - iris: Stop looking at textures_used for samplers |
| - iris: Split max #defines for textures/samplers/images |
| - iris: Support up to 64 images |
| - iris: Support up to 128 textures |
| - intel/fs: Always use integer types for indirect MOVs |
| - intel/fs: SEL_EXEC uses the integer pipe for 64-bit stuff |
| - intel/mi_builder: add a way to reserve a register |
| - intel/mi_builder: Add a helper for incrementing reference counts |
| - anv/formats: Advertise ACCELERATION_STRUCTURE_VERTEX_BUFFER_BIT |
| - anv: Add an anv_address_map helper |
| - anv: Implement VK_KHR_pipeline_library |
| - anv: Set up the memory-backed FIFO buffer |
| - anv: Add extern "C" guards |
| - anv: Add support for compiling OpenCL-style kernels |
| - anv: Add support for OpenCL-style kernel dispatch |
| - anv: Import GRL |
| - anv/grl: Build OpenCL kernels |
| - anv/grl: Add a helper for dispatching our pre-built kernels |
| - intel/grl: Parse GRL files and generate C |
| - anv: Build BVHs on the GPU with GRL |
| - anv: Advertise ray-tracing on DG2 |
| - intel/devinfo: DG2 supports ray-tracing |
| - vulkan/wsi: Support configuring swapchain images as part of swapchain init |
| - vulkan/wsi/x11: Configure images via params passed to wsi_swapchain_init() |
| - vulkan/wsi/wayland: Configure images via params passed to wsi_swapchain_init() |
| - vulkan/wsi/display: Configure images via params passed to wsi_swapchain_init() |
| - vulkan/wsi/win32: Configure images via params passed to wsi_swapchain_init() |
| - vulkan/wsi: Always configure images in wsi_swapchain_init() |
| - vulkan/wsi: Don't expose as many image configure helpers |
| - vulkan/wsi: Add a typedef for memory type select callbacks |
| - vulkan/wsi: Add a supports_scanout flag |
| - vulkan: Use altlen when available for array lengths |
| - vulkan: Rename viewport_state::negative_one_to_one |
| - vulkan: Add a dynamic state for vp.depth_clip_negative_one_to_one |
| - vulkan: Add more dynamic rasterizer state |
| - vulkan: Add dynamic state for tessellation domain origin |
| - vulkan: Add more dynamic multisample states |
| - vulkan: Add more dynamic color blend states |
| - vulkan,lavapipe: Use a tri-state enum for depth clip enable |
| - iris: Set SamplerCount in shader packets |
| - nir: Reformat a comment |
| - nir: Use nir_shader_instructions_pass in nir_lower_readonly_images_to_tex |
| - nir: Misc. style fixes to nir_lower_readonly_images_to_tex |
| - nir: Unconditionally call nir_trim_vector in nir_lower_readonly_images_to_tex |
| - nir/types: Add some asserts to glsl_get_struct_field() |
| - nouveau/codegen: Support bindless texture queries |
| - intel/lower_mem_access_bit_sizes: Compute alignments automatically |
| - intel: Don't cross DWORD boundaries with byte scratch load/store |
| - vulkan: Handle VK_SUBPASS_EXTERNAL at the end of a subpass |
| - vulkan: Unconditionally add barriers for missing external subpass deps |
| - vulkan: Add state for extraPrimitiveOverestimationSize |
| - vulkan: Add a common implementation of CmdSetSampleLocationsEnableEXT |
| - vulkan: Add a dummy vk_common_CmdSetColorBlendAdvancedEXT() |
| |
| Jeremy Rand (1): |
| |
| - llvmpipe: bump LP_MAX_THREADS to 32 |
| |
| Jesse Natalie (82): |
| |
| - ci/windows: Re-enable Windows runners |
| - dzn: Disable extensions that require multi-view |
| - dzn: Use D3D12 constants instead of naked numbers for device limits |
| - ci/windows: Disable Windows builds while the runner is offline again |
| - ci/windows: Windows runner is back online |
| - ci/windows: Install Graphics-Tools package on the base image |
| - egl/wgl: Don't require a native display to support window surfaces |
| - egl/wgl: Implement QuerySurface |
| - wgl: Split pbuffer creation into two functions |
| - egl/wgl: Implement PBuffer surfaces |
| - egl/wgl: Set YInvert attribute to always true |
| - egl/wgl: Implement bind/release texture buffer |
| - wgl: Allow per-framebuffer swap interval overrides |
| - egl/wgl: Support eglSwapInterval |
| - egl/wgl: Support eglWaitClient |
| - egl/wgl: Support eglWaitNative |
| - egl,wgl: Support eglCreateImageKHR |
| - wgl: Pass smapi explicitly to context creation |
| - wgl: Pass smapi explicitly to framebuffer creation |
| - egl/wgl: Use a per-display st_manager |
| - egl/wgl: Hook up image validate/get in smapi |
| - egl/wgl: Support EGL_MESA_query_driver |
| - egl/wgl: Support sync objects |
| - wgl: Use pfi instead of iPixelFormat more often |
| - egl/wgl: Delete unused variables/code |
| - egl/wgl: Fix some awkward sizeof formatting |
| - meson: Don't include glsl compiler if flex/bison aren't found |
| - microsoft/compiler: Discard shouldn't be marked readnone |
| - dxil_nir_lower_int_cubemaps: When not lowering samplers, don't touch sampler types |
| - microsoft/compiler: Fix PSV struct when numthreads is 0 |
| - microsoft/clc: Handle a null clc logger |
| - gallium/windows: Delete OpenGLOn12.dll target |
| - meson: Add an option to specify the WGL gallium megadriver filename |
| - mesa: Expose GL_NV_ES1_1_compatibility |
| - microsoft/compiler: Always emit a shader at the max-supported shader model |
| - microsoft/compiler: Support up to shader model 6.5 |
| - microsoft/compiler: Add struct and function defs for SM6.6 handle funcs |
| - microsoft/compiler: Add dynamic create handle helper |
| - microsoft/compiler: Add getters for res bind/props structs |
| - microsoft/compiler: Delete double-assignment of sampler metadata field |
| - microsoft/compiler: Pass lower_bound, upper_bound, space to createhandle |
| - microsoft/compiler: Handle SM6.6 handles |
| - microsoft/compiler: SM6.6 is supported |
| - microsoft/compiler: Support SM6.7 |
| - d3d12: Get max supported shader model |
| - dzn: Get max supported shader model |
| - u_atomic: Add a helper for pointer compare-exchange |
| - d3d12: Change displayable format logic |
| - ci/windows: Update build container image with all deps so wraps can be disabled |
| - deps: Update DirectX-Headers req to 1.606.4 |
| - d3d12: Move some things from screen late-init to early-init |
| - d3d12: Store the d3d12 module reference on the screen |
| - d3d12: Support device factories in addition to global device creation |
| - d3d12: Attempt to use the Agility SDK |
| - d3d12: Use ID3D12DeviceConfiguration for root signature serialization |
| - d3d12: Release sharing contract in flush_frontbuffer |
| - dzn: Store the d3d12 module reference on the instance |
| - dzn: Support device factories in addition to global device creation |
| - dzn: Use architecture props to return correct device type |
| - dzn: Attempt to use the Agility SDK |
| - dzn: Use ID3D12DeviceConfiguration for root signature serialization |
| - ci/windows: Update vk-gl-cts to pick up new zlib dependency |
| - d3d12: Don't put permanently-resident resources in the residency bo list |
| - d3d12: Don't multiply cube array sizes by 6 |
| - d3d12: Set PIPE_CAP_MAX_TEXTURE_UPLOAD_MEMORY_BUDGET |
| - dzn: Set factory flags before creating device |
| - d3d12: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT |
| - include: Add WGL version of GL interop extensions |
| - mesa, dri2: Move interop helpers from dri2 to state tracker |
| - include: Add a flush interop method |
| - mesa: Make fence_sync not static |
| - mesa/st: Split interop object lookup from export |
| - mesa/st: Implement the new flush method |
| - dri: Implement the new flush method |
| - glx: Implement the new flush method |
| - egl/dri2: Implement the new flush method |
| - wgl: Implement interop methods |
| - egl/wgl: Implement interop methods |
| - include: Add driver data to v2 of mesa_glinterop_device_info |
| - gallium: Add screen methods for driver-private data on interop interface |
| - mesa/st: Query driver for interop data |
| - d3d12: Hook up driver-private interop data |
| |
| Jessica Clarke (1): |
| |
| - panfrost/blend: Fix invalid const values leading to NIR validation errors |
| |
| Joan Bruguera (1): |
| |
| - nv50/ir/nir: ignore sampler for TXF/TXQ ops. |
| |
| John Brooks (5): |
| |
| - radv: Add get_addrlib function to radv_radeon_winsys |
| - vulkan: Introduce vk_format_is_block_compressed function |
| - radv: Only apply mipmap view adjustments to block compressed images |
| - radv: Fix corrupted mipmap copies on GFX9+ |
| - radv: Fix mipmap views on GFX10+ |
| |
| Jonathan (3): |
| |
| - gallium/u_threaded: buffer subdata merging (v2) |
| - gallium/u_threaded: Prepare for buffer sharedness tracking |
| - gallium/u_threaded: buffer sharedness tracking |
| |
| Jonathan Gray (2): |
| |
| - iris: check i915 features after hw gen |
| - util: include sys/time.h for timespec functions |
| |
| Jordan Justen (15): |
| |
| - iris: Drop extra file-descriptor dup in iris_drm_screen_create() |
| - intel/pci_ids: Update ADL-S strings |
| - intel/pci_ids: Add 0x468b ADL-S PCI-id |
| - intel/pci_ids: Update ATS-M device names |
| - intel/pci_ids: Update DG2 device names |
| - intel/pci_ids: Add dg2 0x5698 pci-id |
| - intel/pci_ids: Drop non-upstream dg2 pci-ids |
| - intel/compiler: Use builder to allocate fs regs for gs control data bits |
| - intel/devinfo: Add MTL platforms enums and intel_device_info_is_mtl() |
| - intel/l3: Use L3 full-way allocation setting for gfx12.5 (DG2, MTL) |
| - anv/meson: Use anv_flags and anv_cpp_flags in genX compiles |
| - meson: Deprecate vulkan-rt-drivers intel |
| - meson: Define with_intel_vk_rt based on with_intel_clc |
| - intel/compiler: Broadcast lower code should check 64-bit int support |
| - ci/d3d12: Update quick_shader results with 24 fixes from !19128 |
| |
| Jose Maria Casanova Crespo (4): |
| |
| - CI: Igalia farm is down |
| - Revert "CI: Igalia farm is down" |
| - vc4: consolidate shader-db output |
| - v3d: Minor fixes on sand8 blit based on sand30 modifications |
| |
| Joshua Ashton (2): |
| |
| - radv: Remove workarounds for D3D9 feedback loops |
| - freedreno: Disable 8bpp_ubwc on a6xx gen2 |
| |
| José Fonseca (1): |
| |
| - util: Add macro util_get_cpu_caps in u_cpu_detect.c for avoid accidentally call it in u_cpu_detect.c |
| |
| José Roberto de Souza (34): |
| |
| - intel: Simply intel_gem_create_context_engines() |
| - anv: Remove duplicated memset() in physical device creation |
| - anv: Remove anv_app_info |
| - anv: Free vmas in case device creation fails in pthread_mutex_init(&device->mutex |
| - anv: Do not duplicate intel_device_info memory in each logical device |
| - anv: Only wait for queue sync if execbuf was properly executed |
| - anv: Do not copy garbage to batch_bo |
| - anv: Nuke anv_execbuf_init() |
| - anv: Group all context operations during device creation |
| - anv: Nuke dead code |
| - anv: Return earlier in anv_gem_get_tiling() when not supported |
| - intel/compiler/fs: Fix compilation of shaders with SHADER_OPCODE_SHUFFLE of float64 type |
| - intel/compiler/fs: Use DF to load constants when has_64bit_int is not supported |
| - vulkan_hasvk: Nuke dead code around I915_ENGINE_CLASS_COMPUTE |
| - anv: Nuke cmd_parser_version |
| - intel/dev: Adjust prefetch_size values for MTL engines |
| - intel: Share code to read render timestamp |
| - intel: Make engine related functions and types not i915 dependent |
| - intel/dev: Split i915 specific parts of intel_get_device_info_from_fd() |
| - anv: Move fetch of i915 physical device parameters |
| - anv: Split i915 specific parts of anv_queue_submit_simple_batch() |
| - anv: Use Vulkan types for priority as much as possible |
| - anv: Split the debug part of anv_queue_exec_locked() |
| - intel: Convert missing i915 engine types to intel |
| - intel: Convert i915 engine type to intel in tools/ common/ and ds/ |
| - intel: Add and use intel_engines_class_to_string() |
| - hasvk: Nuke code around local memory |
| - hasvk: Fix build around intel_measure_state_changed() call |
| - iris: Fix enablement of protected contexts |
| - intel/perf: Use intel_device_info functions to compute subslice and eu totals |
| - iris: Nuke pci_id from iris_screen |
| - iris: Drop duplicated errno handling in iris_bo_wait() |
| - iris: Do not export iris_bo_wait() |
| - iris: Set priority for replaced engine context |
| |
| Juan A. Suarez Romero (3): |
| |
| - vc4/simulator: use i915/amd ioctls for BO |
| - vc4: properly restore vc4 debug option |
| - vc4: store tex sampler in proper register |
| |
| Julia Tatz (1): |
| |
| - zink: Advertise PIPE_CAP_NATIVE_FENCE_FD |
| |
| Juston Li (6): |
| |
| - venus: Enable VK_EXT_texture_compression_astc_hdr |
| - venus: sync venus protocol headers for VK_EXT_primitive_topology_list_restart |
| - venus: add support for VK_EXT_primitive_topology_list_restart |
| - venus: sync to latest venus protocol headers |
| - venus: add support for VK_EXT_multi_draw |
| - venus: use buffer cache for vkGetDeviceBufferMemoryRequirements |
| |
| Kai Wasserbäch (22): |
| |
| - chore(deps): clover: raise the minimum LLVM version to 11.0.0 |
| - fix(FTBFS): meson: raise C++ standard to C++17 |
| - chore(docs): rusticl: improve list of build dependencies |
| - fix: util/format: unused variable ‘desc’ [-Wunused-variable] |
| - fix: nir: unused variable ‘else_block’ [-Wunused-variable] |
| - fix: ac/llvm: unused variable ‘offset’ [-Wunused-variable] |
| - fix: mesa: unused variable ‘ret’ [-Wunused-variable] |
| - fix: mesa/st: variable ‘texobj’ set but not used [-Wunused-but-set-variable] |
| - fix: mesa/st: unused variable ‘src’ [-Wunused-variable] |
| - fix: gallivm: variable ‘type_kind’ set but not used [-Wunused-but-set-variable] |
| - fix: r600/sb: unused variable ‘repdep2’ [-Wunused-variable] |
| - fix: r600/sb: unused variable ‘fop’ [-Wunused-variable] |
| - fix: r600/sb: unused variable ‘r’ [-Wunused-variable] |
| - fix: r600/sfn: variable ‘opinfo’ set but not used [-Wunused-but-set-variable] |
| - fix: r600/sb: warning: unused variable ‘{b,nl}’ [-Wunused-variable] |
| - fix: r600/sfn: unused variable ‘splitpos’ [-Wunused-variable] |
| - fix: r600/sfn: unused variable ‘{splitpos,param}’ [-Wunused-variable] |
| - fix: r600/sfn: variable ‘fail’ set but not used [-Wunused-but-set-variable] |
| - fix: r600/sfn: unused variable ‘spi_sid’ [-Wunused-variable] |
| - fix: iris: unused variable ‘devinfo’ [-Wunused-variable] |
| - fix: zink: unused variable ‘intr’ [-Wunused-variable] |
| - fix: frontends/va: unused variable ‘max_pipe_hevc_slices’ [-Wunused-variable] |
| |
| Karmjit Mahil (30): |
| |
| - pvr: Fix bo mapping on alloc with PVR_BO_ALLOC_FLAG_CPU_ACCESS. |
| - pvr: Compete pvr_calc_fscommon_size_and_tiles_in_flight(). |
| - pvr: Fix calculation in rogue_max_compute_shared_registers(). |
| - pvr: Change indentation from tab to spaces for xml files. |
| - pvr: Set descriptor dirty flag based on other flags. |
| - pvr: Finish setting up job resolve info. |
| - pvr: Remove unimplemented push descriptor code. |
| - pvr: Add depth_bias_array handling on dbenable. |
| - pvr: Add clear program in pvr_device. |
| - pvr: Add static clear control stream templates in pvr_device. |
| - pvr: Add static clear VDM state in pvr_device. |
| - pvr: Add graphics pipeline barrier handling. |
| - Revert "pvr: Make pvr_cmd_pack() macro clearly internal" |
| - pvr: Implement clear ppp state emission from template. |
| - pvr: Add mid fragment pipeline barrier if needed. |
| - pvr: Handle VK_CULL_MODE_FRONT_AND_BACK. |
| - pvr: Emit cs words for load op on vkCmdBeginRenderPass(). |
| - pvr: Add basic skeleton for event sub cmd. |
| - pvr: Remove struct pvr_emit_state and emit header directly. |
| - pvr: Add csb helpers macros to write into raw buffer. |
| - pvr: Make control stream word writing stricter. |
| - pvr: Complete pvr_emit_ppp_state(). |
| - pvr: Add assert for texturestate being 0 in when emitting ppp state. |
| - pvr: Add EMIT_MASK in pvr_emit_ppp_state(). |
| - pvr: Fix possible seg fault on csb copy. |
| - pvr: Fix overflow before widen warning for rgn headers size calculation. |
| - pvr: Handle pipeline barrier vk_sync. |
| - pvr: Remove outdated comments. |
| - pvr: Add PBE accum format size in bytes. |
| - pvr: Change pvr_get_hw_clear_color() to pack based on accum formats. |
| |
| Karol Herbst (189): |
| |
| - printf: extract clovers printf impl |
| - printf: add some unit tests |
| - nir: serialize printf metadata for CL kernels |
| - nir/lower_images: extract from clover |
| - nir: extract the clc inline sampler dedup pass from clc |
| - microsoft/clc: drop inline sampler reordering |
| - clc: undefine spirv defs to work around LLVMs headers |
| - vtn: silence warning about linkage |
| - gallium: mark the input data as const in pipe_grid_info |
| - util/memstream: insert null terminator on windows |
| - ci: update fails list |
| - ci: bumping all tags to make sure we don't have hidden fails |
| - nvc0: limit max global and alloc size |
| - ci: update CI to reflect clovers LLVM version bump |
| - nv50: fix code heap after pipe_shader_enum change |
| - nouveau: use the contexts pushbuf and client where possible |
| - nouveau/mm: make code thread safe |
| - nouveau/buffer: simplify uses of nouveau_fence_work |
| - nouveau/fence: rework nouveau_fence_emit so we can call it on emitted fences |
| - nouveau: move nouveau_context initialization to common code |
| - nouveau: wrap nouveau_bo_map |
| - nouveau: wrap nouveau_bo_wait |
| - nouveau: wrap all nouveau_pushbuf_space calls |
| - nouveau: wrap nouveau_pushbuf_validate |
| - nouveau: wrap nouveau_pushbuf_refn |
| - nouveau: use PUSH_KICK instead of nouveau_pushbuf_kick |
| - nouveau/fence: make the fence struct part of the fence API |
| - nv50: remove nouveau_fence_signalled prior nouveau_fence_wait |
| - nv50: remove nv50_bufctx_fence call in vbo_kick_notify |
| - nouveau/fence: per context fence |
| - nouveau: per context client and pushbuf |
| - nouveau: make fencing race free |
| - nvc0: make state handling race free |
| - nv50: race free state tracking |
| - nv50/ir: fix OP_UNION resolving when used for vector values |
| - nv50: properly flush the TSC cache on 3D |
| - rusticl: added |
| - rusticl: finish implementing clBuildProgram |
| - rusticl/util: add static_assert macro |
| - rusticl: implement clFinish and clFlush |
| - rusticl: implement clEnqueueReadBuffer |
| - rusticl/mem: support ops on subbuffers |
| - rusticl/kernel: basic implementation |
| - rusticl: nir bindings |
| - rusticl: translate spirv to nir and first steps to kernel arg handling |
| - rusticl/kernel: more clGetKernelWorkGroupInfo props |
| - rusticl/kernel: clGetKernelWorkGroupInfo allows a NULL device |
| - rusticl/mem: use helper context for COPY_HOST_PTR buffers |
| - rusticl/event: wrong but non crashing impl of clWaitForEvents |
| - rusticl: hack for CL 3.0 |
| - rusticl/mesa: add fencing support |
| - rusticl/event: proper eventing support |
| - rusticl/kernel: implement clCreateKernelsInProgram |
| - rusticl/kernel: implement clEnqueueTask |
| - rusticl/program: undefine __IMAGE_SUPPORT__ if images are unsupported |
| - rusticl/queue: fix clReleaseCommandQueue |
| - rusticl/mem: implement clCopyBuffer |
| - rusticl/mem: implement clFillBuffer |
| - rusticl/event: implement marker and barrier |
| - rusticl/mem: add clEnqueueMigrateMemObjects stub |
| - rusticl/program: support compiling libraries |
| - rusticl: implement clUnloadPlatformCompiler |
| - rusticl/api: add param to query which contains application provided values |
| - rusticl/program: implement clCreateProgramWithBinary |
| - rusticl/event: add fake impl of clGetEventProfilingInfo |
| - rusticl/kernel: implement CL_KERNEL_COMPILE_WORK_GROUP_SIZE |
| - rusticl/mem: finish clEnqueueMigrateMemObjects |
| - rusticl/kernel: run driver requested lowering passes |
| - rusticl/kernel: add support for offsets |
| - rusticl/kernel: add missing preprocessor definitions |
| - rusticl: advertize CL 1.1 and CL 1.2 extensions |
| - rusticl/context: implement clSetContextDestructorCallback |
| - rusticl/kernel: implement clCloneKernel |
| - rusticl/device: set required double fp config values |
| - rusticl/device: advertize atomic caps |
| - rusticl: claim support for generic pointers |
| - rusticl/program: add il stubs |
| - rusticl: add svm func stubs |
| - rusticl/icd: add more func pointers |
| - rusticl/kernel: report SIMD width as work group sizes |
| - rusticl/kernel: sweep nir to reduce peak memory usage |
| - rusticl/device: expose cles_khr_int64 for compatible embedded devices |
| - rusticl/mem implement the memory \*WithProperties API |
| - rusticl/pipe: add clGetPipeInfo stub end return CL_INVALID_MEM_OBJECT |
| - rusticl/program: add stubs for program ctors and dtors |
| - rusticl/kernel: add stub for clGetKernelSubGroupInfo |
| - rusticl/event: timeout wait on condvar to abort on reaped worker threads |
| - rusticl: add support for printf |
| - rusticl/spirv: add print method |
| - rusticl/mesa: support mapping textures |
| - rusticl/mem: support read/write/copy ops for images |
| - rusticl/memory: rework mapping tracking |
| - rusticl/mem: implement maping images |
| - rusticl/mem: implement fill image |
| - rusticl/mem: implement copy image |
| - rusticl/mem: implement clCreateImage2D and 3D |
| - rusticl/mem: implement copies between buffers and images |
| - rusticl/mesa/context: un Arc the PipeContext |
| - rusticl/mem: implement clCreateSamplerWithProperties |
| - rusticl/kernel: support for images |
| - rusticl/kernel: inline samplers |
| - rusticl/kernel: lower memcpy |
| - rusticl/device: add retain/release callbacks for luxmark v4 |
| - rusticl/device: implement cl_khr_3d_image_writes |
| - rusticl/kernel: set CL_DEVICE_PREFERRED_WORK_GROUP_SIZE_MULTIPLE |
| - rusticl/kernel: run some more opt passes |
| - rusticl: get rid of a few clones |
| - rusticl/memory: don't map more than necessary for copy operations |
| - rusticl/device: fix api consistency_device_and_host_timer test |
| - rusticl/queue: implement missing CL 3.0 bits |
| - rusticl/device: fix compiler features_macro |
| - rusticl/mem: add user_ptr fallback shadow buffer |
| - rusticl/mesa: make resource_from_user_memory an optinal feature |
| - rusticl/kernel: implement image_format and image_order |
| - rusticl/mem: fix IMAGE1D_BUFFER |
| - rusticl/mem: only write pitch when required |
| - rusticl: port to Rust 2018 |
| - rusticl: kernel caching |
| - rusticl/kernel: prepare for nir caching |
| - rusticl/kernel: cache the nir as well |
| - rusticl: fix compiler features_macro |
| - rusticl/kernel: optimize local size |
| - rusticl: the CTS is a piece of shit |
| - rusticl/kernel: implement CL_KERNEL_ATTRIBUTES |
| - rusticl/program: some boilerplate code for SPIR-V support |
| - rusticl/memory: kernel read_write images prep work |
| - rusticl: disable fp64 support |
| - rusticl/device: allow overwriting the device_type via env |
| - rusticl/program: parse quoted paths in args |
| - rusticl/kernel: fix local buffers |
| - rusticl/device: report mesas version for CL_DRIVER_VERSION |
| - rusticl/icd: implement clGetExtensionFunctionAddressForPlatform |
| - rusticl/icd: add some way of debugging CL function calls |
| - rusticl/kernel: use real references for arguments |
| - rusticl: proper PIPE_MAP flags for internal maps |
| - rusticl: call glsl_type_singleton_init_or_ref |
| - rusticl/program: some more API validation |
| - ci: rusticl with llvmpipe |
| - rusticl: add README file |
| - rusticl/device: print error when libclc fails to load |
| - rusticl/mem: return errors for OOB accesses |
| - rusticl/device: fix custom device detection |
| - rusticl/mem: fix image OOB checks |
| - rusticl/mem: align fill pattern buffer to 4 bytes |
| - rusticl: limit global mem to 2GB |
| - rusticl/kernel: assign locations before passing the nir to drivers |
| - ac/llvm: support non 32 bit sized workgroup ids |
| - ac/llvm: fix load_const for vectors with more than 4 elements |
| - ac/llvm: fix load/store_shared for vectors with more than 4 elements |
| - ac/llvm: fix load_ubo for vectors with more than 4 elements |
| - radeonsi: stop vectorizing unpack_32_2x16_split |
| - iris: use images_used instead of num_images |
| - iris: speed up walking global bindings |
| - iris: bump IRIS_MAX_GLOBAL_BINDINGS to 128 |
| - rusticl/kernel: preserve fp16 denorms to fix vload/vstore_half |
| - rusticl: add support for coherent resources |
| - rusticl/mesa: add bx() method to PipeTransfer |
| - rusticl: rework resource mappings a little |
| - rusticl: add helper ctx wrapper for coherent and direct mapping |
| - rusticl/mem: rewrite the (un)mapping code |
| - lp: claim being UMA |
| - rusticl/mem: propper CL_MEM_ALLOC_HOST_PTR support |
| - radeonsi: fail creating textures from user memory |
| - nir/lower_cl_images: set binding |
| - radeonsi: lower hadd and fisnormal |
| - rusticl: remove blit code |
| - rusticl: add RUSTICL_ENABLE to have a global way of enabling devices |
| - docs: Add documentation for Rusticl's env variables |
| - ci/llvmpipe: update to new RUSTICL_ENABLE var |
| - zink: handle more opcodes for CL |
| - zink: handle f2f16_rtz |
| - zink: fix spirv_builder_spec_const_uint |
| - aux/trace: add set_global_binding |
| - rusticl: force BIND_LINEAR on staging resources |
| - rusticl/mem: can only map staging textures directly |
| - rusticl: add small bitset impl |
| - rusticl/nir: add reads_sysval wrapper |
| - rusticl/kernel: lower system values before gathering info |
| - rusticl/kernel: add work_dim lowering |
| - zink: remove leftover work_dim lowering code |
| - rusticl/kernel: fix more 32 bit problems |
| - nir/algebraic: add 8 and 64 bit urol and uror lowering |
| - nir/algebraic: support CL vector accessors |
| - nir/algebraic: generalize vector_cmp lowering |
| - nir/algebraic: add vec8/16 cmp lowering |
| - iris: invalidate sysvals if grid dimension changes |
| - glsl: fix buffer texture type |
| - rusticl/device: put space at the end of CL_DEVICE_VERSION |
| - nir/lower_int64: fix shift lowering |
| |
| Kenneth Graunke (53): |
| |
| - intel/compiler: Use named NIR intrinsic const index accessors |
| - iris: Pass devinfo to iris_resource_level_has_hiz() |
| - iris: Enable HiZ for non-8x4 aligned miplevels on Icelake and later |
| - iris: Always retain ISL_AUX_USAGE_HIZ_CCS_WT in texture aux usage |
| - iris: Clean up iris_sample_with_depth_aux() |
| - intel/compiler: Drop variable group size lowering |
| - iris: Fix PIPE_CAP_UMA |
| - iris: Use linear for exported resources if we can't convey tiling |
| - intel/compiler: Change dg2_plus check to devinfo->verx10 >= 125 |
| - iris: Delete unused iris_screen::aperture_bytes field |
| - iris: Don't print out XXX messages if resource creation fails |
| - iris: Fall back if iris_map_copy_region can't create a staging resource |
| - iris: don't create staging resources larger than half the aperture |
| - crocus: Don't print out XXX messages if resource creation fails |
| - crocus: Fall back if iris_map_copy_region can't create a staging resource |
| - crocus: Fix memory leaks on iris_resource_create failure paths |
| - anv: Fail to create a device on ver < 9 |
| - anv: Drop checks for version 8 or 9 |
| - anv: Delete image param support. |
| - anv: Delete batch buffer growing code. |
| - anv: Delete shader constants UBO from descriptor sets |
| - anv/tests: Don't use relocations in a test case |
| - anv: Delete "back" allocation from state pool |
| - anv: Delete relocation support from anv_block_pool |
| - anv: Delete "back" allocation from anv_block_pool |
| - anv: Drop state pool relocation munging |
| - anv: Delete use_relocations flag |
| - anv: Delete softpin checks |
| - anv: Delete anv_reloc_list_add() |
| - anv: Delete wrapper BOs for relocations |
| - anv: Delete relocation support from batch submission |
| - anv: Delete has_a64_buffer_access flag |
| - anv: Delete has_bindless_images and has_bindless_samples flags |
| - anv: Make a helper function for pinning a state pool's BOs |
| - anv: Drop offset from anv_reloc_list_append |
| - anv: Inline write_reloc into the only remaining caller |
| - anv: Remove anv_batch_emit_reloc and just open-code it |
| - intel/compiler: Use subgroup invocation for ICP handle loads |
| - iris: Ignore aux for copy_region source if there's no unresolved color |
| - intel/compiler: Use an existing URB write to end TCS threads when viable |
| - st/mesa: Optionally call nir_vectorize_tess_levels() |
| - intel/compiler: Vectorize gl_TessLevelInner/Outer[] writes |
| - intel/genxml: Add XY_FAST_COLOR_BLT |
| - blorp: Make blitter_supports_aux accessible from multiple files. |
| - blorp: Implement blitter clears via XY_FAST_COLOR_BLT |
| - blorp: Fix typo in blorp_xy_block_copy_blt |
| - Revert "intel/compiler: Vectorize gl_TessLevelInner/Outer[] writes" |
| - iris: Add miplevel parameters to iris_resource_texture_aux_usage |
| - intel/compiler: Vectorize gl_TessLevelInner/Outer[] writes [v2] |
| - iris: Use nir_intrinsic_load_global_constant for large constants |
| - intel/compiler: Run nir_opt_large_constants before scalarizing consts |
| - st/mesa: Let nir_opt_access() infer non-readable |
| - nir: Drop infer_non_readable option for nir_opt_access() |
| |
| Konrad Kleine (1): |
| |
| - Update docs/drivers/llvmpipe.rst |
| |
| Konstantin Kharlamov (5): |
| |
| - loader/dri3: simplify check for reply |
| - loader/dri3: remove a no-op free call |
| - meson: remove source_root() call in nir compiler path |
| - meson: remove source_root() call in main meson.build |
| - .mailmap: change spelling for Constantine Kharlamov |
| |
| Konstantin Seurer (63): |
| |
| - radv/nir_lower_abi: Use instructions_pass |
| - radv: Move accel struct structs to bvh/bvh.h |
| - radv: Move radv_acceleration_structure |
| - radv: Remove acceleration structure host builds |
| - radv: Always emulate fmin/fmax |
| - radv: Remove accel_struct_build |
| - radv: Remove unused push constant structs |
| - radv: Add create_build_pipeline_spv helper |
| - radv: Add the basics for GLSL bvh kerrnels |
| - radv: Add a GLSL morton kernel implementation |
| - radv: Switch to the GLSL morton implementation |
| - radv: Add a GLSL internal kernel implementation |
| - radv: Switch to the GLSL internal implementation |
| - radv: Add a GLSL leaf kernel implementation |
| - radv: Switch to the GLSL leaf implementation |
| - lavapipe: Remove st_shader_stage_to_ptarget |
| - radv: Fix stack size calculation with stage ids |
| - radv: Inline bvh build headers |
| - radv/ci: Remove host build related fails |
| - lavapipe: Set ss_dirty in emit_state |
| - radv: Advertise subgroup ops for rt stages |
| - vulkan: Add common physical device management |
| - radv: Use the common physical device enumeration |
| - anv: Use the common physical device enumeration |
| - turnip: Use the common physical device enumeration |
| - lvp: Use the common physical device enumeration |
| - panvk: Use the common physical device enumeration |
| - v3dv: Use the common physical device enumeration |
| - radv: Proper handling for inactive instance nodes |
| - radv: Deduplicate push constant structs |
| - radv/rra: Map accel struct VAs to handles |
| - radv/rra: Remove redundant bounds validation |
| - radv: Make the radv_buffer_get_va parameter const |
| - radv/rra: Replace aliasing assert with a warning |
| - radv: Explicitly store the VA of accel structs |
| - radv: Cleanup radv_GetInstanceProcAddr |
| - radv: Add a common traversal build helper |
| - radv/rq: Use the common traversal helper |
| - radv/rtpso: Use the common traversal helper |
| - radv: Use scalar layout for BDA references |
| - radv: Add and use AS and scratch layout structs |
| - radv: Fix incorrect build info indexing |
| - radv: Use cache_uuid for accel struct compatibility |
| - radv: Remove main_loop_case_visited |
| - radv/rra: Transcode nodes recursively |
| - radv: Rename node_internal to node_box32 |
| - radv: Add radv_bvh_node_box16 |
| - radv: Use half floats for box16 coords |
| - radv/rra: Handle box16 nodes |
| - radv/rra: Calculate bvh size requirements recursively |
| - radv: Remove create_accel_build_shader |
| - radv/rra: Fix dumps in the case of aliasing |
| - radv/rra: Validate before gathering bvh info |
| - radv/rra: Add basic header validation |
| - radv/rra: Continue dumping accel structs if validation fails |
| - radv/rra: Use the accel struct type for header validation |
| - radv/rt: Fix setting tmax for opaque AABBs |
| - radv/rt: Load instance id and custom index on demand |
| - radv/rt: Set vars.arg in the traversal shader |
| - radv/rt: Restore prev barycentrics when rejecting hits |
| - radv/rra: Fix copying accel structs that were not built yet |
| - radv/ray_queries: Fix AABB handling |
| - radv/rt: Check space before emitting descriptors |
| |
| Kostiantyn Lazukin (1): |
| |
| - vulkan/cmd_queue: Do not generate unreachable vk_free_* calls. |
| |
| Kuixi Ren (1): |
| |
| - radeonsi/vcn: Add ability to encode with ltr |
| |
| Laurent Bigonville (1): |
| |
| - Try to fix FTBFS on kfreebsd architecture |
| |
| Leandro Ribeiro (9): |
| |
| - vulkan/wsi/wayland: move some structs to beginning of code |
| - vulkan/wsi/wayland: remove unnecessary spaces in struct fields |
| - vulkan/wsi/wayland: introduce struct wsi_wl_surface |
| - vulkan/wsi/wayland: take ownership of wsi_wl_surface when creating chain |
| - vulkan/wsi/wayland: move wl_surface and wl_display from chain to struct wsi_wl_surface |
| - vulkan/wsi/wayland: remove refcount from struct wsi_wl_display |
| - vulkan/wsi/wayland: add default dma-buf feedback support |
| - vulkan/wsi/wayland: add per-surface dma-buf feedback support |
| - vulkan/wsi/wayland: avoid useless re-allocations when receiving per-surface dma-buf feedback |
| |
| Leo Liu (2): |
| |
| - meson: add with_gallium_virgl to allow it as VA backend driver |
| - frontends/va: fix build error for vaSyncBuffer with older VA |
| |
| LingMan (10): |
| |
| - rusticl: Fix compilation if stdout and/or stderr aren't symbols |
| - rusticl/bindgen: Use \`allowlist-\*\` instead of \`whitelist-\*\` switches |
| - docs/rusticl: Document minimum required bindgen version |
| - rusticl/api: Use iterators in create_program_with_source |
| - rusticl/api: Drop UTF-8 conversion of input source |
| - rusticl/api: Fix creating a program if a nul byte is within the specified string length |
| - rusticl/api: Factor pushing to the Vec out of the if |
| - rusticl/api: Shrink unsafe block |
| - rusticl/api: Don't check the program source for nul bytes needlessly |
| - rusticl/api: Interpret \`lengths\` as a slice of Option<NonZeroUsize> |
| |
| Lionel Landwerlin (152): |
| |
| - anv: fixup PIPE_CONTROL restriction on gfx8 |
| - intel/ds: track untyped dataport flushes |
| - intel/fs: store num of resume shaders in prog_data |
| - intel/fs: fixup simd selection with shader calls |
| - intel/compiler: document units of brw_ubo_range fields |
| - intel/nir: specify synchronous value for tracing op |
| - anv: don't return incorrect error code for vkCreateDescriptorPool |
| - intel/drm-shim: update shim to support DG2 |
| - anv: fix GetPipelineExecutableStatistics for ray tracing pipelines |
| - anv: fixup assertions on lowered storage formats |
| - anv: fix assert in memory budget code when extension is not supported |
| - anv: prevent trying to mmap non host visible memory |
| - anv: remove the LOCAL_MEM allocation bit |
| - intel/fs: fix load_scratch intrinsic |
| - intel/fs: fixup scratch load/store handling on Gfx12.5+ |
| - intel/fs: switch compute push constant loads to LSC |
| - intel/fs: remove unused opcode |
| - intel/fs: fixup SEND validation check on overlapping src0/src1 |
| - intel/fs: switch register allocation spilling to use LSC on Gfx12.5+ |
| - intel/fs: port block a64/surface messages to use LSC |
| - intel/fs: bump max SIMD size for A64 atomics with LSC |
| - intel: add a hasvk vulkan driver |
| - hasvk: remove entrypoints for gfx9+ |
| - hasvk: stop advertising Vk 1.3 on non-softpin |
| - hasvk: remove acceleration structure code |
| - hasvk: remove ray tracing code |
| - hasvk: remove mesh code |
| - anv: silence fedora build warning |
| - anv: remove support for gfx7/8 |
| - anv: remove unused gfx7 code |
| - anv/tests: remove back allocation tests |
| - anv/hasvk: tweak loading failure messages |
| - anv: remove non present field in upstream spec |
| - vulkan: fixup 1.3.226 update |
| - anv: enable EXT_depth_clamp_zero_one |
| - hasvk: expose VK_EXT_depth_clamp_zero_one |
| - anv: add a new NO_LOCAL_MEM allocation flag |
| - anv: don't export flush_compute_state |
| - anv: don't export gfx state flushing helper |
| - anv: combine flushes in Draw/DrawIndexed/DrawIndirectByteCountEXT |
| - anv: add missing tracepoint |
| - blorp: defined operations for debug purposes |
| - intel/utrace: make blorp tracepoints more readable |
| - anv: trace xfb queries |
| - anv: clear descriptorsets if AllocateDescriptorSets fails |
| - anv: add support for EXT_mutable_descriptor_type |
| - anv: add missing wokraround for texture cache invalidate |
| - intel/nir/rt: fixup generate hit |
| - intel/nir/rt: spill/fill the entire ray query data |
| - intel/nir/rt: remove ray query mem hit writes at initialization |
| - intel/nir/rt: fix ray query proceed level |
| - intel/nir/rt: change scratch check validation |
| - intel/nir/rt: store ray query state in scratch |
| - intel/fs: fixup a64 messages |
| - intel/nir/rt: fixup alignment of memcpy iterations |
| - ci: add python3-ply to debian/fedora images |
| - ci: disable intel-clc on fedora |
| - ci: build our own version of the LLVM SPIRV translator |
| - ci/debian: don't use libclc from the system |
| - ci: bump llvm to 13 for some builders |
| - ci: disable intel-clc on debian-vulkan |
| - anv: remove HDC flush from invalidate bits |
| - meson: bump required llvm-spirv version with intel-clc |
| - intel/devinfo: Rename & implement num_dual_subslices |
| - intel/rt: fix procedural primitive ID access |
| - intel/nir: reuse rt helper |
| - intel/fs: take a builder arg for resolve_source_modifiers() |
| - intel/nir: fixup preserved metadata in rayquery lowering |
| - intel/nir: fix potential invalid function impl ptr usage |
| - intel/nir: disable assert on async stack id |
| - intel/fs: disable split_array_vars on opencl kernels |
| - genxml: add missing no duplicate anyhit flag |
| - intel/mi_builder: allow half GP registers for dereferencing |
| - anv: disable SIMD16 for RT shaders |
| - anv: add new command buffer space allocation |
| - anv: setup scratch space correctly for RT shaders |
| - anv: bump client visible address heap to 32GiB |
| - anv: use the right dispatch size for tracing shaders |
| - anv: flag BO for write combine when CPU visible and potentially in lmem |
| - iris: better error message with GuC loading failures |
| - intel: add INTEL_DEBUG=capture-all to capture everything upon hang |
| - pps: enable track_event in intel.cfg |
| - anv: add grl build dependency on entrypoints |
| - nir: fix NIR_DEBUG=validate_ssa_dominance |
| - vulkan/runtime: don't lookup the pipeline disk cache if disabled |
| - anv: remove unused code |
| - anv: comment out the Gfx8/9 VB cache key workaround for newer Gens |
| - anv: limit calls into cmd_buffer_flush_dynamic_state |
| - anv: optimize 3DSTATE_VF emission |
| - anv: remove multiple push descriptors |
| - isl: avoid gfx version switch cases on the hot path |
| - anv: limit push constant reemission |
| - anv: track descriptor set layout flags |
| - anv: initialization pipeline layout to 0s |
| - anv: add analysis for push descriptor uses and store it in shader cache |
| - anv: reduce BT emissions & surface state writes with push descriptors |
| - anv: add a layer for hitman3 |
| - nir/lower_shader_calls: rematerialize more trivial values |
| - nir/lower_shader_calls: lower scratch access to format internally |
| - nir/lower_shader_calls: avoid respilling values |
| - nir/lower_shader_calls: rematerialize values in more complex cases |
| - nir/lower_shader_calls: add NIR_PASS_V internally |
| - nir/lower_shader_calls: cleanup shaders a bit more post split |
| - nir/lower_shader_calls: add an option structure for future optimizations |
| - nir/lower_shader_calls: add a pass to trim scratch values |
| - nir/lower_shader_calls: add a pass to sort/pack values on the stack |
| - nir/lower_shader_calls: move scratch loads closer to where they're needed |
| - nir/lower_shader_calls: run opt_cse after lower stack intrinsics |
| - radv: tweak lower_shader_calls parameters |
| - anv: enable localized loads for lower_shader_calls |
| - anv: disable mesh in memcpy |
| - anv: init major/minor before WSI |
| - vulkan/wsi/wl: correctly find whether the compositor uses the same GPU |
| - nir/divergence_analysis: add some missing RT intrinsics |
| - nir/divergence_analysis: add missing desc_set_address_intel |
| - intel/clc: assert when libclc shader is not found |
| - drm-uapi: bump headers |
| - isl: add new MOCS field for protected buffers |
| - intel/common: add detection of protected context support |
| - dri: rename PROTECTED_CONTENT in PROTECTED_SURFACE |
| - gallium: rename PROTECTED_CONTENT cap into PROTECTED_SURFACE |
| - egl: Add EGL_EXT_protected_content support |
| - st/gallium: plumb protected context creation |
| - iris: handle protected BO creation |
| - iris: Emit protection & session ID on protected command buffers |
| - iris: enable protected contexts |
| - intel/fs: require UNDEFs register offsets to be aligned to REG_SIZE |
| - intel/fs: make split_virtual_grfs deal with partial undefs |
| - intel/fs: reduce liveness of variables in lowering passes |
| - intel/fs: use fs implementation of dump_instructions |
| - intel/compiler: don't allocate compaction arrays on the stack |
| - anv: remove shader fp64 inspection after parsing |
| - anv: Reduce RHWO optimization (Wa_1508744258) |
| - anv: fix missing VkPhysicalDeviceExtendedDynamicState3PropertiesEXT handling |
| - anv: fixup invalid enum for nir environment |
| - anv: split internal surface states from descriptors |
| - anv: bump pool bucket max allocation size |
| - anv: get rid of ilog2_round_up |
| - intel/perf: allocate cleared counter infos |
| - intel/perf: fix B/C counters accumulation in non query mode |
| - nir/lower_explicit_io: fix metadata preserve |
| - nir/lower_shader_calls: update metadata before validation |
| - nir/lower_shader_calls: wrap only jumps rather than entire code blocks |
| - Revert "nir/lower_shader_calls: put inserted instructions into a dummy block" |
| - intel/fs: put scratch surface in the surface state heap |
| - anv: fix 3d state initialization |
| - anv: ensure CPS is initialized when KHR_fragment_shading_rate is disabled |
| - anv: fixup context initialization on DG2 |
| - intel/fs: improve Wa_22013689345 workaround |
| - blorp: support negative offsets in addresses |
| - anv: generate correct addresses for state pool offsets |
| - nir/divergence: add missing btd_shader_type_intel |
| |
| Liviu Prodea (5): |
| |
| - d3d12/dzn/spirv2dxil: Require version library |
| - Microsoft clc: strip lib prefix Otherwise OpenCLon12 ICD can't load it |
| - meson: Microsoft / maybe Intel CLC need the all-targets workaround just like clover |
| - meson: Only draw with llvm depends on native directly Tests, softpipe or AMD drivers don't depend on it directly |
| - clc/clover: Link clang statically when shared-llvm is disabled |
| |
| Lucas Stach (38): |
| |
| - etnaviv: properly check number of layers in surface creation |
| - etnaviv: fix TS buffer allocation for 3D textures |
| - etnaviv: allow 3D textures with TS in transfer |
| - etnaviv: don't expose array and 3D texture support on pre-halti GPUs |
| - etnaviv: expose ARB_draw_instanced |
| - etnaviv: mark instanced draw extensions as supported in docs/features.txt |
| - etnaviv: move checking for MC2.0 for TS into screen init |
| - etnaviv: use linear PE rendering only on properly aligned surfaces |
| - etnaviv: add debug option to disable linear PE feature |
| - etnaviv: fill alpha channel for DXT1_RGB textures |
| - etnaviv: don't use TS with multi-layer resources |
| - etnaviv: texture-desc: sample 1D textures using 2D mode |
| - etnaviv: texture-desc: drop superfluous condition |
| - etnaviv: rewrite sampler TS check |
| - etnaviv: slim down etna_shader_key |
| - etnaviv: pass shader key by reference |
| - etnaviv: properly reference flush_resources |
| - etnaviv: move etna_layout_multiple into etnaviv_resource.c |
| - etnaviv: assert valid layout in etna_layout_multiple |
| - etnaviv: compute linear resource Y alignment in etna_layout_multiple |
| - etnaviv: handle compressed texture formats in etna_layout_multiple |
| - etnaviv: increase alignment for MSAA resources |
| - etnaviv: rs: fix MSAA alignment adjustment |
| - etnaviv: rs: try to find exact format match first |
| - etnaviv: properly size TS buffer for MSAA resources |
| - etnaviv: set LOGIC_OP_UNK24 for MSAA rendering on SMALL_MSAA GPUs |
| - etnaviv: disable PE_COLOR_FORMAT_OVERWRITE with MSAA |
| - etnaviv: blt: scale operations by MSAA mode |
| - etnaviv: blt: try to find exact format match first |
| - etnaviv: update headers from rnndb |
| - etnaviv: fix late Z with MSAA active |
| - etnaviv: fix shader register control with MSAA |
| - etnaviv: switch to late Z when linear PE is used |
| - etnaviv: fix tile status interaction with write mappings |
| - etnaviv: always use RS align when GPU has TEXTURE_HALIGN feature |
| - etnaviv: rs: fix blits with insufficient alignment for dual pipe operation |
| - etnaviv: blt: use correct TS offset in clear operations |
| - etnaviv: fix wrong surface TS clear size |
| |
| Luis Felipe Strano Moraes (8): |
| |
| - anv: fix FTBFS on grl due to changes in clang 15 |
| - anv: reword info flag in intel_clc's getopt to avoid clash |
| - anv: add missing separator to help for intel_clc |
| - anv: fixing typo on description of output flag for intel_clc |
| - anv: added proper handling for input argument in intel_clc |
| - anv: adding parsetab.py to the .gitignore for grl |
| - meson: simplified meson for enabling ray-tracing on Intel |
| - meson: only enable intel-clc for x86_64 builds |
| |
| M Henning (1): |
| |
| - nv/nir: Set ssbo CacheMode from intrinsic access |
| |
| Marcin Ślusarz (53): |
| |
| - intel/compiler: fix mesh urb write regression |
| - nir/lower_task_shader: print shader after each step |
| - intel/compiler: add support for non-zero base in [load|store]_shared intrins |
| - Revert "nir/lower_task_shader: don't use base index for shared memory intrinsics" |
| - anv: disable task redistribution |
| - spirv, compiler: add "bool nv" to shader_info.mesh |
| - intel/compiler: implement EXT_mesh_shader |
| - anv: check EXT_mesh_shader whenever NV_mesh_shader is checked |
| - anv: replace VK_SHADER_STAGE_[TASK|MESH]_BIT_NV with VK_SHADER_STAGE_[TASK|MESH]_BIT_EXT |
| - anv: implement EXT_mesh_shader |
| - anv: implement draw calls for EXT_mesh_shader |
| - anv: enable EXT_mesh_shader |
| - anv: fix emission of primitive replication packet for mesh stage |
| - nir: add uses_wide_subgroup_intrinsics to task/mesh shader_info |
| - anv: add support for anv_assume_full_subgroups to task & mesh stages |
| - anv: small cleanup of anv_graphics_pipeline_compile |
| - nir, anv, hasvk, radv: pull uses_wide_subgroup_intrinsics into shader_info |
| - intel/compiler: remove second shading rate lowering for mesh |
| - intel/compiler: print shader after successful brw_nir_lower_shading_rate_output |
| - intel/compiler: use nir_lower_task_shader pass |
| - nir/lower_task_shader: lower small stores & loads to shared when requested |
| - intel/compiler/task: use shared memory for small task payload loads & stores |
| - intel/compiler: refactor brw_nir_lower_mem_access_bit_sizes |
| - intel/compiler: add support for 8/16 bits task payload loads |
| - radv: use nir_shader_instructions_pass in radv_nir_lower_ycbcr_textures |
| - iris: remove invalid nir_metadata_preserve from iris_fix_edge_flags |
| - crocus: remove invalid nir_metadata_preserve from crocus_fix_edge_flags |
| - glsl: use nir_shader_instructions_pass in gl_nir_lower_atomics |
| - glsl: use nir_shader_instructions_pass in gl_nir_lower_images |
| - glsl: use nir_shader_instructions_pass in gl_nir_lower_samplers_as_deref |
| - nir: use nir_shader_instructions_pass in nir_lower_alu |
| - nir: use nir_shader_instructions_pass in nir_lower_bool_to_bitsize |
| - nir: use nir_shader_instructions_pass in nir_lower_bool_to_float |
| - nir: use nir_shader_instructions_pass in nir_lower_bool_to_int32 |
| - nir: use nir_shader_instructions_pass in nir_lower_64bit_phis |
| - nir: use nir_shader_instructions_pass in nir_lower_clamp_color_outputs |
| - nir: use nir_shader_instructions_pass in nir_lower_clip_disable |
| - nir: use nir_shader_instructions_pass in nir_lower_clip_halfz |
| - nir: use nir_shader_instructions_pass in nir_lower_drawpixels |
| - nir: use nir_shader_instructions_pass in nir_lower_fb_read |
| - nir: use nir_shader_instructions_pass in nir_lower_frexp |
| - nir: use nir_metadata_none instead of its value |
| - nir: use nir_shader_instructions_pass in nir_lower_interpolation |
| - nir: use nir_shader_instructions_pass in nir_lower_samplers |
| - nir: use nir_shader_instructions_pass in nir_split_var_copies |
| - nir: use nir_shader_instructions_pass in nir_split_per_member_structs |
| - intel/compiler: fix loading of draw_id from task & mesh payload |
| - anv: add support for mesh shading in INTEL_MEASURE |
| - intel/ds: add new category/stage for draw mesh events |
| - anv: add mesh shading tracepoints |
| - intel/genxml: fix width of 3DSTATE_TASK_CONTROL.MaximumNumberofThreadGroups |
| - anv: set 3DSTATE_[MESH|TASK]_CONTROL.MaximumNumberofThreadGroups |
| - anv: program 3DSTATE_MESH_DISTRIB with the recommended values |
| |
| Marek Olšák (135): |
| |
| - nir: add nir_intrinsic_image_samples_identical |
| - nir: add nir_intrinsic_image_descriptor_amd |
| - nir: add nir_texop_descriptor_amd |
| - nir: add shader_info::uses_resource_info_query for txs, levels, samples, etc. |
| - ac/llvm: implement nir_intrinsic_image_deref_samples_identical |
| - ac/llvm: implement nir_intrinsic_image_deref_descriptor_amd |
| - ac/llvm: implement nir_texop_descriptor_amd |
| - ac/nir: add ac_nir_lower_resinfo |
| - radeonsi,radv: run ac_nir_lower_resinfo |
| - ac/llvm: remove all resinfo code now that it's lowered |
| - radeonsi: implement a non-scaled compute blit+resolve and use it on gfx11 |
| - radeonsi: don't assume that TC_ACTION_ENA invalidates L1 cache on gfx9 |
| - radeonsi/ci: update failing tests on navi21 |
| - radeonsi: fix a regression due to reordering PIPE_SHADER_* |
| - ac/llvm: handle external textures in ac_nir_lower_resinfo |
| - radeonsi: merge both fail paths in si_set_vb_descriptor |
| - radeonsi: add vertex buffers into the BO list in set_vertex_buffers |
| - radeonsi: remove vb_descriptors_gpu_list only used for debugging |
| - radeonsi: remove temporary si_context::vb_descriptor_user_sgprs |
| - radeonsi: use si_cp_dma_prefetch_inline for prefetching VBO descriptors |
| - radeonsi: use do..while loops and other cosmetic changes in display list path |
| - ttn: set the correct sampler declaration type in the presense of txs and lod |
| - gallivm: fix TXQ crash with MSAA samplers |
| - gallivm: initialize texture_unit_offset in emit_size_query |
| - gallium/u_blitter: remove unused code for integer MSAA resolve |
| - gallium/u_blitter: fix the has_txf support condition |
| - gallium/u_blitter: treat Z scaling as scaled blits |
| - gallium/u_blitter: clean up IMMs in util_make_fs_blit_msaa_gen |
| - gallium/u_blitter: make nearest filtering emulation using TXF conformant |
| - gallium/u_blitter: make the bilinear filter for MSAA resolving conformant |
| - glthread: unbind framebuffers in glDeleteFramebuffers |
| - glthread: call _mesa_glthread_DeleteBuffers unconditionally |
| - glthread: track GL_READ_FRAMEBUFFER bindings too |
| - glthread: track glBindFramebufferEXT, not just glBindFramebuffer |
| - glthread: add missing code for GL_ARB_sparse_texture |
| - util/xmlconfig: allow drivers to override option values |
| - radeonsi: rename stop_exec_on_failure -> allow_context_lost |
| - radeonsi: allow lost context with aux_contexts |
| - winsys/amdgpu: terminate process on CS rejection when unrobust context is lost |
| - winsys/amdgpu: flatten huge if and reorder code in amdgpu_cs_submit_ib |
| - winsys/amdgpu: change num_rejected_cs to a bool flag |
| - radeonsi/ci: make the running script easy to use |
| - Revert "mesa: implement a display list / glBitmap texture atlas" |
| - mesa: create glBitmap textures while creating display lists |
| - st/mesa: fix potential use-after-free in draw_bitmap_quad |
| - ci: update pass/fail results for spec@!opengl 1.0@gl-1.0-dlist-bitmap |
| - winsys/amdgpu: use cached GTT for command buffers and don't set the 32BIT flag |
| - ci: disable the freedreno farm. |
| - radeonsi/ci: add skips of tests not built on Linux |
| - radeonsi: don't pass num_patches via derived_tess_state, pass it via si_context |
| - radeonsi: make the primitive type constant with tessellation |
| - radeonsi: move \*rs to its only use in si_draw |
| - radeonsi: remove the prim_restart_tri_strips_only option |
| - radeonsi: move fixing ngg_culling into si_update_shaders |
| - radeonsi: unify the logic that sets rast_prim |
| - radeonsi: precompute GS_OUT_PRIM in advance |
| - radeonsi: set GS_STATE_OUTPRIM and PROVOKING_VTX_INDEX only when they change |
| - radeonsi: cosmetic changes in si_emit_rasterizer_prim_state |
| - radeonsi: remove 1 draw packet order codepath, keep the first one |
| - radeonsi: move patch_vertices-related tessellation updates out of si_draw |
| - radeonsi: move set_patch_vertices into si_state_shaders.cpp |
| - ac/gpu_info: handle LPDDR4 and 5 in ac_memory_ops_per_clock |
| - ac/surface: disallow 256KB swizzle modes on gfx11 APUs |
| - radeonsi: set VS_OUT_MISC_SIDE_BUS_ENA=1 for clip distance exports on gfx10.3 |
| - mesa: fix glDrawRangeElementsBaseVertex should be compiled into display list |
| - glthread: fix draws not compiled into a display list should generate an error |
| - glthread: generate errors for glGet functions between glBegin/End |
| - glthread: always sync for glShaderSource because invalid params can crash |
| - glthread: fix glGetIntegerv(GL_CLIENT_ACTIVE_TEXTURE) |
| - glthread: work around GL_INVALID_OPERATION with OpenGL ES 1.x draws |
| - frontend/dri: make the backgroundCallable extension optional |
| - frontend/dri: sync glthread when calling from the app side |
| - radeonsi: simplify radeonsi_zerovram implementation |
| - driconf: disable glthread for DeusEx:MD and F1 2015 due to a perf drop |
| - radeonsi/ci: add glx@glx-swap-event_async to CI failures for glthread |
| - radeonsi: don't flush asynchronously for fence_server_signal |
| - glthread: execute glSignalSemaphoreEXT synchronously |
| - radeonsi: enable glthread by default |
| - gallium/u_threaded: add an option for unsychronized create_fence_fd |
| - mesa: replace ALLOC_PRIMS with persistently-allocated context arrays |
| - mesa: remove _mesa_draw_gallium_fallback by adapting st_feedback_draw_vbo |
| - mesa: remove unused code using _mesa_prim |
| - mesa: clean up st_indirect_draw_vbo interface and callers |
| - mesa: trivial simplifications in _mesa_reference_buffer_object_ |
| - mesa: inline _mesa_reference_buffer_object |
| - mesa: inline _mesa_handle_bind_buffer_gen and correct things |
| - mesa: restructure bind_buffer_object for better performance |
| - mesa: remove unused gl_buffer_object::Data and Written |
| - mesa: reorder and pack gl_buffer_object |
| - vbo: don't call _glapi_set_dispatch for GL_COMPILE_AND_EXECUTE loopback |
| - mesa: don't restore the dispatch at the end of glCallList |
| - mesa: remove unused function _mesa_get_dispatch |
| - mesa: initialize OutsideBeginEnd directly instead of through Exec |
| - glapi: rename 'vtxfmt' to 'beginend' to make it clear |
| - api_hw_select_init_h.py: simplify the code |
| - mesa: remove deprecated comments |
| - mesa: rename GL dispatch initialization functions to make the intent clear |
| - glthread: use GLenum16 for enums, but clamp it to 0xffff to get correct errors |
| - glthread: use 8-bit GLenum for prim modes where it results in better packing |
| - glthread,hud: draw per-frame values and expose the number of batches per frame |
| - glthread: skip glMultMatrixf if it's identity |
| - mesa: use memcmp instead of floating-point comparisons in glMultMatrixf |
| - mesa: make glPopMatrix a no-op if the matrix hasn't changed |
| - radeonsi: rename si_create_multi_fence -> si_alloc_fence |
| - winsys/amdgpu: fix (enable) preemption for chained IBs |
| - radeonsi: fix max_dw computation for CS preambles |
| - gl_marshal.py: remove/simplify parameters |
| - gl_marshal.py: move the unmarshal table into a separately generated file |
| - gl_marshal.py: rework how the marshal dispatch table is initialized |
| - gl_marshal.py: remove the -O1 hack and manual SET_* inlining |
| - gl_marshal.py: inline functions for readability |
| - gl_marshal.py: simplify print_sync_call and rename to print_call |
| - glthread: use a constant expression instead of cmd_size in custom functions |
| - glthread: add more DrawArrays/Elements variants with fewer fields |
| - glapi: remove EXT and ARB suffixes from Draw functions |
| - glthread: demystify Draw function names |
| - glthread: rewrite CallList merging and do it in the app thread |
| - gallium/u_threaded: don't call simplify_draw_info redundantly |
| - gl_marshal.py: C style fixups |
| - glthread: merge and collapse glBindBuffer calls that unbind and then bind |
| - glthread remove the unused \*last pointer from unmarshal functions |
| - glthread: don't sync for glIsEnabled(GL_BLEND, GL_LIGHTING, GL_POLYGON_STIPPLE) |
| - cso: start without u_vbuf by default if it's not always used |
| - cso: constify some parameters to remove typecasts |
| - cso: fix broken optimization for sampler state lookups |
| - cso: don't destroy CSOs that are saved |
| - cso: inline more functions because some parameters like key_size are literals |
| - cso: make memcmp and hash computation use a literal key size for blend state |
| - driconf: add a workaround for a cursor issue with kwin_wayland |
| - driconf: add a workaround for Cossacks 3 |
| - nir: add nir_intrinsic_optimization_barrier_vgpr_amd for LLVM |
| - radeonsi: force the MSAA resolve shader to use 1 clause for MSAA loads |
| - radeonsi/gfx11: fix compute scratch buffer - WAVES is always per SE |
| - st/mesa: suppress async glthread flushing for GLX_EXT_texture_from_pixmap |
| - st/vdpau: fix interop with GL |
| |
| Marek Vasut (1): |
| |
| - etnaviv: Use old set of state registers for PE configuration on GC880 |
| |
| Mario Kleiner (1): |
| |
| - vulkan/wsi/display: Reset connector state in vkReleaseDisplay(). |
| |
| Mark Collins (8): |
| |
| - tu: Implement VK_EXT_non_seamless_cube_map |
| - tu: Allocate private memory per-device instead of per-pipeline |
| - tu: Clamp priority in DRM submitqueue creation |
| - tu: Expose VK_EXT_tooling_info using common implementation |
| - tu: Retain allocated CSes in tu_autotune_on_submit |
| - tu: Optimize hash_renderpass_instance by removing XXH64_update |
| - tu: Only write \`A6XX_PC_PRIMITIVE_CNTL_0` if changed |
| - tu: Clean up variable usage in \`tu6_draw_common` |
| |
| Mark Janes (6): |
| |
| - Revert "anv: Do not copy garbage to batch_bo" |
| - anv: Track BOs that need a write-combined mapping |
| - anv: Use WC mapped local memory for block pool BO |
| - anv: Allocate buffers with write-combined local memory |
| - anv: compile anv_acceleration_structure.c |
| - anv: work around improper buffer usage in hitman3 |
| |
| Martin Krastev (1): |
| |
| - meson: add svga gallium driver to 'auto' group on arm/aarch64 |
| |
| Martin Roukala (né Peres) (20): |
| |
| - radv/ci: document a recent regression |
| - radv/ci: move some tests from the vega10 fail to its flake list |
| - radv/ci: add more tests to the flake list of vega10 |
| - radv/ci: skip host_write_transfer_src.1048576 on renoir |
| - radv/ci: bump the console activity timeout of VanGogh to 3 minutes |
| - radv/ci: uprev boot2container |
| - radv/ci: disable the POLARIS10 jobs |
| - radv/ci: disable kabini's vkd3d job |
| - radv/ci: run vkcts on the two steam decks in parallel |
| - radv/ci: document an unstable test |
| - radv/ci: move some tests from the renoir fail to its flake list |
| - ci/b2c: add support for the new format of CI_RUNNER_TAGS |
| - radv/ci: bump the job count on navi21 |
| - radv/ci: run two jobs in parallel to reduce the execution time |
| - zink: update the RADV expectations |
| - zink: mark a test as a flake in RADV expectations |
| - zink/ci: mark another test as fixed after the frontend caching series |
| - zink/ci: add another subtest to the list of known failures for radv |
| - Revert "glx: Fix drawable refcounting for naked Windows" |
| - Revert "glx/dri: Fix DRI drawable release at MakeCurrent time" |
| |
| Matt Coster (3): |
| |
| - util: Include stddef.h in util/macros.h |
| - pvr: Add pvr_csb_unpack(). |
| - pvr: Fix assignment to ISP back_a.sref |
| |
| Matt Turner (4): |
| |
| - intel/tools: Also look for 'batch' tag |
| - freedreno/ir3: Don't use the broken idiv lowering |
| - freedreno/ir3: Use nir_opt_idiv_const |
| - intel/dev: Set display_ver = 13 on all ADL/RPL/DG2 |
| |
| Mauro Rossi (5): |
| |
| - util: glsl2spirv.py: ensure '#endif' is printed in new line |
| - hasvk: fix android build and reported API version |
| - Android.mk: Fix gnu++14 related build failures |
| - r600/sfn: allow building with clang 6 (Android 9) |
| - AOSP: Add intel_hasvk vulkan library suffix |
| |
| Max Kellermann (2): |
| |
| - gallium/u_threaded: add missing reference counts for draw_multi slots |
| - gallium/u_threaded: fix offset calculation for draw_multi slots |
| |
| Michael Skorokhodov (1): |
| |
| - egl: Return EGL_BAD_MATCH for invalid share_list |
| |
| Michael Tang (3): |
| |
| - microsoft/compiler: Add dxil_logger type and parameter to nir_to_dxil |
| - nir: add nir_instr_as_str |
| - microsoft/compiler: Change calls to NIR_INSTR_UNSUPPORTED to log to dxil_logger instead |
| |
| Michel Dänzer (6): |
| |
| - gallivm: Fix LLVMAtomicRMWBinOpFMax spelling |
| - lavapipe: Fix float32_atomic_min_max spelling |
| - winsys/amdgpu: Set RADEON_FLAG_32BIT again |
| - radeonsi/ci: Test piglit quick_shader profile as well |
| - Revert "egl/glx: add fallback for zink loading" |
| - vulkan/wsi/wayland: Set num_modifier_lists = 0 if num_drm_modifiers == 0 |
| |
| Mihai Preda (21): |
| |
| - gallivm: LLVM-15 opaque pointers: disable LLVMGetElementType(ptr_type) |
| - gallivm: drop unused "emit_load" in lp_llvm_buffer_member() |
| - gallium: drop unused macros in draw_llvm.h |
| - gallivm: fixes for LLVM-15 opaque pointers in lp_bld_nir_aos.c |
| - gallivm: fixes for LLVM-15 opaque pointers in lp_bld_nir_soa.c |
| - gallivm: fixes for LLVM-15 opaque pointers in lp_bld_jit_types.c |
| - gallivm: fixes for LLVM-15 opaque pointers in lp_bld_format_s3tc.c |
| - gallivm: fixes for LLVM-15 opaque pointers in lp_bld_struct.c |
| - gallivm: fixes for LLVM-15 opaque pointers in lp_bld_format_soa.c |
| - gallium/draw: fix LLVM opaque pointer warnings in store_clip() |
| - gallium/draw: fix LLVM opaque pointer warnings in generate_clipmask() |
| - gallium/draw: fix LLVM opaque pointer warnings in draw_gs_llvm_fetch_input() |
| - gallium/draw: fix LLVM opaque pointer warnings in draw_gs_llvm_end_primitive() |
| - gallium/draw: fix LLVM opaque pointer warnings in draw_gs_llvm_epilogue() |
| - gallium/draw: fix LLVM opaque pointer warnings in draw_tcs_llvm_emit_fetch_input() |
| - gallium/draw: fix LLVM opaque pointer warnings in draw_tcs_llvm_emit_fetch_output() |
| - gallium/draw: fix LLVM opaque pointer warnings in draw_tcs_llvm_emit_store_output() |
| - gallium/draw: fix LLVM opaque pointer warnings in store_aos() |
| - gallivm: LLVM opaque pointer fixes in lp_bld_sample.c |
| - gallivm: LLVM opaque pointers: add lp_build_array_get[_ptr]2 in lp_bld_struct.c |
| - gallivm: LLVM opaque pointers: use lp_build_array_get_ptr2() in lp_bld_nir_soa.c |
| |
| Mike Blumenkrantz (443): |
| |
| - zink: handle mutable swapchain images with dmabuf |
| - zink: store VkFormatFeatureFlags on creation |
| - zink: use stored format features for blit functionality |
| - zink: use modifier feature flags during surface creation when necessary |
| - zink: init cache_put program fence on program creation |
| - zink: always set vertex dynamic states |
| - zink: hook up pipeline_library extensions |
| - zink: change zink_create_gfx_program signature |
| - zink: break out program primtype->idx conversion function |
| - zink: reorganize zink_gfx_pipeline_state a little |
| - zink: set program pipeline array idx earlier in zink_get_gfx_pipeline |
| - zink: add a graphics pipeline library implementation |
| - tgsi_to_nir: fix clipdistance store writemask |
| - tgsi_to_nir: handle compact arrays for clipdistance |
| - zink: force flush clears on fb change if fb geometry changes |
| - zink: force a new framebuffer for clear_depth_stencil if the clear region is big |
| - zink: use maintenance4 |
| - zink: allow sw loading with D3D_ALWAYS_SOFTWARE for nine |
| - zink: drop mode_changed check from linewidth/depthbias draw updates |
| - zink: handle unscaled depth bias from nine |
| - zink: handle !half_pixel_center |
| - zink: fix return for PIPE_CAP_DEPTH_CLIP_DISABLE |
| - zink: explicitly set nir cursor in rewrite_and_discard_read |
| - zink: handle invalid gl_TexCoord reads |
| - zink: rewrite all undefined shader reads as 0001 instead of undef |
| - zink: export PIPE_CAP_MULTISAMPLE_Z_RESOLVE |
| - zink: support PIPE_QUERY_TIMESTAMP_DISJOINT |
| - nine: check return on resource_get_handle |
| - zink: add u_queue.h to kopper header |
| - zink: demote dmabuf tiling to linear if modifiers aren't supported |
| - zink: use modifier_aspect to check for modifier plane in zink_resource_get_param |
| - zink: don't add modifiers if EXT_image_drm_format_modifier isn't present |
| - zink: add all format modifiers when adding for dmabuf export |
| - zink: don't fixup sparse texops |
| - zink: remove an old RGBA4 nvidia workaround |
| - vulkan: Update the XML and headers to 1.3.224 |
| - radv: fix return type for meta resolve shaders |
| - nir/validate: clamp unsized tex dests to 32bit |
| - nir/validate: add some (light) validation for sampler type matching |
| - radv: remove stupid conditional from draw path |
| - radv: force inlining on another draw function |
| - mesa: fix blending when using luminance/intensity emulation |
| - mesa: require render target bind for A/L/I in format selection |
| - mesa/st: add ignore_srgb_decode param to st_convert_sampler |
| - mesa/st: plumb glsl130_or_later through sampler creation |
| - mesa/st: use plumbed value for srgb_skip_decode in sampler conversion |
| - gallium: convert PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE to enum |
| - gallium: add PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_ALPHA_NOT_W |
| - util/format: handle SRGB in util_format_luminance_to_red() |
| - zink: stop modifying samplerview swizzle on base struct |
| - zink: add some format functions for emulating alpha formats |
| - zink: reject native blits for emulated alpha formats |
| - zink: handle emulated alpha format clears |
| - zink: support emulating alpha formats using new border color quirk |
| - zink: add a driver workaround for broken L4A4 on nvidia |
| - mesa: track format swizzles on teximage objects |
| - mesa/st: use pre-calculated format swizzle for samplerviews |
| - mesa: track gl_texture_object swizzle internally |
| - mesa/st: use tracked samplerview swizzle values |
| - mesa/st: simplify sampler border color swizzling |
| - zink: remove cached descriptor mode |
| - zink: remove checks for lazy descriptor mode + templates ext |
| - zink: remove descriptor vtable |
| - zink: delete a lot of descriptor code |
| - zink: move lazy descriptor functions into zink_descriptors.c |
| - zink: delete unused struct types |
| - zink: move all types/enums into zink_types.h |
| - zink: move lazy descriptor data types into zink_types.h |
| - zink: delete some unused descriptor struct members |
| - zink: remove descriptor dummy_set |
| - zink: merge context descriptor data structs |
| - zink: merge batch descriptor data structs |
| - zink: merge context descriptor data onto context struct |
| - zink: merge program descriptor data onto program struct |
| - zink: merge batch descriptor data onto batch state |
| - zink: reorder some descriptor structs |
| - zink: rename _lazy descriptor stuff |
| - zink: use statically sized array for descriptor allocation |
| - zink: ZINK_SHADER_COUNT -> ZINK_GFX_SHADER_COUNT |
| - zink: explicitly define ZINK_GFX_SHADER_COUNT |
| - zink: PIPE_SHADER_TYPES -> MESA_SHADER_STAGES |
| - zink: remove all pipe_shader_type usage |
| - zink: delete zink_shader_stage() |
| - zink: fix program cache comparisons |
| - zink: require EXT_non_seamless_cube_map for GPL support |
| - zink: (correctly) require extendedDynamicState2PatchControlPoints for GPL |
| - zink: use a maybe more accurate wild guess for pcp-less gpl |
| - zink: require VK_KHR_descriptor_update_template |
| - zink: add an id member for zink_descriptor_pool_key |
| - zink: use a dynarray instead of hash table for tracking pools |
| - zink: use a single allocation for zink_descriptor_layout_key |
| - zink: rework descriptor pool overflow |
| - nir: add uses_bindless flag for shader_info |
| - zink: bail out of dmabuf resource creation when srgb explosion is expected |
| - zink: iterate over all modifiers and nuke the ones that aren't supported |
| - zink: try to reuse swapchain modifier for dmabuf export |
| - ci: remove broken device-select-layer from build |
| - vk/render_pass: don't deref null resolve attachments |
| - Revert "zink: try to reuse swapchain modifier for dmabuf export" |
| - Revert "zink: add all format modifiers when adding for dmabuf export" |
| - zink: deduplicate some program creation code |
| - zink: deduplicate some program destroy code |
| - zink: move gfx program update/creation into zink_program.c |
| - zink: remove cpu conditional render hack |
| - zink: add c++ guards to zink_pipeline.h |
| - zink: move pipeline cache structs to zink_program.h |
| - zink: move gfx pipeline recalc to c++ |
| - zink: move pipeline state comparison to c++ template |
| - zink: use template to eliminate more instructions in pipeline state comparison |
| - zink: remove some gfx pipeline state members |
| - zink: fix shader key struct packing |
| - zink: rework bit iteration in update_gfx_shader_modules to use for loop |
| - zink: simplify shader key comparison by splitting out non-generated tcs |
| - zink: plumb some screen info through gfx shader creation |
| - zink: plumb screen info through shader key comparison |
| - zink: split out nonseamless/inline shader module info calculation |
| - zink: split out gfx shader module creation from cache searching |
| - zink: split out first-time shader module creation |
| - zink: use a list for free batch states |
| - zink: add templates for pipeline stage comparison |
| - zink: use stage comparison template to ignore vertices_per_patch |
| - zink: cache dynamic render vs renderpass pipelines separately |
| - zink: make program cache values more intelligible |
| - zink: change u_foreach_bit to regular for loop in zink_gfx_program_update |
| - zink: use dynarray instead of list for internal shader cache |
| - zink: pass shader stage as param for gfx module update |
| - zink: constify shader module variable |
| - zink: make zink_bo.h compile in c++ |
| - zink: add more c++ guards to headers |
| - zink: remove c++ guard from zink_resource.h |
| - zink: move zink_batch_resource_usage_set to be static inline |
| - zink: add is_buffer param to zink_batch_resource_usage_set |
| - zink: use PIPE_CAP_RGB_OVERRIDE_DST_ALPHA_BLEND |
| - zink: merge all the batch state sets onto the struct |
| - zink: make batch internal api static again |
| - zink: split resource tracking into two sets for buffer/texture |
| - zink: track which stages in gfx programs are using inlined uniforms |
| - zink: add gfx pipeline lookup shortcut |
| - st_pbo/compute: handle download failures with fallback |
| - st_pbo/compute: fix 1D_ARRAY offsets |
| - st_pbo/compute: fix 1D coord dimension by pre-trimming vectors |
| - st_pbo/compute: pre-clamp shader geometry to 16bit |
| - st_pbo/compute: pre-clamp loaded geometry based on coord components |
| - zink: handle nir_intrinsic_sparse_residency_code_and mechanics |
| - zink: ignore nir_texop_lod for tex dest matching |
| - vulkan: Update the XML and headers to 1.3.225 |
| - zink: support PIPE_CAP_FBFETCH_COHERENT |
| - tu: fix invalid free on alloc failure |
| - lavapipe: stop tracking descriptor image layout |
| - lavapipe: use cso for compute samplers too |
| - lavapipe: remove lvp_sampler::state |
| - lavapipe: replace lvp_sampler internals with pipe_sampler_state |
| - lavapipe: use pipe_sampler_state directly in descriptor info |
| - lavapipe: delete lvp_buffer::offset |
| - lavapipe: use gallium buffer descriptor types directly |
| - lavapipe: apply VK_WHOLE_SIZE to bufferviews at creation |
| - lavapipe: create gallium descriptor image/sampler view types for views |
| - lavapipe: delete some code |
| - zink: use screen param for zink_descriptor_program_deinit |
| - zink: allow programs to solely manage descriptor deinit |
| - zink: create compute programs from compute shaders directly |
| - zink: initialize cache_get_thread unconditionally |
| - zink: add use_local_size flag to compute programs |
| - zink: don't call util_queue_fence_init in zink_screen_get_pipeline_cache() |
| - zink: add param to allow cache_get jobs to run directly |
| - zink: add a pipeline shortcut for basic compute programs |
| - zink: merge compute program hash table onto program struct |
| - zink: don't add compute base pipeline to hash table |
| - zink: remove use_local_size from compute pipeline state |
| - zink: stop zeroing local size if current compute doesn't use it |
| - zink: use different key comparison for local_size-using compute pipelines |
| - zink: track whether shader has cube samplers |
| - zink: move compute pipeline cache update to caller |
| - zink: add a param to allow zink_screen_update_pipeline_cache to run directly |
| - zink: move compute descriptor binds down in execution |
| - zink: add precompilation for compute shaders |
| - zink: add partial async shader compile support |
| - zink: add locking for descriptor layout caches |
| - zink: add a context pointer to zink_program struct |
| - zink: init fbfetch/bindless ctx stuff on shader create |
| - zink: generate sha1 for compute programs |
| - zink: add function to check whether a shader has cube samplers |
| - zink: defer all compute shader creation |
| - zink: don't emit entrypoints for function temp variables |
| - zink: check the variable mode before taking samplemask path in ntv |
| - zink: explicitly use unsigned types for bit shifts |
| - zink: store VkImageViewUsageCreateInfo for surface creation |
| - zink: only add srgb mutable for images with modifiers |
| - zink: avoid needing mutable for z/s sampling |
| - zink: don't set MUTABLE by default |
| - zink: don't emit illegal interpolation |
| - vulkan: initialize a variable |
| - llvmpipe: don't assume pipe_context is always available in flush_frontbuffer |
| - mesa: add GLSL_SOURCE |
| - zink: add spirv builder function for demote |
| - zink: hook up demote extension |
| - zink: fix sharedmem ops with bit_size!=32 |
| - zink: flag all assigned output slots as mapped |
| - zink: add a ci skip for anv |
| - mesa/st: split out builtin shader finish |
| - st_pbo/compute: use split shader finish funcs |
| - mesa/st: move compute pbo shutdown to compute pbo file |
| - gallium: add pipe_screen::driver_thread_add_job |
| - st_pbo/compute: use new shader interface to perform async shader creation |
| - mesa/st: add specialized pbo download shaders |
| - mesa/st: add MESA_COMPUTE_PBO env var |
| - zink: implement async nir creation |
| - lavapipe: VK_EXT_attachment_feedback_loop_layout |
| - lavapipe: ARM/EXT_rasterization_order_attachment_access |
| - docs: add more features |
| - radv: avoid bottlenecking on sequential sparse buffer binds |
| - zink: handle split acquire/present |
| - zink: uncap ssbo size |
| - zink: remove special-casing for 64bit runtime array emission |
| - zink: always set var used by get_ssbo_size to the 32bit var |
| - zink: simplify ntv shader descriptor emission |
| - zink: emit Aliased decoration for aliased bo descriptors |
| - zink: fix atomic ssbo indexing with non-32bit values |
| - zink: handle 64bit float atomics |
| - zink: export PIPE_CAP_IMAGE_ATOMIC_FLOAT_ADD |
| - zink: export PIPE_CAP_SHADER_ATOMIC_INT64 |
| - zink: don't add void clears if a full clear already exists |
| - zink: split up get_clear_data() |
| - zink: make void clears more robust |
| - zink: rewrite clears on fb bind if only the format has changed |
| - zink: add have_vulkan13 to device info |
| - zink: add functions for using '2' variants of pipeline barriers |
| - zink: add screen interfaces for pipeline barriers |
| - zink: use screen interfaces for pipeline barriers |
| - zink: fix/relax resolve geometry check |
| - zink: defer acquire semaphore destruction |
| - zink: fix atomic ssbo fadd offsets |
| - st/bitmap: use normalized coords in samplers with lowered rects |
| - st/drawpixels: use normalized coords in samplers with lowered rects |
| - zink: disable GPL when sample shading is in use |
| - anv: force inline more pipe flush functions |
| - ci: add a panfrost flake |
| - ci: add nouveau flakes |
| - egl/glx: add fallback for zink loading |
| - zink: don't call CmdBindVertexBuffers2EXT with no attributes |
| - zink: don't always set VK_DYNAMIC_STATE_VERTEX_INPUT_BINDING_STRIDE_EXT with gpl |
| - zink: clamp z/s formats to aspected format for border colors |
| - zink: move color channel clamping to zink_format.c |
| - zink: add border color clamping when customBorderColorWithoutFormat isn't supported |
| - zink: add srgb border color clamping |
| - zink: add stencil border color clamping |
| - zink: check format compatibility up front when creating framebuffer surface |
| - zink: rework flush_frontbuffer to always flush |
| - zink: defer more semaphore destruction |
| - zink: rework batch tracking for resources |
| - zink bump batch state limit a bit |
| - zink: split out bind_sampler_states hook for drivers without EXT_non_seamless_cube_map |
| - zink: remove batch tracking/usage from view types |
| - zink: remove batch usage from sampler states |
| - zink: delete more sw conditional render handling |
| - zink: delete some descriptor caching cruft |
| - glsl: fix gl_CullDistance xfb linking |
| - zink: handle culldistance xfb outputs like clipdistance |
| - zink: track a mask of gfx stages remaining in gfx programs |
| - zink: track the owner context for gfx programs |
| - zink: use screen params for program reference functions |
| - zink: don't try to delete programs from the cache if stages have been modified |
| - zink: add locking for program caches |
| - zink: use program's context for cache deletion |
| - zink: use screen param for zink_shader_free() |
| - zink: PIPE_CAP_SHAREABLE_SHADERS |
| - radv: ALWAYS_INLINE radv_after_draw |
| - radv: ALWAYS_INLINE radv_flush_descriptors |
| - radv: ALWAYS_INLINE radv_is_streamout_enabled() |
| - radv: store pointer to sgprs for last vertex stage |
| - radv: use direct access to last_vgt_api_stage_locs for sgpr emission |
| - zink: add a bool flag for decomposed vertex elements |
| - zink: check on init whether decomposed attrs are needed |
| - zink: wrap zink_shader_key_base access for nonseamless cubemap use |
| - zink: remove a bunch of casts from shader key inlines |
| - zink: handle inline uniforms during first-time shader generation |
| - zink: copy the non-dirty shader stages when updating gfx program |
| - zink: move to AoS for gfx program shader modules |
| - zink: ALWAYS_INLINE bind_gfx_stage |
| - zink: add some padding to small shader keys for byte-alignment |
| - zink: add 'optimal_keys' handling for shader keys |
| - zink: use optimal_keys to massively reduce size of pipeline cache comparisons |
| - zink: split dirty_shader_stages for gfx and compute |
| - zink: simplify bit tracking in zink_gfx_program_update() |
| - zink: be even more granular with optimal_key program updates |
| - zink: unspaghettify some program update code |
| - lavapipe: set writemask for compute shader buffers |
| - lavapipe: propagate shader access info across libraries |
| - zink: copy optimal key always |
| - zink: add asserts to verify optimal key state |
| - zink: fix some batch tracking leaks |
| - zink: free vertex_state_cache on shutdown |
| - zink: always unset var->data.explicit_xfb_buffer |
| - zink: fix sparse queue creation |
| - zink: fix handling for ssbos that are just runtime arrays |
| - zink: name bo variables using bitsize |
| - zink: avoid overflow when clamping bufferviews |
| - zink: allow reordered clear_buffer calls |
| - zink: remove color_write_missing driver workaround |
| - zink: only emit runtime array for ssbos if present |
| - mesa: set normalized_coords for bindless texture buffer samplers |
| - zink: enable VK_ATTACHMENT_LOAD_OP_DONT_CARE for zs in renderpasses |
| - zink: remove zink_context::new_swapchain |
| - zink: rename fb_binds -> fb_bind_count |
| - zink: reorder zink_resource a little |
| - zink: add a mask of fb attachment idx for resources |
| - zink: use feedback loop layout to correctly handle implicit feedback loops |
| - zink: always set VK_PIPELINE_CREATE_COLOR_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT sometimes |
| - nir/opt_undef: add a pass to clean up 64bit undefs |
| - zink: don't flatten 64bit arrays during rewrite |
| - zink: always run optimize_nir after lower_64bit_vars |
| - zink: split up lower_64bit_vars pass |
| - zink: also lower 64bit function temps |
| - aux/trace: dump line_rectangular member of rasterizer state |
| - aux/trace: dump depth_clamp for rasterizer state |
| - zink: always unflag unordered_write when binding image descriptors |
| - zink: set layouts before possibly reordering image copies |
| - zink: move zink_screen_get_pipeline_cache() calls out to callers |
| - zink: rework ZINK_PIPELINE_LIBRARY_FORCE into ZINK_DEBUG flag |
| - zink: move create_pipeline_lib to zink_program.c |
| - zink: hook up VK_EXT_depth_clamp_zero_one |
| - zink: pass actual screen->threaded to zink_tc_context_unwrap() |
| - vulkan: Update the XML and headers to 1.3.230 |
| - lavapipe: dynamic state3 |
| - zink: add a draw barrier when unbinding a sampler-bound fb surface |
| - zink: replace mixed_zs with zs feedback loops |
| - glthread: use signed vertex buffer offsets when available, don't require them |
| - glthread: handle DeleteBuffers(n=-1) gracefully |
| - zink: fix/improve handling for multi-component bitfield ops |
| - zink: export PIPE_CAP_ALLOW_MAPPED_BUFFERS_DURING_EXECUTION |
| - zink: export PIPE_CAP_MAP_UNSYNCHRONIZED_THREAD_SAFE |
| - pipe-loader: fix zink driinfo header path |
| - zink: enable glthread by default |
| - zink: call flush_resource when presenting garbage |
| - lavapipe: improve blend handling on pipeline binds |
| - zink: use dynamic state3 rasterization states when available |
| - zink: use more dynamic state3 when available |
| - zink: add tgl ci skip |
| - zink: don't access prog->shaders array during pipeline compile |
| - zink: compress zink_gfx_output_key members |
| - zink: further compress zink_gfx_output_key lookups |
| - zink: set VK_PIPELINE_CREATE_RETAIN_LINK_TIME_OPTIMIZATION_INFO_BIT_EXT |
| - zink: require optimal_keys for GPL |
| - zink: add param to disable optimization when combining pipeline libraries |
| - zink: move binding_map to hw_state part of vertex state |
| - zink: reorder zink_destroy_gfx_program() |
| - zink: don't deref prog->shaders during pipeline construction |
| - zink: use an extra pipeline state bit to track coherent fbfetch usage for gpl outputs |
| - zink: reenable GPL |
| - zink: define zink_pipeline_dynamic_state3 as zink_rasterizer_hw_state |
| - zink: set depth clamp |
| - remove xvmc |
| - delete rbug |
| - delete graw tests |
| - vk/graphics_state: don't set default sample locations if rast samples is dynamic |
| - zink: allow creating full pipelines unoptimized |
| - zink: gfx_pipeline_cache_entry -> zink_gfx_pipeline_cache_entry |
| - zink: change zink_gfx_program::last_pipeline -> struct zink_gfx_pipeline_cache_entry |
| - zink: implement async gfx precompile |
| - zink: add ZINK_DEBUG=shaderdb |
| - zink: move descriptor layout/pool stuff to screen object |
| - zink: add a safety assert in descriptor updates |
| - zink: add some bda handling |
| - zink: handle global and scratch vars |
| - zink: move descriptor function up in file |
| - zink: handle oom better during descriptor updating |
| - zink: fix pool overflow handling on batch reset |
| - zink: rework/improve descriptor pool overflow handling on batch reset |
| - zink: delete view objects when unsetting resource usage in batch reset |
| - zink: prevent ballooning of view object memory |
| - lavapipe: store compiler options to physical device |
| - zink: check core feature for pipeline cache control |
| - zink: don't add other usage bits for transient images |
| - zink: clamp line_stipple_factor to 1 if stipple is disabled |
| - zink: unset rp_changed after initializing renderpass attachments |
| - zink: disable fbfetch when flushing clears |
| - glthread: fix buffer allocation size with non-signed buffer offset path |
| - zink: enable tc cpu storage on some buffers |
| - zink: clamp feedback loop layout to GENERAL if the ext isn't supported |
| - zink: delete some big struct members from zink_context |
| - zink: reorder context init to handle COMPUTE_ONLY |
| - zink: clamp shader bindings for compute descriptors |
| - zink: add ci flake |
| - zink: move spec constant emission to the types/consts block |
| - zink: set default spec constant value to 1 |
| - zink: add docs for zink_batch.c |
| - zink: delete/cleanup some descriptor #defines |
| - zink: delete unused descriptor struct field |
| - zink: start on some type docs |
| - zink: fix duplicate VK_DYNAMIC_STATE_LINE_STIPPLE_EXT setting |
| - zink: fix ds3 line stipple enable handling |
| - driconf: add mesa_glthread=false for gfxbench |
| - lavapipe: store binding offset for buffers |
| - lavapipe: fix BufferDeviceAddress usage |
| - zink: fix gpl compile optimization |
| - zink: decrement ZINK_FBFETCH_BINDING |
| - zink: don't flag ubo0 as used in shaders with indirect ubo access |
| - zink: allow direct memory mapping for any COHERENT+CACHED buffer |
| - zink: allocate/place memory using memoryTypeIndex directly |
| - zink: support multiple heaps per memory type |
| - zink: delete unused zink descriptor mode |
| - zink: rename some variables in descriptor code |
| - zink: ZINK_DESCRIPTOR_TYPES -> ZINK_DESCRIPTOR_BASE_TYPES |
| - zink: add more zink_descriptor_type enum values to improve readability |
| - zink: delete unused descriptor function |
| - zink: stop passing pointer to changed_sets to populate_sets() |
| - zink: add some descriptor docs |
| - zink: add some docs for c++ pipeline state functions |
| - zink: simplify some program descriptor helpers |
| - zink: split out uniform decriptor bindings in shader data |
| - zink: flag push state changed in batch changed conditional |
| - zink: rescope/simplify some push descriptor code |
| - zink: elide a buffer samplerview update conditional |
| - zink: handle broken resource mapping deadlocks |
| - zink: simplify swapchain imageview handling |
| - zink: delete stale comment for zink_surface |
| - zink: simplify conditional for surface rebind no-ops |
| - zink: use zink_resource_object::views to defer deferred storage view deletion |
| - zink: delete unused zink_surface member |
| - zink: add some breadcrumbs for VK_EXT_multisampled_render_to_single_sampled |
| - zink: add docs for zink_surface |
| - zink: match bitsizes in bo rewriting |
| - zink: pass KERNEL shaders through successfully |
| - zink: rework sampler emission |
| - zink: pass image type to image emission |
| - zink: add a nir pass for CL image typing and sampler tracking |
| - zink: simplify image deref handling |
| - zink: add some spirv builder handling for sampled image ops |
| - zink: emit explicit samplers in ntv based on sampler_mask |
| - zink: always take the compact descriptor size path with kernel shaders |
| - zink: add handling on the vk side for discrete sampler descriptors |
| - zink: add handling for CL-style discrete shader samplers |
| - zink: rip out unused kernel push constant |
| - zink: fix weird formatting |
| - zink: don't force attachment usage for images when unsupported |
| - zink: always add MUTABLE with EXTENDED_USAGE for image creation |
| - zink: don't double-deref bindless texture arrays in shaders |
| - util/tc: split out flush and deferred flush calls |
| - util/tc: split out dsa and fs state cso handling |
| - util/tc: implement renderpass tracking |
| - zink: add a context flag to indicate when blitter is running |
| - zink: use tc renderpass tracking to optimize renderpasses |
| - zink: enable renderpass optimizing for turnip jobs |
| |
| Ming Li (1): |
| |
| - docs: Add d3d10umd and lavapipe to the doc for the gallium frontends. |
| |
| Mykhailo Skorokhodov (8): |
| |
| - util: Add glsl2spirv.py script |
| - nir: Make lower_double_ops recognize SPIR-V mangling |
| - nir: Add assert in nir_lower_doubles |
| - glsl/meson: Add variable to export float64.glsl |
| - anv/meson: Add float64_spv_h custom target |
| - anv: Add softtp64 workaround |
| - drirc: Add fp64_workaround_enabled option |
| - drirc: Apply fp64_workaround_enabled to DOOM Eternal |
| |
| Nanley Chery (8): |
| |
| - mesa/st: Set PIPE_MAP_DISCARD_RANGE in fallback_copy_image |
| - mesa/st: Add and use st_texture_image_insert_transfer |
| - mesa/st: Add and use st_texture_image_transfer::box |
| - mesa/st: Don't map compressed texture fallbacks for reads |
| - mesa/st: Fix indentation in st_MapTextureImage |
| - mesa/st: Drop st_texture_image_transfer::map |
| - iris: Enable INTEL_MEASURE for compute dispatches on XeHP |
| - iris: Reduce use of RHWO optimization (Wa_1508744258) |
| |
| Nicholas Bishop (2): |
| |
| - crocus: Advertise PIPE_CAP_NATIVE_FENCE_FD |
| - crocus: fix off-by-one error when clearing stale syncobjs |
| |
| Oleksii Bozhenko (2): |
| |
| - vulkan/wsi: Pass wsi_image_create_info into anv_GetPhysicalDeviceImageFormatProperties2 |
| - anv: Allow aliasing with modifiers for WSI images |
| |
| Omar Akkila (5): |
| |
| - ci: Merge common builds in debian test jobs |
| - ci: Move crosvm to the base test image |
| - ci: Build crosvm with opaque fd mapping support |
| - ci: Enable virtio-experimental in testing |
| - ci: Add vulkan dEQP tests for venus |
| |
| Paul Gofman (1): |
| |
| - vbo/dlist: keep buffers used in loopback_vertex_list() mapped. |
| |
| Pavel Ondračka (22): |
| |
| - r300: fix variables detection for paired ALU and TEX instructions in different branches |
| - r300: allow constant swizzles with inline constant |
| - r300: fix negate mask computation when merging movs |
| - r300: check for identical saturate mode when merging MOVs |
| - r300: run dataflow optimizations in separate loops |
| - r300: generalize the merge_movs pass |
| - r300: don't merge w channel in fragment shaders |
| - r300: merge MOVs into ADD using the 0 swizzle |
| - r300: merge MOVs with MULs or ADDs in merge channels |
| - r300: Set more shadow sampler lowering in precompiles. |
| - r300: allow presubtract when both ADD sources are negative |
| - r300: run copy propagate once more after merging channels |
| - r300: merge together MOV and MAD instructions |
| - r300: add special path for merging movs with the same source |
| - r300: add some synchronization for KIL |
| - r300: fix register rewrite when converting rbg instructions to alpha |
| - r300: reduce CPU overhead in IF transformation pass |
| - Revert "r300: be less agresive with copy propagate in loops" |
| - r300: fix reader detection with breaks |
| - r300: be more careful when pair merging with presubtract |
| - r300: fix generation of invalid swizzles in regalloc |
| - r300: respect buffer offset in r300_set_constant_buffer |
| |
| Pedro J. Estébanez (5): |
| |
| - nir2dxil: Lower texture projections |
| - nir2dxil: Use native helper lane intrinsic on SM >= 6.6 |
| - microsoft/spirv2dxil: Refactor to make SM and validator versions public |
| - microsoft/spirv2dxil: Change \`spirv_to_dxil()` to receive SM and validator versions |
| - microsoft/spirv2dxil: Support pack_32_2x16 and pack_32_4x8 |
| |
| Pierre Moreau (11): |
| |
| - nv50,nvc0: Do not resize global residents if unnecessary |
| - nv50: Rename fixups to relocs |
| - nv50: Rename interps to fixups |
| - nv50: Report actual VRAM size |
| - nv50: Disallow allocating more than VRAM size |
| - nv50: Mark RESOURCE_FROM_USER_MEMORY_COMPUTE_ONLY as unsupported |
| - nv50/ir: Handle non-32-bit values when cst folding SPLIT |
| - nv50/peephole: Disallow combining sub 4-byte ld/st for now |
| - nv50/ra: Fix the offset computation for compounds |
| - nv50/ir: Avoid generating splits of splits |
| - nv50/nir: A group barrier is CTA-level not global-level |
| |
| Pierre-Eric Pelloux-Prayer (72): |
| |
| - loader: don't return empty string in loader_get_dri_config_device_id |
| - loader: allow DRI_PRIME=vendor_id:device_id syntax |
| - vulkan/device_select: print the dri_prime warning only if needed |
| - vulkan/device_select: allow DRI_PRIME=vendor_id:device_id |
| - docs: document DRI_PRIME |
| - gallium/dri2: make protected buffer checks opt-in |
| - amdgpu/bo: update uses_secure_bos when importing buffers |
| - radeonsi: allocate BIND_RENDER_TARGET as tmz instead of SCANOUT |
| - radeonsi: print radeon_bo_flag when AMD_DEBUG=vm is used |
| - nir: add a nir_opt_if_options enum |
| - radeonsi/tests: remove left-over debug print |
| - radeonsi/tests: add testing of the khr-single tests |
| - radeonsi/tests: update raven expected results |
| - radeonsi/tests: update navi21 expected results |
| - radeonsi/tests: cosmetic changes |
| - radeonsi: prevent u_blitter recursion in si_update_ps_colorbuf0_slot |
| - radeonsi: use LOAD_CONTEXT_REG_INDEX for VGT_STRMOUT_DRAW_OPAQUE |
| - ac/llvm: add a return value to ac_nir_translate |
| - radeonsi: deal with ac_nir_translate failures |
| - radv: deal with ac_nir_translate failures |
| - radeonsi: use nir_opt_large_constants earlier |
| - radeonsi/sqtt: set stable pstate if possible |
| - mesa: avoid reading back textures from VRAM |
| - mesa: simplify _mesa_texstore_z24_s8 and s8_z24 |
| - mesa: remove fallback for GL_DEPTH_STENCIL |
| - radeonsi: invalidate L2 when using dcc stores |
| - egl/wayland: fix glthread crashes |
| - winsys/amdgpu: init 'r' before using it |
| - radeonsi/gfx11: don't set VERTS_PER_SUBGRP to 0 |
| - tc: do a GPU->CPU copy to initialize cpu_storage |
| - tc: don't use CPU storage for glBufferData |
| - ac/llvm: remove unused vars hash table |
| - radeonsi: remove USE_LDS_SYMBOLS |
| - radeonsi: use nir_lower_explicit_io to get rid of mem_shared deref |
| - ac/llvm: drop visit_deref |
| - ac/llvm: simplify get_memory_ptr |
| - ac/llvm: add helpers to get pointer types of ac_arg |
| - ac/llvm: remove LLVMBuildGEP usages |
| - ac/llvm: add a ac_llvm_pointer type |
| - ac/llvm: port functions to use ac_llvm_pointer |
| - ac/llvm: switch ac_build_gep_ptr to LLVMBuildGEP2 |
| - ac/llvm: switch ac_build_pointer_add to LLVMBuildGEP2 |
| - ac/llvm: add type param to ac_build_load_invariant |
| - ac/llvm: add xxxx2 functions to iteratively port to opaque pointers |
| - ac/llvm: store lds as ac_llvm_pointer |
| - ac/llvm: store constant_data and scratch as ac_llvm_pointer |
| - ac/llvm: remove unneeded casts |
| - radv/llvm: use ac_build_gep0_type to get args types |
| - radeonsi: use ac_get_arg_pointee_type to get the right type |
| - radeonsi: use LLVMBuildGEP2 in si_build_gep_i8_var |
| - radeonsi: make ngg_gs_get_vertex_storage return a ac_llvm_pointer |
| - radeonsi: port the remaining code to opaque pointers |
| - ac/llvm: remove gep_2 and others temporary functions |
| - radv/llvm: replace LLVMBuildLoad by LLVMBuildLoad2 |
| - ac/llvm: add ac_get_ptr_arg |
| - ac/llvm: pass ac_llvm_ptr instead of separate type/ptr |
| - Revert "ac: use LLVMContextSetOpaquePointers if available" |
| - radeonsi: zero init using the correct components count |
| - radeonsi/sqtt: notify sqtt when we skip a flush |
| - radeonsi/sqtt: re-export shaders in a single bo |
| - radeonsi/sqtt: allow to disable spm counters |
| - radeonsi/tests: handle flakes file for all test suite |
| - radeonsi/tests: simplify the script |
| - radeonsi/tests: update expected results for navi21 |
| - radeonsi/tests: update expected results for navi10 |
| - radeonsi/tests: update expected results for polaris11 |
| - radeonsi: simplify si_prefetch_shaders |
| - radeonsi/sqtt: simplify condition to determine if sqtt is on |
| - radeonsi: store the shader gpu adress in si_shader |
| - radeonsi: handle sqtt pipeline in shader prefetch |
| - radeonsi/gfx11: enable sdma copy DRI_PRIME |
| - radeonsi: update db_eqaa even if msaa is disabled |
| |
| PixelyIon (1): |
| |
| - tu: Make KGSL wait IOCTLs interrupt-safe |
| |
| Qiang Yu (62): |
| |
| - nir/lower_gs_intrinsics: fix primitive count for points |
| - radeonsi: fix tcs_out_lds_offsets arg alignment |
| - winsys/amdgpu: fix non-page-aligned sparse buffer creation |
| - nir,ac/llvm: add nir_intrinsic_load_half_line_width_amd |
| - nir,ac/llvm: add nir_intrinsic_load_viewport_xy_scale_and_offset |
| - ac/nir/ngg,radv: use nir_load_viewport_xy_scale_and_offset |
| - ac/nir/cull: support caller react when primitive is rejected |
| - ac/nir/ngg: support line culling |
| - ac/nir/ngg: fix and simplify gs store output lower |
| - ac/nir/ngg: use same driver location for gs output |
| - ac/nir/ngg: save and restore output bit size for gs |
| - ac/nir/ngg: save and restore position output base for nogs |
| - ac/nir/ngg: add gs culling |
| - ac/nir/ngg: support component position store |
| - ac/nir/ngg: support clipdist culling |
| - ac/llvm: cast tes_u/v_replaced to float |
| - radeonsi: implement nir_intrinsic_load_half_line_width_amd |
| - radeonsi: implement nir_intrinsic_load_viewport_xy_scale_and_offset |
| - radeonsi: implement cull nir intrinsics |
| - ac/llvm: move culling replaced registers to abi |
| - radeonsi: use replaced values when present |
| - radeonsi: vs load input re-calculate vertex index after culling |
| - radeonsi: add and implement load_user_clip_plane abi |
| - nir: fix nir_xfb_info buffer_to_stream length |
| - nir: add nir_intrinsic_load_num_vertices_per_primitive_amd |
| - nir: add nir_intrinsic_load_streamout_buffer_amd |
| - nir,ac/llvm: add nir_intrinsic_load_ordered_id_amd |
| - nir: add nir_intrinsic_ordered_xfb_counter_add_amd |
| - ac/llvm: implement nir_intrinsic_ordered_xfb_counter_add_amd |
| - ac/nir/ngg: cleanup prim id to prepare for streamout |
| - ac/nir/ngg: nogs support streamout |
| - ac/nir/ngg: ngg_gs_load_out_vtx_primflag support stream |
| - ac/nir/ngg: support multi stream per output slot for gs |
| - ac/nir/ngg: support gs streamout |
| - radeonsi: implement nir_intrinsic_load_num_vertices_per_primitive_amd |
| - radeonsi: implement nir_intrinsic_load_streamout_buffer_amd |
| - radeonsi: print out remove_streamout shader key |
| - nir,ac/llvm,radv: add nir_intrinsic_load_provoking_vtx_in_prim_amd |
| - ac/nir/ngg: use nir_load_provoking_vtx_in_prim_amd in ngg lower |
| - radv: count gen_prims_queries_enabled |
| - radv: split active_pipeline_gds_queries |
| - nir,ac/nir/ngg,radv: split shader_query_enabled_amd |
| - ac/nir/ngg,radv: use different counters for shader queries |
| - nir,ac,radv: add primitive count add intrinsics |
| - nir: add nir_load_prim_xfb_query_enabled_amd |
| - ac/nir/ngg: add query param to ac_nir_lower_ngg_gs |
| - ac/nir/ngg: add streamout emitted primitive query |
| - radeonsi: implement llvm abi atomic_add_prim_count |
| - radeonsi: implement nir shader query enabled intrinsics |
| - ac/llvm: get back intrinsics used by NGG |
| - ac/nir/ngg: support user edge flags for ngg lower |
| - ac/nir/ngg: pass primitive_id_location as param for nogs lower |
| - nir: add two amd ngg lds base load intrinsics |
| - ac/nir/ngg,radv: move LDS layout calculation out of nir ngg lowering |
| - ac/nir/ngg: allow passthrough with vs primitive id output |
| - ac/nir/ngg: save and restore no_varying/no_sysval_output |
| - ac/nir/ngg,ac/llvm,aco: save nogs ngg culling one lds dword |
| - ac/nir/ngg: add one odd dword to nogs culling pervertex lds |
| - ac/nir/ngg: fix nogs culling with nuw add |
| - ac/nir/ngg: remove nuw for negative value add |
| - ac/llvm: fix gfx11 fs input load for 16bit varying |
| - ac/llvm: fix 16bit varying llvm compile error |
| |
| Rajnesh Kanwal (48): |
| |
| - pvr: Implement vkResetDescriptorPool API. |
| - pvr: Fix clang-format issue. |
| - pvr: Implement vkGetPhysicalDeviceImageFormatProperties2 API. |
| - pvr: Implement vkCmdDrawIndirect API. |
| - pvr: Implement vkCmdDrawIndexedIndirect API. |
| - pvr: Fix multiple file descriptor leaks. |
| - pvr: Implement vkCreateEvent and vkDestroyEvent APIs. |
| - pvr: Implement vkCmdSetEvent2 API. |
| - pvr: Implement vkCmdResetEvent2 API. |
| - pvr: Implement vkCmdWaitEvents2 API. |
| - pvr: Implement vkSetEvent API. |
| - pvr: Implement vkResetEvent API. |
| - pvr: General reformatting and renaming some fields. |
| - pvr: Replace pvr_renderpass_surface_initop with VkAttachmentLoadOp. |
| - pvr: Add support for generating render pass hw setup data. |
| - pvr: Use ~0 for masks instead of ~0U. |
| - pvr: Add missing output register write in pvr_CreateRenderPass2. |
| - pvr: Rename userpass_spawn to isp_userpass. |
| - pvr: Add const to structs that are not supposed to be modified. |
| - pvr: Add attachment resolve support. |
| - pvr: Add support to create subpass load ops. |
| - pvr: Pass load-op struct instead of idx to support subpass load-ops. |
| - pvr: Implement vkCmdNextSubpass API. |
| - pvr: implement vkcmddispatchindirect api. |
| - pvr: Handle indirect buffer address in pvr_setup_descriptor_mappings. |
| - pvr: Convert attachment indexes to unsigned. |
| - pvr: Remove redundant instruction buffer check. |
| - pvr: Remove redundant assignment error.instruction. |
| - pvr: Remove framebuffer NULL check from pvr_CmdBeginRenderPass2. |
| - pvr: Remove dead code from pvr_perform_start_of_render_clears. |
| - pvr: Zero init pbe_cs_words to avoid passing on garbage. |
| - pvr: Fix Unintentional integer overflow (OVERFLOW_BEFORE_WIDEN) errors. |
| - pvr: Call VALGRIND_FREELIKE_BLOCK before unmapping to avoid use after free. |
| - pvr: Add assert to check for non-NULL binding_desc. |
| - pvr: Implement vkGetEventStatus API. |
| - pvr: Add pvr_csb_emit_link function to support secondary buffer links. |
| - pvr: Fix check in pvr_cmd_uses_deferred_cs_cmds. |
| - pvr: Add support for PVR_CMD_STREAM_TYPE_GRAPHICS_DEFERRED stream. |
| - pvr: Create deferred control stream for secondary command buffers. |
| - pvr: Reserve space for vdm state for deferred secondary buffer. |
| - pvr: Add support to copy a control stream to another control stream. |
| - pvr: Implement vkCmdExecuteCommands API. |
| - pvr: Update buffer type macro names for consistency. |
| - pvr: Fix heap type of availability_buffer allocation. |
| - pvr: Fix allocation size passed in pvr_cmd_buffer_alloc_mem. |
| - pvr: Remove double error reporting. |
| - pvr: Split pds compute shader create and upload code for reuse. |
| - pvr: Add vulkan shader factory headers for Query and clear APIs. |
| |
| Renato Pereyra (1): |
| |
| - venus: Increase vn_relax sleep time |
| |
| Rhys Perry (117): |
| |
| - aco: fix LdsBranchVmemWARHazard with 2+ branch chains |
| - aco: set has_VMEM,has_DS=false after a branch |
| - aco: only add vscnt wait when visiting VMEM/DS |
| - aco: improve VcmpxPermlaneHazard workaround |
| - aco: fix hash statistic |
| - docs/radv: advertise VK_KHR_global_priority |
| - aco: fix consecutive exec writes when finding exec_copy instruction |
| - aco: rename is_cmp to is_fp_cmp |
| - aco: fix assembly of vopc_sdwa writing exec |
| - aco: fix re-write of uses of exec_val's lo/hi half |
| - aco: test branch opcode if removing it in try_optimize_branching_sequence |
| - aco: remove val_and_copy_adjacent |
| - aco: improve vcc check for instructions between exec_val and exec_copy |
| - aco: test for one and_savexec opcode in try_optimize_branching_sequence |
| - aco: fix long-jump version of discard early exit |
| - radv: remove claimed support for sRGB vertex buffer formats |
| - radv: fix 16-bit support in radv_lower_vs_input |
| - aco: fix 16-bit VS inputs |
| - radv/llvm: fix packed VS inputs on GFX6/GFX10+ |
| - aco: don't expand vec3 VS input load to vec4 on GFX6 |
| - aco: allow direct_fetch=true for vec4 VS input loads |
| - nir/print: support nir_texop_descriptor_amd |
| - aco: add SCC clobber in build_cube_select |
| - nir/lower_tex: ignore width of cube textures |
| - radv: enable ac_nir_lower_resinfo for ACO |
| - aco: remove dead code for querying image size/samples/levels |
| - util/ralloc: add HEADER_ALIGN macro |
| - util: add freelist allocator with mark/sweep |
| - nir/serialize: remove unused parameter from read_src() |
| - Revert "nir: Drop the unused instr arg for src/dest copy functions." |
| - nir: adjust nir_src_copy signature to take a nir_instr * |
| - nir: use a GC context for instructions |
| - util/format,vulkan/format: update p_format for Vulkan vertex buffers |
| - ac: add ac_vtx_format_info |
| - radv: add radv_is_vertex_buffer_format_supported |
| - radv,aco: use pipe_format for static vertex input state |
| - radv,aco: use pipe_format for dynamic vertex input state |
| - radv: remove radv_translate_vertex_format |
| - radv: enable R8G8B8, B8G8R8 and R16G16B16 vertex formats |
| - radv,aco: implement 64-bit vertex inputs |
| - radv: enable 64-bit vertex formats |
| - aco/ra: handle empty def_reg interval in get_regs_for_copies |
| - aco/ra: remove bounds parameter from get_regs_for_copies() |
| - aco/ra: rework fixed operands |
| - nir/algebraic: optimize bits=umin(bits, 32-(offset&0x1f)) |
| - nir/algebraic: optimize fabs(bcsel(b, fneg(a), a)) |
| - radv: disable EXT_vertex_input_dynamic_state when using DGC |
| - radv: shrink zero-initialization in vkCmdSetVertexInputEXT |
| - radv: use nir_ubfe_imm |
| - radv: remove unnecessary .align_mul=4 |
| - radv: fix dynamic RT stack size with VGPR spilling |
| - aco: DCE ra_ctx::defs_done |
| - aco: rename Interp_instruction to VINTRP_instruction |
| - aco: add reg() helper to assembler |
| - aco: fix assembly of MUBUF-to-LDS loads |
| - aco: add GFX11 opcode numbers |
| - aco/gfx11: don't use more than 1 NSA dword |
| - aco: update assembler for GFX11 |
| - aco: limit GFX11 to 128 VGPRs for now |
| - aco: add LDSDIR instruction format |
| - aco: add VINTERP instruction format |
| - aco: omit read-only memory_sync_info when printing |
| - aco/tests: add GFX11 assembly tests |
| - aco: mostly implement FS input loads on GFX11 |
| - aco: fix VMEMtoScalarWriteHazard s_waitcnt mitigation |
| - aco: improve VMEMtoScalarWriteHazard s_waitcnt mitigation |
| - aco: use some helpers in GFX10 hazard workarounds |
| - aco: improve printing of sgpr_null |
| - aco: improve printing of s_waitcnt_depctr |
| - aco: add VMEMtoScalarWriteHazard tests |
| - aco/gfx11: swap ds_cmpst_* data operands |
| - aco: improve wait_imm unpack |
| - aco/gfx11: fix s_waitcnt printing |
| - aco: update sendmsg enum from LLVM |
| - aco/gfx11: deallocate VGPRs at the end of the shader |
| - aco/gfx11: update form_hard_clauses |
| - aco: limit hard clauses to 63 instructions |
| - aco: fix assembler.gfx11.vinterp test |
| - aco: add search_backwards helper |
| - aco/gfx11: workaround VcmpxPermlaneHazard |
| - aco/gfx11: workaround LdsDirectVALUHazard |
| - aco/gfx11: workaround LdsDirectVMEMHazard |
| - aco/gfx11: workaround VALUTransUseHazard |
| - aco/gfx11: workaround VALUPartialForwardingHazard |
| - aco/gfx11: workaround VALUMaskWriteHazard |
| - aco: add ACO_DEBUG=force-waitdeps |
| - nir/algebraic: optimize b<<a + c<<a |
| - ac/llvm: fix mixing non-uniform/uniform sampler/texture descriptors |
| - ac/llvm: ignore g16 if the image opcode doesn't use derivatives |
| - aco/gfx11: optimize LS/HS load_local_invocation_index |
| - aco: swap v_perm_b32 operands |
| - radv,nir: add intrinsics for streamout and GS copy shaders |
| - ac/nir: add legacy streamout and GS copy shader helpers |
| - ac/nir: micro-optimize boolean expression |
| - ac/nir: add ac_nir_lower_ngg_options |
| - aco: add storage_gds |
| - aco: insert waitcnt before/after ds_ordered_count |
| - nir,ac/nir,aco,radv: replace has_input_*_amd with more general intrinsics |
| - aco: don't split swizzled store_buffer_amd on GFX9+ |
| - ac/nir: lower gfx11 vertex parameter exports |
| - radv/gfx11: don't create fmask copy/expand pipelines |
| - radv/gfx11: don't create texop_samples_identical |
| - radv,aco: don't use lower_to_fragment_fetch_amd on GFX11+ |
| - aco: fix typo in branch lowering |
| - aco/gfx11: perform FS input loads in WQM |
| - aco/gfx11: fix FS input loads in quad-divergent control flow |
| - radv/llvm: fix GS shaders on GFX8/9 |
| - docs: update new_features.txt for non-extension additions to RADV |
| - radv: increase gfx1100/gfx1101 physical vgprs |
| - ac/gpu_info: increase gfx1100/gfx1101 physical vgprs |
| - aco/gfx11: increase gfx1100/gfx1101 physical vgprs |
| - radeonsi: increase gfx1100/gfx1101 physical vgprs |
| - nir/lower_bit_size: lower uadd_carry |
| - radv: lower 8/16-bit uadd_carry/usub_borrow |
| - radv: disable MRT compaction with dual-source blending |
| - aco: ensure MRT0 is written with dual source blending |
| - ac/nir: mask shift operands |
| |
| Riteo (1): |
| |
| - vulkan/device_select_wayland: fix a memory leak with DRM device handling |
| |
| Rob Clark (106): |
| |
| - gbm: Add USE_FRONT_RENDERING flag |
| - freedreno: Disable UBWC for frontbuffer rendering |
| - freedreno/drm: Fix potential bo cache vs export crash |
| - freedreno/gmem: Fix col0 calc |
| - freedreno: Drop fixed upper bound on # of tiles |
| - freedreno/drm: Add FD_VA_SIZE param |
| - freedreno: Cap reported video memory by VA size |
| - freedreno/drm: Avoid lock in fd_pipe_del() |
| - egl: atrace support |
| - freedreno/drm/virtio: Add some atrace |
| - util: Fix vs2019 build error |
| - egl: Drop _eglSetFuncName() return |
| - egl: Drop unused _EGL_FUNC_START() arg |
| - egl: Remove redundant fxn param |
| - egl: Convert to simple_mtx_t |
| - egl: Add some locking asserts |
| - egl/dri2: Indenting fix |
| - egl/dri2: Error path cleanups |
| - llvmpipe: Add some missing locking |
| - freedreno: Extract helper to check for batch dependency |
| - freedreno: Simplify add_dep logic |
| - freedreno: Avoid deferred-flush dependency loops |
| - Revert "ci: disable the freedreno farm." |
| - freedreno/a6xx: Random indent fixes |
| - freedreno/drm: Let backend hint about upload vs map |
| - freedreno/drm: Add offset param for fd_bo_upload() |
| - freedreno: Add support for upload transfers |
| - Revert "ci: disable the freedreno farm." |
| - egl/wgl: Make ref_count atomic |
| - egl/dri2: Make ref_count atomic |
| - egl/dri2: Add display lock |
| - egl: Make RefCount atomic |
| - egl: Introduce rwlock to protect eglTerminate() |
| - egl: Relax locking |
| - freedreno: Misc indent fix |
| - freedreno: nr_rings -> nr_priorities |
| - freedreno: Remap high/norm/low priorities |
| - freedreno/virtio: Don't upload if we have valid range |
| - freedreno/drm/virtio: Don't prefer upload for mapped buffers |
| - freedreno/drm/virtio: Handle read after upload |
| - freedreno: We really don't need aligned vbo's |
| - freedreno: Update github wiki links |
| - freedreno/ir3: GC unused macro |
| - freedreno/drm: Inline fd_bo_get_iova() |
| - freedreno/drm: Add fd_ringbuffer_attach_bo() |
| - freedreno/a6xx: Drop "hardpin" support |
| - freedreno/drm: Simplify emit_reloc_common |
| - freedreno/a6xx: Skip IBO state when unused |
| - freedreno/a6xx: Pre-calculate user const state size |
| - freedreno: Drop unused arg |
| - freedreno/a6xx: Move user const upload to bind |
| - freedreno/a6xx: Simplify fd6_build_user_consts() |
| - freedreno: Handle xfb invalidation |
| - freedreno: Use TC cpu-storage to shadow buffers |
| - freedreno/drm: Don't call kernel with no ops |
| - freedreno: Add perf-debug trace |
| - isaspec: Fix out of date comment |
| - gallium/u_threaded: Add some atrace/perfetto |
| - gallium/u_threaded: Add optional call-id tracing |
| - freedreno: Fix fence unref race |
| - docs: Update freedreno features |
| - mesa/st: ARB_vertex_attrib_64bit depend on glsl>=410 |
| - freedreno/a6xx: Remove unused param |
| - freedreno/ir3: Fix clipvertex with GS+tess |
| - freedreno/ir3: Lower all the 64b |
| - freedreno/a6xx: Fix MAX_GEOMETRY_OUTPUT_VERTICES cap |
| - freedreno/a6xx: Fix primitives-generated query |
| - freedreno/a6xx: Update caps |
| - freedreno/a6xx: Drop max_scissor tracking |
| - Revert "egl: Factor some common terminate cleanup up to common code" |
| - nir: Add helper to create passthrough TCS shader |
| - radeonsi: Use nir passthrough TCS helper |
| - freedreno: Dirty the TCS state if patch_vertices change |
| - freedreno/ir3: Passthrough TCS support |
| - freedreno: Add tess_state support |
| - freedreno: Passthrough TCS support |
| - v3d/ci: Add a flake |
| - Revert "mesa/st: ARB_vertex_attrib_64bit depend on glsl>=410" |
| - freedreno: Don't advertise TGSI support for GS/tess |
| - freedreno/a6xx: Move layer_zero handling to interp state |
| - freedreno/ir3: Drop unused view_zero/layer_zero lowering |
| - freedreno/a6xx: Fix buffer size clamping |
| - freedreno/a6xx: Support AMD_vertex_shader_layer |
| - freedreno: support multiple viewports |
| - freedreno: Massage scissor state at bind time |
| - freedreno: Move guardband calc to bind time |
| - freedreno/a6xx: Support GL_ARB_viewport_array + gl43 |
| - freedreno/a6xx: Enable GL_ARB_texture_mirror_clamp_to_edge |
| - freedreno: Enable GL_ARB_enhanced_layouts |
| - freedreno/ci: Small bit of xfails cleanup |
| - docs: Update features.txt |
| - nir: Add way to create passthrough TCS without VS nir |
| - freedreno: Add ARB_gl_spirv support |
| - freedreno/a6xx: Use box to pass 2d clear params |
| - freedreno/a6xx: Implement ARB_clear_texture |
| - freedreno/a3xx+: Enable ARB_shader_texture_image_samples |
| - freedreno/a3xx+: Enable ARB_derivative_control |
| - freedreno/ir3: Unconditionally lower subgroup ops |
| - freedreno/a6xx: Enable ARB_shader_group_vote |
| - freedreno/a6xx: Fix occlusion queries |
| - freedreno/a6xx: Remove unused field |
| - freedreno/a5xx+a6xx: Add base class for query samples |
| - freedreno/batch: Add a global epilogue |
| - freedreno: Core ARB_query_buffer_object support |
| - freedreno/a6xx: ARB_query_buffer_object support |
| - freedreno/a6xx: Mark gl45 supported |
| |
| Rohan Garg (3): |
| |
| - anv, iris: Disable pre fetching the binding table entries on DG2 |
| - intel/compiler: Support 16 bit float ops |
| - anv: Enable 16 bit float ops on devices that have a LSC |
| |
| Roland Scheidegger (4): |
| |
| - lavapipe: initialize index_bias to zero for non-indexed draws |
| - gallivm: always do per-pixel LOD for cube maps |
| - llvmpipe: fix front/back face system value in the fs |
| - gallivm: remove dead cube map lod calculation code |
| |
| Roman Gilg (2): |
| |
| - loader/dri3: Simplify buffer age query |
| - loader/dri3: Add DRI performance option to wait for next buffer on swap |
| |
| Roman Stratiienko (6): |
| |
| - v3dv: Enable sync_fd importing/exporting on Android |
| - v3dv: Limit API version to v1.0 for Android |
| - v3dv: Avoid swapchain_info handling on Android |
| - lima: Ensure jobs initialized before calling lima_job_fini() |
| - Android: Use libgbm_mesa name for SDK30+ |
| - meson: Enable system_has_kms_drm for android |
| |
| Ruijing Dong (30): |
| |
| - frontends/va: improve enc quality interface change |
| - radeonsi/vcn: add enc quality bits interface |
| - radeonsi/vcn: dpb change for supporting pre-encoding |
| - radeonsi/vcn: support encoding preset modes |
| - radeonsi/vcn: support VBAQ modes |
| - radeonsi/vcn: remove rate control double begin IBs |
| - radeonsi/vcn: add AUD syntax to h264 encoding |
| - frontend/va: remove some unnecessary code |
| - frontends/va: change getEncParamPreset location |
| - frontends/va: add max_frame_size into rate control |
| - frontends/va: add HRD, filler data enable and etc |
| - frontends/va: enable sao in hevc encoding |
| - radeon/vcn: process invalid frame rate in encoding |
| - radeonsi/vcn: support async |
| - radeonsi/vce: using fixed value for vce 52 |
| - gallium/pipe/video: group avc encoder seq structure |
| - radeonsi/vce: apply avc seq parameters |
| - frontends/omx: apply avc seq parameters |
| - frontends/va: apply avc seq parameters |
| - radeonsi/vcn: use avc seq paramters |
| - frontends/va: add aspect ratio and vui timing info |
| - radeonsi/vcn: support aspect ratio and vui timing info |
| - d3d12: Updating refactored fields in pipe_h264_enc_picture_desc |
| - frontends/va: fix av1 decoding image distortion issue |
| - gallium/pipe/video: add data struct for dbk in avc encoding. |
| - frontends/va: add h264 dbk enc parameters |
| - radeonsi/vcn: enable dbk in avc encoding |
| - radeonsi/vcn: enable multi-slice encoding |
| - frontends/va: fixed an av1 decoding image corruption issue. |
| - frontends/va: fixed an av1 dec image corruption. |
| |
| Ryan Houdek (1): |
| |
| - vulkan/wsi: Add dep_libudev to idep dependencies |
| |
| Sagar Ghuge (5): |
| |
| - intel/isl: Setting L1 caching policy to Write-back mode |
| - anv: Specify Untyped L1 cache policy for stateless accesses |
| - iris: Specify Untyped L1 cache policy for stateless accesses |
| - anv: Handle bits to flush data-port's Untyped L1 data cache |
| - iris: Handle new untyped dataport cache flush PIPE_CONTROL field |
| |
| Sajeesh Sidharthan (1): |
| |
| - radeonsi/vcn: set current pic index correctly |
| |
| Sami Kyöstilä (1): |
| |
| - intel/ds: Update to Perfetto API v28.0 |
| |
| Samuel Pitoiset (329): |
| |
| - vulkan: add support for VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT |
| - radv: implement VK_EXT_attachment_feedback_loop_layout |
| - radv: ignore out-of-order rasterization if stencil write mask is dynamic |
| - radv: rework shaders ref counting |
| - radv: use ref counting for VS prologs and PS epilogs |
| - radv: remove RADV_META_SAVE_SAMPLE_LOCATIONS |
| - radv: simplify saving/restoring all dynamic states |
| - radv: remove unused states parameter from some radv_emit_XXX() helpers |
| - radv: simplify radv_bind_dynamic_state() slightly |
| - radv: fix gathering XFB info if there is dead outputs |
| - radv: respect the render area for depth/stencil resolves |
| - radv: fix cleaning the meta query state if an error occured |
| - radv: properly ignore PSO states that are dynamic |
| - radv: remove one useless check about line width and dynamic state |
| - radv: simplify dynamic states uses when determining NGG culling settings |
| - radv: do not set registers for PSO states that are dynamic |
| - radv: remove radv_invariant_geom workarounds for DX games |
| - radv: explain why radv_invariant_geom is still needed for SotTR DX |
| - radv: remove unecessary radv_pipeline::uses_dynamic_stride |
| - radv: copy line stipple factor/pattern only if the state isn't dynamic |
| - radv: copy the sample locations only if the state isn't dynamic |
| - zink: stop skipping bunch of GLESx tests |
| - vulkan: initialize discardRectangleCount even if the state is dynamic |
| - vulkan: fix initializing the primitive topology when it's dynamic |
| - vulkan: add attachment sample count info AMD support |
| - radv: store the provoking vertex mode in the graphics pipeline info |
| - radv: simplify the depth clip enable/disable logic |
| - radv: do not translate polygon mode during gfx info initialization |
| - radv: do not translate logic op during gfx info initialization |
| - radv: do not translate blend op/factor during gfx info initialization |
| - radv: do not translate primitive topology during gfx info initialization |
| - radv: slightly change the color write enable mask |
| - radv: call vk_graphics_pipeline_state_fill() when initializing gfx pipelines |
| - radv: convert radv_tessellation_info to vk_tessellation_state |
| - radv: convert radv_viewport_info to vk_viewpoert_state |
| - radv: convert radv_discard_rectangle_info to vk_discard_rectangles_state |
| - radv: convert radv_rasterization_info to vk_rasterization_state |
| - radv: convert radv_depth_stencil_info to vk_depth_stencil_state |
| - radv: convert radv_fragment_shading_rate_info to vk_fragment_shading_rate_state |
| - radv: convert radv_input_assembly_info to vk_input_assembly_state |
| - radv: convert radv_multisample_info to vk_multisample_state |
| - radv: convert radv_color_blend_info to vk_color_blend_state |
| - radv: convert radv_rendering_info to vk_render_pass_state |
| - radv: convert radv_vertex_input_info to vk_vertex_input_state |
| - radv: remove radv_graphics_pipeline_info completely |
| - radv: advertise VK_KHR_global_priority |
| - radv: remove unused pCreateInfo from radv_pipeline_init_blend_state() |
| - radv: add radv_graphics_pipeline_import_info() helper |
| - radv: introduce a new pipeline type for graphics libs |
| - radv: store vk_graphics_pipeline_state to radv_graphics_pipeline |
| - radv: prevent accessing rasterization state when it's NULL |
| - radv: add a very initial implementation of VK_EXT_graphics_pipeline_library |
| - radv: do not compress DCC in presence of render loops on GFX10+ |
| - radv: remove useless check when creating the noop FS |
| - radv: copy linked I/O info for merged stages in radv_fill_shader_info |
| - radv: copy the whole radv_shader_info struct for merged stages |
| - radv: add a radv_postprocess_nir() helper |
| - radv: move lowering FS intrinsics to radv_postprocess_nir() |
| - radv: move dumping NIR shaders to the previous similar loop |
| - radv: remove unused parameter in radv_lower_io_to_mem() |
| - radv: use NIR_PASS with radv_export_implicit_primitive_id |
| - radv: remove useless parameter from radv_link_shaders() |
| - radv: add radv_remove_point_size() linking helper |
| - radv: rework linking shaders in NIR |
| - radv: move assigning driver locations to the new linking helpers |
| - radv: move lowering patch vertices to radv_pipeline_link_tcs() |
| - radv: move lowering the view index to radv_pipeline_link_shaders() |
| - radv: remove redundant assignment of tcs.tcs_vertices_out |
| - radv: remove unused tcs_vertices_out assignment for VS |
| - radv: remove unused num_tess_patches assignment for VS |
| - radv: merge gather_tess_info() with radv_fill_shader_info() |
| - radv: stop setting CB_COLOR_CONTROL.ROP3 from the pipeline |
| - radv: stop clearing bitfields for registers that are emitted dynamically |
| - radv: cleanup dynamic states in radv_emit_graphics_pipeline() |
| - radv: stop emitting RMW context registers for updating sample locations |
| - radv: re-emit viewports if negative one to one or depth clamp mode changed |
| - radv: remove bogus assertion about independent set layouts with GPL |
| - radv: fix missing initialization of the pipeline layout when creating a lib |
| - radv: destroy the pipeline layout if creating a library failed |
| - aco: fix wrong size for 1D images and A16 on GFX9 |
| - ac: constify ac_compute_cs_workgroup_size() |
| - radv: stop duplicating radv_es_output_info |
| - radv: use esgs_itemsize when calling ac_nir_lower_es_outputs_to_mem |
| - radv: compute the ESGS itemsize outside of radv_nir_shader_info_pass() |
| - radv: remove dead code about task ring when binding a compute pipeline |
| - radv: replace cs.uses_task_rings by ms.has_task |
| - radv/llvm: remove unused parameter in handle_vs_outputs_post() |
| - radv: stop duplicating radv_vs_output_info |
| - radv: fill radv_vs_output_info unconditionally for vertex related stages |
| - radv: remove redundant VS output parameter assignments |
| - radv: add a helper that links shader info between stages |
| - radv: link primitive ID/clip distance shader info from the new helper |
| - radv: stop gathering info for FS before other stages |
| - radv: move filling cs.block_size |
| - radv: add gather_shader_info_fs() helper |
| - radv: add gather_shader_info_mesh() helper |
| - radv: add gather_shader_info_gs() helper |
| - radv: add gather_shader_info_tes() helper |
| - radv: add gather_shader_info_tcs() helper |
| - radv: add gather_shader_info_vs() helper |
| - radv: add gather_shader_info_cs() helper |
| - radv: add gather_shader_info_task() helper |
| - radv: move more CS info to gather_shader_info_cs() |
| - radv: move more MS info to gather_shader_info_ms() |
| - radv: move determining NGG shader info to radv_fill_shader_info() |
| - radv: store binning settings into the physical device |
| - radv: make sure to emit BREAK_BATCH when color write enable is dynamic |
| - radv: remove useless parameter in si_write_scissors() |
| - radv: split emitting guardband into a separate helper |
| - radv: rework dirtying scissors when the line width changes |
| - radv: emit the guardband state separately from the scissor state |
| - radv: allow null descriptor set layouts in pipeline layouts |
| - radv: determine if the topology uses adjacency from the shaders on GFX9+ |
| - radv: fix bogus assertion with RADV_FORCE_VRS |
| - aco: remove unused isel_context::tcs_num_patches |
| - radv: stop setting as_ls for the TCS stage |
| - radv: stop setting as_es for the GS stage |
| - radv: stop setting num_tess_patches for the GS stage |
| - radv: stop setting esgs_itemsize to the GS stage |
| - radv: cleanup helpers that compute NGG info and GS info on GFX9+ |
| - radv: move computing wave_size/ballot_bit_size to the shader info pass |
| - radv: move computing NGG info and GS info to radv_nir_shader_info_link() |
| - radv: refactor radv_nir_shader_info_link() |
| - radv: move determining other NGG settings in radv_link_shaders_info() |
| - radv: simplify determining if NGG passthrough can be enabled |
| - radv: cleanup computing the workgroup size for all stages |
| - radv: add and use a helper that merges shader info for merged stages on GFX9+ |
| - radv: remove unused parameters in radv_fill_shader_info() |
| - radv: determine as_ls/as_es in radv_link_shaders_info() |
| - radv: constify radv_shader_info where it's missing |
| - radeonsi/ci: trigger radeonsi-raven-skqp for any RADV changes |
| - radv: gather MRTs that are written by the fragment shader |
| - radv: compact MRTs to save PS export memory space |
| - radv: only expose sparseResidencyImage3D on GFX9+ |
| - radv: emit SQTT markers for RT related commands |
| - radv: capture RT pipelines from the SQTT layer |
| - radv: fix reporting RT shaders in RGP |
| - radv: do not lower color exports for FS that need an epilog |
| - radv: add radv_remove_color_exports() helper |
| - radv: do not try to remove color exports for FS that need an epilog |
| - radv: keep track of the code size for VS prologs and PS epilogs |
| - radv: create a PS epilog from a library without the main FS |
| - radv: add support for emitting and prefetching PS epilogs |
| - radv: import PS epilog from libraries if present |
| - radv: fix hw remapping of MRT holes with color attachments without export |
| - radv: fix pipelineStageCreationFeedbackCount when it's 0 |
| - radv: add a pointer to radv_shader_binary in radv_shader |
| - radv/ci: re-enable dEQP-VK.renderpass2.depth_stencil_resolve.*_samplemask |
| - radv: store the VS prologs/PS epilogs VA at upload time |
| - radv: split upload_shader_part() in two parts |
| - radv: store the binary to radv_shader_part |
| - radv: upload the PS epilog in the existing pipeline BO |
| - radv: reword a comment about dynamic states when rasterization is disabled |
| - radv: emit the LDS size for TCS directly from the pipeline on GFX9+ |
| - radv: add radv_pipeline_key::dynamic_patch_control_points |
| - radv: skip computing some tess info when patch control points is dynamic |
| - radv: set workgroup_size to 256 when patch control points is dynamic |
| - aco: prevent a division by zero when patch control points is dynamic |
| - radv: add shader arguments for dynamic patch control points |
| - radv: add ABI lowering support for dynamic patch control points |
| - radv: pass the number of patch control points to si_get_ia_multi_vgt_param() |
| - radv: move emitting PRIMGROUP_SIZE for <= GFX9 from the cmdbuf |
| - radv: move emitting GE_CNTL for non-NGG pipelines from the cmdbuf |
| - radv: implement dynamic patch control points |
| - radv: advertise extendedDynamicState2PatchControlPoints |
| - radv: stop setting redundant viewport/scissor for internal operations |
| - radv: do not bind NULL graphics pipeline when restoring the meta state |
| - radv: reset the compute pipeline when the saved one was NULL |
| - radv: stop dirtying the graphics pipeline when restoring it |
| - radv: stop checking for NULL pipelines in radv_CmdBindPipeline() |
| - radv/ci: cleanup lists of failures/flakes |
| - radv: disable VK_EXT_graphics_pipeline_library with LLVM |
| - radv: rename radv_pipeline_key::vs::dynamic_vs_input to has_prolog |
| - radv: prepare the VS input state for prologs created with GPL |
| - radv: bind the VS input state for prologs created with GPL |
| - radv: enable the VS prologs cache if graphicsPipelineLibrary is enabled |
| - radv: do not remove PSIZ for VS when the topology is unknown |
| - Revert "radv: upload the PS epilog in the existing pipeline BO" |
| - nir: add nir_intrinsic_load_sample_positions_amd |
| - radv: add nir_intrinsic_load_sample_positions_amd in the ABI |
| - radv: run nir_opt_cse before lowering FS intrinsics |
| - radv,aco: lower barycentric_at_sample in NIR |
| - radv: do not link shaders when the next stage is unknown |
| - radv: allow to build the main FS in a graphics pipeline library |
| - radv,aco: do not compact MRTs if the pipeline uses a PS epilog |
| - radv: rename radv_thread_trace_set_pstate() to radv_device_set_pstate() |
| - radv: acquire pstate on-demand when capturing with RGP |
| - radv: constify radv_lookup_user_sgpr() |
| - radv: add radv_pipeline_key::dynamic_rasterization_samples |
| - radv: declare shader arguments for the number of samples for FS |
| - nir: add nir_load_rasterization_samples_amd |
| - radv: lower nir_load_rasterization_samples_amd in ABI |
| - nir,radv: pass the number of samples to load_sample_positions_amd |
| - radv: add barycentric_at_sample lowering when the number of samples is dynamic |
| - radv: emit the rasterization samples through an user SGPR if needed |
| - radv: make sure to initialize wd_switch_on_eop before checking its value |
| - radv: pass a VkSampler to write_sampler_descriptor() |
| - radv: remove unnecessary radv_buffer_view::vk_format |
| - radv: rework and rename radv_make_buffer_descriptor() |
| - zink/ci: skip ext_external_objects from Piglit with RADV |
| - zink/ci: skip arb_shader_clock with RADV |
| - zink/ci: skip GLX tests with RADV |
| - zink/ci: update list of expected failures with RADV on NAVI10 |
| - zink/ci: update list of flakes for Piglit with RADV on NAVI10 |
| - radv/ci: add piglit testing with Zink on NAVI10 |
| - radv: do not free the GS copy shader binary if created from a library |
| - radv: import the GS copy shader from a library if present |
| - radv: determine the last VGT api stage from the active_stages bitfield |
| - radv: allow to build the pre-rasterization stages in a library |
| - radv: remove useless gfx10_ngg_info::enable_vertex_grouping |
| - radv: determine the last VGT api stage also for GPL |
| - radv: use the maximum number of vertices per primitives for NGG with GPL |
| - radv: enable NGG culling unconditionally for GPL but disable it dynamically |
| - radv: rework multiview lowering in NIR slightly |
| - radv: fix handling ViewportIndex with GPL |
| - radv: fix handling primitive ID and clip/cull distances with GPL |
| - radv: fix handling multiview with GPL |
| - radv: fix emitting RBPLUS state when MRTs are compacted |
| - radv: remove NULL checks when binding a graphics pipeline |
| - radv: fix initializing the attribute mask for VS input state and GPL |
| - radv: fix initializing the blend state for NULL render pass with GPL |
| - radv: ignore shader stages that don't need to be imported with GPL |
| - radv: add a new drirc option to enable a unified heap on APUs |
| - radv: enable the unified heap on APUs for Red Dead Redemption 2 |
| - radv: fix and rework shaders upload with GPL |
| - radv: always re-emit patch control points when a pipeline with tess is bound |
| - radv: move the disk cache to the vk_physical_device struct |
| - radv: use SHA1_DIGEST_LENGTH in the pipeline cache |
| - radv: do not create a noop FS when the FS is imported from a library |
| - radv: discard the PS epilog when the pipeline doesn't use a fragment shader |
| - ac/llvm: add missing nir_intrinsic_bindless_image_atomic_{fmin,fmax} |
| - radv: set SAMPLE_MASK_TRACKER_WATERMARK to 15 on GFX11 |
| - radv: use correct VGT_TESS_DISTRIBUTION settings on GFX11 |
| - radv: do not use memory for waiting for cache flushes on GFX11 |
| - radv: drop the ES vertex count requirement on GFX11 |
| - radv: add RADEON_FLAG_DISCARDABLE |
| - radv: create and configure the ring BO for attributes |
| - radv: declare shader arguments for attributes via memory |
| - nir: add nir_intrinsic_load_ring_attr_{offset}_amd |
| - radv: add lowering for nir_intrinsic_load_ring_attr_{offset}_amd |
| - radv: allocate more space for pipeline statistics query on GFX11 |
| - ac: add radeon_info::has_vrs_ds_export_bug |
| - radv: advertise fragmentShadingRateWithShaderDepthStencilWrites on NAVI23+ |
| - aco: fix tcs_wave_id unpacking on GFX11 |
| - radv: declare new dynamic states |
| - radv: add support for dynamic polygon mode |
| - radv: add support for dynamic tessellation domain origin |
| - radv: add support for dynamic logic op enable |
| - radv: add support for dynamic line stipple enable |
| - radv: add support for dynamic alpha to coverage enable |
| - radv: add support for dynamic sample mask |
| - radv: add support for dynamic depth clip enable |
| - radv: add support for dynamic conservative rasterization mode |
| - radv: add support for dynamic depth clip negative one to one |
| - radv: add support for dynamic provoking vertex mode |
| - radv: add support for dynamic depth clamp enable |
| - radv: advertise VK_EXT_extended_dynamic_state3 |
| - radv: disable VK_AMD_texture_gather_bias_lod on GFX11 |
| - radv: disable AMD_shader_explicit_vertex_parameter on GFX11 |
| - radv/llvm: do not set COMPR for exports on GFX11 |
| - ac/nir,radv/llvm: add support for the attribute ring |
| - aco,radv/llvm: do not export parameters on GFX11 |
| - ac: fix has_vrs_ds_export_bug for VanGogh |
| - radv: disable {shader,sparse}ImageFloat32AtomicMinMax on GFX11 |
| - radv: disable shaderBufferFloat64AtomicMinMax on GFX11 |
| - radv: disable dual source blending in more situations |
| - ac/llvm: add support for device clock on GFX11 |
| - aco: add support for s_sendmsg_rtn_b{32,64} |
| - aco: split the sendmsg enumeration into sendmsg_rtn |
| - aco: add support for device clock on GFX11 |
| - ac/nir/ngg: fix emitting streamout output by using packed location |
| - radv: fix suspending/resuming pipeline statistics queries with GDS |
| - radv: fix VRS limit when attachmentFragmentShadingRate is disabled |
| - radv/llvm: fix dual source blending on GFX11 |
| - radv: move nir_opt_idiv_const/nir_lower_idiv after NGG lowering |
| - ac/nir/ngg: fix emitting streamout output by using packed location |
| - radv: implement transform feedback queries with NGG streamout |
| - Revert "radv: add a pointer to radv_shader_binary in radv_shader" |
| - aco: create a new builder variant for ds_add_rtn |
| - aco: implement NIR intrinsics for NGG streamout |
| - aco: remove invalid assertions for NGG streamout |
| - radv/llvm: prevent emitting streamout outputs for NGG |
| - radv: call nir_io_add_intrinsic_xfb_info() after IO lowering |
| - radv: lower NIR intrinsics for NGG streamout |
| - radv: do not unconditionally disable NGG streamout lowering in NIR |
| - radv: reword an incorrect comment about primitives generated query |
| - radv: use 32-bit GDS counters for primitives generated query with NGG |
| - radv: suspend/resume primitives generated query separately for legacy/NGG |
| - radv: fix primitives generated query with NGG only |
| - radv: set the correct buffer size for NGG streamout |
| - radv: enable all possible features with NGG streamout |
| - radv: allow to enable NGG streamout with RADV_PERFTEST=ngg_streamout |
| - radv: enable VK_EXT_transform_feedback with NGG streamout on GFX11 |
| - ac/nir: stop using nir_gather_xfb_info_from_intrinsics |
| - radv: assign IO var locations for VS/TES with NGG streamout |
| - radv/amdgpu: add a kernel GDS management workaround for -ENOMEM |
| - radv: use defines instead of magic values for GDS counters offset |
| - radv: move GDS counters after reserved GDS offsets for streamout |
| - radv: fix fallback for extreme geometry with tessellation on GFX11 |
| - radv: fix setting MIN_LOD for texture descriptors on GFX11 |
| - radv: make the GDS/GDS OA buffer objects resident |
| - radv: invalidate L2 instead of only writeback L2 when using DCC stores |
| - radv: re-emit the guardband state when restoring meta operations |
| - ac/nir,radv: rework and fix NGG queries enables for VS/TES |
| - ac/nir: do not convert GS outputs to the expected variable size on GFX11 |
| - radv: re-emit NGG culling settings when conservative rast mode is dynamic |
| - radv: enable lowering of subgroup shuffle in NIR on GFX11+ |
| - aco: fix p_interp_gfx11 to not overwrite SCC |
| - aco: fix missing SCC for p_interp_gfx11 in emit_interp_mov_instr() |
| - aco: add p_dual_src_export_gfx11 for dual source blending on GFX11 |
| - aco: fix dual source blending on GFX11 |
| - aco: fix FS inputs loads in WQM with 16-bit |
| - radv: suspend/resume XFB queries with NGG for meta operations |
| - zink: require extendedDynamicState3ColorBlendEquation for full ds3 |
| - radv: stop overallocating LDS for VS/TES when NGG streamout is enabled |
| - radv: fix computing the pervertex LDS size with NGG streamout |
| - radv: enable NGG XFB queries only if streamout is enabled |
| - radv: re-mit streamout buffers to unbind them when NGG streamout is disabled |
| - radv: fix NGG streamout when it's never enabled in runtime |
| - radv: fix possible hangs with NGG streamout and secondary cmdbuf |
| - aco: fix emitting DEALLOC_VGPRS in the discard block |
| - radv: re-emit dynamic depth clamp enable if depth clip enable changed |
| - radv: make sure to mark DCC as compressed on GFX11 |
| - radv: introduce RADV_DEBUG=nofmask |
| - radv: set missing SPI_SHADER_PGM_xxx registers on GFX11 |
| - radv: set INTERPOLATE_COMP_Z to 0 on GFX11 |
| - radv: do not enable DCC for MSAA images without FMASK |
| - radv: do not enable NGG culling on GFX11 |
| - radv: disable VRS entirely on GFX11 |
| |
| Sarah Walker (3): |
| |
| - pvr: Update FWIF compute register structure |
| - pvr: Remove STRIP_RENDERING members from FWIF |
| - pvr: Update FWIF 3d and compute register structures |
| |
| Sathishkumar S (2): |
| |
| - frontends/va: support yuv 400/444 rt_formats in vaconfig |
| - frontends/va: reallocate surface for yuv400/yuv444 picture |
| |
| Sergei Chernyadyev (1): |
| |
| - radv: fix setting results for initialization failures in thread trace and trap handler |
| |
| Shuicheng Lin (1): |
| |
| - iris: return failure if iris_resource_configure_main fail |
| |
| Sil Vilerino (76): |
| |
| - gallium/va: vaDeriveImage to check PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP |
| - d3d12: Implement cap PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP |
| - frontends/va: Add HEVC decode slice descriptors |
| - frontends/va: Add HEVC decode args: IntraPicFlag, no_pic_reordering_flag, no_bipred_flag |
| - frontends/vdpau: Fill HEVC Decode param IntraPicFlag |
| - frontends/omx: Fill HEVC Decode param IntraPicFlag |
| - frontends/va: Mark IsLongTerm in HEVC decode args |
| - frontends/va: Add HEVC Encode support multi slice and extend pipe args |
| - frontends/va: Extend single to multiple L0-L1 references for HEVC Encode |
| - frontends/va: Support HEVC caps regarding features, block sizes, prediction direction |
| - d3d12/va: Name convention rename PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP to PIPE_VIDEO_CAP_SUPPORTS_CONTIGUOUS_PLANES_MAP |
| - gallium/vl: Allow vl_zscan.h to be included from C++ |
| - gallium/vl: Rename s_addr variable in vl_idct.c as it conflicts with windows existing inaddr.h keyword definition |
| - d3d12: Add HEVC Decode/Encode |
| - d3d12: Avoid extra allocation, copies when generating DXVA_Slice_Hxxx_Short arrays |
| - d3d12: Avoid local allocations for D3D12_RESOURCE_BARRIER on hot paths |
| - d3d12: Avoid heap allocations on hot path d3d12_video_decoder_dxva_picparams_from_pipe_picparams_hevc |
| - d3d12: Fill feedback in d3d12_video_encoder_encode_bitstream so vaSyncSurface properly populates buf->coded_size |
| - d3d12: Fix leak in d3d12_video_proc when re-creating ID3D12VideoProcessor |
| - d3d12: Fix winsys displaytarget leak in d3d12_resource |
| - d3d12: Fix leak in d3d12_resource_from_resource and usage in d3d12 video dec, enc |
| - d3d12: Add support for importing d3d12_video_buffer from handle |
| - d3d12: Allow formats other than NV12 in d3d12_video_buffer |
| - d3d12: Allow video processing for formats other than NV12 |
| - d3d12: Add VPBlit processor check for D3D12_FEATURE_VIDEO_PROCESS_MAX_INPUT_STREAMS |
| - d3d12: d3d12_video_buffer_create_impl make resident after checking for resource creation |
| - frontends/va: Implement vaSyncBuffer |
| - d3d12: Update HEVC Encode GOP on I frames too |
| - d3d12: Video process - Remove unnecessary batches flush |
| - d3d12: Add initialization values for d3d12_video_encoder |
| - d3d12: Report PIPE_VIDEO_CAP_REQUIRES_FLUSH_ON_END_FRAME |
| - d3d12: Add support for PIPE_VIDEO_CAP_ENC_SUPPORTS_ASYNC_OPERATION |
| - d3d12: Change type of m_FenceValue to uint64_t in d3d12_video_encoder |
| - d3d12: Export some util functions from d3d12_fence for d3d12 video |
| - d3d12: Keep track of feedbacks from previous encode executions |
| - d3d12: Change d3d12_video_encoder to not automatically flush |
| - d3d12: Move encode configuration and capabilities into separate structs |
| - d3d12: Encoder to use independent command allocators per in flight resource set |
| - d3d12: d3d12_video_encoder add reset() to prevent leaks on re-allocation |
| - d3d12: Have d3d12_video_encoder keep separate references for enc, heap and dpb allocations for in flight resources |
| - d3d12: Make d3d12 encode operations async and do not block waiting |
| - d3d12: Add output buffer to inflight resourceset |
| - d3d12: Fix redundant/inconsistent initializations for d3d12_video_encoder |
| - d3d12: Enhance get_feedback not enough buffers message |
| - d3d12: Extract into common variable for metadata slot calculation |
| - d3d12: Make get_feedback return correctly on error when writing \*size=0 |
| - d3d12: Relax slice caps requirements for video encode |
| - d3d12: Fix out of bounds index check for texture array DPB |
| - d3d12: Add debug verbosity for output reconstructed picture |
| - d3d12: Fix texture array DPB encode path |
| - d3d12: Fix usage of %ld with PRIu64 instead for cross-platform build |
| - d3d12: Replace size_t with uint64_t in d3d12_video_encoder definitions for cross-plat compatibility |
| - frontends/va: Add more params from VADecPictureParameterBufferAV1 and VASliceParameterBufferAV1 to pipe_av1_picture_desc |
| - frontends/va: Support AV1 Decode with multiple tiles (num_elements > 1) in a single VASliceParameterBufferAV1 buffer |
| - frontends/va: Use is_video_format_supported in vlVaVidEngineBlit instead of format list |
| - frontends/va: Fix AV1 decode qm_* parameters reading from VADecPictureParameterBufferAV1 |
| - d3d12: Add support for AV1 Video Decode |
| - d3d12: Support more video formats |
| - d3d12: Fix aliased usage of m_SliceControlBuffer in d3d12_video_decoder_prepare_dxva_slices_control_XXXX |
| - d3d12: Style variable fix in d3d12_video_decoder_prepare_dxva_slices_control_av1 |
| - frontends/va: Add VP9 decode multi slice information |
| - frontends/va: Add VP9 decode ref/mode_deltas information |
| - d3d12: Add VP9 Decode support |
| - d3d12: Fix get_index7bits - Reuse previously freed indices |
| - d3d12: HEVC Encode workaround for edge case in caps reporting not contemplated by upper layer interface |
| - d3d12: Fix HEVC wrong caps detection due to bad parenthesis in condition |
| - d3d12: HEVC Set D3D12_VIDEO_ENCODER_CODEC_CONFIGURATION_HEVC_FLAG_USE_ASYMETRIC_MOTION_PARTITION when required by caps |
| - gallium/vl: Add vl_winsys_win32 support |
| - frontends/va: Add windows VA frontend support via vl_winsys_win32 and libva-win32 |
| - d3d12: resource_from_handle to validate importing resource ptr from same d3d12 device |
| - CI: Add gallium-va and video-codecs in windows-vs2019 and debian-mingw32-x86_64 |
| - d3d12: Create d3d12_resource objects for video usage keeping their DXGI\_..._UNORM formats |
| - d3d12: HEVC Decode - Fix slice size and offsets translation from VA to DXVA |
| - frontend/va: Fix WIN32 VA_DRIVER_INIT_FUNC declaration. Remove declspec as it uses .def file |
| - ci: Add va frontend to windows-build-rules |
| - d3d12: Video Screen - Do not crash if HEVC not supported, return no support instead |
| |
| Simon Fels (1): |
| |
| - venus: drop duplicated YCBCR_2_PLANE_444_FORMATS_FEATURES_EXT |
| |
| Simon Ser (2): |
| |
| - vulkan/wsi/x11: detect Xwayland via the XWAYLAND extension |
| - egl: fix error string for dri2_create_image_dma_buf() |
| |
| Simon Zeni (10): |
| |
| - mesa: fix EXT_EGL_image_storage target validation |
| - gallium: track if st_egl_image was created by a dmabuf |
| - mesa: simplify st_egl_image binding process for texture storage |
| - mesa/st: check egl image and texture target match before binding |
| - mesa: fix GL_INVALID_OPERATION in glEGLImageTargetTexStorageEXT |
| - mesa: unlock texture on error path in glEGLImageTargetTexStorageEXT |
| - egl: fix Xlib include in egldisplay.h |
| - egl: set API symbols public |
| - EGL: sync headers with Khronos |
| - meson: remove unused EGL_NO_X11 definition |
| |
| Sonny Jiang (1): |
| |
| - radeonsi/vcn: add decode support for gfx1101 and gfx1103 |
| |
| SoroushIMG (22): |
| |
| - zink: Fix incorrect emission of SPIR-V shift ops |
| - zink: track min_samples state for per sample shading |
| - zink: lower 64bit vars for drivers without float64 |
| - zink: optimise lower_64bit_vars for when int64 is supported |
| - zink: lower 64bit pack when float64 not supported |
| - zink: fix nir_op_unpack_64_2x32 emission |
| - zink: fix spirv temp func variable class emission |
| - nir: track whether a loop contains soft fp64 ops |
| - nir: allow to fine tune unrolling for loops with soft fp64 ops |
| - zink: enable loop unrolling for loops that have soft fp64 |
| - zink: enable native ETC2 if vk driver supports it |
| - zink: fix invalid Offset set for variables which do not need an offset |
| - zink: stop enabling minmax filtering when not supported |
| - zink: fix isNan mismatch between NIR and SPIR-V |
| - lavapipe: stop allocating 0 size const buffer |
| - zink: cleanup pushconst interface between driver/compiler |
| - zink: use unified pushconst layour for passthorugh tcs |
| - zink: cleanup zink_pipeline_layout_create |
| - zink: add pushconst only pipeline layout |
| - zink: add new framebuffer_is_layered state |
| - zink: clamp gl_Layer output to 0, if framebuffer is not layered |
| - zink: limit gl_Layer clamping to drivers that need it |
| |
| Sunil Khatri (1): |
| |
| - winsys/amdgpu: clamp up the alignment if zero |
| |
| Sviatoslav Peleshko (2): |
| |
| - iris: Always initialize shader compilation queue ready fence |
| - driconf/intel: Use fake vendor to WA bad detection in Source engine games |
| |
| Tapani Pälli (40): |
| |
| - glsl: allow image*Shadow keywords on ES and GLSL >= 420 |
| - glsl: add check for too large atomic counter buffer offset |
| - iris: implement Wa_14015946265 for DG2 |
| - anv: implement Wa_14015946265 for DG2 |
| - intel/compiler: implement Wa_14014595444 for DG2 |
| - intel/compiler: fix assert from ver to verx10 |
| - intel/genxml: add VFG_PREEMPTION_CHICKEN_BITS register |
| - anv: disable preemption on VFG, Wa_14015207028 for DG2 |
| - iris: disable preemption on VFG, Wa_14015207028 for DG2 |
| - anv: remove vk_sample_locations_state from emit_multisample |
| - intel/common: clamp sample location coordinate range |
| - anv: remove primitive_topology from 3DPRIMITIVE calls |
| - mesa/st: fix a set_sampler_views call parameter order |
| - anv: implement Wa_14016118574 |
| - iris: implement Wa_14016118574 |
| - anv: limit to default fast clear color when image used for transfer |
| - intel: revert preemption disable via VFG changes |
| - intel/fs: mark debug variables with ASSERTED |
| - anv: mark debug variables with ASSERTED |
| - mesa: mark debug variables with ASSERTED |
| - anv: dynamic state for polygon mode |
| - anv: dynamic state for tessellation domain origin |
| - anv: dynamic rasterization stream |
| - anv: dynamic line stipple enable |
| - anv: dynamic line rasterization mode |
| - anv: dynamic state for logic op enable |
| - anv: dynamic alpha to one enable |
| - anv: dynamic states for depth clip and clamp |
| - anv: dynamic provoking vertex mode |
| - anv: dynamic color write mask |
| - anv: dynamic color blend equation |
| - anv: dynamic multisample sample mask |
| - anv: toggle on EXT_extended_dynamic_state3 |
| - anv: move emit_shading_rate to gfx8_cmd_buffer |
| - anv: fill AlphaToCoverageEnable lazily from state |
| - intel/compiler: run nir_opt_idiv_const before nir_lower_idiv |
| - anv: remove some unused functions |
| - hasvk: remove some unused functions |
| - iris: setup stage bitmask for Wa_22011440098 |
| - anv: setup stage bitmask for Wa_22011440098 |
| |
| Tatsuyuki Ishi (4): |
| |
| - radv: Implement radv_flush_before_query_copy to workaround UE Vulkan bugs. |
| - radv: Remove unused functions. |
| - radv: Remove unused radv_deferred_queue_submission. |
| - radv,lvp,anv: Delete pre-common dispatch leftover. |
| |
| Thomas Debesse (7): |
| |
| - r600: set clear_buffer = u_default_clear_buffer |
| - r600: info.stage MESA_SHADER_KERNEL as MESA_SHADER_COMPUTE |
| - nv50: call nir_lower_flrp |
| - clover: implement CLOVER_DEVICE_TYPE like RUSTICL_DEVICE_TYPE |
| - Revert "gallium/clover: pass -no-opaque-pointers to Clang", opaque pointers are now implemented |
| - gallium/clover: pass -opaque-pointers to Clang on LLVM 15 and 16 |
| - crocus: set clear_buffer = u_default_clear_buffer |
| |
| Thomas H.P. Andersen (15): |
| |
| - nvc0: fix a warning -Wconstant-conversion |
| - radeonsi: avoid a use-after-free |
| - d3d12: fix warnings for missing-braces on clang |
| - r600: fix warnings for missing-braces on clang |
| - ci: remove missing-braces from no-error |
| - util: avoid deprecated builtin has_trivial_destructor |
| - spirv: avoid allocating memory twice |
| - hasvk: Switch to the common descriptor update template struct |
| - panvk: Implement VK_KHR_descriptor_update_template |
| - spirv: mark function unused |
| - llvmpipe: mark function unused |
| - llvmpipe: drop unused function |
| - panfrost: avoid warning about unused function |
| - vulkan: avoid warning about unused function |
| - vulkan: drop const for descriptor set |
| |
| Timothy Arceri (24): |
| |
| - Revert "nir: Preserve offsets in lower_io_to_scalar_early" |
| - glsl: dont lower precision for textureGatherOffsets |
| - glsl: make packed varying helper needs_lowering() external |
| - glsl: fix location for array subscript |
| - nir: update nir_is_supported_terminator_condition() |
| - nir: support loop unrolling with inot conditions |
| - glsl: remove GLSL IR inverse comparison optimisations |
| - nir/loop_analyze: delay instruction cost calculation |
| - nir/loop_analyze: remove cost of redundant selects |
| - util/conf: enable init to zero workaround for Exanima |
| - util/radeonsi: enable zerovram workaround for Exanima |
| - radv: add radv_zero_vram workarounds for OpenGL games |
| - glthread: fix matrix stack depth tracking |
| - mesa: add EXT_debug_label support |
| - glthread: leave dlist dispatch in place for Begin/End |
| - glsl/glsl_to_nir: remove unreachable code |
| - glsl: move rule inside lower_packing_builtins() |
| - glsl: drop sub to add neg lowering in GLSL IR |
| - glsl: always do {CARRY,BORROW}_TO_ARITH lowering |
| - glsl: move lower instructions logic inside that pass |
| - st/glsl_to_ir: remove stale comment |
| - st/glsl_to_ir: remove stale include |
| - mesa: fix typo from adding glGetObjectLabelEXT |
| - nir: fix typo in lower_double options handling |
| |
| Timur Kristóf (64): |
| |
| - ac/nir/cull: Move some code from cull_bbox into helper functions. |
| - ac/nir/cull: Move the contents of cull_bbox into ac_nir_cull_triangle. |
| - ac/nir/cull: Change if condition for bounding box culling. |
| - ac/nir/ngg: Move LDS store of accepted flag into the inner branch. |
| - ac/nir/cull: Make cull functions more consistent. |
| - aco: Optimize branching sequence during SSA elimination. |
| - aco: Remove branch instruction when exec is constant non-zero. |
| - ac/nir/cull: Tweak phi for cull_small_primitive branch. |
| - ac/nir/cull: Fix typo in bounding box culling. |
| - aco: Add faster code path to store_lds for consecutive write mask. |
| - aco: Fix invalidated reference in branching sequence optimization. |
| - aco: Check for instructions that inhibit the branching sequence optimization. |
| - aco/optimizer_postRA: Don't try to optimize dead instructions. |
| - aco: Support s_cselect_b64 in SCC no-compare optimization. |
| - aco: Improve SCC nocompare optimization when SCC is clobbered. |
| - aco: Fix p_init_scratch for task shaders. |
| - vulkan, spirv: Update to Vulkan 1.3.226 and latest SPIR-V headers. |
| - spirv: Support EXT_mesh_shader indices and mark them per-primitive. |
| - spirv: Add mesh_shading capability for EXT_mesh_shader. |
| - spirv: Support EXT_mesh_shader mesh/task stages. |
| - spirv: Support EXT_mesh_shader SetMeshOutputsEXT. |
| - spirv: Support the CullPrimitiveEXT mesh shader built-in. |
| - spirv: Support TaskPayloadWorkgroupEXT storage class. |
| - spirv, nir: Handle EmitMeshTasksEXT opcode. |
| - ac/nir/ngg: Add EXT_mesh_shader primitive indices. |
| - ac/nir/ngg: Add EXT_mesh_shader CullPrimitiveEXT output. |
| - ac/nir/ngg: Add EXT_mesh_shader vertex/primitive count. |
| - radv: Implement EXT_mesh_shader draw calls. |
| - radv: Enable EXT_mesh_shader on RDNA2 with RADV_PERFTEST=ext_ms |
| - nir/lower_system_values: Add shortcut for 1D workgroups. |
| - nir/gather_info: Clear cross-invocation output mask. |
| - radv: Replace NV_mesh_shader macros with EXT. |
| - radv: Enable subgroup feature bits for mesh and task shaders. |
| - radv: Change max preferred task workgroup invocations to 64. |
| - radv: Remove dead shader temps after linking. |
| - ac/nir/ngg: Don't use LDS for same-invocation indices and cull outputs. |
| - ac/nir/ngg: Don't initialize same-invocation mesh shader outputs. |
| - aco/tests: Add post-RA optimizer testcase for partially overwritten VCC. |
| - aco/tests: Add post-RA DPP test cases with control flow. |
| - aco/tests: Add post-RA SCC no-compare tests cases with control flow. |
| - aco/optimizer_postRA: Mark a register overwritten when predecessors disagree. |
| - aco/optimizer_postRA: Don't assume all operand registers were written by same instr. |
| - aco/optimizer_postRA: Fix logical control flow handling. |
| - aco/optimizer_postRA: Clarify terminology. |
| - aco: Change inverse-comparison optimization to work with s_not |
| - spirv: Treat EmitMeshTasksEXT as a terminating instruction. |
| - radv: Use a fallback for marketing name when libdrm doesn't know it. |
| - ac/nir/ngg: Fix cross-invocation indices and cull outputs. |
| - aco: Fix build error with std::max on GCC 12 |
| - nir: Document the flatten/dont_flatten selection control options. |
| - nir: Add selection control enum for always taken divergent branches. |
| - aco: Allow explicitly removing jumps on GFX10+ when beneficial. |
| - ac/nir/ngg: Remove jumps from some branches where we know LGKMCNT==0. |
| - ac/nir/tess: Remove jump from tess factor writes. |
| - nir, ac, aco: Add ACCESS intrinsic index to load/store_buffer_amd. |
| - aco: Cleanup load_vmem_mubuf and store_vmem_mubuf functions. |
| - nir, ac, aco: Add index src to load_buffer_amd/store_buffer_amd. |
| - aco: Optimize MUBUF 0 offset when idxen is also being used. |
| - aco/optimizer_postRA: Use unique_ptr + array for instruction indices. |
| - aco/optimizer_postRA: Speed up reset_block() with predecessors. |
| - aco/optimizer_postRA: Properly handle vccz/execz/scc in reset_block. |
| - aco/optimizer_postRA: Delete dead instructions more efficiently. |
| - aco: Move is_dead to aco_ir.h to allow it to get inlined. |
| - aco: Add ACO_DEBUG=novalidateir option. |
| |
| Tomeu Vizoso (16): |
| |
| - radeonsi/ci: Move libva and piglit jobs to Raven Ridge boards |
| - radv/ci: Put one more board to run the CTS on Stoney Ridge |
| - iris/ci: Test on Jasper Lake Chromebooks |
| - iris/ci: Set FDO_CI_CONCURRENT for all Chromebook jobs |
| - ci: Report failures to GitLab for LAVA jobs |
| - ci: Make sure that the Crosvm control socket has been removed |
| - Revert "ci: set venus on lavapipe to manual due to flakes" |
| - ci: Crosvm won't remove the control socket file on stop |
| - Revert "Revert "Revert "ci: set venus on lavapipe to manual due to flakes""" |
| - ci: Uprev ci-templates |
| - ci: Use --no-install-recommends to avoid problems with --no-remove |
| - ci: Install sysvinit-core without --no-remove |
| - ci: Stop explicitly passing env vars to FDO_DISTRIBUTION_EXEC command |
| - gallium: Add PIPE_CAP_QUERY_TIMESTAMP_BITS |
| - ci: Run RadeonSI jobs on changes to src/amd/ci |
| - radeonsi,radv/ci: Increase coverage |
| |
| Victor Hermann Chiletto (1): |
| |
| - radv: refactor descriptor set layout tracking |
| |
| Vinson Lee (10): |
| |
| - bin/perf-annotate-jit.py: Update to Python 3. |
| - zink: Remove duplicate variable zero. |
| - pan/bi: Fix memory leaks. |
| - egl/dri2: Fix missing return with dri2_egl_error_unlock. |
| - radv: Use count_tes_user_sgprs return value. |
| - d3d12: Initialize d3d12_video_encoder_references_manager_hevc::m_current_frame_idx. |
| - radv: Fix file descriptor leak. |
| - zink: Remove duplicate variable zm. |
| - r600/sfn: Initialize TestInstrFromString member m_instr_factory. |
| - v3d: Fix initializer-overrides warning. |
| |
| Väinö Mäkelä (1): |
| |
| - hasvk: Correctly set NonPerspectiveBarycentricEnable on gfx7 |
| |
| Xin LI (1): |
| |
| - meson: Improve detection of qsort_r(). |
| |
| Yiwei Zhang (52): |
| |
| - venus: increase busy wait order in vn_relax |
| - venus: double the abort timeout to allow long shader compiles |
| - venus: fix external memory ext filtering |
| - venus: require necessary extensions for common wsi support |
| - venus: avoid feedback for external fence |
| - venus: use a separate sync fence for Android wsi |
| - venus: lazily create queue wait fence and make it non-external |
| - venus: avoid pre-allocating the feedback pool |
| - venus: sync to latest venus protocol headers for syncFdFencing |
| - venus: stub out renderer sync_fd fencing features |
| - venus: put android wsi on the sub-optimal path |
| - venus: fix vn_GetFenceFdKHR |
| - venus: fix vn_GetSemaphoreFdKHR |
| - venus: query renderer sync_fd props to fill the feature stubs |
| - venus: re-implement sync_fd external fence |
| - venus: re-implement sync_fd external sempahore |
| - venus: avoid scrubing wsi/external sempahores |
| - venus: allow no external memory handle when renderers lacks support |
| - venus/ci: mark a test as flake |
| - venus: some clang format fixes |
| - venus: enable zink required extensions on Android |
| - venus: enable KHR_driver_properties on Android |
| - venus: double the abort timeout |
| - loader: use os_get_option for driver override |
| - zink: fix core support on Android |
| - zink: fix zink_create_fence_fd to properly import |
| - zink: fix in-fence lifecycle |
| - zink: implement fence_get_fd required by EGL android platform |
| - venus: clean up vn_QueueSubmit |
| - venus: force synchronous submission for external signal semaphore |
| - venus: fix an oob in descriptor pool state restoration |
| - venus: vn_ResetDescriptorPool to reset mutable type states |
| - venus: zero out the pipeline handles |
| - venus: avoid fixing pipeline if not derivative |
| - venus: ignore pInheritanceInfo if not secondary command buffer |
| - venus: ensure to clear VkPipelineCreationFeedback::flags if needed |
| - mesa/st: emit string marker based on PIPE_CAP_STRING_MARKER |
| - meson: libelf is optional |
| - venus/ci: remove redundant crosvm patch |
| - venus: move vendor ext to bottom and sort the ext list |
| - docs: update venus extension support |
| - docs: move VK_EXT_extended_dynamic_state3 out from 1.3 |
| - ci/venus: enable render server for venus testing |
| - venus: avoid accessing local var in VN_ADD_EXT_TO_PNEXT_OF |
| - venus: remove redundant codes |
| - venus: handle VkAndroidHardwareBufferFormatProperties2ANDROID |
| - venus: sync to latest venus protocol headers |
| - venus: add VK_EXT_primitives_generated_query support |
| - venus: add VK_EXT_depth_clip_control support |
| - docs: update to latest venus driver support |
| - venus: handle VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT |
| - venus: fix android wsi with global fencing disabled |
| |
| Yonggang Luo (218): |
| |
| - meson: Place all pre_args about HAVE_DRI* into single place |
| - meson: Cleanup usage dri_drivers as it's a dead option |
| - d3d12: Fixes compile error with mingw/gcc-x64 when static linkage to runtime library |
| - meson: clc compiler test only works on win32 |
| - microsoft/clc: Fixes compiling errors with clang/mingw64 in clc/clc_compiler_test.cpp |
| - util: open_memstream also support on MacOS |
| - c11: Fixes the declaration of timespec_get in time.h for macOS |
| - egl: Remove eglextchromium.h and import eglext_angle.h |
| - egl: wire up ANGLE_sync_control_rate |
| - egl/x11: Split declaration of dri2_format_for_depth into header platform_x11.h |
| - egl/x11: implement ANGLE_sync_control_rate |
| - ci/windows: Setup proper MESA_IMAGE_TAG for Windows build/test image |
| - c11: It's should detect PTHREAD_DESTRUCTOR_ITERATIONS when define TSS_DTOR_ITERATIONS |
| - c11: call impl_tss_dtor_invoke with tls callback for win32 |
| - c11: Implement and use of thrd_current properly on win32 |
| - util: Implement util_thread_get_time_nano on win32 |
| - util: Fixes typo in function os_get_total_physical_memory |
| - util: Implement os_get_available_system_memory on Win32 |
| - util: Use \`DETECT_OS_WINDOWS` instead \`WIN32` in u_process.c |
| - util: Enable tests that disabled for msvc/mingw |
| - zink: Use DETECT_OS_WINDOWS instead of 'WIN32' |
| - meson: Enable graw tests on mingw |
| - util: Add os_create_anonymous_file support on win32 |
| - nine: Fixes compile of nine_pipe.h with clang ../../src/gallium/frontends/nine/nine_pipe.h:377:9: error: unannotated fall-through between switch labels [-Werror,-Wimplicit-fallthrough] default: ^ ../../src/gallium/frontends/nine/nine_pipe.h:377:9: note: insert 'break;' to avoid fall-through default: |
| - panfrost: Do no use designated initializer for union |
| - intel: Fixes compile error of aubinator_viewer.cpp with gcc |
| - util: Fixes memory leak in create_cache_item_header_and_blob |
| - util: Fixes memory leak in u_printf_test.cpp |
| - util: Fixes invalid assumption that return non null by function util_format_fetch_rgba_func |
| - util: Disable usage of __attribute__((__const__)) when the compiler is clang |
| - ci: Improve MacOS github ci |
| - ci: Enable llvmpipe on MacOS |
| - c11: Remove mtx_try from mtx enums as it's not a part of c11 standard |
| - c11: #include <threads.h> when the os/platform provide it |
| - anv: Fixes struct anv_device::info is not initialized with struct anv_physical_device |
| - meson: Use different STRACEDIR folder for intel_devinfo_override_test |
| - meson: add enable-glcpp-tests option |
| - ci: disable glcpp tests for now |
| - Revert "ci: disable unit tests" |
| - util: Use literal instead call to list_inithead in u_queue.c |
| - mapi: Remove unused ENTRY_CURRENT_TABLE_GET |
| - mapi: Remove usage of _glapi_check_multithread and _glapi_destroy_multithread |
| - mapi: Remove reference to unused _gl_DispatchTSD |
| - mapi: Move shared _glapi_set_context and _glapi_set_dispatch into u_current.c |
| - mapi: Move shared stub into u_current.c |
| - mapi: Access _glapi_tls_Dispatch directly without need of macro |
| - docs: Remove document about USE_ELF_TLS |
| - mapi: Improve comment about _glapi_tls_Dispatch and _glapi_tls_Context |
| - aco: Use unreachable instead assert(false) |
| - amd/compiler: Fixes warning [-Wunused-variable] in test_to_hw_instr.cpp |
| - amd/compiler: Fixes warning [-Wunused-variable] in test_optimizer_postRA.cpp |
| - nir: Fixes [-Wdeprecated-declarations] in serialize_tests.cpp |
| - util: Fix [-Wdeprecated-declarations] in mesa-sha1_test.cpp |
| - glsl: Fixes [-Wdeprecated-declarations] in list_iterators.cpp |
| - intel/compiler: Fixes [-Wdeprecated-declarations] in test_eu_compact.cpp |
| - intel/compiler: Fixes [-Wdeprecated-declarations] in test_eu_validate.cpp |
| - virgl: Fixes [-Wdeprecated-declarations] in virgl_staging_mgr_test.cpp |
| - osmesa: Fixes [-Wdeprecated-declarations] in test-render.cpp |
| - ci: Remove -Wno-error=deprecated-declarations cpp option from debian-clang |
| - util: Remove unused rtasm_cpu_has_sse2 |
| - util: Rename util_cpu_detect to _util_cpu_detect_local |
| - util: Improve has_sse has_sse2 has_sse3 has_sse4_1 has_avx disable handling |
| - util: Handling GALLIUM_NOSSE in u_cpu_detect.c |
| - util: Handling LP_FORCE_SSE2 in u_cpu_detect.c |
| - llvmpipe: Remove the hack for LLVM4 in lp_bld_init.c |
| - gallium: Fixes memory leak of translate_test.c |
| - util: Use environment variable GALLIUM_OVERRIDE_CPU_CAPS to remove usage of util_cpu_caps in translate_test.c |
| - util: Move detect_done out of struct util_cpu_caps_t |
| - util: Add api util_call_once_with_context |
| - util: Add extern "C" guard in simple_mtx.h |
| - util: Remove the need of _MTX_INITIALIZER_NP in simple_mtx.h |
| - meson/panfrost: Add dep_valgrind for libpanfrost_pixel_format to fixes the compiling error: |
| - llvmpipe: Fixes warning [-Wimplicit-fallthrough] in lp_bld_swizzle.c |
| - llvmpipe: Fixes warning [-Wimplicit-fallthrough] in lp_bld_sample_soa.c |
| - radeonsi: Fixes warning [-Wimplicit-fallthrough] in si_descriptors.c |
| - freedreno: Fixes warning [-Wimplicit-fallthrough] in freedreno_util.h |
| - freedreno: Fixes warning [-Wimplicit-fallthrough] in fd4_texture.h |
| - freedreno: Fixes warning [-Wimplicit-fallthrough] in fd2_texture.c |
| - freedreno: Fixes warning [-Wimplicit-fallthrough] in fd3_texture.c |
| - freedreno: Fixes warning [-Wimplicit-fallthrough] in fd5_texture.h |
| - freedreno: Fixes warning [-Wimplicit-fallthrough] in fd5_emit.c |
| - freedreno: Fixes warning [-Wimplicit-fallthrough] in fd6_emit.c |
| - vdpau: Fixes warning [-Wimplicit-fallthrough] in vdpau_private.h |
| - v3dv: Fixes warning [-Wimplicit-fallthrough] in v3dvx_pipeline.c |
| - v3dv: Remove redundant unreachable in v3dvx_pipeline.c |
| - st: Fixes warning [-Wimplicit-fallthrough] in st_texture.c |
| - llvmpipe: Remove llvm version guard LLVM_VERSION_MAJOR <=> 4 |
| - meson: bump the minimal required llvm version to 5.0.0 |
| - util: Refactoring check_cpu_caps_override out |
| - util: Add ssse3 and avx cpu caps override option |
| - util: Move the detect of max_vector_bits into u_cpu_detect.c |
| - util: add support for detecting avx512 vector bit size |
| - llvmpipe: Always using util_get_cpu_caps to get cpu caps for llvm on x86 |
| - llvmpipe: Enable avx512 support in llvm |
| - docs: document LP_NATIVE_VECTOR_WIDTH GALLIUM_OVERRIDE_CPU_CAPS GALLIUM_NOSSE LP_FORCE_SSE2 |
| - ci: EXTRA_OPTIONS -> EXTRA_OPTION for debian-clang |
| - ci: Enable GALLIUM_DUMP_CPU=true only in the clang job |
| - ci: Enable all possible meson build options for clang |
| - ci: Add debian-clang-release build pipeline |
| - ci/fedora: Re-enable vulkan-layers=device-select,overlay |
| - gallium/hud: Fixes compile error in hud/hud_context.c: |
| - util: Fixes comment about debug_get_option |
| - mesa: direct use of mtx_recursive is not c11 conformance |
| - radeonsi: direct use of mtx_recursive is not c11 conformance |
| - svga: direct use of mtx_recursive is not c11 conformance |
| - c11: Update the values of enum mtx_plain, mtx_recursive, mtx_timed to make sure mtx_recursive != mtx_plain | mtx_recursive |
| - drm-shim: drop gnu99 override |
| - clover: Fixes use of designated initializers requires in c++ that doesn't support by MSVC |
| - radv: Remove the redundant #include <gelf.h> and #include <libelf.h> in ac_binary.c |
| - aco: Convert to use u8 literal for Unicode character to fixes msvc warning |
| - util: Add util_call_once for optimize call to util_call_once_with_context out for hot path |
| - util: getting get_tracefile in u_trace.c to be thread safe |
| - util: Remove the macro DEBUG_GET_ONCE_FILE_OPTION |
| - radv/r600/clover: Getting libelf to be optional |
| - radv: Getting radeon_icd to be generated properly on win32 |
| - dzn: rename module_dir to icd_lib_path for consistence |
| - lavapipe/meson: rename module_dir to icd_lib_path and swap two lines for consistence |
| - Update 22.2.0.rst |
| - ci: debian-clang: drop -Wno-error=unused-function |
| - r600/sfn: Fixes 'numeric_limits' is not a member of 'std' error in sfn/sfn_instr.cpp |
| - aco: Fixes compiling error about char8_t with c++20 |
| - ci/docs: Stick to specific version of python packages |
| - util: Remove the include of windows.h when compiling with MSVC |
| - util: Implement atomic operations consistently across compilers and testing for it |
| - util/mesa/wide: Rename _SIMPLE_MTX_INITIALIZER_NP to SIMPLE_MTX_INITIALIZER |
| - util: Turn -DWINDOWS_NO_FUTEX to be pre_args |
| - ci/windows: Enable gles1 for msvc |
| - ci/windows: Remove -Dlibelf:warning_level=1 as libelf subproject are already removed |
| - ci/windows: Getting the default supported windows version to be 7 when using mingw |
| - gallium: delete graw |
| - nir: No need redefine snprintf anymore in nir.h |
| - d3d10ump: No need define snprintf anymore in InputAssembly.cpp |
| - util: Move the implementation of futex_wake and futex_wait from futex.h to futex.c |
| - docs: Remove graw related words |
| - docs: Remove rbug related documents |
| - nouveau: Remove XVMC_VL environment variable usage |
| - docs: Remove document about XvMC |
| - meson: add dep_valgrind dependency for 'dxil_compiler' |
| - util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_stack.c |
| - util: Call function mtx_init on exit_mutex to replace the usage of _MTX_INITIALIZER_NP in u_queue.c |
| - util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_symbol.c |
| - util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_stack_android.cpp |
| - util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_memory.c |
| - util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_M in u_debug_refcnt.c |
| - mapi: Use util_call_once to init exec_mem and mutex instead _MTX_INITIALIZER_NP |
| - mapi: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in stub.c |
| - gallium/auxiliary: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER |
| - egl/wgl: Use util_call_once_data to replace usage of mtx_t glFlushMutex |
| - egl/dri2: Use util_call_once_data to replace usage of mtx_t glFlushMutex |
| - mesa: Use util_call_once_data instead of mtx_t init_once_lock |
| - util: #include "c11/threads.h" when necessary in simple_mtx.h |
| - util: #include "util/macros.h" in u_cpu_detect.h for ATTRIBUTE_CONST |
| - c11: Remove the code for Windows XP |
| - meson/mesa/wide: define WIN32_LEAN_AND_MEAN with pre_args |
| - util: Remove the guard WIN32_WINNT >= 0x0600 in u_thread.h |
| - c11: Getting the macro guard of HAVE_PTHREAD in c11/threads.h to be consistence with util/u_uthread.h |
| - util: #include <windows.h> in disk_cache_os.c |
| - llvmpipe: #include <windows.h> in lp_rast.c |
| - util: Remove include of windows.h in memstream.h |
| - util: Remove include of windows.h in rwlock.h |
| - util: Move the function implementation from u_thread.h to u_thread.c |
| - glx: Guard usage of infer_zink && !explicit_zink in glxext.c |
| - glx: Trim trailing spaces in in glx/glxext.c |
| - glx: Remove redundant #ifndef GLX_USE_APPLEGL in glx/glxext.c |
| - glx: Adopt guard of glx_direct and glx_accel in #if defined(GLX_DIRECT_RENDERING) && !defined(GLX_USE_APPLEGL) |
| - glx: Adot #ifdef GLX_USE_WINDOWSGL in #if defined(GLX_DIRECT_RENDERING) && !defined(GLX_USE_APPLEGL) |
| - st: Remove st_api_type because ST_API_OPENVG is not an option anymore |
| - ci: libxvmc-dev is not a dependence anymore |
| - meson: upgrade zlib wrap |
| - ci: upgrade zlib to 1.2.13 in mesa_deps_build.ps1 |
| - ci: Add Get-Date in each package build in mesa_deps_build.ps1 |
| - ci: Move building libva before clone llvm-project in mesa_deps_build.ps1 |
| - ci: Install the clang-cl compiler within mesa_deps_vs2019.ps1 |
| - ci: Trigger building visual studio docker image |
| - meson: with_glx never assigned to 'gallium-xlib', so need add with_glx == 'xlib' for dep_glproto |
| - meson: with_glx never assigned to 'gallium-xlib', remove 'gallium-xlib' |
| - xlib: Remove macro guard with GLX_INDIRECT_RENDERING in xlib.c |
| - ci/macos: Building both dri and xlib version of glx |
| - hgl: Remove non-nused hgl_create_st_api |
| - st: move function st_visual_have_buffers into st_manager.c |
| - st: OpenVG are removed, remove st_api::profile_mask as it's constant |
| - st: OpenVG are removed, remove st_api::feature_mask as it's constant |
| - st: Remove st_api::destroy |
| - st: Turn st_api::destroy_drawable to be direct accessed function st_api_destroy_drawable |
| - st: Turn st_api::get_current to be direct accessed function st_api_get_current |
| - st: Turn st_api::make_current to be direct accessed function st_api_make_current |
| - st: Turn st_api::create_context to be direct accessed function st_api_create_context |
| - st: Turn st_api::query_versions to be direct accessed function st_api_query_versions |
| - xlib: Implement xmesa_get_name by return literal "Mesa " PACKAGE_VERSION directly |
| - st: Remove struct st_api, st_gl_api_create and st_gl_api.h |
| - aco: Do not define NOMINMAX as it's already defined in pre_args now |
| - util: Remove non used UTIL_MAX_L3_CACHES in u_thread.h |
| - util: Merge duplicated function prototype in u_thread.h that named with util_barrier* |
| - util: Move UTIL_MAX_CPUS from u_thread.h to u_cpu_detect.h |
| - util: #include "c11/threads.h" instead of #include "util/u_cpu_detect.h" u_format.c |
| - xlib: Fixes compiling on linux |
| - xlib: Remove the linkage hack in lib.c by use meson link_whole option |
| - gallium: Remove mgl* prefix in linker script osmesa.sym and libgl-xlib.sym |
| - xlib: Indent with space and trim trailing spaces of xlib files |
| - ci/macOS: Getting the installed binary to be artifacts |
| - ci: Getting debian-clang-release to building with glx=xlib |
| - xlib: Getting xmesa_strict_invalidate to be function |
| - xlib: Remove the usage of global constructor in xlib.c, so the xm_public.h are removed |
| - docs: -Ddri-drivers= option is deprecated, do not use it anymore |
| - docs: There is only glx=xlib option now |
| - docs: There is no more swrast driver |
| - android: -Ddri-drivers= is not needed anymore |
| - glx: Fixes apple/apple_visual.c that include non-exist util/debug.h |
| - ci: Fixes macos.yml |
| - radv: Fixes prototypes |
| - meson: Refactoring shared gen_vs_module_defs_normal_command out |
| - meson: fixes mingw-clang32 building |
| - clover: Rename \*OpenCL.def to \*OpenCL.def.in |
| - clover: Fixes building with mingw-x86 |
| - meson: Fixes name_prefix for clover on mingw |
| - clover: empty soversion when on win32 |
| - util: Test __PPC64__ for getting PIPE_ARCH_PPC_64 respond to __PPC64__ take effect |
| - gallivm: use valid LLVM_VERSION_MAJOR instead of LLVM_VERSION |
| |
| Yusuf Khan (7): |
| |
| - nouveau: implement and enable PIPE_CAP_QUERY_MEMORY_INFO |
| - tu: remove tu_syncobj_to_fd |
| - nouveau: get timestamp from os_time_get_nano instead of os_time_get |
| - nouveau: put nv04 push macros in nouveau_winsys |
| - nv50/ir: nir_op_b2i8 and nir_op_b2i16 |
| - gallium/util: add a helper for get_timestamp |
| - nouveau: enable PIPE_CAP_UMA when appropriate |
| |
| jenneron (1): |
| |
| - docs/panfrost: Document Mali-T620 support |
| |
| noasakurajin (1): |
| |
| - disable zinks shader cache when the needed functions do not exist |
| |
| nyanmisaka (2): |
| |
| - gallium: Do not include offscreen winsys if glx is disabled |
| - radeonsi: Inherit the old modifier when reallocating the texture. |
| |
| semjon00 (4): |
| |
| - hasvk: don't export flush_compute_state |
| - hasvk: don't export gfx state flushing helper |
| - hasvk: combine flushes in Draw/DrawIndexed/DrawIndirectByteCountEXT |
| - hasvk: force inline more pipe flush functions |
| |
| sjfricke (2): |
| |
| - isl: fix bug where sb.MOCS is not being set |
| - anv: fix assert to build with shader cache disabled |