| Mesa 23.3.0 Release Notes / 2023-11-29 |
| ====================================== |
| |
| Mesa 23.3.0 is a new development release. People who are concerned |
| with stability and reliability should stick with a previous release or |
| wait for Mesa 23.3.1. |
| |
| Mesa 23.3.0 implements the OpenGL 4.6 API, but the version reported by |
| glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / |
| glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. |
| Some drivers don't support all the features required in OpenGL 4.6. OpenGL |
| 4.6 is **only** available if requested at context creation. |
| Compatibility contexts may report a lower version depending on each driver. |
| |
| Mesa 23.3.0 implements the Vulkan 1.3 API, but the version reported by |
| the apiVersion property of the VkPhysicalDeviceProperties struct |
| depends on the particular driver being used. |
| |
| SHA256 checksum |
| --------------- |
| |
| :: |
| |
| 50f729dd60ed6335b989095baad81ef5edf7cfdd4b4b48b9b955917cb07d69c5 mesa-23.3.0.tar.xz |
| |
| |
| New drivers |
| ----------- |
| - NVK: A Vulkan driver for Nvidia hardware |
| |
| New features |
| ------------ |
| - VK_EXT_pipeline_robustness on ANV |
| - VK_KHR_maintenance5 on RADV |
| - OpenGL ES 3.1 on Asahi |
| - GL_ARB_compute_shader on Asahi |
| - GL_ARB_shader_atomic_counters on Asahi |
| - GL_ARB_shader_image_load_store on Asahi |
| - GL_ARB_shader_image_size on Asahi |
| - GL_ARB_shader_storage_buffer_object on Asahi |
| - GL_ARB_sample_shading on Asahi |
| - GL_OES_sample_variables on Asahi |
| - GL_OES_shader_multisample_interpolation on Asahi |
| - GL_OES_gpu_shader5 on Asahi |
| - EGL_ANDROID_blob_cache works when disk caching is disabled |
| - VK_KHR_cooperative_matrix on RADV/GFX11+ |
| |
| |
| Bug fixes |
| --------- |
| |
| - crash in si_update_tess_io_layout_state during _mesa_ReadPixels (radeonsi_dri, mesa 23.2.1) |
| - mesa: vertex attrib regression |
| - [RADV] War Thunder has some grass flickering. |
| - radv: satisfactory broken shader |
| - RADV problem with R7 M440 in some games |
| - gpu driver crashes when opening ingame map playing dead space 2023 |
| - [anv] Valheim water misrendering |
| - EGL/v3d: EGL applications under a X compositor doesn't work |
| - RADV: trunc_coord breaks ambient occlusion in Dirt Rally and other games |
| - radv: Mass Effect Legendary Edition: a line going across the screen is visible in some areas with Ambient Occlusion enabled |
| - anv: DIRT5 gfx11_generated_draws_spv_source triggers "assert(!copy_value_is_divergent(src) || copy_value_is_divergent(dest));" |
| - panfrost: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12 buffers |
| - [RADV][TONGA] - BeamNG.drive (284160) - Artifacts are present when looking at the skybox. |
| - LEGO Star Wars: The Skywalker Saga graphical glitches (DXVK) on R9 380 |
| - [radv] Crypt not rendering properly |
| - Leaks of DescriptorSet debug names |
| - [Tracing flake] Missing geometry in trace\@freedreno-a630\@freedoom\@freedoom-phase2-gl-high.trace |
| - Unreal Engine 5.2 virtual shadow maps have glitchy/lazy tile updates |
| - RADV: Visual glitches in Unreal Engine 5.2.1 when using material with anisotropy and light channel 2 |
| - radv: Regression with UE5 test |
| - SIGSEGV with MESA_VK_TRACE=rgp and compute only queue |
| - [ANV] Corruptions in Battlefield 4 |
| - anv regression w/ commit e488773b29d97 ("anv: Fast clear depth/stencil surface in vkCmdClearAttachments") |
| - ir3: dEQP-GLES31.functional.synchronization.inter_invocation.image_atomic_read_write crash on a6xx gen4 |
| - Zink + Venus: driver can't handle INVALID<->LINEAR! |
| - Anv: Particles have black square artifacts on Counter Strike 2 on Skylake |
| - Lords of the Fallen 2023 Red Eye mode crashing game and desktop |
| - [radeonsi] [vulkan] [23.3-rc1 regression] Video output corrupted in QMplay2 with Vulkan renderer |
| - [BISECTED] ac/radeon commit somehow breaks nv12 surface from HEVC decode |
| - Parsec displays completely green screen with hardware decoder selected while using Mesa 23.3 and Mesa 24 |
| - H264 to H264 transcode output corruption with gst-vaapi |
| - opencl-jpeg-encoder does not work with nouveau/rusticl, works with nouveau/clover |
| - [R600] X-plane 11 demo (Linux Native) crashes upon launch on HD5870 and HD6970 |
| - Ubuntu 23.10 build error with rusticl_opencl_bindings.rs |
| - Rusticl fails to build |
| - ANV not handling VkMutableDescriptorTypeCreateInfoEXT::pMutableDescriptorTypeLists[i] being out of range |
| - tu: Wolfenstein: The New Order misrenders on a740 |
| - DRI_PRIME fails with ACO only radeonsi |
| - nir_to_tgsi: Incorrect handling of indirect array access |
| - ANV gen9 32 bit vulkan asserts on many cts tests |
| - GPU hang observed while launching 3DMark Wildlife Unlimited on MTL |
| - ac/gpu_info: Query maximum submitted IBs from the kernel |
| - RADV: regression in 23.2.1 causing GPU hang with RDNA1 in various UE5 games |
| - GPU page faults reported while playing Talos Principle 2 (demo) |
| - No CCS_E scanout on tgl+ with ANV |
| - anv: Modifier tests assert-fail on TGL+ |
| - ci: zink-tu jobs no longer included in manual pipelines |
| - [ANV][A770] GravityMark segfaults and buffer allocation errors |
| - etnaviv: gc2000 gles2 regression |
| - ci_run_n_monitor: pipeline finding unreliable |
| - nvk: Implement VK_EXT_dynamic_rendering_unused_attachments |
| - anv: jsl timeline semaphores flaky |
| - anv: OOB access in vkDestroyDevice? |
| - nvk: Implement VK_EXT_primitive_topology_list_restart |
| - nvk: Implement VK_EXT_image_sliced_view_of_3d |
| - nvk: Implement VK_KHR_workgroup_memory_explicit_layout |
| - util/macros: BITFIELD64_RANGE raises an error with mesa-clang if we try to set last bit |
| - r300/r400 regression; can't compile \`if/then` in shaders |
| - iris: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12/P010 buffers |
| - nvk: Implement VK_EXT_depth_bias_control |
| - ICL/zink: gpu hang on 'piglit.object namespace pollution.framebuffer with gldrawpixels' |
| - [R600] Wolfenstein: The New Order text glitch on menu |
| - need extension to request image/texture not use data dependent compression |
| - rusticl: segfault in clCreateKernel on AMD Instinct MI100 |
| - !25587 broke xserver |
| - GPU Hang in Deep Rock Galactic on DG2 |
| - intel: Wrong length for 3DSTATE_3D_MODE on gfx125 |
| - [radeonsi] Wargame: Red Dragon /w OpenGL stopped working with ACO |
| - traces job reference images missing again sometimes |
| - Vulkan Texture/Polygon Glitches in Games |
| - freedreno: dmabuf modify query ignores format |
| - virgl: removing PIPE_CAP_CLEAR_TEXTURE completely breaks virglrenderer |
| - Turnip build error on termux |
| - failiure in amd llvm helper |
| - failiure in amd llvm helper |
| - radv_amdgpu_cs_submit: Assertion \`chunk_data[request->number_of_ibs - 1].ib_data.ip_type == request->ip_type' failed. |
| - hasvk: subgroups regression |
| - radeonsi: broken hardware decoding (vaapi/vulkan) on RDNA2 gpu (bisected) |
| - aco: SwizzleInvocationsMaskedAMD behavior is not correct for reads from inactive lanes |
| - anv: dEQP-VK.ssbo.phys.layout.random.16bit.scalar.13 slow |
| - [RDNA3] CS:GO - excessive power consumption and lower performance in Vulkan while MSAA is set to 4x or 8x |
| - [ICL] piglit.spec.arb_gl_spirv.execution.ssbo.unsized-array regression |
| - radv: Counter Strike 2 has multiple bugs while rendering smoke grenade effect |
| - Doom Eternal freezing on NAVI31 with current git |
| - iris CTS blend test fail with MSAA config on DG2 |
| - anv: 32bit mesa asserts |
| - RADV: Randomly dissapearing objects in Starfield with RX 5xx and Vega graphics |
| - anv: missing barrier handling on video engines |
| - radv: Star Wars The Old Republic hang when DCC is enabled |
| - radv: Resident Evil 6 hangs 7900XTX GPU when DCC is enabled if in Options go to Display settings |
| - radv: Resident Evil 6 Benchmark Tool hangs 7900 XTX GPU when DCC is enabled immediately after splash screen |
| - ANV: fp64 shader leaked |
| - v3d: noop drm-shim raises some warnings |
| - freedreno: crashdec/etc chip_id support |
| - intel: compute dispatches with variable workgroup size have ralloc_asprintf CPU overhead |
| - ci build issues with builtin types |
| - freedreno: running angle perf traces with GALLIUM_THREAD=0 crashes |
| - RadeonSI: glClear() causes clear texture for some frames on RX580 |
| - radeonsi: corruption when seeking video decoded with vaapi in mpv |
| - Zink/HasVK regression bisected to "gallium: move vertex stride to CSO" |
| - [radv] [Path Of Exile] - one setting in the workaround file breaks shadows/lighting rendering. Other workaround settings seems obsolete. |
| - radv: images don't always have extents in RGP |
| - shader_test causing a crash in compiler |
| - D3D12: Video decoding requirements are too restrictive. ID3D12VideoDevice3 should not be required. |
| - Crash in st_ReadPixels |
| - [regression] intel build issue on i386 |
| - [ANV] [DG2/A770] The Spirit and The Mouse, miscellaneous issues with Mesa Git |
| - zink on hasvk regression: Assertion \`(dyn)->vi_binding_strides[first_binding + i] == (strides[i])' failed. |
| - Penumbra: Overture hangs on new game loading screen |
| - [r300, RV516] Some deqp-gles2\@performance\@shader\@control_statement vertex tests cause hard lockup & reboot in mesa 22.3.1 (regression over 22.1.7) on a Radeon X1550 |
| - v3dv: Add a feature that implicitly copies the linear image to the tiled image prior to sampling from it |
| - radv: Regression from 266b2cfe5bf3feda16747c50c1638fb5a0426958 |
| - h264 encoding picture showed randomly repeated frames. |
| - Mesa CI: NAVI10 hangs when running VKCTS on Linux 6.1 |
| - zink: no uniform buffer objects support for v3dv? |
| - v3dv: Request for VkImageDrmFormatModifierExplicitCreateInfoEXT::pPlaneLayouts support |
| - [ANV] [DG2/A770] The Spirit and The Mouse, occasional flickering geometry |
| - [Google][Rex][anv] GLES dEQP test fails in anv when run via ANGLE-on-Venus on ChromeOS ARCVM. |
| - VAAPI on VCN: bad stream may crash whole gfx system |
| - Crash after GPU reset |
| - Bifrost PanVK should not be in CI |
| - [Intel][Vulkan][Gen12] vkCmdCopyImage() generates garbage data when the destination texture is bound to a piece of used device memory |
| - mesa: new glcts fails |
| - tu: GPL support is broken |
| - lavapipe: ycbcr regression |
| - aco: Assertion when compiling CP2077 shader |
| - anv: flakiness on tgl+ with samplemask handling |
| - [RADV] Dead by Daylight memory leak (shader-related?) on 23.1.6 |
| - r300: optionally convert MULs into output modifier for the following MUL or DOT instructions |
| - r300: better 1-x presubtract pattern matching |
| - gpu hang on DG2 when running KHR-GLES31.core.texture_cube_map_array.image_op_tess* |
| - KHR-GLES31.core.texture_cube_map_array.image_op_tessellation_evaluation_sh fail on GFX12+ |
| - wsi: deadlocks when DISPLAY is changed |
| - hasvk: Incompatible with minigbm/gralloc4 on Android |
| - VAAPI: AMDGPU crash on RX 6900 XT on corrupted video |
| - lavapipe/llvmpipe: shader unregister crash |
| - [ANV] [DG2/A380] Corruption in Borderlands 3 |
| - blorp regression on dg2 |
| - decouple -Dshader-cache= from EGL_ANDROID_blob_cache |
| - radv: commit 81641b01555faa4dd1dfc7de2513ad8d63e77ab7 leaded to artifacts in Quake II RTX |
| - [radv] Colors are distorted in Cyberpunk 2077 with ray tracing enabled |
| - Forza Horizon 5 stuttering since mesa 23.1.4 / 9b008673 revert as a FIX |
| - ubsan + gtest build fails |
| - glCopyTexSubImage2D is very slow on Intel |
| - NVE4 (GeForce 710) fails to get vdpau in mesa git |
| - [RADV] red and pink tinted shadows in Overwatch 2 on 7900 XTX |
| - nouveau prevents hardware acceleration with Chromium (Wayland) |
| - Corrupt text rendering in Blender |
| - DRI2 gallium frontend is using bad format type |
| - regression - MR 23089 - Hellblade RT crashing |
| - Incorrect vlVaCreateBuffer/vlVaMapBuffer behavior for buffer type VAEncCodedBufferType in Gallium |
| - Issue with clang-format |
| - Follow-up from "Draft: intel: Disable color fast-clears for blorp_copy" |
| - nightly VA-API build: new timeout |
| - r600: retire the SB optimizer |
| - ci: do not download perfetto on-fly in build jobs |
| - Shared Memory Leak With Qt OpenGL Applications |
| - OpenGL, SIGSEGV when program pipeline objects has separated vertex shader progam and separated fragment shader progam with in/out |
| - vaDeriveImage returns VA_STATUS_ERROR_OPERATION_FAILED |
| - 975a8ecc881873744d851ab0ef45ad7698eaa0ef "frontends/va: use resources instead of views" cause radeonsi can't play video. |
| - zink: reduce pipeline hash size |
| - Rusticl,radeonsi: ac_rtld error(2): too much LDS |
| - aco, radv Rage 2 menu corruption - bisected |
| - radv, aco: World War Z character texture regression on 7900xtx |
| - android: De-stage drm_gralloc support from mesa3d |
| - Cyberpunk screen goes black at game launch on integrated Gfx |
| - lavapipe/llvmpipe: regressions since descriptor rewrite |
| - intel: State cache invalidation after BLORP binding table setup ought to be unnecessary on ICL. |
| - ci: HW job logs have spam at the end |
| - kernel crash seen on AMD Raven device |
| - crocus: regression crashing in doubles/ubo tests |
| - turnip: object management CTS crashes |
| - a618: multiple assertions with different kernel config on u_vector_add |
| - [anv] Death Stranding crashes |
| - Can no longer build Clover without llvmspirvlib |
| - [radeonsi][vaapi] segfault in vl_video_buffer_sampler_view_components() when using vaapisink receiving I420 format |
| - Baldurs Gate 3 (DX11) - Graphical corruption on RDNA3 (ACO regression) |
| - [AMDGPU] Compiling large Blender Eevee shader node trees is unusably slow |
| - Building llvmpipe with LP_USE_TEXTURE_CACHE set fails since 23.2.0-rc1: error C2039: dynamic_state is not member of lp_build_sampler_soa in lp_tex_sample.c |
| - r300: calculate some cycles estimate for shader-db |
| - intel: Deathloop and other DX12 games fail assert(validated) with invalid SEL instruction |
| - GTF-GL46.gtf21.GL.build.CorrectFull_vert regressed on intel platforms |
| - error message when encoding via VAAPI AMD |
| - gpu hangs on dg2 with mesh shading enabled on vkcts |
| - radeonsi: Deadlock when creating a new GL context in parallel with linking a shader on another GL context |
| - robustness2 raygen tests intermittently fail in Intel Mesa CI |
| - ci/ci_run_n_monitor.py: KeyError: 'clang-format' |
| - glthread: huge performance regression |
| - DirectX games do not launch on Intel HD Graphics 4000 (IVB GT2) [bisected] |
| - rusticl: fails to build for iris + radeonsi |
| |
| |
| Changes |
| ------- |
| |
| Adam Jackson (3): |
| |
| - egl: Implement EGL_EXT_explicit_device |
| - mesa: Implement and advertise GL_MESA_sampler_objects |
| - docs: Mention 'meson devenv' in the pre-install test instructions |
| |
| Aditya Swarup (6): |
| |
| - isl: enable Tile64 for 3D images |
| - intel/isl: Unittest for linear to Ytile conversion |
| - intel/isl: Convert linear texture to Tile4 format |
| - intel/isl: Convert Tile4 texture to linear format |
| - intel/isl: Linear to Tile-4 conversion unittest |
| - Revert "iris: Disable tiled memcpy for Tile4" |
| |
| Alba Mendez (1): |
| |
| - meson: support installation tags |
| |
| Alejandro Piñeiro (61): |
| |
| - v3dv: re-enable sync_fd import/export on the simulator |
| - broadcom(cle,clif,common,simulator): add 7.1 version on the list of versions to build |
| - broadcom/cle: update the packet definitions for new generation v71 |
| - broadcom/common: add some common v71 helpers |
| - broadcom/qpu: add comments on waddr not used on V3D 7.x |
| - broadcom/qpu: set V3D 7.x names for some waddr aliasing |
| - broadcom/compiler: rename small_imm to small_imm_b |
| - broadcom/compiler: add small_imm a/c/d on v3d_qpu_sig |
| - broadcom/qpu: add v71 signal map |
| - broadcom/qpu: define v3d_qpu_input, use on v3d_qpu_alu_instr |
| - broadcom/qpu: add raddr on v3d_qpu_input |
| - broadcom/qpu: defining shift/mask for raddr_c/d |
| - broadcom/commmon: add has_accumulators field on v3d_device_info |
| - broadcom/qpu: add qpu_writes_rf0_implicitly helper |
| - broadcom/qpu: add pack/unpack support for v71 |
| - broadcom/compiler: phys index depends on hw version |
| - broadcom/compiler: don't favor/select accum registers for hw not supporting it |
| - broadcom/vir: implement is_no_op_mov for v71 |
| - broadcom/compiler: update vir_to_qpu::set_src for v71 |
| - broadcom/qpu_schedule: add process_raddr_deps |
| - broadcom/qpu: update disasm_raddr for v71 |
| - broadcom/qpu: return false on qpu_writes_accumulatorXX helpers for v71 |
| - broadcom/compiler: add support for varyings on nir to vir generation for v71 |
| - broadcom/compiler: payload_w is loaded on rf3 for v71 |
| - broadcom/qpu_schedule: update write deps for v71 |
| - broadcom/compiler: update register classes to not include accumulators on v71 |
| - broadcom/qpu: implement switch rules for fmin/fmax fadd/faddnf for v71 |
| - broadcom/compiler: update one TMUWT restriction for v71 |
| - broadcom/compiler: update ldunif/ldvary comment for v71 |
| - broadcom/compiler: update payload registers handling when computing live intervals |
| - broadcom/qpu: new packing/conversion v71 instructions |
| - v3dv/meson: add v71 hw generation |
| - v3dv: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71 |
| - v3dv/cmd_buffer: emit TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1 for v71 |
| - v3dvx/cmd_buffer: emit CLEAR_RENDER_TARGETS for v71 |
| - v3dv/cmd_buffer: emit CLIPPER_XY_SCALING for v71 |
| - v3dv/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71 |
| - v3dv/cmd_buffer: just don't fill up early-z fields for CFG_BITS for v71 |
| - v3dv: default vertex attribute values are gen dependant |
| - v3dv/pipeline: default vertex attributes values are not needed for v71 |
| - v3dv/pipeline: handle GL_SHADER_STATE_RECORD changed size on v71 |
| - v3dv: no specific separate_segments flag for V3D 7.1 |
| - v3dv: add support for TFU jobs in v71 |
| - v3d: add v71 hw generation |
| - v3d: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71 |
| - v3d: TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1 |
| - v3d: emit CLEAR_RENDER_TARGETS for v71 |
| - v3d: just don't fill up early-z fields for CFG_BITS for v71 |
| - v3d: emit CLIPPER_XY_SCALING for v71 |
| - v3d: no specific separate_segments flag for V3D 7.1 |
| - v3d: default vertex attributes values are not needed for v71 |
| - v3d/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71 |
| - v3d: handle new texture state transfer functions in v71 |
| - v3d: handle new TEXTURE_SHADER_STATE v71 YCbCr fields |
| - v3d: setup render pass color clears for any format bpp in v71 |
| - v3d: GFX-1461 does not affect V3D 7.x |
| - v3d: don't convert floating point border colors in v71 |
| - v3d: handle Z clipping in v71 |
| - v3d: add support for TFU blit in v71 |
| - v3dv: implement depthBounds support for v71 |
| - doc/features: update after last v3d changes |
| |
| Alex Denes (1): |
| |
| - virgl: link VA driver with build-id |
| |
| Alexander Orzechowski (1): |
| |
| - radeonsi: Set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET for auxiliary contexts |
| |
| Alyssa Rosenzweig (431): |
| |
| - zink: Switch to register intrinsics |
| - gallium/trace: Collect enums from multiple files |
| - gallium,util: Move blend enums to util/ |
| - gallium,util: Move util_blend_dst_alpha_to_one |
| - util/blend: Add helpers for normalizing inverts |
| - vulkan: Add helpers for blend enum translation |
| - lvp: Use common blend/logicop translation |
| - nir/lower_blend: Use util enums |
| - panfrost: Convert to PIPE_BLEND enums internally |
| - gallium: Remove pipe->compiler BLEND enum translation |
| - compiler: Remove blend enums duplicating util |
| - nir/legacy: Fix fneg(load_reg) case |
| - nir/legacy: Fix handling of fsat(fabs) |
| - ntt: Switch to new-style registers and modifiers |
| - ir3: Convert to register intrinsics |
| - nir: Add fence_{pbe,mem}_to_tex(_pixel)_agx intrinsics |
| - nir: Devendor load_sample_mask |
| - nir: Promote tess_coord_r600 to tess_coord_xy |
| - nir: Add nir_lower_tess_coord_z pass |
| - r600: Use nir_lower_tess_coord_xy |
| - ir3: Use nir_lower_tess_coord_z |
| - nir: Initialize workgroup_size in builder_init_simple_shader |
| - v3dv: Rely on nir_builder setting workgroup size |
| - radv: Rely on workgroup_size initialization |
| - panfrost: Fix transform feedback on v9 |
| - r600/sfn: Remove nir_register unit tests |
| - panfrost: Lower vertex_id for XFB |
| - panfrost: Fix transform feedback on v9 harder |
| - asahi: Augment fake drm_asahi_params_global |
| - asahi: Use nir_builder_at more |
| - asahi: Remove unused #define |
| - asahi: Refactor PBE upload routine |
| - asahi: Extract shader_initialize helper |
| - asahi: Serialize NIR in memory |
| - asahi: Identify background/EOT counts |
| - asahi,agx: Set coherency bit for clustered targets |
| - ail: Page-align layers for writable images |
| - asahi: Mark writeable images as such |
| - asahi: Reallocate to set the writeable image flag |
| - asahi: Add agx_batch_track_image helper |
| - asahi: Add texture/image indexing lowering pass |
| - asahi: Upload at most the max texture state registers |
| - asahi: Upload image descriptors |
| - asahi: Make clear the non-sRGBness of EOT images |
| - asahi: Don't restrict sampler views |
| - asahi: Forbid 2D Linear with images |
| - agx: Add try_coalesce_with helper |
| - agx: Try to allocate phis compatibly with sources |
| - agx: Try to allocate phi sources with phis |
| - agx: Try to allocate phi sources with loop phis |
| - agx: Vectorize 16-bit parallel copies |
| - agx: Reduce un/packs with mem access lowering |
| - agx: Fix bogus assert |
| - asahi: Augment PBE descriptor for software access |
| - asahi: Extend PBE packing for image support |
| - asahi: Use nir_lower_robust_access |
| - agx: Legalize image LODs to be 16-bit |
| - agx: Lower image size to txs |
| - agx: Generalize texture/PBE packing |
| - agx: Add image write instruction |
| - agx: Model texture bindless base |
| - agx: Handle bindless properly for txs lowering |
| - agx: Pack bindless textures |
| - agx: Translate texture bindless handles |
| - agx: Translate image_store from NIR |
| - agx: Handle frag side effects without render targets |
| - agx: Wait for outstanding stores before barriers |
| - agx: Implement image barriers |
| - agx: Handle early_fragment_tests |
| - agx: Add interleave opcode |
| - agx: Extract coords_for_buffer_texture helper |
| - agx: Extract texture_descriptor_ptr_for_* helpers |
| - agx: Lower image atomics |
| - agx: Lower buffer images |
| - asahi,agx: Fix txf sampler |
| - agx: Add image_load opcode |
| - agx: Extract texture write mask handling |
| - agx: Implement image_load |
| - agx: Emit global memory barriers for images |
| - agx: Don't emit silly barriers |
| - agx: Implement fence_*_to_tex_agx intrinsics |
| - agx: Add simple image fencing pass |
| - agx: Require tag writes with side effects |
| - agx: Plumb in coverage mask |
| - asahi: Extract sampler_view_for_surface |
| - asahi: Introduce concept of spilled render targets |
| - asahi: Add agx_tilebuffer_spills query |
| - asahi: Do not support masking with spilled RTs |
| - asahi: Ignore spilled render targets in EOT shaders |
| - asahi: Ignore spilled render targets with partial renders |
| - asahi: Extract some tilebuffer lowering code |
| - asahi: Lower tilebuffer access for spilled RTs |
| - asahi: Lower multisample image stores |
| - asahi: Permit meta shaders to use preambles |
| - asahi: Ignore spilled render targets for background load |
| - asahi: Offset clear colour uniform by 4 |
| - asahi: Execute preambles for background programs |
| - asahi: Advertise Z16_UNORM |
| - ir2: Switch to nir_legacy |
| - intel/fs: Don't read reg.base_offset |
| - panfrost: Remove unused helpers |
| - nir: Remove nir_lower_locals_to_regs |
| - nir: Rename lower_locals_to_reg_intrinsics back |
| - nir: Remove register arrays |
| - asahi: Don't depend on glibc to decode |
| - pan/bi: Remove leftover include |
| - nir/trivialize: Handle more RaW hazards |
| - panfrost: Disable blending for no-op logic ops |
| - nir/lower_blend: Fix 32-bit logicops |
| - nir/lower_blend: Optimize out PIPE_LOGICOP_NOOP |
| - clang-format: Ignore original panfrost commit |
| - nir/schedule: Assume no old-style registers |
| - gallium/u_simple_shaders: Optimize out ffloors |
| - gallium/u_transfer_helper: Remove dead forward decl |
| - nir/loop_analyze: Drop unused inverse_comparison |
| - nir/passthrough_gs: Drop unused array_size_for_prim |
| - panfrost: Add missing static inline annotation |
| - pan/decode: Drop unused debug function |
| - pan/mdg: Add missing static inline annotation |
| - panfrost: Drop unused decode_position for samples |
| - panfrost: Only define pan_blitter_get_blend_shaders for midgard |
| - panfrost: Add missing inline |
| - panfrost: Gate overdraw_alpha on Bifrost+ |
| - nir: Rename scoped_barrier -> barrier |
| - nir: Remove lower_to_source_mods |
| - nir: Remove lower_vec_to_movs |
| - nir: Remove reg_intrinsics parameter to convert_from_ssa |
| - nir: Remove register load/store builders |
| - r600/sfn: Stop referencing legacy functionality |
| - r600/sfn: Ignore instruction write masks |
| - nouveau/codegen: Drop writemask check |
| - vc4,broadcom/compiler: Drop write_mask handling |
| - zink: Collapse is_ssa check |
| - nir: Add {...} before case |
| - nir/from_ssa: Drop legacy reg support |
| - nir/schedule: Drop nir_schedule_dest_pressure |
| - nir: Drop NIR reg create/destroy |
| - nir: Remove nir_index_local_regs and callers |
| - nir/schedule: Drop more nir_register handling |
| - nir: Remove nir_foreach_register |
| - nir: remove nir_{src,dest}_for_reg |
| - ntt: Drop nir_register reference |
| - nir/print: Assume SSA |
| - nir/clone: Assume SSA |
| - nir/serialize: Drop legacy NIR |
| - nir/validate: Assume SSA |
| - nir: Remove impl->{registers,reg_alloc} |
| - nir: Remove nir_alu_dest::saturate |
| - treewide: Drop is_ssa asserts |
| - nir: Collapse some SSA checks |
| - treewide: Remove more is_ssa asserts |
| - nir: Remove reg-only dest manipulation |
| - nir: Remove stale todo |
| - nir/print: Drop legacy NIR |
| - nir: Drop nir_alu_src::{negate,abs} |
| - treewide: sed out more is_ssa |
| - pan/mdg: Assume SSA |
| - treewide: Drop some is_ssa if's |
| - nir: Drop trivial reg handling |
| - aco: Remove is_ssa check |
| - intel: Collapse is_ssa checks |
| - llvmpipe: Assume SSA |
| - ir3: Collapse is_ssa checks |
| - lima: Collapse is_ssa checks |
| - radeonsi: Collapse SSA check |
| - nir/gather_ssa_types: Collapse SSA checks |
| - nir/worklist: Assume SSA |
| - nir/range_analysis: Assume SSA |
| - treewide: Collapse more SSA checks |
| - nir/instr_set: Assume SSA |
| - nir: Collapse more SSA checks |
| - nir: Remove def_is_register |
| - nir: Do not init dests |
| - nir: Initialize source as a NULL SSA def |
| - nir: Collapse more SSA checks |
| - nir: Remove nir_{src,dest}::is_ssa |
| - nir: Drop nir_register |
| - nir/from_ssa: Remove pointless union |
| - ir3: Drop write_mask handling |
| - rogue: Stop reading write masks |
| - etnaviv: Don't use alu->dest.write_mask |
| - etnaviv: What if we just didn't have a compiler? |
| - intel/vec4: Don't use legacy write mask |
| - ntt: Evaluate write_mask check |
| - nir: Remove nir_alu_dest::write_mask |
| - nir: Remove nir_foreach_def |
| - lima: Clean up after deleting asserts |
| - nir: Remove no-op remove_def_cb |
| - nir: Drop no-op all_srcs_are_ssa |
| - nir: Simplify alu_instr_is_copy |
| - nir: Add load_coefficients_agx intrinsic |
| - agx: Implement nir_intrinsic_load_coefficients_agx |
| - agx: Allow more varying slots |
| - agx: Set lower_fisnormal |
| - agx: Forcibly vectorize pointcoord coeffs |
| - agx: Add interpolateAtOffset lowering pass |
| - agx: Lower flat shading in NIR |
| - asahi: Stub num_dies |
| - asahi: Move a bunch of helpers to common |
| - agx: Lower 8-bit ALU |
| - agx: Handle 8-bit vecs |
| - asahi,agx: Respect no16 even for I/O |
| - agx: Don't lower load_local_invocation_index |
| - agx/dce: Use the helper |
| - agx: Fix atomics with no destination |
| - agx: Fix shader info with sample mask writes |
| - agx: Do not move bindless handles |
| - agx: Put else instructions in the right block |
| - agx: Use unconditional else instruction |
| - agx: Optimize out pointless else instructions |
| - agx: Fix length bit confusion |
| - agx: Require an immediate for \`nest` |
| - agx: Use compressed fadd/fmul encodings |
| - agx: Optimize swaps of 2x16 channels |
| - agx: Optimize logical_end removal |
| - agx: Fix AGX_MESA_DEBUG=demand |
| - agx: Maintain ctx->max_reg while assigning regs |
| - agx: Allow 64-bit memory regs |
| - agx: Fix accounting for phis |
| - agx: Set phi sources in predecessors |
| - agx: Stop setting registers after the shader |
| - agx: Use agx_replace_src |
| - agx: Assert invariant stated in the comment |
| - agx: Don't use ssa_to_reg across blocks |
| - agx: Don't reuse ssa_to_reg across blocks |
| - agx: Remove unused allocation |
| - agx: Stop setting forwarding bit |
| - agx: Handle blocks with no predecessors |
| - agx: Lower f2u8/f2i8 |
| - agx: Handle conversions to 8-bit |
| - agx: Fix uadd_sat packing |
| - agx: Fix 64-bit immediate moves |
| - agx: Lower f2f16_rtz |
| - agx: Handle f2f16_rtne like f2f16 |
| - agx: Handle <32-bit local memory access |
| - agx: Do not allow creating vec8 |
| - asahi: Legalize compression before blitting |
| - nir: Drop "SSA" from NIR language |
| - agx: Stop passing nir_dest around |
| - agx: Remove agx_nir_ssa_index |
| - pan/mdg: Don't reference nir_dest |
| - pan/bi: Don't reference nir_dest |
| - asahi: Do not reference nir_dest |
| - panfrost: Do not reference nir_dest |
| - zink: Do not reference nir_dest |
| - ir3: Do not reference nir_dest |
| - dxil: Do not reference nir_dest |
| - nir: Drop nir_dest_init |
| - panfrost: Pack stride at CSO create time on v9 |
| - lvp,nir/lower_input_attachments: Use nir_trim_vector |
| - broadcom/compiler: Use nir_trim_vector explicitly |
| - nir: Assert that nir_ssa_for_src components matches |
| - nir: Add nir_shader_intrinsics_pass |
| - nir: Lower fquantize2f16 |
| - agx: Lower fquantize2f16 |
| - nir/lower_helper_writes: Consider bindless images |
| - nir/passthrough_gs: Correctly set vertices_in |
| - nir/passthrough_gs: Fix array size |
| - nir/print: Print access qualifiers for intrinsics |
| - nir/lower_gs_intrinsics: Remove end primitive for points |
| - panfrost/ci: Disable T720 |
| - nir: Add load_sysval_agx intrinsic |
| - agx: Fix extraneous bits with b2b32 |
| - agx: Use more barriers |
| - asahi: Copy CSO stride |
| - agx: Assert vertex_id, instance_id are VS-only |
| - asahi: Keep drawoverhead from OOMing itself |
| - agx: Don't blow up when lowering textures twice |
| - agx/lower_vbo: Handle nonzero component |
| - agx: Allow loop headers without later preds |
| - agx: Handle b2i8 |
| - agx: Convert 8-bit comparisons |
| - agx: Implement imul_high |
| - asahi: Advertise OpenGL ES 3.1! |
| - asahi/decode: Turn assert into error |
| - asahi: Report local_size from compiler |
| - asahi: Use local_size from compiler directly |
| - asahi: Pass layer stride in pixels, not elements |
| - agx: Clear sample count after lowering MSAA |
| - agx: Clear image_array after lowering |
| - asahi: Preserve atomic ops when rewriting image to bindless |
| - agx: Use 16-bit reg for pixel_coord |
| - asahi: Generalize query logic |
| - asahi: Simplify occlusion query batch tracking |
| - asahi: Refactor agx_get_query_result |
| - asahi: Only touch batch->occlusion_queries for occlusion |
| - asahi: Sync when beginning a query |
| - asahi: Add non-occlusion query tracking |
| - asahi: Add get_query_address helper |
| - agx/fence_images: Use intrinsics_pass |
| - agx: Do not fence write-only images |
| - asahi: Add missing LOD source for agx_meta's txfs |
| - agx: Do some texture lowering early |
| - agx: Add helper returning if a descriptor crawl is needed |
| - nir,asahi: Remove texture_base_agx |
| - asahi: Move UBO lowering into GL driver |
| - asahi: Add sysval tables for each shader stage |
| - asahi: Split out per-stage sysvals |
| - asahi: Collapse grid_info |
| - asahi: Extract agx_upload_textures |
| - asahi: Upload a single draw_uniforms per draw |
| - asahi: Add real per-stage dirty flags |
| - asahi: Extract sampler upload |
| - asahi: Put unuploaded uniforms on the batch |
| - asahi: Decouple sysval lowering from uniform assignment |
| - asahi: Use finer dirty tracking for blend constant |
| - asahi: Use proper dirty tracking for VBOs |
| - asahi: Dirty track VBOs + blend const separately |
| - asahi: Dirty the shader stage when the shader changes |
| - asahi: Fix shader stage dirtying |
| - treewide: Use nir_shader_intrinsic_pass sometimes |
| - treewide: Also handle struct nir_builder form |
| - nir/lower_shader_calls: Fix warning with clang |
| - nir: Add nir_before/after_impl cursors |
| - treewide: Use nir_before/after_impl in easy cases |
| - treewide: Use nir_before/after_impl for more elaborate cases |
| - radv: Use before/after_cf_list for entrypoints |
| - ci: Disable known broken Bifrost Vulkan job |
| - ci: Disable WHL jobs |
| - nir/opt_if: Simplify if's with general conditions |
| - asahi: Fixes for clang-warnings |
| - agx: Fix jmp_exec_none encoding |
| - agx/validate: Print to stderr |
| - agx: Annotate opcodes with a scheduling class |
| - agx: Add schedule-specialized get_sr variants |
| - agx: Include schedule class in the opcode info |
| - agx: Schedule for register pressure |
| - agx: Lower pack_32_4x8_split |
| - asahi: Force translucency for ignored render targets |
| - agx: Remove logical_end instructions |
| - agx: Lower pseudo-ops later |
| - agx: Expand nest |
| - agx: Lower nest later |
| - agx: Split nest instruction into begin_cf + break |
| - agx: Add break_if_*cmp instructions |
| - agx: Add agx_first/last_instr helpers |
| - agx: Use agx_first_instr |
| - agx: Detect conditional breaks |
| - agx: Omit push_exec at top level |
| - agx: Omit while_icmp without continue |
| - agx: Add helper to determine if a NIR loop uses continue |
| - agx: Only use nest by 1 for loops w/o continue |
| - agx: Add pseudo-instructions for icmp/fcmp |
| - agx: Generate unfused comparison pseudo ops |
| - agx: Fuse conditions into if's |
| - agx: Fuse compares into selects |
| - agx: Add unit test for if_cmp fusing |
| - agx: Add unit test for cmp+sel fusing |
| - asahi: Translate cube array dimension |
| - ail: Force page-alignment for layered attachments |
| - agx: Handle cube arrays when clamping arrays |
| - agx: Lower coordinates for cube map array images |
| - agx: Run opt_idiv_const after lowering texture |
| - asahi: Forbid linear 1D Array images |
| - asahi: Handle linear 1D Arrays |
| - asahi: Conditionally expose cube arrays |
| - gallium,mesa/st: Add PIPE_CONTEXT_NO_LOD_BIAS flag |
| - asahi: Skip LOD bias lowering for GLES |
| - nir: Add nir_function_instructions_pass helper |
| - nir: Add NIR_OP_IS_DERIVATIVE property |
| - nir: Hoist nir_op_is_derivative |
| - nir/opt_preamble: Use nir_op_is_derivative |
| - nir/opt_gcm: Use nir_op_is_derivative more |
| - nir/gather_info: Use nir_op_is_derivative |
| - nir/opt_sink: Sink load_constant_agx |
| - nir/opt_sink: Sink load_local_pixel_agx |
| - nir/opt_sink: Sink frag coord instructions |
| - nir/opt_sink: Do not move derivatives |
| - nir/opt_sink: Move ALU with constant sources |
| - nir/opt_sink: Also consider load_preamble as const |
| - agx: Enable sinking ALU |
| - treewide: Drop nir_ssa_for_src users |
| - treewide: Remove remaining nir_ssa_for_src |
| - nir: Remove nir_ssa_for_src |
| - asahi: Clamp index buffer extent to what's read |
| - agx: Align the reg file for 256-bit vectors |
| - agx: Hoist sample_mask/zs_emit |
| - agx: Set PIPE_SHADER_CAP_CONT_SUPPORTED |
| - agx: Augment if/else/while_cmp with a target |
| - agx: Add jumps to block ends |
| - agx: Add agx_prev_block helper |
| - agx: Insert jmp_exec_none instructions |
| - nir: Add layer_id_written_agx sysval |
| - nir: Support arrays in block_image_store_agx |
| - agx/nir_lower_texture: Allow disabling layer clamping |
| - agx: Pack block image store dim correctly |
| - agx: Handle layered block image stores |
| - agx: Add pass to lower layer ID writes |
| - asahi: Add helper to get layer id in internal program |
| - asahi,agx: Select layered rendering outputs |
| - agx: Support packed layered rendering writes |
| - agx/tilebuffer: Support layered layouts |
| - agx/lower_tilebuffer: Support spilled layered RTs |
| - asahi: Use layered layouts |
| - asahi: Expose VS_LAYER_VIEWPORT behind a flag |
| - asahi: Account for layering for attachment views |
| - asahi: Assume LAYER is flat-shaded |
| - asahi: Add pass to predicate layer ID reads |
| - asahi: Predicate layer ID reads |
| - asahi: Write to cubes/etc attachments as 2D array |
| - asahi: Use a 2D Array texture for array render targets |
| - asahi: Generate layered EOT programs |
| - asahi: Handle layered background programs |
| - lima/pp: Do not use union undefined behaviour |
| - nir: Add trivial nir_src_* getters |
| - nir: Use set_parent_instr internally |
| - nir: Use getters for nir_src::parent_* |
| - nir: Assert the nir_src union is used safely |
| - nir: Use a tagged pointer for nir_src parents |
| - nir: Add ACCESS_CAN_SPECULATE |
| - ir3: Set CAN_SPECULATE before opt_preamble |
| - ir3: Model cost of phi nodes for opt_preamble |
| - nir/opt_preamble: Walk cf_list manually |
| - nir/opt_preamble: Preserve IR when replacing phis |
| - nir/opt_preamble: Unify foreach_use logic |
| - nir/opt_preamble: Move phis for movable if's |
| - nir/opt_preamble: Respect ACCESS_CAN_SPECULATE |
| - freedreno/ci: Minetest |
| - r600/sfn: Handle load_global_constant |
| - nir/opt_phi_precision: Work with libraries |
| - nir/legalize_16bit_sampler_srcs: Use instr_pass |
| - nir/print: Handle KERNEL |
| - nir/lower_io: Use load_global_constant for OpenCL |
| - nir/opt_algebraic: Reduce int64 |
| - nir/opt_algebraic: Optimize LLVM booleans |
| - nir/trivialize_registers: Handle obscure load hazard |
| - hasvk: Support builiding on non-Intel |
| - crocus: Support building on non-Intel |
| - meson: Add vulkan-drivers=all option |
| - meson: Add gallium-drivers=all option |
| - agx: Fix fragment side effects scheduling |
| |
| Amber (7): |
| |
| - ir3: make wave_granularity configurable |
| - turnip: Add support for devices not supporting double thread size. |
| - turnip: make sampler_minmax support configurable. |
| - freedreno, turnip: set correct reg_size_vec4 for a6xx_gen1_low |
| - ir3: handle non-uniform case for atomic image/ssbo intrinsics |
| - freedreno: Add support for devices not supporting double thread size. |
| - turnip: Add debug option to allow non-conforming features. |
| |
| Andrew Randrianasulu (1): |
| |
| - nv50/ir: Remove few nvc0 specific defines from nv50-specific header. |
| |
| Antonio Gomes (9): |
| |
| - rusticl/kernel: Removing unnecessary clone in kernel launch |
| - rusticl/kernel: Add CsoWrapper |
| - rusticl/compiler: Add NirPrintfInfo |
| - rusticl: Move Cso to Program |
| - rusticl/compiler: Remove unnecessary functions |
| - rusticl: Move NirKernelBuild to ProgramDevBuild |
| - rusticl/program: New helper functions to NirKernelBuild |
| - rusticl/core: Delete KernelDevState and KernelDevStateInner |
| - rusticl/core: Make convert_spirv_to_nir output pair (KernelInfo, NirShader) |
| |
| Asahi Lina (29): |
| |
| - docs/tgsi: Specify that depth texture fetches are replicated |
| - asahi: Add synctvb debug flag |
| - asahi: Add smalltile debug option |
| - asahi: Add nomsaa debug flag |
| - asahi: decode: Add a params argument to pass through |
| - asahi: Add extra CDM header block for G14X |
| - asahi: wrap: Handle freeing shmems |
| - asahi: decode: Refactor to always copy GPU mem to local buffers |
| - asahi: decode: Add a function to construct decode_params from a chip_id |
| - asahi: Add a shared library interface for decode |
| - asahi: Add a noshadow debug flag |
| - asahi: Do not overallocate BOs by more than 2x |
| - asahi: Fix race in BO stats accounting |
| - asahi: Always use resource size, not BO size |
| - asahi: Print info about shadowed resources |
| - asahi: Impose limits on resource shadowing |
| - asahi: Force linear for SHARED buffers with no/implicit modifier |
| - asahi: Enable explicit coherency for G14D (multi-die) |
| - asahi: Handle non-written RTs correctly |
| - asahi: Fix incorrect BO bitmap reallocations |
| - asahi: Allocate staging resources as staging |
| - asahi: cmdbuf: Identify call/ret bits |
| - asahi: decode: Implement VDM call/ret |
| - asahi: decode: Do not assert on buffer overruns |
| - asahi: Fix VDM pipeline field width |
| - asahi: Add scaffolding for supporting driconf options |
| - asahi: Add and support the no_fp16 driconf flag |
| - driconf: Disable fp16 for browsers |
| - asahi: Allow no16 flag for disk cache |
| |
| Bas Nieuwenhuizen (16): |
| |
| - aco: fix nir_op_vec8/16 with 16-bit elements. |
| - aco: Fix some constant patterns in 16-bit vec4 construction with s_pack. |
| - nir: Fix 16-component nir_replicate. |
| - radv: Expose VK_EXT_external_memory_acquire_unmodified. |
| - util/perf: Add gpuvis integration. |
| - egl,venus,vulkan,turnip,freedreno: Update CPU trace init to init more than perfetto. |
| - vulkan: Add CPU tracing for vkWaitForFences. |
| - docs: Add documentation for gpuvis. |
| - vulkan: Add trace points for more Vulkan waiting functions. |
| - radv: Use a double jump to limit nops in DGC for dynamic sequence count. |
| - nir: Add AMD cooperative matrix intrinsics. |
| - aco: Add WMMA instructions. |
| - aco: Make RA understand WMMA instructions. |
| - radv: Don't transparently use wave32 with cooperative matrices. |
| - radv: Add cooperative matrix lowering. |
| - radv: Expose VK_KHR_cooperative_matrix. |
| |
| Benjamin Cheng (10): |
| |
| - radv/video: use app provided hevc scaling list order |
| - radv/video: copy from correct H264 scaling lists |
| - anv/video: copy from correct H264 scaling lists |
| - vulkan/video: add helper to derive H264 scaling lists |
| - radv/video: use vk_video_derive_h264_scaling_list |
| - anv/video: use vk_video_derive_h264_scaling_list |
| - util/vl: extract gallium vl scanning data to shared code |
| - radv/video: send h264 scaling list in raster order |
| - anv/video: send h264 scaling list in raster order |
| - radv/video: find SPS with pps_seq_parameter_set_id |
| |
| Benjamin Lee (1): |
| |
| - nvk: Fix segfault when opening DRI device file returns error |
| |
| Biswapriyo Nath (1): |
| |
| - radv/video: Match function definitions to declarations |
| |
| Boris Brezillon (1): |
| |
| - panfrost: Flag the right shader when updating images |
| |
| Boyuan Zhang (3): |
| |
| - virgl: Add vp9 picture desc |
| - virgl: Implement vp9 hardware decode |
| - radeonsi/vcn: disable tmz ctx buffer for VCN_2_2_0 |
| |
| Caio Oliveira (134): |
| |
| - nir: Use instructions_pass() for nir_fixup_deref_modes() |
| - meson: Ensure that LLVMSPIRVLib is not required for Clover |
| - nir: Let nir_fixup_deref_modes() fix deref_casts when possible |
| - nir: Add nir_opt_reuse_constants() |
| - radv: Use nir_opt_reuse_constants() |
| - compiler/types: Use ralloc for the key in array_types |
| - compiler/types: Use smaller keys for array_types table |
| - compiler/types: Extract get_explicit_matrix_instance() function |
| - compiler/types: Use smaller keys for explicit_matrix_types table |
| - anv/tests: Refactor state_pool_test_helper to not use macros for parametrization |
| - anv/tests: Link a single anv_tests binary using gtest |
| - anv/tests: Propagate failures to gtest |
| - hasvk/tests: Refactor state_pool_test_helper to not use macros for parametrization |
| - hasvk/tests: Link a single hasvk_tests binary using gtest |
| - hasvk/tests: Propagate failures to gtest |
| - util: Add convenience macros for linear allocator |
| - compiler/types: Use right hash for function types |
| - compiler/types: Don't duplicate empty string |
| - compiler/types: Constify a couple of pointers in glsl_type |
| - compiler/types: Remove unused GLSL_TYPE_FUNCTION and related functions |
| - compiler/types: Move GLSL specific builtin structs into glsl/ |
| - glsl: Add missing glsl_types initialization to test_optpass |
| - glsl: Don't create struct type builtins |
| - compiler/types: Add extra level of macro to builtin_macros |
| - compiler/types: Use designated initializer syntax to specify builtins |
| - compiler/types: Move local cache details to implementation file |
| - compiler/types: Add a mem_ctx for the glsl_type_cache |
| - compiler/types: Use type cache mem_ctx for hash tables |
| - compiler/types: Don't store a mem_ctx per type |
| - compiler/types: Simplify clearing the glsl_type_cache |
| - compiler/types: Move static asserts about glsl_type to a central place |
| - compiler/types: Store builtin types directly as data |
| - compiler/types: Use a linear (arena) allocator for glsl_types |
| - compiler/types: Make struct glsl_type visible to C code |
| - compiler/types: Add workaround to use builtin_type_macros.h in C |
| - compiler/types: Move builtin type initialization to C |
| - glsl: Annotate _mesa_glsl_error() with PRINTFLIKE |
| - compiler/types: Fix array name dimension flipping for unsized arrays |
| - compiler/types: Use Python to generate code for builtin types |
| - compiler/types: Use glsl_get_type_name() to access the type name |
| - compiler/types: Change glsl_type::name to be an uintptr_t |
| - compiler/types: Use a string table for builtin type names |
| - intel/compiler/xe2: Account for reg_unit() in TCS intrinsics |
| - intel/compiler/xe2: Account for reg_unit() in TES intrinsics |
| - intel/fs/xe2+: Update BS payload setup for Xe2 reg size. |
| - intel/fs/xe2+: Update TASK/MESH payload setup for Xe2 reg size. |
| - compiler: Use a meson dependency for libcompiler |
| - meson: Remove unnecessary inc_compiler mentions |
| - rusticl: Ensure NIR generated headers will be available |
| - clover: Hide SPIR-V related code behind HAVE_CLOVER_SPIRV |
| - clover: Only compile/depend libclspirv and libclnir when using SPIR-V support |
| - compiler: Only enable mesaclc helper if we have OpenCL SPIR-V support |
| - intel/compiler: Don't allocate memory for SIMD select error handling |
| - microsoft/compiler: Fix printf formatting string issues |
| - util: Add more PRINTFLIKE and MALLOCLIKE annotations |
| - util: Remove ralloc_parent from linear_header |
| - util: Use linear parent to (r)allocated extra nodes |
| - util: Remove size from linear_parent creation |
| - util: Make DECLARE_LINEAR_ALLOC_* macros assume no destructors |
| - util: Use an opaque type for linear context |
| - util: Remove usages of linear_realloc() |
| - util: Remove linear_realloc() |
| - util: Remove size information from child allocations |
| - util: Remove per-buffer header in linear alloc for release mode |
| - util: Add a few basic tests for linear_alloc |
| - util: Fix bookkeeping of linear node sizes |
| - intel/compiler: Don't store stage name and abbrev |
| - intel/compiler/xe2: URB fence uses LSC now |
| - intel/compiler/xe2: Fix URB writes in TCS |
| - intel/compiler/xe2: Update TCS ICP handle code to support SIMD16 |
| - compiler/types: Add support for Cooperative Matrix types |
| - nir: Add new intrinsics for Cooperative Matrix |
| - nir: Handle cooperative matrix in various passes |
| - spirv: Expose some memory related functions in vtn_private.h |
| - spirv: Let vtn_ssa_value hold references to variables |
| - spirv: Implement SPV_KHR_cooperative_matrix |
| - compiler/types: Remove private related declarations |
| - compiler/types: Remove use of new/delete |
| - compiler/types: Remove use of references |
| - compiler/types: Remove use of auto |
| - compiler/types: Use C compatible cast syntax |
| - compiler/types: Spell struct and enum in type names |
| - compiler/types: Add void parameter to ensure these are valid C prototypes |
| - intel/fs: Tweak default case of fs_inst::size_read() |
| - compiler/types: Move the C++ inline functions in glsl_type out of the struct body |
| - compiler/types: Move C declarations into glsl_types.h |
| - compiler/types: Flip wrapping of base_type checks |
| - compiler/types: Flip wrapping of various type identification checks |
| - compiler/types: Flip wrapping of convenience accessors for vector types |
| - compiler/types: Flip wrapping of basic "get type" functions |
| - rusticl: Add Rust bindings for inline glsl_types functions |
| - util: Add size to ralloc_header in debug mode |
| - util: Add a canary to identify gc_ctx in debug mode |
| - util: Add function print information about a ralloc tree |
| - util: Avoid waste space when linear alloc'ing large sizes |
| - spirv: Expose stage enum conversion in vtn_private.h |
| - spirv: Change spirv2nir to use the shorter shader name abbreviations |
| - spirv: List entry-points in spirv2nir when unsure what to use |
| - spirv: Let spirv2nir find out the shader to use |
| - intel/compiler: Don't emit calls to validate() in release build |
| - compiler/types: Flip wrapping of "type contains?" predicate functions |
| - compiler/types: Flip wrapping of array related functions |
| - compiler/types: Flip wrapping of cmat related functions |
| - compiler/types: Flip wrapping of CL related functions |
| - compiler/types: Flip wrapping of size related functions |
| - compiler/types: Flip wrapping of struct related functions |
| - compiler/types: Flip wrapping of interface related functions |
| - compiler/types: Flip wrapping of layout related functions |
| - compiler/types: Flip wrapping of record_compare |
| - compiler/types: Flip wrapping of get_instance() |
| - compiler/types: Flip wrapping of texture/sampler/image get instance functions |
| - compiler/types: Flip wrapping of various get instance functions |
| - compiler/types: Flip wrapping of get row/column type helpers |
| - compiler/types: Flip wrapping of remaining non-trivial type getters |
| - compiler/types: Flip wrapping of remaining small data getters |
| - compiler/types: Flip wrapping of numeric type conversion functions |
| - compiler/types: Move remaining code from nir_types to glsl_types |
| - rusticl: Add bindings for glsl_vector_type() |
| - compiler/types: Add more glsl_contains_*() functions and use them in C++ |
| - compiler/types: Add glsl_get_mul_type() and use it in C++ |
| - compiler/types: Add glsl_type_compare_no_precision() and use it in C++ |
| - compiler/types: Add glsl_type_uniform_locations() and use it in C++ |
| - compiler/types: Add glsl_get_std430_array_stride() and use it in C++ |
| - compiler/types: Add glsl_get_explicit_*() functions and use them in C++ |
| - compiler/types: Implement glsl_type::field_type() in terms of existing functions |
| - compiler/types: Add glsl_simple_explicit_type() and simplify glsl_simple_type() |
| - compiler/types: Add remaining type extraction functions and use them in C++ |
| - compiler/types: Use C instead of C++ constants for builtin types |
| - compiler/types: Remove usages of C++ members in glsl_types.cpp |
| - compiler/types: Annotate extern "C" only once in glsl_types.cpp |
| - compiler/types: Rename glsl_types.cpp to glsl_types.c |
| - compiler/types: Remove warnings about potential fallthrough |
| - compiler/types: Move comments and reorganize declarations |
| - anv: Fix leak when compiling internal kernels |
| |
| Carsten Haitzler (2): |
| |
| - kmsro: Add hdlcd DPU |
| - panfrost: Add GPU variant of G57 to the set of known ids |
| |
| Charles Giessen (1): |
| |
| - panvk: Use 1.0 in ICD Manifest json |
| |
| Charmaine Lee (8): |
| |
| - svga: set clear_texture to NULL for vgpu9 |
| - svga: fix stride used in vertex declaration |
| - svga: fix persistent mapped surface update to constant buffer |
| - svga: restrict use of rawbuf for constant buffer access to GL43 device |
| - svga: fix immediates used in rawbuf for constant buffer |
| - svga: use srv raw buffer for accessing readonly shader buffer |
| - svga: sync resource content from backing resource before image upload |
| - svga: ignore sampler view resource if not used by shaders |
| |
| Chia-I Wu (38): |
| |
| - radv: fix separate depth/stencil layouts in fb state |
| - radv: fix separate depth/stencil layouts in resolve meta |
| - radv: refactor depth clear in clear meta |
| - radv: fix separate depth/stencil layouts in clear meta |
| - amd/ci: update radv-stoney-aco-fails.txt for depth/stencil clear |
| - radv: disable tc-compat htile for layered images on gfx8 |
| - amd/ci: update radv-stoney-aco-fails.txt for depth/stencil resolve |
| - winsys/amdgpu: fix a race between import and destroy |
| - ac/surface: limit RADEON_SURF_NO_TEXTURE to color surfaces |
| - winsys/radeon: fix a race between bo import and destroy |
| - vulkan/runtime: add a helper for ETC2 emulation |
| - radv: use vk_tecompress_etc2 from the runtime |
| - vulkan/runtime: fix image type check for ETC2 emulation |
| - vulkan/runtime: fix a harmless typo for ETC2 emulation |
| - vulkan/runtime, radv: remove 1D support from ETC2 emulation |
| - radv: add radv_is_format_emulated |
| - radv: simplify view format override for emulated formats |
| - radv: hard code format features for emulated formats |
| - mesa: make astc_decoder.glsl vk-compatible |
| - radv, drirc: rename radv_require_{etc2,astc} |
| - anv: remove unused field from anv_image_view |
| - anv: add anv_image_view_{init,finish} |
| - anv: support image views with surface state stream |
| - anv: add anv_push_descriptor_set_{init,finish} |
| - anv: support alternative push descriptor sets |
| - anv: add anv_descriptor_set_write |
| - anv: add anv_cmd_buffer_{save,restore}_state |
| - anv: add anv_is_format_emulated |
| - anv: add a hidden plane for emulated formats |
| - anv: decompress on upload for emulated formats |
| - anv: fix up image views for emulated formats |
| - anv: fix up blit src for emulated formats |
| - anv: advertise emulated formats |
| - anv: add support for vk_require_astc driconf |
| - util: improve BITFIELD_MASK and BITFIELD64_MASK on clang |
| - anv: prep for gen9 astc workaround |
| - anv: add gen9 astc workaround |
| - radv: fix image view extent override for astc |
| |
| Chris Spencer (9): |
| |
| - radv: initialize result when pipeline cache creation fails |
| - anv/android: Fix importing hardware buffers with planar formats |
| - anv/android: Add support for AHARDWAREBUFFER_FORMAT_YV12 |
| - anv: Advertise Vulkan 1.3 on Android 13 |
| - anv: Don't reject Android image format if external props not supplied |
| - android: Add explanatory comment to u_gralloc |
| - anv/android: Enable shared presentable image support |
| - anv/video: use correct enum value for max level IDC |
| - radv/video: use correct enum value for max level IDC |
| |
| Christian Gmeiner (41): |
| |
| - nir/print: print instr pass_flags |
| - etnaviv: move nir texture lowerings into one pass |
| - nir: add enta specific intrinsic used for txs lowering |
| - etnaviv: nir: support intrinsic used for txs lowering |
| - etnaviv: nir: lower nir_texop_txs |
| - ci/etnaviv: update ci expectations |
| - etnaviv: make use of BITFIELD_BIT(..) macro |
| - etnaviv: name the enum used for pass_flags |
| - etnaviv: add is_dead_instruction(..) helper |
| - etnaviv: extend etna_pass_flags with source modifiers |
| - etnaviv: do not clear all pass_flags before RA |
| - etnaviv: nir: look at parent instr in lower_alu(..) |
| - etnaviv: nir: add etna_nir_lower_to_source_mods(..) |
| - etnaviv: nir: switch to etna_nir_lower_to_source_mods(..) |
| - etnaviv: nir: convert to new-style NIR registers |
| - freedreno/regs: remove double assignment of self.current_domain |
| - freedreno/regs: remove not used variable |
| - freedreno/regs: remove dead code |
| - freedreno/regs: python does not need ';' |
| - etnaviv: switch to log2f(..) |
| - etnaviv: switch to U_FIXED(..) macro |
| - etnaviv: switch to S_FIXED(..) macro |
| - etnaviv: fix null pointer dereference |
| - etnaviv: switch to float_to_ubyte(..) |
| - ci/etnaviv: update ci expectation |
| - etnaviv: unbreak cmdline compiler |
| - agx/lower_address: Use intrinsics_pass |
| - agx/lower_address: Remove not used has_offset |
| - isaspec: python does not need ';' |
| - docs: Move isaspec out of drivers/freedreno |
| - isaspec: Add support for templates |
| - isaspec: encode: Correct used regex |
| - isaspec: Add method to get all instrustions |
| - isaspec: Add support for custom meta information |
| - isaspec: Add BitSetEnumValue object |
| - spirv: Don't use libclc for rotate |
| - docs: update etnaviv extensions |
| - etnaviv: drm: Be able to mark end of context init |
| - etnaviv: Skip 'empty' cmd streams |
| - ci: Bump PyYAML to 6.0.1 |
| - etnaviv: Don't leak disk_cache |
| |
| Collabora's Gfx CI Team (2): |
| |
| - Uprev Piglit to ed58dfbd12be34fa3dab97a7a2987b890e0637f1 |
| - Uprev Piglit to f7db20b03de6896d013826c0a731bc4417c1a5a0 |
| |
| Cong Liu (2): |
| |
| - r300: Fix out-of-bounds access in ntr_emit_store_output() |
| - virgl:Fix ITEM_CPY macro pointer copy bug |
| |
| Connor Abbott (83): |
| |
| - afuc: Rework and significantly expand README.rst |
| - tu: Fix vk2tu_*_stage flag type |
| - tu: Fix and simplify execution dependency handling |
| - tu, freedreno/a6xx: Remove has_ccu_flush_bug |
| - ir3: Handle GS stream "mixing" with non-point output primitives |
| - tu: Disable transformFeedbackPreservesProvokingVertex |
| - isaspec: Add "displayname" for altering {NAME} when decoding |
| - isaspec: Add support for "absolute" branches |
| - isaspec: Add support for function and entrypoint labels |
| - isaspec: Add "custom" field type |
| - isaspec: Add callback after decoding an instruction |
| - isaspec: Rename isa_decode() to isa_disasm() |
| - isaspec: Add initial decoding support |
| - afuc: Fix xmov lexer typo |
| - afuc: Convert to isaspec |
| - afuc: Add setbit/clrbit |
| - afuc: Fix writing $00 |
| - freedreno/afuc: Initial a7xx support |
| - ir3: Parse (eq) flag |
| - ir3, freedreno, tu: Plumb through SP_FS_PREFETCH_CNTL::ENDOFQUAD |
| - tu: Add missing last_baryf statistic |
| - freedreno, tu, ir3: Add last_helper statistic |
| - ir3: Gather pixlod status earlier |
| - ir3: Implement helper invocation optimization |
| - vk/graphic_state, tu: Use dynamic blend count from subpass |
| - freedreno/a7xx: Add CP_RESET_CONTEXT_STATE |
| - vk/graphics_state: Fix copying MS locations pipeline state |
| - tu: Remove MSAA draw state |
| - tu: Merge SAMPLE_LOCATIONS and SAMPLE_LOCATIONS_ENABLE draw states |
| - tu: Merge PC_RASTER_CNTL into RAST draw state |
| - tu: Stop reusing base Vulkan dynamic state enums |
| - tu: Merge depth/stencil draw states |
| - tu: Rename PrimID-related registers |
| - tu, freedreno/a6xx: Don't use VS for PrimID passthru state |
| - tu: Pull entangled shader state into program config |
| - ir3: Add ir3_find_input_loc() helper |
| - tu: Split up tu6_emit_vpc() |
| - freedreno, ir3, tu: Constify various uses of ir3_shader_variant |
| - ir3: Add helper to determine when variant exceeds safe constlen |
| - tu: Split program draw state into per-shader states |
| - tu: Fix per-view viewport state propagation |
| - tu: Fix tu6_emit_*_fdm size call |
| - tu: Fix assert in FDM state emission |
| - tu: Actually emit patchpoint for viewports with FDM |
| - nir/lower_subgroups: Don't do multiple lowerings at once |
| - nir/spirv: Add inverse_ballot intrinsic |
| - amd: Use inverse ballot intrinsic if available |
| - tu: Create singleton "empty" shaders |
| - tu: Start tracking shaders independently of pipeline |
| - tu: Move FS-specific pipeline information to the shader |
| - tu: Use shader directly for VS/TCS output size and patch size |
| - tu: Rewrite tessellation modes handling |
| - tu: Rework passing shared consts |
| - tu: Decouple program state from the pipeline |
| - tu: Use pipeline feedback loop flag indirectly |
| - tu: Rewrite remaining pipeline LRZ handling |
| - tu: Don't reference pipeline for some draw states |
| - tu: Make compute dispatch use the shader |
| - tu: Don't use pipeline for dynamic draw states |
| - tu: Don't use pipeline for bandwidth validity |
| - tu: Don't use pipeline for per_view_viewport |
| - tu: Don't use pipeline for active stages |
| - tu: Remove pipeline from state |
| - zink: Rework color clamping and conversion |
| - freedreno/fdl: Use A8_UNORM HW format for sampling |
| - tu: Support clearing A8_UNORM |
| - freedreno/fdl: Support PIPE_FORMAT_R5G5B5A1_UNORM on a6xx |
| - tu/clear_blit: Fix staging image view layer count |
| - tu/clear_blit: Allow VK_REMAINING_ARRAY_LAYERS as layerCount |
| - tu: Allow VK_WHOLE_SIZE in tu_CmdBindVertexBuffers2EXT pSizes |
| - tu: Implement vkCmdBindIndexBuffer2KHR |
| - tu: Implement vkGetImageSubresourceLayout2KHR and vkGetDeviceImageSubresourceLayoutKHR |
| - tu: Implement vkGetRenderingAreaGranularityKHR |
| - tu: Use new buffer usage flags |
| - tu: Support VkPipelineCreateFlags2CreateInfoKHR |
| - tu: Check for DEVICE_LOST in vkGetEventStatus() |
| - tu: Add maintenance5 properties |
| - freedreno/ci: Skip dEQP-VK.info.device_extensions |
| - tu: Expose VK_KHR_maintenance5 |
| - freedreno/ci: Remove minetest trace |
| - v3d/ci: Remove minetest trace |
| - ir3/ra: Don't swap killed sources for early-clobber destination |
| - tu: Fix re-emitting VS param state after it is re-enabled |
| |
| Corentin Noël (16): |
| |
| - ci: Add locked flag to bindgen-cli installation |
| - virgl: Do not expose EXT_texture_mirror_clamp when using a GLES host |
| - ci: disable Collabora's LAVA lab for maintenance |
| - llvmpipe: make sure to initialize the lp_setup_context slots with the default values |
| - virgl: Cover all the formats defined in the virgl definition |
| - mesa: Ensure that the baselevel will never exceed the maximal supported number |
| - ci: Uprev virglrenderer |
| - freedreno/drm/virtio: Use MESA_TRACE_SCOPE instead of _BEGIN/_END |
| - tu: Use MESA_TRACE_SCOPE instead of _BEGIN/_END |
| - aux/tc: Use MESA_TRACE_SCOPE instead of _BEGIN/_END |
| - venus: Change the only occurrence of VN_TRACE_BEGIN/END to VN_TRACE_SCOPE |
| - util: Avoid the use of MESA_TRACE_BEGIN/END |
| - util/perf: Remove the tracing categories |
| - util: Remove MESA_TRACE_BEGIN/END |
| - mesa/bufferobj: ensure that very large width+offset are always rejected |
| - frontends/va: Remove wrong use of ProfileToPipe |
| |
| Daniel Schürmann (9): |
| |
| - nir/opt_move: fix handling of if-condition |
| - aco: append p_logical_end after monolithic RT shaders |
| - aco/insert_exec_mask: set Exact mode after p_discard_if when necessary |
| - aco: don't optimize cross-lane instructions across p_wqm |
| - aco: make p_wqm a marker instruction without Operands/Definitions |
| - aco: don't insert a copy when emitting p_wqm |
| - aco: insert a single p_end_wqm after the last derivative calculation |
| - aco/insert_exec_mask: Simplify WQM handling (1/2) |
| - aco/insert_exec_mask: Simplify WQM handling (2/2) |
| |
| Daniel Stone (23): |
| |
| - dri: Support 1555/4444 formats |
| - egl/dri2: Don't look up image extension twice |
| - egl/wayland: Always initialise fd_display_gpu |
| - egl/wayland: Add image loader extension for swrast |
| - egl/wayland: Never use DRI2_LOADER extension |
| - egl/wayland: Assume modern DRI interface versions |
| - egl/drm: Use IMAGE_DRIVER instead of DRI2_LOADER |
| - egl/drm: Assume modern DRI interface versions |
| - ci: Disable nouveau CI |
| - panfrost/vk: Use correct sampler dimensions for MSAA |
| - ci: Declare stages before jobs |
| - ci/radeonsi: Add new flake |
| - ci/d3d12: Add new flake |
| - ci/intel: Add new skqp flake |
| - ci/zink: Add new zink-lvp flakes |
| - ci/radeonsi: Skip more really slow tests |
| - ci/zink: Add another conversion fail on a618 |
| - ci: Move farm-disable rules before anything else |
| - ci: Always set user container jobs to manual |
| - ci: Use container rules for containers |
| - ci: Only look at file changes for MRs |
| - ci: Fix pre-merge pipelines with no code changes |
| - ci: Try really hard to print final result string |
| |
| Daniel van Vugt (1): |
| |
| - glx: Increment dpy->request before issuing an error that had no request |
| |
| Danylo Piliaiev (71): |
| |
| - freedreno/cffdec: Decode CP_DRAW_AUTO |
| - freedreno, turnip: Clarify some RB_CCU_CNTL fields |
| - freedreno,turnip: Make number of VSC pipes configurable |
| - freedreno,turnip: Make CS shared memory size configurable |
| - freedreno,turnip: Make VS input attr/binding count configurable |
| - freedreno: Add A605, A608, A610, A612 GPUs definition |
| - turnip: Make multiview support configurable per generation |
| - ir3: Make FS tex prefetch optimization optional |
| - ir3: Use NIR info to enable per sample shading |
| - freedreno/regs: Rename SP_FS_CTRL_REG0.DIFF_FINE into LODPIXMASK |
| - ir3: Fix FS quad ops returning wrong values from helper invocations |
| - tu,freedreno: Forbid blit event for R8G8_SRGB due to gpu faults |
| - radv: fix unused non-xfb shader outputs not being removed |
| - vulkan/nir: Add common helper to check if output is XFB |
| - radv: Use common nir_vk_is_not_xfb_output |
| - turnip: Use common nir_vk_is_not_xfb_output |
| - freedreno/regs: Define unknown SP_FS_PREFETCH_CNTL fields |
| - freedreno/registers: Refactor gen_header.py to allow more options |
| - freedreno/registers: Generate python files with reg offsets |
| - freedreno: Add a list of raw magic regs |
| - freedreno: Fully define a730 and a740 device properties |
| - ir3/tests: Use fd_dev_info to infer GPU generation |
| - freedreno/computerator: Fix remaining issues with A7XX |
| - isaspec: Make possible to obtain gpu_id in <expr> blocks |
| - ir3/a7xx: cat5 mode1 has swapped tex/samp ids |
| - ir3/a7xx: Don't multiply global mem instruction's offset by 4 |
| - ir3/a7xx: insert lock/unlock at the end of every compute shader |
| - ir3/a7xx: Add ccinv instruction |
| - ir3/a7xx: Use ccinv for data synchronization |
| - ir3/a7xx: Disable shared consts for a7xx |
| - tu/common: Generalize TU_GENX macro |
| - tu: Basic a7xx support |
| - freedreno/fdl: Set LOSSLESSCOMPEN for image when ubwc is enabled on a7xx |
| - tu/a7xx: Fix geometry shaders |
| - tu/a7xx: Fix tesselation shaders |
| - tu/a7xx: Fix multiview |
| - tu/a7xx: Fix flat shading |
| - tu/a7xx: Fix occlusion query |
| - tu/a7xx: Fix 3d blits after multiview usage |
| - tu/a7xx: Fix CmdDrawIndirectByteCountEXT |
| - tu/a7xx: Disable LRZ |
| - ir3/lower_tex_prefetch: Fix crash with lowered load_barycentric_at_offset |
| - tu: Exclude SP_UNKNOWN_AE73 from reg stomping |
| - tu: Call tu_cs_dbg_stomp_regs with appropriate GPU gen |
| - freedreno/replay: Add limited support for KGSL |
| - freedreno/rddecompiler: Update to handle a7xx |
| - freedreno/replay: Add "print" instr to ir3 asm to be used in replay |
| - freedreno/replay: Add "gpu_print" function for command streams |
| - tu/perfetto: Remove now unnecessary tu_perfetto_util |
| - tu/perfetto: Allow gpu time to be passed into tu_perfetto_submit |
| - tu/kgsl: Fix memory leak of tmp allocations during submissions |
| - tu/kgsl: Support u_trace and perfetto |
| - tu/a7xx: Correctly record timestamps for u_trace |
| - tu/virtio: Fix incorrect call to tu_perfetto_submit |
| - ci: Compile Turnip's virtio kmd in debian-arm64 |
| - freedreno/registers: Refine a7xx push consts registers |
| - ir3,tu: Refactor push consts info plumbing |
| - freedreno: Make possible to specify A7XX feature flags |
| - turnip,ir3: Implement A7XX push consts load via preamble |
| - tu: Add push_consts_per_stage debug option |
| - tu: Fix VK_FORMAT_A8_UNORM_KHR using UBWC when !has_8bpp_ubwc |
| - tu/kgsl: Fix field order in kgsl_command_object init |
| - tu: Fix stale tu_render_pass_attachment::store_stencil with dyn rendering |
| - tu: Zero init tu_render_pass and tu_subpass for dynamic rendering |
| - tu: Disable preamble push consts when they are not used |
| - ir3: Fix values of #wrmask not being compatible with ir3 parser |
| - tu: Count a whole push consts range in constlen for PREAMBLE push consts |
| - freedreno/rddecompiler: Use fd_dev_gen to pass gpu_id to ir3 disasm |
| - freedreno/rddecompiler: Decompile repeated IBs |
| - freedreno: Fix field size of A6XX_TEX_CONST[3].ARRAY_PITCH |
| - tu: Fix reading of stale (V)PC_PRIMITIVE_CNTL_0 |
| |
| Dave Airlie (163): |
| |
| - ci: remove binding model from the asan skips for lavapipe. |
| - gallivm: fix atomic global temporary storage. |
| - llvmpipe: fix fragdata/lastfragdata heuristic a bit more. |
| - nvk: add missing finish calls |
| - nvk: add some initial wsi framework. |
| - nvk: fix header guards to be less generic. |
| - nvk: add bind buffer memory |
| - nvk: Add initial queue |
| - nvk: add cmd buffer framework |
| - nvk: Reset pushbufs on command buffer reset |
| - nvk: reindent descriptor sets to mesa std. |
| - nvk: add initial descriptor pool framework. |
| - nvk: some boilerplate for descriptor sets |
| - nvk: add descriptor set bo allocation. |
| - nvk: implement buffer address. |
| - nvk: descriptor set freeing fix |
| - nvk: move to new command stream generator. |
| - nvk: port the blit and copy code to new command submission. |
| - nouveau/ws: drop the old push generators. |
| - nvk: link in codegen without gallium bits. |
| - nvk: Initial wiring in of the compiler |
| - nvk: Basic descriptor binding |
| - nouveau/vk: add support for compute classes to generator. |
| - nvk: retrieve gpc/mp counts from kernel. |
| - nvk: add support for preamble and tls allocation. |
| - nvk: add record result to cmd_buffer. |
| - nvk: add command stream upload buffer. |
| - nouveau/winsys: Add m2mf/compute objects |
| - nvk: add some basic format wrapping framework |
| - nvk: add some compute limits |
| - nvk: add basic nve4+ compute support. |
| - nvk: fix empty cmd submission. |
| - nouveau/ws: add a push reset just for references. |
| - nouveau/classes: add 906f header support. |
| - nvk: add initial 8/16 byte clears. |
| - nvk: fix pipeline pushbuf sizing |
| - nvk: increase graphics cpu push buffer |
| - nvk: fix depth emission ordering. |
| - nvk: add some limits/features from binary driver. |
| - nvk: add indexed draw support. |
| - nvk: assign vertex locations according to input attrib index |
| - nvk: lower io to temps to avoid output reads in vertex shaders |
| - nvk: handle NULL to destroy descriptor pool |
| - nvk: add basic primitive restart |
| - nvk: fix copy lower address extraction |
| - nvk: fix multiple pipelines failure allocation case. |
| - nvk: init dev->physical_device earlier. |
| - nvk/winsys: store device ptr into bo instead of ptr |
| - nvk: set the device fd |
| - nil: Fix image align and size constraints |
| - nvk: Report image alignments from NIL |
| - nouveau/winsys: allocate unique object handles across channels. |
| - nvk/nil: don't ask for compressed image kind |
| - nvk/barrier: handle host bit. |
| - nvk: add compute support for ampere |
| - nvk: add min_lod to spirv caps. |
| - nvk: fix r32_sint format support |
| - nvk: expose EXT_sampler_filter_minmax |
| - nvk: fix transform feedback crash when optimiser removes things. |
| - nvk: merge tess info between tcs/tes. |
| - nvk: introduce an optimisation loop. |
| - nvk: add support for D32_SFLOAT_S8_UINT |
| - nvk/query: fix push buffer size for copy pool results. |
| - nvk: init image fields for requirements |
| - nvk: handle alignments in device memory |
| - nvk/tess: don't emit patch control points in pipeline |
| - nvk: align geometry clip setting with nvc0 |
| - nvk: fix independent color write masks. |
| - nvk: enable rgb32 texel buffer support |
| - nvk: enable EXT_depth_clip_control |
| - nvk: enable EXT_depth_clip_enable |
| - nvk: always sync internal cmd bufs for vma lifetimes. |
| - nouveau/winsys: add support for the vma bind interfaces |
| - nvk: Add support for sparse buffers |
| - nvk: Add support for sparse images |
| - nvk/queue: add support for syncobjs and sparse binds |
| - nvk: Handle pre-turing indirect buffers with sparse |
| - nvk: enable sparse features |
| - nvk: enable a bunch of external fence/semaphore bits |
| - nvk: enable sparse residency buffer on maxwell+ |
| - nvk: add new internal bo allocation flag. |
| - docs: add two nvk exts to features.txt |
| - zink: use fprintf instead of printf to align the requirements warnings |
| - nvk: align sampler allocation counts with nvidia. |
| - zink: turn off threaded cpu access if not visible. |
| - nvk: add gart forced cmd pool side buffer. |
| - nvk: add cond render upload buffer. |
| - nvk: enable KHR_shader_clock. |
| - nvk: NOUVEAU_WS_BO_LOCAL is a trap. |
| - gallivm: drop unused info parameter |
| - llvmpipe/fs: drop cbuf 0 since it's lowered now. |
| - gallivm/nir: avoid using params->info |
| - llvmpipe/fs: move some tgsi checks in nir path to nir code. |
| - llvmpipe/cs: convert to using tgsi->nir |
| - llvmpipe/cs: drop tgsi for compute/mesh/task shader internals. |
| - lavapipe: use vk_buffer common code. |
| - lavapipe: use vk_buffer_range common code. |
| - llvmpipe/fs: switch to using tgsi->nir instead of handling tgsi |
| - llvmpipe/analyse: drop TGSI path. |
| - llvmpipe/fs: start using nir info in some places. |
| - llvmpipe/fs: drop the simple shader logic |
| - llvmpipe/fs: rewrite output finding using nir. |
| - nvk: add build_id linker argument. |
| - nir/gather: add support for fbfetch and bindless image loads. |
| - llvmpipe/cs: further cleanups after tgsi removal. |
| - llvmpipe: move to nir lowering for fquantize2f16 |
| - rusticl: don't store ptrs to nir_variables across opt passes. |
| - llvmpipe: enable f16 paths on aarch64. |
| - clover/llvm: move to modern pass manager. |
| - nir: use a _clone so users calling their variable clone don't get a warning |
| - nir: rename nir_inline_functions.c to nir_functions.c |
| - nir: use nir_function_instructions_pass in the inliner. |
| - nir: move the libclc lowering over to functions file. |
| - nir/functions: use helper to get function for a name. |
| - nir/functions: put link state into a struct |
| - nir/functions: move linker pass to new helper |
| - nir: add nir function clone |
| - nir: don't inline linked functions |
| - gallivm/nir: split prepasses out to make per-function work easier. |
| - gallivm: rework translator to allow per-impl work. |
| - spirv/nir: parse function control and store in nir. |
| - nir: add driver_functions option to avoid inlining. |
| - nir: add a function usage tracker |
| - rusticl: use cleanup funcs |
| - gallivm: add support for function calling |
| - llvmpipe/cs: add support for function calls. |
| - llvmpipe: enable driver functions. |
| - radv: don't emit event code on video queues. |
| - spirv: use a pointer sized int type for opencl event_t |
| - clover: fix parameter arguments since recent translator changes. |
| - radv/video: take db alignment into account when allocating images. |
| - ac,radeonsi: move vcn enc structs to common |
| - ac,radeonsi: move vcn enc av1 default cdf file to common |
| - nir: add a deref slot counter that handles compact |
| - llvmpipe/linear: drop tgsi path. |
| - gallivm: drop tgsi aos paths. |
| - llvmpipe/nir: call gather info to update inputs read properly |
| - llvmpipe/fs: start converting interp/input paths to nir. |
| - llvmpipe/fs: start converting dervied state to nir based. |
| - llvmpipe/linear: convert to using nir for output. |
| - llvmpipe/linear: move to nir inputs |
| - draw/mesh: reset some user state values on mesh draws. |
| - llvmpipe/fs: fix regression in sample mask handling from tgsi removal. |
| - llvmpipe: reset viewport_index_slot in fb bind |
| - llvmpipe/cs: migrate to generic jit texture from pipe code. |
| - llvmpipe/cs: migrate cs image handle to common jit code. |
| - lavapipe: fix some whitespace in advance of other changes. |
| - lavapipe: fix subresource layers asserts |
| - lavapipe: support host image copying on compressed texture formats |
| - llvmpipe: don't create texture functions for planar textures. |
| - lavapipe: don't emit blit src/dst for subsampled formats. |
| - llvmpipe: don't support planar formats for buffers. |
| - lavapipe: convert sampler to use vk base class. |
| - lavapipe: cleanup copy code to use a local region variable. |
| - lavapipe: start introducing planes structure. |
| - lavapipe: allocate image and image view planes. |
| - lavapipe: handle planes in copies |
| - lavapipe: handle planes in get image sub resource |
| - lavapipe: add descriptor sets bindings for planar images |
| - lavapipe: handle planes in texture lowering. |
| - lavapipe: expose planar ycbcr formats and new ycbcr features |
| - lavapipe + docs: update ycbcr extension enables. |
| - intel-clc: avoid using spirv-linker. |
| |
| David Heidelberg (82): |
| |
| - ci/freedreno: update a530 flakes |
| - ci: build kernel in gfx-ci/linux and just use binaries in Mesa3D CI |
| - ci: update kernel to 6.3.13 |
| - ci/freedreno: add fails introduced by upreving to 6.3.13 |
| - Revert "lima/ci: temporarily disable deqp-egl tests due to timeouts" |
| - ci/radeonsi: stoney arb_timer_query got fixed between kernel 6.3.1..13 |
| - ci/lima: EGL testing was disabled when fp16 fail was removed |
| - ci/freedreno: fix unexpectedpass flake on a630 |
| - ci/freedreno: add another a530 flakes |
| - ci: add quirk for GitLab assuming changes is always true for scheduled runs |
| - ci/microsoft: when re-enabling Windows Farm, always run the container |
| - ci/freedreno: add a530 flakes, remove one fail which recently started passing |
| - ci/panfrost: introduce OpenGL testing with Mali-G57 MP5 on Asurada chromebook |
| - ci/freedreno: cover all texture gather flakes |
| - ci/freedreno: add a530 flake vs-lessthanequal-uvec4-uvec4 |
| - ci/farms: always compare the code against main repository |
| - Revert "ci/farms: always compare the code against main repository" |
| - ci/kernel: add amd patch to prevent crashes when starting X |
| - ci/kdl: remove extra-verbose ls command |
| - ci/nouveau: add 20 minutes timeout to gk20a and align gm20b |
| - ci/freedreno: document another mapbuffer flake on a530 |
| - ci/amd: fix timeouting radeonsi-raven-va-full job |
| - docs/ci: default to port 80 for the caching proxy |
| - docs/ci: update to systemd and used version of the trace for testing |
| - docs/ci: remove default nginx config, which we don't need for proxy |
| - bin/ci: handle errors more gracefully in update_traces_checksum script |
| - ci/freedreno: document another flakes on Adreno 530 |
| - ci: add perfetto into mesa git-cache |
| - ci/panfrost: re-enable t760 and t860 traces as a nightly job |
| - CI: Re-enable G52 Vulkan testing |
| - ci/panfrost: t760-gles is nightly job, test also GLES 3 and 3.1 |
| - ci/zink: Add flake seen in the wild |
| - ci/build: limit debian-build-testing to 30 minutes |
| - ci/amd: add glx\@glx-visuals-depth flake to raven |
| - ci/freedreno: document vs-nested-return-sibling-loop2 flake on Adreno 530 |
| - ci/farms: enabled Microsoft job only when conditions are met |
| - ci/deqp: really remove the uncompressed results.csv file |
| - ci/baremetal: do not install curl, it's already there |
| - ci/baremetal: shorten BM_KERNEL to filename and BM_DTB to name only |
| - ci/freedreno: document another a530 flake batch |
| - ci: remove LAVA prefix from variables which can be used also elsewhere |
| - ci/zink: drop a630, which we currently have very low amount available |
| - ci/freedreno: the tag belongs to the apq8016 only |
| - ci/freedreno: switch references, the farm-rules takes care about this |
| - ci/freedreno: handle disabling farm properly for each FD/Collabora farm |
| - ci/freedreno: another batch of Adreno 530 flakes |
| - gtest: backport ansi color fix |
| - ci: disable Material Testers.x86_64_2020.04.08_13.38_frame799.rdc trace |
| - panfrost/ci: revert Disable T720 |
| - ci/piglit: add extra space on top to prevent single quote getting into URL |
| - ci/freedreno: There is only one King of Town. |
| - ci: switch to 6.4 kernel, improving Adreno 660 reliability |
| - ci/iris: add GL46.arrays_of_arrays_gl.SizedDeclarationsPrimitive timeout |
| - ci/panfrost: add G52 flakes |
| - ci/panfrost: we have enough device, parallelize Vulkan tests |
| - ci/virgl: flakes in functional.draw_buffers_indexed group |
| - ci/freedreno: add another a530 flake |
| - ci/panfrost: add G52 simple_tests.partial_image_pot_same_format_noclear flake |
| - panvk: architecture isn't invalid, just unsupported |
| - panvk: catch unsupported arch in the panvk_physical_device_init |
| - Revert "ci: disable a660 jobs" |
| - docs: add LAVA farm informations |
| - ci: disable Google Freedreno farm, currently timeouting on all jobs |
| - Revert "ci: disable Google Freedreno farm, currently timeouting on all jobs" |
| - ci/farms: no need to check RUNNER_TAG for Collabora farm |
| - ci/traces: extend no-output timeout by 5 minutes |
| - ci/venus: add fragment.32B_in_memory_with_vec4_s32 flake |
| - iris: do not mention specifically clover for OpenCL support |
| - ci/freedreno: disable broke cheza (Adreno 630) runners |
| - ci/bare-metal: correct workaround for R8152 issue while retrieving TFTP data |
| - ci/bare-metal: drop unused imports, sort, use SPDX license |
| - ci/lima: farm is down, disable for now |
| - ci: do not report failed job when flakes reporting fails |
| - ci/freedreno: re-enable Cheza (Adreno 630) runners |
| - ci/traces: upload only missing trace images |
| - ci/traces: keep images for every job except the performance testing |
| - ci/traces: rename upload function to reflect it works with S3 |
| - ci/traces: always export piglit EXTRA_ARGS |
| - ci: ci_marge_queue.py |
| - ci/freedreno: fix copy paste causing a618_gl being run only in manual pipeline |
| - ci/freedreno: disable Adreno 660 Vulkan pre-merge |
| - ci/traces: drop the freedoom-phase2-gl-high.trace |
| |
| David Rosca (70): |
| |
| - radeonsi: Use DIV_ROUND_UP instead of ALIGN_POT |
| - frontends/va: Skip processing buffers already converted with EFC |
| - frontends/va: Don't use EFC with scaling or filtering enabled |
| - radeonsi/vcn: Don't use chroma in AV1 encode with RGB input |
| - frontends/va: Parse H264 SPS for video signal parameters |
| - frontends/va: Parse HEVC SPS for video signal parameters |
| - frontends/va: Add postproc support for converting to full range |
| - radeonsi/vcn: Set H264 video signal parameters in bitstream |
| - radeonsi/vcn: Set HEVC video signal parameters in bitstream |
| - radeonsi/vcn: Enable full/limited range support for H264/HEVC/AV1 |
| - radeonsi/vcn: Fix setting color range in AV1 bitstream |
| - gallium/auxiliary/vl: Fix RGB->YCbCr full range matrix |
| - gallium/auxiliary/vl: Handle UV subsampling in compute_shader_yuv |
| - gallium/auxiliary/vl: Fix blurry output of compute_shader_yuv |
| - frontends/va: Add YUV420 to NV12 postproc conversion |
| - gallium/auxiliary/vl: Fix chroma and blurry output of cs video_buffer |
| - gallium/auxiliary/vl: Fix chroma offset of compute_shader_weave |
| - frontends/va: Also map VAImageBufferType for reading |
| - frontends/va: Alloc interlaced surface for interlaced pics |
| - frontends/vdpau: Alloc interlaced surface for interlaced pics |
| - radeonsi: Don't prefer interlaced for video decode |
| - ci/amd: Skip VAAPI CreateSurfacesWithConfigAttribs/1121 test |
| - frontends/va: Don't allow multi-plane derive without driver support |
| - frontends/va: Init view_resources array in vlVaPut/GetImage |
| - radeonsi: Copy all planes with multi-plane staging textures |
| - radeonsi: Enable PIPE_VIDEO_CAP_SUPPORTS_CONTIGUOUS_PLANES_MAP |
| - ci/amd: Skip all VAAPI tests that creates too many huge surfaces |
| - radeonsi/vcn: Update rate control when framerate changes with HEVC |
| - frontends/va: Ignore requested size when creating VAEncCodedBufferType |
| - gallium/auxiliary/vl: Set correct csc matrix in set_buffer_layer |
| - radeonsi/vcn: Fix leaking fences in decode |
| - gallium/auxiliary/vl: Add BT.709 full csc matrix |
| - frontends/va: Set csc matrix in postproc |
| - gallium/auxiliary/vl: Don't set csc matrix in video_buffer/rgb_to_yuv_layer |
| - frontends/va: Add BT.709 as supported postproc color standard |
| - Revert "radeonsi/vcn: add an exception of field case for h264 decoding" |
| - gallium/auxiliary/vl: Set vertex element src_stride in vl_deint_filter |
| - gallium/auxiliary: Fix util_compute_blit half texel offset with scaling |
| - gallium/auxiliary/vl: Map range when updating constants |
| - gallium/auxiliary/vl: Clamp coordinates in compute shaders |
| - gallium/auxiliary/vl: Support chroma sample location in compute shaders |
| - frontends/va: Support chroma sample location in postproc |
| - frontends/va: Flush after unmapping VAImageBufferType |
| - frontends/va: Parse chroma sample location in H264/HEVC SPS |
| - radeonsi/vcn: Set H264/HEVC chroma sample location in bitstream |
| - radeonsi/vcn: Don't hang GPU when using DCC surface as encoder input |
| - frontends/va: Track surfaces in context |
| - frontends/va: Destroy fences when destroying surface or context |
| - radeonsi/vcn: Implement destroy_fence vfunc |
| - frontends/va: Process VAEncSequenceParameterBufferType first in vaRenderPicture |
| - frontends/va: Set default rate control values once when creating encoder |
| - gallium/auxiliary/vl: Add RGB to YUV compute shader |
| - gallium/auxiliary/vl: Use chroma offset in YUV to RGB weave compute shader |
| - gallium/auxiliary/vl: Fix YUV to RGB bob compute shader deinterlacing |
| - gallium/auxiliary/vl: Only map the shader constants buffer in render |
| - frontends/va: Add High Quality preset mode |
| - radeonsi/vcn: Add High Quality encoding preset for AV1 |
| - radeonsi: Fix plane size in si_copy_multi_plane_texture |
| - frontends/va: Implement vaMapBuffer2 |
| - frontends/va: Fix locking in vlVaBeginPicture |
| - frontends/va: Parse H264 SPS for max_num_reorder_frames |
| - util/vl: Fix vl_rbsp parser with bitstreams without emulation bytes |
| - frontends/va: Fix parsing packed headers without emulation bytes |
| - radeonsi/vcn: Add encode support for H264 B-frames |
| - frontends/va: Map decoder and postproc surfaces for reading |
| - radeonsi: Fix offset for linear surfaces on GFX < 9 |
| - gallium/auxiliary/vl: Fix coordinates clamp in compute shaders |
| - gallium/auxiliary: Fix coordinates clamp in util_compute_blit |
| - gallium/auxiliary/vl: Scale dst_rect x0/y0 when rendering chroma plane |
| - util/rbsp: Fill bits twice if reading more than 16 bits |
| |
| Derek Foreman (2): |
| |
| - vulkan/wsi: Allow binding presentation_timing when software rendering |
| - vulkan/wsi: warn about unset present_mode in PresentModeCompatibilityExt |
| |
| Dmitry Baryshkov (3): |
| |
| - gallium: move kmsro definition to the bottom of the file |
| - gallium: unbreak kmsro/freedreno case |
| - tu: Pass real size of prime buffers to allocator |
| |
| Dmitry Osipenko (3): |
| |
| - util/cache_test: Re-add test for disabled cache |
| - util/cache_test: Fix disabled cache test using SHADER_CACHE_DISABLE_BY_DEFAULT |
| - util/cache_test: Add test for get/put() with disabled cache |
| |
| Dor Askayo (1): |
| |
| - nouveau: add exported GEM handles to the global list |
| |
| Dr. David Alan Gilbert (6): |
| |
| - rusticl/core: Add profiling time storage (queued) to event |
| - rusticl: Wire the 'queued' profiling time up |
| - rusticl: Wire the 'submit' profiling time up |
| - rusticl: Wrap pipe queries |
| - rusticl: Wrap pipe query reads |
| - rusticl: Wire the 'start' and 'end' profilng times up |
| |
| Dylan Baker (4): |
| |
| - VERSION: bump to 23.3.0-devel |
| - docs: Update release calendar for 23.2.0-rc1 |
| - docs: truncate feature list for 23.3-devel |
| - meson: use a single dependency call for lua |
| |
| Echo J (5): |
| |
| - nvk: Fix some cast defines |
| - nvk: Add A8B8G8R8_*_PACK32 format support |
| - nvk: Add bufferImageGranularity limit |
| - nvk: Reset offset value in ResetDescriptorPool |
| - nil: Add A4B4G4R4_UNORM format support |
| |
| Emma Anholt (111): |
| |
| - ci/radv: Clarify when the ANGLE GS failures started happening. |
| - ci: Uprev ANGLE to 0518a3ff4d4e ("Android: Simplify power metrics collection") |
| - ci/tgl: Improve the info for ANGLE's MSAA regression on TGL. |
| - ci/tu: Add more crash cases for the multithreading bugs caught on a630. |
| - ci/tu: Mark descriptor_buffer.basic.limits as failing in gmem too. |
| - ci/tu: Drop some xfails for !24086 |
| - tu: Fix data race in userspace VMA management. |
| - ci/a5xx: Add another GPU hanging piglit test to the skips. |
| - Revert "ci: Disable nouveau CI" |
| - nvk: Avoid strict aliasing warning in the pushbuffer encoding. |
| - nvk: Fix uninitialized result usage in NVK_DEBUG_ZERO_MEMORY. |
| - nvk: Fix unused result warnings in pushbuf resets. |
| - nvk: Remove duplicate (disabled) point sprite setup. |
| - nvk: Fix missing init of the stages to sync against. |
| - nvk: Use depth_clamp_enable to select PIXEL_*_Z_CLAMP. |
| - nouveau/winsys: Fix an undefined use in the error path. |
| - nvk: Quiet a compiler warning. |
| - nvk: Clean up redundant vendor checking for physical device creation. |
| - nvk: Add support for probing as a platform device. |
| - nvk: Disable shaderStorageImageReadWithoutFormat pre-Maxwell. |
| - freedreno/a5xx: Fix border color structure size. |
| - freedreno/a5xx: Skip emitting unused texture descriptors for images. |
| - freedreno/ir3: Move pvtmem per-fiber size alignment to the compiler. |
| - ci/freedreno: Drop a bunch of stale a530 xfails. |
| - ci/freedreno: Sort another a530 xfail with its friends. |
| - ci/freedreno: Update comments for some a530 xfails. |
| - ci/freedreno: Add some more db820c xfails. |
| - freedreno/devices: Move fibers_per_sp to the common info struct. |
| - freedreno/devices: Set num_sp_cores explicitly for pre-gen6. |
| - freedreno/a6xx: Move pvtmem allocation to ir3_gallium. |
| - freedreno/a3xx: Add the shift for MEMSIZEPERITEM according to db410c docs. |
| - freedreno/a5xx: Refactor SHADER_OBJ emit to a helper function. |
| - freedreno/a5xx: Set num_sp_cores and set PC/VFD_POWER_CNTL accordingly. |
| - freedreno/a5xx: Add private mem support. |
| - freedreno/cffdec: Fix decode on pixel 2 blob's COMPUTE_CHECKPOINT |
| - ci/freedreno: Add a regression test for decoding a540 blob's compute shaders. |
| - freedreno: Fix crashdec pre-a6xx. |
| - freedreno/a5xx: Skip SSBO emit when none are enabled. |
| - vulkan/util: Make multialloc succeed with 0 allocations. |
| - turnip: Track the first/last subpass an attachment is used in. |
| - turnip: Skip emitting empty CP_COND_REG_EXEC. |
| - turnip: Save the renderpass's clear values in the cmdbuf state. |
| - turnip: Move gmem clears and loads to the first subpass that uses them. |
| - turnip: Move sysmem clears to the first subpass that uses them. |
| - ci/freedreno: Skip some tests on a5xx that destabilize other tests. |
| - freedreno/a3-5xx: Don't try to emit ISAM for SSBO loads. |
| - ci/turnip: Add a660 VK coverage. |
| - disk_cache: Disable the "List" test for RO disk cache. |
| - blorp: Disable unaligned partial HIZ fast clears for HIZ_CCS too. |
| - intel/fs: Move defin/defout setup to the start of the loop. |
| - intel/fs: Move the defin[]/defout[] screening up to livein[]/liveout[] setup. |
| - intel/fs: Simplify compute_start_end(). |
| - ci/freedreno: Add another excessive-constlen UBO skip. |
| - ci/anv: Drop DEQP_VER:vk setting. |
| - ci/anv: Drop "-vk" from the job name. |
| - ci/anv: Add a manual full VK run for TGL. |
| - ci/anv: Add testing on JSL. |
| - freedreno: Build drm subdir before perfcntrs, which uses it. |
| - ci/intel: Add various updates from our nightly runs. |
| - ci/virgl: Disable virgl-iris-traces. |
| - ci/zink: Add a few updates for anv/tgl from the nightly runs. |
| - ci/fastboot: Use a case insensitive match for a fastboot line. |
| - ci/etnaviv: Skip some tests that hang the GPU and knock out other tests. |
| - ci/etnaviv: Drop some gc2k flakes that I think are resolved. |
| - ci/anv: Drop incorrect xfail addition for TGL |
| - ci/anv: Drop the 16bit.scalar.13 skip. |
| - ci/etnaviv: Minor xfail/flake polishing. |
| - ci/etnaviv: Skip a GLES2 test that times out the asan job. |
| - ci/zink: Skip more doubles tests on anv that flake at 3 minute timeouts. |
| - ci/docker: Clear the results file before starting a new deqp test run. |
| - ci/crocus: Add a related flake to a known one. |
| - ci/etnaviv: return gl-1.4-tex1d-2dborder as a known flake |
| - ci/crocus: Add known piglit flakes |
| - ci/hasvk: Add a bunch of new CTS border color fails. |
| - i915: Re-clang-format and enforce it in CI. |
| - i915: Print the relevant counts vs limits when throwing errors. |
| - i915: Don't log I915_DEBUG=fs output for blit shaders. |
| - i915: Save fragment program compile error messages in the fragment shader. |
| - i915: Do a test compile at glLinkShader() time. |
| - i915: Make exceeding tex indirect count fatal. |
| - i915: Use nir_group_loads() to reduce texture indirection phases. |
| - ci/crocus: Generalize the drawarrays-vertex-count flakes. |
| - ci/zink: Skip 3-minute-long glx-visuals timeouts. |
| - ci/zink: Skip dmat[34] op tests in general, as well |
| - ci/crocus: Disable flaky unvanquished-ultra trace |
| - nir/print: Decode system values in the variable declarations. |
| - ci/zink: Add a TGL flake that's showed up in nightlies recently. |
| - ci/radeonsi: Drop an xfail for vangogh. |
| - i915: Make I915_DEBUG=fs log shaders that fail to link due to CF. |
| - nir: Flatten ifs with discards in nir_opt_peephole_select for HW without CF. |
| - glsl: Remove lower_discard(). |
| - ci/zink: Only test half of piglit pre-merge on anv. |
| - ci: Stop doing internal retries in bare-metal. |
| - ci/bare-metal: Drop the 2 vs 1 exit code from poe_run. |
| - ci/bare-metal: Default our boards to a 20-minute timeout for the whole job. |
| - ci/iris: Drop parallel on kbl piglit to 2. |
| - ci/freedreno: Fold a630_egl into a630_gl. |
| - ci/freedreno: Move skqp testing to a618. |
| - ci/zink: Cut zink-lvp coverage in half. |
| - ci/freedreno: Generalize the implicit_unmap timeouts. |
| - ci_run_n_monitor: Poll mesa/mesa and user/mesa for pipelines at the same time. |
| - glx: Delete support for GLX_OML_swap_method. |
| - ci: drop skip for glx-swap-copy. |
| - dri: Drop a duplicate mesa vs pipe format table. |
| - docs/ci: Drop old instructions for farm disabling |
| - docs/ci: Add some links in the CI docs to how to track job flakes |
| - glsl: Remove int64 div/mod lowering. |
| - llvmpipe: Set nir_lower_dround_even. |
| - nir: Add nir_lower_dsign as 64-bit fsign lowering. |
| - glsl: Retire dround lowering. |
| - ci_run_n_monitor: Always resolve --rev arguments for looking up pipelines. |
| |
| Eric Engestrom (194): |
| |
| - ci: avoid running hardware jobs if lint fails - now on LAVA too! |
| - ci: avoid running hardware jobs if lint fails - now on Windows too! |
| - ci: replace copy of nouveau rules with reference |
| - ci: drop leftover kernel configs |
| - ci: use !reference for scheduled_pipeline retry rule |
| - ci: add .llvmpipe-manual-rules and use it |
| - ci: add .gallium-core-rules and use it instead of gallium_core_file_list anchor |
| - ci: replace llvmpipe_file_list anchor with reference |
| - ci: replace softpipe_file_list anchor with reference |
| - ci: replace lavapipe_file_list anchor with reference |
| - ci: replace iris_file_list anchor with reference |
| - ci: replace radv_file_list anchor with reference |
| - ci: replace radeonsi_file_list anchor with reference |
| - ci: replace virgl_file_list anchor with reference |
| - ci: move etnaviv files rules to src/etnaviv/ci/gitlab-ci.yml |
| - ci: move freedreno files rules to src/freedreno/ci/gitlab-ci.yml |
| - ci: move nouveau files rules to src/gallium/drivers/nouveau/ci/gitlab-ci.yml |
| - ci: move panfrost files rules to src/panfrost/ci/gitlab-ci.yml |
| - ci: move broadcom files rules to src/broadcom/ci/gitlab-ci.yml |
| - ci: move lima files rules to src/gallium/drivers/lima/ci/gitlab-ci.yml |
| - ci: move amd files rules to src/amd/ci/gitlab-ci.yml |
| - ci: move microsoft files rules to src/microsoft/ci/gitlab-ci.yml |
| - ci: move zink files rules to src/gallium/drivers/zink/ci/gitlab-ci.yml |
| - ci: move virtio files rules to src/virtio/ci/gitlab-ci.yml |
| - ci: move intel files rules to src/intel/ci/gitlab-ci.yml |
| - ci: move virgl files rules to src/gallium/drivers/virgl/ci/gitlab-ci.yml |
| - ci: move llvmpipe files rules to src/gallium/drivers/llvmpipe/ci/gitlab-ci.yml |
| - ci: move softpipe files rules to src/gallium/drivers/softpipe/ci/gitlab-ci.yml |
| - ci: move lavapipe files rules to src/gallium/drivers/lavapipe/ci/gitlab-ci.yml |
| - ci: delete install.tar after extracting it to avoid re-uploading it |
| - docs: add release notes for 23.1.4 |
| - docs: add sha256sum for 23.1.4 |
| - docs: update calendar for 23.1.4 |
| - asahi: drop unused include paths |
| - ci/lint: deduplicate formatting check jobs |
| - ci/lint: also print a diff for rust format issues |
| - ci: allow hw jobs even if lint jobs fail for non-Marge pipelines |
| - ci: print rustfmt's version |
| - ci: print clang-format's version |
| - bin/ci_run_n_monitor: get git sha from pipeline if specified, instead of requiring --rev to match |
| - lavapipe/ci: use tighter changes: rules |
| - ci: add a 10min job timeout to formatting checks |
| - ci: reduce bare-metal retries of poe_run to only 3 attempts |
| - broadcom/ci: reduce vc4-rpi3-gl timeout to 30min (instead of 1h) |
| - broadcom/ci: reduce v3d-rpi4-gl timeout to 30min (instead of 1h) |
| - broadcom/ci: reduce v3d-rpi4-traces timeout to 30min (instead of 1h) |
| - broadcom/ci: reduce v3dv-rpi4-vk timeout to 30min (instead of 1h) |
| - ci: add .core-rules to .gallium-core-rules |
| - ci: drop rule for non-existent src/include/ |
| - docs: add release notes for 23.1.5 |
| - docs: add sha256sum for 23.1.5 |
| - docs: update calendar for 23.1.5 |
| - ci: include some timing information in the git cache download script |
| - docs/ci: stop trying to enumerate drivers that are tested using VK-GL-CTS |
| - docs/ci: in paragraph about the CI being overwhelmed, mention our tool to help with that |
| - docs/ci: drop mention of build systems variants in the CI |
| - docs/ci: expand the description of test suites |
| - bin: add wrapper to run scripts in a python venv |
| - bin/ci/ci_run_n_monitor: use venv wrapper |
| - bin/ci/gitlab_gql: use venv wrapper |
| - bin/ci/update_traces_checksum: use venv wrapper |
| - bin/pick-ui: use venv wrapper |
| - ci: include mold in x86_64_test-base & rootfs images |
| - ci: use mold to build deqp |
| - zink/ci: set the default timeout for zink jobs to 30min instead of 1h |
| - egl: make _eglFilterConfigArray static |
| - egl: fixup _eglFilterConfigArray() params and drop _eglFallbackMatch() wrapper |
| - ci: build nvk |
| - ci: document max image tag length |
| - docs/radv: mark VK_EXT_tooling_info as implemented |
| - docs/radv: mark VK_INTEL_shader_integer_functions2 as implemented |
| - git-blame-ignore-revs: repeat instruction on how to enable to avoid having to look for it |
| - git-blame-ignore-revs: add radv formatting commit |
| - git-blame-ignore-revs: add pvr formatting commit |
| - meson: fix indentation |
| - docs/v3dv: mark direct display extensions as implemented |
| - ci: reorder vk drivers alphabetically in debian-vulkan job |
| - ci: build hasvk in debian-vulkan job |
| - ci/zink+radv: set a timeout of 2x the normal runtime |
| - amd/ci: drop duplicate test expectations |
| - panfrost: upcast uint8/uint16 before shifting them beyond their range |
| - ci/a530: document piglit flake |
| - docs: add release notes for 23.1.6 |
| - docs: add sha256sum for 23.1.6 |
| - docs: update calendar for 23.1.6 |
| - docs: add one more 23.1.x release |
| - ci: rename \*.log to \*.txt to work around gitlab bug |
| - ci/freedreno: reuse freedreno_gl_file_list instead of re-definining it |
| - egl: bump extension string length |
| - vc4: drop duplicate .lower_ldexp |
| - zink: fix format in zink_make_{image,texture}_handle_resident() |
| - v3dv: fix VK_PIPELINE_ROBUSTNESS_{BUFFER,IMAGE}_BEHAVIOR_DEVICE_DEFAULT_EXT copy/paste typo |
| - v3dv: fix copy/pasted type of \`sample` |
| - v3dv: fix shader stage name in error message |
| - v3d/qpu: fix type of function argument |
| - ci/deqp: backport fix for dEQP-EGL.functional.wide_color.*_888_colorspace_* |
| - ci/farm-rules: fix missing valve-infra jobs in scheduled pipelines |
| - bin/ci_run_n_monitor: error out if both --project and --pipeline-url are passed |
| - ci: document farm rules |
| - ci/b2c: skip install.tar extraction if the tarball is not present |
| - ci/b2c: don't allow failures in test script preparation |
| - ci/b2c: assert that install folder is present whether or not the tarball was extracted |
| - ci/amd: split the polaris10 rules into one for each farm |
| - ci: skip containers & build jobs when disabling a farm |
| - docs: add release notes for 23.1.7 |
| - docs: add sha256sum for 23.1.7 |
| - docs: update calendar for 23.1.7 |
| - docs: add one more 23.1.x release |
| - ci: taking igalia farm offline |
| - ci/b2c: drop logic to remove install.tar |
| - ci: drop clover leftover |
| - Revert "ci: taking igalia farm offline" |
| - bin/ci_run_n_monitor: print in which repo we're looking for the pipeline |
| - bin/ci_run_n_monitor: automatically pick MR pipelines when they exist |
| - ci: remove duplicate fork pipeline in MRs |
| - ci_run_n_monitor: add comment to explain "MR > fork" logic |
| - ci: don't run everything just because a farm gets re-enabled |
| - ci/windows: centralize definition of windows runners tags |
| - ci/windows: add windows docker runner tags to .windows-docker-vs2019 |
| - ci/windows: drop build rules from test jobs |
| - ci: document which image tags need to be bumped when updating piglit |
| - ci: document which image tags need to be bumped when updating {alpine,debian,fedora}/x86_64 |
| - ci/farm-rules: rename .disable-farm-mr-rules to make it clear it's only about MRs |
| - ci/farm-rules: re-add "run every container and build job when a farm gets re-enabled" |
| - ci/zink: drop redundant \`MESA_LOADER_DRIVER_OVERRIDE: zink` |
| - docs: add release notes for 23.1.8 |
| - docs: add sha256sum for 23.1.8 |
| - docs: update calendar for 23.1.8 |
| - docs: add another 23.1.x |
| - ci: limit build jobs to 30min so that they can retry when they go wrong |
| - docs: drop outdated and redundant note about the minimum meson version |
| - ci/zink+radv: specify that zink-radv-navi10-valve should run in the mupuf farm |
| - ci/zink+radv: bump the timeout of zink-radv-navi10-valve by 10 minutes |
| - docs: add calendar for 23.3 |
| - ci: unify container and build jobs rules |
| - docs/meson: drop mention that our meson is ready |
| - ci/docs: drop extra overwritten rules |
| - ci/zink+radv: document flake |
| - docs: document the merging process and what is allowed or not |
| - ci: drop unused shader-db clone + build from alpine image |
| - ci: drop unused shader-db clone + build from fedora image |
| - ci: move shader-db clone/build into its own script |
| - ci/deqp-runner: fix indentation |
| - ci/deqp-runner: restore exit-on-error after getting deqp-runner's exit code |
| - ci: fix shebang in build-deqp-runner.sh |
| - docs: add release notes for 23.1.9 |
| - docs: add sha256sum for 23.1.9 |
| - docs: update calendar for 23.1.9 |
| - ci: drop unused ephemeral packages in alpine image |
| - docs/ci: rewrite the "farm maintenance ^ other change" rule to mean what we actually meant |
| - ci: skip dEQP-VK.api.driver_properties.conformance_version for everyone |
| - pick-ui: use assignment expressions |
| - pick-ui: use more expressive variable names |
| - pick-ui: add \`Backport-to: XX.Y` nomination |
| - v3d/ci: move traces job to wayland |
| - ci: print deqp version in the job log |
| - ci/b2c: move to the shiny new \`gfx-ci/ci-tron` repo |
| - ci/b2c: use latest mesa-trigger image |
| - include/dri_interface.h: restore define mistakenly removed in !25587 |
| - ci_run_n_monitor: dependency jobs must always be started |
| - util/xmlconfig: drop driInjectDataDir() now that DRIRC_CONFIGDIR is always supported |
| - util/xmlconfig: inline datadir |
| - ci/b2c: change artifacts path to match baremetal and LAVA |
| - VERSION: bump for rc1 |
| - .pick_status.json: Update to e64a97694ac9dc97f65e1a8e91a5c9789109fd2c |
| - .pick_status.json: Update to 4cdd094ae1e97d857a6b9dbc291d7bbe6ea266ac |
| - .pick_status.json: Update to e4a1bc70dd739ca8addddc940af08312b038e288 |
| - .pick_status.json: Update to faed5d647f2416bb0ce3a9d33a3955169c70dc52 |
| - VERSION: bump for 23.3.0-rc2 |
| - .pick_status.json: Update to 1f1ec1c6bcc2a32a3c1df8c2cc7a2f4e7139b7ec |
| - .pick_status.json: Mark 8dda860f83ac30d042dc6beb4438cc925d1fd130 as denominated |
| - .pick_status.json: Update to 7d6f9ccfbeab050c26775d5e03578a01526cbfcb |
| - .pick_status.json: Update to aa33ca0a52591961f8ae01dc253354462ed17c18 |
| - .pick_status.json: Update to a77ea9555aa00cc12f3d1c440252e940ff552500 |
| - .pick_status.json: Mark 227300345ed38377190b0eaf08694d5c42ee7e60 as denominated |
| - VERSION: bump for 23.3.0-rc3 |
| - .pick_status.json: Update to 56451ce773c11094a8c08fdc6b500bb8bdcf37e1 |
| - .pick_status.json: Mark fa7ec4226bdf48bf63438e303af83ecd58ec95f2 as denominated |
| - .pick_status.json: Update to 08f851f4361cfbdb211dc70d03cf3ebff331c3ee |
| - .pick_status.json: Update to 03a7cb261828b350dd9b56bd74850197ca9eba33 |
| - .pick_status.json: Mark fcfa68a632e5711cc657b103c9a0384928e9bf49 as denominated |
| - VERSION: bump for 23.3.0-rc4 |
| - .pick_status.json: Update to f05688aa3299a27430119b27e45181a6f415bff8 |
| - egl/dri2: increase NUM_ATTRIBS to fit all the attributes |
| - .pick_status.json: Update to f39ed0063b4cd3e5a71efad2d43ce31f574c698d |
| - .pick_status.json: Update to b07a58157d0b110dbc09a42cffe7046c3200dd3b |
| - VERSION: bump for 23.3.0-rc5 |
| - .pick_status.json: Update to f843b14c171299e1696ca6d971ccaa496f60c3ab |
| - intel/perf: fix regex escaping |
| - intel/ci: fix .hasvk-manual-rules |
| - VERSION: bump for 23.3.0 |
| - Revert "VERSION: bump for 23.3.0" |
| - docs: add release notes for 23.3.0 |
| - Revert "docs: add release notes for 23.3.0" |
| |
| Erico Nunes (10): |
| |
| - lima/ppir: don't optimize loads with different block successors |
| - lima/ppir: convert to nir_legacy |
| - lima/gpir: switch to register intrinsics |
| - egl/drm: fix EGL_EXT_buffer_age with gbm contexts |
| - lima: fix plbu block stride calculation |
| - ci: disable lima LAVA lab for maintance |
| - Revert "ci: disable lima LAVA lab for maintance" |
| - v3dv: allow headless device without display device |
| - Revert "ci/lima: farm is down, disable for now" |
| - v3dv: Rework to remove drm authentication for wsi |
| |
| Erik Faye-Lund (30): |
| |
| - meson: report with_glvnd in summary |
| - docs: upgrade bootstrap to 5.3.1 |
| - docs: expand mobile-menu without js |
| - panfrost: delete stale editorconfig file |
| - docs/panfrost: link to lima |
| - docs/panfrost: use code-blocks with wrapping for long blocks |
| - docs/panfrost: use math-role to denote powers of two |
| - docs: fix linkcheck |
| - docs: update a few links to https |
| - docs: update anchor for link |
| - docs: update link to git-wiki |
| - docs: link to upstream etnaviv |
| - docs: apply some trivial redirects |
| - docs: use doc-role when linking to lists article |
| - docs: keep up with intels ever-moving documentation |
| - docs: mark some redirects as allowed |
| - docs: only link to old docs from html |
| - docs: use html_static_path for static files |
| - ci/etnaviv: update ci expectation |
| - ci/etnaviv: allow failure on failing test |
| - zink: fix wording of warning |
| - ci/etnaviv: move failure to flake |
| - meson: add wayland-protocols from meson wrapdb |
| - util/xmlconfig: add an env-var for overriding drirc search dir |
| - meson: add src/util to the drirc search path |
| - docs/relnotes: remove cruft from end of lines |
| - docs/ci: escape at-symbols |
| - docs/relnotes: escape some at-symbols |
| - bin/gen_release_notes: escape at-symbols |
| - panfrost: use perf_debug instead of open-coding |
| |
| Faith Ekstrand (809): |
| |
| - nv50/ir: Convert to new-style NIR registers |
| - nv50/ir: Support vector movs |
| - intel/fs: Add support for new-style registers |
| - intel/vec4: Assume get_nir_dest() provides a sane write-mask |
| - intel/vec4: Add support for new-style registers |
| - intel: Switch to intrinsic-based registers |
| - intel/fs: Drop support for nir_register |
| - intel/vec4: Drop support for nir_register |
| - anv,hasvk,iris: sampler_prog_key::swizzles is only used on crocus |
| - nir: Properly handle divergence for load_reg |
| - nir/trivialize: Maintain divergence information |
| - nir/trivialize: Trivialize cross-block loads |
| - vc4: Convert to new-style NIR registers |
| - nir/schedule: Support load/store_reg |
| - broadcom/compiler: Convert to new-style NIR registers |
| - intel/fs: Use write masks from store_reg intrinsics |
| - intel/fs: Rework the overlapping mov/vec case |
| - intel/fs: Assume NIR is in SSA form |
| - nir: Add a backend_flags field to nir_tex_instr |
| - intel/fs: Add a parameter to speed up register spilling |
| - nir/builder: Allow tex helpers on image types |
| - nir/builder: Add a nir_txs_deref() helper |
| - vulkan: Add a core vk_buffer_view struct |
| - vulkan: Add a more direct way to use a NIR shader |
| - vulkan: Add a vk_query_pool base object |
| - vulkan: Add common vkCmdBegin/EndQuery wrappers |
| - vulkan/format: Add the remaining 1-plane YCbCr formats |
| - vulkan: Add a core vk_sampler struct |
| - nv50/nir: Lower to scratch AFTER optimization |
| - nouveau: Allow GLSL_SAMPLER_DIM_SUBPASS* |
| - nouveau/nir: Implement support for compact arrays |
| - nouveau/codegen: Handle/indirect goes before sample index |
| - nouveau/codegen: Use a NULL format for PIPE_FORMAT_NONE for images |
| - nouveau/codegen: Don't convertSurfaceFormat for unknown formats |
| - nv50/ir: Run nir_divergence_analysis before out-of-SSA |
| - anv: Use vk_sampler |
| - anv: Use vk_buffer_view |
| - vulkan: Add init/finish helpers for vk_query_pool |
| - anv: Use vk_query_pool |
| - anv: Use the common versions of vkBegin/EndQuery() |
| - nir/builder: Don't assume we have compiler options |
| - Revert "mesa, compiler: Move gl_texture_index to glsl_types.h" |
| - Revert "compiler: Combine duplicated implementation of is_gl_identifier into glsl_types.h" |
| - vulkan: Use VkBufferUsageFlags2 in vk_buffer |
| - clang-format: Set ColumnLimit to 78 |
| - nvk: Implement EnumerateInstanceVersion |
| - nvk: Add stub implementations of VkImage and VkImageView |
| - nvk: Add stub implementation of VkSampler |
| - nvk: Add a stub implementation of VkBuffer |
| - nvk: Implement VkDescriptorSetLayout |
| - nvk: Implement VkPipelineLayout |
| - nvk: Add initial descriptor set lowering |
| - nvk: Implement vkUpdateDescriptorSets |
| - nvk: Expose nvk_descriptor_stride_align_for_type |
| - nvk: Re-format descriptor set layouts |
| - nvk: Re-format pipeline layouts |
| - nvk: Re-format descriptor sets some more |
| - nvk/buffer: Take an offset in nvk_buffer_address |
| - nvk/buffer: Add a push_buffer_ref helper |
| - nvk/copy: Use nvk_buffer_address in CmdCopyBuffer |
| - nvk/image: Add image address helpers |
| - nvk/copy: Use nvk_image_base_address() |
| - nvk: Add an nvk_device_physical helper |
| - nvk: Add a skeleton for pipelines |
| - nvk: Re-arrange nvk_descriptor_set.h a bit |
| - nvk: Reformat nvk_nir_lower_descriptors |
| - nvk: Add a couple descriptor set address helpers |
| - nvk: Move nvk_cmd_pool cast definitions |
| - nvk: Rework whitespace in nvk_cmd_buffer.c |
| - nvk: Add a root descriptor table |
| - nvk: Fetch descriptor set addresses from the root table |
| - nvk: Re-arrange nir_lower_explicit_io a bit |
| - nvk: Lower load_global_constant_offset |
| - nvk: Drop image_view_init |
| - nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable |
| - nvk: Allow R32_UINT |
| - nvk: Mark nvk_push_descriptor_set_ref() inline |
| - nvk: Add a descriptor table data structure |
| - nvk: Copy in the nouveau TIC format table |
| - nvk/image_view: Reformat and fix Create/DestroyImageView |
| - nvk: Add an image descriptor table to the device |
| - nvk: Fill out TIC table entries for image views |
| - nvk: Set b->cursor when lowering image intrinsics |
| - nvk: Unify descriptor loading in lower_descriptors |
| - nvk: Re-format nvk_image_view.h a bit |
| - nvk: Re-format nvk_buffer.c a bit |
| - nvk: Add a stub implementation of buffer views |
| - nvk: Make texture descriptors a bit more acceptable to codegen |
| - nvk: GART os host-cache-coherent |
| - nvk: Reserve a null image descriptor |
| - nvk: Rework descriptor writes |
| - nouveau: Add stubs for an image layout library called NIL |
| - nil: Create images |
| - nil: Add the TIC format table from nouveau |
| - nil: Add a nil_view and code to fill out TIC entries |
| - nvk: Add an nvk_get_format helper |
| - nvk: Use helpers for push_ref |
| - nvk: Align arguments consistently in copy/blit code |
| - nvk: Move Fill/UpdateBuffer to nvk_cmd_copy |
| - Revert "nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable" |
| - nvk: Manually offset for array layers in copy/blit |
| - nvk: Convert to using NIL for image layout |
| - nvk: Re-indent image entrypoints |
| - nvk: Implement VkGetImageSubresourceLyout |
| - nvk: Reset and properly clean up command buffer upload areas |
| - nvk: Rework format features queries |
| - nvk: Add a more competent GetPhysicalDeviceImageFormatProperties |
| - nvk: Support compressed images in copy commands |
| - nvk: Drop vk_sync BO refs after push_submit |
| - nil: Drop miptail support for now |
| - nil: Don't minify image dimensions when setting up TIC |
| - nil: Refactor TIC image extent setup |
| - nil: Fix image array layer alignments |
| - nvk: Teture pool sizes are maximums not sizes |
| - nvk: Re-format nvk_sampler.c |
| - nvk: Implement samplers |
| - nil: Add a helper for filling out buffer TIC entries |
| - nvk: Move is_storage_image_format to nvk_format.c |
| - nvk: Implement buffer views |
| - nvk: Advertise KHR_dedicated_allocation |
| - nvk: Use the correct root descriptor table size for CmdDispatch |
| - nvk: Add support for dynamic buffers |
| - nvk: Better advertise image format features |
| - nvk: Advertise descriptor array indexing |
| - nvk: Advertise non-zero descriptor set limits |
| - nvk: Use a descriptor type instead of a hand-rolled thing |
| - nvk: Handle cube storage images properly |
| - nvk: Load the requested descriptor size |
| - nvk: Implement push constants |
| - nvk: Properly indent a comment |
| - nvk: Fix descriptor offset alignment |
| - nvk: Use a switch for descriptor types in load_descriptor |
| - nvk: Support inline uniform blocks |
| - nvk: Delete the storage TIC in nvk_image_view_destroy |
| - nvk: Assert that we don't double-free descriptors |
| - nvk: Initial vkCmdClearImage support |
| - nvk: Unconditionally zero image format properties |
| - nvk: No-op sparse image format properties |
| - nvk: Advertise minUniformBufferOffsetAlignment |
| - nvk: Rework OOM handling for descriptor pools |
| - nvk: Bind immutable samplers on descriptor set creation |
| - nvk: Padd shader BOs by 4K to avoid I-cache overflow |
| - nvk: Include nvk_private.h in everything |
| - nvk: Make image/buffer address helpers const |
| - nouveau/push: Add a P_INLINE_FLOAT helper |
| - nvk: Init WSI after setting up supported_sync_types |
| - nouveau/parser: Fix an integer overflow and a typo |
| - nouveau/parser: Properly dump most arrays used by 3D |
| - nouveau/parser: Better dump float data |
| - nouveau/parser: Handle arrays properly in P_IMMD() |
| - nouveau/push: Make P_IMMD more versatile |
| - nouveau: Null terminate the debug flag list |
| - nouveau: Generate 3D headers |
| - nvk: Add graphics state to command buffers |
| - nvk: Split pipeline binding into helpers |
| - nvk: Switch to vk_pipeline_shader_stage_to_nir |
| - nvk: Don't free the NIR in nvk_compile_nir |
| - nvk: Add an nvk_shader_address helper |
| - nvk: Free pipeline shader BOs |
| - nvk: Expose pipeline alloc/free functions |
| - nvk: Make shader_upload take an nvk_device |
| - nvk/shader: Assign I/O locations and gather info |
| - nvk/shader: Populate headers for vertex and fragment shaders |
| - nvk: Add a nvk_cmd_buffer_device() helper |
| - nvk: Import 3D context init code from nouveau |
| - nil/format: Add helpers for render formats |
| - nvk: Add boilerplate for Begin/EndRendering |
| - nvk: Misc. additional state setup |
| - nvk: Emit dynamic graphics state |
| - nvk: Implement push constants and descriptors for graphics |
| - nouveau: Add CPU push buffers |
| - nvk: Graphics pipelines |
| - nvk: Implement vkCmdDraw() |
| - nvk: Color attachments clears via image clears |
| - vulkan/meta: Add the start of a meta framework |
| - vulkan/meta: Add an object tracking list |
| - vulkan/meta: Add a concept of rect pipelines |
| - vulkan/meta: Implement attachment clears |
| - vulkan/meta: Implement start-of-rendering clears |
| - vulkan/meta: Add implementations of Clear*Image |
| - nvk: Add an attachment format even for secondaries |
| - nvk: Add an addr field to nvk_buffer |
| - nvk: Expose a bind_vertex_buffer helper |
| - nvk: Use vk_meta for CmdClearAttachments |
| - nvk: Stop using vk_cmd_set_dynamic_graphics_state in meta_end() |
| - nvk: Enable all the dynamic state features |
| - nouveau: Fix pushbuf ref reset for user command buffers |
| - nvk: add linear image creation support. |
| - nvk: Use max alignment for descriptor pool sizes |
| - nil: Switch to using the new headers for TIC entries |
| - nvk: Use meta for CmdClear*Image |
| - nvk: Zero client memory objects |
| - nvk: Bind texture and sampler header pools for 3D |
| - nvk: Use the new headers for samplers |
| - nvk: Implement nir_intrinsic_load_frag_coord |
| - vulkan/meta_clear: Populate VkRenderingInfo::renderArea |
| - nvk: Don't assert when there are no attachments |
| - nvk: Track and reference all device memory objects |
| - vulkan: Allow scissors or viewports to be set without counts |
| - nvk/copy: Mape bpp part of nouveau_copy_buffer |
| - nvk: Implement copies for D24_UNORM_S8_UINT images |
| - nvk: Drop sample locations structs |
| - nvk/meta: Save and restore VI state |
| - nvk: Re-initialize dynamic_graphics_state.vi when recycling |
| - nvk: Move the vertex format table into nvk_format.h |
| - nvk: Advertise vertex buffer format featues |
| - nvk: Clean up try_create_physical_device error handling |
| - nouveau/parser: Dump more fields as float |
| - nvk: Depth bounds need fui() |
| - nouveau: Add class information to nouveau_ws_device |
| - nil: Properly depend on nouveau winsys and nvidia-headers |
| - nil: Use nvidia headers for texture format enums |
| - nil: Use the nvidia headers for render target format enums |
| - nil: Use nvidia headers for ZS format enums |
| - nil: Rename rt to czt in the format info struct |
| - nil: Rename rendering to color_target |
| - nil: Re-introduce the format capabilities |
| - nil: Add more format support helpers |
| - nvk: Advertise more format features |
| - nvk: Clear dynamic state dirty after flushing it all |
| - vulkan/meta: Make stencil reference dynamic for clears |
| - nvk: Depth buffers don't allow Z-tiling |
| - nvk: Disable sparse Z on Maxwell+ |
| - nil: Compute PTE kinds and tile modes for images |
| - nouveau: Add a function to allocate a tiled buffer |
| - nvk: Add internal helpers for device memory allocation |
| - nvk: Do internal dedicated allocations for ZS images |
| - nvk: Fix depth/stencil render pass clears |
| - nvk: Fix viewport Z scale |
| - nvk: Enable two-sided stencil |
| - nvk: Flip the front-face setting |
| - nvk: Advertise depth/stencil support |
| - nvk: Don't destroy NULL descriptor pool BOs |
| - nvk: Call nir_lower_input_attachments |
| - nvk: Set GEOMETRY_SHADER_SELECTS_LAYER properly |
| - nvk: Return OUT_OF_DEVICE_MEMORY if bo_new fails |
| - nil: Add a PTE kind for Z32_FLOAT |
| - nvk: Add nvk_queue_init/finish() helpers |
| - nvk: Align descriptor buffers to NVK_MIN_UBO_ALIGNMENT |
| - nvk: Re-flow a couple function prototypes |
| - nvk: Assert samples == 1 |
| - nvk: Allocate descriptors for input attachments |
| - nvk: Wire up early z and post depth coverage |
| - nvk: Save/restore push constants around meta ops |
| - nouveau/parser: Add array and float tags for clear values |
| - nvk: Use hardware clears for attachment clears |
| - nvk: Add image_view_init/finish functions |
| - nvk: Implement vkCmdClear*Image directly |
| - nvk: Use a UINT format to clear non-renderable images |
| - nvk: Don't advertise tiling on non-power-of-two formats |
| - nvk: Fix max anisotropy |
| - nvk: Assert on CmdExecuteCommands |
| - nvk: VkSamplerCreateInfo::mipLodBias is signed |
| - nvk: Fix border color alpha |
| - nil/format: Depth/stencil formats appear as red |
| - nil: Fix max mip level |
| - nil: Fix nonnormalized coordinates |
| - nvk: Set up clip and cull distances |
| - nvk: Fix dynamic buffer descriptor copies |
| - nvk: Inline nouveau_copy_linear |
| - nvk/copy: Rename push to p |
| - nvk/blit: Rename push to p |
| - nvk/dispatch: Rename push to p |
| - nvk: Drop most buffer tracking |
| - nvk: Rework TLS/SLM and image/sampler table handling |
| - nvk: Invalidate texture header and sampler caches each submit |
| - nvk/sampler: Free descriptor table entries |
| - nvk: Rework nvk_descriptor_table_add/remove |
| - nvk: Implement descriptor table growing |
| - nvk: Zero unused descriptors |
| - nvk: Add some asserts for nv50 compiler image restrictions |
| - nvk: Update to the new command buffer infrastructure |
| - nvk: Split nvk_queue into its own file |
| - nvk: Start every command buffer with a nop |
| - nvk: Initialize fixed draw/default state once |
| - nouveau/parser: Convert to mako |
| - nouveau/parser: Use more idiomatic python |
| - nouveau/parser: Put the dump helpers in C files |
| - nvk: Use f for extension features |
| - nvk: Drop a TODO |
| - nvk: Use VK_IMAGE_USAGE_*_ATTACHMENT_BIT for image clears |
| - nvk: Increase the graphics pipeline push space |
| - nil: Don't claim texture support for 2-bit SNORM |
| - nouveau/push: Fix a void pointer arithmetic bug |
| - nouveau/parser: Add more arrays |
| - nouveau/mme: Add basic structures for the Turing+ MME |
| - nouveau/mme: Add isaspec XML for the Turing+ MME |
| - nouveau/mme: Add an assembler and disassembler for the Turring+ MME |
| - nouveau/mme: Add a builder for the Turing+ MME |
| - nouveau/mme: Add a tiny simulator for the Turing+ MME |
| - nouveau/mme: Add an isaspec-based dumper |
| - nouveau/mme: Make the winsys headers C++ safe |
| - nouveau/mme: Add unit tests for the Turing+ MME simulator |
| - nvk: Add MME infrastructure |
| - nvk: Use MME for clears |
| - nouveau/mme: Add helper macros for setting fields |
| - nvk: Use MME for vkCmdDraw[Indexed]() |
| - nvk: Implement vkCmdDraw[Indexed]Indirect() |
| - nvk: Use p for the nouveau_ws_push_buffer in zero_vram |
| - nouveau: Add an nv_push struct |
| - nouveau: Rename the fields of vk_push |
| - nouveau: Move nv_push and helpers to their own header |
| - nouveau/parser: Take a FILE* in DUMP_*_MTHD_DATA |
| - nouveau: Move push validate to nv_push.c |
| - nouveau: Move push dumping to nv_push.c |
| - nvk: Use nv_push directly for graphics pipelines |
| - nouveau: Add a nouveau_ws_bo_new_mapped helper |
| - nvk: Use bo_new_mapped for the zero page |
| - nvk: Always allocate empty_push |
| - nvk: Move queue_sumbit to nvk_queue_drm_nouveau.c |
| - nvk: Submit pushbufs directly |
| - nvk: Use a regular BO for the empty push |
| - nvk: Use a regular BO for the queue state push |
| - nvk: Add an nvk_queue_submit_simple helper |
| - nvk: Initialize the queue later in device setup |
| - nvk: Use submit_simple for draw state init |
| - nvk: Use queue_submit_simple for zero_vram |
| - nvk: Break nvk_cmd_pool into its own file |
| - nvk: Use cmd instead of cmd_buffer |
| - nvk: Add BO recycling to the command pool |
| - nvk: Return VkResult from nvk_cmd_buffer_upload_alloc |
| - nvk: memcpy root descriptors for compute instead of doing a DMA |
| - nvk: Fully populate QMDs before uploading |
| - nvk: Constant buffer alignment is actually 64B |
| - nvk: Rework side-band data upload |
| - nvk: Add an nvk_cmd_buffer_push helper |
| - nvk: Add an nvk_cmd_buffer_ref_bo helper |
| - nvk: Allocate upload buffers from the command pool |
| - nvk: Use nvk_cmd_bo for push bufs |
| - nvk: Implement vkCmdExecuteCommands() |
| - nvk: Remove remaining references to nouveau_push.h |
| - nouveau: Use DRM interfaces directly in MME tests |
| - nouveau: Drop nouveau_ws_push |
| - nvk: Re-indent vk_instance.c |
| - nvk: Use vk_object_zalloc/free for descriptor pools/sets |
| - nvk: Fix up whitespace in nvk_descriptor_set.c |
| - nvk: Implement VK_KHR_push_descriptor |
| - nvk: Reference descriptor set layouts in the sets themselves |
| - nvk: Embed a nv_device_info in nvk_physical_device |
| - nvk: Add an nvk_queue_submit wrapper |
| - nvk: Also store the push BO map in nvk_queue_state |
| - nvk: Bring back push sync and dumping |
| - nvk: drop nvk_nir.h |
| - nvk: Add lowering for load_global_constant_bounded |
| - nvk: Properly implement robustBufferAccess |
| - vulkan/meta: Add key types |
| - vulkan/meta: Add a helper for image view types |
| - vulkan/meta: Add a create_sampler helper |
| - vulkan/meta: Fixes for clear |
| - vulkan/meta: Implement vkCmdBlitImage() |
| - nvk: Support load_layer_id |
| - nvk/meta: Save/restore descriptor set 0 |
| - nvk: Use meta for doing blits with the 3D hardware |
| - nvk: WFI in pipeline barriers |
| - util/vma: Allow initializing zero-size heaps |
| - nvk: Rework nvk_queue_submit_simple() |
| - nvk: Add a heap data structure |
| - nvk: Return a VkResult from nvk_shader_upload() |
| - nvk: Add a shader heap to nvk_device |
| - nvk: Allocate shaders from a heap |
| - nvk: Rework whitespace in nvk_device_memory.c |
| - nvk: Style fixes in nvk_physical_device.c |
| - nvk: Reset semaphore syncs on wait |
| - nvk/wsi: Style fixes |
| - nvk/wsi: Use the common present implementation |
| - nouveau/parser: Parse all fields in each method |
| - nvk: Add a query pool object |
| - nvk: Implement timestamp queries |
| - nvk: Implement pipeline statistics and occlusion queries |
| - nouveau/mme: Allow ZERO as the destinatio nof mme_load_to |
| - nouveau/mme: Assert on OOB registers |
| - nouveau/mme: Add support for freeing registers |
| - nouveau/mme: Add a couple helpers for working 64-bit addresses |
| - nouveau/mme: Add a helper for MME_DMA_READ_FIFOED |
| - nvk: Use mme_tu104_read_fifoed() |
| - nvk: Implement vkCmdCopyQueryPoolResults() |
| - nvk: Handle large command buffer uploads better |
| - nvk: Use a normal DMA for CmdUpdateBuffer |
| - nouveau/parser: Handle 6F methods |
| - nvk: Use mme_load_addr64() |
| - nvk: Use poll for BO waits |
| - nvk: Events |
| - nvk: Don't crash if we fail to allocate a push BO |
| - nvk: Stop leaking command pool BOs |
| - nvk: Enable VK_KHR_create_renderpass2 |
| - nvk: Advertise VK_KHR_imageless_framebuffer |
| - nvk: Flush the current pushbuf before allocating a new one |
| - nvk: Advertise VK_KHR_separate_depth_stencil_layout |
| - nvk: Tell WSI we don't support legacy scanout |
| - nouveau: Add PCI information to nv_device_info |
| - nvk: Implement VK_EXT_pci_bus_info |
| - nvk: Bind 3D images as 3D for clears |
| - nvk: Support copies between 3D and 2D images |
| - nil: Add a helper for getting 2D views of 3D images |
| - nvk: Support 2D views of 3D images |
| - nvk: Advertise VK_KHR_maintenance1 |
| - nvk: Use 2D array views for 3D storage images |
| - nil: Fix include guards in nil_image.h |
| - nvk: Advertise custom border color features |
| - vulkan: Add a helper for swizzling color values |
| - nvk: Implement VK_EXT_border_color_swizzle |
| - nvk: Advertise VK_EXT_extended_dynamic_state3 |
| - nvk: Move more states to dynamic |
| - nvk: Advertise VK_KHR_storage_buffer_storage_class |
| - nvk: Add a helper for pushing descriptors |
| - nouveau/headers: Add generated headers to dependencies |
| - nvk: Implement VK_EXT/KHR_buffer_device_address |
| - nvk: Break the guts of CmdDispatch into a helper |
| - nvk: Implement DispatchIndirect |
| - nouveau/mme: Add a mul64 helper |
| - nvk: Implement CS invocations statistics queries |
| - nil: Use ONE for the anixotropic coarse spread function |
| - nil: Properly support MSAA |
| - nil: Add an offset4d struct and some helpers |
| - nouveau/parser: Sort METHOD_ARRAY_SIZES |
| - nouveau/parser: Handle SET_ANTI_ALIAS_SAMPLE_POSITIONS |
| - nvk: Stop asserting on MSAA |
| - nvk: Handle zero color attachments better |
| - nvk: Handle multisampled render targets properly |
| - nvk: Support copies of MSAA images |
| - nvk: Use the right view format for stencil texturing |
| - nvk: Pass through a shader key for fragment shaders and MSAA |
| - nvk: Set correct multisample regs for graphics pipelines |
| - nvk: Stop creating a new upload BO every time |
| - nvk: Fill out sample locations on Maxwell B+ |
| - vulkan/meta: Bind whole LODs of 3D blit destinations |
| - vulkan/meta: Add a helper for building texture ops |
| - vulkan/meta: Break the guts of blit into a helper |
| - vulkan/meta: Support writing stencil as iterative discard |
| - vulkan/meta: Rename vk_meta_blit.c to vk_meta_blit_resolve.c |
| - vulkan/meta: Add support for MSAA resolves |
| - nvk/meta: Fix restore for descriptor set 0 |
| - nvk: Use meta for MSAA resolves |
| - nvk: Replace gl_SamplePosition with fract(gl_FragCoord.xy) |
| - nvk: Stop advertising higher framebufferNoAttachmentsSampleCounts |
| - nvk: Advertise MSAA via image format properties |
| - nvk: Advertise VK_KHR_depth_stencil_resolve |
| - nvk: Assert that descriptor buffer access stays in-bounds |
| - nvk: Add a bo size to nvk_descriptor_set |
| - nvk/format: Style fix for VkFormatProperties3KHR |
| - nvk: Support VK_FORMAT_B10G11R11_UFLOAT_PACK32 for vertex buffers |
| - nvk: Add a devenv ICD json file |
| - nvk: Advertise EXT_vertex_attribute_divisor |
| - nvk: Lower image_size to txs |
| - nvk: Fix a comment |
| - nvk: Add an nvk_buffer_addr_range helper |
| - nvk: Use nvk_buffer_addr_range for buffer descriptors |
| - nvk: Re-order Vulkan 1.0 feature bits |
| - nvk: Enable inheritedQueries |
| - nvk: Enable VK_EXT_provoking_vertex |
| - nvk: Advertise samplerMirrorClampToEdge via 1.2 features |
| - nvk: Advertise VK_KHR_bind_memory2 |
| - nvk: Enable KHR_dynamic_rendering |
| - nvk: Advertise KHR_uniform_buffer_standard_layout |
| - nvk: Advertise EXT_index_type_uint8 |
| - nvk: Advertise VK_EXT_separate_stencil_usage |
| - nvk: Capitalize NVK in user exposed strings |
| - nvk: Rename grid_size to group_count |
| - nvk: Lower load_num_workgroups ourselves |
| - nvk: Drop block_size from the root descriptor table |
| - nvk: Add a helper for loading resource_index-based descriptors |
| - nvk: Set maxMemoryAllocationCount |
| - nouveau/winsys: Take a drmDevicePtr in nouveau_ws_device_new() |
| - nouveau/winsys: Add an info to nouveau_ws_device |
| - nouveau/winsys: Move device type into nv_device_info |
| - nouveau/nil: Take an nv_device_info for image functions |
| - nouveau/nil: Use nv_device_info for format queries |
| - nouveau/mme: Invoke SET_OBJECT in the tests |
| - nouveau/mme: Make alu_op_to_str static |
| - nouveau/mme: Move mme_value into its own header |
| - nouveau/mme: Add a mme_reg_alloc struct |
| - nouveau/mme: Add an intermediate MME_ALU_OP enum |
| - nouveau/mme: Add an intermediate MME_CMP_OP enum |
| - nouveau/mme: Use mme_mov() for temp copies of register IMM32 sources |
| - nouveau/mme: Make helpers less Turing specific |
| - nouveau/mme: Break the Turing builder guts into a separate header |
| - nouveau/mme: Move the guts of mme_merge_to() into mme_tu104_builder.c |
| - nouveau/mme: Move the guts of mme_state_arr_to() into mme_tu104_builder.c |
| - nouveau/mme: Drop the implicit_imm parameter from mme_alu_to() |
| - nouveau/mme: Move the cf_stack struct to mme_builder.h |
| - nouveau/mme: Prepare the builder for multiple GPU generations |
| - nouveau/mme: Take an nv_device_info in mme_builder_init |
| - Support immediates in MERGE |
| - Add add immediate optimizations |
| - nvk: Add support for contiguous heaps to nvk_heap |
| - nvk: Use a contiguous shader heap pre-Volta |
| - nvk: Disable indirect draw/dispatch and query copy MMEs for now |
| - nvk: Free a couple regs in nvk_mme_build_draw_*() |
| - nvk: Properly align root descriptor tables for pre-Pascal |
| - nvk: Compile all NIR before running codegen |
| - vulkan/meta: Insert a geometry shader when needed |
| - nvk: Use a GS for layerered rendering pre-MaxwellB |
| - nvk: Handle zero-size index and vertex buffers pre-Turing |
| - nvk: Cosmetic clean-ups to Create/DestroyDevice |
| - nil: Only choose a PTE kind for tiled images |
| - nouveau/mme: Fix is_int18 for negative numbers |
| - nouveau/mme: Don't swap x and y in mme_fermi_merge_to() |
| - nouveau/mme: Take a const nv_device_info in mme_builder_init |
| - nouveau/mme: Unify some of the test framework |
| - nouveau/mme: Add some generic builder tests |
| - nouveau/mme: Add builder tests for SUB |
| - nouveau/mme: Use a uint32_t for size in mme_fermi_bfe() |
| - nouveau/mme: nouveau/mme: Add builder tests for SLL and SRL |
| - nvk/drm: Take a byte offset/range in push_add_push |
| - nvk: Rework nvk_cmd_push a bit |
| - nvk: Add a helper for pushing indirect data |
| - nvk: Make some MME builder names more consistent |
| - nouveau/mme: Don't allow WaW dependencies in the same Turing instruction |
| - nvk: Reduce register pressure in nvk_mme_build_draw*() |
| - nouveau/push: Add an NV_PUSH_MAX_COUNT #define |
| - nvk: Implement Draw*Indirect on pre-Turing |
| - vulkan/meta: Use the new NIR texture helpers |
| - nvk: Add a build test for MMEs |
| - nvk: Don't over-size push descriptor sets |
| - nvk: Return VK_ERROR_INCOMPATIBLE_DRIVER if the PCI vendor isn't NVIDIA |
| - nvk: Bump init context batch size |
| - nouveau/mme: Fix nested while instructions on Turing+ |
| - nouveau/mme: Add a helper to dump instructions |
| - nvk: Rework extension enables |
| - nvk: Rework features enables |
| - nvk: Advertise shaderImageGatherExtended |
| - nouveau/mme: Add a bfe helper |
| - nouveau/mme: Ensure that zero-initizlied mme_value is ZERO |
| - nvk: De-duplicate MME code for setting draw params |
| - nvk: Clamp viewport clip to max range |
| - nvk: Use the same lock for the submit and the memory objects list |
| - nvk: Advertise ICD/loader interface version 4 |
| - nvk: Add instace WSI entrypoints |
| - nouveau/mme: Use ADD for ine with an immediate |
| - nouveau/mme: Fix while loops pre-Turing |
| - nvk: Add begin to mme_scratch |
| - nvk: Use the new load/store_scratch helpers for DRAW_PAD_DW |
| - nouveau/mme: Add a helper for re-allocating registers |
| - nvk: Rework spill helpers and DRAW_COUNT spilling |
| - nvk: Spill DRAW_IDX pre-Turing |
| - nvk: Break the inner MME draw loop into a helper |
| - nvk: Increase the push runout to 512 dwords |
| - nil: Add a nil_image_for_level helper |
| - nil: Add an image_level_as_uncompressed helper |
| - nvk: Implement uncompressed views of compressed images |
| - nvk: Set pointClippingBehavior |
| - nvk: Expose VK_KHR_maintenance2 |
| - nvk: Add a separate #define for SSBO alignment |
| - nvk: Set spirv_to_nir_options::min_*_alignment |
| - nvk: Use vk_device_memory |
| - nvk: Implement VK_KHR_map_memory2 |
| - nvk: Sort SPIR-V caps |
| - nvk: Advertise EXT_shader_viewport_index_layer on MaxwellB+ |
| - nvk: Only use view_id for layer in multiview |
| - nvk/heap: Set the right pitch for heap resize copies |
| - nvk: Advertise shaderStorageImageReadWithoutFormat |
| - nvk: Fix the NO_PREFETCH assert for CmdDrawIndirect |
| - nvk: Advertise KHR_spirv_1_4 |
| - nvk: s/device/dev in nvk_image.c |
| - nvk: Add helpers for binding image planes |
| - nvk: Take an nvk_image_plane in nouveau_copy_rect_image |
| - nvk: Use the max descriptor alignemtn in GetDescriptorSetLayoutSupport |
| - nvk: Use NVIDIA_VENDOR_ID in pdev try_create() |
| - nvk: Use abbreviated names in nvk_device_memory.c |
| - nvk: Add device and driver UUIDs |
| - nvk: Add external memory queries |
| - nvk: Dedicated allocations override internal |
| - nvk: Require dedicated allocations for external images |
| - nouveau/winsys: Add dma-buf import support |
| - nvk: Support dma-buf import |
| - nvk: Support dma-buf export |
| - nvk: Enable external memory extensions |
| - nvk: Reformat nvk_buffer.c |
| - nvk: Add a buffer alignment helper |
| - nvk: Add an addr field to nvk_image_plane |
| - nvk: Use canonical variable names in nvk_physical_device.c |
| - nvk: Use canonical variable names in nvk_shader.c |
| - nvk: Use canonical variable names in nvk_bo_sync.c |
| - nvk: Use canonical variable names in nvk_sampler.c |
| - nvk: Drop nvk_physical_device::instance |
| - nvk: Only advertise EXT_pci_bus_info on discrete GPUs |
| - nouveau: Put PCI info in a pci substruct in nv_device_info |
| - nouveau: Stop using hex for SM numbers |
| - nvk: Set deviceType based on nv_device_info::type |
| - nouveau: Move more stuff into nv_device_info |
| - nouveau: Move gart_size to nv_device_info |
| - nvk: Use nv_device_info for class checks |
| - nvk: Rename nvk_device::ctx to ws_ctx |
| - nvk: Add a ws_dev to nvk_device and use it |
| - nvk: Move the winsys device to nvk_device |
| - nvk: Don't enumerate pre-Kepler GPUs |
| - nvk: Implement VK_EXT_physical_device_drm |
| - nvk: Require an environment variable for poorly tested hardware |
| - nvk: Use the new core vk_sampler struct |
| - Revert "vulkan: Allow scissors or viewports to be set without counts" |
| - vulkan/meta: Add a get_pipeline_layout helper |
| - vulkan/meta: Use vk_meta_get_pipeline_layout in blit/resolve |
| - nvk: Bind 3D depth/stencil images as 2D arrays |
| - nvk: Flush more state on VI_BINDINGS_VALID dirty |
| - nvk: Don't skip zero-size bindings in GetDescriptorSetLayoutSupport |
| - docs: Add a docs page for NVK |
| - docs: Add NVK to features.txt |
| - docs/relnotes: Stick something about NVK in new_features.txt |
| - nouveau: Drop GART size from nv_device_info |
| - nil: Add a nil_image_level_extent_px() helper |
| - nvk: Use the new NIL helper for image level extents for copies |
| - nvk: Improve image format properties and limits |
| - nvk: Rework multi-plane format features a bit |
| - nvk: Use nvk_root_descriptor_offset for drawInfoBase |
| - nvk: Add a root_desc_addr to the root descriptor table |
| - nvk: Add support for variable pointers |
| - nvk: Enable the SPIR-V DeviceGroup capability |
| - nvk: Separate the MME query copy code out a bit |
| - nvk: Implement CopyQueryPoolResults with a compute shader |
| - nvk: Misc. style nits |
| - nvk: Rework memory requirements to handle aspects correctly |
| - nvk: Implement the maintenance5 image layout queries |
| - nvk: Use VkBufferUsageFlags2 |
| - nvk: Implement CmdBindIndexBuffer2KHR |
| - nvk: Implement GetRenderingAreaGranularityKHR |
| - nvk: Decorate CmdBegin/EndRendering entrypoints |
| - nouveau: Move shader topology info to nv_device_info |
| - drm-uapi: Import nouveau_drm.h |
| - nouveau/winsys: Use the imported nouveau_drm.h headers |
| - nvk: Use the imported nouveau_drm.h headers |
| - nouveau/shim: Use the imported nouveau_drm.h headers |
| - nouveau/mme: Support the new UAPI |
| - nvk: Use an empty EXEC for the empty submit case |
| - nouveau/winsys: Allow nouveau_ws_device_new() without VM_BIND |
| - nvk: Print an error message if VM_BIND support is missing |
| - nvk: Enable the new UAPI |
| - nvk: Use more consistent device variable names |
| - nvk: Call nir_lower_int64 |
| - nir/gl: Move glsl_type::sampler_target() into a helper in its one caller |
| - nvk: Remove plane sources from tex instructions |
| - nvk: Use common physical device properties |
| - nv50/ir: Rework conversions for texture array indices |
| - clang-format: Add nir_foreach_reg_* |
| - clang-format: nir_foreach_src is not a foreach macro |
| - clang-format: Set the default ColumnLimit to 0 |
| - nir: Re-align a couple enums and add clang-format comments |
| - nir: Don't clang-format const_value helpers |
| - nir: Don't clang-format a couple typedefs |
| - nir: Don't clang-format debug print setup |
| - nir: More manual formatting |
| - nir: Pretty format type mapping helpers |
| - nir: Wrap pass macros in braces |
| - nir: Add a do to the do/while in nir_const_value_t_array() |
| - nir: Add a .clang-format file |
| - nir: clang-format src/compiler/nir/\*.[ch] |
| - nvk: Don't use nir_ssa_for_src() |
| - nir: Drop most instances of nir_ssa_dest_init() |
| - nir: Drop more instances of nir_ssa_dest_init() |
| - nir/clone: Clone nir_def nor nir_dest |
| - nir/serialize: [De]serialize nir_def nor nir_dest |
| - nir: Drop nir_ssa_dest_init() |
| - nir: Drop nir_ssa_dest_init_for_type() |
| - nir: nir_foreach_ssa_def() -> nir_foreach_def() |
| - st,zink,sfn: Use nir_foreach_def instead of nir_foreach_dest |
| - dxil: Use nir_foreach_def() instead of nir_foreach_dest() |
| - nir/from_ssa: Use nir_foreach_def() instead of nir_foreach_dest() |
| - nir: Drop nir_foreach_dest() |
| - intel/vec4: Stop passing around nir_dest |
| - intel/fs: Stop passing around nir_dest and nir_alu_dest |
| - broadcom: Stop using nir_dest directly |
| - vc4: Stop passing around nir_dest |
| - nir,ntt,a2xx,lima: Stop using nir_dest directly |
| - lima: Stop using nir_dest directly |
| - etnaviv: Stop passing around nir_dest |
| - r600/sfn: Stop passing around nir_dest and nir_alu_dest |
| - nv50/ir: Stop passing around nir_dest and nir_alu_dest |
| - nir/gather_types: Stop passing around nir_dest |
| - nir/dce: Stop passing around nir_dest |
| - nir/propagate_invariant: Stop passing around nir_dest |
| - nir/validate: Replace all dest validation with validate_def |
| - nir/print: Replace all dest printing with print_def |
| - nir: Get rid of nir_dest_bit_size() |
| - nir: Get rid of nir_dest_num_components() |
| - nir: Get rid of nir_dest_is_divergent() |
| - nir: Drop nir_alu_dest |
| - nir: Drop nir_dest |
| - util/format: 8-bit interleaved YUV formats are UNORM |
| - gallivm: Support G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM |
| - blorp: Use R8G8_UINT for YCRCB_* formats with CCS |
| - anv: Disable CCS_E for ISL_FORMAT_YCRCB_* |
| - vulkan/format: Use correct swizzle for 1-plane YCbCr formats |
| - gallivm: Drop the Vulkan YUV format hacks |
| - nir: Rename nir_instr_type_ssa_undef to nir_instr_type_undef |
| - nir s/nir_get_ssa_scalar/nir_get_scalar/ |
| - nir: s/live_ssa_def/live_def/ |
| - nir: s/nir_instr_ssa_def/nir_instr_def/ |
| - nir: Rework nir_scalar_chase_movs a bit |
| - nir: Fix nir_op_mov handling in nir_collect_src_uniforms |
| - nir: Handle nir_op_mov properly in opt_shrink_vectors |
| - nir: Don't handle nir_op_mov in get_undef_mask in opt_undef |
| - nir: Clean up nir_op_is_vec() and its callers |
| - nir/large_constants: Use nir_component_mask_t |
| - nir/large_constants: Add read/write_const_values helpers |
| - nir/opt_large_constants: Add Small constant handling |
| - spirv: Re-emit constants at their uses |
| - nir: Take a nir_def * in nir_tex_instr_add_src() |
| - nir: Take a nir_def * in nir_phi_instr_add_src() |
| - nir/opt_undef: Don't rewrite a bcsel to mov |
| - nir: Add a nir_instr_clear_src() helper and use it |
| - nir: Add and use a nir_instr_init_src() helper |
| - nir: Drop nir_if_rewrite_condition() |
| - nir: Drop most uses of nir_instr_rewrite_src_ssa() |
| - nir: Drop nir_instr_rewrite_src_ssa() |
| - nir: Drop most uses if nir_instr_rewrite_src() |
| - nir: Drop nir_instr_rewrite_src() |
| - nir: Drop nir_push_if_src() |
| - nir: Fix metadata in nir_lower_is_helper_invocation |
| - nir: Use nir_shader_intrinsic_pass() a few places |
| - drm-uapi: Sync nouveau_drm.h |
| - nvk: Plumb no_prefetch through to the DRM back-end |
| - nouveau/mme: Fix a compile warning |
| - intel/isl: Rename ISL_TILING_Yf/s to ISL_TILING_SKL_Yf/s |
| - intel/isl: Add ICL variants of Yf and Ys tiling |
| - intel/isl: Implement correct tile size calculations for Ys/Yf |
| - intel/isl: Use the depth field of phys_level0_sa for GFX4_2D 3D surfaces |
| - intel/isl: Fill out the correct phys_total_extent for Ys/Yf/Tile64 |
| - intel/isl: Indent uncompressed surface code |
| - intel/isl: Support Ys, Yf & Tile64 in isl_surf_get_uncompressed_surf |
| - intel/isl: Support Yf/Ys tiling in surf_fill_state |
| - intel/isl: Support Yf/Ys tiling in emit_depth_stencil_hiz |
| - intel/isl: Add initial data-structure support for miptails |
| - intel/isl: Add support for computing offsets with miptails |
| - intel/isl: Support miptails in isl_surf_get_uncompressed_surf |
| - intel/isl: Start using miptails |
| - intel/isl: Disallow CCS on 3D surfaces with miptails |
| - intel/isl: Allow Ys tiling |
| - anv: Align memory VA to support for Ys, Tile64 tiled images |
| - nvk: Clean up includes |
| - nvk: Add include guards to nvk_bo_sync.h |
| - nvk: SPDX everything |
| - nouveau/nil: SPDX everything |
| - nouveau/mme: SPDX everything |
| - nvk: Don't add a dummy attachment when gl_SampleMask is written |
| - nvk: Set the discard bit for Z/S self-deps |
| - nvk: Invalidate the texture cache in PipelineBarrier |
| - nvk: Lower interp_at_sample to interp_at_offset |
| - nvk: Disable statistics around meta ops |
| - nvk: Clean up viewport math |
| - nvk: Fix depth clipping parameters |
| - nvk: Enable dynamic clip/clamp enable |
| - nvk: Set GUARDBAND_Z_SCALE_1 when Z-clipping |
| - r600: Use more auto-generated nir_builder helpers |
| - r600: Use nir_builder helpers for load/store_shared_r600 |
| - nvk: Re-order physical device limits |
| - nvk: Advertise maxMemoryAllocationCount = 4096 |
| - nvk: Advertise discreteQueuePriorities = 2 |
| - nvk: Rip out old UAPI support |
| - nvk/drm: Drop the push_add_push_bo() helper |
| - nvk/drm: Drop the push_add_bo() helper |
| - nvk: Drop command buffer BO tracking |
| - nvk: Drop memory object tracking |
| - nvk: Drop the device-level mutex |
| - nvk: Get rid of the tiled memory allocation helpers |
| - nvk/drm: Restructure nvk_queue_submit_drm_nouveau() |
| - nvk/drm: Split exec as needed for large command buffers |
| - nvk: Don't store the descriptor pool BO in the set |
| - nvk: Store a 20-bit driver_build_sha in nvk_instance |
| - nvk: Hook up the disk cache |
| - nvk: Re-structure early shader compilation a bit |
| - nvk: Add a default pipeline cache |
| - nvk: Cache NIR shaders |
| - nvk: Init pipelineCacheUUID |
| - drm-uapi: Sync nouveau_drm.h |
| - nvk: Take GETPARAM_EXEC_PUSH_MAX into account |
| - nvk: Handle zero-sized sparse buffers |
| - nvk: Use align() and align64() instead of ALIGN_POT |
| - nouveau: Generate headers for Maxwell B compute |
| - nvk: Add a nvk_cmd_buffer_compute_cls() helper |
| - nvk: Invalidate sampler/texture header caches in BeginCommandBuffer() |
| - nvk: Invalidate SKED caches at the top of command buffers |
| - nvk: Advertise more inline uniform block limits |
| - nvk: Emit MME_DMA_SYSMEMBAR before indirect draw/dispatch |
| - nvk: Set max descriptors to 2^20 for most descriptor types |
| - nvk: Reset descriptor pool allocator when all sets are destroyed |
| - nil/format: Use A for alpha blend |
| - nil/format: Advertise R10G10B10A2_UINT texture buffer support |
| - nvk: Disable depth or stencil tests when unbound |
| - nvk: Always emit at least one color attachment |
| - nvk: Improve address space and buffer size limits |
| - nvk: Always set pixel_min/max_Z to CLAMP |
| - nvk: Use nouveau_ws_bo_unmap() instead of munmap() |
| - nvk: Free the disk cache |
| - nvk: Add an nvk_shader_finish() helper |
| - nvk: Handle unbinding images and buffers |
| - nvk: Clean up the disk cache on physical device create fail path |
| - vulkan/wsi: Allow for larger linear images |
| - nvk: Add a nvk_cmd_buffer_dirty_render_pass() helper |
| - nvk: Re-sort device features |
| - nvk: Implement VK_EXT_depth_bias_control |
| - nvk: Advertise VK_KHR_workgroup_memory_explicit_layout |
| - nvk: Implement VK_EXT_image_sliced_view_of_3d |
| - nvk: Advertise VK_EXT_primitive_topology_list_restart |
| - nvk: Advertise VK_EXT_attachment_feedback_loop_layout |
| - features: Mark VK_EXT_attachment_feedback_loop_layout done for NVK |
| - nvk: Re-arrange Vulkan 1.2 features to match the header |
| - nvk: Advertise shaderOutputLayer and shaderOutputViewportIndex |
| - nvk: Enable descriptorIndexing |
| - nvk: Implement VK_EXT_dynamic_rendering_unused_attachments |
| - nir: Add a nir_ssa_def_all_uses_are_fsat() helper |
| - nir: Add convert_alu_types to divergence analysis |
| - nir/lower_tex: Add a lower_txd_clamp option |
| - nir: Add a load_sysval_nv intrinsic |
| - nir: Add NV-specific texture opcodes |
| - nir: Add an load_barycentric_at_offset_nv intrinsic |
| - nir: Add a range to most I/O intrinsics |
| - nir: Add NVIDIA-specific I/O intrinsics |
| - nir/lower_bit_size: Fix subgroup lowering for floats |
| - nir: add deref follower builder for casts. |
| - nir: Handle wildcards with casts in copy_prop_vars |
| |
| Felix DeGrood (12): |
| |
| - anv: save a shader source uint32_t hash in gfx/compute pipelines |
| - anv: Add Source hash field to VkPipelineExecutableStatisticKHR |
| - iris: save shader source sha1 in ish |
| - mesa: propagate shader source sha1 from gl_shader to nir_shader |
| - intel: use shader source hash in INTEL_MEASURE |
| - intel/compiler: use shader source hash in shader dump code |
| - anv: add fake sparse support |
| - anv: enable fake sparse for Elden Ring |
| - anv: debug messaging for sparse texture usage |
| - anv: fix frame count reporting in INTEL_MEASURE |
| - anv: set ComputeMode.PixelAsyncComputeThreadLimit = 4 |
| - anv: remove CS_FLUSH from query regression |
| |
| Feng Jiang (9): |
| |
| - virgl: Only PIPE_BUFFER with VIRGL_BIND_CUSTOM flag is considered busy during creation |
| - meson: Export winsys function symbols for target va |
| - frontends/va: Add slice_count to AV1 slice_parameter |
| - virgl/video: Add definition of virgl_av1_picture_desc |
| - virgl/video: Add support for AV1 decoding |
| - virgl/video: Enable AV1 decoding |
| - meson: Rename dri-vdpau.dyn to dri.dyn |
| - CODEOWNERS: Add \@flynnjiang for VirGL video |
| - meson: Move video to separate section in meson configuration summary |
| |
| Filip Gawin (1): |
| |
| - crocus: Avoid fast-clear with incompatible view |
| |
| Flora Cui (1): |
| |
| - radeonsi: limit CP DMA to skip holes in sparse bo |
| |
| Francisco Jerez (29): |
| |
| - intel/fs/ra: Define REG_CLASS_COUNT constant specifying the number of register classes. |
| - intel/vec4/ra: Define REG_CLASS_COUNT constant specifying the number of register classes. |
| - intel/compiler: Make MAX_VGRF_SIZE macro depend on devinfo and update it for Xe2. |
| - intel/fs/ra/xe2: Scale up register allocation granularity by 2x on Xe2+ platforms. |
| - intel/eu/xe2+: Fix encoding of various message descriptors for change in register size. |
| - intel/fs: Fix signedness of payload_node_count argument of calculate_payload_ranges(). |
| - intel/fs/xe2+: Fix payload node live range calculations for change in register size. |
| - intel/fs/xe2+: Fix grf_count in post-RA scheduling for updated register file size. |
| - intel/fs/xe2+: Fixes for increased accumulator register width. |
| - intel/fs/xe2+: Scale MAX_SAMPLER_MESSAGE_SIZE by native register size. |
| - intel/eu/xe2+: Update validation of GRF region size to account for Xe2 reg size |
| - intel/fs/xe2+: Allow increased SIMD width for various get_fpu_lowered_simd_width() restrictions. |
| - intel/compiler/xe2+: Represent dispatch_grf_start_reg in native GRF units. |
| - intel/fs/xe2+: Update encoding of FB write message payload. |
| - intel/fs/xe2+: Round up fs_builder::vgrf() size calculation to HW register unit. |
| - intel/fs/xe2+: Scale BRW_MAX_MSG_LENGTH by native register size. |
| - intel/fs/xe2+: Fix payload layout of sampler messages for Xe2 reg size |
| - intel/fs/xe2+: Update GS payload setup for Xe2 reg size. |
| - intel/fs/xe2+: Update TCS payload setup for Xe2 reg size. |
| - intel/fs/xe2+: Update TES payload setup for Xe2 reg size. |
| - intel/fs: Lower unsupported regioning with non-trivial 2D regions on FIXED_GRFs. |
| - intel/fs/xe2+: Update regioning lowering offset alignment checks for Xe2 regs. |
| - intel/fs/xe2+: Fix execution width of SHADER_OPCODE_GET_BUFFER_SIZE for SIMD16 EU. |
| - intel/fs/xe2+: Fix calculation of spill message width for Xe2 regs. |
| - intel/xe2+: Round up size to reg_unit() in fs_reg_alloc::alloc_spill_reg(). |
| - intel/fs/xe2+: Fix URB writes with 0 data components. |
| - intel/fs: Specify number of data components of logical URB writes via control immediate. |
| - intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB writes. |
| - intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB reads. |
| |
| Frank Binns (10): |
| |
| - pvr: clang-format fixes |
| - pvr: skip setting up SPM consts buffer when no const shared regs are used |
| - pvr: cleanup SPM EOT dynarray after upload |
| - pvr: treat VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT as not supported |
| - pvr: remove pvr_pbe_get_src_pos() |
| - pvr: fix attachments segfault in pvr_is_stencil_store_load_needed() |
| - pvr: fix allocation size of clear colour consts shared regs buffer |
| - pvr: change a few places to use PVR_DW_TO_BYTES() |
| - pvr: fix setup of load op unresolved msaa mask |
| - pvr: emit PPP state when vis_test dirty bit is set |
| |
| Friedrich Vock (19): |
| |
| - radv/ci: Set DRIVER_NAME in LAVA raven vkcts jobs |
| - radv: Handle VK_SUBOPTIMAL_KHR in trace layers |
| - ac/msgpack: make fixstrs a const char |
| - ac/sqtt,radv: Split internal and API hash in PSO correlations |
| - ac/rgp: Write lds_size metadata |
| - ac/rgp: Add metadata for separate-compiled RT stages |
| - radv/sqtt: Move record filling to helper function |
| - radv/sqtt: Unregister records based on hash |
| - radv/sqtt: Write LDS size metadata in code objects |
| - radv/sqtt: Handle separately-compiled RT pipelines |
| - ac/sqtt,radv/sqtt: Add and use marker for separate RT compilation |
| - nir/load_store_vectorize: Handle intrinsics with constant base |
| - radv/rt: Pre-initialize instance address |
| - radv: Initialize shader freelist on allocation |
| - radv: Fix check in insert_block |
| - radv/rt: Reject hits within 10ULP of previous hits in emulated RT |
| - radv/rra: Recognize LPDDR memory |
| - radv/rmv: Recognize LPDDR memory |
| - vulkan: Don't use set_foreach_remove when destroying pipeline caches |
| |
| Ganesh Belgur Ramachandra (5): |
| |
| - radeonsi: stores bottom_edge_rule option in the rasterizer state |
| - radeonsi: sets OPTIMAL_BIN_SELECTION to 0 if using bottom_edge_rule |
| - radeonsi: "clear_render_target" shader in nir |
| - radeonsi: "clear_render_target_1d_array" shader in nir |
| - radeonsi: "clear_12bytes_buffer" shader in nir |
| |
| Georg Lehmann (39): |
| |
| - aco/gfx11: fix get_gfx11_true16_mask with v_cmp_class_f16 |
| - aco: improve get_gfx11_true16_mask description |
| - aco: combine a & ~b to bfi(b, 0, a) |
| - aco/gfx11: use v_cmp_class_f16 with opsel for bitnz/bitz |
| - aco: fix non constant 16bit bitnz/bitz |
| - ac/nir: handle more special cases in ac_nir_unpack_arg |
| - aco: use s_bitreplicate_b64_b32 to set exec to 0xffff0000ffff0000 |
| - nir/opt_intrinsics: optimize (exclusive_scan(op, a) op a) to inclusive scan |
| - aco: always use rtne for fquantize2f16 |
| - nir/opt_if: also rewrite uniform uses for read_invocation |
| - nir: unify lower_bitfield_insert with has_{bfm,bfi,bitfield_select} |
| - nir: unify lower_bitfield_extract with has_bfe |
| - nir: unify lower_find_msb with has_{find_msb_rev,uclz} |
| - aco: fix u2f16 with 32bit input |
| - aco: combine a | ~b to bfi(b, a, -1) |
| - aco: use v_cvt_f32_ubyte for signed casts too |
| - nir: add nir_scalar intrinsic helpers |
| - nir: add nir_scalar_equal |
| - aco: implement some exclusive scans with inclusive scans |
| - aco/gfx11: don't use bfe for local_invocation_id if the others are always 0 |
| - nir/opt_algebraic: remove broken fddx/fddy patterns |
| - aco: simplify masked swizzle dpp selection by removing or_mask first |
| - aco: fix p_extract with v1 dst and s1 operand |
| - aco: implement 64bit div find_lsb |
| - nir: scalarize masked_swizzle_amd created from shuffle_xor |
| - aco/optimizer: check if we can use omod before labeling it |
| - aco/optimizer: copy propagate to output modifier instructions |
| - aco: remove -0.0 for 32 bit fsign with mul_legacy/omod when denorms are flushed |
| - nir: make quad intrinsic dst bit size match src0 |
| - nir/lower_subgroups: use intrinsic builder more |
| - aco: assume new generations are unsupported by clrx |
| - aco: assume newer generation will use GFX11 wait_imm packing |
| - aco: print final ir instead if printing asm is unsupported |
| - aco/gfx11: optimize dual source export |
| - aco/gfx11: apply clamp/omod to vinterp |
| - aco: support v_fma_f32_dpp as fma_mix |
| - aco/gfx11: support vinterp as fma_mix |
| - aco: add missing scc def for SALU quad broadcast |
| - aco/sched: treat p_dual_src_export_gfx11 like export |
| |
| George Ouzounoudis (38): |
| |
| - nouveau/codegen: Support compact clip distances with arrayed_io |
| - nouveau/codegen: Handle nir op amul |
| - nouveau/codegen: Fix compact patch varyings in case of NIR |
| - nouveau/codegen: Add capability to pre-specify tessellation domain |
| - nvk: Do not increment instance id across draws |
| - nvk: Add a macro for root descriptor table byte offsets |
| - nvk: Set base vertex state in sequential mme draw |
| - nvk: Support base instance in instanced draw calls |
| - nvk: Switch point rasterization to point sprites |
| - nvk: Support large points |
| - nvk: Compile geometry shaders |
| - nouveau/mme: Keep device info in mme_builder |
| - nvk: Simplify mme build function argument |
| - nvk: Support VK_KHR_shader_draw_parameters |
| - nvk: Support for vertex shader transform feedback |
| - nvk: Support transform feedback indirect draws |
| - nvk: Support transform feedback geometry streams |
| - nvk: Support transform feedback queries |
| - nvk: Support vertex shader transform feedback on Fermi |
| - nvk: Disable PRIMITIVE_RESTART_VERTEX_ARRAY by default |
| - nvk: Fix geometry shader active stream mask |
| - nvk: Support geometry shaders |
| - nvk: Basic tessellation shader support |
| - nvk: Assign locations correctly for arrayed IO |
| - nvk: Enable multiview with tessellation shader |
| - nvk: Fix cases where execution mode is specified in the tesc shader. |
| - nvk: Respect tessellation domain origin state |
| - nvk: Lower io to temporaries for tessellation evaluation nir |
| - nvk: Support VkDescriptorSetVariableDescriptorCountLayoutSupport |
| - nvk: Handle cases of descriptor bindings with variable counts |
| - nvk: Add nir non-uniform optimization pass |
| - nvk: Enable descriptor indexing |
| - nvk: Do not keep redundant info for tessellation domain |
| - nouveau/codegen: Do not keep redundant info for tessellation domain |
| - nvk: Enable dynamic line rasterization mode state |
| - nvk: Fix support for VK_EXT_sample_locations |
| - nvk: Support dynamic state for enabling sample locations |
| - nouveau/codegen: Add a 4th optimization level for MemoryOpts |
| |
| Gert Wollny (63): |
| |
| - r600/sfn: Switch to register intrinsics |
| - r600/sfn/tests: add simple copy-prop test with register source |
| - r600/sfn: Allow for larger ALU CF's |
| - r600/sfn: Handle indirect array load/store dependencies better |
| - r600/sfn: Increase LDS fetch schedule priority |
| - r600/sfn: Add peephole optimization to move a dest to the previous op |
| - r600/sfn: reorder the value factory class member declaration a bit |
| - r600/sfn: Add some tests for proper register access |
| - r600/sfn: Print more info if scheduling fails |
| - r600/sfn: remove debug output leftovers |
| - r600/sfn: Fix use of multiple IDX with kcache |
| - r600/sfn: Always check arrays writes before allowing copy propagation |
| - r600/sfn: set block sizes based on chip class |
| - r600/sfn: Fix typo with block type |
| - r600/sfn: override slot count for IfInstr |
| - r600/sfn: Add method to convert to AluGroup directly |
| - r600/sfn: Add flags to check whether a group starts CF and can do that |
| - r600/sfn: make remaining slots a signed value |
| - r600/sfn: on Cayman loading an index register needs only one slot |
| - r600/sfn: Splizt ALU blocks in scheduler to fit into 128 slots |
| - r600/sfn: rework checks for ALU CF emission |
| - r600/sfn: Schedule AR uses befor possible groups |
| - r600: Explicitly force new CF in gs copy shader |
| - r600: Assert when backend wants to create a new ALU CF |
| - r600: don't check possible size of ALU CF |
| - r600: don't use sb disasm to disassamble copy shader |
| - r600: Force CF when emitting a NOP on R600 in gs copy shader |
| - r600/sfn: Don't try to propagate to vec4 with more than one use |
| - r600/sfn: Only switch to other CF if no AR uses are pending |
| - r600/sfn: AR loads should depend on all previous non ALU instructions |
| - r600/sfn: Renumber shader blocks in scheduler |
| - r600/sfn: Track whether a register is ALU clause local |
| - r600/sfn: Use clause local registers in RA |
| - r600/sfn: Take source uses into account when switching channels |
| - r600/sfn: take number of dest values into account |
| - r600: retire SB optimizer |
| - r600/sfn: work around injecting extra CF's to handle hardware bugs |
| - r600: use correct cso pointer for fetch shader |
| - r600/sfn: Make use of four clause local registers |
| - r600/sfn: drop unused ControlFlowInstr type enum |
| - r600/sfn: factor out resource as extra class |
| - r600/sfn: Simplify dependency chain for index loads on EG |
| - r600: print texture resource index mode separately |
| - r600/sfn: Make address split pass obligatory |
| - r600/sfn: rename method resource_base to resource_id |
| - r600/sfn: Add old address to update_indirect_addr |
| - r600/sfn: Sepeate resource and sampler in texture instructions |
| - r600/sfn: get rid of the method to get the index mode |
| - r600/sfn: sort the uniforms of the right shader |
| - r600/sfn: Fix use of scheduled_shader vs shader |
| - virgl: report MIRROR_CLAMP features better |
| - ci: Upref virglrenderer |
| - copyimage: check requested slice early when cube maps are involved |
| - mesa: check numlevels and numlayers when creating a texture view |
| - virgl: Use common clear_texture if host doesn't support the feature |
| - r600/sfn: don't remove texture sources by using the enum value |
| - r600: drop egcm_load_index_reg |
| - r600/sfn: Don't override a chgr pinning during copy propagation |
| - r600/sfn: When simplifying src vec4 pinnings, also check all uses |
| - virgl: Fix logic for reporting PIPE_MIRROR_CLAMP |
| - r600: Add callbacks for get_driver_uuid and get_device_uuid |
| - r600: Link with libgalliumvl, when enabling rusticl this is needed |
| - r600/sfn: Fixup component count only if intrinsic has it |
| |
| Guilherme Gallo (5): |
| |
| - bin/ci: Ensure that all jobs have nodes in DAG |
| - ci/radeonsi: Update flake list |
| - ci/freedreno: Add a new flake |
| - ci/zink: Found some flakes |
| - ci/anv: Catch some flakes |
| |
| Hannes Mann (1): |
| |
| - vulkan/wsi/wayland: Fix detection of tearing control protocol |
| |
| Hans-Kristian Arntzen (2): |
| |
| - wsi/x11: Fix potential deadlock in present ID. |
| - wsi/x11: Don't allow signal_present_id to rewind. |
| |
| Helen Koike (21): |
| |
| - ci: re-add EXTRA_LOCAL_PACKAGES to rootfs |
| - ci: add EXTRA_LOCAL_PACKAGES to apt-get install |
| - docs/ci: Add docs for EXTRA_LOCAL_PACKAGES |
| - ci: disable duplicated pipelines triggered by marge |
| - ci: add --project option to ci_run_n_monitor.py |
| - ci/android: remove strace output from cuttlefish-runner.sh |
| - ci: add locked flag to bindgen-cli on x86_64_build.sh |
| - ci: separate hiden jobs to -inc.yml files |
| - ci/ci_run_n_monitor: add docs for multiple targets |
| - ci/ci_run_n_monitor: print stress test results per job |
| - ci/ci_run_n_monitor: simplify with defaultdict |
| - ci/ci_run_n_monitor: merge print_job_status_change with print_job_status |
| - ci/ci_run_n_monitor: make --target mandatory |
| - ci/ci_run_n_monitor: merge enable_job with retry_job |
| - ci/ci_run_n_monitor: simplify enable/cancel logic in monitor_pipeline() |
| - ci/ci_run_n_monitor: allow <user>/<project> in --project |
| - ci/ci_run_n_monitor: limit repetitions on --stress |
| - ci/marge_queue: add missing python-dateutils to requirements.txt |
| - ci/ci_run_n_monitor: keep monitoring if a job is still running |
| - ci/marge_queue: add pretty_dutation() |
| - ci/ci_run_n_monitor: print job duration time |
| |
| Honglei Huang (7): |
| |
| - virgl/video: Add support for mpeg12 decoding |
| - virgl/video: Add support for vc1 decoding |
| - virgl/video: Add support for jpeg decoding |
| - virgl/video: Add support for hevc10bit decoding. |
| - virgl/video: Add more pipe type in virgl formats convert table |
| - virgl/video: Add jpeg buf start code check |
| - virgl: Enable vp9 hardware decode |
| |
| Hyunjun Ko (3): |
| |
| - anv: use ycbcr_info for P010 format |
| - anv: don't use cmd_buffer after destroyed. |
| - anv: don't flush_llc on gen9 |
| |
| Iago Toral Quiroga (100): |
| |
| - nir/trivialize: Move decl_reg to the start of the block |
| - v3dv: stop incrementing UBO indices by one |
| - nir/lower_robustness: drop skip_ubo_0 option |
| - v3dv: fix incorrect key setup |
| - broadcom/compiler: stop asserting on Vulkan environment |
| - broadcom/compiler: use NIR's lowering for dispatch base |
| - broadcom/compiler: move uniform offset lowering from compiler to GL driver |
| - broadcom/compiler: move vulkan's point coord lowering to the driver |
| - v3dv: don't set lower_wpos_pntc for Vulkan |
| - broadcom/compiler: always clamp results from logic ops |
| - broadcom/compiler: drop execution environment from the shader key |
| - v3dv: drop cpu path for buffer to image copies |
| - v3dv: remove unused code |
| - nir/lower_tex: copy backend_flags field when copying a tex instruction |
| - nir/lower_tex: use a callback to check sampler return size packing |
| - squash! v3dv,broadcom/compiler: don't abuse sampler index |
| - v3dv: assert that only tex instructions with sampler state have a sampler src |
| - v3d: fix texture packing lowering |
| - v3d,v3dv: use fquantize2f16 lowering in NIR |
| - v3dv: be more precise in vkGetImageSubresourceLayout |
| - v3dv: handle pPlaneLayouts in VkImageDrmFormatModifierExplicitCreateInfoEXT |
| - v3dv: bump up MAX_UNIFORM_BUFFERS to 16 |
| - v3dv: add support for sampling simple 2D linear textures |
| - v3dv: expand sampling from linear image hack to support multi-planar images |
| - v3dv: don't assume that bound descriptors have been written |
| - v3dv: only handle Android Hardware Buffer on Android |
| - v3dv: we can sample from 1D array too |
| - broadcom/compiler: add a couple of shader key helpers |
| - v3d: compute nir sha1 for uncompiled shader state |
| - v3d: use pre-computed shader sha1 for disk cache |
| - v3d: fix RAM shader cache |
| - v3d: get rid of shader_state pointer in v3d_key |
| - broadcom/simulator: reset CFG7 for compute dispatch in v71 |
| - broadcom/common: retrieve V3D revision number |
| - broadcom/compiler: update node/temp translation for v71 |
| - broadcom/compiler: implement "reads/writes too soon" checks for v71 |
| - broadcom/compiler: implement read stall check for v71 |
| - broadcom/compiler: add a v3d71_qpu_writes_waddr_explicitly helper |
| - broadcom/compiler: prevent rf2-3 usage in thread end delay slots for v71 |
| - broadcom/qpu: add new ADD opcodes for FMOV/MOV in v71 |
| - broadcom/qpu: fix packing/unpacking of fmov variants for v71 |
| - broadcom/compiler: make vir_write_rX return false on platforms without accums |
| - broadcom/compiler: rename vir_writes_rX to vir_writes_rX_implicitly |
| - broadcom/compiler: only handle accumulator classes if present |
| - broadcom/compiler: don't assign rf0 to temps across implicit rf0 writes |
| - broadcom/compiler: CS payload registers have changed in v71 |
| - broadcom/compiler: don't schedule rf0 writes right after ldvary |
| - broadcom/compiler: allow instruction merges in v71 |
| - broadcom/qpu: add MOV integer packing/unpacking variants |
| - broadcom/qpu: fail packing on unhandled mul pack/unpack |
| - broadcom/compiler: generalize check for shaders using pixel center W |
| - broadcom/compiler: v71 isn't affected by double-rounding of viewport X,Y coords |
| - broadcom/compiler: update peripheral access restrictions for v71 |
| - broadcom/qpu: add packing for fmov on ADD alu |
| - broadcom/compiler: handle rf0 flops storage restriction in v71 |
| - broadcom/compiler: enable ldvary pipelining on v71 |
| - broadcom/compiler: try to use ldunif(a) instead of ldunif(a)rf in v71 |
| - broadcom/compiler: don't assign rf0 to temps that conflict with ldvary |
| - broadcom/compiler: convert mul to add when needed to allow merge |
| - broadcom/compiler: implement small immediates for v71 |
| - broadcom/compiler: update thread end restrictions for v7.x |
| - broadcom/compiler: update ldvary thread switch delay slot restriction for v7.x |
| - broadcom/compiler: lift restriction for branch + msfign after setmsf for v7.x |
| - broadcom/compiler: start allocating from RF 4 in V7.x |
| - broadcom/compiler: validate restrictions after TLB Z write |
| - broadcom/compiler: lift restriction on vpmwt in last instruction for V3D 7.x |
| - broadcom/compiler: fix up copy propagation for v71 |
| - broadcom/compiler: don't allocate spill base to rf0 in V3D 7.x |
| - broadcom/compiler: improve allocation for final program instructions |
| - broadcom/compiler: don't assign registers to unused nodes/temps |
| - broadcom/compiler: only assign rf0 as last resort in V3D 7.x |
| - v3dv: expose V3D revision number in device name |
| - v3dv/device: handle new rpi5 device (bcm2712) |
| - v3dv: setup render pass color clears for any format bpp in v71 |
| - v3dv: setup TLB clear color for meta operations in v71 |
| - v3dv: fix up texture shader state for v71 |
| - v3dv: handle new texture state transfer functions in v71 |
| - v3dv: implement noop job for v71 |
| - v3dv: handle render pass global clear for v71 |
| - v3dv: GFX-1461 does not affect V3D 7.x |
| - broadcom/compiler: update thread end restrictions validation for v71 |
| - v3dv: handle early Z/S clears for v71 |
| - v3dv: handle RTs with no color targets in v71 |
| - v3dv: don't convert floating point border colors in v71 |
| - v3dv: handle Z clipping in v71 |
| - v3dv: make v3dv_viewport_compute_xform depend on the V3D version |
| - v3dv: fix depth clipping then Z scale is too small in V3D 7.x |
| - v3d/v3dv: fix texture state array stride packing for V3D 7.1.5 |
| - v3d,v3dv: support up to 8 render targets in v7.1+ |
| - v3d,v3dv: don't use max internal bpp for tile sizing in V3D 7.x |
| - v3d,v3dv: propagate NaNs bits in shader state records are reserved in v7.x |
| - v3dv: use new texture shader state rb_swap and reverse fields in v3d 7.x |
| - v3dv: fix color write mask for v3d 7.x |
| - v3d,v3dv: fix depth bias for v3d 7.x |
| - v3d,v3dv: fix compute for V3D 7.1.6+ |
| - v3dv: expose fullDrawIndexUint32 in V3D 7.x |
| - v3dv: expose depthClamp in V3D 7.x |
| - v3dv: expose scalarBlockLayout on V3D 7.x |
| - v3dv: fix confusing nomenclature about DRM nodes |
| - v3d,v3dv: fix MMU error from hardware prefetch after ldunifa |
| |
| Ian Douglas Scott (1): |
| |
| - egl/wayland: Don't segfault if \`create_wl_buffer` returns \`NULL` |
| |
| Ian Romanick (38): |
| |
| - intel/fs: Always do opt_algebraic after opt_copy_propagation makes progress |
| - intel/fs: Constant fold SHL |
| - intel/fs: Constant fold OR and AND |
| - util/rb-tree: Return the actual first node from rb_tree_search |
| - util/rb-tree: Fix typo in comment |
| - nir/builder: Add nir_extract_i8_imm and nir_extract_u8_imm helpers |
| - nir/algebraic: Remove redundant pack / unpack lowering patterns |
| - intel/fs: Completely re-write the combine constants pass |
| - intel/fs: Combine constants for SEL instructions too |
| - intel/fs: Combine constants for integer instructions too |
| - intel/fs: New VGRF packing scheme for constant combining |
| - intel/compiler: Combine control barriers with identical memory semantics |
| - intel/compiler: Don't evict for workgroup-scope fences |
| - glsl/list: Clean up an inappropriate comment |
| - util/rb-tree: Work around C++'s dislike of offsetof |
| - util/rb-tree: Inline rb_tree_init |
| - intel/fs: Don't continue fixed point iteration just because liveout changes |
| - intel/fs: Don't try to copy propagate into a source again after progress is made |
| - intel/fs: Make try_constant_propagate and try_copy_propagate file private |
| - intel/fs: Move src.file checks out of try_constant_propagate and try_copy_propagate |
| - intel/fs: Don't loop in try_constant_propagate |
| - intel/fs: Simplify check in can_propagate_from |
| - intel/fs: Make opt_copy_propagation_local file private |
| - intel/fs: Encapsulate per-block ACP in a structure |
| - intel/fs: Use rb_tree to store ACP entries by source |
| - intel/fs: Use rb_tree to store ACP entries by destination |
| - intel/fs: Use rb_tree for copy prop dataflow |
| - intel/fs: Merge copy prop dataflow loops |
| - intel/compiler/xe2: Update fs_visitor::setup_vs_payload to account for Xe2 reg size |
| - intel/compiler/xe2: Use SIMD16 for nir_intrinsic_image_size |
| - intel/compiler/xe2: TXD is lowered to SIMD16 in SIMD32 mode |
| - nir/rematerialize: Rematerialize ALUs used only by compares with zero |
| - intel/compiler/xe2: Handle new URB read messages |
| - intel/compiler/xe2: Handle new URB write messages |
| - intel/compiler/xe2: Update fs_visitor::emit_urb_writes to not assume SIMD8 |
| - spirv: Track when a shader has a cooperative matrix |
| - intel/fs: Add DP4A to get_lowered_simd_width |
| - nir/split_vars: Don't split arrays of cooperative matrix types |
| |
| Igor Torrente (4): |
| |
| - zink: Fix enumerate devices when running compositor |
| - zink: Removes \`disable_xcb_surface` |
| - zink: Fix one addicional case when running a compositor |
| - zink: fix for startup crash of weston running on top of zink + venus |
| |
| Illia Abernikhin (2): |
| |
| - state_tracker: moving initialisation of whandle out from if statement whandle initialization inside if statement but used also outside |
| - i915: change format in dbg string Actually, uintptr_t is of type unsigned long, but the debug line uses the %d format specifier, which expects an int. |
| |
| Illia Polishchuk (7): |
| |
| - iris: remove NULL check for already dereferenced pointer earlier |
| - s/Intel: fix/anv: fix: potentially overflowing expression in genX |
| - glx: fix dead code when gc var cannot be null due to earlier check |
| - state_tracker: fix dereference before null check |
| - anv, drirc: Add workaround to speed up Cyberpunk 2077 reg allocation |
| - zink: move find_sampler_var from zink to nir core |
| - nir: fix invalid sampler search by texture id |
| |
| Italo Nicola (24): |
| |
| - mesa/main: account for RTT samples when updating framebuffer |
| - mesa/main: allow readpix/teximage to read from implicitly multisampled fbos |
| - panfrost/genxml: fix Surface With Stride descriptor alignment |
| - panfrost/genxml: add Multiplanar Surface descriptor |
| - panfrost: refactor (un)packing of surface descriptors |
| - pan/decode: decode Multiplanar Surface descriptors |
| - panfrost: prepare pan_image_view for multiplanar formats |
| - panfrost: prepare the driver to support YUYV and variants |
| - panfrost: advertise support for YUYV and variants |
| - panfrost: mandate proper alignment requirement depending format and arch |
| - panfrost: add PAN_MESA_DEBUG=yuv for debugging yuv sampler |
| - gallium/st: add non-CSC lowering of I420 as PIPE_FORMAT_R8_G8_B8_420 |
| - gallium/st: add non-CSC lowering of YV12 as PIPE_FORMAT_R8_B8_G8_420 |
| - pan/bi: add support for I420 and YV12 sampling |
| - gallium/st: lower NV21 to R8_B8G8 instead of G8_B8R8 |
| - panfrost: fix invalid memory access in get_equation_str() |
| - pan/decode: handle more than one panfrost_device |
| - panfrost/ci: updated CI expectations |
| - egl: reenable partial redraw with a warning when using gallium hud |
| - pan/genxml: add Width/Height fields to v9+ Plane descriptor |
| - panfrost: rename _needs_multiplanar_descriptor to _is_yuv |
| - panfrost: prepare v9+ to support YUV sampling |
| - panfrost: use centered YUV chroma siting |
| - panfrost: advertise YUV formats for valhall |
| |
| Iván Briano (23): |
| |
| - anv: ensure CFE_STATE is emitted for ray tracing pipelines |
| - iris: ensure mesh is disabled on context init |
| - anv: ensure mesh is disabled on context init |
| - anv: implement Wa_14019750404 |
| - intel/compiler: call brw_nir_adjust_payload from brw_postprocess_nir |
| - anv,hasvk: respect provoking vertex setting on geometry shaders |
| - anv: fix missing 3DSTATE_SBE_CLIP emission |
| - anv: ensure pipelines have all state |
| - anv: tell blorp to do mesh stuff only if it's enabled |
| - blorp: fix hangs with mesh enabled |
| - anv: use a simpler MUE layout for fast linked libraries |
| - anv: track what kind of pipeline a fragment shader may be used with |
| - intel/fs: read viewport and layer from the FS payload |
| - intel/fs: handle URB setup for fast linked mesh pipelines |
| - anv: enable VK_EXT_mesh_shader where supported |
| - intel/fs: use ffsll so we don't explode on 32 bits |
| - vulkan/runtime: add internal parameter to vk_spirv_to_nir |
| - nir/lower_int64: respect rounding mode when casting to float |
| - intel/compiler: round f2f16 correctly for RTNE case |
| - util: add double_to_float16 helpers |
| - nir: round f2f16{_rtne/_rtz} correctly for constant expressions |
| - anv: advertise VK_KHR_global_priority_queue |
| - anv: use the right vertexOffset on CmdDrawMultiIndexed |
| |
| Jani Nikula (1): |
| |
| - docs/vulkan: fixup some typos |
| |
| Janne Grunau (4): |
| |
| - asahi: toggle more barrier bits after transform feedback |
| - asahi,agx: Fix stack buffer overflow in agx_link_varyings_vs_fs |
| - asahi,agx: Upload constant buffers immediately |
| - asahi: decode: Fix uint64_t format modifiers in agxdecode_stateful() |
| |
| Jesse Natalie (2): |
| |
| - nir_lower_mem_access_bit_sizes: Fix write-mask-constrained 3-byte stores as atomics |
| - d3d12: Fix multidimensional array ordering |
| |
| Jianxun Zhang (1): |
| |
| - intel/common: Only set op mask on instructions in decoder |
| |
| Jonathan Marek (2): |
| |
| - freedreno: move redump.h to common code + cleanup |
| - tu: add a TU_DEBUG=rd option for cmdstream dumping |
| |
| Jordan Justen (73): |
| |
| - isl: Add ISL_SURF_USAGE_STREAM_OUT_BIT |
| - anv,iris,hasvk: Use ISL_SURF_USAGE_STREAM_OUT_BIT for setting stream-out MOCS |
| - genxml/hsw: Add additional MOCS field enumerations |
| - genxml/chv: Add MEMORY_OBJECT_CONTROL_STATE_CHV to document compared to BDW |
| - isl/dev: Add uncached MOCS value |
| - isl: Set MOCS to uncached for MTL stream-out |
| - intel/isl: Use intel_needs_workaround() for MTL CCS WA |
| - intel/compiler: Use nir SUBGROUP_INVOCATION for RT TOPOLOGY_ID |
| - intel/dev: Add LNL platform enum |
| - intel/dev: Support xe2 device init (for intel_device_info_test) |
| - intel/tools: Use 'env bash' to find bash executable |
| - intel/decoder: Fix xml filename when verx10 % 10 is not 0 |
| - intel/decoder: Add intel_spec_load_common() |
| - intel/decoder: Make intel_spec_load_filename() have separate dir and name strings |
| - intel/genxml: Align "Texture Coordinate Mode" naming |
| - intel/genxml: Split some genxml sorting code into a intel_genxml module |
| - intel/genxml: Convert gen_bits_header to use ElementTree |
| - intel/genxml: Convert gen_pack_header to use ElementTree |
| - intel/genxml: Add GenXml class into intel_genxml module |
| - intel/genxml: Add filter_engines() to GenXml class |
| - intel/genxml: Move sorting & writing into GenXml class |
| - intel/genxml: Don't rewrite sorted xml if the contents didn't change |
| - intel/genxml: Add final newline to output when saving xml |
| - intel/genxml: Update xml with gen_sort_tags.py output |
| - intel/dev: Use RPL-U name on RPL-U devices |
| - intel/dev: Add more RPL PCI IDs |
| - anvil,hasvk: Rename need_clflush to need_flush |
| - intel/common: Move intel_clflush.h to intel_mem.h/intel_mem.c |
| - anvil,hasvk: Replace intel_clflush_range with intel_flush_range |
| - intel/common: Add intel_flush_range_no_fence |
| - anvil,hasvk: Use intel_flush_range_no_fence to flush command buffers |
| - util/u_cpu_detect: Drop unused has_tsc |
| - util/u_cpu_detect: Detect clflushopt support |
| - meson: Check for the __builtin_ia32_clflushopt function |
| - intel/clflush: Add support for clflushopt instruction |
| - intel/dev/xe: Move placeholder subslice info into XEHP_FEATURES |
| - intel/genxml: Ignore tail leading/trailing whitespace in node_validator() |
| - intel/genxml: Fix comparing xml when node counts differ |
| - intel/dev: Update device string for MTL PCI ID 0x7d55 |
| - intel/genxml: Support importing from another genxml file |
| - intel/genxml: Add support for excluding items when importing |
| - intel/genxml: Add all xml files as pack dependencies |
| - intel/genxml: Add GenXml.optimize_xml_import() |
| - intel/genxml: Drop assertion to allow for importing |
| - intel/genxml: Add GenXml.add_xml_imports method |
| - intel/genxml: Add GenXml.flatten_xml() method |
| - intel/genxml: Add genxml_import.py script |
| - intel/decoder: ralloc_steal() values from spec context for fields and enums |
| - intel/decoder: Implement support for importing genxml |
| - intel/genxml: Start Xe2 support |
| - intel/genxml: Auto-import genxml files using genxml_import.py |
| - intel/common: Add sse2_args for 32-bit build when -Dsse2=false was set |
| - intel/compiler/fs: Support Xe2 reg size in assign_curb_setup |
| - intel/compiler: Update opt_split_sends() for Xe2 reg size |
| - intel/compiler: Update emit_rt_lsc_fence() for Xe2 |
| - intel/compiler: Update lower_trace_ray_logical_send() for Xe2 |
| - intel/compiler: Update ray-tracing intrinsic lowering for Xe2 |
| - intel/compiler: Update RT stack_id access for Xe2 |
| - intel/fs: Update SSBO & shared uniform block loads for Xe2 |
| - intel/genxml: Build with gen20.xml |
| - intel/isl: Build for Xe2 |
| - iris: Build for Xe2 |
| - anv/blorp: Use anv_genX to set device->blorp.exec |
| - anv: Disable Ray Tracing on xe2 until our compiler supports Xe2 RT |
| - anv: Build for Xe2 |
| - anv: Print warning that Xe2 is not supported rather than failing |
| - intel/compiler: Add enum xe2_lsc_cache_store |
| - intel/compiler: Use enum xe2_lsc_cache_store on xe2 |
| - intel/compiler: Add enum xe2_lsc_cache_load |
| - intel/compiler: Use enum xe2_lsc_cache_load on xe2 |
| - anv/batch: Check if batch already has an error in anv_queue_submit_simple_batch() |
| - anv/batch: Assert that extend_cb is non-NULL if the batch is out of space |
| - intel/dev: Add 0x56ba-0x56bd DG2 PCI IDs |
| |
| Jose Maria Casanova Crespo (2): |
| |
| - vc4: mark buffers as initialized at vc4_texture_subdata |
| - vc4: Fix mask RGBA validation at YUV blit |
| |
| José Expósito (3): |
| |
| - zink: Fix crash on zink_create_screen error path |
| - zink: fix dereference before NULL check |
| - zink: allow software rendering only if selected |
| |
| José Roberto de Souza (51): |
| |
| - anv: Use workaround framework to Wa_14016118574 |
| - intel/aux_map: Nuke format_enum |
| - intel/aux_map: Use get_aux_entry() in remove_mapping() |
| - intel/aux_map: Replace magic number by INTEL_AUX_MAP_ENTRY_VALID_BIT |
| - intel/aux_map: Rename some variables to improve readability |
| - intel/aux_map: Mask out bits above index 47 in intel_aux_get_meta_address_mask() |
| - intel/aux_map: Convert l1_entry_addr_out to canonical |
| - intel/aux_map: Drop magic sub table size number |
| - intel/aux_map: Add function and macro to return l2 and l1 table masks |
| - anv: Add gem_create_userptr() to KMD backend |
| - anv: Replace handle by anv_bo in the gem_close() |
| - anv: Add support for userptr in Xe KMD |
| - intel: Sync xe_drm.h |
| - intel/dev/xe: Add support for small-bar setups |
| - anv: Request Xe KMD to place BOs to CPU visible VRAM when required |
| - iris: Request Xe KMD to place BOs to CPU visible VRAM when required |
| - iris/xe: Call iris_lost_context_state() when batch engine is replaced |
| - intel/dev: Port intel_dev_info tool to Xe KMD |
| - iris: Replace I915_EXEC_FENCE_SIGNAL by IRIS_BATCH_FENCE_SIGNAL in common code |
| - intel: Move i915_drm.h specific code from common/intel_gem.h to common/i915/intel_gem.h |
| - intel/common: Move functions inside of C++ ifdef |
| - intel: Rename intel_gem_add_ext() to intel_i915_gem_add_ext() |
| - iris: Move i915_gem_set_domain() call to i915 backend |
| - iris: Move iris_bufmgr_bo_close() to kmd backend |
| - iris: Add gem_create_userptr() to KMD backend |
| - iris: Add support for userptr in Xe KMD |
| - intel/genxml/gen125: Add missing fields in MI_MATH |
| - iris: Set MI_MATH MOCS field |
| - anv: Set MI_MATH MOCS field |
| - intel/tests/mi_builder: Set MI_MATH MOCS field |
| - intel/genxml/gen125: Set MI_MATH MOCS field as non-zero |
| - anv: Nuke unused READ_ONCE() from anv_batch_chain.c |
| - anv: Remove VkAllocationCallbacks parameter from reloc functions |
| - anv: Return earlier in anv_reloc_list functions |
| - intel: Sync xe_drm.h and rename engine to exec_queue |
| - anv: Override vendorID for Hogwarts Legacy |
| - intel/isl: Remove unknown workaround |
| - intel/isl: Remove Wa_22011186057 |
| - anv: Update Wa_16014390852 for MTL |
| - intel: Sync xe_drm.h |
| - anv: Move i915 specific gem_set_caching to backend |
| - anv: Move i915 specific code from common anv_gem.c |
| - anv: Move bo_alloc_flags_to_bo_flags() to backend |
| - anv: Move i915 handling of imported bos bo_flags |
| - anv: Remove i915_drm.h include from common code |
| - iris: Lock bufmgr->lock before call vma_free() in error path |
| - iris: Nuke useless flags from iris_fine_fence_new() |
| - intel: Prepare implementation of Wa_18019816803 and Wa_16013994831 for future platforms |
| - intel: Sync xe_drm.h |
| - anv: Switch Xe KMD vm bind to sync |
| - anv: Add missing ANV_BO_ALLOC_EXTERNAL flags when calling anv_device_import_bo() |
| |
| Juan A. Suarez Romero (7): |
| |
| - broadcom/ci: update expected results |
| - vc4/ci: update expected results |
| - v3d/shim: include new ioctl parameters |
| - v3dv/ci: update expected list |
| - broadcom: add performance counters for V3D 7.x |
| - broadcom/simulator: add per-hw version calls |
| - v3d/vc4/ci: add new fails/timeout |
| |
| Julia Tatz (10): |
| |
| - gallium/dri: fix dri2_from_names |
| - aux/trace: skip multi-line comments in enums2names |
| - aux/trace: deduplicate enum dump macro work |
| - aux/trace: move trace_sample_view logic |
| - aux/trace: fix set_hw_atomic_buffers method name |
| - aux/trace: add screen video methods |
| - aux/trace: add context video methods |
| - aux/trace: wrap video_codec & video_buffer |
| - aux/trace: unwrap refrence frames in picture_desc |
| - aux/trace: trace video_buffer method return vals |
| |
| Julia Zhang (1): |
| |
| - radeonsi: modify algorithm of skipping holes of sparse bo |
| |
| Julian Hagemeister (1): |
| |
| - Gallium: Fix shared memory segment leak |
| |
| Juston Li (10): |
| |
| - zink: remove venus from renderpass optimizations |
| - venus: sync protocol for VK_EXT_vertex_input_dynamic_state |
| - venus: implement VK_EXT_vertex_input_dynamic_state |
| - venus: set lvp queries as saturate on overflow |
| - venus: add helper function to get cmd handle |
| - venus: refactor out common cmd feedback functions |
| - venus: support deferred query feedback recording |
| - venus: track/recycle appended query feedback cmds |
| - venus: append query feedback at submission time |
| - venus: switch to unconditionally deferred query feedback |
| |
| Kai Wasserbäch (3): |
| |
| - fix: clover: LLVM 18 renamed/moved CGFT_*, update compat layer |
| - fix: clover: LLVM 18: s/CodeGenOpt::/CodeGenOptLevel::/ |
| - fix: clover: warning: ignoring return value of ‘int posix_memalign(…)’ [-Wunused-result] |
| |
| Karmjit Mahil (29): |
| |
| - pvr: Remove mrt setup from SPM EOT |
| - pvr: Compile SPM EOT shader |
| - pvr: Use the SPM EOT on barrier stores |
| - pvr: Remove some magic numbers and increments from km stream |
| - pvr: Restructure \`rogue_kmd_stream.xml` |
| - pvr: Submit PR commands |
| - pvr: Use the correct size for the unified store allocation |
| - pvr: Allow query stage for barrier sub cmds |
| - pvr: Fix occlusion query unaccounted for user fences |
| - pvr: Fix writing query availability write out |
| - pvr: Fix packing issue with max_{x,y}_clip |
| - pvr: Fix csb relocation status assert on \`pvr_csb_finish()` |
| - pvr: Fix \`for` loop itarator usage |
| - pvr: Fix dynamic desc offset storage |
| - pvr: Fix cubemap layer stride |
| - pvr: Use the render passes' attachments array to setup ISP state |
| - pvr: Adjust EOT PBE state to account for the iview's base array layer |
| - pvr: Fix MRT index in PBE state |
| - pvr: Fix pbe_emit assert |
| - pvr: Fix OOB access of pbe_{cs,reg}_words |
| - pvr: Order tile buffer EOT emits to be last |
| - pvr: Fix subpass sample count on ds attachment only |
| - pvr: Refactor subpass ds and sample count setup |
| - pvr: Fix SPM load shader sample rate |
| - pvr: Fix PPP_SCREEN sizes |
| - vulkan: Add \`vk_subpass_dependency_is_fb_local()` helper |
| - tu: Use common \`vk_subpass_dependency_is_fb_local()` |
| - pvr: Don't merge subpasses on framebuffer-global dependancy |
| - pvr: Only setup the bgobj to load if we have a load_op |
| |
| Karol Herbst (213): |
| |
| - nvc0: initial Ada enablement |
| - rusticl/mesa: make svm_migrate optional |
| - llvmpipe: enable system SVM |
| - nvc0: fix num_gprs for Volta+ |
| - rusticl: fix warnings with newer rustc |
| - gm107/ir: fix SULDP for loads without a known format |
| - nv50/ir/nir: fix txq emission on MS textures |
| - nv50/ir/nir: Fix zero source handling of tex instructions. |
| - rusticl/kernel: only handle function_temp memory before lowering printf |
| - meson,ci: bump meson req for rusticl to 1.2 |
| - rusticl/nir: add helper functions we need for a NIR_PASS macro |
| - rusticl/nir: add a nir_pass macro |
| - rusticl/nir: use the new nir_pass macro |
| - rusticl/kernel: rename res to internal_args inside lower_and_optimize_nir_late |
| - rusticl/kernel: merge lower_and_optimize_nir_pre_inputs and lower_and_optimize_nir_late |
| - rusticl/kernel: move things around in lower_and_optimize_nir |
| - rusticl/kernel: get rid of initial function_temp type lowering |
| - rusticl/kernel: mark can_remove_var as unsafe and document it |
| - n50/compute: submit initial compute state in nv50_screen_create |
| - nvk: add vulkan skeleton |
| - nouveau/winsys: add the new winsys implementation |
| - nvk: use winsys lib |
| - nvk: fix nvk_buffer include guards |
| - nouveau/headers: add script to sync in-tree headers with open-gpu-doc |
| - nouveau/headers: initial sync of headers |
| - nvk: implement GetPhysicalDeviceQueueFamilyProperties2 to make the CTS happy |
| - nvk: advertize memory heaps and types |
| - nouveau/ws: reorganize a little |
| - nouveau/ws: dup the fd |
| - nouveau/ws: add a field for the SM version |
| - nvk: set nonCoherentAtomSize as the CTS divides with this value |
| - nouveau/ws: add bo API |
| - nvk: add basic device memory support |
| - nouveau/headers: add nvtypes.h |
| - nouveau/headers: typedef Nv void types |
| - nouveau/headers: add host classes |
| - nouveau/ws: add context support |
| - nouveau/ws: add a cmd buffer |
| - novueau/bo: refcount it |
| - novueau/bo: add nouveau_ws_bo_wait |
| - nvk: allocate a GPU context for each VkDevice |
| - nvk: add nvk_bo_sync |
| - nvk: add nvk_CmdPipelineBarrier2 stub |
| - nvk: impl nvk_CmdCopyBuffer |
| - nouveau/ws: fix setting push bo domains |
| - nouveau/ws: PUSH_IMMD only works with 16 bit values |
| - nouveau/ws: set GPU object class |
| - nouveau/ws: bind 2D class |
| - nvk: use fermi class definitions |
| - nvk: add basic support for images |
| - nvk: simple format table |
| - nvk: add support for blits |
| - nvk: report maxMipLevels as 1 |
| - nvk: optimize blit command buffer gen |
| - nvk: implement CmdFillBuffer |
| - nvk: implement CmdUpdateBuffer |
| - nvk: implement CmdCopyBuffer2 |
| - nvk: advertise VK_KHR_copy_commands2 |
| - nvk: implicitly reset the command buffer |
| - nouveau/ws: handle 0inc inside nvk_push_val as well |
| - nvk: reduce pitch even further in CmdFillBuffer |
| - nvk: support multiple miplevels |
| - nvk: support array blits over multiple layers |
| - nvk: tiling prep work for VK_EXT_image_2d_view_of_3d |
| - nouveau/ws: make sure we don't submit nonsense |
| - nouveau/ws: assert on broken channel |
| - nvk/blit: assert that formats are supported |
| - nouveau/headers: Generate parser functions |
| - nouveau/ws: initial debugging options for command submissions |
| - nouveau/ws: depend on generated class header files |
| - nouveau/ws: get rid of libdrm |
| - nouveau/ws: use new NVIF interface to query oclasses |
| - nvk: set deviceName |
| - nouveau/headers: add path for 3D headers |
| - nouveau/headers: initial 3D headers import |
| - nouveau/ws: allocate 3D subchan |
| - nouveau/ws: allocate copy subchan as well |
| - nouveau/ws: add API to query if the context was killed |
| - nouveau/ws: add a bo unmap helper function |
| - nvk: clean up bo mappings |
| - nouveau/ws: bound check nouveau_ws_push_append |
| - nouveau/ws: rework refing push buffer bos |
| - nouveau/ws: push chaining |
| - nvk: fix OOB read inside nvk_get_va_format |
| - nvk: alloc a zero page and use it for vertex runouts |
| - nvk: fix zero page refing |
| - nvk: support exporting buffers |
| - nvk: fix some class version checks |
| - nvk: properly align shaders pre Turing |
| - nvk: rework QMD handling to support pre Turing |
| - nvk: align desc root table |
| - nvk: Use SET_PIPELINE_PROGRAM pre-Volta |
| - nvk: properly align slm size |
| - nvk: use remaps for image copies |
| - nvk: reduce pitch for FillBuffer |
| - nvk: bind more subchans in init_context_state |
| - nvk: support pre Maxwell Texture Headers |
| - nvk/device: fix order of error handling |
| - nvk: allocate VAB memory area |
| - nvk: wire up M2MF for Fermi |
| - nouveau/mme: add test for BEQ with magic exit offset |
| - nouveau/mme: add a macro exit helper |
| - nvk: Add a macro to set MMIO registers via falcons |
| - nouveau/winsys: fix SM value for Ada |
| - nvk: fix num_gprs for Volta+ |
| - nvk: replace mp with tpc |
| - nvk: properly calculate SLM region by taking per arch limits into account |
| - nouveau: fix max_warps_per_mp_for_sm for builds with asserts disabled |
| - nvk: enable fp helper invocations loads on more gens |
| - nv50/ir: use own info struct for sys vals |
| - nv50/ir: convert system values to gl_system_value |
| - nouveau/mme: fix OOB access inside while_ine builder test |
| - nouveau/mme: fix OOB inside tu104 simulator |
| - clc: use CLANG_RESOURCE_DIR for clang's resource path |
| - nv50: fix code uploads bigger than 0x10000 bytes |
| - nouveau: take glsl_type ref unconditionally |
| - rusticl/kernel: optimize nir between lowering io and explicit types |
| - nv50: limit max code uploads to 0x8000 |
| - zink: fix source type in load/store scratch |
| - zink: fix global stores |
| - zink: update some compute caps |
| - rusticl: add debug option to sync every event |
| - rusticl/device: _MAX_CONST_BUFFER0_SIZE is unsigned |
| - ci: disable a660 jobs |
| - nir: make workgroup_id 32 bit only |
| - nir: make num_workgroups 32 bit only |
| - ac: drop 64 bit handling for cl workgroup intrinsics |
| - gallivm/nir: drop 64 bit handling for cl workgroup intrinsics |
| - intel/compiler: drop 64 bit handling for cl workgroup intrinsics |
| - panfrost: drop 64 bit handling for cl workgroup intrinsics |
| - rusticl: reduce global_invocation_id_zero_base to 32 bit |
| - panfrost: drop pan_nir_lower_64bit_intrin |
| - rusticl/disk_cache: fix stack corruption |
| - rusticl/query: fix use-after-free, but also fix incorrect usage of unsafe |
| - rusticl/event: disable profiling for devices without timestamps |
| - rusticl/queue: properly implement clCreateCommandQueueWithProperties |
| - rusticl/memory: do not verify pitch for IMAGE1D_BUFFER |
| - rusticl/memory: only specify PIPE_BIND_SHADER_IMAGE where supported |
| - asahi: fetch available system memory |
| - asahi: lower hadd |
| - asahi: handle kernels |
| - asahi: handle load_workgroup_size |
| - asahi: handle load_global_invocation_id_zero_base |
| - asahi: implement get_compute_state_info |
| - asahi: implement set_global_binding |
| - asahi: implement clear_buffer |
| - asahi: gracefully handle allocating linear images |
| - asahi: handle images in is_format_supported |
| - rusticl/memory: fallback if allocating linear images fails |
| - rusticl: enable asahi |
| - rusticl/mesa: create contexts with PIPE_CONTEXT_NO_LOD_BIAS |
| - docs/features: cl_khr_3d_image_writes needs driver support |
| - rusticl/mesa: fix \`set_constant_buffer` when passing an empty buffer |
| - rusticl/kernel: skip adding global id offsets if not used |
| - meson/rusticl: add sha1_h |
| - rusticl/mesa/context: fix clear_sampler_views |
| - nir: add nir_lower_alu_vec8_16_srcs pass |
| - zink: lower vec8/16 |
| - rusticl/mesa: create COMPUTE_ONLY contexts |
| - rusticl: fix clippys bool_to_int_with_if |
| - rusticl/memory: fix potential use-after-free in clEnqueueSVMMemFill |
| - nir/load_libclc: fix libclc memory leak |
| - rusticl/kernel: Fix creation from programs not built for every device |
| - ci: add half-life 2 freedreno flake |
| - zink: implement get_compute_state_info |
| - zink: copy has_variable_shared_mem cs property |
| - zink: pass entire pipe_grid_info into zink_program_update_compute_pipeline_state |
| - zink: refactor spec constant handling |
| - zink: variable shared mem support |
| - zink: support more nir opcodes |
| - zink: make spirv_builder_emit_*op compatible with spec constants |
| - zink: support samplers with unnormalized_coords |
| - zink: implement remaining pack ops via bitcast |
| - zink: fix RA textures |
| - zink: fix load/store scratch offsets |
| - rusticl/mesa/screen,device: add driver_name |
| - rusticl: enable zink |
| - pipe-loader: allow to load multiple zink devices |
| - rusticl: bump rustc version to 1.66 |
| - rusticl/mesa/nir: mark more methods as mut |
| - rusticl/mesa/nir: Mark NirShader and NirPrintfInfo as Send and Sync |
| - rusticl/mesa: mark PipeResource as Send and Sync |
| - rusticl/mesa: mark PipeTransfer as Send |
| - rusticl/cl: mark _cl_image_desc as Send and Sync |
| - rusticl/queue: get rid of pointless Option around our worker thread handle |
| - rusticl/queue: make it Sync |
| - rusticl/kernel: get rid of Arcs in KernelDevStateVariant |
| - rusticl/memory: use get_mut instead of lock in drop |
| - zink: implement PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS |
| - rusticl/api: remove cl_closure macro |
| - zink: implement load_global_constant |
| - zink: properly emit PhysicalStorageBufferAddresses cap |
| - nir/lower_mem_access_bit_sizes: fix invalid shift bit_size |
| - rusticl/device: restrict 1Dbuffer images for RGB and RGBx |
| - rusticl/memory: use PIPE_BUFFER for IMAGE1D_BUFFER images |
| - rusticl/format: disable all sRGB formats |
| - asahi: flush denorms on exact fmin/fmax |
| - zink: wrap shared memory blocks in a struct |
| - zink: properly alias shared memory |
| - zink: fix zink_destroy_screen for early screen creation fails |
| - docs/features: remove empty lines confusing mesamatrix |
| - rusticl/device: restrict image_buffer_size |
| - rusticl/device: restrict param_max_size further |
| - rusticl/mem: properly set pipe_image_view::access |
| - zink: lower fisnormal as it requires the Kernel Cap |
| - radv: fix buffers in vkGetDescriptorEXT with size not aligned to 4 |
| - rusticl/queue: Only take a weak ref to the last Event |
| - rusticl/mesa: pass PIPE_BIND_LINEAR in resource_create_texture_from_user |
| - zink: deallocate global_bindings array |
| - rusticl/mesa/screen: do not derefence the entire pipe_screen struct |
| - nvc0: implement PIPE_CAP_TIMER_RESOLUTION |
| - rusticl/queue: do not send empty lists of event to worker queue |
| - rusticl/queue: fix implicit flushing of queue dependencies |
| |
| Kenneth Graunke (21): |
| |
| - iris: Re-emit 3DSTATE_DS for each primitive (workaround 14019750404) |
| - intel/compiler: Fix sparse cube map array coordinate lowering |
| - intel/compiler: Respect NIR_DEBUG_PRINT_INTERNAL for DEBUG_OPTIMIZER |
| - intel/fs: Account for payload GRFs when calculating register pressure |
| - intel/compiler: Move SCHEDULE_NONE handling into schedule_instructions() |
| - intel/fs: Index scheduler mode string table by mode enum |
| - intel/fs: Make helpers for saving/restoring instruction order |
| - intel/fs: Pick the lowest register pressure schedule when spilling |
| - intel/fs: Dump IR for pre-RA scheduler modes in DEBUG_OPTIMIZER |
| - iris: Check prog[] instead of uncompiled[] for BLORP state skipping |
| - nir: Fix function parameter indentation in nir_opt_barriers.c |
| - nir: Add an optimization pass to reduce barrier modes |
| - nir: Reduce the scope of shared memory barriers |
| - lavapipe: Don't delete control barriers |
| - virgl, nir_to_tgsi: Add a hack for promoting partial memory barriers |
| - dxil: Set UAV_FENCE_THREAD_GROUP any time global isn't required |
| - glsl: Use nir_opt_barrier_modes() to drop unnecessary barriers |
| - anv: Use nir_opt_barrier_modes() to drop unnecessary barriers |
| - mesa: Fix zeroing of new ParameterValues array entries when growing |
| - intel/fs: Fix Xe2 URB read/lowering with per-slot offsets |
| - anv: Add support for a transfer queue on Alchemist |
| |
| Kevron Rees (1): |
| |
| - Force vk vendor for spider-man remastered |
| |
| Konrad Dybcio (5): |
| |
| - freedreno: Set magic writes per-GPU, using existing data |
| - freedreno: Include speedbin fallback in 740 chipid to fix probing |
| - freedreno: Include speedbin fallback in 730 chipid to fix probing |
| - freedreno: Include speedbin fallback in 690 chipid to fix probing |
| - freedreno: Add Adreno 643 |
| |
| Konstantin Seurer (95): |
| |
| - radv: Stop using the misleading round_up_u* functions |
| - radv/meta_buffer: Stop setting RADV_META_SAVE_DESCRIPTORS |
| - radv/meta_buffer: Rename size_minus16 to max_offset |
| - llvmpipe: Fix compiling with LP_USE_TEXTURE_CACHE |
| - nir/tests: Refactor boilerplate into a common header |
| - nir/tests: Use a single binary |
| - draw: Do not restart the primitive_id at 0 |
| - gallivm: Fix subsampled format sampling under Vulkan |
| - gallivm: Ignore nir_tex_src_plane |
| - lavapipe: Remove dummy sampler ycbcr conversion |
| - lavapipe: Store immutable_samplers as lvp_sampler array |
| - lavapipe: Fix binding immutable samplers with desc buffers |
| - lavapipe: Implement samplerYcbcrConversion |
| - lavapipe: Advertise samplerYcbcrConversion |
| - llvmpipe: Zero extend vectors in widen_to_simd_width |
| - vulkan: Add a generated vk_properties struct |
| - radv: Use common physical device properties |
| - clang-format: Disable formatting by default |
| - lavapipe: Use common physical device properties |
| - nir/from_ssa: Don't insert store_reg instructions before phis |
| - gallivm: Run nir_convert_to_lcssa before nir_convert_from_ssa |
| - lavapipe/ci: Remove descriptor_indexing fails |
| - radv/rt: Rename shader_pc and next_shader |
| - radv/rt: Rename traversal_shader to traversal_shader_addr |
| - nir/opt_large_constants: Handle small float arrays |
| - bin: Update spirv sources |
| - vulkan: Allow beta extensions for physical device features |
| - vulkan: Allow beta extensions for physical device properties |
| - vulkan Add enqueue entrypoint for CmdDispatchGraphAMDX |
| - nir: Add shader enqueue data structures and handling |
| - spirv: Update headers and grammer JSON |
| - spirv: Implement SPV_AMDX_shader_enqueue |
| - lavapipe: Add lvp_pipeline_type |
| - lavapipe: Implement exec graph pipelines |
| - lavapipe: Implement AMDX_shader_enqueue commands |
| - lavapipe: Advertise AMDX_shader_enqueue |
| - radv: Add internal_nodes_offset to scratch_layout |
| - radv: Remove leaf_args::dst_offset |
| - radv/rt: Remove some dead code |
| - radv/rt: Do not apply stack_ptr for non-recursive stages |
| - radv/rt: Add and use radv_build_traversal |
| - radv/rt: Insert rt_return_amd before lowering shader calls |
| - radv/rt: Split stage initialization and hashing |
| - aco: Do not fixup registers if there are no shader calls |
| - radv: Stop updating the stack_size in insert_rt_case |
| - lavapipe: Lock around CSO destroys |
| - vulkan/wsi/x11: Implement capture hotkey using the keymap |
| - venus: Use the common GetPhysicalDeviceFeatures2 implementation |
| - nir/lower_shader_calls: Limit the remat chain length |
| - lavapipe: Avoid lowering shaders twice |
| - lavapipe: Fix the locking around cso destruction |
| - aco/validate: Handle p_wqm like p_parallelcopy |
| - aco: Use bytes() instead of size() in emit_wqm |
| - aco: Unify demote and demote_if selection |
| - radv: Only generate debug info if required |
| - aco/lower_to_cssa: Fix typo |
| - radv: Don't use the depth image view for depth bias emission |
| - radv/rt: Store NIR shaders separately |
| - radv/rt: Add monolithic raygen lowering |
| - radv/rt: Enable monolithic pipelines |
| - radv/ci: Document new flake |
| - vulkan/properties: Handle unsized arrays properly |
| - radv: Remove dead radix_sort_vk_get_memory_requirements call |
| - radv/radix_sort: Vendor the radix sort dispatch code |
| - radv: Perform multiple sorts in parallel |
| - radv/ci: Improve ray tracing skips |
| - ac/llvm: Fix typed loads with 16bit formats |
| - ac/llvm: Use the correct return type for uadd_carry and usub_borrow |
| - ac/llvm: Use float types for float atomics |
| - radv: Don't advertise features requiring PS epilogs with LLVM |
| - radv: Update navi21 llvm fails |
| - radv/rt: Handle stages without nir properly |
| - radv: Remove ray tracing shader module identifier skips |
| - radv/bvh: Treat instances with mask == 0 as inactive |
| - radv/ray_queries: Skip cull_mask handling if it is FF |
| - radv/rt: Skip cull_mask handling if it is FF |
| - aco/spill: Make sure that offset stays in bounds |
| - nir: Add nir_cf_node_cf_tree_prev |
| - nir: Add nir_foreach_block_in_cf_node_reverse |
| - nir: Add nir_rematerialize_deref_in_use_blocks |
| - nir/lcssa: Fix rematerializing derefs |
| - nir/deref: Layer rematerialization helpers |
| - lavapipe/ci: Fix asan expectations |
| - hasvk: Use the common GetPhysicalDeviceFeatures2 implementation |
| - vulkan: Remove vk_get_physical_device_core_1_*_feature_ext |
| - radv/bvh/ploc: Load child bounds from LDS |
| - radv: Merge the sync_data and header initialization |
| - radv: Do not sync after radv_update_buffer_cp |
| - zink: Initialize primitive types to an invalid value |
| - nir/passthrough_gs: Support edge flags with points |
| - zink: Enable edge flags with points |
| - mesa: Fix glBegin/End when LINE_LOOP is not supported |
| - llvmpipe: Compile a nop texture function for unsupported configurations |
| - radv/rt: Use nir_shader_instructions_pass for lower_rt_instructions |
| - radv/sqtt: Fix tracing acceleration structure commands |
| |
| Lang Yu (5): |
| |
| - amd/common: add AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 property |
| - radeonsi: use AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 to determine wave size |
| - radeonsi: use wave size to determine index stride |
| - amd/common: add missing stuff for gfx11.5 |
| - amd/radeonsi: add missing stuff for gfx11.5 |
| |
| Leandro Ribeiro (13): |
| |
| - egl: rewrite outdated comment in _eglFindDevice() |
| - egl: remove unused parameter from _eglAddDRMDevice() |
| - egl: simplify _eglAddDRMDevice() |
| - egl: make explicit that we don't support render nodes for software EGLDevice |
| - egl: move is_render_node flag to platform_wayland |
| - loader: rename loader_open_render_node() to loader_open_render_node_platform_device() |
| - loader: add driver list as parameter in loader_open_render_node_platform_device() |
| - pipe-loader: add pipe_loader_get_compatible_render_capable_device_fd() |
| - dri: add queryCompatibleRenderOnlyDeviceFd() to __DRI_MESA extension |
| - kmsro: try to use only compatible render-capable devices |
| - loader: add loader_is_device_render_capable() |
| - egl/drm: get compatible render-only device fd for kms-only device |
| - egl: error out if we can't find an EGLDevice in _eglFindDevice() |
| |
| Leo Liu (4): |
| |
| - radeonsi: add AV1 profile to supported profile |
| - radeonsi/vcn: fix the incorrect dt_size |
| - Revert "frontends/va: Also map VAImageBufferType for reading" |
| - ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG |
| |
| Lina Versace (14): |
| |
| - docs: Add row for VK_KHR_maintenance5 |
| - intel/pci_ids: Consistently use lowercase |
| - venus: Sync protocol for VK_EXT_graphics_pipeline_library |
| - venus: Erase pViewports and pScissors in fewer cases |
| - venus: Fix crash when VkGraphicsPipelineCreateInfo::layout is missing |
| - venus: Fix subpass attachments |
| - venus: Drop incorrectly-used always-true pipeline vars |
| - venus: Use VkImageAspectFlags in vn_subpass |
| - venus: Add enum vn_pipeline_type |
| - venus: Renames for VkGraphicsPipelineCreateInfo fixes |
| - venus: Refactor pipeline fixup into two stages |
| - venus: Do pipeline fixes for VK_EXT_graphics_pipeline_library |
| - venus: Enable VK_EXT_graphics_pipeline_library behind debug flag |
| - venus: Fix -Wmaybe-uninitialized |
| |
| LingMan (22): |
| |
| - rusticl/memory: fix potential use-after-free in clEnqueueSVMFree |
| - rusticl: Rename XyzCB aliases to FuncXyzCB |
| - rusticl: add structs to hold the C callbacks |
| - rusticl: use CreateContextCB |
| - rusticl: use DeleteContextCB |
| - rusticl: use EventCB |
| - rusticl: use MemCB |
| - rusticl: use ProgramCB |
| - rusticl: use SVMFreeCb |
| - rusticl: Make EventSig take ownership of its environment |
| - rusticl: add a safe abstraction to execute a DeleteContextCB |
| - rusticl: add a safe abstraction to execute an EventCB |
| - rusticl: add a safe abstraction to execute a MemCB |
| - rusticl: add a safe abstraction to execute an SVMFreeCb |
| - rusticl: add a safe abstraction to execute a CreateContextCB |
| - rusticl: add a safe abstraction to execute a ProgramCB |
| - rusticl/api: drop a few include paths |
| - rusticl: mark the fields of callback structs private |
| - rusticl: drop an \`#[allow(dead_code)]` marker |
| - rusticl/core: don't take a lock while dropping \`Context` |
| - rusticl: Show an error message if the build is attempted with an outdated bindgen version |
| - rusticl: Show an error message if the version of bindgen can't be detected |
| |
| Lionel Landwerlin (169): |
| |
| - anv: hide exec_flags selection inside the i915 backend |
| - isl: add a tool to query surface parameters |
| - intel/fs: fix missing predicate on SEL instruction |
| - intel/compiler: rework input parameters |
| - ci/a530: switch a few tests to flakes to unblock CI |
| - vulkan: bump header register to 1.3.258 |
| - intel/fs: don't try to rebuild sequences of non ssa values |
| - intel/vec4: fix log_data pointer |
| - intel/fs: consider UNDEF as non-partial write |
| - intel/fs: add more UNDEFs around SEND messages |
| - isl: add ability to store buffer size in unused RENDER_SURFACE_STATE fields |
| - anv: simplify buffer address+size loads from descriptor buffer |
| - intel/fs: add support for sparse accesses |
| - intel/nir: handle image_sparse_load in storage format lowering |
| - intel/nir: add lower for sparse images & textures |
| - anv: wire image sparse loads |
| - blorp: switch blorp_update_clear_color to early return |
| - blorp: update and move fast clear PIPE_CONTROLs to drivers |
| - anv: fix 3DSTATE_RASTER::APIMode field setting |
| - anv: enable EDS3 ConservativeRasterizationMode |
| - vulkan: skip non required extension structures |
| - vulkan/runtime: add a layered implementation of vkCmdBindIndexBuffer |
| - anv: enable INTEL_DEBUG=nofc |
| - anv: fake non intel vendorID for Death Stranding |
| - hasvk: fix null descriptor handling with A64 messages |
| - anv: remove descriptor array bounds checking |
| - hasvk: remove descriptor array bounds checking |
| - anv/hasvk: track robustness per pipeline stage |
| - anv: implement VK_EXT_pipeline_robustness |
| - intel/fs: track more steps with INTEL_DEBUG=optimizer |
| - intel/fs: add variable for output of debug backend optimizer |
| - intel/decoder: constify some input parameters |
| - blorp: drop programming of 3DSTATE_(MESH|TASK)_SHADER |
| - anv: emit 3DSTATE_GS only once per pipeline |
| - intel/decoder: add options to decode surfaces/samplers |
| - anv: get rid of genX(emit_multisample) |
| - anv: move genX(rasterization_mode) to gfx8_cmd_buffer.c |
| - anv: don't try to access dynamic buffers from surface states |
| - iris: ensure stalling pipe control before fast clear |
| - intel/compiler: disable per-sample interpolation modes with non-per-sample dispatch |
| - intel/compiler: fix dynamic alpha-to-coverage handling |
| - intel/fs: implement dynamic interpolation mode for dynamic persample shaders |
| - intel/fs: move lower of non-uniform at_sample barycentric to NIR |
| - zink+anv: add regression testing with pipeline libraries |
| - anv: implement vkCmdBindIndexBuffer2KHR |
| - anv: handle new VkBufferViewUsageCreateInfoKHR |
| - anv: add vkGetRenderingAreaGranularityKHR() |
| - anv: implement GetDeviceImageSubresourceLayoutKHR/GetImageSubresourceLayout2KHR |
| - anv: add maintenance5 A8_UNORM/A1B5G5R5_UNORM support |
| - anv: deal with new pipeline flags |
| - anv: enable KHR_maintenance5 |
| - anv: add missing ISL storage usage |
| - genxml/gfx11: remove Tiled Resource Mode field from HIER_DEPTH_BUFFER |
| - genxml/gfx12: rename Tiled Resource Mode |
| - isl: program 3DSTATE_HIER_DEPTH_BUFFER_BODY::TiledMode as documented |
| - intel/isl: Disallow Yf, Ys and Tile64 for 3D depth/stencil surfaces |
| - isl: disable Yf/Ys/Tile64 tilings for 1D images |
| - isl: add a usage flag to request 2D/3D compatible views |
| - isl: disallow TileYs/Yf on 3D storage images on Gfx9/11 |
| - intel/isl: Add a max_miptail_levels field to isl_tile_info |
| - isl: make isl_surf_get_uncompressed_surf robust to argument accesses |
| - isl: add Gfx12/12.5 restriction on 3D surfaces & compression |
| - isl: disallow miptails on planar formats |
| - isl: disable miptails on gfx12 with yuv formats |
| - isl: disable CCS on Ys/Yf |
| - blorp: allow 3D blits/copies on Ys/Yf/Tile64 tiling |
| - intel/aux_map: correctly program tiling mode for Ys |
| - isl: reorder tiling selection |
| - anv: enable standard Y tiles |
| - isl/tilememcpy_test: add multiple tile testing |
| - anv: rename total_batch_size |
| - anv: reuse cmd_buffer::total_batch_size |
| - intel/measure: track batch buffer sizes |
| - intel/nir: rerun lower_tex if it lowers something |
| - intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL |
| - hasvk: add state cache invalidation back before fast clears |
| - blorp: remove unused variable |
| - anv: remove ReorderMode from pipeline 3DSTATE_GS emission |
| - anv: change anv_batch_emit_merge to also do packing |
| - intel/anv: batch stats util |
| - intel/decoder: implement accumulated prints |
| - anv: move all dynamic state emission to cmd_buffer_flush_dynamic_state |
| - anv: rename files to represent their usage |
| - anv: categorize partial/final pipeline instruction |
| - anv: split 3DSTATE_TE packing between static & dynamic parts |
| - anv: split 3DSTATE_VFG emission |
| - anv: add a flag tracking occlusion query count change |
| - anv: split pipeline programming into instructions |
| - vulkan/runtime: add helper to name dirty states |
| - anv: add new low level emission & dirty state tracking |
| - anv: remove unused state emission |
| - anv: split BLEND_STATE packing from BLEND_STATE_POINTERS emit |
| - docs: update Anv documentation about dynamic state emission |
| - anv: create individual logical engines on i915 when possible |
| - anv: Copy/Clear MSAA images over companion RCS while we are on compute |
| - pps-producer: add ability to select device with DRI_PRIME |
| - anv: remove aux checking asserts |
| - anv: bound image usages to the associated queue family |
| - anv: fix 3DSTATE_VFG emission |
| - anv: emit 3DSTATE_URB_ALLOC_(MESH|TASK) only when mesh shaders are enabled |
| - anv: ensure mesh pipeline have all pre-rasterization stages disabled |
| - anv: ensure partially packed instructions are emitted in the pipeline |
| - anv: fix missing 3DSTATE_SBE_MESH emission |
| - anv: fix utrace timestamp buffer copies |
| - anv: add a memcpy compute internal kernel |
| - anv: add simple shader support without a command buffer |
| - anv: move simple shaders code to its own object |
| - anv: move utrace flush out of backends |
| - anv: enable utrace timestamp buffer copies on compute engine |
| - intel: don't assume Linux minor dev node |
| - intel/ds: lock submissions to u_trace_context |
| - util/u_trace: count number of tracepoints |
| - intel/ds: track number of tracepoint timestamp copies |
| - anv/utrace: trace CPU on timestamp buffer readiness |
| - intel/ds: avoid dropping traces when running out of shared memory |
| - anv/iris: widen Wa_14015946265 to Gfx11+ |
| - anv: add missing workaround for 3DSTATE_LINE_STIPPLE |
| - iris: add missing workaround for 3DSTATE_LINE_STIPPLE |
| - intel/fs: handle ishl in surface/sampler rematerialization |
| - intel/fs: handle add3 in surface/sampler rematerialization |
| - intel/fs: switch from SIMD 1 to 8 instructions surface/sampler rematerialization |
| - anv: fix internal compute copy shader build |
| - anv: reduce working temporary memory for BVH builds |
| - anv: move bo_pool allocation flags to init caller |
| - anv: use buffer pools for BVH build buffers |
| - intel/ds: track acceleration RT commands |
| - anv: fix index buffer size programming |
| - anv: implement INTEL_DEBUG=reemit |
| - anv: add missing workaround handling in simple shader |
| - anv: fix a couple of missing input for 3DSTATE_RASTER programming |
| - anv: flag 3DSTATE_RASTER as dirty after simple shader primitive |
| - vulkan: bump headers/registry to 1.3.267 |
| - anv: rename primary in container in ExecuteCommands() |
| - anv: add support for VK_EXT_nested_command_buffer |
| - anv: simplify push descriptors |
| - anv: fixup spirv cap for ImageReadWithoutFormat on Gfx12.5 |
| - Revert "intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL" |
| - anv: update batch chaining to Gfx9 commands |
| - anv: workaround Gfx11 with optimized state emission |
| - u_trace: generate tracepoint index parameter in perfetto callbacks |
| - u_trace: generate tracepoint name array in perfetto header |
| - intel/ds: provide names for different events of a timeline's row |
| - anv: reuse local variable for gfx state |
| - anv: track render targets & render area changes separately |
| - anv: don't uninitialize bvh_bo_pool is not initialized |
| - anv: uninitialize queues before utrace |
| - anv: move generation shader return instruction to last draw lane |
| - anv: fix generated draws gl_DrawID with more than 8192 indirect draws |
| - anv: extract out draw call generation |
| - anv: identify internal shader in NIR |
| - anv: avoid MI commands to copy draw indirect count |
| - anv: move generation batch fields to a sub-struct |
| - util/glsl2spirv: add ability to pass defines |
| - anv: factor out host/gpu internal shaders interfaces |
| - anv: index indirect data buffer with absolute offset |
| - anv: add ring buffer mode to generated draw optimization |
| - anv: merge gfx9/11 indirect draw generation shaders |
| - anv: document the draw indirect optimization ring mode |
| - anv: fixup 32bit build of internal shaders |
| - anv: fix uninitialized use of compute initialization batch |
| - intel/fs: fix dynamic interpolation mode selection |
| - anv/meson: add missing dependency on the interface header |
| - anv: fix corner case of mutable descriptor pool creation |
| - isl: disable MCS compression on R9G9B9E5 |
| - intel/fs: rerun divergence analysis prior to convert_from_ssa |
| - intel/nir/rt: fix reportIntersection() hitT handling |
| - anv: fix CC_VIEWPORT pointer dirty after blorp/simple-shaders |
| - anv: fix dirty state tracking for 3DSTATE_PUSH_CONSTANT_ALLOC |
| - intel/perf: fix querying of configurations |
| |
| Louis-Francis Ratté-Boulianne (15): |
| |
| - panfrost: Fix error in comment |
| - panfrost: Add methods to determine slice and body alignment |
| - panfrost: Add method to get size of AFBC subblocks |
| - panfrost: Precalculate stride and nr of blocks for AFBC layouts |
| - panfrost: Add panfrost_batch_write_bo |
| - panfrost: Make panfrost_resource_create_with_modifier public |
| - panfrost: Split out internal of \`panfrost_launch_grid` |
| - panfrost: Add infrastructure for internal AFBC compute shaders |
| - panfrost: Add method to get size of AFBC superblocks valid data |
| - panfrost: Add support for AFBC packing |
| - panfrost: Legalize resource when attaching to a batch |
| - panfrost: Don't force constant modifier after converting |
| - panfrost: Add debug flag to force packing of AFBC textures on upload |
| - panfrost: Add some debug utility methods for resources |
| - panfrost: Add env variable for max AFBC packing ratio |
| |
| Lucas Stach (33): |
| |
| - ci/etnaviv: update ci expectation |
| - etnaviv: move resource seqnos to level |
| - etnaviv: flush destination before executing blit |
| - etnaviv: optimize resource copies by skipping clean levels |
| - etnaviv: add helper to mark resource level as flushed |
| - etnaviv: add helper to mark resource level as changed |
| - etnaviv: add helper to transfer resource level age to another |
| - etnaviv: add helper to get TS validity |
| - etnaviv: add helper to set TS validity |
| - etnaviv: move TS meta into etna_resource_level |
| - etnaviv: add tile status buffer status into TS metadata |
| - etnaviv: optimize sampler source update |
| - etnaviv: allow sampler TS even if the resource is flushed |
| - etnaviv: keep blit destination tile status valid if possible |
| - etnaviv: optimize render resource update |
| - etnaviv: optimize transfers when whole resource level is discarded |
| - etnaviv: split etna_copy_resource_box levels parameter in src/dst |
| - etnaviv: don't allocate full resource as transfer staging |
| - etnaviv: check for valid TS as condition to create the staging resource |
| - etnaviv: reword comment about staging resource usage |
| - etnaviv: remove huge outdated comment |
| - etnaviv: move buffer range tracking into the PIPE_MAP_WRITE clause |
| - etnaviv: remove superfluous braces |
| - etnaviv: remove always true assert in etna_transfer_unmap |
| - etnaviv: remove bogus comment about replacing resource storage |
| - etnaviv: initialize VIVS_GL_BUG_FIXES |
| - etnaviv: fix read staging buffer leak |
| - Revert "ci/etnaviv: allow failure on failing test" |
| - mesa: enable NV_texture_barrier in GLES2+ (again) |
| - etnaviv: use correct blit box sizes when copying resource |
| - etnaviv: zero shared TS metadata block |
| - Revert "etnaviv: use correct blit box sizes when copying resource" |
| - mesa: add GL_APPLE_sync support |
| |
| Luigi Santivetti (1): |
| |
| - pvr: do not claim support for ASTC texture compression |
| |
| M Henning (31): |
| |
| - nv50/ir: Drop nir_jump_return handling |
| - nv50/ir: Remove ArgumentMovesPass |
| - nv50/ir: Remove Function.stackPtr |
| - nv50/ir: Remove dead loop from assignSlot |
| - nv50/ir: Remove SpillSlot |
| - nvc0: Keep nir directly in nvc0_program |
| - nv50: Keep nir directly in nv50_program |
| - nouveau: Delete nv50_ir_from_tgsi.cpp |
| - nouveau: Drop tgsi support from nv50_ir_prog_info |
| - nouveau: Drop ConverterCommon::Subroutine |
| - nouveau: Drop BuildUtil::DataArray |
| - nouveau: Drop BuildUtil::Location |
| - nouveau: Delete the nouveau_compiler tool |
| - nv/codegen: Call nir_shader_gather_info |
| - nv/codegen: Implement nir_op_fquantize2f16 |
| - nvk: Remove reference to genUserClip |
| - nv/codegen: Use nir_lower_clip |
| - nv50_ir_from_nir: Use nir's lower_fpow |
| - nv/codegen: Delete OP_POW |
| - nv/codegen: Fix an uninitialized variable warning |
| - nv/codegen: Delete OP_WRSV |
| - nv/codegen: Delete OP_EXP, OP_LOG |
| - nv/codegen: Remove fragCoord variable. |
| - nv/codegen: Merge from_common into from_nir |
| - nv/codegen: Remove unused clipVertexOutput var |
| - nv50_ir_ra: Delete unused functions |
| - nv/codegen: Delete unused OP_CONSTRAINT |
| - nv/codegen: Delete periodicMask32 |
| - nv/codegen: Remove Function::buildDefSets |
| - nv/codegen: Change copy-constructor call to assign |
| - nv/codegen: Delete copy and assign |
| |
| Maaz Mombasawala (2): |
| |
| - svga: Make surfaces shareable at creation. |
| - svga: Unify gmr and mob surface pool managers |
| |
| Marcin Ślusarz (16): |
| |
| - iris: avoid duplicating validation entries |
| - hasvk: remove dead code & comments related to mesh shading |
| - anv: drop support for VK_NV_mesh_shader |
| - intel/compiler: remove NV_mesh_shader support |
| - intel/compiler: remove redundant code |
| - anv: drop unused function |
| - anv: merge cases leading to the same code |
| - intel/compiler/mesh: compactify MUE layout |
| - intel/compiler,anv: put some vertex and primitive data in headers |
| - intel/compiler: load debug mesh compaction options once |
| - intel/compiler/test: fix crashes when TEST_DEBUG is set |
| - intel/compiler: add lsc_msg_desc_wcmask |
| - intel/compiler: add initial support for URB_LOGICAL_SRC_CHANNEL_MASK to lower_urb_write_logical_send_xe2 |
| - intel/compiler/mesh: fix position of output URB handle for xe2 |
| - intel/compiler/mesh: implement IO for xe2 |
| - intel/compiler: mask GS URB handles at thread payload construction |
| |
| Marek Olšák (125): |
| |
| - Revert "ac/nir/ngg: Follow intrinsic sources when analyzing before culling." |
| - glthread: determine global locking once every 64 batches to fix get_time perf |
| - mesa: fix 38% decrease in display list performance of Viewperf2020/NX8_StudioAA |
| - freedreno,lima,zink: update CI fixes and flakes |
| - util/u_queue: fix util_queue_finish deadlock by merging lock and finish_lock |
| - util/u_queue: always enable UTIL_QUEUE_INIT_SCALE_THREADS, remove the flag |
| - radeonsi: fix a CDNA regression breaking compute |
| - glthread: sync for VDPAU sync functions |
| - radeonsi: turn sh_base[PIPE_SHADER_VERTEX] into a constant in emit_draw_packets |
| - radeonsi: restructure the loop for non-indexed multi draws |
| - radeonsi: cosmetic changes to radeon_opt_* macros |
| - radeonsi: handle draw user SGPRs as tracked registers |
| - radeonsi: update obsolete comments about compiler queues |
| - radeonsi: remove si_compute.h, move the contents into si_pipe.h |
| - radeonsi: move si_update/emit_tess_io_layout_state into si_state_shaders.cpp |
| - radeonsi: move si_emit_spi_map into si_state_shaders.cpp |
| - radeonsi: move si_emit_rasterizer_prim_state out of si_emit_all_states |
| - radeonsi: remove splitting IBs that use too much memory |
| - radeonsi: add padding to si_resource to fix Viewperf2020/catiav5test1 perf |
| - radeonsi: remove unused check_mem parameter from si_sampler_view_add_buffer |
| - radeonsi: remove the draw counter with primitive restart from the HUD |
| - radeonsi: always inline si_prefetch_shaders |
| - radeonsi: specialize si_draw_rectangle using a C++ template |
| - radeonsi: add index parameter into si_atom::emit |
| - radeonsi: split direct pm4 emission from si_pm4_emit |
| - radeonsi: move code around si_pm4_emit_state into si_pm4_emit_state |
| - radeonsi: merge pm4 state and atom emit loops into one |
| - radeonsi: add a simple version of si_pm4_emit_state for non-shader states |
| - radeonsi: handle deferred cache flushes as a state (si_atom) |
| - radeonsi: remove render condition logic from si_draw by reordering atoms |
| - radeonsi: abort when failing to upload descriptors instead of skipping draws |
| - radeonsi: rename shader_pointers state -> gfx_shader_pointers |
| - radeonsi: merge si_upload_*_descriptors into si_emit_*_shader_pointers |
| - radeonsi: convert si_gfx_resources_add_all_to_bo_list to a state atom |
| - radeonsi/ci: update gfx11 failures |
| - radeonsi: move GE_CNTL emission from si_draw into si_emit_vgt_pipeline_state |
| - radeonsi: use num_patches_per_workgroup directly in si_get_ia_multi_vgt_param |
| - radeonsi: enable shader culling by default because it helps Viewperf |
| - radeonsi: rewrite how occlusion query precision is determined for performance |
| - radeonsi: set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET on aux_context explicitly |
| - radeon_winsys: move allow_context_lost from cs_create to ctx_create |
| - winsys/amdgpu: rework how SW reset status is generated and reported |
| - radeon_winsys: add a ctx_set_sw_reset_status callback |
| - radeonsi: don't abort for descriptor failures, let the winsys handle it |
| - radeonsi: don't use threadID.yz/blockID.yz for copy_image if those are always 0 |
| - radeonsi: don't use threadID.yz/blockID.yz for compute_blit if they're always 0 |
| - nir: fix constant evaluation of fddx/fddy sourcing Inf & NaN constant |
| - nir/algebraic: collapse ALU opcodes sourcing NaN |
| - ac/gpu_info: add the /dev/dri/ filename into radeon_info |
| - Revert "ac: don't call ac_query_pci_bus_info from ac_query_gpu_info" |
| - ac: implement AMD_FORCE_FAMILY properly, remove SI_FORCE_FAMILY |
| - ac: document ac_shader_args::gs_vtx_offset |
| - ac: minor updates to packet documentation and definitions |
| - ac: change offsets of DMA_DATA dwords to prevent reg offset conflicts |
| - ac: improve the IB parser |
| - ac: update gfx11 shadowed register tables |
| - ac: add a standalone IB parser program |
| - ac/surface: trivial non-functional changes |
| - ac/surface: add radeon_surf::u::gfx9::uses_custom_pitch |
| - radeonsi: allow setting any index in radeon_set_sh_reg_idx |
| - radeonsi: rename uses_subgroup_info to uses_tg_size |
| - radeonsi: improve the heuristic when to use Wave32 for compute shaders |
| - radeonsi: simplify/merge emit_shader_ngg functions |
| - radeonsi: don't pass gl_Layer to PS for blit shaders |
| - radeonsi/gfx11: pass attribute ring addr via SGPR instead of memory for blits |
| - radeonsi: fix templated si_draw_rectangle callback for Navi14 |
| - nir: replace undef only used by ALU opcodes with 0 or NaN |
| - nir: remove nir_op_unpack_64 handling from nir_opt_undef |
| - ac/llvm: don't convert undef to 0 because nir_opt_undef does it now |
| - meson: use llvm-config instead of cmake to fix linking errors with meson 1.2.1 |
| - gallivm: fix build with LLVM 18 |
| - amd/llvm: fix build with LLVM 18 |
| - radeonsi: fix compute-only contexts |
| - ac/llvm: replace removed amdgcn.ldexp for LLVM 18 |
| - ac/perfcounter: remove a bogus assert to fix an assertion failure on gfx11 |
| - ac/llvm: set !fpmath 3.0 for llvm.sqrt |
| - ac/gpu_info: don't align IBs to the GL2 cache line size |
| - ac/llvm: fix flat PS input corruption |
| - amd: rename GFX110x to NAVI31-33 |
| - ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments |
| - ac/gpu_info: pad IBs according to ib_size_alignment |
| - winsys/amdgpu: pad gfx and compute IBs with a single NOP packet |
| - Revert "radeonsi: specialize si_draw_rectangle using a C++ template" |
| - radeonsi/ci: update navi10 results |
| - gallium/util: fix GALLIUM_TESTS=1 by using cso_set_vertex_buffers_and_elements |
| - gallium/util: add more tests for compute-only contexts |
| - radeonsi: add another aux context for uploading shaders |
| - radeonsi: upload shaders via a staging buffer so as not to map VRAM directly |
| - ac/surface: don't require exact pitch for gfx6-8 tiled imports |
| - Revert "ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG" |
| - Revert "radv/amdgpu: fix alignment of command buffers" |
| - Revert "radv: fix alignment of DGC command buffers" |
| - Revert "winsys/amdgpu: pad gfx and compute IBs with a single NOP packet" |
| - Revert "ac/gpu_info: pad IBs according to ib_size_alignment" |
| - Revert "ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments" |
| - nir: sort variables by location in nir_lower_io_passes to work around a bug |
| - nir: recompute IO bases after DCE in nir_lower_io_passes |
| - nir: add dual-slot input information into load_input intrinsics |
| - nir: take dual slot input info into account when computing IO driver locations |
| - nir: gather dual slot input information |
| - nir: expose reusable linking helpers for cloning uniform loads |
| - nir: handle nir_var_mem_ubo in nir_clone_uniform_variable |
| - ac/gpu_info: split ib_alignment as ip[type].ib_alignment |
| - ac/gpu_info: move ib_pad_dw_mask into ip[] |
| - ac/gpu_info: drop the hack unifying all IB alignments |
| - ac/gpu_info: conservatively decrease IB alignment and padding to 256B |
| - ac/gpu_info: set gfx and compute IB padding to only 8 dwords |
| - winsys/amdgpu: properly pad the IB in amdgpu_submit_gfx_nop |
| - winsys/amdgpu: correctly pad noop IBs for RADEON_NOOP=1 |
| - winsys/amdgpu: pad gfx and compute IBs with only 1 NOP |
| - ac/gpu_info: don't allow register shadowing with SR-IOV due to bad performance |
| - radeonsi: disable register shadowing without SR-IOV to fix bad performance |
| - winsys/amdgpu: don't send CP_GFX_SHADOW chunk if shadow address is not set |
| - radeonsi/ci: update gfx1100 results |
| - nir: split FLOAT_CONTROLS_SIGNED_ZERO_INF_NAN_PRESERVE_FP* flags |
| - nir/algebraic: use only signed_zero_preserve_* for addition by 0 patterns, etc. |
| - mesa: don't pass Infs to the shader via gl_Fog.scale |
| - radeonsi/ci: update the runner for new build scripts |
| - radeonsi/ci: enable GTF tests in the runner |
| - radeonsi/ci: enable GLES CTS in the runner |
| - radeonsi/ci: update failures and flakes |
| - amd/common: update DCC for gfx11.5 |
| - radeonsi: initialize perfetto in the right place |
| - radeonsi/gfx11: don't set OREO_MODE to fix rare corruption |
| - nir: fix gathering TESS_LEVEL_INNER/OUTER usage with lowered IO |
| |
| Marek Vasut (1): |
| |
| - etnaviv: Fully replicate back stencil config |
| |
| Mark Collins (10): |
| |
| - tu/a7xx: Adapt r3d blits for A7xx |
| - freedreno/rnn: Remove %n usage in fprintf |
| - freedreno: Only add drm/computerator when system_has_kms_drm |
| - freedreno/decode: Support building replay for multiple KMDs |
| - freedreno+meson: Add lua+libarchive+libxml from Meson WrapDB |
| - meson: Warn about side-effects from DRM for FD KMDs |
| - meson: Update libarchive to v3.7.2-2 |
| - freedreno/common: Add max_sets property to A6xxGPUInfo |
| - tu: Support higher descriptor set count for A7XX |
| - tu,util/driconf: Add option to not reserve descriptor set |
| |
| Mark Janes (1): |
| |
| - intel: allow reduced memory usage for INTEL_MEASURE |
| |
| Martin Roukala (né Peres) (22): |
| |
| - radv/ci: drop the auto-reboot-on-hang for vkcts-navi10 |
| - radv/ci: use the default kernel on vkcts-navi10 |
| - zink/ci: automatically reboot when hitting a kernel BUG on vangogh |
| - zink/ci: document more flakes seen on vangogh |
| - radv/ci: move vkcts-navi10 testing to KWS |
| - radv/ci: add more tests to the navi10 vkcts flake list |
| - radv/ci: increase the parallelism of the vkcts-navi21 job |
| - radv/ci: add more tests to the navi21 vkcts flake list |
| - radv/ci/vkcts-navi21: catch all the line_stipple_(enable|params) flakes |
| - radv/ci/vkcts-navi21: document more flakes |
| - radv/ci/vkcts-navi10: catch all the line-related flakes |
| - radv/ci: update the vkcts gfx1100 flake/fail lists |
| - radv/ci: add a manual job to run vkcts on navi31 |
| - radv/ci: add a manual job for vkd3d-proton on navi31 |
| - ci/vkcts-vangogh: mark dEQP-VK.dynamic_rendering.primary_cmd_buff.basic.* as flake |
| - ci/vkcts-navi21: mark more of the RT handles checks as flakes |
| - ci: make B2C_JOB_VOLUME_EXCLUSIONS to all .b2c-test jobs |
| - zink/ci: remove 19 tests from the zink-radv-polaris10-fails list |
| - ci/b2c: switch containers to a back-up ahead of valve-infra renaming |
| - zink/ci: remove 42 tests from the zink-radv-polaris10-fails list |
| - radv/ci: tighten the vkcts-navi21 timeouts |
| - zink/ci: tighten the zink-radv-vangogh timeouts |
| |
| Martin Stransky (1): |
| |
| - llvmpipe: fix UAF in lp_scene_is_resource_referenced. |
| |
| Mary (6): |
| |
| - nouveau/mme: Add initial Fermi definition |
| - nouveau/mme: Add Fermi builder |
| - nouveau/mme: Add Fermi simulator |
| - nouveau/mme: Add Fermi hardware tests |
| - agx: Move nir_lower_fragcolor out of agx_preprocess_nir |
| - agx: Ensure to lower 1D image load/store to 2D |
| |
| Mary Guillemard (4): |
| |
| - nir: Add NVIDIA-specific geometry shader opcodes |
| - venus: skip bind sparse info when checking for feedback query |
| - zink: Check for VK_EXT_extended_dynamic_state3 before setting A2C |
| - venus: Do not submit batch manually when no feedback is required |
| |
| Matt Coster (21): |
| |
| - pvr: Pad rogue_regarray_cache_key union members to avoid UB |
| - pvr: Clean up extension tables |
| - pvr: Refactor pvr_GetPhysicalDeviceProperties2() |
| - docs: Fixup imagination/pvr extension support |
| - pvr: Add VK_KHR_get_display_properties2 |
| - pvr: Add VK_KHR_get_memory_requirements2 |
| - pvr: Add VK_KHR_get_surface_capabilities2 |
| - pvr: Print VkStructureType name on pvr_debug_ignored_stype() |
| - pvr: Add VK_KHR_copy_commands2 |
| - pvr: Don't override commands copied to new buffer when extending cs |
| - pvr: Do not require TA_STATE_HEADER.pres_ispctl_dbsc for {db,sc}enable |
| - pvr: Zero tail of cs buffers after linking when dumping cs |
| - pvr: Cleanup comments in pvr_physical_device_get_supported_*() |
| - pvr: Don't rely on GNU void pointer arithmetic |
| - pvr: Force compile error on GNU void pointer arithmetic |
| - pvr: Switch to common pipeline cache implementation |
| - pvr: Use vk_sampler base |
| - pvr: Clean up & fix sampler border color support |
| - pvr: Don't pass pvr_physical_device when only device info is needed |
| - pvr: Minor refactor of pvr_device.c |
| - pvr: Use common physical device properties |
| |
| Matt Turner (10): |
| |
| - Revert "intel/fs: only avoid SIMD32 if strictly inferior in throughput" |
| - intel: Rearrange for next commit |
| - intel: Consider with_intel_clc in with_any_intel |
| - intel: Only build blorp if drivers are enabled |
| - intel: Only build ds if drivers are enabled |
| - intel: Only build perf if drivers or tools are enabled |
| - intel: Allow using intel_clc from the system |
| - intel: Limit Intel Vulkan RT to x86_64 |
| - r600: Add missing dep on git_sha1.h |
| - util: Include stdint.h in libdrm.h |
| |
| Mauro Rossi (7): |
| |
| - nouveau/ws: fix building error in nouveau_ws_push_dump() |
| - vulkan/meta: fix gnu-empty-initializer build error |
| - nouveau/mme: fix print inst for case MME_FERMI_OP_MERGE |
| - anv/android: remove numFds check |
| - hasvk/android: remove numFds check |
| - Android.mk: filter out cflags to build with Android 14 bundled clang |
| - Android.mk: disable android-libbacktrace to build with Android 14 |
| |
| Mike Blumenkrantz (293): |
| |
| - ci: bump VVL to 1.3.257 |
| - zink: set pipeline dynamic state count after all dynamic states are set |
| - zink: set feedback attachments on batch init |
| - zink: be even dumber about buffer refs when replacing storage |
| - zink: emit SpvCapabilitySampleMaskPostDepthCoverage with SpvExecutionModePostDepthCoverage |
| - zink: fix the fix for separate shader program refcounting |
| - kopper: handle pixmap creation failure more gracefully |
| - glxsw: check geometry of drawables on creation |
| - kopper: move pixmap param for drawable creation to info struct |
| - glx/dri3: split out modifier check |
| - glx/sw: check for modifier support in the kopper path |
| - kopper: pass modifier availability to drawable creation |
| - kopper: determine modifier support per-drawable |
| - zink: don't clobber descriptor mode on multiple screen creation |
| - nir: fix slot calculations for compact variables with location_frac |
| - lavapipe: use the component offset directly for xfb |
| - nir: add a helper for calculating variable slots |
| - radv: bump max xfb output to 128 |
| - ir3: bump max xfb output to 128 |
| - gallium: bump PIPE_MAX_SO_OUTPUTS to 128 |
| - zink: add feedback loop exts to optimal profile |
| - glsl: only explicitly check GS components in PSIZ injection with output variables |
| - lavapipe: statically allocate fb attachment array |
| - lavapipe: zero fb attachment array at rp start |
| - lavapipe: don't check geometry for fb attachments |
| - lavapipe: be slightly more permissive for bad apps (and cts) with dynrender |
| - lavapipe: VK_EXT_host_image_copy |
| - zink: better handle separate shader dsl creation when no bindings exist |
| - zink: force image barriers after dmabuf import |
| - ci: bump VVL to 1.3.261 |
| - zink: use VK_WHOLE_SIZE when binding null db buffer descriptors |
| - zink: unset line stipple ds3 state flags when stipple not available |
| - nir/lower_io_to_scalar: fix 64bit io splitting |
| - nir/linking_helpers: force type matching in does_varying_match |
| - nir/print: print location names for (some) tess slots |
| - nir/print: always group variables by type when printing |
| - zink: add batch refs for transient images |
| - zink: fix zs resolve attachment indexing |
| - zink: don't add VK_IMAGE_USAGE_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT for transient images |
| - zink: don't append msrtss to dynamic render if not supported |
| - zink: set msrtss depth resolve mode when enabled |
| - zink: hook up VK_KHR_workgroup_memory_explicit_layout |
| - zink: propagate have_workgroup_memory_explicit_layout to ntv |
| - zink: use SPV_KHR_workgroup_memory_explicit_layout when available |
| - zink: add more locking for pipeline cache |
| - zink: add VK_PIPELINE_CACHE_CREATE_EXTERNALLY_SYNCHRONIZED_BIT_EXT |
| - aux/trace: fix winsys handle dumping |
| - zink: generated tcs is on the tes, not the vs |
| - zink: apply ZINK_DEBUG=noopt to linked separate shaders |
| - gallivm: handle A8_UNORM image stores |
| - llvmpipe: enable A8_UNORM for shader images |
| - llvmpipe: export PIPE_CAP_IMAGE_LOAD_FORMATTED |
| - lavapipe: GetRenderingAreaGranularityKHR |
| - llvmpipe: block weird uses of subsampled formats in buffers |
| - llvmpipe: fix early depth + alpha2coverage + occlusion query interaction |
| - lavapipe: fix BindVertexBuffers2 buffer size handling |
| - lavapipe: fix resolves where src image has a layer offset |
| - lavapipe: block yuv formats from getting blit feature flags |
| - lavapipe: BindIndexBuffer2 |
| - lavapipe: GetDeviceImageSubresourceLayoutKHR |
| - lavapipe: VK_REMAINING_ARRAY_LAYERS for copy ops |
| - lavapipe: maintenance5 |
| - zink: fix xfb buffer array sizing to use buffer limit, not output |
| - zink: move ZINK_DEBUG=nir printing to just before compile |
| - draw: fix so debug offset printing |
| - zink: reindex ssa defs before dumping debug shaders |
| - lavapipe: zero-init pipe_sampler_state |
| - zink: explicitly set non-optimal last_vertex_stage shader key on ctx create |
| - zink: fix big tcs output io |
| - zink: don't try to replace separate shader prog in noopt mode |
| - zink: pre-convert mode in fixup_io_locations |
| - zink: add a special separate shader i/o mode for legacy variables |
| - nir: minor fixes for io_to_scalar |
| - nir/lower_io: add a new doubles-only 64bit lowering option |
| - nir: add a filter cb to lower_io_to_scalar |
| - d3d10umd: use cso_context to set vertex buffers and elements |
| - virgl: move virgl_vertex_elements_state to header |
| - virgl: fix some indentation |
| - nouveau: calloc vertex csos |
| - gallium: move vertex stride to CSO |
| - zink: fix null config screen creation |
| - zink: fix crash in lower_pv_mode_gs_store |
| - u/draw: skip zero-sized indirect draws |
| - lavapipe: handle VkPipelineCreateFlagBits2KHR |
| - lavapipe: handle VkBufferUsageFlags2KHR |
| - zink: ci updates |
| - zink: track start/stop of a couple query types |
| - zink: require EDS1 for CWE usage |
| - zink: unset primgen suspended flag when ending a primgen query |
| - zink: rework rast-discard for primgen queries |
| - zink: rip out some awkward parts of the old non-cwe path |
| - zink: drop CWE requirement for renderpass tracking with primgen queries |
| - nir/zink: fix gs emulation xfb_info sizing |
| - zink: move fragcolor lowering further along the compile process |
| - zink: add a mode param to find_var_with_location_frac |
| - zink: use lowered io (kinda) for i/o vars |
| - zink: stop lowering indirect derefs |
| - ntt: handle interp intrinsics as derefs |
| - zink: delete split_blocks pass |
| - zink: delete lower_64bit_vertex_attribs pass |
| - zink: fix clip/cull dist xfb inlining |
| - zink: delete all the extra gross xfb handling |
| - zink: stop using pipe_stream_output |
| - zink: remove pipe_stream_output from function params |
| - zink: ci updates |
| - aux/trace: print bindless handles as pointers |
| - zink: remove unused param from create_ici |
| - zink: split create_ici to init and eval |
| - zink: add maintenance extensions to profile |
| - zink: use maintenance5 |
| - zink: use real A8_UNORM when possible |
| - vk/graphics: fix CWE handling with DS3 |
| - Revert "vk/wsi/x11: handle geometry updating more asynchronously" |
| - r600: store the mask of buffers used by a vertex state |
| - r600: better tracking for vertex buffer emission |
| - zink: wait on async fence during ctx program removal |
| - zink: handle patch variable locations for separate shaders better |
| - zink: don't start multiple cache jobs for the same program |
| - zink: use the "set" optimal key for prog last_variant_hash for consistency |
| - zink: sanitize optimal keys |
| - zink: copy some cs shader properties to the program struct |
| - zink: handle global atomic intrinsics |
| - zink: use Aligned with global load/store ops |
| - zink: fix rewrite_read_as_0 filtering |
| - rusticl: fixes for zink shader images |
| - zink: pass KERNEL shaders through successfully |
| - zink: add a618 flake |
| - zink: break out ds3 state resetting |
| - zink: be consistent with ds3 state resetting for blits |
| - zink: fix optimal_keys warning message |
| - zink: force-reset unordered flags for buffer barriers on non-matching batch access |
| - zink: reset unordered flags for image barriers on non-matching batch access |
| - zink: make image barrier init functions void return |
| - zink: simplify some image barrier conditionals |
| - zink: remove sync TODO |
| - zink: add lavapipe flake |
| - ci: disable nouveau shaderdb |
| - egl/dri3: only set driver_name if not already set |
| - egl: call dri3_x11_connect() for zink |
| - egl: bind dri2_set_WL_bind_wayland_display for zink when necessary |
| - zink: be more precise about flagging rp changes around unordered u_blitter |
| - zink: don't block reordering during ref updates in unordered blits |
| - lavapipe: update vbo indices before propagating stride |
| - lavapipe: fix pipeline stride propagation |
| - zink: fix linear modifier dmabuf imports |
| - zink: polaris ci updates |
| - aux/tc: handle stride mismatch during rp-optimized subdata |
| - zink: always add a per-prog ref for gpl libs |
| - zink: use a pointer to simplify submit struct mechanics |
| - zink: make zink_resource_image_barrier2_init public |
| - zink: add a third submitinfo (unused for now) |
| - zink: make submitinfo handling easier to manage with enum |
| - zink: add another submitinfo for fd semaphore waits |
| - zink: add a screen cache for fd semaphores |
| - zink: add a util for getting cached fd semaphores |
| - zink: hook up cached fd semaphore usage for batch signal/waits |
| - zink: handle implicit sync for dmabufs |
| - zink: handle multi-plane implicit sync |
| - zink: ci updates |
| - zink: set is_xfb=false for all i/o variables |
| - zink: reorder bindless io lowering |
| - zink: fix typing on bindless io lowering |
| - zink: delete some bindless io lowering code |
| - zink: use nir_io_semantics::num_slots for indirect var creation |
| - zink: simplify an arrayed io check during variable creation |
| - zink: use explicit stride from types instead of copying old_var stride |
| - zink: use MAX_PATCH_VERTICES directly for arrayed io var sizing |
| - zink: use explicit sizing for builtins when creating variables |
| - zink: create new vars without copying existing ones |
| - zink: add a new linker pass to handle mismatched i/o components |
| - zink: use right function to get src_type in eliminate_io_wrmasks |
| - zink: re-rework i/o variable handling to make having variables entirely optional |
| - ci: bump VVL to 1.3.263 |
| - zink: simplify redundant is_buffer check |
| - zink: use VkFormatProperties3 |
| - lavapipe: handle VkHostImageCopyDevicePerformanceQueryEXT |
| - lavapipe: don't advertise UNDEFINED layout for HIC |
| - zink: hook up VK_EXT_host_image_copy |
| - zink: move mem type detection up in file |
| - zink: disable HIC without resizable BAR |
| - zink: add a fixup method for extra driver props |
| - zink: fix some off-by-one indentation |
| - zink: use some return codes for check_ici errors |
| - zink: check/use suboptimal HIC during ici init |
| - zink: use HIC for image subdata when possible |
| - zink: slightly refactor psiz deletion during linking |
| - zink: delete all psiz=1.0 stores if maintenance5 is present |
| - nir/inline_uniforms: fix oob access with nir_find_inlinable_uniforms |
| - zink: add ZINK_DEBUG=quiet |
| - zink: imply ZINK_DEBUG=quiet if ZINK_DEBUG=optimal_keys is set on turnip |
| - zink: set optimal_keys for turnip jobs |
| - aux/tc: fix staging buffer sizing for texture_subdata |
| - aux/tc: fix address calc for segmented texture subdata |
| - zink: ci updates |
| - lavapipe: KHR_map_memory2 |
| - zink: slightly refactor pipeline compile selection |
| - zink: add a flag for combined pipeline compile for doing FAIL_ON_PIPELINE_COMPILE_REQUIRED |
| - zink: remove an intermediate variable in pipeline compile selection |
| - zink: use FAIL_ON_PIPELINE_COMPILE_REQUIRED for GPL path |
| - zink: pass a stage mask to pipeline create functions |
| - glsl: check for xfb setting xfb info |
| - zink: don't warn about missing scalarBlockLayout on v3dv |
| - aux/tc: fix renderpass tracking fb state clobber scenario |
| - vk/enum2str: add more max enum vendors |
| - aux/tc: fix rp info handling around tc_sync calls |
| - aux/tc: don't use pipe_buffer_create_with_data() for rp-optimized subdata |
| - zink: flag db maps as unsynchronized |
| - lavapipe: clamp cache uuid size |
| - lavapipe: EXT_load_store_op_none |
| - tu: handle unused color attachments without crashing |
| - zink: use much bigger dummy surfaces |
| - zink: propagate rp_tc_info_updated across unordered blits |
| - zink: use null attachments for null attachments with dynamic render |
| - egl/swrast: expose EXT_swap_buffers_with_damage and EXT_present_opaque |
| - egl/wayland: split out wl drm extension init |
| - egl/wayland: use more registry listeners to better handle device init |
| - egl/wayland: enable WL_bind_wayland_display for zink |
| - zink: delete injected pointsize during shader creation |
| - zink: require maintenance5 for shobj |
| - zink: delete a non-maintenance5 workaround for shobj use |
| - lavapipe: set separate_shaders for shader objects |
| - zink: set workgroup_memory_explicit_layout for shader validation |
| - zink: add a ZINK_DEBUG=validation alias |
| - zink: fix semaphore signal ordering |
| - zink: move swapchain fence to swapchain object |
| - zink: avoid UAF on wayland async present with to-be-retired swapchain |
| - zink: always trace_screen_unwrap in acquire |
| - lavapipe: fix variable descriptor count support handling |
| - lavapipe: always set independent blend |
| - lavapipe: more vertex stride fixups |
| - lavapipe: set default viewport and scissor count for cmdbufs |
| - lavapipe: set default min sample shading to 1 |
| - glx: XFree visual info |
| - radv: fix external handle type queries for dmabuf/fd |
| - zink: fix crashing in image rebinds |
| - zink: move push descriptor disable to driver workarounds |
| - zink: move v3dv scalarBlockLayout workaround |
| - zink: fix end-of-batch barrier pipeline stages |
| - zink: guarantee egl syncobj lifetime |
| - aux/trace: dump enum names for map usage |
| - gallium: add PIPE_MAP_NONE |
| - Revert "egl/wayland: Add image loader extension for swrast" |
| - egl/wayland: don't block in swrast when updating buffers for zink |
| - egl/wayland: return sooner from swrast_update_buffers() if zink |
| - zink: don't check submit count for unflushed usage |
| - egl: don't set ForceSoftware for all zink loading |
| - zink: error at handle export on missing EXT_image_drm_format_modifier |
| - gbm: delete some zink handling |
| - zink: apply ZINK_DEBUG=quiet to all missing feature warnings |
| - zink: set ZINK_DEBUG=quiet for polaris jobs |
| - lavapipe: don't block begin/end cmdbuf pipeline barriers |
| - ci: add a630 trace flakes |
| - zink: shrink vectors during optimization |
| - zink: always clamp shader stage in descriptor handling |
| - zink: add set_global_binding |
| - zink: eliminate samplers from no-sampler CL texops |
| - zink: add some checks to determine whether queue is init on screen destroy |
| - zink: don't destroy any simple_mtx_t objects during screen destroy |
| - zink: don't destroy uninitialized disk cache thread |
| - zink: reorder glsl_type_singleton_init_or_ref call |
| - zink: use screen destructor for creation fails |
| - zink: fix readback_present locking |
| - zink: add automatic swapchain readback using heuristics |
| - lavapipe: VK_EXT_nested_command_buffer |
| - zink: ignore unacquired swapchain images during end-of-frame flush |
| - nir/lower_fragcolor: preserve location_frac |
| - zink: update pointer for GPL pipeline cache entry formats |
| - zink: fix legacy depth texture rewriting for single component reads |
| - egl: unify dri2_egl_display creation |
| - egl: init dri3 version info during screen creation |
| - egl/glx: don't load non-sw zink without dri3 support |
| - egl: add automatic zink fallback loading between hw and sw drivers |
| - glx: add automatic zink fallback loading between hw and sw drivers |
| - ci: don't set GALLIUM_DRIVER for zink |
| - egl/wayland: only add more registry listeners for hardware devices |
| - zink: only increment image_rebind_counter on image export if binds exist |
| - zink: check for sampler view existence during zink_rebind_all_images() |
| - zink: use weston for anv ci |
| - zink: blow up broken xservers more reliably |
| - zink: delete some dead modifier handling |
| - ci: skip implicit modifier piglits for zink |
| - zink: don't block large vram allocations |
| - zink: add copy box locking |
| - zink: emit SpvCapabilitySampleRateShading with SampleId |
| - zink: always set VK_EXTERNAL_MEMORY_HANDLE_TYPE_HOST_ALLOCATION_BIT_EXT for usermem |
| - zink: clamp resolve extents to src/dst geometry |
| - zink: only emit xfb execution mode for last vertex stage |
| - aux/u_transfer_helper: set rendertarget bind for msaa staging resource |
| - zink: unset explicit_xfb_buffer for non-xfb shaders |
| - mesa/st/texture: match width+height for texture downloads of cube textures |
| - zink: add more locking for compute pipelines |
| - radv: correctly return oom from the device when failing to create a cs |
| - zink: check for cbuf0 writes before setting A2C |
| |
| Mohamed Ahmed (19): |
| |
| - vulkan/util: Support 10-bit and 12-bit color formats in ycbcr_info in vk_format.c |
| - vulkan/util: Support VK_EXT_ycbcr_2plane_444_formats color formats in vk_format.c |
| - vulkan/util: Use ycbcr_info for multiplane helpers in vk_format.c |
| - nvk: implement vkGetDeviceImageMemoryRequirementsKHR() |
| - nvk: add stub for vkGetDeviceImageSparseMemoryRequirementsKHR() |
| - nvk: implement vkGetDeviceBufferMemoryRequirementsKHR() |
| - nvk: advertise VK_KHR_maintenance4 |
| - nvk: advertise DemoteToHelperInvocation |
| - nvk: Enable multiplane images and image views |
| - nouveau/nvk: Add YCbCr sampler NIR lowering pass |
| - nouveau/nvk: Support multi-plane descriptors in nvk_nir_lower_descriptors.c |
| - nouveau/nvk: Create helper function for sampler creation |
| - nouveau/nvk: Add multiple sampler planes for CONVERSION_SEPARATE_RECONSTRUCTION_FILTER_BIT |
| - nouveau/nvk: Enable VK_KHR_sampler_ycbcr |
| - util/format: Add G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM formats |
| - vulkan/format: Translate G8B8G8R8_422_UNORM and B8G8R8G8_422_UNORM properly |
| - nvk: Enable SEPARATE_RECONSTRUCTION_FILTER_BIT for multi-planar formats only |
| - nvk: Enable MIDPOINT_CHROMA_SAMPLES_BIT for multi-planar formats only |
| - nil: Add support for G8B8_G8R8_UNORM and B8G8_R8G8_UNORM |
| |
| Nanley Chery (33): |
| |
| - iris: Remap DRM_FORMAT_MOD_INVALID more often during import |
| - anv: Don't support ASTC images with modifiers |
| - intel: Add and use isl_drm_modifier_get_plane_count |
| - anv: Handle explicit surface layout of DG2_RC_CCS |
| - anv: Reduce accesses of isl_mod_info->aux_usage |
| - iris: Reduce accesses of mod_info->aux_usage |
| - crocus: Delete modifier with aux code |
| - hasvk: Delete modifier with aux code |
| - iris: Swap stencil and modifier aux assignment order |
| - intel: Describe modifier compression with booleans |
| - intel/isl: Move the Tile4 modifier score case down |
| - intel/isl: Add a score for DG2_RC_CCS |
| - intel/blorp: Ambiguate after CCS resolves on gfx7-8 |
| - iris: Reorder render_aux_usage parameters |
| - iris: Pass the render format to prepare_render |
| - iris: Create BLORP surfaces after resource preparation |
| - iris: Handle clear color compatibility in prepare_render |
| - iris: Sample more texture view fast-clears on gfx11+ |
| - iris: Fix aux usage tracking in prepare_render |
| - iris: Fix iris_copy_region calls involving FCV_CCS_E |
| - iris: Drop get_copy_region_aux_settings |
| - iris: Inline iris_can_sample_mcs_with_clear |
| - anv: Initialize the clear color more often for FCV |
| - intel: Return a bool from intel_aux_map_add_mapping |
| - anv: Move scope of CCS binding determination |
| - anv: Allocate space for aux-map CCS in image bindings |
| - anv: Wrap aux surface image binding queries |
| - anv: Refactor CCS disabling at image bind time |
| - anv: Place images into the aux-map when safe to do so |
| - anv: Loosen anv_bo_allows_aux_map |
| - anv: Meet CCS alignment reqs with dedicated allocs |
| - anv: Delete implicit CCS code |
| - intel/isl: Add scores for GEN12_RC_CCS and MTL_RC_CCS |
| |
| Neal Gompa (1): |
| |
| - asahi: Fix 32-bit x86 build with correct data type for overflow error message |
| |
| Neha Bhende (1): |
| |
| - ntt: lower indirect tesslevels in ntt |
| |
| Paul Gofman (2): |
| |
| - driconf: add a workaround for Captain Lycop: Invasion of the Heters |
| - driconf: add a workaround for Rainbow Six Extraction |
| |
| Paulo Zanoni (15): |
| |
| - anv: rename the vm_bind vfuncs |
| - anv: add a new vm_bind vfunc |
| - anv/xe: make vm_binds async |
| - anv/xe: return failure in case waiting for the vm_bind syncobj fails |
| - anv: remove misleading comment about batch_len |
| - iris: assert bufmgr->bo_deps_lock is held |
| - iris: avoid stack overflow in iris_bo_wait_syncobj() |
| - iris: assert(bo->deps) after realloc() |
| - intel/isl: add ISL_SURF_USAGE_SPARSE_BIT |
| - intel/isl: simplify the check for maximum surface size |
| - anv/sparse: add the initial code for Sparse Resources |
| - anv/sparse: get ready to issue a single vm_bind ioctl per non-opaque bind |
| - anv/sparse: add INTEL_DEBUG=sparse |
| - anv: enable sparse resources by default |
| - vulkan: fix potential memory leak in create_rect_list_pipeline() |
| |
| Pavel Ondračka (44): |
| |
| - r300: update RV370 failures |
| - r300: check for index overflow when translating from TGSI |
| - r300: source register index is always unsigned |
| - r300: bump the RC_MAX_INDEX_BITS |
| - r300: normal instruction can't have presubtract op |
| - r300: add a helper for checking number of temporary sources |
| - r300: cycles estimate for shader-db |
| - r300: fix cycles calculation |
| - r300: don't abort on flow control when using draw for vs |
| - r300: add dEQP baseline for RV370 with forced swtcl |
| - r300: copy ntt to r300 compiler |
| - r300: add lower_sqrt to nir option |
| - r300: remove unused intrinsics in ntr |
| - r300: remove irrelevant opcodes in ntr |
| - r300: remove unused integer support in ntr |
| - r300: remove ntr_tgsi_usage_mask |
| - r300: remove more unused 64-bit pieces from ntr |
| - r300: simplify vectorization rules |
| - r300: remove more ntr unused helpers |
| - r300: remove the unneeded ntr_lower_vec_to_reg callback |
| - r300: remove unneeded 64bit and atomic lowering passes |
| - r300: remove unused ntr default settings |
| - r300: remove ntr default options |
| - r300: simplify ntr_emit_load_ubo |
| - r300: simplify ntr_emit_load_input |
| - r300: remove some virglrenderer specifics from ntr |
| - r300: simplify ntr_setup_uniforms |
| - r300: simplify ntr_output_decl |
| - r300: simplify ntr_try_store_in_tgsi_output |
| - r300: remove some unsupported texture opcodes |
| - r300: remove unused barrier code from ntr |
| - r300: simplify ntr_get_gl_varying_semantic |
| - r300: remove the nrt main optimization loop |
| - r300: reorder for easier presubtract 1-x pattern recognition |
| - r300: exit early in presubtract is not supported |
| - r300: implement bias presubtract |
| - r300: convert x * 2 into x + x for presubtract |
| - r300: move power of two multipliers down |
| - r300: there is no limitation on presubtract source file |
| - r300: use w channel for scalar opcodes if possible |
| - r300: reduce number of iterations for vertex shader loops |
| - r300: enable nir_move_vec_src_uses_to_dest |
| - nir/move_vec_src_uses_to_dest: skip reuse if vec is used only once in store_output |
| - nir/move_vec_src_uses_to_dest: allow to skip reuse of constant sources |
| |
| Philipp Zabel (1): |
| |
| - etnaviv: fix segfault after compile failure |
| |
| Pierre-Eric Pelloux-Prayer (18): |
| |
| - radeonsi/sdma: use multiple commands if required |
| - radv/sdma: use multiple commands if required |
| - radv/sdma: use correct limits for gfx10.3 |
| - glx: drop the 'libGL' log prefix |
| - loader: refactor DRI_PRIME handling code |
| - loader: extend DRI_PRIME to support =N |
| - loader: add DRI_PRIME_DEBUG env var |
| - device_select_layer: support DRI_PRIME=n |
| - docs: update DRI_PRIME documentation |
| - device_select: add shortcut for MESA_VK_DEVICE_SELECT_FORCE_DEFAULT_DEVICE |
| - st/mesa: check renderbuffer before using it |
| - radeonsi: emit framebuffer state after allocating cmask |
| - amd/common: update addrlib for gfx11.5 |
| - amd/common: add registers for gfx11.5 |
| - ac/nir: extract must_wait_attr_ring helper |
| - amd, radeonsi: Add code to enable gfx11.5 |
| - mesa: restore call to _mesa_set_varying_vp_inputs from set_vertex_processing_mode |
| - radeonsi: check sctx->tess_rings is valid before using it |
| |
| Piotr Kocia (2): |
| |
| - nir: Remove dead nir_const_value variables |
| - glsl: ir_function_param_visitor::visit_enter always true condition |
| |
| Qiang Yu (77): |
| |
| - aco,radv: replace tess_input_vertices shader info param |
| - radeonsi: aco does not pass LS outputs to HS by arg |
| - radeonsi: extract si_get_prev_stage_nir_shader to be shared with aco |
| - radeonsi: init aco shader info for merged LS/HS |
| - radeonsi: simplify si_build_wrapper_function |
| - radeonsi: move vertex shader vb desc input sgpr args to last |
| - radeonsi: remove param type check in wrapper function |
| - radeonsi: refine si_llvm_ls_build_end |
| - radeonsi: refine si_llvm_es_build_end |
| - radeonsi: aco compile support merged mono shader |
| - radeonsi: calculate lds size for merged shaders |
| - radeonsi: enable aco compile for mono merged LS/HS |
| - radeonsi: enable aco compile for mono merged ES/GS |
| - aco: extract aco_compile_shader_part from aco_compile_ps_epilog |
| - aco: add p_end_with_regs pseudo instruction |
| - aco: move jump to epilog out of ic_merged_wave_info |
| - aco: add tcs end regs for epilog usage |
| - aco: allow tcs with epilog to keep nir store output instruction |
| - aco: add pending_lds_access option for insert waitcnt |
| - aco: add tcs epilog generation for radeonsi |
| - aco: don't emit s_endpgm for tcs with epilog |
| - aco: skip scratch init when no scratch arg provide |
| - aco,radeonsi: save const addr to symbol |
| - ac/nir/tess: move tess factor output out of control flow |
| - aco: use semantic location as io temp index |
| - radeonsi: add exec_size to shader binary |
| - radeonsi: support upload multi part shader binary |
| - radeonsi: share si_get_tcs_out_patch_stride with aco |
| - radeonsi: fill part mode tcs aco shader info |
| - radeonsi: extract si_llvm_build_shader_part |
| - radeonsi: remove separate_prolog arg from prolog/epilog build |
| - radeonsi: add si_get_tcs_epilog_args |
| - radeonsi: change si_fill_aco_options args |
| - radeonsi: add si_aco_build_shader_part |
| - radeonsi: part mode standalone tcs support aco compile |
| - radeonsi: remove unused arg of get_tcs_tes_buffer_address |
| - aco: simplify setup_tcs_info |
| - aco: pass sw_stage when setup_isel_context |
| - aco: prepare fix_ls_vgpr_init_bug to be used by gl vs prolog |
| - aco: add vs prolog instruction selection for radeonsi |
| - aco: add aco compile interface for radeonsi vs prolog |
| - aco: do not fix_exports when program is prolog |
| - radeonsi: fill aco_shader_info->is_monolithic |
| - radeonsi: remove is_monolithic from vs prolog key |
| - radeonsi: extract si_get_vs_prolog_args to be shared with aco |
| - radeonsi: fix aco options has_ls_vgpr_init_bug setup |
| - radeonsi: add vs prolog aco build |
| - radeonsi: set vs has prolog aco shader info |
| - radeonsi: enable aco compile for part mode standalone vs |
| - aco,radv,radeonsi: rename is_monolithic to merged_shader_compiled_separately |
| - ac,radeonsi: move ps arg pos_fixed_pt to ac_shader_args |
| - aco: do not eliminate final exec write when p_end_with_regs block |
| - aco: remove p_end_with_regs from needs_exact() |
| - aco: add ps prolog generation for radeonsi |
| - aco: handle ps outputs from radeonsi |
| - aco: add create_fs_end_for_epilog for radeonsi |
| - aco,radv: remove unused ps epilog info fields |
| - aco,radv: rename ps epilog info inputs to colors |
| - aco: simplify export_fs_mrt_color |
| - aco,radv: add radeonsi spec ps epilog code |
| - aco: compact ps expilog color export for radeonsi |
| - aco,radv,radeonsi: pass spi ps input ena and addr |
| - aco: do not fix_exports when program has epilog |
| - aco: fix assertion fail when program contains empty block |
| - aco: create exit block for p_end_with_regs to branch to |
| - aco: wait memory ops done before go to next shader part |
| - radeonsi: reduce sgpr count for scratch_offset when aco |
| - radeonsi: init spi_ps_input_addr for part mode ps |
| - radeonsi: extract si_prolog_get_internal_binding_slot |
| - radeonsi: extract si_get_ps_prolog_args to be shared with aco |
| - ac,radeonsi: remove unused ps prolog key fields |
| - radeonsi: add ps prolog shader part build |
| - radeonsi: extract si_get_ps_epilog_args to be shared with aco |
| - radeonsi: fill aco shader info for ps part |
| - radeonsi: add ps epilog shader part build |
| - radeonsi: enable aco compile for part mode ps |
| - radeonsi: disable disk cache when use aco |
| |
| Rebecca Mckeever (32): |
| |
| - vulkan/runtime: Add helper functions for VK_EXT_host_image_copy |
| - nouveau/codegen: Support nir_intrinsic_load_workgroup_id_zero_base |
| - nouveau/codegen: Set lower_device_index_to_zero |
| - nvk: Convert system values for gl_PointCoord and PointCoord into inputs |
| - nvk: Add base_group to root descriptor table |
| - nvk: Lower base_workgroup_id |
| - nvk: Implement nvk_CmdDispatchBase and delete nvk_CmdDispatch |
| - nvk: Advertise KHR_device_group |
| - nvk: Add VK_FORMAT_B4G4R4A4_UNORM_PACK16 format to nil_format_info table |
| - nvk: Add A4B4G4R4 formats to nil_format_info table |
| - nvk: Advertise EXT_4444_formats |
| - nvk: Enable shadow sampling |
| - nvk: Implement VK_EXT_non_seamless_cube_map |
| - nouveau/nil: Add macros for ufixed |
| - nvk: Implement VK_EXT_image_view_min_lod |
| - nvk: Update mutable descriptor struct type |
| - nvk: Replace asserts with conditional that sets type_list = NULL |
| - nvk: Implement nvk_GetDescriptorSetLayoutSupport |
| - nvk: Enable VK_KHR_maintenance3 |
| - nvk: Advertise VK_EXT_mutable_descriptor_type |
| - nvk: Set image index to zero for NULL nvk_buffer_view |
| - nvk: Advertise VK_EXT_image_robustness |
| - nvk: Advertise VK_EXT_robustness2 |
| - nvk: Add view_index to root descriptor table |
| - nvk: Lower nir_intrinsic_load_view_index |
| - nvk: Add draw support for multiview |
| - nvk: Add query support for multiview |
| - nvk: Add input attachments support for multiview |
| - nvk: Advertise VK_KHR_multiview |
| - nvk: Load view_mask to shadow scratch in nvk_CmdBeginRendering |
| - nvk: Combine CLEAR_VIEWS and CLEAR_LAYERS MME macros |
| - nvk: Move code inside view mask loops to a helper function |
| |
| Rhys Perry (89): |
| |
| - ac/llvm: fix AC_TM_CHECK_IR |
| - radv: fix radv_get_ballot_bit_size with CS |
| - ac/llvm: fix wave32 ac_build_mbcnt_add with 64-bit mask |
| - ac/llvm: skip ballot zext for 32-bit dest with wave32-as-wave64 |
| - radv: add conformant_trunc_coord to cache UUID |
| - radv: don't unset TRUNC_COORD if conformant_trunc_coord=true |
| - ac/nir: always round cube array layers |
| - nir/unsigned_upper_bound: fix phi(bcsel) |
| - nir/tests: add test for unsigned_upper_bound with loop header phis |
| - nir/opt_dead_cf: remove nodes after a jump earlier |
| - nir/tests: add nir_opt_dead_cf_test.jump_before_constant_if |
| - aco: insert s_nop before VGPR deallocation |
| - nir/lower_shader_calls: vectorize stack access for all shaders |
| - radv: workaround WWZ exporting index=1 through location=1 |
| - radv: correctly skip MRT output NaN fixup for meta shaders |
| - radv: don't set vertex_attribute_strides on GFX8+ |
| - radv/ci: skip some mesh shader tests on GFX1100 |
| - aco: summarize register demand after handling branches |
| - aco: don't create sendmsg(dealloc_vgprs) if scratch is used |
| - radv: disable 64-bit color attachments |
| - radv: fix 128bpp comp-to-single clears |
| - radv: support 128bpp comp-to-single with all colors |
| - radv/gfx11: re-enable 0001/1110 clear values |
| - nir/lower_shader_calls: fix align_offset |
| - nir/opt_load_store_vectorize: support scratch access |
| - radv: vectorize RT stack access |
| - radv: vectorize scratch access |
| - aco: fix p_bpermute_gfx6 with input at non-zero byte |
| - aco: fix p_bpermute_gfx6's exec save/restore with wave32 |
| - aco: clarify bpermute pseudo opcode names |
| - aco: add adjust_bpermute_dst helper |
| - aco/spill: skip p_branch in process_block |
| - aco/spill: add all live-in to merge block spill candidates |
| - nir/lower_system_values change num_workgroups to uint32_t |
| - radv: optimize mesh workgroup ID using ts_mesh_dispatch_dimensions |
| - radv: use shortcut_1d_workgroup_id |
| - aco: remove fast path in insert_exec_mask's process_instructions |
| - aco/optimizer_postRA: check overwritten_subdword in is_overwritten_since() |
| - aco: check logical_phi_info at p_logical_end when eliminating exec writes |
| - aco: remove unused p_logical_end check when optimizing branching sequence |
| - radv: disable mesh dispatch XYZ_DIM when possible |
| - nir/deref: remove rematerialize_deref_in_block cache |
| - aco: reset prefetch in the correct block after removing the exit |
| - aco/waitcnt: replace wait_cnt::\*_cnt with booleans |
| - aco/waitcnt: add print helpers |
| - nir/lower_int64: fix find_lsb(0) |
| - nir/algebraic: optimize u2u32(a >> 32) |
| - aco/optimizer_postRA: don't combine DPP across exec on GFX8/9 |
| - aco: don't combine DPP into v_cmpx |
| - aco: disable zero offset optimization for strict WQM coords |
| - nir/constant_folding: remove zero texel offset |
| - aco: remove zero offset optimization |
| - aco: shrink DPP8_instruction |
| - aco: add fetch_inactive field to DPP instructions |
| - nir: add fetch inactive index to quad_swizzle_amd/masked_swizzle_amd |
| - aco: disable FI for quad/masked swizzle |
| - aco: fix LdsDirectVMEMHazard WaW with the wrong waitcnt |
| - aco: only mitigate VcmpxExecWARHazard when necessary |
| - aco: fix s_setreg hazards |
| - aco: consider exec_hi in reads_exec() |
| - aco: resolve all possible hazards at the end of shader parts |
| - aco/tests: test that hazards are resolved at the end of shader parts |
| - radv: skip zero-sized memcpy |
| - ac/nir: fix out-of-bounds access in ac_nir_export_position |
| - radv: fix signed integer overflow |
| - Revert "radv: pre-init surface info" |
| - nir: improve ms_cross_invocation_output_access with local_invocation_id |
| - aco,nir: add export_row_amd intrinsic |
| - ac/nir: add row parameter to helpers |
| - ac/nir: remove dead code |
| - ac/nir: refactor mesh vertex/primitive export |
| - ac/nir: implement mesh shader gs_fast_launch=2 |
| - ac/nir: optimize mesh shader local_invocation_index |
| - radv: implement mesh shader gs_fast_launch=2 |
| - ac/nir: add emit_ms_outputs helper |
| - ac/nir,radv: pass workgroup size to ac_nir_lower_ngg_ms |
| - ac/nir: implement mesh shader multi-row export |
| - radv: implement mesh shader multi-row export |
| - radv: enable mesh shader gs_fast_launch=2 and multi-row export |
| - nir/serialize: fix signed integer overflow |
| - nir/lower_shader_calls: skip zero-sized qsort |
| - util: skip zero-sized SHA1Update |
| - radv: call lower_array_deref_of_vec before lower_io_arrays_to_elements |
| - radv: skip radv_remove_varyings for mesh shaders |
| - radv: disable gs_fast_launch=2 by default |
| - docs: fix RADV_THREAD_TRACE_CACHE_COUNTERS default |
| - radv: add radv_disable_trunc_coord option |
| - radv: enable radv_disable_trunc_coord for vkd3d-proton/DXVK |
| - ac/nir: fix partial mesh shader output writes on GFX11 |
| |
| Rob Clark (60): |
| |
| - freedreno: move virtgpu msm_proto.h to common |
| - freedreno/drm/virtio: Remove unused header |
| - tu/msm: staticify a couple things |
| - tu/knl: Remove some random const'ness |
| - drm-uapi: Update virtgpu header |
| - freedreno: Update virtgpu proto |
| - freedreno/drm/virtio: Use global_faults |
| - tu: close submitqueues before device_finish() |
| - tu/drm: Factor out shared helpers |
| - tu/drm: Add missing error path cleanup |
| - tu/drm: Split out helper for iova alloc |
| - tu: Add virtgpu support |
| - util: Decouple disk cache from EGL_ANDROID_blob_cache |
| - docs: Followup to !24636 |
| - tu: Workaround bionic _SC_LEVEL1_DCACHE_LINESIZE |
| - ir3+tu: Simplify ir3_find_sysval_regid callers |
| - freedreno/a6xx: Drop unused screen args |
| - freedreno/a6xx: Re-work fd6_emit_shader |
| - freedreno/a6xx: Re-write the function-of-doom |
| - freedreno: Implement ATI_meminfo |
| - freedreno/a6xx: ARB_post_depth_coverage |
| - freedreno/a6xx: ARB_sample_locations |
| - freedreno/a6xx: ARB_texture_filter_minmax |
| - freedreno/a6xx: EXT_demote_to_helper_invocation |
| - freedreno/a6xx: EXT_shader_image_load_formatted |
| - freedreno/a6xx: EXT_depth_bounds_test |
| - freedreno/a6xx: Use pipe_blit_info::sample0_only |
| - freedreno/a6xx: Handle PIPE_BIND_BLENDABLE |
| - freedreno/a6xx: ARB_shader_viewport_layer_array |
| - tu: Fix heap size |
| - freedreno: Fix crash with debug msgs enabled |
| - freedreno/layout: Handle 565/etc MSAA special case |
| - freedreno/decode: Fix printing chip-id |
| - freedreno/a6xx: Add L8_SRGB |
| - freedreno: Add reformatting commits to .git-blame-ignore-revs |
| - freedreno/fence: Hold a strong ref to batch |
| - freedreno/decode: Lookup device info |
| - freedreno/decode: Use info->chip to decode |
| - freedreno/decode: Remove gpu_id |
| - freedreno: Indentation fix |
| - freedreno: Use explicit QCOM_TILED3 modifier |
| - freedreno/a6xx: Remove dummy packet for globals |
| - freedreno: Fix streamout offset_buf dirtiness |
| - freedreno: Fix user const buffer dirtiness |
| - freedreno/batch: Move query_buf allocation |
| - freedreno: Add private-BO tracking |
| - freedreno: Add missing indirect_draw_count tracking |
| - freedreno: Move/add some attach_bo() |
| - freedreno: Add attach-bo debugging |
| - freedreno: Rework supported-modifiers handling |
| - mesa: Introduce MESA_texture_const_bandwidth |
| - mesa: Implement MESA_texture_const_bandwidth |
| - freedreno: Add PIPE_CAP_HAS_CONST_BW support |
| - panfrost: Add PIPE_CAP_HAS_CONST_BW support |
| - iris: Add PIPE_CAP_HAS_CONST_BW support |
| - radeonsi: Add PIPE_CAP_HAS_CONST_BW support |
| - tu/msm: Fix timeline semaphore support |
| - tu/virtio: Fix timeline semaphore support |
| - freedreno/drm: Fix race in zombie import |
| - freedreno: Always attach bo to submit |
| |
| Robert Foss (9): |
| |
| - egl: Expose access to DeviceList |
| - egl: Rename _eglRefreshDeviceList() to _eglDeviceRefreshList() |
| - egl: Refresh DeviceList during eglInitialize() |
| - egl/surfaceless: Use EGL DeviceList instead of drmGetDevices2() |
| - egl/android: Use EGL DeviceList instead drmGetDevices2() |
| - egl: Rename _eglAddDevice() to _eglFindDevice() |
| - egl: Rename _eglAddDevice() to _eglFindDevice() |
| - egl: Fix attrib_list[0] == EGL_NONE check |
| - egl: Always set _EGLDisplay->Device during eglGetPlatformDisplay() |
| |
| Robert Mader (6): |
| |
| - egl/wayland: wait for compositor to release shm buffers |
| - iris: Support parameter queries for main planes |
| - util: Add new helpers for pipe resources |
| - panfrost: Support parameter queries for main planes |
| - vc4/resource: Support offset query for multi-planar planes |
| - v3d/resource: Support offset query for multi-planar planes |
| |
| Rohan Garg (33): |
| |
| - iris: migrate WA 14013910100 to use the WA framework |
| - iris: migrate WA 14016118574 to use the WA framework |
| - iris: fix iris for WA 16013000631 |
| - intel/perf: add perf query support for Intel Raptorlake |
| - intel/genxml: set a default value for "Pixel Position Offset Enable" in genxml |
| - anv: use the WA infrastructure where possible when generating state |
| - anv: use the correct GFX_VERx10 macro for WA |
| - anv,iris: program the maximum number of threads on compute queue init |
| - anv: drop CFE state validation checks |
| - iris: track reset signalling instead of replacing the context |
| - iris: allow for a unsynchronized device reset query |
| - anv: partially revert 2e8b1f6d |
| - anv: emitting 3DSTATE_PRIMITIVE_REPLICATION is required on Gen12+ |
| - anv: use the pre defined _3DPRIMITIVE_DIRECT macro |
| - anv: drop dead ifdef |
| - iris: use the correct WA macros and lineage numbers |
| - anv: use the lineage number for WA |
| - crocus: add a __gen_get_batch_address declaration |
| - crocus: fix GFX_VERx10 macro |
| - blorp: drop undefined macro |
| - iris: migrate preemption streamwout wa to WA infra |
| - intel/genxml: update PIPE_CONTROL instruction for dg2 |
| - anv: define clear color localy within can_fast_clear_color_att |
| - intel/compiler: Adjust CS payload registers for new register width on Xe2+ |
| - intel/compiler: Adjust fence message lengths for new register width on Xe2+ |
| - intel/compiler: Adjust barrier emission for Xe2+ |
| - intel/genxml: fix 3DSTATE_3D_MODE length to align with BSpec |
| - anv: ensure that FCV_CCS_E fast clears are properly tracked |
| - anv: enable FCV for Gen12.5 |
| - anv: fix debug string for PC flush |
| - anv: cleanup includes |
| - anv: turn off non zero fast clears for CCS_E |
| - anv: selectively enable FCV optimization for DG2 |
| |
| Roland Scheidegger (1): |
| |
| - lavapipe: further limit accurate_a0 hack |
| |
| Roman Stratiienko (22): |
| |
| - egl: android: Remove legacy name-based shared buffers support |
| - util: Add NONNULL macro |
| - android: Introduce the Android buffer info abstraction |
| - android: Fix num_planes assignment in u_gralloc_fallback |
| - v3dv/android: Use u_gralloc code |
| - v3dv/android: Enable shared presentable image support |
| - v3dv: Migrate to vk_device_memory |
| - v3dv/android: Skip swapchain binding |
| - v3dv: Rely on the internal tiled flag instead of the common vk structure |
| - v3dv/android: Add a helper function to support explicit layouts |
| - v3dv/android: Rework Android native buffer importing logic |
| - v3dv: Use format stored in vk_image and vk_image_view after init |
| - v3dv: Split v3dv_image_init to use layout setting logic separately |
| - v3dv/android: Add AHardwareBuffer support |
| - v3dv: Enable VK API v1.2 for Android |
| - panvk: Add Android ICD loader entry point |
| - u_gralloc: Remove inline modifiers from the functions |
| - u_gralloc: Remove usage of NONNULL macro |
| - Revert "util: Add NONNULL macro" |
| - u_gralloc: Add a function that returns gralloc type |
| - dri: Remove __driDriverExtensions leftovers |
| - v3d: Don't implicitly clear the content of the imported buffer |
| |
| Ruijing Dong (2): |
| |
| - frontends/va: checking va version for av1enc support |
| - radeonsi/vcn: change max_poc to fixed value for hevc encoder. |
| |
| Ryan Neph (1): |
| |
| - vulkan/android: add missed STACK_ARRAY_FINISH() |
| |
| Sagar Ghuge (34): |
| |
| - intel/compiler: Look at 2 register worth of data instead of 4 |
| - isl: Disable MCS compression just on ACM platform |
| - intel: Add env variable to add break point on/before draw |
| - anv: Add GPU breakpoint before/after specific draw call |
| - iris: Add GPU breakpoint before/after draw call |
| - blorp: Implement blorp hooks to emit breakpoint |
| - docs: Add INTEL_DEBUG_BKP_BEFORE/AFTER_DRAW_COUNT |
| - intel/isl: Enable INTEL_DEBUG=noccs/nohiz in ISL helpers |
| - anv,hasvk: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks |
| - iris,crocus: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks |
| - blorp: Drop unnecessary assertions in blorp_can_hiz_clear_depth |
| - anv: Add helper to create companion RCS command buffer |
| - anv: Split out End/Destroy/Reset cmd buffer code into helper |
| - anv: Handle companion RCS in end/destory/reset code path |
| - intel: Add helper to create/destroy i915 VM |
| - intel: Pass virtual memory address space ID while creating context |
| - anv: Create companion RCS engine |
| - anv: Move compute specfic bits under compute queue init |
| - anv: Execute RCS init batch on companion RCS context/engine |
| - anv: Setup companion RCS command buffer submission |
| - anv: Execute an empty batch to sync main and companion RCS batch |
| - anv: Add secondary companion RCS cmd buffer to primary |
| - anv: Skip layout transition on the compute queue |
| - anv: Extract batch print code to anv_print_batch helper |
| - iris: Enable always flush cache with DEBUG_STALL option |
| - intel/genxml: Add STATE_COMPUTE_MODE instruction |
| - anv: Program and emit STATE_COMPUTE_MODE |
| - anv: Enable barrier handling on video engines |
| - isl: Use 16-bit instead of 8-bits for surface format info fields |
| - anv: Handle end of pipe with MI_FLUSH_DW on transfer queue |
| - anv: Enable transfer queue only on ACM+ platforms |
| - blorp: Use the correct miptail start LOD for surfaces |
| - anv: Write timestamp using MI_FLUSH_DW on blitter |
| - anv: Flush data cache while clearing depth using HIZ_CCS_WT |
| |
| Saleemkhan Jamadar (1): |
| |
| - radeonsi/vcn: set jpeg reg version for gfx 1150 |
| |
| Samuel Holland (3): |
| |
| - Android.mk: Allow building only Vulkan drivers |
| - Android.mk: Explicitly enable/disable LLVM support |
| - Android.mk: Only link LLVM for radeonsi, not amd_vk |
| |
| Samuel Pitoiset (299): |
| |
| - radv: remove support for VK_INDIRECT_COMMANDS_TOKEN_TYPE_STATE_FLAGS_NV |
| - radv: make radv_get_pa_su_sc_mode_cntl() static |
| - zink/ci: update list of expected failures for NAVI10 |
| - radv: stop using a pipeline for emitting VGT_VERTEX_REUSE_BLOCK_CNTL |
| - radv: remove unused param in radv_pipeline_emit_vgt_gs_out() |
| - radv: pass a shaders array for computing ia_multi_vgt_param |
| - radv: bind the pre-compiled PS epilog to the cmdbuf state |
| - radv: stop using an array of binaries when compiling a compute shader |
| - radv: add radv_compile_cs() to compile a compute shader |
| - radv: remove the pipeline dependency for creating a GS copy shader |
| - radv: add a helper to compute the ESGS itemsize |
| - radv: use the number of GS linked inputs to compute the ESGS itemsize |
| - radv: determine ES info for VS/TES with GS earlier |
| - radv: determine as_ls earlier by using the next stage |
| - radv: simplify getting next VS stage for VS prologs |
| - radv: use next_stage for determining the stage to lower NGG |
| - radv/amdgpu: fix dumping CS with the chained IBs path |
| - radv/amdgpu: rename old_ib to ib in radv_amdgpu_winsys_cs_dump() |
| - radv: pass submit info to radv_check_gpu_hangs() |
| - radv: initialize stage/next_stage earlier |
| - radv: set next_stage to MESA_SHADER_NONE if there is no FS |
| - radv: rework considering force VRS without relying on graphics pipeline |
| - radv: stop passing radv_graphics_pipeline to radv_fill_shader_info() |
| - radv: move removing all varyings when the FS is a noop |
| - radv: rename graphics pipeline linking helpers |
| - radv: simplify lowering NGG GS intrinsics |
| - radv: rework determining the NGG stage without a graphics pipeline |
| - radv: cleanup pipeline compute emit helpers |
| - radv: rename radv_pipeline_stage to radv_shader_stage |
| - radv: rename NGG query state to be more generic |
| - radv: declare the shader query user SGPR for emulating GS counters |
| - radv: enable pipelinestat query emulation for legacy GS |
| - radv: simplify the NGG vs legacy pipelinestat query path |
| - radv: rename RADV_SHADER_QUERY_PIPELINE_STAT_OFFSET |
| - radv: implement nir_intrinsic_atomic_add_gs_invocation_count_amd |
| - radv: emulate GEOMETRY_SHADER_INVOCATIONS query on RDNA1-2 |
| - radv: track whether inputs/outputs are linked per shader stage |
| - radv: add support for VS/TES as ES without shaders IO linking |
| - radv: use next_stage to determine if the layer should be exported |
| - radv: use next stage to determine if primID/clip dist should be exported |
| - radv: compute the legacy GS info earlier |
| - radv: stop copying some NIR info fields from TES to TCS |
| - radv: stop lowering patch vertices for TES |
| - radv: do not always copy the number of tess patches to TES |
| - radv: initialize tcs.tes_{patch}_inputs_read to a default value |
| - radv: prevent linking TCS<->TES when TES is NULL |
| - radv: use a packed user SGPR for the TES state |
| - radv: stop checking if patch control points is dynamic everywhere |
| - radv: copy the number of TCS vertices out to TES shader info |
| - radv: add support for dynamic TCS vertices out for TES |
| - radv: remove radv_shader_info::tes::num_linked_patch_inputs |
| - amd,radeonsi: move si_shader_io_get_unique_index_patch() to common code |
| - radv: allow to use fixed IO locations for VS<->TCS<->TES without linking |
| - aco: add aco_shader_info::tcs::has_epilog |
| - aco: add infra for compiling TCS epilogs |
| - radv,aco: move has_epilog to radv_shader_info |
| - radv: assume a TCS needs an epilog unless it's linked with a TES |
| - radv: do not write tess factors in main TCS when it has an epilog |
| - radv: track if TES reads tess factors differently |
| - radv: declare new argument for the TCS epilog PC |
| - radv: add radv_tcs_epilog_key |
| - radv: add infra for creating TCS epilogs |
| - radv: add support for a TCS epilogs cache in the device |
| - radv: add support for emitting TCS epilogs in cmdbuf |
| - radv: remove unnecessary check in radv_pipeline_nir_to_asm() |
| - radv: stop passing a graphics pipeline to radv_pipeline_nir_to_asm() |
| - radv: inline radv_pipeline_get_nir() in radv_graphics_pipeline_compile() |
| - radv: add a struct for the retained shaders and GPL |
| - radv: add radv_graphics_shaders_compile() to compile graphics shaders |
| - radv: remove redundant check in radv_cmd_buffer_after_draw() |
| - radv: track if patch control points is dynamic from the cmdbuf state |
| - radv: re-emit binning state if the framebuffer is dirty |
| - radv: track if vertex binding stride is dynamic from the cmdbuf state |
| - vulkan: bump header register to 1.3.261 |
| - vulkan/runtime: add common implementation for GetImageSubresourceLayout() |
| - vulkan/format: add VK_FORMAT_{A8_UNORM,A1B5G5R5_UNORM_PACK16}_KHR |
| - radv: use the RT prolog scratch size directly for tracing rays |
| - radv: add a helper to get the maximum number of scratch waves per shader |
| - radv: update the number of scratch waves for RT prolog at bind time |
| - radv: update cmdbuf scratch size info when shaders are bound |
| - vulkan: add init/finish helpers for vk_buffer_view |
| - radv: use vk_buffer_view |
| - radv: use vk_sampler |
| - radv: use common vkCmdBegin/EndQuery wrappers |
| - radv: use vk_query |
| - zink: fix setting VkShaderCreateInfoEXT::nextStage |
| - radv/rt: fix capture/replay support |
| - vulkan/render_pass: add common vkGetRenderingAreaGranularityKHR() |
| - radv: implement vkCmdBindIndexBuffer2KHR() |
| - radv: allow VK_WHOLE_SIZE for pSizes in vkCmdBindVertexBuffers2() |
| - radv/rmv: remove unused pipeline create flags when logging pipelines |
| - radv: store pipeline create flags to radv_pipeline::create_flags |
| - radv: add support for VkPipelineCreateFlags2CreateInfoKHR |
| - radv: add support for VkBufferUsageFlags2CreateInfoKHR |
| - radv: allow VK_REMAINING_ARRAY_LAYERS with VkImageSubresourceLayers |
| - radv: implement radv_Get{Device}ImageSubresourceLayout2KHR() |
| - radv: advertise VK_KHR_maintenance5 |
| - radv: remove useless NULL for pipeline layout during shader info pass |
| - radv: introduce radv_shader_layout for per-stage descriptor layout |
| - radv: stop passing redundant stage to radv_shader_stage_init() |
| - radv: re-introduce radv_pipeline_stage_init() |
| - radv: add support for loading the LSHS vertex stride from a SGPR |
| - radv: use the number of VS outputs for computing the tessellation info |
| - vulkan: ignore VkPipelineColorWriteCreateInfoEXT if the state is dynamic |
| - radv: reduce TCS_OFFCHIP_LAYOUT_NUM_PATCHES to 6-bits |
| - radv: add missing comment about TCS_OFFCHIP_LAYOUT_LSHS_VERTEX_STRIDE |
| - radv: fix emitting TCS epilogs for GFX6-9 |
| - radv: remove radv_cmd_buffer::cached_vertex_formats |
| - radv: remove unused param from radv_pipeline_init_multisample_state() |
| - radv: simplify declaring VS specific input SGPRs |
| - radv: stop copying if VS or TES uses the InvocationID built-in |
| - Revert "radv/amdgpu: workaround a kernel bug when replacing sparse mappings" |
| - Revert "radv/amdgpu: skip adding per VM BOs for sparse during CS BO list build" |
| - radv/amdgpu: allow to execute external IBs on the compute queue |
| - radv/amdgpu: add support for submitting external IBs with the chained path |
| - zink/ci: update list of expected failures for NAVI10 |
| - radv: use the maximum possible workgroup size for TCS epilogs |
| - radv: stop declaring the scratch offset argument for TCS epilogs |
| - radv: declare shader arguments for TCS epilogs |
| - radv: add tcs_out_patch_fits_subgroup to radv_tcs_epilog_key |
| - aco: fix jumping from main TCS to epilog on GFX9+ |
| - aco: adjust TCS epilogs for RADV |
| - aco: allow SGPRs operands with p_jump_to_epilog |
| - aco: implement create_tcs_jump_to_epilog() |
| - radv: track the pipeline bind point for indirect commands layout |
| - radv: prepare radv_get_sequence_size() for DGC compute |
| - radv: prepare radv_prepare_dgc() for DGC compute |
| - radv: implement NV_device_generated_commands_compute |
| - radv: allow DGC on the compute queue |
| - radv: advertise NV_device_generated_commands_compute |
| - aco: rework printing shader stages |
| - radv: fix the per-patch data offset when TES isn't linked with TCS |
| - radv: stop declaring unused SGPR arguments for PS epilogs |
| - radv: add radv_shader_info::is_monolithic |
| - radv: use info->uses_view_index directly when declaring shader arguments |
| - radv: do not inline push constants for non-monolithic shaders |
| - radv: force indirect descriptor sets for non-monolithic shaders |
| - radv: always declare some arguments for non-monolithic VS/TCS shaders |
| - radv: add a new shader argument for non-monolithic shaders PC |
| - ac: allow to mark shader arguments as preserved |
| - radv: preserve shader arguments for non-monolithic VS/TCS on GFX9+ |
| - aco: disable shared VGPRs for non-monolithic shaders on GFX9+ |
| - aco: ensure to initialize exec manually for VS as LS on GFX9+ |
| - aco: add support for compiling VS+TCS separately on GFX9+ |
| - radv: always declare some arguments for non-monolithic {VS,TES}/GS shaders |
| - radv: preserve shader arguments for non-monolithic {VS,TES}/GS on GFX9+ |
| - aco: ensure to initialize exec manually for non-monolithic {VS,TES}/GS on GFX9+ |
| - aco: add support for compiling {VS,TES}+GS separately on GFX9+ |
| - radv,aco: remove unused clip/cull distances variables |
| - radv: rename tcs_shader to tcs in radv_emit_tcs_epilog_state() |
| - radv: small cleanups in radv_emit_patch_control_points() |
| - radv: fix emitting TCS epilogs if TES and GS are linked on GFX9+ |
| - radv: remove the pipeline dependency for emitting VGT_GS_MODE |
| - aco: fix emitting TCS epilogs end on GFX9+ |
| - radv: re-order IO slot layout for stages that aren't linked |
| - amd/ci: update list of failures/flakes for glcts-vangogh-valve |
| - ci: uprev vkd3d-proton |
| - ci: uprev Fossilize |
| - ci: add comment explaining which image tags to update for Fossilize |
| - radv: preserve shader argument for separate compilation of NGG shaders |
| - aco: flag blocks with long-jump as export_end for separate compilation |
| - aco: adjust fix_exports() for VS/TES as NGG and non-monolithic shaders |
| - aco: allow separate compilation of NGG shaders |
| - zink/ci: add zink-radv-polaris10-valve |
| - radv/ci: re-enable vkcts-polaris10-valve |
| - radv: fix capturing indirect dispatches with SQTT |
| - radv/ci: re-enable vkd3d-polaris10-valve |
| - ci: do not fail vkd3d-proton job when the expectations match |
| - radv/amdgpu: fix executing secondaries without IB2 |
| - radv/amdgpu: do not copy the original chain link for IBs |
| - radv: avoid emitting SQTT markers for DGC calls |
| - radv: add support for DGC with SQTT |
| - zink/ci: merge GLCTS testing with GLESx for RADV |
| - zink/ci: merge piglit testing with deqp-runner for RADV |
| - radv: fix interactions with primitives generated queries and pipeline stats |
| - radv: skip DGC calls when the indirect sequence count is zero with a predicate |
| - radv: avoid emitting THREAD_TRACE_MARKER for predicated draws/dispatches |
| - radv: adjust next stage for VS prologs and merged shaders compiled separately |
| - radv: adjust emitted prolog regs for merged shaders compiled separately |
| - radv: do not use pre-compiled prologs when VS is compiled separately |
| - radv: remove useless PIPELINE_CREATE_2_LIBRARY_BIT check for retained shaders |
| - radv: fix enabling DGCC |
| - radv: fix emitting SQTT userdata when CAM is needed |
| - radv: fix capturing RGP on RDNA3 with more than one Shader Engine |
| - zink/ci: update list of expected failures for POLARIS10/NAVI10 |
| - radv: set THREAD_TRACE_TOKEN_MASK.BOP_EVENTS_TOKEN_INCLUDE on GFX10.3+ |
| - radv: disable unsupported hw shader stages for RGP on GFX11+ |
| - radv: fix instruction timing on GFX11 |
| - ac/rgp: use correct API stage string for mesh/task shaders |
| - radv: set THREAD_TRACE_MARKER_ENABLE for mesh/task draws |
| - radv: emit relocation for mesh/task shaders |
| - issue_templates/Bug Report: fix outdated URL for GFXReconstruct |
| - ac,radv,radeonsi: rework SPM counters configuration and share it |
| - ac/perfcounter: add new SQ_WGP block for GFX11+ |
| - ac/spm: add SPM counters configuration for GFX11 |
| - radv: enable the PKT3 CAM bit for some SPM register writes |
| - radv,radeonsi: use AC_SPM_SEGMENT_TYPE_xxx instead of magic values |
| - ac/spm: remove useless SPM block setting for GFX9 and older GPUs |
| - ac/spm: add SPM block definition for GFX10-GFX10.3 |
| - ac/gpu_info: init num_cu_per_sh from the kernel |
| - ac/perfcounter: set the number of instances of GL1C to 4 |
| - ac/perfcounter: compute the number of global instances of TCP,SQ,GL1C and GL2C |
| - ac/spm: fix checking if the counter instance is valid |
| - ac/spm: rework how segment muxsel RAM are filled |
| - ac/spm: initialize and set instance mapping for counters |
| - radv: reserve more CS space in SQTT/SPM paths |
| - ac/spm: use block flags to initialize instance mapping |
| - ac/spm: select correct segment type for per-SE blocks |
| - radv,radeonsi: make sure to emit GRBM_GFX_INDEX before SQ select registers |
| - ac/spm: fix number of instances of GL2C |
| - ac,radv,radeonsi: prepare support for multi-instance SPM SQ counters |
| - ac,radv,radeonsi: prepare support for multi-instance SPM generic counters |
| - ac/spm: move the counter instance to ac_spm_counter_create_info |
| - ac/spm: enable support for multi-instance counters |
| - radv: fix checking if RGP is enabled with others tracing tools |
| - radv: fix missing ISA with RGP and GPL |
| - ac/perfcounter: add SG_WQP group for GFX11 |
| - ac/perfcounter: add GFX11 groups |
| - drirc: remove Path of Exile workarounds |
| - radv: remove drirc workarounds for Path Of Exile |
| - radv: remove absolute_depth_bias workaround |
| - ac/gpu_info: define AMD_MAX_WGP |
| - ac/spm: add new segment types for GFX11 |
| - ac/spm: add support for GFX11 |
| - radv: add SPM support for GFX11 |
| - radv: enable cache counters for RGP on GFX11 |
| - ci: update to vulkan-cts-1.3.6.3 |
| - radv/ci: skip dEQP-VK.robustness.* on Vangogh due to weird GPU hangs |
| - nir: rename atomic_add_gs_invocation_count_amd to make it more generic |
| - ac/nir: add lowering for mesh shader queries |
| - ac/nir: add lowering for task shader queries |
| - radv: add GDS counters offset for mesh/task queries |
| - radv: adjust lowering of intrinsic queries for mesh/task shaders |
| - radv: enable lowering of mesh/task shader queries when enabled |
| - radv: declare shader_query_state for mesh/task shaders |
| - radv: stop skip emitting CB states when there is no color attachment |
| - radv: re-enable DCC with mipmaps on GFX11 |
| - radv: fix COMPUTE_SHADER_INVOCATIONS query on compute queue |
| - radv: emit missing PA_{SC,SU}_LINE_STIPPLE_xxx regs in gfx preamble |
| - radv: fix alignment of DGC command buffers |
| - radv/ci: update list of expected failures on PITCAIRN |
| - radv/ci: update list of flakes for NAVI10/VEGA10 |
| - radv/amdgpu: fix alignment of command buffers |
| - radv: enable DCC for MSAA images on GFX11 |
| - zink/ci: update list of expectations for zink-anv-tgl |
| - zink/ci: bump zink-anv-tgl-full timeout to 1h45m |
| - radv/ci: rename GFX1100 lists to NAVI31 |
| - radv: fix emulated geometry shader primitives/invocations queries |
| - radv/ci: remove duplicate skipped tests for RAVEN/STONEY |
| - radv/ci: exclude dEQP-VK.texture.explicit_lod.2d.sizes.128x128_* for all jobs |
| - radv: fix synchronization with emulated GS primitives/invocations queries |
| - radv/ci: remove no longer existing test for VANGOGH |
| - radv/ci: cleanup list of expected failures for NAVI10/NAVI21/VEGA10 |
| - radv: always write the sample positions when a new descriptor BO is created |
| - radv: fill the scratch BO in radv_fill_shader_rings() |
| - radv: fix gang submissions with chaining |
| - radv: fix re-emitting streamout descriptors for NGG streamout |
| - radv: fix IB alignment |
| - zink: use warn_missing_feature for missing modifier support |
| - radv: fix destroying GDS/OA BOs |
| - radv: allocate only 1 GDS OA counter for gfx10 NGG streamout |
| - ac/nir: only consider overflow for valid feedback buffers |
| - radv/ci: update list of expected failures on RAVEN |
| - radv/ci: update list of flakes for VANGOGH |
| - radv/ci: update list of flakes for STONEY |
| - radv: disable primitive restart for non-indexed draws on GFX11 |
| - radv: enable radv_disable_aniso_single_level=true for Zink too |
| - amd/llvm,aco,radv: implement NGG streamout with GDS_STRMOUT registers on GFX11 |
| - radv: mark GDS as needed for XFB queries with NGG streamout on GFX11 |
| - radv: skip GDS allocation for NGG streamout on GFX11 |
| - zink/ci: remove expected failures that are skipped for RADV |
| - ci: update CTS to vulkan-cts-1.3.7.0 |
| - ci: bump the number of tests per group from 500 to 5000 for Vulkan drivers |
| - ci: bump DEQP_FRACTION for some jobs |
| - radv: set ENABLE_PING_PONG_BIN_ORDER for GFX11.5 |
| - radv: initialize video decoder for GFX11.5 |
| - ac/gpu_info: query the maximum number of IBs per submit from the kernel |
| - Revert "radv: fix finding shaders by PC" |
| - radv: fix missing predicate bit for WRITE_DATA helper |
| - ac/gpu_info: fix querying the maximum number of IBs per ring |
| - radv: remove outdated RADV_DEBUG=vmfaults support |
| - amd: update amdgpu_drm.h |
| - amd: add has_gpuvm_fault_query |
| - radv/amdgpu: add support quering the last GPUVM fault |
| - radv: query and report the last GPUVM fault with RADV_DEBUG=hang |
| - radv: report the last GPUVM fault when a device lost is detected |
| - ac/gpu_info: remove bogus assertion about number of COMPUTE/SDMA queues |
| - radv: fix a synchronization issue with primitives generated query on RDNA1-2 |
| - radv: bind the non-dynamic graphics state from the pipeline unconditionally |
| - radv: fix compute shader invocations query on compute queue on GFX6 |
| - radv: emit COMPUTE_PIPELINESTAT_ENABLE for CS invocations on ACE |
| - nir: fix inserting the break instruction for partial loop unrolling |
| - radv: fix registering queues for RGP with compute only |
| - radv: set radv_zero_vram=true for Unreal Engine 4/5 |
| - radv: fix a descriptor leak with debug names and host base descriptor set |
| - radv: add a missing async compute workaround for Tonga/Iceland |
| - radv: disable TC-compatible HTILE on Tonga and Iceland |
| - radv: set radv_invariant_geom=true for War Thunder |
| - radv: do not set OREO_MODE to fix rare corruption on GFX11 |
| |
| Saroj Kumar (4): |
| |
| - radeonsi: Add perfetto support in radeonsi |
| - radeonsi: Add u_trace init code in radeonsi |
| - radeonsi: Add tracepoints in radeonsi driver |
| - radeonsi: fixes compilaton error when perfetto is disabled |
| |
| Sathishkumar S (2): |
| |
| - radeonsi/vcn: support variable number of bs_bufs |
| - radeonsi/vcn: num bs_bufs must be proportional to num jpeg engines |
| |
| Semjon Kravtsenko (1): |
| |
| - glx: Assign unique serial number to GLXBadFBConfig error |
| |
| Seppo Yli-Olli (1): |
| |
| - zink: Fix SyntaxWarning in zink_extensions script |
| |
| Sergi Blanch Torne (7): |
| |
| - Introduce ci-kdl builder and launcher. |
| - Integrate ci-kdl in the building process and launch process. |
| - ci: disable Collabora's LAVA lab for maintance |
| - Revert "ci: disable Collabora's LAVA lab for maintance" |
| - Revert "ci: disable Collabora's LAVA lab for maintance" |
| - ci: disable Collabora's LAVA lab for maintance |
| - Revert "ci: disable Collabora's LAVA lab for maintance" |
| |
| Sid Pranjale (1): |
| |
| - nvk: Enable VK_EXT_load_store_op_none |
| |
| Sil Vilerino (20): |
| |
| - util: Blake3 - Identify arm64ec as aarch64 instead of x64 |
| - d3d12: Fix Map/Unmap of YUV resources |
| - d3d12: Fix H264 interlaced decode |
| - d3d12: Video Decode - Remove unnecessary copy for texture array case |
| - util/vl_vlc: Use UINT64_MAX instead of ~0UL with MSVC compiler |
| - d3d12: Extend video screen AV1 encode tile support checking |
| - aux/tc: Add ASSERTED to unreferenced release build variable |
| - d3d12: Video - Relax ID3D12VideoDevice QI version for decode, process |
| - frontends/va: Add profile param when querying PIPE_VIDEO_CAP_ENC_QUALITY_LEVEL |
| - d3d12: Upgrade to D3D12 Agility SDK 1.611 Video interface |
| - d3d12: Fixes AV1 tx_mode_support reporting and unsupported tx_mode overriding |
| - d3d12: Video Decode - Wait for GPU completion before destroying decoder in-flight objects |
| - d3d12: Do not destroy codec when destroying video buffer |
| - d3d12: AV1 encode - Add lower resolution fallback check for uniform tile support |
| - d3d12: AV1 encode - add fallback for app passing unsupported pic_params.InterpolationFilter |
| - d3d12: AV1 Encode - Fix VAConfigAttribEncMaxRefFrames reporting |
| - frontend/va: Add support for VAConfigAttribEncMaxTileRows/Cols |
| - d3d12: Add support for PIPE_VIDEO_CAP_ENC_MAX_TILE_ROWS/COLS |
| - d3d12: Allocate d3d12_video_buffer with higher alignment for compatibility |
| - d3d12: d3d12_video_buffer_create_impl - Fix resource importing |
| |
| Simon Ser (7): |
| |
| - wayland: enable use of wayland-protocols as a subproject |
| - vulkan/wsi/wayland: add support for IMMEDIATE |
| - vulkan/wsi/wayland: fix unset present_mode |
| - radv/winsys: check amdgpu_create_bo_from_user_mem() for EINVAL |
| - egl: extract EGLDevice setup in dedicated function |
| - egl: move dri2_setup_device() after dri2_setup_extensions() |
| - egl: ensure a render node is passed to _eglFindDevice() |
| |
| Simon Zeni (1): |
| |
| - nouveau/winsys: use mmap instead of mmap64 in nouveau_bo |
| |
| SoroushIMG (1): |
| |
| - pvr: fix mipmap size calculation for bc formats |
| |
| Sviatoslav Peleshko (9): |
| |
| - dri: Use RGB internal formats for RGBX formats |
| - intel/isl: Don't over-allocate CLEAR_COLOR size to use whole cache line |
| - anv: Do fast clear color initialization more delicately |
| - zink: Change zink_vertex_elements_hw_state::b.strides to VkDeviceSize |
| - intel/fs: Check if the whole ubo load range is in the push const range |
| - zink: Store zink_vertex_elements_hw_state::b.strides by binding id |
| - intel/fs: Fix "packed word exception" condition for register regioning |
| - intel/eu/validate: Validate "packed word exception" stricter |
| - nir/loop_analyze: Fix inverted condition handling in iterations calculation |
| |
| Sylvain Munaut (9): |
| |
| - egl/dri2: Add a couple of missing mutex release in error path |
| - mesa: Enable ARB_texture_border_clamp in GL Core |
| - include: Fix the PFN declarations to be pointers as they should |
| - glx: Add missing MesaGLInteropGLXFlushObjects |
| - glx: Export the MESA GL Interop functions through glXGetProcAddress |
| - egl: Export the MESA GL Interop functions through eglGetProcAddress |
| - glx: Remove MESA_depth_float_bit from enum |
| - glx: Advertise GLX_MESA_gl_interop extension if support present |
| - egl: Advertise EGL_MESA_gl_interop extension if support present |
| |
| Tapani Pälli (34): |
| |
| - intel/blorp: add a new flag to communicate PSS sync need |
| - anv: implement required PSS sync for Wa_18019816803 |
| - iris: implement required PSS sync for Wa_18019816803 |
| - vulkan/runtime: change assert to match specification needs |
| - anv: remove assert, size is asserted in the runtime |
| - anv: refactor batch_set_preemption to use batch_emit_pipe_control |
| - anv: implement a dummy depth flush for Wa_14016712196 |
| - iris: implement a dummy depth flush for Wa_14016712196 |
| - mesa: fix some TexParameter and SamplerParameter cases |
| - mesa: remove GL_UNSIGNED_BYTE as supported for snorm reads |
| - ci: add a fix for KHR-GLES3.packed_pixels.*snorm tests |
| - anv: implement Wa_14018912822 |
| - iris: implement Wa_14018912822 |
| - driconf: use lower_depth_range_rate for The Spirit and The Mouse |
| - mesa: disable snorm readpix clamping with EXT_render_snorm |
| - iris: modify Wa_14014414195 to use intel_needs_workaround |
| - mesa: some cleanups for texparam extension checks |
| - iris: avoid issues with undefined clip distance |
| - crocus: avoid issues with undefined clip distance |
| - anv: refactor to fix pipe control debugging |
| - anv: fix a leak of fp64_nir shader |
| - iris: use intel_needs_workaround for Wa_14014414195 part 2 |
| - iris: correct dst alpha blend factor in Wa_14018912822 |
| - iris/anv: move Wa_14018912822 as a drirc workaround |
| - iris: flush data cache when flushing HDC on GFX < 12 |
| - anv: HDC flush is available only for GFX_VER 12+ |
| - iris: HDC flush is available only for GFX_VER 12+ |
| - intel/genxml: remove HDC from gen11.xml, it is not available |
| - mesa/st: ignore StencilSampling if stencil not part of the format |
| - intel/dev: expand existing fix for all gfx12 with small EU count |
| - egl: fix leaking drmDevicePtr in _eglFindDevice |
| - iris: add data cache flush for pre hiz op |
| - anv/drirc: add option to disable FCV optimization |
| - drirc: Set limit_trig_input_range option for Valheim |
| |
| Tatsuyuki Ishi (8): |
| |
| - radv/amdgpu: Remove unused bo_list variable from cs_submit. |
| - radv/winsys: Remove unused struct radv_winsys_bo_list. |
| - radv/amdgpu: Do not pass in a BO handle when clearing PRT VA region. |
| - radv: Fix IB size for RADV_DEBUG=hang. |
| - radv: Fix dumping vertex descriptors with RADV_DEBUG=hang. |
| - radv/amdgpu: Use rwlock to protect access to virtual BOs. |
| - zink: Fix missing sparse buffer bind synchronization. |
| - zink: Fix waiting for texture commit semaphores. |
| |
| Thomas H.P. Andersen (65): |
| |
| - tgsi: remove unused tgsi_shader_info.num_tokens |
| - tgsi: remove unused tgsi_shader_info.array_max |
| - tgsi: remove unused tgsi_shader_info.num_memory_instructions |
| - tgsi: remove unused tgsi_shader_info.colors_read |
| - tgsi: remove unused tgsi_shader_info.colors_written |
| - tgsi: remove unused tgsi_shader_info.reads_position |
| - tgsi: remove unused tgsi_shader_info.reads_samplemask |
| - svga: remove unused struct field |
| - tgsi: remove unused tgsi_shader_info.reads_tess_factors |
| - tgsi: remove unused tgsi_shader_info fields |
| - tgsi: remove unused tgsi_shader_info fields |
| - tgsi: remove unused tgsi_shader_info.uses_drawid |
| - tgsi: remove unused tgsi_shader_info fields |
| - tgsi: remove unused tgsi_shader_info.uses_subgroup_info |
| - tgsi: remove unused tgsi_shader_info.writes_primid |
| - tgsi: remove unused tgsi_shader_info.uses_doubles |
| - tgsi: remove unused tgsi_shader_info.uses_derivatives |
| - tgsi: remove unused tgsi_shader_info.uses_bindless_samplers |
| - tgsi: remove unused tgsi_shader_info.uses_bindless_images |
| - tgsi: remove unused tgsi_shader_info.clipdist_writemask |
| - tgsi: remove unused tgsi_shader_info.culldist_writemask |
| - tgsi: remove unused tgsi_shader_info.images_load |
| - tgsi: remove unused tgsi_shader_info.images_store |
| - tgsi: remove unused tgsi_shader_info.images_atomic |
| - tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_load |
| - tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_store |
| - tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_atomic |
| - tgsi: remove unused tgsi_shader_info.uses_bindless_image_load |
| - tgsi: remove unused tgsi_shader_info.uses_bindless_image_store |
| - tgsi: remove unused tgsi_shader_info.uses_bindless_image_atomic |
| - tgsi: remove unused tgsi_shader_info.indirect_files_read |
| - tgsi: remove unused tgsi_shader_info.indirect_files_written |
| - tgsi: remove unused tgsi_shader_info.const_buffers_indirect |
| - tgsi: remove unused tgsi_shader_info.max_depth |
| - tgsi: drop two unused functions |
| - nvk: use common physical device enumeration |
| - nvk: fix implicit-fallthrough warnings with clang |
| - nvk: delete commented code |
| - nvk: fix mem leaks |
| - nvk: use common descriptor set layout code |
| - nvk: use common pipeline layout code |
| - nvk: advertise KHR_shader_non_semantic_info |
| - nvk: advertise KHR_image_format_list |
| - nvk: advertise EXT_private_data |
| - nvk: advertise KHR_sampler_mirror_clamp_to_edge |
| - nvk: KHR_descriptor_update_template |
| - nvk: CmdPushDescriptorSetWithTemplateKHR |
| - nvk: drop dead assignment |
| - nvk: drop dead assignment |
| - nvk: fix initialization override |
| - nvk: sort extensions |
| - nvk: advertize KHR_relaxed_block_layout |
| - nvk: add check for VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT |
| - nvk: advertise EXT_image_2d_view_of_3d |
| - nvk: fix maxPushDescriptors |
| - nvk: call correct macro to clear views |
| - nouveau/mme: use fermi enum in fermi builder |
| - nvk: add warning on non-nouveau drm driver |
| - nvk: Implement VK_KHR_draw_indirect_count on Turing+ |
| - nvk: set device info before use in nvk_get_device_extensions |
| - nvk: simplify code by using new helpers |
| - nvk: remove duplicated device features |
| - nvk: EXT_conditional_rendering |
| - nvk: advertise VK_EXT_tooling_info |
| - nvk: set optimization level to 3 |
| |
| Thong Thai (3): |
| |
| - radeonsi: enable vcn encoder rgb input support |
| - Update radeon_vcn_enc.c |
| - frontends/va/config: report max width and height for encoding/decoding |
| |
| Timothy Arceri (27): |
| |
| - glsl: fix validation of ES vertex attribs |
| - nir/opt_copy_prop_vars: don't clone copies if branch empty |
| - nir/opt_copy_prop_vars: speedup cloning of copy tables |
| - nir/opt_copy_prop_vars: remove var hash entry on kill alias |
| - nir/opt_copy_prop_vars: skip cloning of copies arrays until needed |
| - nir/opt_copy_prop_vars: drop reuse of dynamic arrays |
| - glsl: fix spirv sso validation |
| - glsl: mark structs containing images as bindless |
| - util: add radeonsi workaround for Nowhere Patrol |
| - glsl: fix out params in glsl to nir |
| - glsl_to_nir: add more unhandled function types |
| - nir: replace use of nir_src_copy() |
| - nir: remove unused nir_src_copy() |
| - nir: remove unused param from nir_alu_src_copy() |
| - glsl: remove field from gl_shader_program |
| - glsl: move get_varying_type() declaration earlier |
| - glsl: add nir version of validate_first_and_last_interface_explicit_locations() |
| - glsl: switch to nir validate_first_and_last_interface_explicit_locations() |
| - glsl: remove unused validate_first_and_last_interface_explicit_locations() |
| - nir: fix typo in comment |
| - nir: copy explicit_invariant flag to nir vars |
| - glsl: move interpolation_string() to linker_util |
| - glsl: move is_gl_identifier() to linker_util |
| - nir: add used field to nir variables |
| - glsl: implement cross_validate_outputs_to_inputs() in nir linker |
| - glsl: switch to nir linkers cross_validate_outputs_to_inputs() |
| - glsl: remove now unused varying linker code |
| |
| Timur Kristóf (39): |
| |
| - aco: Fix subgroup_id intrinsic on GFX10.3+. |
| - ac/nir: Simplify arg unpacking when shift is zero. |
| - ac/nir: Add new pass to lower intrinsics to shader args. |
| - radv: Move radv_select_hw_stage to radv_shader_info. |
| - radv: Use ac_nir_lower_intrinsics_to_args. |
| - radeonsi: Move si_select_hw_stage to si_shader_info. |
| - radeonsi: Use ac_nir_lower_intrinsics_to_args. |
| - aco: Remove subgroup_id and num_subgroups intrinsics. |
| - ac/llvm: Remove subgroup_id and num_subgroups intrinsics. |
| - aco: Refactor select_program to smaller functions. |
| - nir/opt_dead_cf: Remove if branches with undef condition. |
| - ac/nir: Add done arg to ac_nir_export_position. |
| - ac/nir: Slightly refactor how pos0 exports are added when missing. |
| - ac/nir/ngg: Wait for attribute stores before VS/TES/GS pos0 export. |
| - ac/nir/ngg: Refactor mesh shader primitive export. |
| - ac/nir/ngg: Wait for attribute ring stores in mesh shaders. |
| - ac/nir/ngg: Extract nogs_export_vertex_params function. |
| - ac/gpu_info: Add some SDMA related information. |
| - ac: Clarify SDMA opcode defines. |
| - ac: Add amd_ip_type argument to ac_parse_ib and ac_parse_ib_chunk. |
| - ac: Rename ac_do_parse_ib to parse_pkt3_ib. |
| - ac: Print IP type for IBs. |
| - ac: Add rudimentary implementation of printing SDMA IBs. |
| - radv: Rename SDMA file to radv_sdma.c |
| - radv: Use const device argument in radv_sdma_copy_buffer. |
| - radv: Use const on vi_alpha_is_on_msb arguments. |
| - radv: Only call si_cp_dma_wait_for_idle on GFX and ACE queues. |
| - radv: Move radv_cp_wait_mem to radv_cs.h and add queue family argument. |
| - radv: Refactor WRITE_DATA helper function. |
| - radv: Use new WRITE_DATA helper in more places. |
| - radv: Add queue family argument to some functions. |
| - radv: Wait for bottom of pipe in ACE gang wait postamble. |
| - radv: Simplify gang CS and semaphore initialization. |
| - radv: Allow gang submit use cases other than task shaders. |
| - radv: Slightly refactor gang semaphore functions. |
| - radv: Add gang follower semaphore functions. |
| - radv: Support SDMA in radv_cs_write_data_head. |
| - radv: Support SDMA in radv_cp_wait_mem. |
| - radv: Support SDMA in si_cs_emit_write_event_eop. |
| |
| Vignesh Raman (4): |
| |
| - ci: add Vignesh Raman into restricted traces access list |
| - Do explicit cast to suppress clang warnings |
| - ci: enforce -Wimplicit-const-int-float-conversion for clang |
| - ci: Uprev crosvm |
| |
| Vinson Lee (8): |
| |
| - nvk: Fix assert |
| - lavapipe: Fix struct initialization |
| - intel/decoder: Fix memory leak on error path |
| - nv50: Remove unused value |
| - vk/wsi/x11: Remove dead code |
| - freedreno/replay: Fix implicit-function-declaration error |
| - anv: Fix transfer type assert |
| - broadcom/qpu: Remove duplicate variable opcode |
| |
| Vitaliy Triang3l Kuzmin (3): |
| |
| - r600/asm: Fix AR force_add_cf setting if a clause is not open |
| - r600/asm: Make sure MOVA and SET_CF_IDX are in the same clause |
| - r600: Replace R600_BIG_ENDIAN with UTIL_ARCH_BIG_ENDIAN |
| |
| Vlad Schiller (15): |
| |
| - pvr: Implement VK_EXT_tooling_info |
| - pvr: Add 'info' PVR_DEBUG flag |
| - pvr: Implement VK_KHR_format_feature_flags2 |
| - pvr: Remove PVR_WINSYS_BO_FLAG_ZERO_ON_ALLOC flag |
| - pvr: Add VK_KHR_driver_properties |
| - pvr: Use correct index when writing query availability data |
| - pvr: Enable VK_EXT_scalar_block_layout |
| - pvr: Enable KHR_image_format_list |
| - pvr: Enable VK_KHR_uniform_buffer_standard_layout |
| - pvr: Implement VK_KHR_external_fence |
| - pvr: Implement VK_KHR_external_semaphore |
| - pvr: Enable VK_KHR_bind_memory2 extension |
| - pvr: Implement VK_EXT_texel_buffer_alignment |
| - pvr: Implement VK_EXT_host_query_reset |
| - pvr: Fix VK_EXT_texel_buffer_alignment |
| |
| WinLinux1028 (1): |
| |
| - radeonsi: prefix function with si\_ to prevent name collision |
| |
| Xaver Hugl (1): |
| |
| - vulkan wsi: add support for PresentOptionAsyncMayTear |
| |
| Yiwei Zhang (46): |
| |
| - venus: handle query feedback creation failure |
| - venus: ensure consistency of query overflow behavior |
| - venus: add a missing barrier before copying query feedback |
| - venus: refactor query feedback cmd record |
| - venus: reduce to use 4K mem suballoc align on platforms known to fit |
| - turnip: flush cache for dstBuffer in vkCmdCopyQueryPoolResults |
| - lvp: avoid reading immutable sampler from desc write info |
| - ci/venus: update venus-lavapipe expectations |
| - venus: fix a cmd builder render_pass state leak across reset |
| - venus: fix cmd state leak across implicit reset |
| - venus: log and doc the broken query feedback in suspended render pass |
| - venus: move transient storage from cmd to pool |
| - venus: remove redundant fb tracking from cmd builder |
| - venus: use tracked queue_family_index from the cmd pool |
| - venus: cleanup vn_cmd_begin_render_pass usage |
| - venus: add helpers to track subpass view mask |
| - venus: avoid redundant tracking of render pass |
| - venus: refactor more cmd states into cmd builder |
| - venus: use in_render_pass to skip present_src counting |
| - ci/venus: remove fixed tests that no longer run |
| - ci/venus: reenable pipeline cts |
| - venus: suppress a false logging |
| - venus: add no_sparse debug option to disable sparse resource support |
| - venus: set deviceMemoryReport feature |
| - venus: expose at least one cached memory type |
| - venus: expose KHR_external_fence/sempahore_fd extensions |
| - venus: fix a device memory report leak |
| - vulkan: remove a dup entry from vk_image_usage_to_ahb_usage |
| - vulkan/android: improve vkQueueSignalReleaseImageANDROID |
| - vulkan/android: add missing AHARDWAREBUFFER_USAGE_GPU_DATA_BUFFER usage |
| - vulkan/android: drop vk_buffer dependency from common AHB impl |
| - venus: use common vk_queue object |
| - venus: use common ANB implementation |
| - venus: use more common vk_queue related implementations |
| - venus: drop device, family, index, flags tracking from vn_queue |
| - venus: fix re-export of imported classic 3d resources |
| - venus: remove redundant bo roundtrip and add more docs |
| - venus: track VkPhysicalDeviceMemoryProperties instead |
| - venus: refactor vn_device_memory to prepare for async alloc |
| - venus: make device memory alloc async |
| - venus: enable Vulkan 1.3 for Android 13 and above |
| - zink: sync queue access for vkQueueWaitIdle |
| - venus: properly expose KHR_external_fence/sempahore_fd |
| - ci/venus: mark more flaky tests after recent cts uprev |
| - venus: fix query feedback batch leak and race upon submission |
| - zink: apply can_do_invalid_linear_modifier to Venus |
| |
| Yogesh Mohan Marimuthu (12): |
| |
| - gallium: remove start_slot parameter from pipe_context::set_vertex_buffers |
| - ac/surface: add astc block size to bpe_to_format() function |
| - util: move ASTCLutHolder from mesa/main to util |
| - vulkan/formats,zink: move vk_format_from_pipe_format() function |
| - vulkan/runtime: add compute astc decoder helper functions |
| - vulkan add 3D texture support for compute astc decoder |
| - radv: integrate meta astc compute decoder to radv |
| - radeonsi: add more documentation for dpbb debug env variable |
| - docs: remove document for unused variable dfsm from AMD_DEBUG |
| - radeonsi: correct old comment in si_emit_framebuffer_state() |
| - radeonsi: In gfx6_init_gfx_preamble_state() use gfx_level only from sctx |
| - radeonsi: add radeonsi to GL_RENDERER string |
| |
| Yonggang Luo (43): |
| |
| - lima: Convert to use nir_foreach_function_impl when possible |
| - freedreno: Switch to use nir_foreach_function_impl in tu_shader.cc |
| - zink: Convert to use nir_foreach_function_impl when possible |
| - lavapipe: Convert to use nir_foreach_function_impl |
| - lavapipe: fixes indent of function lvp_inline_uniforms |
| - microsoft/compiler: convert to use nir_foreach_function_with_impl in function emit_module |
| - microsoft/clc/compiler: Convert to use nir_foreach_function_impl when possible |
| - radeonsi: Convert to use nir_foreach_function_impl |
| - ac: Switch to use nir_foreach_function_impl in function analyze_shader_before_culling |
| - util: Move pipe_swizzle from p_defines.h to u_formats.h |
| - util: Move PIPE_MASK_* from p_defines.h to u_formats.h |
| - util: Move pipe_color_union from p_defines.h into u_formats.h |
| - util: Move u_pack_color.h and dbughelp.h into src/util from/src/gallium/auxiliary/util/ |
| - util: Remove include "pipe/\*.h" in src/util/* files |
| - util:Move only gallium used u_debug_refcnt.* and u_debug_describe.* into src/gallium/auxiliary/util/ |
| - util/meson: Getting mesa util core to be self contained |
| - pvr: decouple vulkan driver and compiler from gallium |
| - freedreno: decouple compiler and vulkan driver from gallium |
| - glx: decouple from gallium |
| - meson: Remove arm_neon_workaround |
| - nouveau/drm-shim: Decouple from gallium |
| - ac/radv: decouple radv vulkan driver and compiler from gallium |
| - etnaviv: decouple drm from gallium |
| - asahi: decouple layout from gallium |
| - compiler: Move WRITEMASK_* from prog_instruction.h into shader_enums.h |
| - intel/blorp: Use float directly to avoid #include "mesa/main/format_utils.h" |
| - intel/blorp: brw_sampler_prog_key_data::swizzles is only and should only accessed in crocus |
| - intel/brw: Define and use BRW_SWIZZLE_* instead of SWIZZLE_* |
| - crocus: #include "program/prog_instruction.h" for SWIZZLE_* |
| - intel/compiler,intel/blorp,intel/vulkan: decouple vulkan driver and compiler from gallium |
| - util/treewide: Use alignas(x) instead __attribute__((aligned(x))) |
| - v3dv: Use alignas(8) over 64 bit atomic value |
| - svga: use alignas over struct MKSGuestStatInfoEntry |
| - radv: Fixes mingw linkage error undefined reference to \`radv_GetCalibratedTimestampsEXT' |
| - v3d: Use DIV_ROUND_UP instead div_round_up |
| - freedreno: Use shared DIV_ROUND_UP instead div_round_up |
| - sfn: Use 4 instead of ATOMIC_COUNTER_SIZE |
| - intel/brw: use 4 instead of MAX_VERTEX_STREAMS to avoid #include "mesa/main/config.h" |
| - d3d12: replace use of MAX_VERTEX_STREAMS with PIPE_MAX_VERTEX_STREAMS |
| - compiler: use 4 instead ATOMIC_COUNTER_SIZE in glsl_types.h to avoid #include "mesa/main/config.h" |
| - compiler/glsl: Move glsl_print_type from glsl_types.* to ir_print_visitor.cpp |
| - util: Deduplicate macros between u_math.h and macros.h |
| - nvk: Should use alignment instead of align |
| |
| Yusuf Khan (4): |
| |
| - nouveau/ws: remove the drm.h header |
| - nvk: implement GetDeviceMemoryCommitment |
| - nvk: support GetImageSparseMemoryRequirements2 |
| - nvk: expose KHR_driver_properties |
| |
| Zhang Ning (1): |
| |
| - Revert "intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR" |
| |
| antonino (14): |
| |
| - virgl: add ci flake |
| - freedreno: add ci flake |
| - zink: remove unused indices from \`nir_load_push_constant` calls |
| - zink/nir: add a zink specific intrinsic for push constants |
| - vulkan/wsi: add \`vk_wsi_force_swapchain_to_current_extent` driconf |
| - drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "The Talos Principle" |
| - drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "Serious Sam Fusion" |
| - vulkan: Extend vkGet/SetPrivateDataEXT handling to all platforms |
| - vulkan: Extend vkGet/SetPrivateDataEXT handling to VkSurface |
| - vulkan: Handle vkSetDebugUtilsObjectNameEXT on WSI objects |
| - zink: store bindless var when creating it to avoid creating it again |
| - nir: fix several crashes in \`nir_lower_tex` |
| - nir: don't take the derivative of the array index in \`nir_lower_tex` |
| - vulkan: use instance allocator for \`object_name` in some objects |
| |
| cheyang (1): |
| |
| - isaspec : fix isaspec build error in aosp |
| |
| georgeouzou (1): |
| |
| - nvk: Support VK_EXT_line_rasterization |
| |
| jazzfool (1): |
| |
| - zink: Hash only first 32 bits of zink_gfx_pipeline_state with full DS3 |
| |
| lorn10 (1): |
| |
| - docs: Update Clover's env variable documentation |
| |
| norablackcat (2): |
| |
| - spirv/nir_to_spirv: add expect assume op codes |
| - rusticl: add cl_khr_expect_assume |
| |
| timmac-qmc (1): |
| |
| - glsl: fix potential crash with DisableUniformArrayResize |
| |
| twisted89 (1): |
| |
| - util/driconf: add workarounds for the Chronicles of Riddick |
| |
| wangra (1): |
| |
| - tu/kgsl: Fix bitfield of DITHER_MODE_MRT6 |
| |
| xurui (1): |
| |
| - glx: There is no need to psc++ |