| Mesa 21.0.0 Release Notes / 2021-03-11 |
| ====================================== |
| |
| Mesa 21.0.0 is a new development release. People who are concerned |
| with stability and reliability should stick with a previous release or |
| wait for Mesa 21.0.1. |
| |
| Mesa 21.0.0 implements the OpenGL 4.6 API, but the version reported by |
| glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / |
| glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. |
| Some drivers don't support all the features required in OpenGL 4.6. OpenGL |
| 4.6 is **only** available if requested at context creation. |
| Compatibility contexts may report a lower version depending on each driver. |
| |
| Mesa 21.0.0 implements the Vulkan 1.2 API, but the version reported by |
| the apiVersion property of the VkPhysicalDeviceProperties struct |
| depends on the particular driver being used. |
| |
| SHA256 checksum |
| --------------- |
| |
| :: |
| |
| e6204e98e6a8d77cf9dc5d34f99dd8e3ef7144f3601c808ca0dd26ba522e0d84 mesa-21.0.0.tar.xz |
| |
| |
| New features |
| ------------ |
| |
| - GL_EXT_demote_to_helper_invocation on radeonsi |
| |
| - GL_NV_compute_shader_derivatives on radeonsi |
| |
| - EGL_MESA_platform_xcb |
| |
| - Removed GL_NV_point_sprite for classic swrast. |
| |
| - driconf: remove glx_disable_oml_sync_control, glx_disable_sgi_video_sync, and glx_disable_ext_buffer_age |
| |
| - Removed support for loading DRI drivers older than Mesa 8.0, including all DRI1 support |
| |
| - Add support for VK_VALVE_mutable_descriptor_type on RADV |
| |
| - Removed classic OSMesa in favor of the newly improved gallium OSMesa |
| |
| - VK_KHR_fragment_shading_rate on RADV (RDNA2 only) |
| |
| - Freedreno a6xx exposes GL 3.3 |
| |
| - Classic swrast dri driver removed in favor of gallium swrast (llvmpipe or softpipe) |
| |
| - Panfrost g31/g52/g72 exposes ES 3.0 |
| |
| - Panfrost t760+ exposes GL 3.1 (including on Bifrost) |
| |
| - Sparse memory support on RADV |
| |
| - Rapid packed math (16bit-vectorization) on RADV |
| |
| - None |
| |
| |
| Bug fixes |
| --------- |
| |
| - R8 texture upload / corruption bug on Radeon RX 5700 XT |
| - Ambient Occlusion in Two Point Hospital shows black spot artifacts |
| - DXVK is broken in latest master |
| - mesa/st: Uniforms are not updated after lowering alpha test |
| - Regression: Segfault in cso_destroy_context() regression in 20.2 |
| - \[RADV\] Nioh 2 - The Complete Edition: "Bloom" on lights |
| - \[RADV][BISECTED\] The Surge 2 (644830) - In-game assets do not render correctly since 20.3.4. |
| - \[iris][icl,tgl][bisected][regression\] failure on piglit.spec.arb_separate_shader_objects.programuniform coverage |
| - "radeonsi: Check pitch and offset for validity." is a bad commit |
| - RADV: robustBufferAccessUpdateAfterBind is not exposed |
| - \[RADV/DXVK\] Shadow artifacts with different games |
| - glxgears segfaults with classic i915 |
| - ANV: Weird jitter in Witcher 1 |
| - ANV: Weird jitter in Witcher 1 |
| - ANV: Weird jitter in Witcher 1 |
| - meson: meson-built libraries have inconsistent compatability / current versions compared to older autotools-built libraries |
| - RADV: Extreme overhead in vkQueueSubmit |
| - timespec_get used unconditionally / build fails when targeting macOS 10.14 or earlier |
| - Graphical glitch of popupping missing texture on Mesa version \>18.0.5 (Padoka Stable + Unstable/Oibaf/ubuntu-x-swat PPAs) |
| - occasional corruption issue with RADV in multiple games, disappears after using amdvlk |
| - device select layer breaks other layers |
| - OpenGL on GMA4500MHD |
| - Rage 2: Visual corruption on in-game menu with ACO. |
| - GLonD3D12: Crashes and suboptimal fallback |
| - GLonD3D12: Crashes and suboptimal fallback |
| - GLonD3D12: Crashes and suboptimal fallback |
| - \[RADV][REGRESSION][BISECTED\] radv_GetMemoryFdPropertiesKHR returns no valid memory types for vaapi drmbuf |
| - anv: vkQueueSubmit with waitSemaphore value of 0 hangs CPU |
| - ttn: invalid base/range triggering nir_validate assertion |
| - \[RADV][ACO\] Overwatch game crash: amd/compiler/aco_insert_exec_mask.cpp: Failed Assertion |
| - Use out encoding for float immediates |
| - \[RADV\] Severe performance drop when exceeding VRAM compared to AMDVLK |
| - LIBGL_ALWAYS_SOFTWARE=1 picks zink over actual software rasterizers |
| - RADV: Occlusion query hangs Big Navi GPU |
| - "mesa: don't allocate matrices with malloc" cause eglCreateContext problem on android 7. |
| - Metal Gear Solid V: The Phantom Pain: texture issues and vertex stretches |
| - miscompiled compute shader loop on llvmpipe (and Iris) |
| - Graphics glitches after upgrade to mesa 20.3 on Khadas VIM3 Pro (Mali G52 GPU) |
| - glthread crash in \_mesa_glthread_upload |
| - Iris driver causing graphics glitch in QEMU spice egl DMA-BUF |
| - \[RADV/ACO\] Death Stranding cause a GPU hung (\*ERROR\* Waiting for fences timed out!) |
| - \[TGL\] Elder Scrolls Online misrenders |
| - \[ANV\] System hang with GRVK demos |
| - Rendering artifacts in Barn Finders specifically on Radeon Vega |
| - regression in !8152 |
| - \[bdw][icl][iris\] fails new test \`clearbuffer-depth-cs-probe\` |
| - ci: new traces runner needs dashboard links in the job log and junit |
| - zink: car model corruption with game TORCS |
| - Windows: 32-bit build is broken hard |
| - ANV: Not handling separate stencil layouts properly |
| - \[Regression][Intel][OpenGL][Bisected\] Copying whole 2D array texture failed on latest driver |
| - i915 regressions bisected to "vbo/dlist: use a shared index buffer" |
| - radv: dEQP-VK.sparse_resources.\* failures on GFX9 |
| - radv: dEQP-VK.sparse_resources.\* failures on GFX9 |
| - Mesa 20.3.x crashes pidgin on AMD RX480 |
| - libunwind not located / used on macOS |
| - Some games using FNA framework show blank screen |
| - Intel Vulkan regression of angle_end2end_tests |
| - Defer lavapipe warning to queue / command / swapchain buffer creation |
| - aco_tests failure with clang build |
| - BUG: After issues playing World of Warcraft with RADV |
| - Texture views on blits ignore formats |
| - mesa-git hangs weston |
| - radv: Some MSAA tests fail when DCC is forced. |
| - \[RADV/ACO/SIENNA_CICHLID\] Into the game Shadow of the Tomb Raider the flickering artifacts are present on brushes. |
| - Memory leak - alloc_prim_store in vbo_save_NewList |
| - radv/aco: "Failed to allocate registers" in AC:Valhalla |
| - Enable "radeonsi_clamp_div_by_zero" to fix graphical bug in CSGO, "mesa_glthread" for performance |
| - master fails to build with "ac_sqtt.h:139:15: error: expected parameter declarator" |
| - Conditional rendering implementation conflicts with aux-state tracking |
| - regression since !7720 |
| - regression after !8196 |
| - Use up to 4 images for IMMEDIATE flip |
| - piglit gl-1.0-rendermode-feedback TGSI_FILE_NULL assert on Iris |
| - Use LDC and constant buffer state for UBO loads. |
| - DOOM crashes on startup with OpenGL on RX 6800 |
| - Regression with Minecraft/Optifine performance with all VRAM mapped |
| - Space Engineers rendering regression after 5f79e4e6 which triggers incorrect optimizations from 053be9f0 |
| - star conflict crashes on iris, but loads fine on i965, on HD 5500 |
| - radv: blit/copy tests with A2B10G10R10 SNORM fail when DCC is forced on GFX9 |
| - freedreno: regression of gl-3.2-layered-rendering-gl-layer-render after e49748521ec9182e8d2eec823182cc463709123f |
| - \`gl_FragColor' undeclared (AMDGPU) - tested stable Mesa 20.1 and latest git for 20.3 (Game/Wine/Proton) |
| - Mafia III Demo: Artifacts around barrels |
| - android: webview crashes after a2fb87eea6d4 |
| - anv: dEQP-VK.subgroups.ballot_broadcast.compute.subgroupbroadcast_i8vec3_requiredsubgroupsize32 fail |
| - Mesa considers the framebuffer with mixed 3D and 2D array attachments to be incomplete. |
| - Multiple buffer definitions bound to single OpDecorate::Binding break SPIR-V module. |
| - Intel driver segfaults on SPIR-V with OpArrayLength |
| - \[g33][bisected][regression\] multiple piglit failures |
| - \[v3d][bisected][regression\] Piglit failures on gl-1.0-rendermode-feedback and select |
| - Update Mesa CI CTS to latest version |
| - Rendering artifacts in Enter The Gungeon on Both RX 590 and Radeon 7 |
| - No way to turn off "Device" and "Swapchain format" in Vulkan overlay |
| - Frames count doesn't turn off in vulkan overlay with frame=0 |
| - \[bdw][iris][bisected][regression\] failing test on multiple test suites |
| - osmesa classic: build failure with Meson and MinGW-W64 |
| - Crash and slowness in FreeCAD |
| - ci: Missing needs: in radeonsi-stoney-\*? |
| - Triangles appear from the center of the field on PES2021 with Mesa 20.2.x |
| - \[gen9][iris][regression][bisected\] flaky piglit tests |
| - \[Intel][OpenGL\] Fail to get correct value when sampling from a texture in depth formats. |
| - MESA_VK_DEVICE_SELECT only parses 16-bit vendorID, but in Vulkan is uint32_t |
| - lp_test_format test fail on 32-bit mingw builds |
| - RADV: Strange clear behavior with multisample arrays |
| - Mesa 20.3.0 and older ATi/Radeon cards fails |
| - Android building error after commit f08d8c849e |
| - OSMesa SEGV in OSMesaGetDepthBuffer |
| - osmesa gallium state tracker: Leak of screens and buffers on exit/shared library unload |
| - Gallium OSMesa driver is far from being thread-safe |
| - OSMesa UAF in OSMesaDestroyContext |
| - OSMesaGetDepthBuffer flipped vertically |
| - radv,aco: CTS image robustness tests fail to compile |
| - 32-bit mesa failing to build inside a chroot due to f88347cd |
| - Storing pointer to temporary value inside the Iris driver. |
| - \[radeonsi\] DESPERADOS III poor performance when there's lots of animations going on |
| - ci: arm64_test build broken (likely by ci-templates bump) |
| - New build option to specify default value for shader disk cache size |
| - commit f86668f487b32c185388a39e2200c17c298b877a fatal error: util/macros.h: No such file or directory |
| - zink: ubo loading problems |
| - !7138 broke the D3D12 driver |
| - \[icl,tgl][iris][i965][regression][bisected\] piglit failures |
| - 15% perf drop in GfxBench Manhattan 3.1 performance |
| - \[Intel][OpenGL\] Fail to get correct stencil data from the stencil attachment with glReadPixels() |
| - shader-db valgrind error |
| - \[AMDGPU NAVI 5700xt\] Large parts of the Blender viewport does not render correctly if an object with hair is moved. |
| - \[aco\] problem compiling compute pipeline |
| - build failures after simple_mtx helgrind annotations |
| - teach helgrind about simple_mtx |
| - zink: regression after !7606 |
| - Chromium browser with VA-API video acceleration got corruption |
| - glcpp test 084-unbalanced-parentheses fails with bison 3.6.y |
| - \[Intel][OpenGL\] glDepthFunc(GL_EQUAL) doesn't work correctly on Intel Linux Mesa OpenGL drivers |
| - d3d12: GPU based validation issue on fbo-clear-formats piglit |
| - \[tgl,icl,gen9][bisected\] crucible/vulkancts failures on multiple platforms |
| - zink+radv: corruption on pre-game menu in quake3 |
| - Memory leak in minecraft (many dri/renderD128 regions in /proc/[id]/maps) |
| - freedreno: Use nir_opt_large_constants |
| - android: amd/common: building error after 0833dd7d1 |
| - panfrost massive glitches apitrace opengl 2.1 |
| - freedreno/nir: nir_validate failure after nir_lower_tex |
| - \[i965,iris][bisected\] piglit and glcts failures on multiple platforms |
| - \[i965,iris][bisected\] piglit and glcts failures on multiple platforms |
| - db410c ethernet no longer working |
| - Add KHR_display extension to v3dv |
| - \[radeonsi\] After 549ae5f84375dfadb86cfd465f0103acfae3249f commit Firefox Nightly Asan begins crashes |
| |
| |
| Changes |
| ------- |
| |
| Adam Jackson (36): |
| |
| - docs: Update Mesa GL enum allocations for EGL_MESA_platform_xcb |
| - glx, egl: Add LIBGL_DRI2_DISABLE environment variable |
| - glx: Eliminate some stub functions for !GLX_DIRECT_RENDERING |
| - glx: Remove unused \__GLXDRIscreen::createContext |
| - glx: Check share ctx compatibility in ::create_context_attribs |
| - glx: Handle create_context in terms of create_context_attribs |
| - glx: Remove DRI1 |
| - glx: Simplify error handling in glXImportContextEXT |
| - glx: Fix the generated error when indirect contexts are not supported |
| - glx/indirect: Validate the context version in CreateContextAttribs |
| - glx: Claim to support more GL versions in \__glX_send_client_info |
| - meson: Make the glvnd vendor name configurable |
| - zink: factor out GET_PROC_ADDR and friends to zink_screen.h |
| - mesa: Remove silly "dummy_false" extension support |
| - zink: Fix indentation in zink_create_instance |
| - zink: Factor out winsys awareness from zink_internal_create_screen |
| - zink: Factor out zink_get_loader_version() |
| - zink: Factor out zink_create_logical_device |
| - zink: Simplify MoltenVK support a bit |
| - glx/xlib: Build fix |
| - swrast: Remove the classic swrast DRI driver |
| - treewide: Disambiguate various variables named "debug_options" |
| - mesa: Cosmetic cleanups to GL_EXT_texture_sRGB_R8 |
| - mesa: Implement GL_EXT_texture_sRGB_RG8 for softpipe and llvmpipe |
| - zink: Enable GL_EXT_texture_sRGB_R8 |
| - zink: Enable GL_EXT_texture_sRGB_RG8 |
| - virgl: Enable GL_EXT_texture_sRGB_RG8 |
| - drisw: Use debug_screen_wrap like everybody else |
| - tests: Fix memory leaks in DispatchSanity |
| - mesa: Fix array-format-to-format table on big-endian |
| - mesa: Don't make building tests conditional on building DRI drivers |
| - nouveau: pacify gcc on ILP32 |
| - zink: Fix VK_FORMAT_A8B8G8R8_SRGB_PACK32 mapping on big-endian |
| - ci: Add a few more drivers to the cross builds |
| - osmesa: Pacify MSVC in the test code |
| - zink: Fix a thinko in instance setup |
| |
| Alejandro Piñeiro (12): |
| |
| - nir/lower_tex: clarify nir_lower_tex_options indexing |
| - v3dv: cleanup/remove support for pre-generated variants |
| - broadcom/compiler: separate texture/sampler info from v3d_key |
| - v3dv: remove combined_idx support |
| - v3dv/pipeline: take into account precision for the output_type |
| - v3dv: use the common base object type and struct |
| - v3dv: implement VK_EXT_private_data |
| - turnip: minor tu_queue fixes related to vk_base_object |
| - v3dv/cmd_buffer: missing (uint8_t \*) casting when calling memcmp |
| - docs/features: update list of v3dv supported features |
| - v3dv: remove non-conformant warning |
| - v3dv/pipeline: avoid unused warning on release build |
| |
| Alexander Kanavin (1): |
| |
| - anv: fix a build race between generating a header and using it |
| |
| Alexander von Gluck IV (2): |
| |
| - meson: Add \_GNU_SOURCE for Haiku to activate non-posix functions |
| - glsl/builtin_functions: Rename int64 function to int64_avail |
| |
| Alistair Popple (2): |
| |
| - gv100/ir: Make emitATOM consistent with emitRED |
| - gv100/ir: Use system wide atomics |
| |
| Alyssa Rosenzweig (170): |
| |
| - pan/bi: Model writemasks correctly |
| - panfrost: Implement linear Z/S for SFBD |
| - panfrost: Remove panfrost_can_linear |
| - panfrost: Fix out-of-bounds read on SFBD |
| - panfrost: Add PAN_GPU_ID debug option |
| - panfrost: Enable indirect uniform indexing |
| - pan/mdg: Fix shader-db counter |
| - pan/bi: Implement sampler1D |
| - pan/bi: Fix varying writemask handling |
| - pan/bi: Fix off-by-one in RA |
| - pan/bi: Ensure TEXC src0 is not marked SSA |
| - pan/bi: Implement shader-db stats |
| - panfrost: Account for sample count in tib offsets |
| - panfrost: Fix RAW8/16/32 component replication |
| - docs: Add a stub page for Panfrost |
| - docs/panfrost: Fix comment about Lima |
| - docs: Update Panfrost in the source tree |
| - docs/systems: Update Panfrost link |
| - docs/panfrost: Document building Panfrost |
| - docs/panfrost: Mention the IRC channel |
| - pan/bi: Allow toggling disassembly verbosity |
| - pan/bi: Space out disassembly |
| - pan/bi: Remove all-0's termination condition |
| - pan/bi: Minor styling cleanup in disasm |
| - panfrost: Fix LOD mode field on Bifrost |
| - pan/bi: Drop on-board packing tests |
| - pan/bi: Label shader-db shaders |
| - pan/bi: Remove bi_is_live_after |
| - pan/bi: Add unused instruction mechanism |
| - pan/bi: Add pseudo-instruction mechanism |
| - pan/bi: Mark some instructions as unused |
| - pan/bi: Defer newline printing in disassembler |
| - pan/bi: Use consistent negX/absX naming |
| - pan/bi: Use consistent wls naming |
| - pan/bi: Use consistent naming of lane/lane0 |
| - pan/bi: Don't treat extend as per-source |
| - pan/bi: Use canonical names for clamps |
| - pan/bi: Use canonical names for rounding modes |
| - pan/bi: Use canonical varying names |
| - pan/bi: Use canonical sample names |
| - pan/bi: Use canonical update modes |
| - pan/bi: Use canonical min/max semantics |
| - pan/bi: Use canonical name for segments |
| - pan/bi: Use canonical lane ops |
| - pan/bi: Use canonical subgroup size |
| - pan/bi: Use canonical inactive result |
| - pan/bi: Use consistent neg naming |
| - pan/bi: Mark message types in ISA.xml |
| - pan/bi: Fix rounding name for HADD in XML |
| - pan/bi: Add staging register counts to ISA.xml |
| - pan/bi: Add pseudo register formats to XML |
| - pan/bi: Rename isa_parse to bifrost_isa |
| - pan/bi: Add explicit meson dependency on the ISA helpers |
| - pan/bi: Move copyright notice to common code |
| - pan/bi: Add helpers for manipulating the ISA |
| - pan/bi: Remove reference to 64-bit RA |
| - pan/bi: Move modifier prints out of common code |
| - pan/bi: Generate bi_opcodes.h |
| - pan/bi: Use autogenerated modifiers |
| - pan/bi: Generate bi_opcodes.c |
| - pan/bi: Merge BIR_INDEX_FAU and BIR_INDEX_BLEND |
| - pan/bi: Remove BIR_INDEX_UNIFORM |
| - pan/bi: Make BIR_INDEX_ZERO less special |
| - pan/bi: Add bi_swizzle enum |
| - pan/bi: Add bi_index data structure |
| - pan/bi: Add bi_index constructors |
| - pan/bi: Add nullity/equality helpers for bi_index |
| - pan/bi: Add helper to extract a word from an index |
| - pan/bi: Add bi_temp{_reg} for new-style bi_index |
| - pan/bi: Add helpers to generate bi_index from NIR |
| - pan/bi: Add a helper to convert to old-style nodes |
| - pan/bi: Add node_to_index helper |
| - pan/bi: Add bi_half and bi_byte selectors |
| - pan/bi: Add imm_f32 helper |
| - pan/bi: Add bi_imm_u{8, 16} helpers |
| - pan/bi: Add bi_{abs, neg} helpers |
| - pan/bi: Add new bi_instr data structure |
| - pan/bi: Add cursor data structures |
| - pan/bi: Add builder data structure |
| - ci/panfrost: Skip test with 4096 byte shader |
| - pan/bi: Ensure fneg of a constant isn't reached |
| - pan/bi: Rename bi_pack_{fma, add} to free up symbols |
| - pan/bi: Rename bi_load |
| - pan/bi: Add bi_not alias of bi_neg |
| - pan/bi: Generate instruction printer |
| - pan/bi: Generate builder routines |
| - pan/bi: Generate instruction packer for new IR |
| - pan/bi: Add bi_count_staging_registers helper |
| - pan/bi: Add new style read/writemask helpers |
| - pan/bi: Add builder initialization helper |
| - pan/bi: Add bi_is_intr_immediate helper |
| - pan/bi: Add bi_make_vec_to helper |
| - pan/bi: Implement bi_emit_ld_tile via the builder |
| - pan/bi: Implement bi_load_sysval via the builder |
| - pan/bi: Implement bi_emit_load_const via the builder |
| - pan/bi: Implement load_blend_input via the builder |
| - pan/bi: Implement bi_reg_fmt_for_nir helper |
| - pan/bi: Implement load_vary via the builder |
| - pan/bi: Implement BLEND by builder |
| - pan/bi: Implement fragment_out by builder |
| - pan/bi: Implement store_vary with the builder |
| - pan/bi: Implement load_ubo with the builder |
| - pan/bi: Implement frag coord with the builder |
| - pan/bi: Implement load attribute with the builder |
| - pan/bi: Add intrinsic emits for builder |
| - pan/bi: Add bi_alu_src_index helper |
| - pan/bi: Add bi_nir_round helper |
| - pan/bi: Add bi_cmpf_nir helper |
| - pan/bi: Implement ALU with the builder |
| - pan/bi: Implement jumps with the builder |
| - pan/bi: Add TEXS emit with builder |
| - pan/bi: Add builder-using helpers for TEXC structs |
| - pan/bi: Emit TEXC with builder |
| - pan/bi: Fix TEXS/TEXC check prototype |
| - pan/bi: Add emit tex for builder |
| - pan/bi: Add instruction emit for builder |
| - pan/bi: Add bi_message_type_for_instr helper |
| - pan/bi: Schedule new instructions singletons |
| - pan/bi: Add bi_branch, bi_jump helpers |
| - pan/bi: Stub FAU lowering pass |
| - pan/bi: Switch to new IR |
| - pan/bi: Remove combine lowering |
| - pan/bi: Remove old IR packs |
| - pan/bi: Remove packing helpers |
| - pan/bi: Remove old IR prints |
| - pan/bi: Remove old IR spill code |
| - pan/bi: Remove old IR scheduling |
| - pan/bi: Remove NIR->old IR |
| - pan/bi: Remove old IR helpers |
| - pan/bi: Remove old IR opcode table |
| - pan/bi: Remove old IR instruction emit |
| - pan/bi: Use new instruction types |
| - pan/bi: Remove old IR |
| - pan/mdg: Fix bound setting in RA for sources |
| - panfrost: Import render condition check from fd |
| - panfrost: Respect the render condition |
| - docs: Document extensions exposing GL3.0 |
| - pan/bi: Fix TEXS register counts |
| - pan/bi: Workaround BLEND precolour with explicit moves |
| - pan/bi: Pull out bi_dontcare helper |
| - pan/bi: Fix ATEST with pure integers |
| - pan/bi: Don't suppress Inf/NaN |
| - pan/bi: Allow passing thorugh 8-bit scalars |
| - pan/bi: Implement scalar i2i8/u2u8 |
| - pan/bi: Use TEXC for indices \>= 8 |
| - pan/bi: Parametrize intrinsic immediate limits |
| - pan/bi: Assert immediate indices fit |
| - panfrost: Disable AFBC of 3D, 2D arrays |
| - panfrost: Advertise ES3.0 on Bifrost |
| - docs: Add release note for Bifrost GL3.1 |
| - docs/panfrost: Update GL/ES versions for v5+ |
| - docs/features: Mark GL3.1 as done on Panfrost |
| - docs/features: Fix missing close paranthesis |
| - pan/bi: Implement TEXS for cube maps |
| - panfrost: Handle explicit primitive restart |
| - panfrost: Add alpha reference to XML |
| - panfrost: Implement alpha testing natively |
| - pan/bi: Fix assertion |
| - pan/bi: Fix 64-bit SSBO addresses |
| - pan/bi: Fix RA of node 0 |
| - pan/bi: Fix printing of node 0 |
| - pan/bi: Fix M1/M2 decoding in disassembler |
| - pan/bi: Fix FLOG_TABLE modifier handling |
| - pan/bi: Fix empty shader handling |
| - panfrost: Add panfrost_sample_pattern helper |
| - panfrost: Set tiler descriptor sampler pattern |
| - pan/bi: Use explicit move even for RT#0 of MRT |
| - panfrost: Raise TEXTURE_BUFFER_OFFSET_ALIGNMENT |
| - panfrost: Don't advertise OES_copy_image |
| - panfrost/lcra: Fix constraint counting |
| |
| Andres Gomez (23): |
| |
| - ci: update some radv trace checksums |
| - ci: update some radv trace checksums |
| - .mailmap: add and update aliases for Danylo Piliaiev |
| - ci: Bump deqp to current vulkan-cts-1.2.5.0 also in the Lava jobs |
| - ci: specify source and build directories with CMake |
| - ci: use ephemeral packages when building the build-base image |
| - ci: install ci-fairy in the testing images |
| - ci: spread the usage of the FDO_UPSTREAM_REPO variable |
| - ci: update piglit's version so it features replayer |
| - ci: build piglit in the Vulkan testing image |
| - ci: specify MinIO's host URL in a global variable |
| - ci: add piglit replay jobs and remove tracie ones |
| - ci: only modify LD_LIBRARY_PATH when running the piglit cmd |
| - ci: add Vulkan piglit traces jobs and remove tracie ones |
| - ci: move general build commands to their own section |
| - ci: move API specification to driver instead of test suite |
| - ci: build piglit inside baremetal and LAVA's rootfs |
| - ci: add piglit jobs to LAVA and remove tracie ones |
| - ci: refactor arm64 jobs in preparation for piglit addition |
| - ci: add piglit job to baremetal and remove tracie ones |
| - ci: remove all tracie remains |
| - ci: recover tracie dashboard URLs for failing traces |
| - ci: correct the trace image URLs in the piglit summary |
| |
| Andrii Simiklit (6): |
| |
| - glsl: avoid an out-of-bound access while setting up a location for variable |
| - iris: update depth value for stages after fast clear depth |
| - glx: lets compare drawing command sizes using MIN3 |
| - glx: fix spelling issues |
| - st/mesa: don't affect original st_CompressedTexSubImage parameters |
| - st/mesa: fix pbo upload/download for arrays of textures with only 1 layer |
| |
| Anuj Phogat (2): |
| |
| - intel/anv: Fix condition to set MipModeFilter for YUV surface |
| - intel/anv: Fix condition for planar yuv surface |
| |
| Bas Nieuwenhuizen (57): |
| |
| - radv: Do the sample check for tiling earlier. |
| - amd/addrlib: Use signed char for INT_8. |
| - radeonsi: Add displayable DCC flushing without explicit flushes. |
| - drm-uapi: Add AMD modifiers. |
| - amd/common: Add support for modifiers. |
| - amd/common: Add modifier tests. |
| - radeonsi: Check pitch and offset for validity. |
| - radeonsi: Add modifier support. |
| - radeonsi: Do not disable DCC when we have it as a modifier. |
| - radeonsi: Do not try to disable displayable DCC with modifiers. |
| - radeonsi: Add auxiliary plane support. |
| - drm/uapi: Fix modifier field mask for AMD modifiers. |
| - radv: Use internal drm_fourcc.h |
| - gallium/vl: Set modifier field for winsys handle. |
| - radv: Dump BO VA ranges on hang. |
| - radv: Fix RB+ blending for VK_FORMAT_E5B9G9R9_UFLOAT_PACK32. |
| - radv: Fix a hang on CB change by adding flushes. |
| - radv: Deal with unused attachments in mip flush |
| - radv: Don't invalidate the SCACHE for image barriers. |
| - radv: Don't skip layout transitions that only differ in render loop. |
| - radv: Never allow fast clears on DCC images that are not compressed. |
| - radv: Add option to disable DCC in renderpasses without layout. |
| - radv: Disable DCC explicitly for incompatible copies. |
| - radv: Enable DCC in the GENERAL layout on GFX10+. |
| - radv: Use VRAM for upload buffers if entire VRAM is CPU-visible. |
| - radv: Put commandbuffers in VRAM if all VRAM is CPU visible. |
| - radv: Use VRAM for the initial gfx cmdbuffer. |
| - ac/surf: Prepare for 64-bit flags. |
| - ac/surf: Implement PRT layout. |
| - ac/surf: Add sparse texture info to radeon_surf. |
| - ac/surf: Use correct tilemodes on GFX8 for PRT. |
| - radv/winsys: Fix inequality for sparse buffer remapping. |
| - radv/winsys: Fix offset in range merging. |
| - radv: Create sparse images. |
| - radv: Add image sparse memory update implementation. |
| - radv: Add sparse image queries. |
| - radv: Enable sparse buffer and image support. |
| - radv: Add Android module info to linker script. |
| - radeonsi: Only set modifier creation function for GFX9+ & with kernel support. |
| - radv: Remove redundant WB_L2 flush. |
| - radv: Invalidate CB on SHADER_WRITE for meta operations. |
| - radv: Do dst invalidations for write accesses. |
| - radv: Use access helpers for flushing with meta operations. |
| - radv: Use L2 for CP DMA on GFX9+. |
| - radv: Use L2 coherency on GFX9+. |
| - ac/surface: Fix GFX9 sparse mip info. |
| - radv: Do not use a pipe offset for aliased sparse images. |
| - radv: Use stricter HW resolve swizzle compat check. |
| - radv: Do not hash vk_object_base in descriptor set layout. |
| - radv: Improve spilling on discrete GPUs. |
| - radv: Fix vram override with fully visible VRAM. |
| - radv: Ignore WC flags for VRAM. |
| - radv: Do pipe misalignment check per plane. |
| - vulkan/device_select: Stop using device properties 2. |
| - radv: Don't use dedicated memory info to indicate sharing. |
| - radv: Expose robustBufferAccessUpdateAfterBind correctly. |
| - frontends/va: Use correct size for secondary planes. |
| |
| BillKristiansen (1): |
| |
| - microsoft: add resource state manager utility code |
| |
| Boris Brezillon (119): |
| |
| - panfrost: Fix Bifrost blend descriptor emission |
| - panfrost: Fix ->reads_frag_coord assignment |
| - pan/bi: Extract shadowmap comparator |
| - pan/bi: Force BLEND src0 to r0 |
| - panfrost: Fix panfrost_format_to_bifrost_blend() |
| - panfrost: Get rid of the Pixel Format descriptor |
| - pan/bi: Store the architecture in the compiler context |
| - pan/bi: Expose FAU slots |
| - pan/bi: Rename CLPER into CLPER_V7 and add CLPER_V6 |
| - pan/bi: Add support for the CLPER instructions |
| - pan/bi: Add support for derivative instructions |
| - pan/bi: Allow vec16 in bi_print_swizzle() |
| - pan/bi: Allow lane selections on component 4 and above |
| - pan/bi: Add support for tex offsets |
| - pan/bi: Don't use TEXS for tex operations with a src that's not lod or coord |
| - pan/bi: Support txs operations |
| - pan/bi: Support automatic register format |
| - pan/bi: Let the GPU pick the right format based on the varying descriptor |
| - pan/bi: Set roundmode to RTZ for f2u operations |
| - pan/bi: Move LD_VAR packing out of bi_pack_add() |
| - pan/bi: Pass LD_VAR update mode explicitly |
| - pan/bi: Stop passing special varying names through src0 |
| - pan/bi: Fix LD_VAR with non-constant index |
| - pan/bi: Add a varying_index field to bi_texture |
| - pan/bi: Stop extracting the immediate attribute index from src0 |
| - panfrost: Don't expose fp16 support on Bifrost unless explicitly requested |
| - nir: Fix nextafter() for hardware that don't support denorms |
| - compiler/spirv: Handle the LocalSizeHint execution modes |
| - nir: Make nir_build_deref_offset() support ptr_as_array |
| - pan/bi: Emit a combine even if we only pass one staging reg to TEXC |
| - nir: Fix LOD source type for txf_ms instructions |
| - panfrost: Stop forcing depth to nr_samples |
| - panfrost: Get rid of the Sample Count enum |
| - panfrost: Fix decoding of texture payloads |
| - panfrost: Set depth for 3D textures on Bifrost |
| - panfrost: Set sample_count when packing bifrost texture descriptors |
| - pan/bi: Only update LOD mode on TEX operations |
| - pan/bi: Always emit a LOD/CUBE word for FETCH instructions |
| - pan/bi: LOD is a 8.8 fixed point |
| - panfrost: Increase blit shader BO size on Bifrost |
| - panfrost: Add a minus(1) modifier to the Levels field |
| - panfrost: Clarify bit 2:28 meaning in the Midgard texture descriptor |
| - panfrost: Add two helpers to calculate the surface pointer and strides |
| - panfrost: Set the layer stride |
| - panfrost: Unconditionally align strides on 64 bytes for linear resources |
| - panfrost: Enable MSAA on bifrost when deqp debug option is set |
| - panfrost: Expose panfrost_block_dim() |
| - panfrost: Fix panfrost_needs_explicit_stride() for block-based formats |
| - panfrost: Calculate the row stride at resource creation time |
| - panfrost: Fix stride calculation for Z32_S8X24/X32_S8X24 formats |
| - panfrost: Update the resource layout when doing a tile -\> linear conversion |
| - panfrost: Update the resource layout before calling util_copy_rect() |
| - panfrost: Fix texture payload decoding |
| - panfrost: Fix draw descriptor definition |
| - panfrost: Only set varyings and varying_buffers when varying_count \\> 0 |
| - panfrost: Make sure we always add a reader -\> write dependency when needed |
| - panfrost: Fix fencing |
| - pan/mdg: Add support for multi sample iteration writeout |
| - panfrost: Take the number of samples into account in blend shaders |
| - panfrost: Preload SampleID when reloading multisample FBs |
| - panfrost: Fix provoking vertex selection for lines |
| - pan/mdg: Fix texture handling for 2DMS arrays |
| - panfrost: Allow 2DMS arrays |
| - panfost: Fix depth/stencil writeback on Bifrost v7 |
| - panfrost: Force ->s_writeback_base to ->zs_writeback_base for Z24S8 buffers |
| - panfrost: Reload depth/stencil when they are read |
| - gallium/util: Fix depth/stencil blit shaders |
| - panfrost: Fix several depth/stencil format mappings |
| - pan/bi: Fix ATEST emission |
| - panfrost: Move checksum_bo to panfrost_resource |
| - panfrost: Group CRC fields in a struct |
| - panfrost: Pass a device object to panfrost_new_texture() |
| - panfrost: Merge emit_texture_payload() and emit_texture_payload_v7() |
| - panfrost: Pass a dev object to panfrost_needs_explicit_stride() |
| - panfrost: Define AFBC surface flags |
| - panfrost: Adjust the compression tag creation for Bifrost |
| - panfrost: Merge panfrost_new_texture() and panfrost_new_texture_bifrost() |
| - panfrost: s/panfrost_slice.size0/panfrost_slice.surface_stride/ |
| - panfrost: Use PAN_V6_SWIZZLE() in pan_blit.c |
| - panfrost: Stop mixing depth and number of samples |
| - panfrost: Add a pan_image_layout object |
| - panfrost: Move AFBC header_size to a sub-struct |
| - panfrost: Fix AFBC header_size and slice size calculation |
| - panfrost: Add AFBC slice.body_size and slice.{row,surface}_stride fields |
| - panfrost: Adjust surface stride calculation to take AFBC into account |
| - panfrost: Add R5G6B5_UNORM entries to the format tables |
| - panfrost: Pass a pipe-like swizzle to panfrost_new_texture() |
| - panfrost: Adjust the format for AFBC textures on Bifrost v7 |
| - panfrost: Fix ZS block format v7 definition |
| - panfrost: Use proper format for Z16_UNORM |
| - panfrost: Fix AFBC support on Bifrost |
| - panfrost: Enable AFBC support on Bifrost |
| - panfrost: Use panfrost_get_layer_stride() instead of open-coding it |
| - panfrost: Initialize AFBC headers to zero |
| - panfrost: Fix panfrost_should_linear_convert() |
| - panfrost: Allow AFBC on 2D arrays |
| - panfrost: Fix calculation of body/header pointers for 3D AFBC |
| - panfrost: Allow 3D AFBC on Bifrost v7 |
| - panfrost: Fix AFBC on Bifrost v6 |
| - panfrost: Fix UBO count calculation on Bifrost |
| - pan/bi: Fix constant slot selection |
| - panfrost: Set the RT index when emitting a Bifrost blend descriptor |
| - pan/bi: Pass bundle pointers to bi_pack_tuple() |
| - pan/bi: Port bi_collect_blend_ret_addr() to the new compiler infra |
| - pan/bi: Restrict registers to r0-r15 when compiling blend shaders |
| - pan/bi: Use the interference mechanism to describe blend shader reg use |
| - pan/bi: Allow non-terminal BLEND operations |
| - pan/bi: Lower 8bit fragment outputs to 16bit |
| - panfrost: Promote 8b to 16b for blend descriptors |
| - panfrost: Test GLES3 on Bifrost |
| - panfrost: Get layer stride of level 0 on staging resources |
| - panfrost: Pass the resource dimension to panfrost_compression_tag() |
| - panfrost: Fix estimate_texture_payload_size() on Bifrost |
| - panfrost: Re-enable AFBC on 3D, 2D arrays |
| - panfrost: Skip an XFB test that's passing/failing randomly |
| - panfrost: Fix panfrost_afbc_format_needs_fixup() |
| - pan/bi: Fix the !immediate case in bi_emit_store_vary() |
| - panfrost: Fix tiler job injection (again) |
| - panfrost: Fix a polygon list corruption in the multi-context case |
| |
| Boyuan Zhang (2): |
| |
| - radeon: fix license in header |
| - radeon/vcn: use cdw to calculate slice header index |
| |
| Brendan Dougherty (1): |
| |
| - mesa: Fix vertex_format_to_pipe_format index. |
| |
| Caio Marcelo de Oliveira Filho (13): |
| |
| - intel/fs: Add assert on the brw_STAGE_prog_data downcasts |
| - intel/disasm: Don't rely on FALLTHROUGHTs to print unsupported SFID |
| - anv: Avoid a couple of warnings related to vk_error macros |
| - spirv: Implement OpArrayLength for OpenGL |
| - nir: Fix outdated name in comment |
| - nir: Remove unused parameter in remove_dead_var_writes |
| - nir: Consider pointer initializers in nir_remove_dead_variables |
| - spirv: Remove more dead variables |
| - spirv2nir: Add --opengl (-g) argument for OpenGL SPIR-V |
| - spirv: Don't remove variables used by resource indexing intrinsics |
| - nir: Add a data pointer to the callback in nir_remove_dead_variables |
| - compiler: Use util/bitset.h for system_values_read |
| - spirv: Allow variable pointers pointing to an array of blocks |
| |
| Chad Versace (24): |
| |
| - anv/image: Check DISJOINT in vkGetPhysicalDeviceImageFormatProperties2 (v2) |
| - anv/image: Fix isl_surf_usage_flags for stencil images |
| - isl: Define isl_drm_modifier_get_score() \[v3\] |
| - anv/image: Use isl_drm_modifier_get_score() |
| - isl: Add isl_format_layout::uniform_channel_type |
| - anv/image: Teach anv_get_image_format_features() about modifiers (v3) |
| - anv/image: Fill drmFormatModifierTilingFeatures (v2) |
| - isl: Make public the list of modifiers |
| - anv/image: Refactor iteration over modifiers |
| - anv/image: Delete the list of modifier-compatible formats |
| - anv/image: Fix VkExternalMemoryProperties for images (v5) |
| - anv/image: Rename get_wsi_format_modifier_properties_list() |
| - anv/image: Minor refactor of VkImageFormatProperties::sampleCounts |
| - anv/image: Fail earlier in anv_get_image_format_properties |
| - anv/image: Respect VkImageFormatListCreateInfo for VkImageFormatProperties (v2) |
| - anv/image: Drop redundant rejection of YCbCr formats with modifiers |
| - anv/image: Emit error message for non-2D DRM images |
| - anv/image: Move some DRM code in anv_get_image_format_properties() |
| - anv/image: Add more asserts to choose_isl_tiling_flags |
| - anv/image: Define add_all_surfaces() |
| - anv/image: Further split add_*_surface funcs (v2) |
| - anv/image: Rewrite check_surfaces() \[v2\] |
| - anv/image: Check surface offsets after adding each surface |
| - anv/image: Define anv_image_get_aux_addr (v3) |
| |
| Chia-I Wu (1): |
| |
| - virgl: fix modifier truncation |
| |
| Christian Gmeiner (37): |
| |
| - ci: sort packages installed via apt-get |
| - etnaviv: nir: do not run opt loop after nir_lower_bool_xxx(..) |
| - etnaviv: drop nir_print_shader(..) call |
| - etnaviv/drm: fix evil-twin etna_drm_table_lock |
| - etnaviv/drm: convert to simple_mtx |
| - etnaviv/drm: add some locking asserts |
| - etnaviv: update fallthrough comments |
| - nir: change return type to void |
| - etnaviv: rename from immedaite to uniform in some places |
| - etnaviv: remove imm\_ prefix from etna_shader_uniform_info members |
| - ci: build ARM mesa with X11 OpenGL support |
| - ci: build mesa with gbm |
| - ci/bare-metal: build full piglit for baremetal ARM targets. |
| - ci/fastboot: exclude either deqp or piglit |
| - ci/bare-metal: pass thorugh PIGLIT env vars |
| - mesa/prog_to_nir: use intrinsic builders |
| - tgsi_to_nir: use intrinsic builders |
| - nir: use intrinsic builders |
| - v3d: use intrinsic builders |
| - v3dv: use intrinsic builders |
| - ir3: use intrinsic builders |
| - st: use intrinsic builders |
| - zink: use intrinsic builders |
| - tu: use intrinsic builders |
| - d3d12: use intrinsic builders |
| - iris: use intrinsic builders |
| - vc4: use intrinsic builders |
| - intel/blorp: use intrinsic builders |
| - intel/compiler: use intrinsic builders |
| - anv: use intrinsic builders |
| - microsoft/compiler: use intrinsic builders |
| - pan: use intrinsic builders |
| - etnaviv: add set_stream_output_targets(..) stub |
| - v3d: drop not use function parameter |
| - v3d: update fallthrough comments |
| - v3d: mark some variables static const |
| - etnaviv: handle NULL views in set_sampler_views |
| |
| Connor Abbott (17): |
| |
| - freedreno/ci: Strip location from asserts |
| - freedreno/a6xx: Document private memory registers |
| - ir3: Expand cat6 a6xx opcode field |
| - ir3: Add more a6xx-specific cat6 opcodes |
| - ir3: Support assembling & disassembling getspid/getwid |
| - ir3: Fix STP/LDP assembly |
| - ir3/parser: Fix st{l,lw,g,p} and ld{l,lw,g,p} assembly |
| - ir3: Initial support for private memory |
| - ir3: Properly validate cat6 half-ness |
| - freedreno: Add per-device parameters for private memory |
| - tu: Support private memory |
| - freedreno/a6xx: Implement private memory |
| - ir3: Enable nir_lower_vars_to_scratch on a6xx |
| - ir3/ra: Fix array reg liveness in scalar pass |
| - ir3: Rename high registers to shared registers |
| - ir3: Better rules for shared src copy propagation |
| - ir3: Support MOVMSK |
| |
| Daniel Schürmann (53): |
| |
| - nir: add strength reduction pattern for imod/irem with pow2 divisor. |
| - nir: allow for cheap intrinsics in nir_opt_peephole_select() |
| - nir: add nir_phi_get_src_from_block() helper |
| - nir/opt_peephole_select: collapse nested IFs if applicable |
| - nir/opt_peephole_select: respect selection_control when collapsing ifs |
| - nir: don't sink instructions into loops |
| - nir/opt_sink: return early when trying to sink unused instructions |
| - aco/ra: use get_reg_specified() for p_extract_vector |
| - aco: don't create dead exec mask phis on merge blocks |
| - aco: fix DCE of rematerializable phi operands |
| - aco/spill: only prevent rematerializable vars from being DCE'd if they haven't been renamed |
| - aco/ra: fix phi operand renaming |
| - nir/opt_if: split ALU from Phi more aggressively |
| - aco: don't emit parallelcopy when switching to WQM. |
| - aco: make pred_by_exec_mask() accessible in other files |
| - aco: allow to schedule SALU/SMEM through exec changes |
| - aco: fix def-use distance calculation when scheduling. |
| - aco: schedule position exports in the same pass as memory operations |
| - aco: create VMEM clauses slightly more aggressive |
| - nir/opt_vectorize: use a single instruction per hash entry instead of a vector |
| - nir/opt_vectorize: don't hash instructions which are already vectorized |
| - nir/opt_vectorize: don't hash filtered instructions |
| - nir/opt_vectorize: rehash users of vectorized instructions |
| - nir/opt_vectorize: hash whether a swizzle accesses elements beyond the maximum vectorization factor |
| - nir/opt_vectorize: fix call to filter function |
| - nir,vc4: Lower fneg to fmul(x, -1.0) |
| - nir: replace .lower_sub with .has_fsub and .has_isub |
| - nir/divergence_analysis: mark load_push_constant as uniform |
| - radv: optimize idiv_const for small bitsizes |
| - radv: call nir_opt_algebraic_late() after lowering idiv for small bitsizes |
| - radv: don't lower_pack() after load-store-vectorization |
| - radv: enable .lower_ineg |
| - aco: simplify and fix operand/definition sizes |
| - aco/ra: fix infinite recursion in get_reg_simple() with subdword registers |
| - aco: fix VOP3P assembly, VN and validation |
| - aco/RA: fix subdword operands on VOP3P instructions |
| - aco: allow constants/literals on every src position for VOP3P |
| - aco: allow SGPRs on every src position for VOP3P |
| - aco: change usesModifiers() considering opsel_hi on packed instructions |
| - aco: create helpers to emit vop3p instructions |
| - aco: emit packed 16bit instructions |
| - radv: vectorize 16bit instructions |
| - aco: simplify multiply-add combining |
| - aco: optimize packed mul+add to v_pk_fma_f16 |
| - aco: optimize packed clamp |
| - aco: optimize packed fneg |
| - aco: optimize v_pk_fma_f16 -\> v_pk_fmac_f16 on GFX10 |
| - aco: propagate swizzles when optimizing packed clamp & fma |
| - aco: remove divergent branches which only jump over very few instructions |
| - aco/optimizer: don't propagate subdword temps of different size |
| - aco/optimizer: don't copy-prop logical phis |
| - aco: fix nir_intrinsic_ballot with wave32 |
| - aco: fix shared VGPR allocation on RDNA2 |
| |
| Daniel Stone (17): |
| |
| - microsoft/clc: Allow building with Clang git |
| - microsoft/clc: Disable broken f32 -\> i64/u64 test |
| - CI: Add Windows libclc and SPIRV-LLVM-Translator builds |
| - CI: Windows: Use 32 vCPUs for Mesa build |
| - CI: Remove ludicrous Windows container build timeout |
| - CI: Update Windows build for current Meson options |
| - CI: Build d3d12 Gallium driver and CLC framework on MSVC |
| - CI: Re-enable MSVC build |
| - freedreno: Add missing dependency to build |
| - CI: Collapse SCons & meson-misc stages into one |
| - CI: Collapse llvmpipe & softpipe stages into one |
| - CI: Collapse radv & radeonsi stages into one |
| - CI: Collapse virgl & d3d12 stages into one |
| - CI: Collapse lima & panfrost stages into one |
| - CI: Reorder non-hardware stages last |
| - CI: Add llvmpipe- prefix to Piglit jobs |
| - CI: Add Windows source dependency map |
| |
| Danylo Piliaiev (22): |
| |
| - freedreno/a6xx: add support for dual-source blending |
| - freedreno/a6xx: Fix typo in height alignment calculation in a6xx layout |
| - freedreno/a6xx: add support for ARB_shader_stencil_export |
| - tu: Ignore pTessellationState if there is no tesselation shaders |
| - tu: pCounterBuffers can be NULL in vkCmd*TransformFeedbackEXT() |
| - freedreno/a6xx: Fix assert which checks the count of shader outputs |
| - ir3: Allow tesselation to use all 32 varying slots |
| - freedreno/a6xx: Fix SP_HS_UNKNOWN_A831 value and document it |
| - freedreno/a6xx: bump varyings limit |
| - freedreno: Fix FD_MESA_DEBUG=flush debug option |
| - freedreno/ir3: remap FRAG_RESULT_COLOR to \_DATA\* for dual-src blending |
| - nir/lower_fragcolor: handle dual source blending |
| - freedreno/a6xx: fix array pitch for layer-first layouts |
| - freedreno/a6xx: add support for gl_Layer in vertex shader |
| - freedreno/a6xx: support layered framebuffers in blitter_clear |
| - nir: account for point-coord origin when lowering it |
| - nir: fix missing nir_lower_pntc_ytransform.c in the makefile |
| - freedreno/a6xx: fix transform feedback resuming |
| - freedreno/a5xx: implement transform feedback resuming |
| - freedreno: Enable GLSL 3.30, updating us to GL 3.3 contexts |
| - turnip: remove unused IR3_DP_LOCAL_GROUP_SIZE_* from cs params |
| - turnip: implement indirect dispatch |
| |
| Dave Airlie (69): |
| |
| - util: add a env getter for versions |
| - clover/device: store version in device at constructor. |
| - clover: add CL 3.0 CL_DEVICE_NUMERIC_VERSION support |
| - clover/platform: move versioning to core object. |
| - clover: add CL_PLATFORM_NUMERIC_VERSION support |
| - clover: report device CLC versions for 3.0 |
| - clover: add support for versioned device extensions |
| - clover: add platform supported extensions with version |
| - clover: add support for opencl C features |
| - gallium: handle empty cbuf slots in framebuffer samples helper |
| - u_blitter: port radv 3D blit coords logic. |
| - lavapipe: enable alpha to one. |
| - lavapipe: disable SNORM blending for now |
| - llvmpipe: just use draw_regions in draw/line setup. |
| - draw: fix tess eval pipeline statistics. |
| - gallivm: add float to 8/16 int |
| - gallivm/nir: add fsum support |
| - gallivm/nir: lower dot products. |
| - gallivm: lower vector compares |
| - gallivm: fix float atomic exchange. |
| - clover: handle memory object properties properly. |
| - clover: add support command queue properties |
| - clover: add all CL 3.0 API with invalid functions |
| - clover: add cl 3.0 SVM invalid support |
| - clover: add device/platform info for CL 3.0 |
| - clover: add 3.0 program properties |
| - clover: add CL 3.0 event/queue queries |
| - clover/image: handle MEM_KERNEL_READ_AND_WRITE flag. |
| - spirv/cl: add enqueued workgroup size. |
| - lavapipe: fixup device allocate + enable private data |
| - lavapipe: fix wsi acquire fences |
| - llvmpipe/setup: move point stats collection earlier. |
| - llvmpipe: fix multisample point rendering. |
| - llvmpipe: fix multisample lines. |
| - lavapipe: fixup mipmap precsion bits |
| - lavapipe: enable pipeline stats queries |
| - gallium: fix missing bit field in p_state.h |
| - zink: allow the backend to optimise shaders. |
| - lavapipe: enable VK_EXT_shader_stencil_export |
| - lavapipe: enable post depth coverage |
| - lavapipe: add support for VK_KHR_indirect_draw_count |
| - radeonsi: fix regression on gpus using the radeon winsys. |
| - lavapipe: use ralloc for pipeline copies. |
| - lavapipe: split out pipeline struct duplication to a macro. |
| - lavapipe: don't copy pNext |
| - CI: add lavapipe vulkan testing |
| - lavapipe: refactor descriptor set binding to support push later. |
| - lavapipe: add support for VK_KHR_push_descriptor |
| - lavapipe: add support for VK_KHR_descriptor_update_template |
| - zink: add some 64-bit conversion ALUs |
| - gallium: add an api to retrieve pipe offsets |
| - llvmpipe: add support for vulkan streamout offset hook |
| - llvmpipe: handle SO statistics multi value query copy. (v2) |
| - lavapipe: add transform feedback support |
| - gallium: add grid base to dispatch info |
| - llvmpipe: add support for grid base |
| - llvmpipe: enable lower device id to zero |
| - lavapipe: add basic vulkan device group support. |
| - util: add printf specifier shared helper code. |
| - clover/module: add a printf support to module (v5) |
| - clover/nir: hookup printf (v3) |
| - intel/isl: move get_tile dims/masks to common isl header |
| - device-select-layer: update for vulkan 1.2 |
| - lavapipe: fix missing piece of VK_KHR_get_physical_device_properties2 |
| - radv: move queue object to a common base object |
| - zink: don't pick a cpu device ever. |
| - glsl: fix leak in gl_nir_link_uniform_blocks |
| - glx: proposed fix for setSwapInterval |
| - lavapipe: fix pipeline vp/scissor mixup. |
| |
| David McFarland (1): |
| |
| - radv: fix divide by zero with no tesselation params |
| |
| David Stevens (6): |
| |
| - egl/android: don't pass loaderPriv in get_front_bo |
| - dri: add image cleanup callback to loader extensions |
| - frontend/dri: plumb loader image cleanup callback |
| - i965: plumb loader image cleanup callback |
| - egl/android: implement image cleanup callback |
| - egl/dri2: fix image loaderPrivate type mixup |
| |
| Duncan Hopkins (4): |
| |
| - zink: setup version dependent VkPhysicalDeviceVulkan*Features and VkPhysicalDeviceVulkan*Properties. |
| - mesa: Undefine ALIGN macro before it is used as a function name. Issues on MacOS. |
| - zink: moved vkEnumerateInstanceVersion to create_instance |
| - zink. Fixing vkGetPhysicalDeviceProperties2 and vkGetPhysicalDeviceFeatures2 for Vk 1.1 and VK_KHR_get_physical_device_properties2. |
| |
| Dylan Baker (70): |
| |
| - Bump version for 21.0 devel |
| - Reset new features for 21.0 development cycle |
| - meson: Don't add extra values to shader-cache |
| - meson: use a feature option for microsoft-clc |
| - docs: add release notes for 20.2.3 |
| - docs: Add relnotes for 20.2.3 |
| - docs: update calendar and link releases notes for 20.2.3 |
| - release-calender: Update 20.3 |
| - docs: add release notes for 20.3.0 |
| - docs: Add sha256 sums for 20.3.0 |
| - docs: update calendar and link releases notes for 20.3.0 |
| - docs: add release schedule for 20.3 |
| - docs: add release notes for 20.2.4 |
| - relnotes: Add sha256sums for 20.2.4 |
| - docs: update calendar and link releases notes for 20.2.4 |
| - docs: add release notes for 20.2.5 |
| - docs: add sha256 sums for 20.2.5 |
| - docs: update calendar and link releases notes for 20.2.5 |
| - docs: add release notes for 20.3.1 |
| - docs: Add sha256 sums for 20.3.1 |
| - docs: update calendar and link releases notes for 20.3.1 |
| - docs: add release notes for 20.2.6 |
| - docs: Add sha256 sums for 20.2.6 |
| - docs: update calendar and link releases notes for 20.2.6 |
| - docs: add release notes for 20.3.2 |
| - docs: Add sha256 sum for 20.3.2 |
| - docs: update calendar and link releases notes for 20.3.2 |
| - pick-ui: don't handle the mouse |
| - bin/remove get-pick-list.sh files |
| - docs: store the release-calendar information in csv (and fix tests) |
| - bin: Add script for manipulating the release calendar |
| - bin/gen_calendar_entries: Add support for extending a release |
| - bin/gen_calendar_entries: Add support for making a release |
| - docs: Add calendar entries for 21.0 release candidates. |
| - docs/release-calendar.rsv: Remove spaces |
| - VERSION: bump for 21.0.0-rc1 |
| - .pick_status.json: Update to dfe429eb414511170f3dfc960d247c4aa295f924 |
| - .pick_status.json: Update to 184bbef33d1fff3520958c130f2b8e4fce17379c |
| - .pick_status.json: Update to c27347b2e1883a30e023347a36bdcf86cdec4a7c |
| - .pick_status.json: Update to 3e13c1f8dfef4a4c0fd5e79bbc364f9e5f998856 |
| - VERSION: bump for 21.0.0-rc2 |
| - .pick_status.json: Update to af9977a3d5f3378c297965e21389e36491f47e1b |
| - .pick_status.json: Update to c3dbc4df194a15aa1cf09493a3100b59e37e48fe |
| - .pick_status.json: Update to 64f55b82c7f1652e4fae478c0af325fc38b9b53b |
| - .pick_status.json: Update to 3ef89b245e3e1ac4e67fea9c1b13ebeda75769d0 |
| - .pick_status.json: Update to d37124b065c2b6c99c042fb402c6a23ce16b034e |
| - .pick_status.json: Mark 8c7d9716669a74159d2eec86490c756c274f663c as backported |
| - .pick_status.json: Mark 45bebc7a9c73f3add08c2290fa1eac237edf5a34 as backported |
| - .pick_status.json: Update to 9052819ebbff07d82c3eb9adf414144df4868644 |
| - .pick_status.json: Update to f01ea0aef8a50d2732eb0c64153903e52ed2a757 |
| - VERSION: bump for 21.0.0-rc3 |
| - .pick_status.json: Update to 86ff78e8fe55b424c6b853ead6979bcd46820d81 |
| - .pick_status.json: Update to 9003735b9141fb156d3b2e1133b94cdf14f63424 |
| - .pick_status.json: Update to e8707961134daa9b91599840ad5698366a6229b7 |
| - .pick_status.json: Update to b609d4677d3f910c546c1d94d8ddfe4511e2f065 |
| - bump version for 21.0-rc4 |
| - .pick_status.json: Update to 8ed874d73fafcfbcb54730dc5c20e58f24d55f5e |
| - .pick_status.json: Update to 03d3294e35befc2be6ed0ed66ed92fab991c166d |
| - Revert "vulkan: Make vk_debug_report_callback derive from vk_object_base" |
| - VERSION: bump for 21.0.0-rc5 |
| - .pick_status.json: Update to 4ded99f99ddbd1103ffddfd9935638fc12e0ecfd |
| - .pick_status.json: Mark 38ce8d4d00c2b0e567b6dd36876cf171acb1dbc7 as backported |
| - .pick_status.json: Update to 9f8a0b797ed9b8ad9bf49af8269a337b1152a744 |
| - .pick_status.json: Update to 6ceb6b509e64c54812a5f6a208e7d93cc61119f4 |
| - .pick_status.json: Update to ea27f2bf092f462171fe14a44619565d14f43fb8 |
| - .pick_status.json: Update to c22267262ee1b6817df368a51168fa82bd17293c |
| - .pick_status.json: Mark 04df0cb4ae7055b0a4a6dc9875aa5926131fe5f4 as backported |
| - .pick_status.json: Mark 942ba4e34124d1058492f544dc8fd42f4012fd12 as backported |
| - .pick_status.json: Mark ea27f2bf092f462171fe14a44619565d14f43fb8 as backported |
| - .pick_status.json: Mark 5f1b3544729178715a1ed0714bd1029737089824 as backported |
| |
| Ella-0 (1): |
| |
| - v3dv: Wayland WSI support |
| |
| Eric Anholt (156): |
| |
| - util/hash_table: Handle NULL ht in \_mesa_hash_table_clear(). |
| - util/hash_table: Clean up the \_mesa_hash_table_clear() implementation. |
| - util/set: Fix the \_mesa_set_clear function to not leave tombstones. |
| - nir/validate: Size the set of blocks to avoid rehashing. |
| - nir_builder: Return a new builder from nir_builder_init_simple_shader(). |
| - nir/builder_tests: Drop unused lin_ctx. |
| - nir/tests: Simplify the mem_ctx setup in our unit tests. |
| - intel: Drop the last uses of a mem_ctx in nir_builder_init_simple_shader(). |
| - nir/builder: Drop the mem_ctx arg from nir_builder_init_simple_shader(). |
| - nir/builder: Add a name format arg to nir_builder_init_simple_shader(). |
| - ci: Move the rust cleanup in lava_build out of the middle of kernel build. |
| - ci: Only install kernel modules for LAVA devices. |
| - ci/freedreno: Group the short a630 dEQP runs into one test job. |
| - ci/deqp: Allow specifying the caselist fraction separate from CI_NODE_INDEX. |
| - ci: Bump deqp to current vulkan-cts-1.2.4 |
| - ci: Re-enable the clip_three test on non-freedreno ARMs. |
| - ci/db410c: Fix networking so we get artifacts from our jobs. |
| - gallium/draw: Fix rasterizer_discard for wide points/lines. |
| - freedreno: Fix leak of shader binary on disk cache hits. |
| - nir: Add a size_align helper function for aligning elements to 16 bytes. |
| - freedreno/ir3: Include at least 4 NOPs so that cffdump doesn't disasm junk. |
| - freedreno/ir3: Switch emit_const_ptrs() to take BOs instead of prscs. |
| - freedreno/ir3: Fix incorrect optimization of usage of 16-bit constbuf vals. |
| - freedreno+turnip: Upload large shader constants as a UBO. |
| - freedreno: Disable PIPE_CAP_PREFER_IMM_ARRAYS_AS_CONSTBUF. |
| - turnip: Assert about the storage buffer offset alignment. |
| - ci: Enable -Werror in more clover builds. |
| - freedreno: Fix release build warnings for asserted temp vars. |
| - freedreno/a6xx: Fix use of uninitialized img->level in the SSBO/image path. |
| - freedreno: Fix warning about uninit size for the size==0 special case. |
| - freedreno: Fix uninitialized var warning in afuc using unreachable(). |
| - freedreno: Suppress uninit var warnings from shader stage switch. |
| - ci: Bring freedreno into the "warnings clean release build" fold. |
| - freedreno/afuc: Fix up some sprintf format security warnings. |
| - gallium: Fix leak of the merged driconf options. |
| - freedreno: Fix leak of u_transfer_helper. |
| - egl: Skip closing drivers when building with AddressSanitizer. |
| - meson: Remove old todo comment about pthread stubs. |
| - gallium: Fix leak of bound SSBOs at CSO context destruction. |
| - gallivm: Fix max const buffer count. |
| - gallium: Fix leak of currently bound UBOs at CSO context destruction. |
| - freedreno: Break out of "should we free the entry" loop once we've freed. |
| - xmlconfig: Add unit tests for recent bugs in the driconf rewrite. |
| - xmlconfig: Warn if parsing the engine/app versions fails. |
| - gallium/osmesa: Fix flushing and Y-flipping of the depth buffer. |
| - gallium/osmesa: Remove the broken buffer-reuse scheme. |
| - gallium/osmesa: Fix data race on setting up the ST API. |
| - gallium/osmesa: Fix leak of the ST manager/api on library unload. |
| - gallium/osmesa: Return cleanly for OSMesaGetDepthBuffer() with no depth. |
| - ci/freedreno: Detect the cheza power management bus error and restart. |
| - ci/vc4: Skip VS dynamic loops tests that cause GPU hangs. |
| - softpipe: Fix swizzled texture gather of int textures. |
| - osmesa/test: Clear the stencil bits in the depth test. |
| - docs: Fix the documentation of the OSMesa path. |
| - mesa: Retire classic OSMesa. |
| - ci: Make sure that osmesa stays warnings-clean in release builds. |
| - st/mesa: Replace mesa_to_tgsi() with prog_to_nir() and nir_to_tgsi(). |
| - gallium/ntt: Don't manually reindex instrs. |
| - gallium/ntt: Drop reindexing of SSA defs and regs. |
| - nir: Redefine start/end_ip of blocks to fix NIR-to-TGSI liveness bugs. |
| - etnaviv, v3d: Fix valgrind include paths. |
| - util: Fix memory leak in a hash table unit test. |
| - util/vma: Fix leak of the heap in the unit test. |
| - glx/tests: Remove unused teardown function. |
| - glx/tests: Fix leaks in the unit tests. |
| - freedreno/ir3: Free the compiler at the end of the unit tests. |
| - disk_cache: Fix memory leaks in the unit test. |
| - glsl/general_ir_test: Fix leaks. |
| - glsl/uniform_initializer_tests: Fix memory leak |
| - mapi: Fix symbols check with ASan enabled. |
| - glsl/standalone: Fix memory leaks |
| - driconf: Fix memory leak in the unit test. |
| - amd: Fix leak in ac_surface_modifier_test. |
| - ci: Add an ASan build on x86. |
| - ci/freedreno: Treat all freedreno deqp runs as saving results. |
| - ci/freedreno: Stop specifying the number of deqp threads |
| - mesa/st: Finalize the texture before BlitFramebuffer from it. |
| - freedreno/a6xx: Flush depth at the end of bypass rendering, too. |
| - ci/deqp: Make sure that we pull in all board-specific xfail/skip/flake files. |
| - lvp: Fix vtn warnings about unsupported image read/write without format. |
| - softpipe: count CS invocations for pipeline stats queries. |
| - mesa/st: Fix use-after-free of the draw VS. |
| - ci: Disable the now flaky Portals.trace on a630. |
| - ci/deqp: Move .shader_cache artifacts exclusion to the yml. |
| - ci/deqp: Upgrade the runner, enable junit output. |
| - ci/deqp: Move the load reporting to a quiet block. |
| - mesa/st: Update FP state when textures change with an ATI_fs bound. |
| - mesa/prog_to_nir: Factor out the texture-target-to-sampler-dim helper. |
| - mesa/ati_fs: Clean up writemask handling. |
| - st/mesa: Generate NIR for ATI_fragment_shader instead of TGSI. |
| - gallivm: Use the proper enum for the texture target bitfield. |
| - softpipe: Enable GLSL 400 for compat contexts too. |
| - ci/piglit: Include the updated piglit results list in the job results. |
| - ci/softpipe: Include a piglit run. |
| - gallium/ntt: Fix check for "is there anything in the else block?" |
| - ci/deqp: Fix inverted meaning of DEQP_NO_SAVE_RESULTS. |
| - freedreno: Enable GLSL 1.50, updating us to GL 3.2 contexts. |
| - ci/panfrost: Disable the flaky gimark trace. |
| - gallium/draw: Fix intermittent failure to bind new geometry shaders. |
| - ci/softpipe: Re-enable GS tests that had been banned for being flaky. |
| - gallium/tgsi_exec: Fix shared memory atomic ops. |
| - gallium/tgsi_exec: Reuse the atomic helper for SSBO atomics. |
| - gallium/tgsi_exec: Use the new SSBO lookup interface for SSBO loads. |
| - gallium/tgsi_exec: Move the SSBO store path to tgsi_exec, too. |
| - gallium/tgsi_exec: Replace the SSBO RESQ-specific interface with lookup. |
| - softpipe: Sanity check that the SSBO view offset is within the BO. |
| - ci/softpipe: Skip flaky triangle-rasterization-overdraw. |
| - ci/softpipe: Ban glx-multithread-texture, too. |
| - ci/softpipe: Update the comment about the rasterpos flake. |
| - ci/bare-metal: Drop extra DEQP_PARALLEL settings. |
| - ci/bare-metal: Pass through FDO_CI_CONCURRENT on bare-metal runners. |
| - ci: Add a530 and a630 piglit runs. |
| - gallium/tgsi_exec: Simplify GS output vertex count tracking. |
| - gallium/tgsi_exec: Stop doing the weird allocation of the Addrs array. |
| - gallium/tgsi_exec: Drop the unused scratch temp regs. |
| - gallium/tgsi_exec: Clean up storage of the pixel kill mask. |
| - gallium/tgsi_exec: Remove unused MaxGeometryShaderOutputs. |
| - freedreno/ir3: Deduplicate link_stream_out. |
| - freedreno/a5xx: Drop redundant stream output linking check. |
| - freedreno/a5xx: Move link_stream_out after VPC_VAR_DISABLE like on a6xx. |
| - gallium/tgsi_exec: Fix assertion failure about missing constbufs. |
| - gallium/tgsi_exec: Refactor to fix CS local memory overflow checks. |
| - gallium/tgsi_exec: Add support for PIPE_CAP_LOAD_CONSTBUF. |
| - gallium/ntt: Fix emitting UBO declarations. |
| - gallium/ntt: Fix dynamic indirect indexing of per_vertex_input. |
| - gallium/ntt: Fix load_ubo_vec4 buffer index setup. |
| - gallium/ntt: Add support for PIPE_CAP_LOAD_CONSTBUF. |
| - turnip: Move the limited_z24s8 flag to the shared device info. |
| - freedreno/a6xx: Move the IBO pipe2tex down to where it's used. |
| - freedreno/a6xx: Fix z24s8 non-ubwc blits on a630. |
| - freedreno: Disable UBWC on z24s8 on a630. |
| - freedreno: Mark a615/a618 as also lacking Z24_UINT_S8_UINT support. |
| - freedreno: Add missing dep on u_tracepoints. |
| - ci: Disable the freedreno farm, which went down last night. |
| - gallium/ntt: Drop XXX comment about supporting carry opcodes. |
| - gallium/ntt: Emit SSBO buffer declarations. |
| - gallium/ntt: Emit sample index when necessary for image load/store. |
| - gallium/ntt: Add support for emitting TXF_LZ. |
| - gallium/ntt: Drop comment about needing loop label setup. |
| - gallium/ntt: Drop comment about needing array_id for svga tess. |
| - gallium/ntt: Work around virglrenderer UIF handling bug. |
| - nir/lower_locals_to_regs: Use the imul_imm helper instead of forcing it. |
| - gallium/ntt: Fix leak of the per-instr liveness information. |
| - mesa/st: Free the NIR builtins TGSI tokens after passing to the driver. |
| - mesa/st: Free the ARB_vp/fp nir-to-tgsi temporary tokens. |
| - gallium/ntt: Take ownership of the NIR shader we're passed. |
| - Revert "ci: Disable the freedreno farm, which went down last night." |
| - util/format: Fix pack/unpack of A1R5G5B5_UINT. |
| - swr: Don't report support for shader images. |
| - panfrost: Stub out set_shader_images(). |
| - gallium: Fix leak of shader images on context destruction. |
| - mesa/st: Allocate the gl_context with 16-byte alignment. |
| - vc4: Remove vestiges of alpha test lowering. |
| - v3d: Clean up vestiges of alpha test lowering. |
| - freedreno: Add missing dep on freedreno tracepoints. |
| - r300,i915g: Report no shader buffers or images on non-TCL HW. |
| |
| Eric Engestrom (3): |
| |
| - gitlab-ci: drop deprecated platforms that snuck in when nobody was watching |
| - meson: drop deprecated EGL platform build options |
| - docs: use a single cell for the branch number |
| |
| Erico Nunes (6): |
| |
| - lima: define set_clip_state implementation |
| - mesa: allow half float textures based on ARB_half_float_pixel |
| - lima: add support for half float textures |
| - lima: adjust pp and gp max const buffer size |
| - nir/lower_vec_to_movs: don't vectorize unsupports ops |
| - lima: fix max sampler views |
| |
| Erik Faye-Lund (133): |
| |
| - softpipe: correct signature of get_compiler_options |
| - util/slab: allow usage from c++ code |
| - compiler: add SYSTEM_BIT_FRONT_FACE |
| - microsoft/compiler: add dxil-util code |
| - microsoft/compiler: translate nir to dxil |
| - d3d12: introduce d3d12 gallium driver |
| - d3d12: ensure all compoents of clip-distances are written |
| - d3d12: avoid searching twice for bos |
| - util/u_process: implement util_get_process_name for Windows |
| - d3d12: fix code after simple-shader helper changes |
| - microsoft/compiler: remove unused struct |
| - microsoft/compiler: move c++ higher up |
| - microsoft/compiler: inline some struct-declarations |
| - microsoft/compiler: correct typo |
| - meson: verify that d3d12.h exists when building the d3d12 driver |
| - util: fix unknown pragma warning on msvc |
| - mesa/main: add missing include in glformats.h |
| - docs/features: document d3d12 features |
| - zink: mark general layout as transfer-read/write |
| - zink: always insert barriers for general-layout |
| - zink: more accurately track supported blits |
| - mesa/st: Introduce WINSYS_HANDLE_TYPE_D3D12_RES |
| - d3d12: Support WINSYS_HANDLE_TYPE_D3D12_RES |
| - d3d12: also reject GDI-supporting pixel-formats |
| - llvmpipe: fix arith-test build on msvc |
| - d3d12: transition the right planes |
| - docs: add basic docs for d3d12 driver |
| - zink: fix layered resolves |
| - zink: fall back to util_blitter for scaled resolves |
| - Revert "zink: update shader modules in gfx program when flagged dirty" |
| - Revert "zink: put those shader keys to work fixing up fragment shaders" |
| - Revert "zink: fill in params for fs shader keys and flag shader for rebuild" |
| - Revert "zink: move shader key structs into their own header" |
| - Revert "zink: refcount the shader cache" |
| - Revert "zink: initial implementation of shader keys" |
| - Revert "tgsi: Fix helgrind complaint about one-time init" |
| - Revert "gallium/trace: Fix helgrind complaint about one-time init" |
| - Revert "mesa: Fix helgrind complaint about one-time init" |
| - Revert "util: Fix helgrind complaint about one-time init" |
| - Revert "mesa/st: Use do_once for one-time init" |
| - Revert "gallium/hud: Use do_once for one-time init" |
| - Revert "freedreno/ir3: Use get_once() for one-time init" |
| - Revert "nir: Use get_once() helper for one-time init's" |
| - Revert "util: Add helpers for various one-time-init patters" |
| - docs: document new zink-flag |
| - d3d12: lower bitfield_extract to shifts |
| - d3d12: do not inspect NULL samplers |
| - util/slab: do not dereference NULL-pointer |
| - zink: revert to old load_ubo implementation |
| - docs: break project history out of front-page |
| - docs: move major versions history out of front-page |
| - docs: use external link-references |
| - docs: do not explicitly call out es-versions |
| - docs: mention egl in api-list |
| - docs: inline contents.rst into index.rst |
| - gitlab-ci: store build-artifacts from building mesa |
| - gitlab-ci: build zlib statically on windows |
| - gitlab-ci: build piglit in mesa_deps.ps1 |
| - gitlab-ci: run piglit on windows |
| - gitlab-ci: ignore nv_copy_depth_to_color |
| - gitlab-ci: do not clone git-repo for test-job |
| - microsoft/clc: use files-function for source-list |
| - microsoft/clc: add missing dependency |
| - microsoft/clc: increase test-timeout |
| - zink: do not require VK_KHR_external_memory |
| - lavapipe: set some basic usage-flags |
| - gallium/targets/libgl-gdi: prefer d3d12 driver |
| - lavapipe: fix logic-op support |
| - gallium: do not reset buffers for unsupported stages |
| - zink: fix channel ordering in format-mapping |
| - lavapipe: interpret inputRate as an enum-value |
| - lavapipe: implement VK_EXT_vertex_attribute_divisor (v2) |
| - zink: fail if set failed to create |
| - zink: use \_mesa_pointer_set_create for simplicity |
| - gitlab-ci: copy piglit expected results to artifacts |
| - .gitlab-ci: verify that Get-Content worked |
| - mesa: do not allow es2-extension enums for es1 |
| - mesa: check for extension instead of desktop GL |
| - gallium/util: make bitcast-helpers explicitly sized |
| - gallium/util: add bitcast helpers for double and uint |
| - zink: force display-targets to be linear |
| - Revert "st/dri: make sure software color-buffers are linear" |
| - zink: use shader-read-only-optimal for samplers |
| - zink: use emit_bitcast helper |
| - zink: ralloc spirv_shader |
| - zink: fix 8 bit index handling code |
| - zink: convert x8-formats in zink_get_format |
| - zink: make zink_format all about raw format-translation |
| - zink: fix format-mapping |
| - zink: add format test |
| - zink: map some more formats |
| - lavapipe: implement VK_EXT_index_type_uint8 |
| - zink: nir_op_b2f64 implementation |
| - zink: more conversion ALUs |
| - docs/features: update list of zink features |
| - zink: document some more features for higher GL versions |
| - zink: only emit each cap once |
| - zink: do not open-code CALLOC_STRUCT |
| - zink: factor out zink_batch_release-helper |
| - zink: destroy blitter before destroying batches |
| - zink: release batch memory |
| - zink: do not leak vertex element state |
| - zink: dot leak dummy_buffer |
| - zink: free sets and hash-tables in context |
| - zink: destroy transfer-helper |
| - zink: destroy device and instance |
| - zink: do not use reservations for stream-out |
| - zink: do not reserve or pack fragment outputs |
| - zink: use ConstOffset for nir_tex_src_offset |
| - zink: use lower_scmp instead of open-coding |
| - zink: also lower scmp for soft-fp |
| - zink: remove support for fcsel |
| - gallium/util: do not perform n^2 stencil blits |
| - gallium/ntt: lower uniforms to ubo |
| - zink: disable render_condition_enable during blit |
| - microsoft/compiler: correct dxil fma opcode |
| - microsoft/compiler: do not lower away 64-bit ffma |
| - zink: rename zink vs pipe variables |
| - zink: setup compiler options during init |
| - zink: add missing opcodes |
| - zink: add missing 64-bit integer ops |
| - zink: use hardware int64 when supported |
| - mesa/st: fix regression for basic drivers |
| - zink: handle NULL views in zink_set_sampler_views |
| - zink: fix vertex-stride wrangling |
| - zink: respect feature-cap for independent blending |
| - zink: respect feature-cap for sample-shading |
| - zink: respect feature-cap for multi-draw indirect |
| - zink: make all xfb caps depend on extension |
| - zink: require vulkan memory model for tesselation |
| - zink: respect fragment-shader depth-layout |
| - zink: clone shader before lowering clip_halfz |
| - mesa/main: remove leftover bumpmap code |
| |
| Francisco Jerez (1): |
| |
| - intel/gen12: Fix memory corruption issues in fused Gen12 parts. |
| |
| Georg Lehmann (3): |
| |
| - vulkan/device-select: fix vkGetInstanceProcAddr self-resolving |
| - vulkan/overlay: fix vkGetInstanceProcAddr self-resolving |
| - vulkan/device_select: Only call vkGetPhysicalDeviceProperties2 if the device supports it. |
| |
| Gert Wollny (36): |
| |
| - util/format_zs: Add C++ include handling |
| - nir/print: print GS extra info |
| - r600/sfn: lower bool to int32 only after common optimizations |
| - r600/sfn: use a per stream index register in GS |
| - r600/sfn: Correctly lower all int64 |
| - r600/sfn: fix component loading from fixed buffer ID |
| - r600/sfn: Add lowering pass to convert load_interpolated to load for POS |
| - r600/sfn: Add simplified constructors for FS shader inputs. |
| - r600/sfn: lower IO for FS inputs and handle interpolation accordingly |
| - r600/sfn: remove unused FS input deref code |
| - r600/sfn: Fix vertex stage export to accomodate IO lowering |
| - r600/sfn: lower VS output IO |
| - r600/sfn: Lower tess-eval IO |
| - r600/sfn: drop store_deref handling for VS and TES |
| - r600/sfn: lower GS IO |
| - r600/sfn: simplify IO lowering and fix TESS IO lowering |
| - r600/sfn: lower all IO in one pass |
| - r600/sfn: correct error signalling in switch default case |
| - r600/sfn: fix definition of priority queue |
| - r600/sfn: Fix a few warnings in release builds |
| - r600/sfn: remove unused file |
| - r600/sfn: remove leftover debug message |
| - r600/sfn: Fix dest-swizzle for GS vertex loads |
| - r600/sfn: Add support for shader_clock |
| - mesa/st: lower 64 bit ops to scalar before lowering to soft-float |
| - r600/sfn: merge SpecialValue and InlineConstValue |
| - doc: virgl supports ARB_texture_filter_anisotropic already |
| - r600: Support TGSI_OPCODE_I64NEG |
| - r600/sfn: C++ lower-instruct implementation |
| - r600/sfn: Add number for source components for split_y |
| - r600/sfn: add lowering passes to get 64 bit ops lowered to 32 bit vec2 |
| - r600/sfn: tie in 64 lowering code |
| - r600: enable support for 64 bit DIVMOD when NIR is used |
| - r600: enable fp64 lowering to softemu with NIR |
| - r600/nir: use "unreachable" instead of "assert" |
| - r600/sfn: fix use of b32all/and |
| |
| Giovanni Mascellani (2): |
| |
| - disk_cache: Fail creation when cannot inizialize queue. |
| - anv: Allow null handle in DestroyDescriptorUpdateTemplate. |
| |
| Hans-Kristian Arntzen (2): |
| |
| - vulkan: Update to 1.2.164. |
| - radv: Implement VK_VALVE_mutable_descriptor_type. |
| |
| Hoe Hao Cheng (11): |
| |
| - zink: define and use \<%guard\> helper in zink_device_info |
| - zink: decouple features and enabling conditions in zink_device_info.py |
| - zink: move blend_operation_advanced conditions to zink_device_info.py |
| - zink: remove useless import in zink_device_info.py |
| - zink: allow Extension/Version to be shared across files |
| - zink: generate instance creation code with a python script |
| - zink: hook zink_instance to build |
| - zink: replace old code with generated zink_instance |
| - zink: fix property detection |
| - zink: add support for VK_EXT_4444_formats |
| - zink: VK_KHR_draw_indirect_count is a device extension |
| |
| Hyunjun Ko (6): |
| |
| - vulkan: Enable VK_KHR_performance_query on android |
| - turnip: Implement VK_KHR_performance_query |
| - turnip: support multipass for performance query. |
| - turnip: enable VK_KHR_performance_query with new debug flag |
| - turnip/kgsl: support VK_KHR_performance_query |
| - turnip: use ir3_compiler_destroy instead of ralloc_free |
| |
| Iago Toral Quiroga (33): |
| |
| - zink: only add MESA WSI structs for specific devices |
| - v3dv: fix typo |
| - v3dv: move authenticated display fd acquisition to swapchain creation time |
| - v3dv: fix width for buffer view texture state |
| - v3dv: add a buffer to image copy path using a texel buffer |
| - v3dv: initialize pipeline layouts for meta operations at driver initialization |
| - v3dv: blit shader clean-ups |
| - v3dv: rename playout and dslayout fields to use underscores. |
| - v3dv: use VkSurface to retrieve an authenticated display fd |
| - v3dv: remove box check from texel buffer copy fragment shader |
| - v3dv: remove redundant free of default pipeline attributes BO |
| - v3dv: only write new uniforms when needed |
| - v3dv: remove obsolete comment |
| - v3dv: fix allocation size for BO handles |
| - v3dv: fix leak in the buffer to image copy via texel buffer |
| - v3dv: batch buffer to image copies with the texel buffer path if possible |
| - v3dv: extend the list of formats supported by the TFU unit |
| - v3dv: remove obsolete disabled code |
| - v3dv: support compressed formats with TFU unit |
| - v3dv: add a format parameter to emit_tfu_job |
| - v3dv: add a TFU path for image copies |
| - v3dv: fix base layer for 3D blits in the TFU path |
| - v3dv: expand format coverage in TFU path for buffer to image copies |
| - v3dv: check return value of drmGetMagic |
| - v3dv: expand the formats that can be handled in the TFU blit path |
| - v3dv: handle Z mirroring in the TFU blit path |
| - v3dv: add a helper to choose a compatible TFU format |
| - v3dv: ignore filter in TFU blit path |
| - v3dv: move error string definition to debug path |
| - v3dv: don't log out of pool memory errors for internal driver pools |
| - v3dv: fix early return from failed drmGetMagic |
| - v3dv: fix incorrect slice selection for TFU jobs |
| - v3dv: fix BO list for TFU jobs |
| |
| Ian Romanick (23): |
| |
| - intel/compiler: Rotate instructions ROR and ROL cannot have source modifiers |
| - intel/compiler: Delete redundant MAC declaration |
| - intel/fs: Silence unused parameter warning in filter_simd |
| - intel/fs: Add support for printing half-float immediate values |
| - util: Add cnd_monotonic to Makefile.sources |
| - nir: Make some notes about fsign versus NaN |
| - nir/algebraic: Make some notes about comparison rearrangements versus infinity |
| - Revert "nir: Replace an odd comparison involving fmin of -b2f" |
| - nir/algebraic: Don't add reordered version of patterns for commutative instructions |
| - nir: Correctly constant fold fsign(NaN) and fsign(-0) |
| - nir/algebraic: Mark some logic-joined comparison reductions as exact |
| - nir/algebraic: Add some compare-with-zero optimizations that are exact |
| - spir-v: Mark floating point comparisons exact |
| - nir/algebraic: Fix broken NaN and -0.0 behavior |
| - nir/algebraic: Mark comparisons generated from lowered fsign precise |
| - nir/algebraic: Move the flrp -\> bcsel rule earlier |
| - i965: Don't parse driconf again |
| - nir/algebraic: Fix a \>\> \#b \<\< \#b for sizes other than 32-bit |
| - intel/compiler: Properly handle shift count for 8-bit sources |
| - intel/compiler: Enable the ability to emit CMPN instructions |
| - intel/compiler: Make the CMPN builder work like the CMP builder |
| - intel/compiler: Use CMPN for min / max on Gen4 and Gen5 |
| - nir/algebraic: Fix some min/max of b2f replacements |
| |
| Icecream95 (54): |
| |
| - rbug: Forward get_compiler_options to pipe driver |
| - rbug: Handle non-TGSI shaders |
| - panfrost: Fix AFBC blits of resources with faked RGTC |
| - panfrost: Fix stack shift calculation |
| - pan/mdg: Try demoting uniforms instead of spilling to TLS |
| - panfrost: Split up batches with many jobs |
| - pan/gen_pack: Fix signed integer packing |
| - panfrost: Fix negative LOD bias support on Bifrost |
| - pan/decode: Fix "Access to unknown memory" message formatting |
| - panfrost: Fix precise occlusion queries on Bifrost |
| - panfrost: Fix CLAMP wrap mode |
| - panfrost: Fix the Maximum anisotropy field in the XML |
| - panfrost: Set the anisotropy level when cso->max_anisotropy is set |
| - panfrost: Add a gpu_revision argument to panfrost_get_quirks |
| - panfrost: Expose ARB_texture_filter_anisotropic on supported GPUs |
| - panfrost: Fix panfrost_small_padded_vertex_count for 17 vertices |
| - panfrost: Fix discard behaviour on Bifrost |
| - nir: Handle load_kernel_input in nir_get_io_offset_src |
| - pan/mdg: Fix promoted uniform moves with 64-bit types |
| - pan/mdg: Add load_kernel_input support |
| - pan/mdg: Implement load_global_invocation_id |
| - pan/mdg: Set compute lowering options |
| - panfrost: Stop lowering cs derived sysvals in glsl |
| - panfrost: Add a NIR pass to lower 64-bit vec3 intrinsic loads |
| - pan/mdg: Use the pan_nir_lower_64bit_intrin NIR pass |
| - pan/mdg: Support nir_intrinsic_load_global_constant |
| - pan/mdg: Support nir_intrinsic_group_memory_barrier |
| - panfrost: Allow NULL for some binding functions |
| - pan/mdg: Replace zext with a type enum |
| - pan/mdg: Return false instead of asserting in mir_args_ssa |
| - pan/mdg: Add i2i64 to mir_match_offset |
| - pan/mdg: Pass the memory type to mir_set_offset directly |
| - pan/mdg: Invert the type conditional for load intrinsics |
| - pan/mdg: Support loads and stores to scratch memory |
| - panfrost: Stub out panfrost_render_condition |
| - panfrost: Set conditional render cap |
| - gallium: Add new cap PIPE_CAP_TEXTURE_BUFFER_SAMPLER |
| - docs: Mention PIPE_CAP_TEXTURE_BUFFER_SAMPLER |
| - st/mesa: Use samplers for buffer textures if requested |
| - panfrost: Make the width argument to panfrost_new_texture 32 bits |
| - panfrost: Support buffer sampler views |
| - panfrost: Fix textureSize for buffer textures |
| - panfrost: Enable ARB_texture_buffer_object |
| - panfrost: Dual-source blending on Bifrost |
| - pan/bi: Add a define for the Bifrost shader prefetch size |
| - pan/bi: Add some zero bytes after shaders on Bifrost |
| - panfrost: Fix size assertion in bi_alu_src_index |
| - pan/mdg: Fix spilling when scratch memory is used |
| - pan/bi: Iterate from zero when setting RA interference |
| - pan/decode: Free mapped memory objects on BO unreference |
| - panfrost: Use normal malloc/free instead of ralloc for surfaces |
| - panfrost: Add the tiler heap to fragment jobs |
| - pan/bi: Use the correct size for UBO loads |
| - st/mesa: Update constants on alpha test change if it's lowered |
| |
| Ilia Mirkin (18): |
| |
| - nv50: only support 4 components in separate xfb mode |
| - nv50: fake enough resume support pre-nva0 to pass gles3 requirements |
| - mesa/teximage: show internal format when printing verbose api log |
| - nv50/ir: allow a mov to emit directly to a shader output |
| - nv50: fix instancing of client-side vertex buffers |
| - nv50,nvc0: serialize between before/after using a zeta surface as color |
| - nv50: use 2d blit when m2mf doesn't support the copy |
| - nouveau: change fence destruction logic on screen destroy |
| - nouveau: add drm-shim support |
| - ci: include nouveau in shader-db runs |
| - nouveau: trigger the current fence's work on destroy explicitly |
| - glsl: only expose int64 atomics when extension is enabled |
| - cso: set index_bounds_valid = true for arrays draws |
| - nvc0: index_bias is now only set for indexed draws |
| - st/mesa: fix broken moves for u2i64 and related ops |
| - nv50/ir: clear dnz flag when converting mul/mad to simpler ops |
| - nvc0/ir: add fixup to deal with interpolateAtSample with non-MSAA |
| - nouveau: reinstate fencing on screen destroy |
| |
| Indrajit Kumar Das (3): |
| |
| - radeonsi/gfx10: fix overflow and primitive queries |
| - radeonsi/gfx10: added support for gfx10 conditional rendering |
| - radeonsi/gfx10: fix issue with multiple overflow queries on the same context |
| |
| James Jones (4): |
| |
| - gallium: Add pipe_screen::is_dmabuf_modifier_supported |
| - gallium: Add format modifier plane count query |
| - gallium/dri: Factor out DRI extension setup code |
| - gallium/dri: Use per-screen DRI extension list |
| |
| James Park (54): |
| |
| - radv: Fix radv_queue_init failure handling |
| - c11/threads: Fix Win32 timed functions |
| - c11/threads: Remove Win32 null checks |
| - c11/threads: Remove Windows XP support |
| - util/os_time: Safe os_time_get_nano for Windows |
| - util,radv: Cross-platform monotonic condition variable |
| - radv: Const aco_compiler_statistic_info usage |
| - amd: Simplify ac_addrlib_create |
| - amd: Cast to int for %d snprintf argument |
| - amd: Remove bitfield sizes from enum values |
| - amd: Stub sections that don't have \_WIN32 support |
| - amd: Replace vasprintf with vfprintf |
| - amd: Work around MSVC limit for string literals |
| - amd: Fix signature mismatch |
| - amd: Fix declaration mismatch |
| - amd/common: Check with_tests before adding test |
| - vulkan: Remove GCC pragmas by fixing warnings |
| - vulkan: Replace pthread mutex with mtx_t |
| - vulkan: Portable wsi_common_get_current_time() |
| - util: Add os_localtime |
| - vulkan/util: Consolidate typed_memcpy |
| - aco: Define NOMINMAX in Meson build file |
| - aco: Fix warnings about unsafe integer/bool mix |
| - aco: Add missing C++ includes |
| - aco: Remove nonstandard parentheses |
| - aco: Declare num_reduce_ops for array size |
| - aco: Const correct aco_compiler_statistics |
| - aco: Replace indexed array initialization |
| - aco: Use u_memstream instead of POSIX memstream |
| - aco: Initialize union within Operand for MSVC |
| - aco: Fix warnings for bools in bitwise logic |
| - aco: Stub sections that don't have \_WIN32 support |
| - aco: Avoid extra bitfield padding |
| - radv: Exclude amdgpu driver files for Windows |
| - radv: Update build defines for Windows |
| - radv: Replace VLAs with alloca |
| - radv: Wrap pragmas with \__GNUC_\_ to fix MSVC |
| - radv: Use os_localtime instead of localtime_r |
| - radv: Don't return value in void function |
| - radv: Ignore radv_printflike on Windows |
| - radv: Update radv_assert for MSVC |
| - radv: Fix callback signatures |
| - radv: Fix leak in radv_amdgpu_winsys_destroy() |
| - radv: Fix function parameter types |
| - radv: Use standard \__VA_ARGS_\_ macro |
| - radv: Create shader cache if ENABLE_SHADER_CACHE |
| - radv: Use unsigned with u_bit_scan for MSVC |
| - radv: Replace pthread mutex with mtx_t |
| - radv: Replace pthread thread with thrd_t |
| - radv: Use portable ffs and util_bitcount macros |
| - util: Disable \[[fallthrough]\] for C17 |
| - xmlconfig: Disable WITH_XMLCONFIG on Windows |
| - util: Disable memstream for Apple builds |
| - gallium/tessellator: Fix warning suppression |
| |
| Jan Beich (1): |
| |
| - util: unbreak on BSDs after MSVC changes |
| |
| Jason Ekstrand (63): |
| |
| - intel/fs: Fix use of undefined value in fixup_nomask_control_flow |
| - nir/lower_io: Add data OOB asserts to write_constant |
| - nir: Add a more generic helper for gathering constant initializers |
| - nir,clover: Drop nir_lower_mem_constant_vars |
| - nir: Rewrite lower_undef_to_zero |
| - Revert "anv/image: Define anv_image_get_aux_addr (v3)" |
| - vulkan: Update XML and headers to 1.2.162 |
| - spirv: Rename some ray-tracing intrinsics to NV |
| - spirv: Update JSON and headers from Khronos main |
| - spirv: Implement OpTraceRayKHR and OpExecuteCallableKHR |
| - spirv: Call repair SSA for OpTerminateInvocation |
| - spirv: Implement OpTerminateRayKHR and OpIgnoreIntersectionKHR |
| - spirv: Implement SpvOpConvertUToAccelerationStructureKHR |
| - nir: Add a halt instruction type |
| - spirv: Emit nir_jump_halt after TerminateRay or IgnoreIntersection |
| - intel/dev: Add a gen_device_info::has_ray_tracing bit |
| - intel/genxml: Add the BINDLESS_SHADER_RECORD data structure |
| - intel/genxml/pack: Stash the cloned address field |
| - intel/genxml: Support truncated addresses |
| - intel/genxml: Add RT_DISPATCH_GLOBALS and RT_*_SBT_HANDLE structs |
| - intel/genxml: Add BVH data structures |
| - nir: Add a helper to get the live set at a cursor |
| - nir/lower_io: Allow ray_hit_attrib in lower_vars_to_explicit_types |
| - nir/lower_io: Support shader_call_data in vars_to_explicit_types |
| - intel/debug: Add a debug flag for ray-tracing shaders |
| - intel/compiler: Add support for bindless shaders |
| - intel/rt: Add a brw_rt.h header with \#defines for basic RT data structures |
| - intel/fs: Add and implement a load_global_const_block intrinsic |
| - intel/rt: Add builder helpers for accessing RT data structures |
| - intel/rt: Add a pass to lower the new ray-tracing intrinsics |
| - intel/rt: Add lowering functions for each ray-tracing stage |
| - intel/rt: Add support for scratch in ray-tracing shaders |
| - intel/rt: Add return instructions at the end of ray-tracing shaders |
| - intel/rt: Add a pass to lower shader call instructions |
| - intel/rt: Add a helper to create a trivial return shader |
| - intel/rt: Implement support for shader call payloads |
| - intel/fs: Add and implement intel-specific ray-tracing intrinsics |
| - intel/rt: Implement traceRay() |
| - intel/rt: Implement the new ray-tracing system values |
| - intel/rt: Add support for shader buffer record memory |
| - intel/rt: Add lowering for ray-walk intrinsics in any-hit shaders |
| - intel/rt: Add lowering for combined intersection/any-hit shaders |
| - intel/rt: Add a helper to create the raygen trampoline shader |
| - intel/rt: Add support for hit attributes |
| - intel/rt: Implement push constants as global memory reads |
| - nir: Use the right argument order for load_scratch_base_ptr |
| - intel/fs: DISCARD_JUMP does not have side-effects |
| - intel/fs: Rename PLACEHOLDER_HALT to HALT_TARGET |
| - intel/fs: Use BRW_OPCODE_HALT for discards |
| - intel/fs: Remove unnecessary HALT_TARGET in opt_redundant_halt() |
| - intel/fs: Emit HALT_TARGET in emit_nir_code() |
| - intel/fs: Implement nir_jump_halt |
| - nir/lower_non_uniform: Refactor for better code organization |
| - nir/lower_non_uniform: Better handle non-derefs |
| - anv: Bump maxGeometryInputComponents to 128 on Gen8+ |
| - intel/compiler: Return 1 for immediates in regs_read |
| - intel/fs: QUAD_SWIZZLE requires packed data |
| - nir: Drop the lower_mem_constant_vars declaration |
| - vulkan: Make vk_debug_report_callback derive from vk_object_base |
| - nir: Don't optimize bcsel-of-shuffle across blocks |
| - nir: Fix parameter order in the bcsel-of-shuffle optimization |
| - intel/fs: Shuffle can't handle source modifiers |
| - anv/formats: Advertise linear sampling on depth formats |
| |
| Jeremy Huddleston (3): |
| |
| - util: Fix pointer to integer conversion error when using libunwind |
| - Fall back on clock_gettime when timespec_get() is unavailable |
| - Adjust dylib compatibility versions to match what was set by mesa-18.3's autotools-based builds |
| |
| Jesse Natalie (105): |
| |
| - microsoft/compiler: Fix reference to renamed intrinsic getter |
| - panfrost/util: Move nir_undef_to_zero into core nir and add 'lower' |
| - nir: Add nir_alu_type -\> glsl_base_type conversion helper |
| - vtn/opencl: Fix alignment for half vload/vstore |
| - nir_load_libclc: Mark libclc shader as internal |
| - spirv: Allow spirv_to_nir callers to provide a float execution mode |
| - microsoft: Add CLC frontend and kernel/compute support to DXIL converter |
| - d3d12: Add glon12 target which only includes d3d12 driver |
| - d3d12: Pipe adapter LUID from callbacks to D3D12 screen init |
| - wgl: Marshal HDC into screen creation and LUID querying |
| - wgl: Implement get_adapter_luid callback |
| - wgl: Add stw_winsys callback to check which PFD flags should be added |
| - wgl: Add PFD flags based on stw_winsys callback response |
| - wgl: Add winsys framebuffer object |
| - wgl: Use winsys framebuffer interface if present |
| - d3d12: Implement winsys framebuffer |
| - winsys/d3d12: Use MakeWindowAssociation to remove DXGI's alt+enter handling |
| - d3d12: Delete unused local variables |
| - microsoft/compiler: Remove dead code/variables |
| - d3d12: Fix brace-initialization issues |
| - d3d12: Fix signed-unsigned comparison warnings |
| - d3d12: Remove Windows-specific macros |
| - d3d12: Clean up d3d12_compiler.h |
| - d3d12: Fix unhandled switch case warnings |
| - microsoft/compiler: Fix unhandled switch case warnings |
| - d3d12: Misc fixes caught by GCC warnings / code inspection |
| - microsoft/compiler: Misc fixes caught by GCC |
| - d3d12: Fix use of incorrect clear color variable |
| - microsoft/compiler: Add missing 'return' to switch case |
| - d3d12: Fix GCC warnings for missing function prototypes |
| - windows: Always set NOMINMAX to remove min/max macros |
| - util: Add os_get_page_size query |
| - driconf: Avoid empty macro resulting in empty initializer braces |
| - gallium: Include winsock lib as a dependency for Windows |
| - gallium: Remove unnecessary forward declaration of swrast_driver_descriptor |
| - clover: Add opencl-native build flag |
| - clover: Support LLVM coming from CMake instead of config-tool |
| - clover: Add version.lib dependency for Clang on Windows |
| - meson: Adjust Clover's required LLVM modules |
| - clover: Fix property_element::as for MSVC |
| - clover/llvm: Work around MSVC quirks |
| - clover/core: Support MSVC |
| - clover/api: Support MSVC |
| - clover: Use .def files for exports on Windows |
| - clover/core: Fix x86 build |
| - gallium: Add optional pipe_context to flush_frontbuffer |
| - d3d12: Fix incorrect fence timeout calculation |
| - CI: Add repeat-wait to Windows Piglit skip |
| - d3d12: Use DirectX-Headers wrap for d3d12.h |
| - d3d12: Refactor screen to abstract DXGI details |
| - d3d12: Add DXCore screen variation |
| - microsoft/compiler: Pick up new dxcapi.h |
| - winsys_handle: Change D3D12 resource handle type to void\* |
| - d3d12: Include wsl/winadapter.h when not compiling for Windows |
| - d3d12: Include dxguids/dxguids.h in files that need \__uuidof |
| - d3d12: Use IID_PPV_ARGS instead of \__uuidof |
| - d3d12: Scope down wrl includes to just client.h |
| - d3d12: Add forward declaration for LUID |
| - d3d12: Use u_dl instead of Windows DLL APIs |
| - d3d12: Only play DLL path tricks on Windows |
| - d3d12: Only support DXGI and GDI APIs on Windows |
| - d3d12: Support Linux eventfds for fences |
| - d3d12: Don't require DXIL for WSL |
| - gallium/dri: Add D3D12 software driver option |
| - d3d12: Flush and wait in flush_frontbuffer |
| - drisw: Add fallback logic for choosing a driver to use |
| - drisw: Prefer hardware-layered sw-winsys drivers over pure sw |
| - nir: Add intrinsic and string ptrs |
| - nir/vtn: Implement printf opcode in terms of intrinsic (v9) |
| - nir: Add a printf lowering pass (v5) |
| - nir: Add an algebraic optimization for float->double->float |
| - microsoft/clc: Hook up printf |
| - microsoft/compiler: Fix warnings produced by GCC in release mode |
| - microsoft/compiler: Fix incorrect size passed to strncpy |
| - d3d12: Unused variable warning indicated bug in bo_unmap |
| - d3d12: Signed/unsigned comparison warning fixes |
| - d3d12: Fix unused local variable warning in release build |
| - d3d12: Fix implicit fallthrough warnings |
| - microsoft/resoure_state_manager: Silence GCC invalid offsetof warning |
| - d3d12: Fix clang warnings from {0} in C++ code |
| - d3d12: Fix uninitialized variable referenced in error case |
| - d3d12: Remove copy/pasted line of array initialization |
| - microsoft/compile: Fix incorrect enum type in function signature |
| - microsoft/compiler: Fix tautological comparison |
| - microsoft/resource_state_manager: Remove unused private variable |
| - microsoft/compiler: Fix clang fallthrough warnings |
| - microsoft/clc: Fix const violations from ralloc_steal |
| - CI: Install DirectX-Headers package for x86 container |
| - CI: Enable d3d12 driver for Linux CI builds |
| - nir: Update saturated float->int/uint conversion algorithm |
| - d3d12: Add a path for mapping of not-directly-mappable buffers |
| - d3d12: Add a slab bufmgr for readback buffers |
| - d3d12: Use buffer pipe usage to inform allocation |
| - d3d12: Use an appropriate pipe resource usage for map intermediates |
| - d3d12: Don't allocate mappable textures |
| - nir: Work around MSVC x86 internal compiler error |
| - drisw: Disable automatic use of layered drivers with LIBGL_ALWAYS_SOFTWARE |
| - wgl: Refactor screen creation to a function |
| - wgl: Add a loop for screen creation with an ordered list of fallbacks |
| - d3d12: Fail screen creation if a shader validator is needed and can't be created |
| - wgl: Disable automatic use of layered drivers with LIBGL_ALWAYS_SOFTWARE |
| - microsoft/clc: Let lower_vars_to_explicit_types fill kernel input driver_location |
| - microsoft/clc: Fix wrap modes for inline samplers for integer textures |
| - microsoft/clc: Move inline samplers to the end of the variable list |
| - microsoft/clc: Use driver_location for metadata instead of re-computing offsets |
| |
| Jonathan Gray (1): |
| |
| - aco: use UINT64_C on 64 bit constant arguments |
| |
| Jonathan Marek (9): |
| |
| - turnip: implement z-scaling and z-mirroring BlitImage |
| - turnip: no linear_to_srgb for alpha channel for gmem clear value packing |
| - turnip: do not include compute stage in pipeline_builder |
| - turnip: always emit LRZ draw state in DIRTY_DRAW_STATE path |
| - turnip: correctly disable draw states outside of renderpasses |
| - turnip: do not emit draw states in draw_cs outside of renderpass |
| - turnip: move up LRZ invalidate in CmdClearAttachments |
| - turnip: always set LRZ registers to zero for 3d clear/blit |
| - turnip: don't always use 3d ops for blit_image |
| |
| Jordan Justen (10): |
| |
| - intel/dev: Use GEN_GEN if defined for gen_device_info_is_9lp |
| - intel/dev: Add gen_device_info_is_12hp |
| - intel/genxml: Copy gen12.xml to gen125.xml |
| - intel/genxml: Build gen 12.5 |
| - intel/isl: Build gen 12.5 |
| - intel/anv: Build gen 12.5 |
| - intel/iris: Build gen 12.5 |
| - intel/compiler: Add GEN125 to enum gen |
| - intel/common: Build mi_builder_test for gen 12.5 |
| - iris: Fix android build due to missing link to libmesa_iris_gen125 |
| |
| Juan A. Suarez Romero (19): |
| |
| - ci: add testing for VC4 drivers (Raspberry Pi 3) |
| - util: function to check for rgbX format |
| - v3d: force alpha to 1 when rendering RGBX formats |
| - v3d: make set tile buffer size function public |
| - v3d: store number of color buffers in job |
| - v3d: split binning start from draw |
| - v3d: add helper to check if format supports TLB resolve |
| - v3d: implement tile buffer blits |
| - v3d: refactor set tile buffer size function |
| - v3d: implement tile-based blit operation |
| - v3d: remove old tile blit code |
| - v3d: use job's nr_cbufs field |
| - v3d: extend the list of formats supported by the TFU unit |
| - ci: Bump deqp to current vulkan-cts-1.2.5.0 |
| - doc/features: add VC4 driver |
| - v3d: reinterpret stencil data as uint texture in stencil blit path |
| - v3d: check blit mask inside blit subpaths |
| - v3d: add fast-path tile-based blit for depth/stencil buffers |
| - v3d: fix dest offset in TFU setup |
| |
| Karol Herbst (3): |
| |
| - clover/queue: Flush automatically if applications do not flush themselves |
| - tegra/context: fix regression in tegra_draw_vbo |
| - tegra/context: unwrap indirect_draw_count as well |
| |
| Keith Packard (1): |
| |
| - glx: Provide glvnd wrapper for glXSwapIntervalEXT |
| |
| Kenneth Graunke (16): |
| |
| - intel/compiler: Fix passthrough TCS regressions from program rename |
| - prog_to_nir: Revert name initialization change |
| - intel/compiler: Do interpolateAtOffset coordinate scaling in NIR |
| - intel/fs: Fix sampler message headers on Gen11+ when using scratch |
| - nir/algebraic: Avoid creating new fp64 ops when using softfp64 |
| - asm: Fix x86 assembly for inverse matrix operations |
| - asm: Try to fix sparc assembly for inverse matrix operations |
| - nir/lower_non_uniform: Use nir_read_first_invocation helper. |
| - vbo: Don't set node->min_index = max_index = indices_offset when merging |
| - vbo: Only mark merged line strips as lines when actually converting them |
| - tnl: Try not to botch index buffer munging when start \\> 0. |
| - tnl: Respect \`start\` when converting indices to GLuint |
| - tnl: Reset nr_bos to 0 between map/unmap cycles. |
| - Revert "mesa: allow half float textures based on ARB_half_float_pixel" |
| - iris: Consider resolves after changing a resource's aux state |
| - glsl/float64: Bump \#version to 400 |
| |
| Krunal Patel (1): |
| |
| - radeon/vce: Bitrate not updated when changing framerate |
| |
| Leo Liu (17): |
| |
| - vl: add AV1 codec picture support |
| - radeon/vcn: add AV1 codec driver firmware interfaces |
| - radeon/vcn: add AV1 support to the decoder |
| - radeon/vcn: add AV1 dpb buffer size |
| - radeon/vcn: add AV1 default tables for the context |
| - radeon/vcn: add AV1 context buffer |
| - radeon/vcn: fill up the context buffer |
| - radeon/vcn: get AV1 message buffer |
| - radeon/vcn: fill up the probs buffer |
| - radeonsi: cap AV1 codec configuration |
| - radeonsi: cap AV1 support to SIENNA CICHLID |
| - frontends/omx/bellagio: add AV1 initial support to omx dec |
| - frontends/omx/av1: add AV1 OBU header parsers |
| - frontends/omx/av1: add AV1 tasks management |
| - frontends/omx/av1: enable AV1 OMX Bellagio support |
| - mesa/st_vdpau: set surface winsys handle modifier |
| - frontends/omx: fix build warning |
| |
| Lionel Landwerlin (21): |
| |
| - intel/dump_gpu: add support for MMAP_OFFSET ioctl |
| - nir: don't consider txf_ms_mcs a query instruction |
| - st: trigger noop if the default value is not true |
| - mesa: add an environment variable to default enable INTEL_blackhole |
| - anv: fix descriptor pool leak in VMA object |
| - nir: wire shading rate variables |
| - compiler/nir: introduce a new helper to get varying name |
| - spirv: add support for KHR_fragment_shading_rate |
| - isl: Fix android build |
| - vulkan/overlay: don't display frame numbers unless required |
| - vulkan/overlay: add new options to display device/swapchain-format |
| - gallium/dri2: Don't forget protected content flag |
| - anv: add transfer usage for color/depth/stencil attachments |
| - intel/mi_builder: fix self modifying batches |
| - anv: Fix stencil layout in render passes |
| - anv: fix invalid programming of BLEND_STATE |
| - anv: only signal wsi fence BO on last command buffer |
| - anv: discard all timeline wait/signal value=0 |
| - anv: reset binary syncobj to be signaled before submission |
| - anv: don't wait for completion of work on vkQueuePresent() |
| - anv: Fix wait_count missing increment |
| |
| Louis-Francis Ratté-Boulianne (11): |
| |
| - gallium/nir: Wrap tgsi_to_nir header in extern C |
| - gallium/util: Wrap suballoc.h into extern C |
| - gallium: Wrap some header files into "extern C" |
| - d3d12: Add D3D12 WGL winsys |
| - wgl: Flush in-between resolving buffer and presenting |
| - wgl: Call flush_resource() before presenting |
| - wgl: Wait for fence when not using winsys framebuffer |
| - wgl: Create third buffer when drawing to front buffer |
| - wgl: Wrap stw_pixelformat.h into extern C |
| - d3d12: Release swapchain buffers before resizing them |
| - wgl: Don't crash in stw_make_current if current framebuffer is NULL |
| |
| Lucas Stach (2): |
| |
| - etnaviv: fix disabling of INT filter for real |
| - etnaviv: tex_state: fix miplevel selection |
| |
| Marcin Ślusarz (16): |
| |
| - nir: handle float atomics in copy propagation pass |
| - intel/tools/aubinator_error_decode: exit with an error on unknown option |
| - intel/tools/aubinator_error_decode: allow "-" as an input file |
| - intel/tools/aubinator_error_decode: allow 0 arguments |
| - iris: store copy of the border color in the border color hash table |
| - intel/tools/aubinator_error_decode: cleanup path/file handling |
| - intel/tools/aubinator_error_decode: fix small memory leaks |
| - svga: remove duplicated code |
| - iris: remove redundant check |
| - util/list: add list_is_linked |
| - nine: use list_is_linked |
| - gallium: use list_is_linked |
| - iris: use list_is_linked |
| - r600: use list_is_linked |
| - omx: use list_is_linked |
| - util/list: use helper function in list_is_singular |
| |
| Marek Olšák (278): |
| |
| - st/mesa: fix use-after-free when updating shader info in st_link_nir |
| - nir: optionally shuffle local invocation IDs for compute quad derivatives |
| - nir: rename needs_helper_invocations to needs_quad_helper_invocations |
| - nir: gather shader_info::needs_all_helper_invocations |
| - nir: optimize nir_lower_discard_to_demote to lower discard/demote both ways |
| - ac/llvm: fix demote inside conditional branches |
| - radeonsi: enable GL_EXT_demote_to_helper_invocation |
| - amd: add register enums for VRS |
| - radeonsi: add an option to enable 2x2 coarse shading for non-GUI elements |
| - mesa: add Driver.DrawTransformFeedback |
| - gallium: move count_from_stream_output into pipe_draw_indirect_info |
| - gallium: make pipe_draw_indirect_info \\* a draw_vbo parameter |
| - gallium/u_threaded: lift DIV_ROUND_UP to eliminate it for constant expressions |
| - gallium/u_threaded: clean up direct vs indirect draws |
| - gallium: add pipe_draw_info::index_bounds_valid |
| - gallium/u_threaded: improve draw merging by clearing pipe_draw_info fields |
| - gallium: add missing bits of the direct multi draw interface |
| - gallium: extend draw_vbo to support multi draws |
| - gallium/u_threaded: store start/count in min/max_index for better packing |
| - gallium/u_threaded: add support for multi draws |
| - mesa: clean up Driver.Draw parameter types |
| - mesa: clean up GLboolean types in draw.c |
| - mesa: remove constant drawID parameter from \_mesa_draw_arrays |
| - mesa: move primitive restart enablement determination from st/mesa to main |
| - mesa: index \_RestartIndex with index_size_shift |
| - mesa: add primitive restart state to Driver.Draw parameters |
| - mesa: don't FLUSH_VERTICES from primitive restart changes |
| - radeonsi: don't load DrawID for indirect draws if it's unused |
| - radeonsi: swap DrawId and StartInstance SGPR locations |
| - radeonsi: handle pipe_draw_info::increment_draw_id |
| - radeonsi: fix min_direct_count value |
| - radeonsi: do VGT_FLUSH when switching NGG -\> legacy on Sienna Cichlid |
| - radeonsi: only do VGT_FLUSH for fast launch if previous draw was normal launch |
| - radeonsi: determine correctly if switching from normal launch to fast launch |
| - radeonsi: don't subtract max_verts_per_prim from hw_max_esverts on gfx10.3 |
| - radeonsi: read vs_state_bits in vs_prolog correctly |
| - radeonsi: tweak triangle list culling performance for GS fast launch |
| - radeonsi: remove VS input loads when culling with rasterizer discard |
| - radeonsi: add options.inline_uniforms to the shader cache key |
| - ac: add build_alloca with an initializer |
| - ac: fix detection of Pro graphics |
| - ac: fix min/max_good_num_cu_per_sa on gfx10.3 with disabled SEs |
| - ac: rename num_render_backends -\> max_render_backends |
| - ac: rename num_sh_per_se -\> num_sa_per_se |
| - radeonsi: don't do VGT_FLUSH before fast launch on gfx10.3 |
| - radeonsi: don't add num_vbos_in_user_sgprs to the shader cache key for non-VS |
| - radeonsi: fix NGG streamout regression |
| - radeonsi: fix scan_instruction for bindless inc_wrap/dec_wrap atomics |
| - winsys/amdgpu: remove amdgpu_winsys_bo::u::sparse::flags |
| - winsys/amdgpu: remove amdgpu_winsys_bo::sparse |
| - winsys/amdgpu: replace amdgpu_winsys_bo::flags with pb_buffer::usage |
| - winsys/amdgpu: replace amdgpu_winsys_bo::initial_domain with pb_buffer::placement |
| - winsys/amdgpu: move amdgpu_winsys_bo::lock for better packing |
| - mesa: add glInternalSetError for glthread |
| - mesa: make error handling for glGetActiveUniform glthread-safe |
| - glthread: make glGetActiveUniform return without syncing |
| - mesa: lock Shared->BufferObjects only once for a glthread batch |
| - mesa: lock Shared->TexMutex only once for a glthread batch |
| - nir: fix gathering TCS cross invocation access with lowered IO |
| - nir: fix gathering patch IO usage with lowered IO |
| - ac/nir: fix a typo in ac_are_tessfactors_def_in_all_invocs |
| - radeonsi: adjust tess SGPRs to allow fully occupied 3 HS waves of triangles |
| - radeonsi: don't leave more than 8 unoccupied lanes in HS |
| - radeonsi: don't allocate LDS for TCS outputs if they are not read |
| - radeonsi: limit HS LDS usage per workgroup to 16K to allow at least 2 WGs/CU |
| - radeonsi: don't generate a dead conditional in si_write_tess_factors on gfx9+ |
| - radeonsi: merge TCS and TCS epilog conditional blocks |
| - radeonsi: always return void from si_build_wrapper_function |
| - radeonsi: if VS and TCS have the same number of threads, merge the conditonals |
| - radeonsi: remove unnecessary NULL checking in NIR tess functions |
| - ac/llvm: prepare for passing VS->TCS IO via VGPRs |
| - radeonsi: pass VS->TCS IO via VGPRs if VS and TCS have the same thread count |
| - radeonsi: don't insert barrier between VS/TCS if all TCS inputs come from VGPRs |
| - radeonsi: don't allocate LDS for TCS inputs if it's not used |
| - radeonsi: implement GS fast launch for indexed triangle strips |
| - mesa: don't duplicate allocation code in \_mesa_new_parameter_list_sized |
| - mesa: track ParameterValues size separately |
| - mesa: properly disallow param list reallocation |
| - mesa: don't print GL errors in release builds if MESA_DEBUG=silent |
| - mesa: call FLUSH_VERTICES before changing sampler uniforms |
| - mesa: move sampler condition for flushing into mesa_flush_vertices_for_uniforms |
| - mesa: skip redundant uniform updates for glUniform |
| - mesa: skip redundant uniform updates for glUniformMatrix |
| - mesa: skip redundant uniform updates for glUniformHandle |
| - mesa: don't read from destination memory when computing state parameter values |
| - mesa: replace \_mesa_problem with unreachable in fetch_state |
| - util: add a common ALIGN16 macro for m_matrix and u_threaded_context |
| - mesa: don't allocate matrices with malloc |
| - mesa: rework matrix statevar enums to remove excessive branching in fetch_state |
| - mesa: remove redundant \_math_matrix_analyse calls in fetch_state |
| - mesa: fix printing state parameters |
| - mesa: allow multi-slot program parameters |
| - mesa: demystify material_attrib() |
| - mesa: optimize setting gl_Light state parameters |
| - mesa: restructure gl_light vars to match the layout of gl_LightSource uniforms |
| - mesa: put constants before state vars for ffvp |
| - mesa: put constants before state vars for ARB programs |
| - mesa: take advantage of sorted parameters in \_mesa_load_state_parameters |
| - mesa: merge matrix state parameters for faster uploads (disabled) |
| - mesa: merge light state parameters for faster uploads (disabled) |
| - mesa: add helpers for drivers to load state parameters into buffers |
| - gallium: add PIPE_CAP_PREFER_REAL_BUFFER_IN_CONSTBUF0 |
| - st/mesa: add a faster path for uploading state parameters into constant buffers |
| - st/mesa: replace st_context::state::constants with a mask |
| - mesa: fix crashes in the no_error case of invalid glUniform calls |
| - mesa: skip glMultMatrix if the matrix is identity |
| - mesa: consider glPushMatrix a no-op change from the driver perspective |
| - mesa: canonicalize matrix in glPushMatrix to make glPopMatrix possibly a no-op |
| - mesa: memset matrices at initialization to enable memcpy on it |
| - mesa: treat glPopMatrix as a no-op state change if it doesn't change the matrix |
| - mesa: rewrite glPushAttrib/glPopAttrib to get rid of malloc |
| - mesa: add a fast path for restoring fixed-func tex state in glPopAttrib |
| - mesa: add a fast path for restoring light attributes in glPopAttrib |
| - mesa: reorganize gl_texture and sampler structures for glPush/PopAttrib |
| - mesa: optimize saving/restoring bound textures for glPush/PopAttrib |
| - mesa: reduce the size of gl_texture_attrib_node::Texture by about 90% |
| - mesa: skip \_mesa_set_enable in glPopAttrib if there are no changes |
| - mesa: optimize out no-op calls in glPopAttrib |
| - mesa: more optimizations in glPopAttrib (colormask, drawbuffers, coord replace) |
| - mesa: remove gl_texture_object references from glPush/PopAttrib stack |
| - mesa: allocate the attribute stack on demand |
| - st/mesa: fix uninitialized/random clip plane state vars in lower_ucp |
| - compiler: decrease STATE_LENGTH from 5 to 4 |
| - mesa: replace ParameterValueOffset[i\] with Parameters[i].ValueOffset |
| - radeonsi: print more fields in si_dump_shader_key |
| - radeonsi: always use a staging texture for linear 1D textures in VRAM |
| - radeonsi: correct the MAD/FMA support table |
| - radeonsi: use util_logbase2 instead of division by index_size |
| - radeonsi: fix a memory leak in si_create_dcc_retile_cs |
| - radeonsi: fix line stippling with LINES_ADJACENCY without GS |
| - radeonsi: fix max_lds_size warning in release builds |
| - winsys/radeon: don't use debug_get_option_noop in a hot path |
| - winsys/amdgpu: don't use debug_get_option_noop in a hot path |
| - radeonsi: unduplicate code setting MIN_COMPRESSED_BLOCK_SIZE |
| - radeonsi: enable NGG and NGG culling on gfx10.3 APUs by default |
| - radeonsi: add AMD_DEBUG=nofastlaunch for debugging |
| - radeonsi: eliminate shader code for disabled or masked color outputs |
| - radeonsi: fix a nasty bug in si_pm4.c |
| - radeonsi: only mask 1 CU for GS/VS waves on gfx10.3 |
| - ac,radeonsi: fix load_first_vertex |
| - radeonsi: don't update indexed flag in SGPR if it's unused |
| - radeonsi: don't update provoking vertex and outprim states in SGPR if unused |
| - ac: enable late allocation on VanGogh to increase perf |
| - radeonsi: disable WGP mode on gfx10.3 to prevent hangs |
| - radeonsi: don't invalidate emitted NUM_INSTANCES for u_blitter |
| - radeonsi: don't set DrawID and StartInstance if they are unused |
| - radeonsi: don't check for GS fast launch for NOT_EOP in the indexed case |
| - Revert "radeonsi: always return void from si_build_wrapper_function" |
| - vbo: remove gl_context dereferences when we can just subtract the pointer |
| - cso: remove unused code |
| - gallium: inline struct u_suballocator to remove dereferences |
| - cso: inline struct cso_cache to remove dereferences |
| - st/mesa: put pipe_screen \\* into st_context and use it |
| - st/mesa: move cso_context next to the other pointers |
| - r300,r600,radeonsi: inline struct radeon_cmdbuf to remove dereferences |
| - draw: add NIR support to draw_create_vertex_shader |
| - st/mesa: don't generate TGSI for the draw VS because it now supports NIR too |
| - st/mesa: remove less useful debug options in hot paths |
| - gallium: fix the PIPE_SHADER_CAP_SUPPORTED_IRS value for all drivers |
| - glthread: use glthread->used instead of glthread->next_batch->used |
| - glthread: use uint64_t to declare the batch buffer instead of align(8) |
| - glthread: change sizes to unsigned or size_t where needed |
| - glthread: count batch space in units of uint64_t elements |
| - gallium/u_threaded: don't pass index bounds to the driver to decrease overhead |
| - gallium/u_threaded: set has_user_indices = false in the driver thread |
| - gallium/u_threaded: don't copy the indexbuf pointer if we overwrite it |
| - gallium/u_threaded: don't make a local copy of pipe_draw_start_count |
| - gallium/u_threaded: optimize set_constant_buffer |
| - mesa: fix glPopAttrib for GL_COORD_REPLACE for r200 |
| - mesa: remove code for old (mostly unsupported) GL_NV_point_sprite |
| - mesa: remove MAX_3D_TEXTURE_LEVELS, MAX_CUBE_TEXTURE_LEVELS |
| - radeonsi: move si_screen_clear_buffer into si_compute_blit.c w/o SDMA option |
| - radeonsi: rename buffer functions so as not to reference rings |
| - radeonsi: remove SDMA support |
| - radeonsi: rename SI_TEST_DMA to SI_TEST_BLIT |
| - radeonsi: fix the blit test for SW_64KB_R_X |
| - radeonsi: initialize ctx and gfx_cs first, then allocators |
| - ac: add radeon_info::all_vram_visible for Smart Access Memory |
| - radeons: only force staging uploads for VRAM when all VRAM is not visible |
| - radeonsi: only use staging for linear textures when all VRAM is not visible |
| - radeonsi: unify uploaders and upload to VRAM if all VRAM is visible |
| - radeonsi: map PIPE_USAGE_STREAM to VRAM if all VRAM is visible |
| - winsys/amdgpu: use VRAM for command buffers if all VRAM is visible |
| - ac,radeonsi: implement GL_NV_compute_shader_derivatives |
| - st/mesa: enable compute shader derivatives in SPIR-V |
| - radeonsi: fix a crash in si_fence_server_sync |
| - ac: correct ac_shader_args types, remove sgpr_count |
| - ac: add shader return values into ac_shader_args |
| - radeonsi: split ac_shader_args initialization from LLVM code |
| - radeonsi: move si_create_function into si_shader_llvm.c |
| - radeonsi: move si_build_main_function into si_shader_llvm.c |
| - radeonsi: move si_llvm_compiler_shader and deps into si_shader_llvm.c |
| - ac: unify shader arguments that are duplicated |
| - ac/llvm: handle no_(un)signed_wrap NIR flags |
| - compiler: fix glsl_types.h compile failures when including as C++ in drivers |
| - gallium/util: allow including a few files in C++ |
| - amd/llvm: fix C++ compile failures |
| - radeonsi: allow including a few files from C++ |
| - radeonsi: fix future C++ compile failures and warnings |
| - radeonsi: resolve a tricky C++ failure with goto jumping over initializations |
| - radeonsi: rename si_state_draw.c to .cpp |
| - radeonsi: use a C++ template to decrease draw_vbo overhead by 13 % |
| - radeonsi: fix small primitive culling with MSAA force-disabled and smoothing |
| - radeonsi: disable NGG fast launch with indexed triangle strips to fix a hang |
| - radeonsi: improve a comment about an MSAA bug workaround |
| - nir_to_tgsi: fix NIR options instead of asserting |
| - draw: fix incorrect NIR support code |
| - mesa: fix assertion paramList->LastUniformIndex \\< paramList->FirstStateVarIndex |
| - mesa: remove unused LastUniformIndex |
| - mesa: overallocate program parameter values |
| - mesa: don't restore texture state into unbound textures in glPopAttrib |
| - mesa: call Driver.TexParameter in glPopAttrib to fix r100, r200, old nouveau |
| - gallium: pass pipe_stencil_ref by value (it has only 2 bytes) |
| - gallium: inline pipe_alpha_state to enable better DSA bitfield packing |
| - gallium: inline pipe_depth_state to decrease DSA state size by 4 bytes |
| - cso: don't pass blend_color through cso_context |
| - st/mesa: don't make a local copy of blend color |
| - cso: remove context and delete_state pointers from all CSOs |
| - cso: inline cso_construct_key |
| - gallium/util: fix util_can_blit_via_copy_region for conditional rendering |
| - st/mesa: don't do glCopyPixels via blit if depth bounds test is enabled |
| - st/mesa: relax requirements for doing glCopyPixels via blit |
| - st/mesa: skip glDrawPixels if it's totally clipped for all codepaths |
| - mesa: fix an overflow check for MultiDrawElements |
| - vbo: only set count and end when closing \_mesa_prim |
| - vbo: change the parameters of vbo_get_minmax_index to get rid of \_mesa_prim |
| - mesa: add Driver.DrawGallium\* functions to be used by main/draw.c |
| - gallium: add pipe_draw_info::index::gl_bo |
| - mesa: add a fallback for drivers not implementing Driver.DrawGallium\* |
| - vbo: add vbo_get_minmax_indices_gallium |
| - mesa: switch (Multi)DrawArrays to DrawGallium |
| - mesa: switch Draw(Range)Elements(BaseVertex) calls to DrawGallium |
| - mesa: switch MultiDrawElements(BaseVertex) to DrawGallium\* |
| - vbo: remove \_mesa_prim parameter from vbo_try_prim_conversion |
| - vbo: remove \_mesa_prim parameter from vbo_merge_draws |
| - vbo: remove \_mesa_prim parameter from vbo_copy_vertices |
| - vbo: switch immediate Begin/End to DrawGallium |
| - gallium/u_threaded: clear vertices_per_patch if prim type != PATCHES |
| - gallium: remove and emulate PIPE_CAP_MULTI_DRAW |
| - gallium: fix draw info setup in draw and utilities |
| - freedreno: fixes handling draw info |
| - iris: don't use index_bias if not indexed |
| - nouveau: fix handling draw info |
| - panfrost: don't use index_bias if not indexed |
| - r600: fix handling draw info |
| - swr: fix handling draw info |
| - svga: fix handling draw info |
| - vc4: don't use index_bias if indexed |
| - v3d: don't use index_bias if not indexed |
| - virgl: fix handling draw info |
| - st/mesa: implement Driver.DrawGallium callbacks |
| - gallium: remove PIPE_CAP_INFO_START_WITH_USER_INDICES and fix all drivers |
| - util: add AMD CPU family enums and enable L3 cache pinning on Zen3 |
| - ac,radeonsi: limit Smart Access Memory to Zen 3 and GFX10.3 due to perf issues |
| - radeonsi: add driconf options to enable/disable Smart Access Memory |
| - radeonsi: take color interpolation into account for shader variants |
| - util: replace UTIL_MAX_CPUS by util_cpu_caps.num_cpu_mask_bits |
| - st/mesa: simplify checking whether to pin threads to L3 |
| - st/mesa: fix a defect when st_validate_state was invoked for unused states |
| - mesa: add STATIC_ASSERTs to the STATE_LIGHT_ATTRIBS case |
| - mesa: fix a bug in merging light state parameters with unpacked uniforms |
| - mesa: fix a second bug in merging light state parameters with unpacked uniforms |
| - radeonsi: fix hang caused by for loop with exec=0 in LS and ES |
| - radeonsi: remove si_gs_prolog_bits::gfx9_prev_is_vs |
| - gallium: skip draws with count == 0 or instance_count == 0 in drivers |
| - mesa: skip draws w/ count == 0 and instance_count == 0 in draw_gallium_fallback |
| - vbo: fix a index buffer map failure with size = 0 in get_minmax_indices_gallium |
| - gallium/u_threaded: skip draws if user index buffer size has size == 0 |
| - mesa: always set valid index bounds for non-indexed draws for classic drivers |
| - mesa: fix alpha channel of ETC2_SRGB8 decompression for !bgra |
| - radeonsi: fix centroid with VRS coarse shading |
| - glthread: fix interpreting vertex size == GL_BGRA for vertex attribs |
| - mesa: flush glBegin/End before changing GL_DEPTH_STENCIL_TEXTURE_MODE |
| - i915: use align_calloc for the context to fix m32 crashes |
| - radeon,r200: use align_calloc for the context to fix m32 crashes |
| - nouveau_vieux: use align_calloc for the context to fix m32 crashes |
| - Revert "gallium/u_upload_mgr: allow use of FLUSH_EXPLICIT with persistent mappings" |
| - radeonsi: don't crash on NULL images in si_check_needs_implicit_sync |
| |
| Marek Vasut (1): |
| |
| - etnaviv: Fix rework ZSA into a derived state |
| |
| Marijn Suijten (3): |
| |
| - util: Do not insert uninitialized data if Android property is not set |
| - android: util: Add libcutils to Android.mk shared libs |
| - mesa/math: Fix address of array always returning true |
| |
| Mark Janes (1): |
| |
| - meson: add idep_mesautil to components using simple_mtx.h |
| |
| Martin Peres (1): |
| |
| - driconf: remove the redundant glx-extension-disabling options |
| |
| Matt Turner (2): |
| |
| - glcpp: Handle bison-3.6 error message changes |
| - turnip: Remove unused TU_DEBUG_IR3 flag |
| |
| Mauro Rossi (19): |
| |
| - android: gallium/aux: update old generated sources rules |
| - android: gallium/aux: Add GPU tracepoint mechanism |
| - android: freedreno: Add GPU tracepoints |
| - android: freedreno: Remove fd_log() |
| - android: freedreno/ir3: use python3 in gen rules |
| - android: radv: add libcutils shared dependency |
| - android: spirv: fix '::' typo in gen rules |
| - android: pan/bi: Add explicit dependency on the ISA helpers |
| - android: pan/bi: Generate bi_opcodes.{c,h} |
| - android: pan/bi: Generate instruction printer |
| - android: pan/bi: Generate builder routines |
| - android: pan/bi: Generate instruction packer for new IR |
| - android: pan/bi: Remove combine lowering |
| - android: pan/bi: Remove old IR packs |
| - android: pan/bi: Remove NIR->old IR |
| - android: pan/bi: Remove old IR opcode table |
| - android: ac/radv: fix typo in ac_rgp.h listed in Makefile.sources |
| - android: r600/sfn: add sfn_nir_lower_64bit.cpp to Makefile.sources |
| - android: pan/bi: reorder static dependencies in gallium/dri |
| |
| Michael Forney (1): |
| |
| - meson: add missing dependency on generated git_sha1.h |
| |
| Michael Tang (3): |
| |
| - microsoft/compiler: Add dedicated spirv_to_dxil libraries |
| - util: Implement os_read_file for Windows |
| - microsoft/compiler: Add spirv2dxil executable |
| |
| Michel Dänzer (33): |
| |
| - ac: Don't negate strstr return values in ac_query_gpu_info |
| - ci: Drop ci-templates-sha anchor |
| - ci: Update to current ci-templates |
| - ci: Use ci-fairy docker image instead of local git_archive one |
| - ci: Move sanity stage to the beginning of the pipeline |
| - ci: Squash "check mr/commits" jobs into a single sanity job |
| - ci: Make test-docs job depend on sanity job |
| - ci: Go back to previous ci-templates commit for debian.yml |
| - ci: Run git gc before creating Git cache tarball |
| - ci: Define global variable MESA_TEMPLATES_COMMIT for ci-templates commit |
| - ci: Append $MESA_TEMPLATES_COMMIT to image tags |
| - ci: Drop x86_build_old image |
| - ci: sanity job doesn't need the Git tree |
| - ci: Manual test jobs don't need the Git tree |
| - ci: Run sanity job automatically for forked branches as well |
| - ci: Move BASE_TAG expansion to FDO_BASE_IMAGE assignment |
| - ci: Add .use-base-image template |
| - ci: Adapt armhf_test job to MESA_TEMPLATES_COMMIT related changes |
| - docs: Adapt to FDO_DISTRIBUTION_TAG → MESA_IMAGE_TAG rename |
| - ci: .lava-test:amd64 template needs arm_build |
| - ci: Run sanity job only in pre-merge pipelines |
| - ci: Move deploy stage to the end of the pipeline |
| - wsi/x11: Set recognizable name for WSI swapchain queue thread |
| - wsi/x11: Always link against xcb-xrandr |
| - wsi/x11: Detect Xwayland |
| - wsi/x11: Use PresentOptionAsync for MAILBOX present mode with Xwayland |
| - wsi/x11: Treat IMMEDIATE present mode the same as MAILBOX for Xwayland |
| - ci: Rule out scheduled pipelines in .windows-build-rules |
| - ci: Add \*ignore_scheduled_pipelines to mesa/gallium rules templates |
| - wsi/x11: Use wsi_x11_get_connection in x11_present_to_x11_dri3 |
| - wsi/x11: Always free randr_reply in wsi_x11_connection_create |
| - wsi/x11: Make sure wsi_x11_connection::is_xwayland is always initialized |
| - wsi/x11: Use get_screen_resources_current in wsi_x11_detect_xwayland |
| |
| Michel Zou (16): |
| |
| - zink: fix build on windows |
| - util: fix -Wshift-count-overflow warning |
| - zink: fix unused variable warning |
| - libgl-gdi: add zink support |
| - spirv: workaround setjmp/longjmp crash on MinGW |
| - glsl: Drop mingw -O1 workaround for GCC>=7.3 |
| - util: fix mingw format-extra-args warning |
| - glapi: fix unused-function warning |
| - glsl: fix redefinition warning on win32 |
| - wgl: fix maybe-uninitialized warning |
| - softpipe: fix maybe-uninitialized warning |
| - gallium/tests: fix unused-but-set-variable warning |
| - llvmpipe: work around mingw compiler optimization bug |
| - meson: fix multiline string warning |
| - llvmpipe: fix unused variables warnings |
| - drisw: fix unused variables warnings |
| |
| Mike Blumenkrantz (113): |
| |
| - util/threaded_context: use driver's ubo alignment for constant buffer uploads |
| - zink: initial implementation of shader keys |
| - zink: refcount the shader cache |
| - zink: move shader key structs into their own header |
| - zink: fill in params for fs shader keys and flag shader for rebuild |
| - zink: put those shader keys to work fixing up fragment shaders |
| - zink: update shader modules in gfx program when flagged dirty |
| - zink: handle arbitrary border colors using VK_EXT_custom_border_color |
| - zink: track custom border color samplers and verify against device limits |
| - zink: add alternate ubo loader in ntv |
| - zink: assert all index values in ntv OpAccessChain constructor |
| - zink: initial shader key implementation |
| - zink: change a memcmp==0 to !memcmp |
| - zink: use shader keys for samplemask |
| - mesa/st: set reserved storage for params+values to 16 |
| - zink: fix direct image mapping offset |
| - zink: really fix direct image mapping offset (I mean it this time) |
| - st/pbo: fix pbo uploads without PIPE_CAP_TGSI_VS_LAYER_VIEWPORT |
| - st/mesa: set drawpixels swizzle before creating sampler view |
| - glsl/float64: make this compatible with glsl 330 |
| - zink: support frem shader op |
| - zink: add nir pass for splitting 64bit vertex attribs which cross slot boundaries |
| - zink: be more paranoid about array strides in ntv |
| - zink: add get_storage_class() ntv util |
| - zink: handle struct derefs in ntv |
| - zink: ntv formatting |
| - zink: add struct type support for ntv |
| - zink: add handling for 64bit values in spirv_builder |
| - zink: support nir_op_f2f32 |
| - zink: add handlers for some bitfield ops in ntv |
| - zink: set 64bit shader caps in ntv |
| - zink: change function params and asserts to permit 64bit types in ntv |
| - zink: add 64bit glsl basetype handling in ntv |
| - zink: handle 64bit constant loading in ntv |
| - zink: split ubo loading for 64bit types into 2x32bit loads |
| - zink: set nir options for 64bit handling based on feature presence |
| - zink: enable 64bit pipe caps |
| - mesa/st: run nir_lower_point_size_mov on geometry shaders based on cap |
| - mesa/st: do not run lower_psiz_mov on vertex shader if geometry shader is present |
| - mesa/st: tabs -\> spaces in st_program |
| - mesa/st: handle running nir lower passes for ucp and psiz in tess stage |
| - mesa/st: flag ST_NEW_CONSTANTS upon running nir_lower_point_size_mov |
| - mesa/st: set lower_point_size for tes/gs during program update |
| - zink: force stencil format for stencil-only samplers and swizzle the right component |
| - zink: add nir_op_bit_count to ntv |
| - zink: handle nir_op_ibitfield_extract: in ntv |
| - zink: handle nir_op_find_lsb and nir_op_ifind_msb in ntv |
| - zink: move rp hash functions further up in file |
| - zink: fix rp hash table |
| - zink: fix gl_SampleMaskIn handling |
| - zink: don't always run nir_lower_io_arrays_to_elements_no_indirects |
| - zink: add ntv handling for tess shader i/o variables |
| - zink: add handling for tess shader intrinsics |
| - zink: set up ntv init for tess shaders |
| - zink: set scoped barrier flag in nir options |
| - zink: pull xfb info from tess shader when applicable |
| - zink: set tess info in pipeline creation |
| - zink: support PIPE_PRIM_PATCHES |
| - zink: add handling for tcs and tes shader states |
| - zink: only run nir_lower_clip_halfz for last vertex processing stage |
| - zink: add push constant handling to get_storage_class() |
| - zink: add stubs for tess outer/inner level handling |
| - zink: implement passthrough tcs shader injection |
| - zink: handle partial writes to shader outputs |
| - zink: export tess shader pipe caps |
| - doc/features: mark off tessellation for zink |
| - zink: zero VkMemoryRequirements on init |
| - zink: fix debug utils init |
| - zink: handle null ubos |
| - zink: handle 0 as valid pipeline hash value |
| - zink: fix more instance detection stuff |
| - st/pbo: fix pbo uploads without PIPE_CAP_TGSI_VS_LAYER_VIEWPORT and skip gs |
| - zink: avoid replacing valid tcs with injected one |
| - zink: require KHR_maintenance2 for tessellation and set bottom-left origin |
| - zink: fix tess shader i/o variables |
| - zink: add KHR_draw_indirect_count detection |
| - zink: hook up IndirectCount draw commands |
| - zink: enable PIPE_CAP_MULTI_DRAW_INDIRECT(_PARAMS) caps |
| - features: mark off multidraw for zink |
| - radv: avoid oob read during clear |
| - zink: handle dynamic sampler array indexing for arb_gpu_shader5 |
| - zink: run nir_lower_tex for offsets if shaderImageGatherExtended is missing |
| - zink: use Offset param for txf ops |
| - zink: implement ARB_texture_gather |
| - zink: handle textureGather with Shadow-type samplers |
| - zink: enable PIPE_CAP_MAX_TEXTURE_GATHER_COMPONENTS |
| - features: mark off textureGather for zink |
| - zink: handle fs interpolation functions in ntv |
| - zink: set PIPE_CAP_MAX_VIEWPORTS |
| - zink: handle gl_SampleMaskIn loading in ntv |
| - zink: always load (gl_InstanceID - gl_BaseInstance) when loading gl_InstanceID |
| - zink: enable PIPE_CAP_START_INSTANCE |
| - zink: handle vertex streams |
| - zink: run nir_lower_dynamic_bo_access |
| - zink: handle arrays of ubos |
| - zink: GLSL 4.00 |
| - features: mark off GL 4.0 for zink |
| - zink: GLSL 410 |
| - features: mark off GL 4.1 for zink |
| - zink: handle non-const offsets for txf/tg4 ops |
| - nir: preserve explicit_binding in lower_atomics_to_ssbo |
| - zink: clamp shader input/output max values |
| - glcpp: disable 'windows' tests |
| - zink: flag gfx pipeline dirty using newer mechanism |
| - radv: null bo list pointer for null descriptors on update |
| - radv: zero the bo descriptor array when allocating a new set |
| - zink: fix streamout for tess stage |
| - zink: fix slot mapping for legacy gl io with tess stages |
| - zink: handle 1bit undef values in ntv |
| - gallium/trace: add a pipe_screen::get_compiler_options method |
| - mesa/st: clamp scissored clear regions to fb size |
| - zink: unset generated TCS if its parent TESS is unset |
| - zink: fix streamout emission for super-enhanced layouts |
| |
| Nanley Chery (32): |
| |
| - mesa: Add and use \_mesa_has_depth_float_channel |
| - mesa: Clamp some depth values in glClearBufferfv |
| - mesa: Clamp some depth values in glClearBufferfi |
| - iris: Add and use convert_depth_value |
| - iris: Use converted depth in clear_depth_stencil |
| - iris: Disable color fast-clears in iris_copy_region |
| - i965: Disable color fast-clears for miptree copy |
| - intel/blorp: Delete clear color conversions during copies |
| - iris: Stop quantizing the depth clear value |
| - iris: Fix resource ptr in resolve_sampler_views |
| - iris: Drop res variable in resolve_sampler_views |
| - iris: Stop using blorp_hiz_stencil_op |
| - intel/blorp: Drop support for STC_CCS resolves |
| - iris: Move STC case in get_copy_region_aux_settings |
| - iris: Support clears in more GPU-based copies |
| - iris: Don't prepare depth for stencil-aspect blits |
| - iris: Move depth-format assertion out of iris_blit |
| - iris: Use texture preparation helper in iris_blit |
| - iris: Increase use of pipe_resources in iris_blit |
| - iris: Loop through an aspect mask in iris_blit |
| - iris: Blit non-stencil according to aspect_mask |
| - iris: Use single-aspect formats more in iris_blit |
| - iris: Blit stencil according to aspect_mask |
| - iris: Explain how conditional aux accesses work |
| - iris: Make can_fast_clear_depth return constants |
| - iris: Disable conditional fast clears |
| - iris: Delete iris_resolve_conditional_render |
| - iris: Drop fast_clear_color's blorp_flags param |
| - dri: Restrict glthread for CS:GO to radeonsi |
| - gallium: Map \_DRI_IMAGE_FORMAT_NONE to NULL |
| - gallium: Flush GL API resources in eglCreateImage |
| - iris: Disable aux as needed in iris_flush_resource |
| |
| Neha Bhende (3): |
| |
| - meson: Don't build svgadrm on windows |
| - meson.build: Use SSE math for MinGW X86 build as per sse2 option |
| - meson.build: Disable zlib as per -Dzlib option |
| |
| Neil Armstrong (1): |
| |
| - kmsro: sync Android.mk GALLIUM_TARGET_DRIVERS |
| |
| Pavel Asyutchenko (1): |
| |
| - vulkan/overay: fix violation of VUID-VkDeviceCreateInfo-pNext-00373 |
| |
| Pierre Moreau (17): |
| |
| - clover: rename platform/device apis using strings |
| - clover/llvm: don't use strings for version handling. |
| - clover/spirv: avoid strings for version handling |
| - clover/api: Add extended versioning query for built-in kernels |
| - clover/api: Add extended versioning query for OpenCL C |
| - clover/spirv: Add version conversion utilities |
| - clover/spirv: Add function checking whether a binary contains SPIR-V |
| - clover/spirv: Change API to use std::string binaries |
| - clover/spirv: Add function checking the SPIR-V version |
| - clover/spirv: Use cl_version for SPIR-V versions (v2) |
| - clover: List supported ILs versions |
| - clover: Implement clCreateProgramWithILKHR |
| - clover: Handle CL_PROGRAM_IL in clGetProgramInfo |
| - clover/api: Implement CL_DEVICE_IL_VERSION |
| - clover: Advertise cl_khr_il_program |
| - clover: Implement clCreateProgramWithIL from OpenCL 2.1 |
| - clover: Expose cl_khr_extended_versioning |
| |
| Pierre-Eric Pelloux-Prayer (74): |
| |
| - radeonsi: remove unused NO_RB_PLUS flag |
| - radeonsi: remove AMD_DEBUG=zerovram flag |
| - mesa/gallium: add MESA_MAP_ONCE / PIPE_MAP_ONCE |
| - winsys/amdgpu: make RADEON_ALL_BOS a debug only feature |
| - amdgpu_bo: make cache_entry a extensible array |
| - radeonsi/gfx10: flush gfx cs on ngg -\> legacy transition |
| - ac: use bigger storage for ac_arg::arg_index / ac_shader_args::arg_count |
| - util: add a FALLTROUGH macro |
| - nir: update fallthrough comments |
| - gallium: update fallthrough comments |
| - xxhash: update fallthrough comments |
| - src/mesa: update fallthrough comments |
| - compiler/spirv: update fallthrough comments |
| - radeonsi: update fallthrough comments |
| - gallium/winsys: update fallthrough comments |
| - vbo: update fallthrough comments |
| - gallium/util: update fallthrough comments |
| - softpipe: update fallthrough comments |
| - gallium: update fallthrough comments |
| - radeon: update fallthrough comments |
| - llvmpipe: update fallthrough comments |
| - gallivm: update fallthrough comments |
| - nir/ntt: update fallthrough comments |
| - amd/ac: update fallthrough comments |
| - egl: update fallthrough comments |
| - tgsi: update fallthrough comments |
| - glx: update fallthrough comments |
| - Revert "Revert "radeonsi: use staging buffer uploads for most VRAM buffers"" |
| - gallium/u_threaded: fix staging and non-staging conflicts |
| - gallium/u_threaded: disable forced staging upload at runtime |
| - dlist: do not call \_mesa_lookup_list twice |
| - vbo/dlist: create an index buffer in compile_vertex_list |
| - vbo/dlist: convert LINE_STRIPS to LINES |
| - vbo/dlist: implement primitive merging |
| - util/hash_table: add \_mesa_hash_data_with_seed function |
| - mesa: optimize \_mesa_program_resource_location |
| - vbo/dlist: refactor prim_store/vertex_store allocations |
| - vbo/dlist: avoid splitting draw commands in multiple draws |
| - vbo/dlist: only use merged primitives when it's ok to do so |
| - driconf: add allow_incorrect_primitive_id option |
| - radeonsi: fix si_get_draw_start_count count value |
| - gallium/u_threaded: set has_user_indices = false for merged draws |
| - gallium/u_threaded: fix pipe_resource leak for staging transfer |
| - st/mesa: disable line stippling if pattern is all 1's |
| - driconf: add workaround for Enter The Gungeon |
| - egl: fix EGL_EXT_protected_content/surface mixup |
| - vbo/dlist: use a shared index buffer |
| - vdpau: fix -Wabsolute-value warning |
| - vdpau: fix invalid enum usage |
| - amd/addrlib: use cpp.has_argument() to filter compiler arguments |
| - tesselator: remove unused variable |
| - gallium/vl: merge identical h264/h265 enums |
| - radeonsi: fix redundant initializations |
| - mesa/st: fix redundant initialization |
| - radeonsi: pass radeon_cmdbuf to emit_cache_flush |
| - radeonsi: pass radeon_cmdbuf to si_cp_dma_wait_for_idle |
| - ac/sqtt: add ac_thread_trace_data |
| - ac/radv: move sqtt structs and helpers to amd/common |
| - ac/radv: move radv_rgp.c to ac |
| - ac/sqtt: move rgp/sqtt def to ac |
| - ac/sqtt: move ac_is_thread_trace_complete to ac |
| - ac/sqtt: move radv_get_expected_buffer_size to ac |
| - radeonsi: add radeon_set_uconfig_reg_seq_perfctr |
| - radeonsi: implement SQTT support |
| - ac/rgp: add missing include |
| - dri: enable glthread + radeonsi workaround for CS:GO |
| - st/mesa: consider texture view format for fbo blits |
| - mesa/fbo: don't check_end_texture_render on fb read change |
| - st/mesa: use the correct src format in ReadPixels |
| - radeonsi: invalidate compute sgprs in si_rebind_buffer |
| - radeonsi: inhibit clockgating when using SQTT |
| - radeonsi: properly set SPI_SHADER_PGM_HI_ES |
| - radeonsi: fix read from compute / write from draw sync |
| - radeonsi: fix si_check_render_feedback |
| |
| Rhys Perry (148): |
| |
| - radv/winsys: set has_dedicated_vram in the null winsys |
| - aco: don't combine precise max(min()) to med3 |
| - aco: fix combine_constant_comparison_ordering() NaN check with 16/64-bit |
| - aco: disallow various v_add_u32 opts if modifiers are used |
| - aco/tests: initialize debug function |
| - aco/tests: expand optimize.const_comparison_ordering tests |
| - aco/tests: add some more clamp combining tests |
| - nir: add nir_var_mem_ubo to nir_var_read_only_modes |
| - nir: allow reordering of loads from read-only modes |
| - aco: disable omod if the sign of zeros should be preserved |
| - aco: fix fp16 \*0.5 omod |
| - aco/tests: add output modifier tests |
| - aco: don't use SMEM for SSBO stores |
| - aco: create v_mad_u32_u24 |
| - nir: add nir_var_vec_indexable_modes |
| - nir/copy_prop_vars,nir/dead_write_vars: ignore read-only loads |
| - nir/loop_analyze: initialize loop variables on demand |
| - nir/search: check instr type before adding to worklist |
| - nir/search: check for changes before adding uses to worklist |
| - nir/deref: add helpers to lazily create paths |
| - nir/copy_prop_vars: use nir_deref_and_path |
| - nir/copy_prop_vars: avoid a duplicate lookup if src == vec_src |
| - aco: don't create v_mov_b32 in v_mul_imm() |
| - aco: count v_mul_lo_u32 as 16 cycles |
| - aco: create vgpr constant copies using v_bfrev_b32 |
| - aco: copy constant to sgpr in Builder::v_mul_imm() |
| - aco: try harder to not create v_mul_lo_u32 |
| - aco: use v_mul_imm() for some nir_op_imul |
| - aco/tests: add Builder::v_mul_imm() tests |
| - aco: fix v_mul_hi_u32_u24 format |
| - nir/unsigned_upper_bound: fix buffer overflow in search_phi_bcsel |
| - nir/unsigned_upper_bound: decrement num_sources_left before recursing |
| - radv/llvm,aco/ngg: fix large shift exponent in ngg_gs_vertex_lds_addr |
| - aco: fix GS with no outputs |
| - aco/ngg: fix division-by-zero in assertion |
| - nir/lower_non_uniform: improve code with the same texture, sampler indices |
| - nir: fix sampler_lod_parameters_pan indices |
| - nir: use a single canonical list of intrinsic indices |
| - nir: add bit_size_src for when the destination bit size matches a source |
| - nir: add destination bit-size information to more intrinsics |
| - nir: remove useless nir_builder_opcodes.h include |
| - nir: move nir_load_system_value() to nir_builder.h |
| - nir: add generated intrinsic builders |
| - spirv: use intrinsic builders |
| - glsl_to_nir: use intrinsic builders |
| - nir: use intrinsic builders |
| - radv: use intrinsic builders |
| - nir: make intrinsic order in nir_print consistent |
| - nir: fix intrinsic builders on MSVC C++ |
| - nir: fix nir_builder.h on MSVC C++ and GCC7. |
| - d3d12: remove hand-written intrinsic builders |
| - nir: add helpers for chasing resource bindings |
| - nir/opt_load_store_vectorize: use resource binding chasing helpers |
| - ac/nir: use binding chasing helpers |
| - aco: use binding chasing helpers |
| - radv: use FALLTHROUGH macro |
| - aco: use FALLTHROUGH macro |
| - nir/opt_sink: use common instruction removal/insertion helpers |
| - aco: don't assume src=lower when splitting self-intersecting copies |
| - aco: test self-intersecting copies when src=higher |
| - aco: remove sign-extension in constantValue64() |
| - aco: allow 64-bit literals if they can be sign/zero-extended from 32-bit |
| - aco: add get_const/is_constant_representable helpers |
| - aco: use v_lshrrev_b64 for 64-bit VGPR copies on GFX10+ |
| - aco: coalesce constant copies |
| - aco: clear operands in update_renames() |
| - aco: don't fill killed operands in update_renames() |
| - aco: remove rollback code in get_reg_create_vector() |
| - aco: repeat get_reg_create_vector() with increased register demand if fail |
| - aco: use clear() helper instead of writing reg file directly |
| - aco: simplify get_reg_impl() |
| - aco: remove rollback code around parallelcopy creation |
| - aco: remove rollback code for blocked fixed definitions |
| - aco: move update_renames() out of get_reg() |
| - aco: remove rollback code when making an instruction vop3 |
| - nir/lower_non_uniform: remove non_uniform flags after lowering |
| - nir: improve divergence analysis for loads with non-uniform resources |
| - nir/opt_access: don't ignore image arrays in process_variable() |
| - nir/opt_access: ignore barriers and coherent qualifier |
| - nir/opt_access: check restrict before marking a variable as readonly |
| - nir/opt_access: don't check restrict in can_reorder() |
| - nir/opt_access: rename can_reorder() and set ACCESS_NON_WRITEABLE in it |
| - nir/opt_access: add basic Vulkan support |
| - nir/opt_access: handle variable pointers |
| - nir/opt_access: consider global stores |
| - nir/opt_access: infer writeonly |
| - compiler: update gl_access_qualifier comments |
| - aco: fix various s_subb_u32 operands to SCC |
| - aco: rename s_subb_u32 operands to borrow |
| - nir/opt_access: don't ignore infer_non_readable |
| - aco: fix mbcnt_amd with wave32 |
| - aco: allow divergent mbcnt_amd masks |
| - aco: add block to worklist in mark_block_wqm() |
| - ac/llvm: insert phis before demote kill |
| - aco: fix incorrect address calculation for load_barycentric_at_sample |
| - ac/nir: use llvm.readcyclecounter for LLVM9+ |
| - nir/tests: fix callback for load/store vectorizer tests |
| - nir: allow 5 component vectors |
| - nir,spirv: add sparse texture fetches |
| - nir,spirv: add sparse image loads |
| - nir,spirv: implement SpvOpImageSparseTexelsResident |
| - nir: add sparse_residency_code_and |
| - nir/lower_tex: fix lower_tg4_offsets with sparse fetches |
| - vtn: support SpvCapabilitySparseResidency |
| - radv: implement CREATE_REQUIRE_FULL_SUBGROUPS_BIT with cswave32 |
| - nir: gather whether a compute shader uses non-quad subgroup intrinsics |
| - radv: workaround games which assume full subgroups if cswave32 is enabled |
| - nir/load_store_vectorize: don't ignore subgroup memory barriers |
| - nir: add nir_load_store_vectorize_options |
| - nir/load_store_vectorize: add data as callback args |
| - radv: vectorize shader I/O |
| - nir,radv: add and use nir_vectorize_tess_levels() |
| - aco: fix unreachable() for uniform 8/16-bit nir_op_mov from VGPR |
| - aco: fix MIMG_instruction::lwe comment |
| - aco: move MIMG VDATA to its own operand |
| - aco: implement nir_op_vec5 |
| - aco: implement sparse texture fetches |
| - aco: implement sparse image loads |
| - aco: form sparse load clauses |
| - ac/nir: implement nir_op_vec5 |
| - ac/nir: implement sparse image/texture loads |
| - radv: implement is_sparse_texels_resident and sparse_residency_code_and |
| - radv: support SpvCapabilitySparseResidency |
| - radv/winsys: set has_packed_math_16bit in null winsys |
| - nir/opt_vectorize: fix typo in instr_can_rewrite() |
| - nir/opt_vectorize: fix srcs_equal() with two different non-const |
| - aco: try to better align 8+ dword SGPR vectors |
| - aco: remove can_reorder semantic in get_sync_info_with_hack |
| - radv: add RADV_DEBUG=invariantgeom |
| - radv: set invariantgeom for Shadow of the Tomb Raider |
| - aco: improve nir_op_vec with constant operands |
| - aco/tests: don't rely on argument evaluation order |
| - nir/loop_unroll: unroll more aggressively if it can improve load scheduling |
| - aco: fix convert_to_SDWA() check in add_subdword_definition() |
| - radv,aco: don't use MUBUF for multi-channel loads on GFX8 with robustness2 |
| - aco: don't consider a phi trivial if same's register doesn't match the def |
| - radv: round-up num_records division in radv_flush_vertex_descriptors |
| - radv: correctly enable WGP_MODE for NGG and GS |
| - radv: correctly enable WGP_MODE for tessellation control |
| - aco: always set exec_live=false |
| - aco: do not flag all blocks WQM to ensure we enter all nested loops in WQM |
| - aco: add fallback algorithm in get_reg() |
| - aco/lower_phis: fix all_preds_uniform with continue_or_break |
| - aco: add missing usable_read2 check |
| - nir/opt_shrink_vectors: add option to skip shrinking image stores |
| - radv: don't shrink image stores for The Surge 2 |
| - radv: don't set sx_blend_opt_epsilon for V_028C70_COLOR_10_11_11 |
| - aco: calculate all p_as_uniform and v_readfirstlane_b32 sources in WQM |
| |
| Rob Clark (93): |
| |
| - freedreno: Drop fd_context_lock() and friends |
| - freedreno/drm: Convert to simple_mtx |
| - freedreno: debug cleanup |
| - freedreno: Convert to mesa_log*() |
| - freedreno: Fix spurious flush |
| - freedreno: batch-cache locking |
| - freedreno/a6xx: Texture cache locking |
| - freedreno: Use ctx seqno in batch cache key |
| - freedreno/drm: Make ring refcnt atomic again |
| - freedreno/batch: Move fd_batch_get_prologue() |
| - freedreno: Make fd_context_batch() return a reference |
| - freedreno: Add submit lock |
| - freedreno/drm: Drop growable submit_bos table |
| - freedreno/batch: Cleanup submit immediately after flush |
| - freedreno/drm: Rework APPEND() macro |
| - freedreno: Protect gmem_cache ralloc allocations |
| - mesa/fbo: Fix valgrind complaints |
| - mesa/bufferobj: Fix valgrind complaints |
| - nir: Fix nir_validate fail after nir_lower_tex |
| - freedreno/drm: Add some locking asserts |
| - freedreno/ir3: Add pass to deal with load_uniform base offsets |
| - freedreno/ir3: Fix crash in shader compile fail path |
| - freedreno: emit_marker() cleanup |
| - freedreno: Convert one last mtx_t -\> simple_mtx_t |
| - freedreno/a6xx: Clear control mem at context create |
| - freedreno/drm: Quiet timedout error msg |
| - freedreno/ir3: Fix valgrind complaint about streamout state |
| - util: Add helgrind support for simple_mtx |
| - util: Add helpers for various one-time-init patters |
| - nir: Use get_once() helper for one-time init's |
| - freedreno/ir3: Use get_once() for one-time init |
| - gallium/hud: Use do_once for one-time init |
| - mesa/st: Use do_once for one-time init |
| - util: Fix helgrind complaint about one-time init |
| - mesa: Fix helgrind complaint about one-time init |
| - gallium/trace: Fix helgrind complaint about one-time init |
| - tgsi: Fix helgrind complaint about one-time init |
| - mesa: Synchronize get_gl_override() |
| - util: Add property_get() fallback for android |
| - mesa: Use os_get_option() for MESA_*_OVERRIDE |
| - egl/surfaceless: glthread support |
| - egl/dri2: Drop some pointless ifdeffery |
| - util: Add helper to get FILE\* options |
| - gallium/aux: Add GPU tracepoint mechanism |
| - freedreno: Small log-parser.py cleanup |
| - freedreno: Remove unused fxn |
| - freedreno: Don't emit log/trace points in gmem for nondraw |
| - freedreno: Add GPU tracepoints |
| - freedreno: Add trace-parser.py |
| - freedreno: Remove fd_log() |
| - gallium/aux: Avoid creating queue when traces not enabled |
| - gallium/aux: Split u_tracepoints.[ch\] generation |
| - gallium/aux: Update scons build for u_tracepoints.[ch\] |
| - util: Promote \__builtin_types_compatible_p compat |
| - util: Allow STATIC_ASSERT() everywhere |
| - util+treewide: container_of() cleanup |
| - freedreno/ir3: Fix half-immed decoding issues |
| - freedreno/ir3: Fix mova1 disasm |
| - freedreno/ir3: Add some more disasm test vectors |
| - freedreno/ir3: Move assembler error handling |
| - freedreno/ir3/parser: Reset lexer when input changes |
| - freedreno/ir3: Various cat0 updates |
| - freedreno/ir3/parser: Add new cat0 instructions |
| - freedreno/ir3/parser: cat1 instructions can write relative GPR |
| - freedreno/ir3/parser: cat1 updates (mova1, movmsk) |
| - freedreno/ir3/parser: Handle half-immed |
| - freedreno/ir3: Clean up instruction creation |
| - freedreno/ir3: Cleanup cat6 load instructions |
| - freedreno/ir3/parser: Fix cat6 store encoding |
| - freedreno/ir3/parser: Fix dsxpp/dsypp encoding |
| - freedreno/ir3/parser: Fixup cat5 s2en instructions |
| - freedreno/ir3: Don't set bit for dest conversion for p0.c |
| - freedreno/ir3/parser: Add missing (sat) modifier |
| - freedreno/ir3/parser: Relative gpr/const can have modifiers too |
| - freedreno/ir3/parser: Add initial cat6 IBO instructions |
| - freedreno/ir3: Tweak ldib/resinfo encoding |
| - freedreno/ir3: Add parsing and assembler testing |
| - freedreno/ir3: Don't leak disk_cache |
| - freedreno/ir3: Disambiguate a6xx+ "bindless" instructions |
| - freedreno/ir3: Add cat5/cat6 nonuniform flag |
| - freedreno/ir3/parser: Add ldc support |
| - freedreno/ir3/parser: Fix atomic support |
| - freedreno/ir3/parser: Fix pre-a6xx resinfo |
| - freedreno/ir3/parser: Add ldgb support |
| - freedreno/ir3/parser: Add stgb support |
| - freedreno/ir3/parser: Fixup stg parsing and add more tests |
| - freedreno/ir3: Fix ldg decoding/parsing |
| - freedreno/ir3: Explicitly flag disasm test vectors that don't parse |
| - freedreno/ir3: Fix pre-a6xx ldgb/stib parsing |
| - freedreno/ir3/parser: a6xx ldib/stib parsing |
| - freedreno/ir3/parser: Fix pre-a6xx stib parsing |
| - mesa: Remove \_mesa_destroy_context() |
| - util/u_queue: Ensure num_cpu_mask_bits is valid |
| |
| Robin Ole Heinemann (1): |
| |
| - anv: Add DRM_RDWR flag in anv_gem_handle_to_fd |
| |
| Ruijing Dong (4): |
| |
| - radeon/vcn: hevc main10 profile decoding pitch fix |
| - radeon/vcn: add 0x02 to enc emulation prevention |
| - radeon/vcn: support hevc SAO enc for VCN2+ |
| - radeon/vcn: fix hevc 10bit profile error |
| |
| Ryan Neph (2): |
| |
| - virgl: fix BGRA emulation artifacts during window resize |
| - Revert "virgl: fix BGRA emulation artifacts during window resize" |
| |
| Sagar Ghuge (2): |
| |
| - anv: Invalidate the correct AUX-TT entry |
| - anv: Skip CCS ambiguate which preceed fast-clears |
| |
| Samuel Iglesias Gonsálvez (3): |
| |
| - turnip: implement VK_KHR_depth_stencil_resolve support |
| - turnip: pCounterBufferOffsets can be NULL on vkCmd*TransformFeedbackEXT() |
| - turnip: fix cube map array image size calculation |
| |
| Samuel Pitoiset (155): |
| |
| - aco: fix combining add/sub to b2i if a new dest needs to be allocated |
| - nir/algebraic: optimize bitfield_select(a, iand(a, b), c) |
| - aco/tests: add some tests for combining s_add+s_lshl to s_lshl<n>_add |
| - aco: combine more s_add+s_lshl to s_lshl<n>_add by ignoring uses |
| - aco: introduce a generic label for labelling instructions |
| - aco: add a new Operand flag to indicate that is 16-bit |
| - aco: optimize v_mad_u32_u16 with acc=0 to v_mul_u32_u24 |
| - aco: select v_mad_u32_u16 for 16-bit multiplications on GFX9+ |
| - aco: select v_mul_lo_u16 for 16-bit multiplications that can't overflow |
| - aco: optimize v_add_u32(v_mul_lo_u16) -\> v_mad_u32_u16 |
| - aco: optimize v_add(v_bcnt(a, 0), b) to v_bcnt(a, b) |
| - ci: update the list of skipped tests for RAVEN |
| - ci: update the list of expected failures for RADV |
| - aco: remove v_{add,sub,subrev}_u32 on GFX8 |
| - radv: do VGT_FLUSH when switching NGG -\> legacy on Sienna Cichlid |
| - radv: fix applying the NGG minimum vertex count requirement |
| - radv: don't count unusable vertices to the NGG LDS size |
| - radv: don't subtract max_verts_per_prim from hw_max_esverts on gfx10.3 |
| - aco: fix combining max(-min(a, b), c) if a or b uses the neg modifier |
| - radv/winsys: fill real PCIID for Sienna Cichlid and Navy Flounder |
| - radv/winsys: add missing Van Gogh and Dimgrey Cavefish in the null winsys |
| - ci: add list of expected failures for Sienna Cichlid |
| - radv: ignore other blend targets if dual-source blending is enabled |
| - radv: print more debug messages when generating a hang report |
| - radv: append a time string to the hang report dump directory |
| - radv: dump application info in the GPU hang report |
| - radv: add RADV_DEBUG=noumr to disable UMR logs during GPU hang detection |
| - radv: dump BO ranges into bo_ranges.log instead of stderr |
| - ci: fix name of the Sienna Cichlid expected failures file |
| - nir: fix gathering cross invocation info |
| - radv: add new vk_format_is_*() helpers |
| - ac,radv: use better export formats for 8-bit when RB+ isn't allowed |
| - aco/tests: extend the optimize.add_lshl tests to GFX8 |
| - aco: add a new Operand flag to indicate that is 24-bit |
| - aco: allow to use the range analysis UB in emit_{sop2,vop2}_instruction() |
| - aco: optimize v_add+s_lshl to v_mad_u32_u24 on GFX6-8 |
| - aco: optimize v_add+v_lshlrev to v_mad_u32_u24 on GFX6-8 |
| - ac: add gpu_info::has_32bit_predication |
| - radv: use 32-bit predication for conditional rendering on GFX10.3+ |
| - radv: always use 32-bit predication on compute queues |
| - radv: fix missing initialization of the predication value |
| - radv/winsys: fix the sysmem submission path for GFX6 |
| - radv: disable SQTT support for unsupported GPUs |
| - radv: fix using bitfields for debug/perftest options |
| - radv: save and dump vertex descriptors during GPU hang detection |
| - radv: enable NGG on GFX10.3 APUs by default |
| - radv: only disable CU2 & CU3 when NGG is enabled |
| - radv: only mask 1 CU for GS/VS waves on GFX10.3 |
| - radv: disable WGP_MODE for NGG on GFX10.3 |
| - radv/llvm,aco: always split typed vertex buffer loads on GFX6 and GFX10+ |
| - ci: disable check-commits |
| - Revert "radv/llvm,aco: always split typed vertex buffer loads on GFX6 and GFX10+" |
| - vulkan: add missing src_inc to the device select layer |
| - ci: build the Vulkan device select layer |
| - nir: gather if a fragment shader uses sample shading |
| - radv: reduce maxTransformFeedbackBufferDataSize to 512 |
| - radv: mark GFX10.3 as a non-conformant Vulkan implementation |
| - radv: fix exporting multiviews with NGG |
| - radv: set the predication boolean as 32-bit if necessary |
| - radv: use 32-bit predication for skipping FCE on GFX10.3+ |
| - radv: fix using FS sample shading if the linker optimized inputs away |
| - ci: update the list of expected failures for RADV/FIJI |
| - radv: enable using MSAA2x and MSAA4x sample locations on GFX10+ |
| - radv: advertise VK_EXT_sample_locations on GFX10+ |
| - ac/surface: initialize the FMASK slice size for GFX9+ |
| - radv: fix clearing FMASK for layered MSAA images on GFX9+ |
| - radv: disable alphaToOne feature |
| - amd/registers: add missing VRS registers |
| - radv: add VK_KHR_fragment_shading_rate but leave it disabled |
| - radv: implement VK_KHR_fragment_shading_rate |
| - radv/llvm: implement fragment shading rate |
| - aco: implement fragment shading rate |
| - radv: track if VRS is enabled to apply a workaround on GFX10.3 |
| - radv/llvm: implement a workaround for gl_FragCoord.z with VRS on GFX10.3 |
| - aco: implement a workaround for gl_FragCoord.z with VRS on GFX10.3 |
| - radv: advertise VK_KHR_fragment_shading_rate on GFX10.3+ |
| - radv: add support for resolving layered depth/stencil images |
| - radv: add missing DB flush after depth/stencil resolve operations |
| - radv: enable TC-compat HTILE for D32_SFLOAT+MSAA on GFX10+ |
| - radv: adjust the maximum number of coverage samples for VRS |
| - radv: fix maxFragmentShadingRateRasterizationSamples |
| - radv: remove useless push constants data when resolving ds attachments |
| - radv: ignore the mutable bit for TC-compatible HTILE |
| - radv: enable VK_EXT_line_rasterization on GFX9 |
| - radv: sort the extension table like Khronos |
| - radv: add code that checks if the extension table is sorted correctly |
| - radv: make sure FMASK compression is enabled for MSAA copies |
| - Revert "radv: use 32-bit predication for skipping FCE on GFX10.3+" |
| - radv: dump VA ranges history when a GPU hang is detected |
| - radv: add a Python script to check if a VA was ever valid |
| - radv: disable stippledBresenhamLines on GFX9 |
| - nir: fix determining if an addition might overflow for phi sources |
| - radv: disable A2 SNORM/SSCALED/SINT for texel buffers & images on all gens |
| - radv: fix clearing images with vkCmdClear{Color,DepthStencil}Image() |
| - radv: remove unused radv_image::aspects |
| - radv: always clear the SR0/SR1 bits of the HTILE buffer |
| - radv: fix potential HTILE issues for TC-compat images on GFX8 |
| - radv: add radv_htile_get_initial_value() and document the HTILE dword |
| - radv: fix TC-compat HTILE images with DST_OPTIMAL on the compute queue |
| - radv: clean up radv_layout_is_htile_compressed() |
| - radv: only load the DS fast clear values for compressed rendering |
| - radv: enable TC-compat HTILE in GENERAL on GFX10+ |
| - aco: fix creating the dest vector when 16-bit vertex fetches are splitted |
| - radv/llvm,aco: always split typed vertex buffer loads on GFX6 and GFX10+ |
| - radv: configure the texture descriptor for TC-compat CMASK on GFX10+ |
| - radv: fix enabling TC-compat HTILE in GENERAL for writes on GFX10+ |
| - radv: fix performance regression by restoring TC-compat HTILE in GENERAL |
| - radv: determine at creation if an image view can be fast cleared |
| - radv: do not predicate FMASK decompression when DCC+MSAA is used |
| - ci: re-mark some depth/stencil resolve CTS as expected failures |
| - radv: fix crashes when fast-clearing in a secondary command buffer |
| - radv: disable TC-compat HTILE in GENERAL for Detroit: Become Human |
| - radv: re-initialize HTILE properly after depth/stencil compute resolves |
| - radv: only re-initialize HTILE after ds compute resolves if compressed |
| - ac/surface: initialize dcc_slice_size on GFX9+ |
| - radv: add support for fast-clearing DCC layers on GFX9+ |
| - radv: clean up radv_decompress_dcc_compute() |
| - radv: do not use predication when the range doesn't cover the whole image |
| - radv: enable DCC for layered color images on GFX10+ |
| - radv: mark VK_IMAGE_CREATE_SPARSE_RESIDENCY_BIT as unsupported on GFX6-7 |
| - aco: fix inserting expcnt for MIMG on GFX6 |
| - ci: mark some sparse tests as expected failures on Pitcairn (GFX6) |
| - radv: mark some sparse texture CTS as expected failures on GFX9 |
| - radv: set depth to 1 for subpass resolves using the compute path |
| - radv: decompress DCC for partial resolves using the compute path |
| - radv: fixup DCC after color resolves using the compute path |
| - radv: fix color resolves if the dest image has DCC |
| - radv: fix clearing DCC on GFX9 |
| - radv: only use predication if the FCE value is allocated |
| - radv: allocate and initialize the FCE predicate value for CMASK too |
| - radv: update the FCE predicate for fast clears using CMASK |
| - radv: skip fast-clear eliminate for CMASK based on a predicate |
| - ac/surface: store DCC mip info into the surface |
| - radv: prevent fast-clearing uncompressed DCC levels |
| - radv: add support for fast-clearing DCC levels on GFX10+ |
| - radv: do not enable DCC for 3D images with mipmaps on GFX10+ |
| - radv: enable DCC for mipmaps on GFX10+ |
| - radv: disable VK_EXT_sample_locations again on GFX10+ |
| - radv: enable DCC for MSAA on GFX10+ |
| - radv: do not invalidate the L2 metadata cache on compute queues |
| - radv: flush L2 metadata as part of CB/DB flush instead of CS_DONE on GFX9 |
| - radv: restore invalidating the vector cache for internal meta operations |
| - radv: flush L2 for images affected by the pipe misaligned issue on GFX10+ |
| - ci: exclude one CTS test that timeout most of the time for RADV CI |
| - radv: fix a sync issue with geometry shader primitives query on GFX10+ |
| - radv: fix overflow when computing the SQTT buffer size |
| - radv: inhibit clock gating when tracing with SQTT |
| - radv: fix separate depth/stencil layout in render pass |
| - radv,aco: fix shifting input VGPRs for the LS VGPR init bug on GFX9 |
| - nir/algebraic: mark more optimization with fsat(NaN) as inexact |
| - radv: fix centroid with VRS coarse shading |
| - radv: fix waiting on the last enabled RB for occlusion queries |
| - radv: only apply the MRT output NaN fixup to non-meta shaders |
| - radv: set correct value for OFFCHIP_BUFFERING on GFX10+ |
| - radv: do not scale the depth bias for D16_UNORM depth surfaces |
| |
| Serge Martin (1): |
| |
| - clover: add core clover printf support (v12) |
| |
| Simon Ser (11): |
| |
| - amd/common: introduce ac_surface_print_info |
| - radeonsi: use ac_surface_print_info in si_print_texture_info |
| - radv: add img debug flag |
| - egl: fix typo in wl_drm error message |
| - egl/wayland: remove libwayland \\< 1.18 workaround |
| - ci: skip failing test on lavapipe |
| - radv: fix access to uninitialized radeon_bo_metadata |
| - egl/wayland: add a NULL guard for the authenticate callback |
| - radv: only set BO metadata for the first plane |
| - nouveau/nvc0: fix linear buffer alignment for scan-out/cursors |
| - nouveau/nv50: fix linear buffer alignment for scan-out/cursors |
| |
| Steven Houston (1): |
| |
| - v3dv: VK_KHR_display extension support |
| |
| Tapani Pälli (7): |
| |
| - egl/dri2: fix race between image create and egl_image_target_texture |
| - iris: initialize shared screen->vtbl only once |
| - mesa/st: choose S/D format depending on gl_format passed for readpixels |
| - anv: fix calculation of buffer size in case dynamic size is used |
| - mesa: fix layered framebuffer attachment target check |
| - vbo/dlist: free prim_store->prims when vbo_save is destroyed |
| - i965: use aligned malloc for context instead of ralloc |
| |
| Theogen Ratkin (1): |
| |
| - docs: grammar fixes |
| |
| Thong Thai (4): |
| |
| - frontends/va/postproc: Use the actual image height when blitting |
| - frontends/va/postproc: Convert destination when deinterlacing |
| - gallium: Fix VAAPI postproc blit |
| - frontends/va: Return an error if non-interlaced buffer is not supported |
| |
| Timothy Arceri (1): |
| |
| - glsl: default to compat shaders in compat profile |
| |
| Timur Kristóf (16): |
| |
| - nir: Use src_is_invocation_id in get_deref_info. |
| - aco/optimizer: Only set scc_needed when it is actually needed. |
| - aco/optimizer: Propagate scc_needed label through p_wqm. |
| - aco: Fix NGG GS assert failure from the WG scan. |
| - aco: Skip TCS s_barrier when VS outputs are not stored in the LDS. |
| - aco: Use program->num_waves as maximum in scheduler. |
| - aco: Keep live-though variables and constants spilled. |
| - aco: Spill more optimally before loops. |
| - aco: Note if rasterization can start early. |
| - aco: Wait for stores when NGG or legacy VS can finish early. |
| - ci: Add an expected failures list for Oland (GFX6) |
| - radv: Only enable sparse features on Polaris and newer. |
| - tgsi_to_nir: Fix uniform ranges. |
| - radv/llvm: Fix reporting LDS stats of tess control shaders. |
| - aco: Disallow LSHS temp-only I/O when VS output is written indirectly. |
| - aco: Fix LDS statistics of tess control shaders. |
| |
| Tomeu Vizoso (3): |
| |
| - ci: Temporarily disable jobs on the Collabora lab |
| - Revert "ci: Temporarily disable jobs on the Collabora lab" |
| - ci: Only run the sanity job if there's a MR |
| |
| Tony Wasserka (22): |
| |
| - glsl: Fix -Wshadow warning |
| - util: Fix/silence variable shadowing warnings |
| - meson: Treat LLVM headers as a system dependency |
| - aco: Fix -Wshadow warnings |
| - aco/tests: Fix -Wshadow warnings |
| - aco/tests: Fix -Wunused warnings in release mode |
| - radv: Fix -Wshadow warnings |
| - radv,aco: Compile with -Wshadow when available |
| - radv/query: Avoid hardcoding array size constants |
| - radv/winsys: Fix use of nonexisting struct type in sizeof |
| - aco: Annotate switch fallthroughs |
| - radv,aco: Compile with -Wimplicit-fallthrough when available |
| - gitlab: add RADV bug report template |
| - aco/ra: Add policy parameter to select implementation details for testing |
| - aco/tests: Fix GFX10_3 being printed as gfx11 |
| - aco/tests: Allow specifiying the test subvariant in setup_cs |
| - aco/tests: Fix deadlock for too large test lists |
| - aco: Add tests for subdword register allocation |
| - aco/ra: Add some documentation |
| - aco/ra: Fix register allocation for subdword operands |
| - aco/ra: Avoid redundant RegisterFile copies in get_reg_impl |
| - aco: Fix vector::reserve() being called with the wrong size |
| |
| Trevor Woerner (1): |
| |
| - docs/egl.rst: switch true→enabled |
| |
| Vinson Lee (55): |
| |
| - swr: Initialize FetchJit member mpFetchInfo in constructor. |
| - turnip: Remove pipeline NULL check. |
| - draw: Clean up single-use goto statements. |
| - glsl: Initialize ir_variable member field data.is_xfb. |
| - glsl: Fix typos in comments. |
| - microsoft/compiler: Add dxil_nir_lower_16bit_conv prototype. |
| - turnip: Fix file descriptor return. |
| - nvir/gm107: Initialize SchedDataCalculatorGM107 member score. |
| - vdpau: Add missing printf format specifier. |
| - v3dv: Remove unsigned comparison to zero. |
| - frontends/va: Fix \*num_entrypoints check. |
| - clover/spirv: Add missing break for SpvOpExecutionMode case. |
| - turnip: Close sync_fd only if it is a valid file descriptor. |
| - nv50/ir: Initialize GCRA members in constructor. |
| - microsoft/compiler: Add struct dxil_features forward declaration. |
| - microsoft/compiler: Add struct glsl_type forward declaration. |
| - microsoft/compiler: Add scope for declaration in case statement. |
| - r600/sfn: Fix typos. |
| - r600/sfn: Initialize ShaderFromNir members in constructor. |
| - r600/sb: Initialize sb_context members in constructor. |
| - clover: Initialize command_queue member \_props. |
| - nv50/ir: Initialize Program members in constructor. |
| - clover: Fix typo in comment. |
| - scons: Fix build with llvm-12. |
| - amd/addrlib: Initialize Lib members in constructors. |
| - util: Add os_get_page_size support for macOS. |
| - meson: Fix Clang microsoft-enum-value detection. |
| - meson: Fix build with llvm-12. |
| - r600/sfn: Initialize ShaderInputVarying members in constructors. |
| - mesa: Remove extra texObj. |
| - intel/genxml: Avoid generating identical 12.5 and 12 branches. |
| - mesa: Remove cmd_size \\< 0 check. |
| - zink: Fix typos. |
| - glsl: Fix typos in comments. |
| - glsl: Initialize glsl_type member name. |
| - vc4: Fix typos. |
| - d3d12: Fix memory leak if create_gfx_pipeline_state failed. |
| - d3d12: Fix memory leak if create_root_signature failed. |
| - v3d: Fix typos. |
| - nir/tests: Initialize nir_serialize_test member dup. |
| - d3d12: Fix memory leak if state is NULL. |
| - d3d12: Initialize TransitionableResourceState m_SupportsSimultaneousAccess. |
| - turnip: Remove unsigned nonnegative check. |
| - svga: Fix typos in comments. |
| - d3d12: Initialize local_resource member mapped in constructor. |
| - swr: Fix typos. |
| - virgl: Fix typos. |
| - softpipe: Fix typos. |
| - radeonsi: Fix typos. |
| - freedreno/afuc: Replace readfile with os_read_file. |
| - r300: Fix typos. |
| - clover: Add constructor for clover::module. |
| - nv50/ir: Initialize CodeEmitterGM107 members in constructor. |
| - etnaviv: Fix memory leak in etna_vertex_elements_state_create. |
| - aco: Initialize ds_state.front.writeMask. |
| |
| Víctor Manuel Jáquez Leal (1): |
| |
| - frontends/va/context: don't set max_references with num_render_targets |
| |
| Witold Baryluk (3): |
| |
| - zink: Cap PIPE_SHADER_CAP_MAX_CONST_BUFFERS to 32 |
| - vulkan/device_select: Store Vulkan vendorID and deviceID as uint32_t |
| - lavapipe: Defer lavapipe warning to CreateDevice |
| |
| X512 (13): |
| |
| - util: implement GET_PROGRAM_NAME for Haiku |
| - util/meson: Add libnetwork dependency for Haiku |
| - targets/haiku-softpipe/meson: add libswpipe.so to install directory |
| - hgl/meson: add version to libGL.so |
| - meson: fix Haiku EGL build; no dri requirement |
| - include: fix export in Haiku OpenGL kit headers |
| - hgl: use local headers instead of system header |
| - frontends/hgl: set state_manager |
| - frontends/hgl: set framebuffer id |
| - aux/driver_ddebug: Normalize pid type from Haiku |
| - targets/haiku-softpipe: Restore GalliumContext |
| - hgl: Major refactor and cleanup |
| - util/u_thread: Disable pthread_barrier_t on Haiku |
| |
| Yevhenii Kharchenko (2): |
| |
| - meson: Add build option to specify default shader disk cache max-size |
| - st/mesa: fix PBO download for TEXTURE_1D_ARRAY textures |
| |
| Yevhenii Kolesnikov (3): |
| |
| - intel/fs: don't spill a register, set by undef |
| - iris: only set point sprite overrides if actually using points |
| - nir/from_ssa: consider defs in sibling blocks |
| |
| Yogesh mohan marimuthu (1): |
| |
| - radeonsi: enable vrs2x2 coarse shading if flat shading (v9) |
| |
| Yuxuan Shui (1): |
| |
| - Add EGL xcb platform |
| |
| Zack Rusin (1): |
| |
| - meson.build: Order the flex/bison by odds of them working |
| |
| cheyang (5): |
| |
| - android: fix build failure with libbacktrace |
| - symbol_table:fix mesa symbol table return scope error |
| - glsl: remove unused state variable |
| - virgl: next_handle variable modify to atomic inc in virgl_object_assign_handle |
| - mesa: glProgramBinary add resource_hash |
| |
| jzielins (5): |
| |
| - swr: Pass draw start information to state update mechanism |
| - swr: fix crashes caused by incorrectly reporting SSBO support |
| - gallium/swr: Fix Windows build |
| - swr: Fix building with LLVM12 |
| - swr: Fix crashes on Windows |
| |
| nia (1): |
| |
| - util: Avoid pthread_setaffinity_np on NetBSD |
| |
| yshi18 (1): |
| |
| - iris: fix memleak for query_buffer_uploader |