Changelog in Linux kernel 6.6.92

ACPI: PPTT: Fix processor subtable walk [+ + +]

Author: Jeremy Linton <jeremy.linton@arm.com>
Date:   Wed May 7 21:30:25 2025 -0500

    ACPI: PPTT: Fix processor subtable walk
    
    commit adfab6b39202481bb43286fff94def4953793fdb upstream.
    
    The original PPTT code had a bug where the processor subtable length
    was not correctly validated when encountering a truncated
    acpi_pptt_processor node.
    
    Commit 7ab4f0e37a0f4 ("ACPI PPTT: Fix coding mistakes in a couple of
    sizeof() calls") attempted to fix this by validating the size is as
    large as the acpi_pptt_processor node structure. This introduced a
    regression where the last processor node in the PPTT table is ignored
    if it doesn't contain any private resources. That results errors like:
    
      ACPI PPTT: PPTT table found, but unable to locate core XX (XX)
      ACPI: SPE must be homogeneous
    
    Furthermore, it fails in a common case where the node length isn't
    equal to the acpi_pptt_processor structure size, leaving the original
    bug in a modified form.
    
    Correct the regression by adjusting the loop termination conditions as
    suggested by the bug reporters. An additional check performed after
    the subtable node type is detected, validates the acpi_pptt_processor
    node is fully contained in the PPTT table. Repeating the check in
    acpi_pptt_leaf_node() is largely redundant as the node is already
    known to be fully contained in the table.
    
    The case where a final truncated node's parent property is accepted,
    but the node itself is rejected should not be considered a bug.
    
    Fixes: 7ab4f0e37a0f4 ("ACPI PPTT: Fix coding mistakes in a couple of sizeof() calls")
    Reported-by: Maximilian Heyne <mheyne@amazon.de>
    Closes: https://lore.kernel.org/linux-acpi/20250506-draco-taped-15f475cd@mheyne-amazon/
    Reported-by: Yicong Yang <yangyicong@hisilicon.com>
    Closes: https://lore.kernel.org/linux-acpi/20250507035124.28071-1-yangyicong@huawei.com/
    Signed-off-by: Jeremy Linton <jeremy.linton@arm.com>
    Tested-by: Yicong Yang <yangyicong@hisilicon.com>
    Reviewed-by: Sudeep Holla <sudeep.holla@arm.com>
    Tested-by: Maximilian Heyne <mheyne@amazon.de>
    Cc: All applicable <stable@vger.kernel.org> # 7ab4f0e37a0f4: ACPI PPTT: Fix coding mistakes ...
    Link: https://patch.msgid.link/20250508023025.1301030-1-jeremy.linton@arm.com
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ALSA: es1968: Add error handling for snd_pcm_hw_constraint_pow2() [+ + +]

Author: Wentao Liang <vulab@iscas.ac.cn>
Date:   Wed May 14 17:24:44 2025 +0800

    ALSA: es1968: Add error handling for snd_pcm_hw_constraint_pow2()
    
    commit 9e000f1b7f31684cc5927e034360b87ac7919593 upstream.
    
    The function snd_es1968_capture_open() calls the function
    snd_pcm_hw_constraint_pow2(), but does not check its return
    value. A proper implementation can be found in snd_cx25821_pcm_open().
    
    Add error handling for snd_pcm_hw_constraint_pow2() and propagate its
    error code.
    
    Fixes: b942cf815b57 ("[ALSA] es1968 - Fix stuttering capture")
    Cc: stable@vger.kernel.org # v2.6.22
    Signed-off-by: Wentao Liang <vulab@iscas.ac.cn>
    Link: https://patch.msgid.link/20250514092444.331-1-vulab@iscas.ac.cn
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ALSA: seq: Fix delivery of UMP events to group ports [+ + +]

Author: Takashi Iwai <tiwai@suse.de>
Date:   Sun May 11 15:45:27 2025 +0200

    ALSA: seq: Fix delivery of UMP events to group ports
    
    [ Upstream commit ff7b190aef6cccdb6f14d20c5753081fe6420e0b ]
    
    When an event with UMP message is sent to a UMP client, the EP port
    receives always no matter where the event is sent to, as it's a
    catch-all port.  OTOH, if an event is sent to EP port, and if the
    event has a certain UMP Group, it should have been delivered to the
    associated UMP Group port, too, but this was ignored, so far.
    
    This patch addresses the behavior.  Now a UMP event sent to the
    Endpoint port will be delivered to the subscribers of the UMP group
    port the event is associated with.
    
    The patch also does a bit of refactoring to simplify the code about
    __deliver_to_subscribers().
    
    Fixes: 177ccf811df4 ("ALSA: seq: Support MIDI 2.0 UMP Endpoint port")
    Link: https://patch.msgid.link/20250511134528.6314-1-tiwai@suse.de
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: sh: SND_AICA should depend on SH_DMA_API [+ + +]

Author: Geert Uytterhoeven <geert+renesas@glider.be>
Date:   Tue May 13 09:31:04 2025 +0200

    ALSA: sh: SND_AICA should depend on SH_DMA_API
    
    [ Upstream commit 66e48ef6ef506c89ec1b3851c6f9f5f80b5835ff ]
    
    If CONFIG_SH_DMA_API=n:
    
        WARNING: unmet direct dependencies detected for G2_DMA
          Depends on [n]: SH_DREAMCAST [=y] && SH_DMA_API [=n]
          Selected by [y]:
          - SND_AICA [=y] && SOUND [=y] && SND [=y] && SND_SUPERH [=y] && SH_DREAMCAST [=y]
    
    SND_AICA selects G2_DMA.  As the latter depends on SH_DMA_API, the
    former should depend on SH_DMA_API, too.
    
    Fixes: f477a538c14d07f8 ("sh: dma: fix kconfig dependency for G2_DMA")
    Reported-by: kernel test robot <lkp@intel.com>
    Closes: https://lore.kernel.org/oe-kbuild-all/202505131320.PzgTtl9H-lkp@intel.com/
    Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://patch.msgid.link/b90625f8a9078d0d304bafe862cbe3a3fab40082.1747121335.git.geert+renesas@glider.be
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: ump: Fix a typo of snd_ump_stream_msg_device_info [+ + +]

Author: Takashi Iwai <tiwai@suse.de>
Date:   Sun May 11 16:11:45 2025 +0200

    ALSA: ump: Fix a typo of snd_ump_stream_msg_device_info
    
    [ Upstream commit dd33993a9721ab1dae38bd37c9f665987d554239 ]
    
    s/devince/device/
    
    It's used only internally, so no any behavior changes.
    
    Fixes: 37e0e14128e0 ("ALSA: ump: Support UMP Endpoint and Function Block parsing")
    Acked-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Link: https://patch.msgid.link/20250511141147.10246-1-tiwai@suse.de
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: usb-audio: Add sample rate quirk for Audioengine D1 [+ + +]

Author: Christian Heusel <christian@heusel.eu>
Date:   Mon May 12 22:23:37 2025 +0200

    ALSA: usb-audio: Add sample rate quirk for Audioengine D1
    
    commit 2b24eb060c2bb9ef79e1d3bcf633ba1bc95215d6 upstream.
    
    A user reported on the Arch Linux Forums that their device is emitting
    the following message in the kernel journal, which is fixed by adding
    the quirk as submitted in this patch:
    
        > kernel: usb 1-2: current rate 8436480 is different from the runtime rate 48000
    
    There also is an entry for this product line added long time ago.
    Their specific device has the following ID:
    
        $ lsusb | grep Audio
        Bus 001 Device 002: ID 1101:0003 EasyPass Industrial Co., Ltd Audioengine D1
    
    Link: https://bbs.archlinux.org/viewtopic.php?id=305494
    Fixes: 93f9d1a4ac593 ("ALSA: usb-audio: Apply sample rate quirk for Audioengine D1")
    Cc: stable@vger.kernel.org
    Signed-off-by: Christian Heusel <christian@heusel.eu>
    Link: https://patch.msgid.link/20250512-audioengine-quirk-addition-v1-1-4c370af6eff7@heusel.eu
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ALSA: usb-audio: Add sample rate quirk for Microdia JP001 USB Camera [+ + +]

Author: Nicolas Chauvet <kwizart@gmail.com>
Date:   Thu May 15 12:21:32 2025 +0200

    ALSA: usb-audio: Add sample rate quirk for Microdia JP001 USB Camera
    
    commit 7b9938a14460e8ec7649ca2e80ac0aae9815bf02 upstream.
    
    Microdia JP001 does not support reading the sample rate which leads to
    many lines of "cannot get freq at ep 0x84".
    This patch adds the USB ID to quirks.c and avoids those error messages.
    
    usb 7-4: New USB device found, idVendor=0c45, idProduct=636b, bcdDevice= 1.00
    usb 7-4: New USB device strings: Mfr=2, Product=1, SerialNumber=3
    usb 7-4: Product: JP001
    usb 7-4: Manufacturer: JP001
    usb 7-4: SerialNumber: JP001
    usb 7-4: 3:1: cannot get freq at ep 0x84
    
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Nicolas Chauvet <kwizart@gmail.com>
    Link: https://patch.msgid.link/20250515102132.73062-1-kwizart@gmail.com
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

binfmt_elf: Calculate total_size earlier [+ + +]

Author: Kees Cook <kees@kernel.org>
Date:   Wed May 8 10:31:47 2024 -0700

    binfmt_elf: Calculate total_size earlier
    
    [ Upstream commit 2d4cf7b190bbfadd4986bf5c34da17c1a88adf8e ]
    
    In preparation to support PT_LOAD with large p_align values on
    non-PT_INTERP ET_DYN executables (i.e. "static pie"), we'll need to use
    the total_size details earlier. Move this separately now to make the
    next patch more readable. As total_size and load_bias are currently
    calculated separately, this has no behavioral impact.
    
    Link: https://lore.kernel.org/r/20240508173149.677910-2-keescook@chromium.org
    Signed-off-by: Kees Cook <kees@kernel.org>
    Stable-dep-of: 11854fe263eb ("binfmt_elf: Move brk for static PIE even if ASLR disabled")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

binfmt_elf: elf_bss no longer used by load_elf_binary() [+ + +]

Author: Kees Cook <kees@kernel.org>
Date:   Thu Sep 28 20:24:30 2023 -0700

    binfmt_elf: elf_bss no longer used by load_elf_binary()
    
    [ Upstream commit 8ed2ef21ff564cf4a25c098ace510ee6513c9836 ]
    
    With the BSS handled generically via the new filesz/memsz mismatch
    handling logic in elf_load(), elf_bss no longer needs to be tracked.
    Drop the variable.
    
    Cc: Eric Biederman <ebiederm@xmission.com>
    Cc: Alexander Viro <viro@zeniv.linux.org.uk>
    Cc: Christian Brauner <brauner@kernel.org>
    Cc: linux-fsdevel@vger.kernel.org
    Cc: linux-mm@kvack.org
    Suggested-by: Eric Biederman <ebiederm@xmission.com>
    Tested-by: Pedro Falcato <pedro.falcato@gmail.com>
    Signed-off-by: Sebastian Ott <sebott@redhat.com>
    Link: https://lore.kernel.org/r/20230929032435.2391507-2-keescook@chromium.org
    Signed-off-by: Kees Cook <keescook@chromium.org>
    Stable-dep-of: 11854fe263eb ("binfmt_elf: Move brk for static PIE even if ASLR disabled")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

binfmt_elf: Honor PT_LOAD alignment for static PIE [+ + +]

Author: Kees Cook <kees@kernel.org>
Date:   Wed May 8 10:31:48 2024 -0700

    binfmt_elf: Honor PT_LOAD alignment for static PIE
    
    [ Upstream commit 3545deff0ec7a37de7ed9632e262598582b140e9 ]
    
    The p_align values in PT_LOAD were ignored for static PIE executables
    (i.e. ET_DYN without PT_INTERP). This is because there is no way to
    request a non-fixed mmap region with a specific alignment. ET_DYN with
    PT_INTERP uses a separate base address (ELF_ET_DYN_BASE) and binfmt_elf
    performs the ASLR itself, which means it can also apply alignment. For
    the mmap region, the address selection happens deep within the vm_mmap()
    implementation (when the requested address is 0).
    
    The earlier attempt to implement this:
    
      commit 9630f0d60fec ("fs/binfmt_elf: use PT_LOAD p_align values for static PIE")
      commit 925346c129da ("fs/binfmt_elf: fix PT_LOAD p_align values for loaders")
    
    did not take into account the different base address origins, and were
    eventually reverted:
    
      aeb7923733d1 ("revert "fs/binfmt_elf: use PT_LOAD p_align values for static PIE"")
    
    In order to get the correct alignment from an mmap base, binfmt_elf must
    perform a 0-address load first, then tear down the mapping and perform
    alignment on the resulting address. Since this is slightly more overhead,
    only do this when it is needed (i.e. the alignment is not the default
    ELF alignment). This does, however, have the benefit of being able to
    use MAP_FIXED_NOREPLACE, to avoid potential collisions.
    
    With this fixed, enable the static PIE self tests again.
    
    Reported-by: H.J. Lu <hjl.tools@gmail.com>
    Closes: https://bugzilla.kernel.org/show_bug.cgi?id=215275
    Link: https://lore.kernel.org/r/20240508173149.677910-3-keescook@chromium.org
    Signed-off-by: Kees Cook <kees@kernel.org>
    Stable-dep-of: 11854fe263eb ("binfmt_elf: Move brk for static PIE even if ASLR disabled")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

binfmt_elf: Leave a gap between .bss and brk [+ + +]

Author: Kees Cook <kees@kernel.org>
Date:   Fri Feb 16 22:25:44 2024 -0800

    binfmt_elf: Leave a gap between .bss and brk
    
    [ Upstream commit 2a5eb9995528441447d33838727f6ec1caf08139 ]
    
    Currently the brk starts its randomization immediately after .bss,
    which means there is a chance that when the random offset is 0, linear
    overflows from .bss can reach into the brk area. Leave at least a single
    page gap between .bss and brk (when it has not already been explicitly
    relocated into the mmap range).
    
    Reported-by: <y0un9n132@gmail.com>
    Closes: https://lore.kernel.org/linux-hardening/CA+2EKTVLvc8hDZc+2Yhwmus=dzOUG5E4gV7ayCbu0MPJTZzWkw@mail.gmail.com/
    Link: https://lore.kernel.org/r/20240217062545.1631668-2-keescook@chromium.org
    Signed-off-by: Kees Cook <keescook@chromium.org>
    Stable-dep-of: 11854fe263eb ("binfmt_elf: Move brk for static PIE even if ASLR disabled")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

binfmt_elf: Move brk for static PIE even if ASLR disabled [+ + +]

Author: Kees Cook <kees@kernel.org>
Date:   Fri Apr 25 15:45:06 2025 -0700

    binfmt_elf: Move brk for static PIE even if ASLR disabled
    
    [ Upstream commit 11854fe263eb1b9a8efa33b0c087add7719ea9b4 ]
    
    In commit bbdc6076d2e5 ("binfmt_elf: move brk out of mmap when doing
    direct loader exec"), the brk was moved out of the mmap region when
    loading static PIE binaries (ET_DYN without INTERP). The common case
    for these binaries was testing new ELF loaders, so the brk needed to
    be away from mmap to avoid colliding with stack, future mmaps (of the
    loader-loaded binary), etc. But this was only done when ASLR was enabled,
    in an attempt to minimize changes to memory layouts.
    
    After adding support to respect alignment requirements for static PIE
    binaries in commit 3545deff0ec7 ("binfmt_elf: Honor PT_LOAD alignment
    for static PIE"), it became possible to have a large gap after the
    final PT_LOAD segment and the top of the mmap region. This means that
    future mmap allocations might go after the last PT_LOAD segment (where
    brk might be if ASLR was disabled) instead of before them (where they
    traditionally ended up).
    
    On arm64, running with ASLR disabled, Ubuntu 22.04's "ldconfig" binary,
    a static PIE, has alignment requirements that leaves a gap large enough
    after the last PT_LOAD segment to fit the vdso and vvar, but still leave
    enough space for the brk (which immediately follows the last PT_LOAD
    segment) to be allocated by the binary.
    
    fffff7f20000-fffff7fde000 r-xp 00000000 fe:02 8110426 /sbin/ldconfig.real
    fffff7fee000-fffff7ff5000 rw-p 000be000 fe:02 8110426 /sbin/ldconfig.real
    fffff7ff5000-fffff7ffa000 rw-p 00000000 00:00 0
    ***[brk will go here at fffff7ffa000]***
    fffff7ffc000-fffff7ffe000 r--p 00000000 00:00 0       [vvar]
    fffff7ffe000-fffff8000000 r-xp 00000000 00:00 0       [vdso]
    fffffffdf000-1000000000000 rw-p 00000000 00:00 0      [stack]
    
    After commit 0b3bc3354eb9 ("arm64: vdso: Switch to generic storage
    implementation"), the arm64 vvar grew slightly, and suddenly the brk
    collided with the allocation.
    
    fffff7f20000-fffff7fde000 r-xp 00000000 fe:02 8110426 /sbin/ldconfig.real
    fffff7fee000-fffff7ff5000 rw-p 000be000 fe:02 8110426 /sbin/ldconfig.real
    fffff7ff5000-fffff7ffa000 rw-p 00000000 00:00 0
    ***[oops, no room any more, vvar is at fffff7ffa000!]***
    fffff7ffa000-fffff7ffe000 r--p 00000000 00:00 0       [vvar]
    fffff7ffe000-fffff8000000 r-xp 00000000 00:00 0       [vdso]
    fffffffdf000-1000000000000 rw-p 00000000 00:00 0      [stack]
    
    The solution is to unconditionally move the brk out of the mmap region
    for static PIE binaries. Whether ASLR is enabled or not does not change if
    there may be future mmap allocation collisions with a growing brk region.
    
    Update memory layout comments (with kernel-doc headings), consolidate
    the setting of mm->brk to later (it isn't needed early), move static PIE
    brk out of mmap unconditionally, and make sure brk(2) knows to base brk
    position off of mm->start_brk not mm->end_data no matter what the cause of
    moving it is (via current->brk_randomized).
    
    For the CONFIG_COMPAT_BRK case, though, leave the logic unchanged, as we
    can never safely move the brk. These systems, however, are not using
    specially aligned static PIE binaries.
    
    Reported-by: Ryan Roberts <ryan.roberts@arm.com>
    Closes: https://lore.kernel.org/lkml/f93db308-4a0e-4806-9faf-98f890f5a5e6@arm.com/
    Fixes: bbdc6076d2e5 ("binfmt_elf: move brk out of mmap when doing direct loader exec")
    Link: https://lore.kernel.org/r/20250425224502.work.520-kees@kernel.org
    Reviewed-by: Ryan Roberts <ryan.roberts@arm.com>
    Tested-by: Ryan Roberts <ryan.roberts@arm.com>
    Signed-off-by: Kees Cook <kees@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

binfmt_elf: Support segments with 0 filesz and misaligned starts [+ + +]

Author: Eric W. Biederman <ebiederm@xmission.com>
Date:   Thu Sep 28 20:24:29 2023 -0700

    binfmt_elf: Support segments with 0 filesz and misaligned starts
    
    [ Upstream commit 585a018627b4d7ed37387211f667916840b5c5ea ]
    
    Implement a helper elf_load() that wraps elf_map() and performs all
    of the necessary work to ensure that when "memsz > filesz" the bytes
    described by "memsz > filesz" are zeroed.
    
    An outstanding issue is if the first segment has filesz 0, and has a
    randomized location. But that is the same as today.
    
    In this change I replaced an open coded padzero() that did not clear
    all of the way to the end of the page, with padzero() that does.
    
    I also stopped checking the return of padzero() as there is at least
    one known case where testing for failure is the wrong thing to do.
    It looks like binfmt_elf_fdpic may have the proper set of tests
    for when error handling can be safely completed.
    
    I found a couple of commits in the old history
    https://git.kernel.org/pub/scm/linux/kernel/git/tglx/history.git,
    that look very interesting in understanding this code.
    
    commit 39b56d902bf3 ("[PATCH] binfmt_elf: clearing bss may fail")
    commit c6e2227e4a3e ("[SPARC64]: Missing user access return value checks in fs/binfmt_elf.c and fs/compat.c")
    commit 5bf3be033f50 ("v2.4.10.1 -> v2.4.10.2")
    
    Looking at commit 39b56d902bf3 ("[PATCH] binfmt_elf: clearing bss may fail"):
    >  commit 39b56d902bf35241e7cba6cc30b828ed937175ad
    >  Author: Pavel Machek <pavel@ucw.cz>
    >  Date:   Wed Feb 9 22:40:30 2005 -0800
    >
    >     [PATCH] binfmt_elf: clearing bss may fail
    >
    >     So we discover that Borland's Kylix application builder emits weird elf
    >     files which describe a non-writeable bss segment.
    >
    >     So remove the clear_user() check at the place where we zero out the bss.  I
    >     don't _think_ there are any security implications here (plus we've never
    >     checked that clear_user() return value, so whoops if it is a problem).
    >
    >     Signed-off-by: Pavel Machek <pavel@suse.cz>
    >     Signed-off-by: Andrew Morton <akpm@osdl.org>
    >     Signed-off-by: Linus Torvalds <torvalds@osdl.org>
    
    It seems pretty clear that binfmt_elf_fdpic with skipping clear_user() for
    non-writable segments and otherwise calling clear_user(), aka padzero(),
    and checking it's return code is the right thing to do.
    
    I just skipped the error checking as that avoids breaking things.
    
    And notably, it looks like Borland's Kylix died in 2005 so it might be
    safe to just consider read-only segments with memsz > filesz an error.
    
    Reported-by: Sebastian Ott <sebott@redhat.com>
    Reported-by: Thomas Weißschuh <linux@weissschuh.net>
    Closes: https://lkml.kernel.org/r/20230914-bss-alloc-v1-1-78de67d2c6dd@weissschuh.net
    Signed-off-by: "Eric W. Biederman" <ebiederm@xmission.com>
    Link: https://lore.kernel.org/r/87sf71f123.fsf@email.froward.int.ebiederm.org
    Tested-by: Pedro Falcato <pedro.falcato@gmail.com>
    Signed-off-by: Sebastian Ott <sebott@redhat.com>
    Link: https://lore.kernel.org/r/20230929032435.2391507-1-keescook@chromium.org
    Signed-off-by: Kees Cook <keescook@chromium.org>
    Stable-dep-of: 11854fe263eb ("binfmt_elf: Move brk for static PIE even if ASLR disabled")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Bluetooth: btnxpuart: Fix kernel panic during FW release [+ + +]

Author: Neeraj Sanjay Kale <neeraj.sanjaykale@nxp.com>
Date:   Mon Mar 10 17:32:31 2025 +0530

    Bluetooth: btnxpuart: Fix kernel panic during FW release
    
    commit 1f77c05408c96bc0b58ae476a9cadc9e5b9cfd0f upstream.
    
    This fixes a kernel panic seen during release FW in a stress test
    scenario where WLAN and BT FW download occurs simultaneously, and due to
    a HW bug, chip sends out only 1 bootloader signatures.
    
    When driver receives the bootloader signature, it enters FW download
    mode, but since no consequtive bootloader signatures seen, FW file is
    not requested.
    
    After 60 seconds, when FW download times out, release_firmware causes a
    kernel panic.
    
    [ 2601.949184] Unable to handle kernel paging request at virtual address 0000312e6f006573
    [ 2601.992076] user pgtable: 4k pages, 48-bit VAs, pgdp=0000000111802000
    [ 2601.992080] [0000312e6f006573] pgd=0000000000000000, p4d=0000000000000000
    [ 2601.992087] Internal error: Oops: 0000000096000021 [#1] PREEMPT SMP
    [ 2601.992091] Modules linked in: algif_hash algif_skcipher af_alg btnxpuart(O) pciexxx(O) mlan(O) overlay fsl_jr_uio caam_jr caamkeyblob_desc caamhash_desc caamalg_desc crypto_engine authenc libdes crct10dif_ce polyval_ce snd_soc_fsl_easrc snd_soc_fsl_asoc_card imx8_media_dev(C) snd_soc_fsl_micfil polyval_generic snd_soc_fsl_xcvr snd_soc_fsl_sai snd_soc_imx_audmux snd_soc_fsl_asrc snd_soc_imx_card snd_soc_imx_hdmi snd_soc_fsl_aud2htx snd_soc_fsl_utils imx_pcm_dma dw_hdmi_cec flexcan can_dev
    [ 2602.001825] CPU: 2 PID: 20060 Comm: hciconfig Tainted: G         C O       6.6.23-lts-next-06236-gb586a521770e #1
    [ 2602.010182] Hardware name: NXP i.MX8MPlus EVK board (DT)
    [ 2602.010185] pstate: 60000005 (nZCv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
    [ 2602.010191] pc : _raw_spin_lock+0x34/0x68
    [ 2602.010201] lr : free_fw_priv+0x20/0xfc
    [ 2602.020561] sp : ffff800089363b30
    [ 2602.020563] x29: ffff800089363b30 x28: ffff0000d0eb5880 x27: 0000000000000000
    [ 2602.020570] x26: 0000000000000000 x25: ffff0000d728b330 x24: 0000000000000000
    [ 2602.020577] x23: ffff0000dc856f38
    [ 2602.033797] x22: ffff800089363b70 x21: ffff0000dc856000
    [ 2602.033802] x20: ff00312e6f006573 x19: ffff0000d0d9ea80 x18: 0000000000000000
    [ 2602.033809] x17: 0000000000000000 x16: 0000000000000000 x15: 0000aaaad80dd480
    [ 2602.083320] x14: 0000000000000000 x13: 00000000000001b9 x12: 0000000000000002
    [ 2602.083326] x11: 0000000000000000 x10: 0000000000000a60 x9 : ffff800089363a30
    [ 2602.083333] x8 : ffff0001793d75c0 x7 : ffff0000d6dbc400 x6 : 0000000000000000
    [ 2602.083339] x5 : 00000000410fd030 x4 : 0000000000000000 x3 : 0000000000000001
    [ 2602.083346] x2 : 0000000000000000 x1 : 0000000000000001 x0 : ff00312e6f006573
    [ 2602.083354] Call trace:
    [ 2602.083356]  _raw_spin_lock+0x34/0x68
    [ 2602.083364]  release_firmware+0x48/0x6c
    [ 2602.083370]  nxp_setup+0x3c4/0x540 [btnxpuart]
    [ 2602.083383]  hci_dev_open_sync+0xf0/0xa34
    [ 2602.083391]  hci_dev_open+0xd8/0x178
    [ 2602.083399]  hci_sock_ioctl+0x3b0/0x590
    [ 2602.083405]  sock_do_ioctl+0x60/0x118
    [ 2602.083413]  sock_ioctl+0x2f4/0x374
    [ 2602.091430]  __arm64_sys_ioctl+0xac/0xf0
    [ 2602.091437]  invoke_syscall+0x48/0x110
    [ 2602.091445]  el0_svc_common.constprop.0+0xc0/0xe0
    [ 2602.091452]  do_el0_svc+0x1c/0x28
    [ 2602.091457]  el0_svc+0x40/0xe4
    [ 2602.091465]  el0t_64_sync_handler+0x120/0x12c
    [ 2602.091470]  el0t_64_sync+0x190/0x194
    
    Fixes: e3c4891098c8 ("Bluetooth: btnxpuart: Handle FW Download Abort scenario")
    Fixes: 689ca16e5232 ("Bluetooth: NXP: Add protocol support for NXP Bluetooth chipsets")
    Signed-off-by: Neeraj Sanjay Kale <neeraj.sanjaykale@nxp.com>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Bin Lan <bin.lan.cn@windriver.com>
    Signed-off-by: He Zhe <zhe.he@windriver.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Bluetooth: MGMT: Fix MGMT_OP_ADD_DEVICE invalid device flags [+ + +]

Author: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
Date:   Tue Apr 29 15:05:59 2025 -0400

    Bluetooth: MGMT: Fix MGMT_OP_ADD_DEVICE invalid device flags
    
    [ Upstream commit 1e2e3044c1bc64a64aa0eaf7c17f7832c26c9775 ]
    
    Device flags could be updated in the meantime while MGMT_OP_ADD_DEVICE
    is pending on hci_update_passive_scan_sync so instead of setting the
    current_flags as cmd->user_data just do a lookup using
    hci_conn_params_lookup and use the latest stored flags.
    
    Fixes: a182d9c84f9c ("Bluetooth: MGMT: Fix Add Device to responding before completing")
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

bpf, arm64: Fix address emission with tag-based KASAN enabled [+ + +]

Author: Peter Collingbourne <pcc@google.com>
Date:   Fri Oct 18 15:16:43 2024 -0700

    bpf, arm64: Fix address emission with tag-based KASAN enabled
    
    commit a552e2ef5fd1a6c78267cd4ec5a9b49aa11bbb1c upstream.
    
    When BPF_TRAMP_F_CALL_ORIG is enabled, the address of a bpf_tramp_image
    struct on the stack is passed during the size calculation pass and
    an address on the heap is passed during code generation. This may
    cause a heap buffer overflow if the heap address is tagged because
    emit_a64_mov_i64() will emit longer code than it did during the size
    calculation pass. The same problem could occur without tag-based
    KASAN if one of the 16-bit words of the stack address happened to
    be all-ones during the size calculation pass. Fix the problem by
    assuming the worst case (4 instructions) when calculating the size
    of the bpf_tramp_image address emission.
    
    Fixes: 19d3c179a377 ("bpf, arm64: Fix trampoline for BPF_TRAMP_F_CALL_ORIG")
    Signed-off-by: Peter Collingbourne <pcc@google.com>
    Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>
    Acked-by: Xu Kuohai <xukuohai@huawei.com>
    Link: https://linux-review.googlesource.com/id/I1496f2bc24fba7a1d492e16e2b94cf43714f2d3c
    Link: https://lore.kernel.org/bpf/20241018221644.3240898-1-pcc@google.com
    [Minor context change fixed.]
    Signed-off-by: Bin Lan <bin.lan.cn@windriver.com>
    Signed-off-by: He Zhe <zhe.he@windriver.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

bpf, arm64: Fix trampoline for BPF_TRAMP_F_CALL_ORIG [+ + +]

Author: Puranjay Mohan <puranjay@kernel.org>
Date:   Thu Jul 11 15:18:38 2024 +0000

    bpf, arm64: Fix trampoline for BPF_TRAMP_F_CALL_ORIG
    
    commit 19d3c179a37730caf600a97fed3794feac2b197b upstream.
    
    When BPF_TRAMP_F_CALL_ORIG is set, the trampoline calls
    __bpf_tramp_enter() and __bpf_tramp_exit() functions, passing them
    the struct bpf_tramp_image *im pointer as an argument in R0.
    
    The trampoline generation code uses emit_addr_mov_i64() to emit
    instructions for moving the bpf_tramp_image address into R0, but
    emit_addr_mov_i64() assumes the address to be in the vmalloc() space
    and uses only 48 bits. Because bpf_tramp_image is allocated using
    kzalloc(), its address can use more than 48-bits, in this case the
    trampoline will pass an invalid address to __bpf_tramp_enter/exit()
    causing a kernel crash.
    
    Fix this by using emit_a64_mov_i64() in place of emit_addr_mov_i64()
    as it can work with addresses that are greater than 48-bits.
    
    Fixes: efc9909fdce0 ("bpf, arm64: Add bpf trampoline for arm64")
    Signed-off-by: Puranjay Mohan <puranjay@kernel.org>
    Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>
    Closes: https://lore.kernel.org/all/SJ0PR15MB461564D3F7E7A763498CA6A8CBDB2@SJ0PR15MB4615.namprd15.prod.outlook.com/
    Link: https://lore.kernel.org/bpf/20240711151838.43469-1-puranjay@kernel.org
    [Minor context change fixed.]
    Signed-off-by: Bin Lan <bin.lan.cn@windriver.com>
    Signed-off-by: He Zhe <zhe.he@windriver.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: don't BUG_ON() when 0 reference count at btrfs_lookup_extent_info() [+ + +]

Author: Filipe Manana <fdmanana@suse.com>
Date:   Tue Jun 18 12:15:01 2024 +0100

    btrfs: don't BUG_ON() when 0 reference count at btrfs_lookup_extent_info()
    
    commit 28cb13f29faf6290597b24b728dc3100c019356f upstream.
    
    Instead of doing a BUG_ON() handle the error by returning -EUCLEAN,
    aborting the transaction and logging an error message.
    
    Reviewed-by: Qu Wenruo <wqu@suse.com>
    Signed-off-by: Filipe Manana <fdmanana@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    [Minor conflict resolved due to code context change.]
    Signed-off-by: Jianqi Ren <jianqi.ren.cn@windriver.com>
    Signed-off-by: He Zhe <zhe.he@windriver.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

cgroup/cpuset: Extend kthread_is_per_cpu() check to all PF_NO_SETAFFINITY tasks [+ + +]

Author: Waiman Long <longman@redhat.com>
Date:   Thu May 8 15:24:13 2025 -0400

    cgroup/cpuset: Extend kthread_is_per_cpu() check to all PF_NO_SETAFFINITY tasks
    
    [ Upstream commit 39b5ef791d109dd54c7c2e6e87933edfcc0ad1ac ]
    
    Commit ec5fbdfb99d1 ("cgroup/cpuset: Enable update_tasks_cpumask()
    on top_cpuset") enabled us to pull CPUs dedicated to child partitions
    from tasks in top_cpuset by ignoring per cpu kthreads. However, there
    can be other kthreads that are not per cpu but have PF_NO_SETAFFINITY
    flag set to indicate that we shouldn't mess with their CPU affinity.
    For other kthreads, their affinity will be changed to skip CPUs dedicated
    to child partitions whether it is an isolating or a scheduling one.
    
    As all the per cpu kthreads have PF_NO_SETAFFINITY set, the
    PF_NO_SETAFFINITY tasks are essentially a superset of per cpu kthreads.
    Fix this issue by dropping the kthread_is_per_cpu() check and checking
    the PF_NO_SETAFFINITY flag instead.
    
    Fixes: ec5fbdfb99d1 ("cgroup/cpuset: Enable update_tasks_cpumask() on top_cpuset")
    Signed-off-by: Waiman Long <longman@redhat.com>
    Acked-by: Frederic Weisbecker <frederic@kernel.org>
    Signed-off-by: Tejun Heo <tj@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

dma-buf: insert memory barrier before updating num_fences [+ + +]

Author: Hyejeong Choi <hjeong.choi@samsung.com>
Date:   Mon May 12 21:06:38 2025 -0500

    dma-buf: insert memory barrier before updating num_fences
    
    commit 72c7d62583ebce7baeb61acce6057c361f73be4a upstream.
    
    smp_store_mb() inserts memory barrier after storing operation.
    It is different with what the comment is originally aiming so Null
    pointer dereference can be happened if memory update is reordered.
    
    Signed-off-by: Hyejeong Choi <hjeong.choi@samsung.com>
    Fixes: a590d0fdbaa5 ("dma-buf: Update reservation shared_count after adding the new fence")
    CC: stable@vger.kernel.org
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Link: https://lore.kernel.org/r/20250513020638.GA2329653@au1-maretx-p37.eng.sarc.samsung.com
    Signed-off-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dmaengine: idxd: Add missing cleanup for early error out in idxd_setup_internals [+ + +]

Author: Shuai Xue <xueshuai@linux.alibaba.com>
Date:   Fri Apr 4 20:02:12 2025 +0800

    dmaengine: idxd: Add missing cleanup for early error out in idxd_setup_internals
    
    commit 61259fb96e023f7299c442c48b13e72c441fc0f2 upstream.
    
    The idxd_setup_internals() is missing some cleanup when things fail in
    the middle.
    
    Add the appropriate cleanup routines:
    
    - cleanup groups
    - cleanup enginces
    - cleanup wqs
    
    to make sure it exits gracefully.
    
    Fixes: defe49f96012 ("dmaengine: idxd: fix group conf_dev lifetime")
    Cc: stable@vger.kernel.org
    Suggested-by: Fenghua Yu <fenghuay@nvidia.com>
    Signed-off-by: Shuai Xue <xueshuai@linux.alibaba.com>
    Reviewed-by: Fenghua Yu <fenghuay@nvidia.com>
    Reviewed-by: Dave Jiang <dave.jiang@intel.com>
    Link: https://lore.kernel.org/r/20250404120217.48772-5-xueshuai@linux.alibaba.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dmaengine: idxd: Add missing cleanups in cleanup internals [+ + +]

Author: Shuai Xue <xueshuai@linux.alibaba.com>
Date:   Fri Apr 4 20:02:13 2025 +0800

    dmaengine: idxd: Add missing cleanups in cleanup internals
    
    commit 61d651572b6c4fe50c7b39a390760f3a910c7ccf upstream.
    
    The idxd_cleanup_internals() function only decreases the reference count
    of groups, engines, and wqs but is missing the step to release memory
    resources.
    
    To fix this, use the cleanup helper to properly release the memory
    resources.
    
    Fixes: ddf742d4f3f1 ("dmaengine: idxd: Add missing cleanup for early error out in probe call")
    Cc: stable@vger.kernel.org
    Signed-off-by: Shuai Xue <xueshuai@linux.alibaba.com>
    Reviewed-by: Fenghua Yu <fenghuay@nvidia.com>
    Reviewed-by: Dave Jiang <dave.jiang@intel.com>
    Link: https://lore.kernel.org/r/20250404120217.48772-6-xueshuai@linux.alibaba.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dmaengine: idxd: Add missing idxd cleanup to fix memory leak in remove call [+ + +]

Author: Shuai Xue <xueshuai@linux.alibaba.com>
Date:   Fri Apr 4 20:02:16 2025 +0800

    dmaengine: idxd: Add missing idxd cleanup to fix memory leak in remove call
    
    commit d5449ff1b04dfe9ed8e455769aa01e4c2ccf6805 upstream.
    
    The remove call stack is missing idxd cleanup to free bitmap, ida and
    the idxd_device. Call idxd_free() helper routines to make sure we exit
    gracefully.
    
    Fixes: bfe1d56091c1 ("dmaengine: idxd: Init and probe for Intel data accelerators")
    Cc: stable@vger.kernel.org
    Suggested-by: Vinicius Costa Gomes <vinicius.gomes@intel.com>
    Signed-off-by: Shuai Xue <xueshuai@linux.alibaba.com>
    Reviewed-by: Fenghua Yu <fenghuay@nvidia.com>
    Reviewed-by: Dave Jiang <dave.jiang@intel.com>
    Link: https://lore.kernel.org/r/20250404120217.48772-9-xueshuai@linux.alibaba.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dmaengine: idxd: fix memory leak in error handling path of idxd_alloc [+ + +]

Author: Shuai Xue <xueshuai@linux.alibaba.com>
Date:   Fri Apr 4 20:02:14 2025 +0800

    dmaengine: idxd: fix memory leak in error handling path of idxd_alloc
    
    commit 46a5cca76c76c86063000a12936f8e7875295838 upstream.
    
    Memory allocated for idxd is not freed if an error occurs during
    idxd_alloc(). To fix it, free the allocated memory in the reverse order
    of allocation before exiting the function in case of an error.
    
    Fixes: a8563a33a5e2 ("dmanegine: idxd: reformat opcap output to match bitmap_parse() input")
    Cc: stable@vger.kernel.org
    Signed-off-by: Shuai Xue <xueshuai@linux.alibaba.com>
    Reviewed-by: Dave Jiang <dave.jiang@intel.com>
    Reviewed-by: Fenghua Yu <fenghuay@nvidia.com>
    Link: https://lore.kernel.org/r/20250404120217.48772-7-xueshuai@linux.alibaba.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dmaengine: idxd: fix memory leak in error handling path of idxd_pci_probe [+ + +]

Author: Shuai Xue <xueshuai@linux.alibaba.com>
Date:   Fri Apr 4 20:02:15 2025 +0800

    dmaengine: idxd: fix memory leak in error handling path of idxd_pci_probe
    
    commit 90022b3a6981ec234902be5dbf0f983a12c759fc upstream.
    
    Memory allocated for idxd is not freed if an error occurs during
    idxd_pci_probe(). To fix it, free the allocated memory in the reverse
    order of allocation before exiting the function in case of an error.
    
    Fixes: bfe1d56091c1 ("dmaengine: idxd: Init and probe for Intel data accelerators")
    Cc: stable@vger.kernel.org
    Signed-off-by: Shuai Xue <xueshuai@linux.alibaba.com>
    Reviewed-by: Dave Jiang <dave.jiang@intel.com>
    Reviewed-by: Fenghua Yu <fenghuay@nvidia.com>
    Link: https://lore.kernel.org/r/20250404120217.48772-8-xueshuai@linux.alibaba.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dmaengine: idxd: fix memory leak in error handling path of idxd_setup_engines [+ + +]

Author: Shuai Xue <xueshuai@linux.alibaba.com>
Date:   Fri Apr 4 20:02:10 2025 +0800

    dmaengine: idxd: fix memory leak in error handling path of idxd_setup_engines
    
    commit 817bced19d1dbdd0b473580d026dc0983e30e17b upstream.
    
    Memory allocated for engines is not freed if an error occurs during
    idxd_setup_engines(). To fix it, free the allocated memory in the
    reverse order of allocation before exiting the function in case of an
    error.
    
    Fixes: 75b911309060 ("dmaengine: idxd: fix engine conf_dev lifetime")
    Cc: stable@vger.kernel.org
    Signed-off-by: Shuai Xue <xueshuai@linux.alibaba.com>
    Reviewed-by: Dave Jiang <dave.jiang@intel.com>
    Reviewed-by: Fenghua Yu <fenghuay@nvidia.com>
    Link: https://lore.kernel.org/r/20250404120217.48772-3-xueshuai@linux.alibaba.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dmaengine: idxd: fix memory leak in error handling path of idxd_setup_groups [+ + +]

Author: Shuai Xue <xueshuai@linux.alibaba.com>
Date:   Fri Apr 4 20:02:11 2025 +0800

    dmaengine: idxd: fix memory leak in error handling path of idxd_setup_groups
    
    commit aa6f4f945b10eac57aed46154ae7d6fada7fccc7 upstream.
    
    Memory allocated for groups is not freed if an error occurs during
    idxd_setup_groups(). To fix it, free the allocated memory in the reverse
    order of allocation before exiting the function in case of an error.
    
    Fixes: defe49f96012 ("dmaengine: idxd: fix group conf_dev lifetime")
    Cc: stable@vger.kernel.org
    Signed-off-by: Shuai Xue <xueshuai@linux.alibaba.com>
    Reviewed-by: Dave Jiang <dave.jiang@intel.com>
    Reviewed-by: Fenghua Yu <fenghuay@nvidia.com>
    Link: https://lore.kernel.org/r/20250404120217.48772-4-xueshuai@linux.alibaba.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dmaengine: idxd: fix memory leak in error handling path of idxd_setup_wqs [+ + +]

Author: Shuai Xue <xueshuai@linux.alibaba.com>
Date:   Fri Apr 4 20:02:09 2025 +0800

    dmaengine: idxd: fix memory leak in error handling path of idxd_setup_wqs
    
    commit 3fd2f4bc010cdfbc07dd21018dc65bd9370eb7a4 upstream.
    
    Memory allocated for wqs is not freed if an error occurs during
    idxd_setup_wqs(). To fix it, free the allocated memory in the reverse
    order of allocation before exiting the function in case of an error.
    
    Fixes: 7c5dd23e57c1 ("dmaengine: idxd: fix wq conf_dev 'struct device' lifetime")
    Fixes: 700af3a0a26c ("dmaengine: idxd: add 'struct idxd_dev' as wrapper for conf_dev")
    Fixes: de5819b99489 ("dmaengine: idxd: track enabled workqueues in bitmap")
    Fixes: b0325aefd398 ("dmaengine: idxd: add WQ operation cap restriction support")
    Cc: stable@vger.kernel.org
    Signed-off-by: Shuai Xue <xueshuai@linux.alibaba.com>
    Reviewed-by: Dave Jiang <dave.jiang@intel.com>
    Reviewed-by: Fenghua Yu <fenghuay@nvidia.com>
    Link: https://lore.kernel.org/r/20250404120217.48772-2-xueshuai@linux.alibaba.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dmaengine: idxd: Refactor remove call with idxd_cleanup() helper [+ + +]

Author: Shuai Xue <xueshuai@linux.alibaba.com>
Date:   Fri Apr 4 20:02:17 2025 +0800

    dmaengine: idxd: Refactor remove call with idxd_cleanup() helper
    
    commit a409e919ca321cc0e28f8abf96fde299f0072a81 upstream.
    
    The idxd_cleanup() helper cleans up perfmon, interrupts, internals and
    so on. Refactor remove call with the idxd_cleanup() helper to avoid code
    duplication. Note, this also fixes the missing put_device() for idxd
    groups, enginces and wqs.
    
    Fixes: bfe1d56091c1 ("dmaengine: idxd: Init and probe for Intel data accelerators")
    Cc: stable@vger.kernel.org
    Suggested-by: Vinicius Costa Gomes <vinicius.gomes@intel.com>
    Signed-off-by: Shuai Xue <xueshuai@linux.alibaba.com>
    Reviewed-by: Fenghua Yu <fenghuay@nvidia.com>
    Reviewed-by: Dave Jiang <dave.jiang@intel.com>
    Link: https://lore.kernel.org/r/20250404120217.48772-10-xueshuai@linux.alibaba.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dmaengine: Revert "dmaengine: dmatest: Fix dmatest waiting less when interrupted" [+ + +]

Author: Nathan Lynch <nathan.lynch@amd.com>
Date:   Thu Apr 3 11:24:19 2025 -0500

    dmaengine: Revert "dmaengine: dmatest: Fix dmatest waiting less when interrupted"
    
    commit df180e65305f8c1e020d54bfc2132349fd693de1 upstream.
    
    Several issues with this change:
    
    * The analysis is flawed and it's unclear what problem is being
      fixed. There is no difference between wait_event_freezable_timeout()
      and wait_event_timeout() with respect to device interrupts. And of
      course "the interrupt notifying the finish of an operation happens
      during wait_event_freezable_timeout()" -- that's how it's supposed
      to work.
    
    * The link at the "Closes:" tag appears to be an unrelated
      use-after-free in idxd.
    
    * It introduces a regression: dmatest threads are meant to be
      freezable and this change breaks that.
    
    See discussion here:
    https://lore.kernel.org/dmaengine/878qpa13fe.fsf@AUSNATLYNCH.amd.com/
    
    Fixes: e87ca16e9911 ("dmaengine: dmatest: Fix dmatest waiting less when interrupted")
    Signed-off-by: Nathan Lynch <nathan.lynch@amd.com>
    Link: https://lore.kernel.org/r/20250403-dmaengine-dmatest-revert-waiting-less-v1-1-8227c5a3d7c8@amd.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dmaengine: ti: k3-udma: Add missing locking [+ + +]

Author: Ronald Wahl <ronald.wahl@legrand.com>
Date:   Mon Apr 14 19:31:13 2025 +0200

    dmaengine: ti: k3-udma: Add missing locking
    
    commit fca280992af8c2fbd511bc43f65abb4a17363f2f upstream.
    
    Recent kernels complain about a missing lock in k3-udma.c when the lock
    validator is enabled:
    
    [    4.128073] WARNING: CPU: 0 PID: 746 at drivers/dma/ti/../virt-dma.h:169 udma_start.isra.0+0x34/0x238
    [    4.137352] CPU: 0 UID: 0 PID: 746 Comm: kworker/0:3 Not tainted 6.12.9-arm64 #28
    [    4.144867] Hardware name: pp-v12 (DT)
    [    4.148648] Workqueue: events udma_check_tx_completion
    [    4.153841] pstate: 60000005 (nZCv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
    [    4.160834] pc : udma_start.isra.0+0x34/0x238
    [    4.165227] lr : udma_start.isra.0+0x30/0x238
    [    4.169618] sp : ffffffc083cabcf0
    [    4.172963] x29: ffffffc083cabcf0 x28: 0000000000000000 x27: ffffff800001b005
    [    4.180167] x26: ffffffc0812f0000 x25: 0000000000000000 x24: 0000000000000000
    [    4.187370] x23: 0000000000000001 x22: 00000000e21eabe9 x21: ffffff8000fa0670
    [    4.194571] x20: ffffff8001b6bf00 x19: ffffff8000fa0430 x18: ffffffc083b95030
    [    4.201773] x17: 0000000000000000 x16: 00000000f0000000 x15: 0000000000000048
    [    4.208976] x14: 0000000000000048 x13: 0000000000000000 x12: 0000000000000001
    [    4.216179] x11: ffffffc08151a240 x10: 0000000000003ea1 x9 : ffffffc08046ab68
    [    4.223381] x8 : ffffffc083cabac0 x7 : ffffffc081df3718 x6 : 0000000000029fc8
    [    4.230583] x5 : ffffffc0817ee6d8 x4 : 0000000000000bc0 x3 : 0000000000000000
    [    4.237784] x2 : 0000000000000000 x1 : 00000000001fffff x0 : 0000000000000000
    [    4.244986] Call trace:
    [    4.247463]  udma_start.isra.0+0x34/0x238
    [    4.251509]  udma_check_tx_completion+0xd0/0xdc
    [    4.256076]  process_one_work+0x244/0x3fc
    [    4.260129]  process_scheduled_works+0x6c/0x74
    [    4.264610]  worker_thread+0x150/0x1dc
    [    4.268398]  kthread+0xd8/0xe8
    [    4.271492]  ret_from_fork+0x10/0x20
    [    4.275107] irq event stamp: 220
    [    4.278363] hardirqs last  enabled at (219): [<ffffffc080a27c7c>] _raw_spin_unlock_irq+0x38/0x50
    [    4.287183] hardirqs last disabled at (220): [<ffffffc080a1c154>] el1_dbg+0x24/0x50
    [    4.294879] softirqs last  enabled at (182): [<ffffffc080037e68>] handle_softirqs+0x1c0/0x3cc
    [    4.303437] softirqs last disabled at (177): [<ffffffc080010170>] __do_softirq+0x1c/0x28
    [    4.311559] ---[ end trace 0000000000000000 ]---
    
    This commit adds the missing locking.
    
    Fixes: 25dcb5dd7b7c ("dmaengine: ti: New driver for K3 UDMA")
    Cc: Peter Ujfalusi <peter.ujfalusi@gmail.com>
    Cc: Vignesh Raghavendra <vigneshr@ti.com>
    Cc: Vinod Koul <vkoul@kernel.org>
    Cc: dmaengine@vger.kernel.org
    Cc: stable@vger.kernel.org
    Signed-off-by: Ronald Wahl <ronald.wahl@legrand.com>
    Acked-by: Peter Ujfalusi <peter.ujfalusi@gmail.com>
    Link: https://lore.kernel.org/r/20250414173113.80677-1-rwahl@gmx.de
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dmaengine: ti: k3-udma: Use cap_mask directly from dma_device structure instead of a local copy [+ + +]

Author: Yemike Abhilash Chandra <y-abhilashchandra@ti.com>
Date:   Thu Apr 17 13:25:21 2025 +0530

    dmaengine: ti: k3-udma: Use cap_mask directly from dma_device structure instead of a local copy
    
    commit 8ca9590c39b69b55a8de63d2b21b0d44f523b43a upstream.
    
    Currently, a local dma_cap_mask_t variable is used to store device
    cap_mask within udma_of_xlate(). However, the DMA_PRIVATE flag in
    the device cap_mask can get cleared when the last channel is released.
    This can happen right after storing the cap_mask locally in
    udma_of_xlate(), and subsequent dma_request_channel() can fail due to
    mismatch in the cap_mask. Fix this by removing the local dma_cap_mask_t
    variable and directly using the one from the dma_device structure.
    
    Fixes: 25dcb5dd7b7c ("dmaengine: ti: New driver for K3 UDMA")
    Cc: stable@vger.kernel.org
    Signed-off-by: Vaishnav Achath <vaishnav.a@ti.com>
    Acked-by: Peter Ujfalusi <peter.ujfalusi@gmail.com>
    Reviewed-by: Udit Kumar <u-kumar1@ti.com>
    Signed-off-by: Yemike Abhilash Chandra <y-abhilashchandra@ti.com>
    Link: https://lore.kernel.org/r/20250417075521.623651-1-y-abhilashchandra@ti.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Drivers: hv: Allow vmbus_sendpacket_mpb_desc() to create multiple ranges [+ + +]

Author: Michael Kelley <mhklinux@outlook.com>
Date:   Mon May 12 17:06:00 2025 -0700

    Drivers: hv: Allow vmbus_sendpacket_mpb_desc() to create multiple ranges
    
    commit 380b75d3078626aadd0817de61f3143f5db6e393 upstream.
    
    vmbus_sendpacket_mpb_desc() is currently used only by the storvsc driver
    and is hardcoded to create a single GPA range. To allow it to also be
    used by the netvsc driver to create multiple GPA ranges, no longer
    hardcode as having a single GPA range. Allow the calling driver to
    specify the rangecount in the supplied descriptor.
    
    Update the storvsc driver to reflect this new approach.
    
    Cc: <stable@vger.kernel.org> # 6.1.x
    Signed-off-by: Michael Kelley <mhklinux@outlook.com>
    Link: https://patch.msgid.link/20250513000604.1396-2-mhklinux@outlook.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Drivers: hv: vmbus: Remove vmbus_sendpacket_pagebuffer() [+ + +]

Author: Michael Kelley <mhklinux@outlook.com>
Date:   Mon May 12 17:06:04 2025 -0700

    Drivers: hv: vmbus: Remove vmbus_sendpacket_pagebuffer()
    
    commit 45a442fe369e6c4e0b4aa9f63b31c3f2f9e2090e upstream.
    
    With the netvsc driver changed to use vmbus_sendpacket_mpb_desc()
    instead of vmbus_sendpacket_pagebuffer(), the latter has no remaining
    callers. Remove it.
    
    Cc: <stable@vger.kernel.org> # 6.1.x
    Signed-off-by: Michael Kelley <mhklinux@outlook.com>
    Link: https://patch.msgid.link/20250513000604.1396-6-mhklinux@outlook.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amd/display: Avoid flooding unnecessary info messages [+ + +]

Author: Wayne Lin <Wayne.Lin@amd.com>
Date:   Tue May 13 11:20:24 2025 +0800

    drm/amd/display: Avoid flooding unnecessary info messages
    
    commit d33724ffb743d3d2698bd969e29253ae0cff9739 upstream.
    
    It's expected that we'll encounter temporary exceptions
    during aux transactions. Adjust logging from drm_info to
    drm_dbg_dp to prevent flooding with unnecessary log messages.
    
    Fixes: 3637e457eb00 ("drm/amd/display: Fix wrong handling for AUX_DEFER case")
    Cc: Mario Limonciello <mario.limonciello@amd.com>
    Cc: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Wayne Lin <Wayne.Lin@amd.com>
    Acked-by: Alex Deucher <alexander.deucher@amd.com>
    Link: https://lore.kernel.org/r/20250513032026.838036-1-Wayne.Lin@amd.com
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit 9a9c3e1fe5256da14a0a307dff0478f90c55fc8c)
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amd/display: Correct the reply value when AUX write incomplete [+ + +]

Author: Wayne Lin <Wayne.Lin@amd.com>
Date:   Fri Apr 25 14:44:02 2025 +0800

    drm/amd/display: Correct the reply value when AUX write incomplete
    
    commit d433981385c62c72080e26f1c00a961d18b233be upstream.
    
    [Why]
    Now forcing aux->transfer to return 0 when incomplete AUX write is
    inappropriate. It should return bytes have been transferred.
    
    [How]
    aux->transfer is asked not to change original msg except reply field of
    drm_dp_aux_msg structure. Copy the msg->buffer when it's write request,
    and overwrite the first byte when sink reply 1 byte indicating partially
    written byte number. Then we can return the correct value without
    changing the original msg.
    
    Fixes: 3637e457eb00 ("drm/amd/display: Fix wrong handling for AUX_DEFER case")
    Cc: Mario Limonciello <mario.limonciello@amd.com>
    Cc: Alex Deucher <alexander.deucher@amd.com>
    Reviewed-by: Ray Wu <ray.wu@amd.com>
    Signed-off-by: Wayne Lin <Wayne.Lin@amd.com>
    Signed-off-by: Ray Wu <ray.wu@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit 7ac37f0dcd2e0b729fa7b5513908dc8ab802b540)
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amd: Add Suspend/Hibernate notification callback support [+ + +]

Author: Mario Limonciello <mario.limonciello@amd.com>
Date:   Wed Nov 27 21:26:56 2024 -0600

    drm/amd: Add Suspend/Hibernate notification callback support
    
    [ Upstream commit 2965e6355dcdf157b5fafa25a2715f00064da8bf ]
    
    As part of the suspend sequence VRAM needs to be evicted on dGPUs.
    In order to make suspend/resume more reliable we moved this into
    the pmops prepare() callback so that the suspend sequence would fail
    but the system could remain operational under high memory usage suspend.
    
    Another class of issues exist though where due to memory fragementation
    there isn't a large enough contiguous space and swap isn't accessible.
    
    Add support for a suspend/hibernate notification callback that could
    evict VRAM before tasks are frozen. This should allow paging out to swap
    if necessary.
    
    Link: https://github.com/ROCm/ROCK-Kernel-Driver/issues/174
    Link: https://gitlab.freedesktop.org/drm/amd/-/issues/3476
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/2362
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/3781
    Reviewed-by: Lijo Lazar <lijo.lazar@amd.com>
    Link: https://lore.kernel.org/r/20241128032656.2090059-2-superm1@kernel.org
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Stable-dep-of: d0ce1aaa8531 ("Revert "drm/amd: Stop evicting resources on APUs in suspend"")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd: Stop evicting resources on APUs in suspend [+ + +]

Author: Mario Limonciello <mario.limonciello@amd.com>
Date:   Wed Feb 7 23:52:55 2024 -0600

    drm/amd: Stop evicting resources on APUs in suspend
    
    [ Upstream commit 226db36032c61d8717dfdd052adac351b22d3e83 ]
    
    commit 5095d5418193 ("drm/amd: Evict resources during PM ops prepare()
    callback") intentionally moved the eviction of resources to earlier in
    the suspend process, but this introduced a subtle change that it occurs
    before adev->in_s0ix or adev->in_s3 are set. This meant that APUs
    actually started to evict resources at suspend time as well.
    
    Explicitly set s0ix or s3 in the prepare() stage, and unset them if the
    prepare() stage failed.
    
    v2: squash in warning fix from Stephen Rothwell
    
    Reported-by: Jürg Billeter <j@bitron.ch>
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/3132#note_2271038
    Fixes: 5095d5418193 ("drm/amd: Evict resources during PM ops prepare() callback")
    Acked-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Stable-dep-of: d0ce1aaa8531 ("Revert "drm/amd: Stop evicting resources on APUs in suspend"")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amdgpu: fix pm notifier handling [+ + +]

Author: Alex Deucher <alexander.deucher@amd.com>
Date:   Thu May 1 13:46:46 2025 -0400

    drm/amdgpu: fix pm notifier handling
    
    commit 4aaffc85751da5722e858e4333e8cf0aa4b6c78f upstream.
    
    Set the s3/s0ix and s4 flags in the pm notifier so that we can skip
    the resource evictions properly in pm prepare based on whether
    we are suspending or hibernating.  Drop the eviction as processes
    are not frozen at this time, we we can end up getting stuck trying
    to evict VRAM while applications continue to submit work which
    causes the buffers to get pulled back into VRAM.
    
    v2: Move suspend flags out of pm notifier (Mario)
    
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4178
    Fixes: 2965e6355dcd ("drm/amd: Add Suspend/Hibernate notification callback support")
    Cc: Mario Limonciello <mario.limonciello@amd.com>
    Reviewed-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit 06f2dcc241e7e5c681f81fbc46cacdf4bfd7d6d7)
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amdgpu: Fix the runtime resume failure issue [+ + +]

Author: Ma Jun <Jun.Ma2@amd.com>
Date:   Wed Feb 21 17:16:49 2024 +0800

    drm/amdgpu: Fix the runtime resume failure issue
    
    [ Upstream commit bbfaf2aea7164db59739728d62d9cc91d64ff856 ]
    
    Don't set power state flag when system enter runtime suspend,
    or it may cause runtime resume failure issue.
    
    Fixes: 3a9626c816db ("drm/amd: Stop evicting resources on APUs in suspend")
    Signed-off-by: Ma Jun <Jun.Ma2@amd.com>
    Reviewed-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Cc: stable@vger.kernel.org
    Stable-dep-of: d0ce1aaa8531 ("Revert "drm/amd: Stop evicting resources on APUs in suspend"")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amdgpu: trigger flr_work if reading pf2vf data failed [+ + +]

Author: Zhigang Luo <Zhigang.Luo@amd.com>
Date:   Thu Feb 29 16:04:35 2024 -0500

    drm/amdgpu: trigger flr_work if reading pf2vf data failed
    
    [ Upstream commit ab66c832847fcdffc97d4591ba5547e3990d9d33 ]
    
    if reading pf2vf data failed 30 times continuously, it means something is
    wrong. Need to trigger flr_work to recover the issue.
    
    also use dev_err to print the error message to get which device has
    issue and add warning message if waiting IDH_FLR_NOTIFICATION_CMPL
    timeout.
    
    Signed-off-by: Zhigang Luo <Zhigang.Luo@amd.com>
    Acked-by: Hawking Zhang <Hawking.Zhang@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Stable-dep-of: d0ce1aaa8531 ("Revert "drm/amd: Stop evicting resources on APUs in suspend"")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_scmi: Add helper to trace bad messages [+ + +]

Author: Cristian Marussi <cristian.marussi@arm.com>
Date:   Mon Mar 25 20:46:17 2024 +0000

    firmware: arm_scmi: Add helper to trace bad messages
    
    [ Upstream commit 5dc0e0b1f0ea2b55031f84a365962b9b45869b98 ]
    
    Upon reception of malformed and unexpected timed-out SCMI messages, it is
    not possible to trace those bad messages in their entirety, because usually
    we cannot even retrieve the payload, or it is just not reliable.
    
    Add a helper to trace at least the content of the header of the received
    message while associating a meaningful tag and error code.
    
    Signed-off-by: Cristian Marussi <cristian.marussi@arm.com>
    Link: https://lore.kernel.org/r/20240325204620.1437237-3-cristian.marussi@arm.com
    Signed-off-by: Sudeep Holla <sudeep.holla@arm.com>
    Stable-dep-of: c23c03bf1faa ("firmware: arm_scmi: Fix timeout checks on polling path")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_scmi: Add message dump traces for bad and unexpected replies [+ + +]

Author: Cristian Marussi <cristian.marussi@arm.com>
Date:   Mon Mar 25 20:46:18 2024 +0000

    firmware: arm_scmi: Add message dump traces for bad and unexpected replies
    
    [ Upstream commit 5076ab66db1671a5cd9ecfb857d1949e36a33142 ]
    
    It is useful to have message dump traces for any invalid/bad/unexpected
    replies. Let us add traces for the same as well as late-timed-out,
    out-of-order and unexpected/spurious messages.
    
    Signed-off-by: Cristian Marussi <cristian.marussi@arm.com>
    Link: https://lore.kernel.org/r/20240325204620.1437237-4-cristian.marussi@arm.com
    Signed-off-by: Sudeep Holla <sudeep.holla@arm.com>
    Stable-dep-of: c23c03bf1faa ("firmware: arm_scmi: Fix timeout checks on polling path")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_scmi: Add support for debug metrics at the interface [+ + +]

Author: Luke Parkin <luke.parkin@arm.com>
Date:   Mon Aug 5 14:10:09 2024 +0100

    firmware: arm_scmi: Add support for debug metrics at the interface
    
    [ Upstream commit 1b18d4295f9d1125bc7a799fc12924cd45fc04b1 ]
    
    Since SCMI involves interaction with the entity(software, firmware and/or
    hardware) providing services or features, it is quite useful to track
    certain metrics(for pure debugging purposes) like how many messages were
    sent or received, were there any failures, what kind of failures, ..etc.
    
    Add a new optional config option for the above purpose and the initial
    support for counting such key debug metrics.
    
    Signed-off-by: Luke Parkin <luke.parkin@arm.com>
    Reviewed-by: Cristian Marussi <cristian.marussi@arm.com>
    Tested-by: Cristian Marussi <cristian.marussi@arm.com>
    Message-Id: <20240805131013.587016-3-sudeep.holla@arm.com>
    Signed-off-by: Sudeep Holla <sudeep.holla@arm.com>
    Stable-dep-of: c23c03bf1faa ("firmware: arm_scmi: Fix timeout checks on polling path")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_scmi: Fix timeout checks on polling path [+ + +]

Author: Cristian Marussi <cristian.marussi@arm.com>
Date:   Mon Mar 10 17:58:00 2025 +0000

    firmware: arm_scmi: Fix timeout checks on polling path
    
    [ Upstream commit c23c03bf1faa1e76be1eba35bad6da6a2a7c95ee ]
    
    Polling mode transactions wait for a reply busy-looping without holding a
    spinlock, but currently the timeout checks are based only on elapsed time:
    as a result we could hit a false positive whenever our busy-looping thread
    is pre-empted and scheduled out for a time greater than the polling
    timeout.
    
    Change the checks at the end of the busy-loop to make sure that the polling
    wasn't indeed successful or an out-of-order reply caused the polling to be
    forcibly terminated.
    
    Fixes: 31d2f803c19c ("firmware: arm_scmi: Add sync_cmds_completed_on_ret transport flag")
    Reported-by: Huangjie <huangjie1663@phytium.com.cn>
    Closes: https://lore.kernel.org/arm-scmi/20250123083323.2363749-1-jackhuang021@gmail.com/
    Signed-off-by: Cristian Marussi <cristian.marussi@arm.com>
    Cc: stable@vger.kernel.org # 5.18.x
    Message-Id: <20250310175800.1444293-1-cristian.marussi@arm.com>
    Signed-off-by: Sudeep Holla <sudeep.holla@arm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_scmi: Track basic SCMI communication debug metrics [+ + +]

Author: Luke Parkin <luke.parkin@arm.com>
Date:   Mon Aug 5 14:10:10 2024 +0100

    firmware: arm_scmi: Track basic SCMI communication debug metrics
    
    [ Upstream commit 0b3d48c4726e1b20dffd2ff81a9d94d5d930220b ]
    
    Add the support for counting some of the SCMI communication debug metrics
    like how many were sent successfully or with some errors, responses
    received, notifications and delayed responses, transfer timeouts and
    errors from the firmware/platform.
    
    In many cases, the traces exists. But the traces are not always necessarily
    enabled and getting such cumulative SCMI communication debug metrics helps
    in understanding if there are any possible improvements that can be made
    on either side of SCMI communication.
    
    Signed-off-by: Luke Parkin <luke.parkin@arm.com>
    Reviewed-by: Cristian Marussi <cristian.marussi@arm.com>
    Tested-by: Cristian Marussi <cristian.marussi@arm.com>
    Message-Id: <20240805131013.587016-4-sudeep.holla@arm.com>
    Signed-off-by: Sudeep Holla <sudeep.holla@arm.com>
    Stable-dep-of: c23c03bf1faa ("firmware: arm_scmi: Fix timeout checks on polling path")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

fs/xattr.c: fix simple_xattr_list to always include security.* xattrs [+ + +]

Author: Stephen Smalley <stephen.smalley.work@gmail.com>
Date:   Thu Apr 24 11:28:20 2025 -0400

    fs/xattr.c: fix simple_xattr_list to always include security.* xattrs
    
    [ Upstream commit 8b0ba61df5a1c44e2b3cf683831a4fc5e24ea99d ]
    
    The vfs has long had a fallback to obtain the security.* xattrs from the
    LSM when the filesystem does not implement its own listxattr, but
    shmem/tmpfs and kernfs later gained their own xattr handlers to support
    other xattrs. Unfortunately, as a side effect, tmpfs and kernfs-based
    filesystems like sysfs no longer return the synthetic security.* xattr
    names via listxattr unless they are explicitly set by userspace or
    initially set upon inode creation after policy load. coreutils has
    recently switched from unconditionally invoking getxattr for security.*
    for ls -Z via libselinux to only doing so if listxattr returns the xattr
    name, breaking ls -Z of such inodes.
    
    Before:
    $ getfattr -m.* /run/initramfs
    <no output>
    $ getfattr -m.* /sys/kernel/fscaps
    <no output>
    $ setfattr -n user.foo /run/initramfs
    $ getfattr -m.* /run/initramfs
    user.foo
    
    After:
    $ getfattr -m.* /run/initramfs
    security.selinux
    $ getfattr -m.* /sys/kernel/fscaps
    security.selinux
    $ setfattr -n user.foo /run/initramfs
    $ getfattr -m.* /run/initramfs
    security.selinux
    user.foo
    
    Link: https://lore.kernel.org/selinux/CAFqZXNtF8wDyQajPCdGn=iOawX4y77ph0EcfcqcUUj+T87FKyA@mail.gmail.com/
    Link: https://lore.kernel.org/selinux/20250423175728.3185-2-stephen.smalley.work@gmail.com/
    Signed-off-by: Stephen Smalley <stephen.smalley.work@gmail.com>
    Link: https://lore.kernel.org/20250424152822.2719-1-stephen.smalley.work@gmail.com
    Fixes: b09e0fa4b4ea66266058ee ("tmpfs: implement generic xattr support")
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ftrace: Fix preemption accounting for stacktrace filter command [+ + +]

Author: pengdonglin <pengdonglin@xiaomi.com>
Date:   Mon May 12 17:42:46 2025 +0800

    ftrace: Fix preemption accounting for stacktrace filter command
    
    commit 11aff32439df6ca5b3b891b43032faf88f4a6a29 upstream.
    
    The preemption count of the stacktrace filter command to trace ksys_read
    is consistently incorrect:
    
    $ echo ksys_read:stacktrace > set_ftrace_filter
    
       <...>-453     [004] ...1.    38.308956: <stack trace>
    => ksys_read
    => do_syscall_64
    => entry_SYSCALL_64_after_hwframe
    
    The root cause is that the trace framework disables preemption when
    invoking the filter command callback in function_trace_probe_call:
    
       preempt_disable_notrace();
       probe_ops->func(ip, parent_ip, probe_opsbe->tr, probe_ops, probe->data);
       preempt_enable_notrace();
    
    Use tracing_gen_ctx_dec() to account for the preempt_disable_notrace(),
    which will output the correct preemption count:
    
    $ echo ksys_read:stacktrace > set_ftrace_filter
    
       <...>-410     [006] .....    31.420396: <stack trace>
    => ksys_read
    => do_syscall_64
    => entry_SYSCALL_64_after_hwframe
    
    Cc: stable@vger.kernel.org
    Fixes: 36590c50b2d07 ("tracing: Merge irqflags + preempt counter.")
    Link: https://lore.kernel.org/20250512094246.1167956-2-dolinux.peng@gmail.com
    Signed-off-by: pengdonglin <dolinux.peng@gmail.com>
    Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ftrace: Fix preemption accounting for stacktrace trigger command [+ + +]

Author: pengdonglin <pengdonglin@xiaomi.com>
Date:   Mon May 12 17:42:45 2025 +0800

    ftrace: Fix preemption accounting for stacktrace trigger command
    
    commit e333332657f615ac2b55aa35565c4a882018bbe9 upstream.
    
    When using the stacktrace trigger command to trace syscalls, the
    preemption count was consistently reported as 1 when the system call
    event itself had 0 (".").
    
    For example:
    
    root@ubuntu22-vm:/sys/kernel/tracing/events/syscalls/sys_enter_read
    $ echo stacktrace > trigger
    $ echo 1 > enable
    
        sshd-416     [002] .....   232.864910: sys_read(fd: a, buf: 556b1f3221d0, count: 8000)
        sshd-416     [002] ...1.   232.864913: <stack trace>
     => ftrace_syscall_enter
     => syscall_trace_enter
     => do_syscall_64
     => entry_SYSCALL_64_after_hwframe
    
    The root cause is that the trace framework disables preemption in __DO_TRACE before
    invoking the trigger callback.
    
    Use the tracing_gen_ctx_dec() that will accommodate for the increase of
    the preemption count in __DO_TRACE when calling the callback. The result
    is the accurate reporting of:
    
        sshd-410     [004] .....   210.117660: sys_read(fd: 4, buf: 559b725ba130, count: 40000)
        sshd-410     [004] .....   210.117662: <stack trace>
     => ftrace_syscall_enter
     => syscall_trace_enter
     => do_syscall_64
     => entry_SYSCALL_64_after_hwframe
    
    Cc: stable@vger.kernel.org
    Fixes: ce33c845b030c ("tracing: Dump stacktrace trigger to the corresponding instance")
    Link: https://lore.kernel.org/20250512094246.1167956-1-dolinux.peng@gmail.com
    Signed-off-by: pengdonglin <dolinux.peng@gmail.com>
    Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

HID: thrustmaster: fix memory leak in thrustmaster_interrupts() [+ + +]

Author: Qasim Ijaz <qasdev00@gmail.com>
Date:   Thu Mar 27 23:11:46 2025 +0000

    HID: thrustmaster: fix memory leak in thrustmaster_interrupts()
    
    [ Upstream commit 09d546303b370113323bfff456c4e8cff8756005 ]
    
    In thrustmaster_interrupts(), the allocated send_buf is not
    freed if the usb_check_int_endpoints() check fails, leading
    to a memory leak.
    
    Fix this by ensuring send_buf is freed before returning in
    the error path.
    
    Fixes: 50420d7c79c3 ("HID: hid-thrustmaster: Fix warning in thrustmaster_probe by adding endpoint check")
    Signed-off-by: Qasim Ijaz <qasdev00@gmail.com>
    Signed-off-by: Jiri Kosina <jkosina@suse.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

HID: uclogic: Add NULL check in uclogic_input_configured() [+ + +]

Author: Henry Martin <bsdhenrymartin@gmail.com>
Date:   Tue Apr 1 17:48:53 2025 +0800

    HID: uclogic: Add NULL check in uclogic_input_configured()
    
    [ Upstream commit bd07f751208ba190f9b0db5e5b7f35d5bb4a8a1e ]
    
    devm_kasprintf() returns NULL when memory allocation fails. Currently,
    uclogic_input_configured() does not check for this case, which results
    in a NULL pointer dereference.
    
    Add NULL check after devm_kasprintf() to prevent this issue.
    
    Fixes: dd613a4e45f8 ("HID: uclogic: Correct devm device reference for hidinput input_dev name")
    Signed-off-by: Henry Martin <bsdhenrymartin@gmail.com>
    Signed-off-by: Jiri Kosina <jkosina@suse.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

hv_netvsc: Preserve contiguous PFN grouping in the page buffer array [+ + +]

Author: Michael Kelley <mhklinux@outlook.com>
Date:   Mon May 12 17:06:02 2025 -0700

    hv_netvsc: Preserve contiguous PFN grouping in the page buffer array
    
    commit 41a6328b2c55276f89ea3812069fd7521e348bbf upstream.
    
    Starting with commit dca5161f9bd0 ("hv_netvsc: Check status in
    SEND_RNDIS_PKT completion message") in the 6.3 kernel, the Linux
    driver for Hyper-V synthetic networking (netvsc) occasionally reports
    "nvsp_rndis_pkt_complete error status: 2".[1] This error indicates
    that Hyper-V has rejected a network packet transmit request from the
    guest, and the outgoing network packet is dropped. Higher level
    network protocols presumably recover and resend the packet so there is
    no functional error, but performance is slightly impacted. Commit
    dca5161f9bd0 is not the cause of the error -- it only added reporting
    of an error that was already happening without any notice. The error
    has presumably been present since the netvsc driver was originally
    introduced into Linux.
    
    The root cause of the problem is that the netvsc driver in Linux may
    send an incorrectly formatted VMBus message to Hyper-V when
    transmitting the network packet. The incorrect formatting occurs when
    the rndis header of the VMBus message crosses a page boundary due to
    how the Linux skb head memory is aligned. In such a case, two PFNs are
    required to describe the location of the rndis header, even though
    they are contiguous in guest physical address (GPA) space. Hyper-V
    requires that two rndis header PFNs be in a single "GPA range" data
    struture, but current netvsc code puts each PFN in its own GPA range,
    which Hyper-V rejects as an error.
    
    The incorrect formatting occurs only for larger packets that netvsc
    must transmit via a VMBus "GPA Direct" message. There's no problem
    when netvsc transmits a smaller packet by copying it into a pre-
    allocated send buffer slot because the pre-allocated slots don't have
    page crossing issues.
    
    After commit 14ad6ed30a10 ("net: allow small head cache usage with
    large MAX_SKB_FRAGS values") in the 6.14-rc4 kernel, the error occurs
    much more frequently in VMs with 16 or more vCPUs. It may occur every
    few seconds, or even more frequently, in an ssh session that outputs a
    lot of text. Commit 14ad6ed30a10 subtly changes how skb head memory is
    allocated, making it much more likely that the rndis header will cross
    a page boundary when the vCPU count is 16 or more. The changes in
    commit 14ad6ed30a10 are perfectly valid -- they just had the side
    effect of making the netvsc bug more prominent.
    
    Current code in init_page_array() creates a separate page buffer array
    entry for each PFN required to identify the data to be transmitted.
    Contiguous PFNs get separate entries in the page buffer array, and any
    information about contiguity is lost.
    
    Fix the core issue by having init_page_array() construct the page
    buffer array to represent contiguous ranges rather than individual
    pages. When these ranges are subsequently passed to
    netvsc_build_mpb_array(), it can build GPA ranges that contain
    multiple PFNs, as required to avoid the error "nvsp_rndis_pkt_complete
    error status: 2". If instead the network packet is sent by copying
    into a pre-allocated send buffer slot, the copy proceeds using the
    contiguous ranges rather than individual pages, but the result of the
    copying is the same. Also fix rndis_filter_send_request() to construct
    a contiguous range, since it has its own page buffer array.
    
    This change has a side benefit in CoCo VMs in that netvsc_dma_map()
    calls dma_map_single() on each contiguous range instead of on each
    page. This results in fewer calls to dma_map_single() but on larger
    chunks of memory, which should reduce contention on the swiotlb.
    
    Since the page buffer array now contains one entry for each contiguous
    range instead of for each individual page, the number of entries in
    the array can be reduced, saving 208 bytes of stack space in
    netvsc_xmit() when MAX_SKG_FRAGS has the default value of 17.
    
    [1] https://bugzilla.kernel.org/show_bug.cgi?id=217503
    
    Closes: https://bugzilla.kernel.org/show_bug.cgi?id=217503
    Cc: <stable@vger.kernel.org> # 6.1.x
    Signed-off-by: Michael Kelley <mhklinux@outlook.com>
    Link: https://patch.msgid.link/20250513000604.1396-4-mhklinux@outlook.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

hv_netvsc: Remove rmsg_pgcnt [+ + +]

Author: Michael Kelley <mhklinux@outlook.com>
Date:   Mon May 12 17:06:03 2025 -0700

    hv_netvsc: Remove rmsg_pgcnt
    
    commit 5bbc644bbf4e97a05bc0cb052189004588ff8a09 upstream.
    
    init_page_array() now always creates a single page buffer array entry
    for the rndis message, even if the rndis message crosses a page
    boundary. As such, the number of page buffer array entries used for
    the rndis message must no longer be tracked -- it is always just 1.
    Remove the rmsg_pgcnt field and use "1" where the value is needed.
    
    Cc: <stable@vger.kernel.org> # 6.1.x
    Signed-off-by: Michael Kelley <mhklinux@outlook.com>
    Link: https://patch.msgid.link/20250513000604.1396-5-mhklinux@outlook.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

hv_netvsc: Use vmbus_sendpacket_mpb_desc() to send VMBus messages [+ + +]

Author: Michael Kelley <mhklinux@outlook.com>
Date:   Mon May 12 17:06:01 2025 -0700

    hv_netvsc: Use vmbus_sendpacket_mpb_desc() to send VMBus messages
    
    commit 4f98616b855cb0e3b5917918bb07b44728eb96ea upstream.
    
    netvsc currently uses vmbus_sendpacket_pagebuffer() to send VMBus
    messages. This function creates a series of GPA ranges, each of which
    contains a single PFN. However, if the rndis header in the VMBus
    message crosses a page boundary, the netvsc protocol with the host
    requires that both PFNs for the rndis header must be in a single "GPA
    range" data structure, which isn't possible with
    vmbus_sendpacket_pagebuffer(). As the first step in fixing this, add a
    new function netvsc_build_mpb_array() to build a VMBus message with
    multiple GPA ranges, each of which may contain multiple PFNs. Use
    vmbus_sendpacket_mpb_desc() to send this VMBus message to the host.
    
    There's no functional change since higher levels of netvsc don't
    maintain or propagate knowledge of contiguous PFNs. Based on its
    input, netvsc_build_mpb_array() still produces a separate GPA range
    for each PFN and the behavior is the same as with
    vmbus_sendpacket_pagebuffer(). But the groundwork is laid for a
    subsequent patch to provide the necessary grouping.
    
    Cc: <stable@vger.kernel.org> # 6.1.x
    Signed-off-by: Michael Kelley <mhklinux@outlook.com>
    Link: https://patch.msgid.link/20250513000604.1396-3-mhklinux@outlook.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

hwpoison, memory_hotplug: lock folio before unmap hwpoisoned folio [+ + +]

Author: Ma Wupeng <mawupeng1@huawei.com>
Date:   Mon Feb 17 09:43:29 2025 +0800

    hwpoison, memory_hotplug: lock folio before unmap hwpoisoned folio
    
    commit af288a426c3e3552b62595c6138ec6371a17dbba upstream.
    
    Commit b15c87263a69 ("hwpoison, memory_hotplug: allow hwpoisoned pages to
    be offlined) add page poison checks in do_migrate_range in order to make
    offline hwpoisoned page possible by introducing isolate_lru_page and
    try_to_unmap for hwpoisoned page.  However folio lock must be held before
    calling try_to_unmap.  Add it to fix this problem.
    
    Warning will be produced if folio is not locked during unmap:
    
      ------------[ cut here ]------------
      kernel BUG at ./include/linux/swapops.h:400!
      Internal error: Oops - BUG: 00000000f2000800 [#1] PREEMPT SMP
      Modules linked in:
      CPU: 4 UID: 0 PID: 411 Comm: bash Tainted: G        W          6.13.0-rc1-00016-g3c434c7ee82a-dirty #41
      Tainted: [W]=WARN
      Hardware name: QEMU QEMU Virtual Machine, BIOS 0.0.0 02/06/2015
      pstate: 40400005 (nZcv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--)
      pc : try_to_unmap_one+0xb08/0xd3c
      lr : try_to_unmap_one+0x3dc/0xd3c
      Call trace:
       try_to_unmap_one+0xb08/0xd3c (P)
       try_to_unmap_one+0x3dc/0xd3c (L)
       rmap_walk_anon+0xdc/0x1f8
       rmap_walk+0x3c/0x58
       try_to_unmap+0x88/0x90
       unmap_poisoned_folio+0x30/0xa8
       do_migrate_range+0x4a0/0x568
       offline_pages+0x5a4/0x670
       memory_block_action+0x17c/0x374
       memory_subsys_offline+0x3c/0x78
       device_offline+0xa4/0xd0
       state_store+0x8c/0xf0
       dev_attr_store+0x18/0x2c
       sysfs_kf_write+0x44/0x54
       kernfs_fop_write_iter+0x118/0x1a8
       vfs_write+0x3a8/0x4bc
       ksys_write+0x6c/0xf8
       __arm64_sys_write+0x1c/0x28
       invoke_syscall+0x44/0x100
       el0_svc_common.constprop.0+0x40/0xe0
       do_el0_svc+0x1c/0x28
       el0_svc+0x30/0xd0
       el0t_64_sync_handler+0xc8/0xcc
       el0t_64_sync+0x198/0x19c
      Code: f9407be0 b5fff320 d4210000 17ffff97 (d4210000)
      ---[ end trace 0000000000000000 ]---
    
    Link: https://lkml.kernel.org/r/20250217014329.3610326-4-mawupeng1@huawei.com
    Fixes: b15c87263a69 ("hwpoison, memory_hotplug: allow hwpoisoned pages to be offlined")
    Signed-off-by: Ma Wupeng <mawupeng1@huawei.com>
    Acked-by: David Hildenbrand <david@redhat.com>
    Acked-by: Miaohe Lin <linmiaohe@huawei.com>
    Cc: Michal Hocko <mhocko@suse.com>
    Cc: Naoya Horiguchi <nao.horiguchi@gmail.com>
    Cc: Oscar Salvador <osalvador@suse.de>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Xiangyu Chen <xiangyu.chen@windriver.com>
    Signed-off-by: He Zhe <zhe.he@windriver.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

iio: adc: ad7266: Fix potential timestamp alignment issue. [+ + +]

Author: Jonathan Cameron <Jonathan.Cameron@huawei.com>
Date:   Sun Apr 13 11:34:24 2025 +0100

    iio: adc: ad7266: Fix potential timestamp alignment issue.
    
    [ Upstream commit 52d349884738c346961e153f195f4c7fe186fcf4 ]
    
    On architectures where an s64 is only 32-bit aligned insufficient padding
    would be left between the earlier elements and the timestamp. Use
    aligned_s64 to enforce the correct placement and ensure the storage is
    large enough.
    
    Fixes: 54e018da3141 ("iio:ad7266: Mark transfer buffer as __be16") # aligned_s64 is much newer.
    Reported-by: David Lechner <dlechner@baylibre.com>
    Reviewed-by: Nuno Sá <nuno.sa@analog.com>
    Reviewed-by: David Lechner <dlechner@baylibre.com>
    Link: https://patch.msgid.link/20250413103443.2420727-2-jic23@kernel.org
    Cc: <Stable@vger.kernel.org>
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iio: adc: ad7768-1: Fix insufficient alignment of timestamp. [+ + +]

Author: Jonathan Cameron <Jonathan.Cameron@huawei.com>
Date:   Sun Apr 13 11:34:25 2025 +0100

    iio: adc: ad7768-1: Fix insufficient alignment of timestamp.
    
    [ Upstream commit ffbc26bc91c1f1eb3dcf5d8776e74cbae21ee13a ]
    
    On architectures where an s64 is not 64-bit aligned, this may result
    insufficient alignment of the timestamp and the structure being too small.
    Use aligned_s64 to force the alignment.
    
    Fixes: a1caeebab07e ("iio: adc: ad7768-1: Fix too small buffer passed to iio_push_to_buffers_with_timestamp()") # aligned_s64 newer
    Reported-by: David Lechner <dlechner@baylibre.com>
    Reviewed-by: Nuno Sá <nuno.sa@analog.com>
    Reviewed-by: David Lechner <dlechner@baylibre.com>
    Link: https://patch.msgid.link/20250413103443.2420727-3-jic23@kernel.org
    Cc: <Stable@vger.kernel.org>
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iio: chemical: sps30: use aligned_s64 for timestamp [+ + +]

Author: David Lechner <dlechner@baylibre.com>
Date:   Thu Apr 17 11:52:37 2025 -0500

    iio: chemical: sps30: use aligned_s64 for timestamp
    
    [ Upstream commit bb49d940344bcb8e2b19e69d7ac86f567887ea9a ]
    
    Follow the pattern of other drivers and use aligned_s64 for the
    timestamp. This will ensure that the timestamp is correctly aligned on
    all architectures.
    
    Fixes: a5bf6fdd19c3 ("iio:chemical:sps30: Fix timestamp alignment")
    Signed-off-by: David Lechner <dlechner@baylibre.com>
    Reviewed-by: Nuno Sá <nuno.sa@analog.com>
    Link: https://patch.msgid.link/20250417-iio-more-timestamp-alignment-v1-5-eafac1e22318@baylibre.com
    Cc: <Stable@vger.kernel.org>
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

KVM: SVM: Forcibly leave SMM mode on SHUTDOWN interception [+ + +]

Author: Mikhail Lobanov <m.lobanov@rosa.ru>
Date:   Mon Apr 14 20:12:06 2025 +0300

    KVM: SVM: Forcibly leave SMM mode on SHUTDOWN interception
    
    [ Upstream commit a2620f8932fa9fdabc3d78ed6efb004ca409019f ]
    
    Previously, commit ed129ec9057f ("KVM: x86: forcibly leave nested mode
    on vCPU reset") addressed an issue where a triple fault occurring in
    nested mode could lead to use-after-free scenarios. However, the commit
    did not handle the analogous situation for System Management Mode (SMM).
    
    This omission results in triggering a WARN when KVM forces a vCPU INIT
    after SHUTDOWN interception while the vCPU is in SMM. This situation was
    reprodused using Syzkaller by:
    
      1) Creating a KVM VM and vCPU
      2) Sending a KVM_SMI ioctl to explicitly enter SMM
      3) Executing invalid instructions causing consecutive exceptions and
         eventually a triple fault
    
    The issue manifests as follows:
    
      WARNING: CPU: 0 PID: 25506 at arch/x86/kvm/x86.c:12112
      kvm_vcpu_reset+0x1d2/0x1530 arch/x86/kvm/x86.c:12112
      Modules linked in:
      CPU: 0 PID: 25506 Comm: syz-executor.0 Not tainted
      6.1.130-syzkaller-00157-g164fe5dde9b6 #0
      Hardware name: QEMU Standard PC (i440FX + PIIX, 1996),
      BIOS 1.12.0-1 04/01/2014
      RIP: 0010:kvm_vcpu_reset+0x1d2/0x1530 arch/x86/kvm/x86.c:12112
      Call Trace:
       <TASK>
       shutdown_interception+0x66/0xb0 arch/x86/kvm/svm/svm.c:2136
       svm_invoke_exit_handler+0x110/0x530 arch/x86/kvm/svm/svm.c:3395
       svm_handle_exit+0x424/0x920 arch/x86/kvm/svm/svm.c:3457
       vcpu_enter_guest arch/x86/kvm/x86.c:10959 [inline]
       vcpu_run+0x2c43/0x5a90 arch/x86/kvm/x86.c:11062
       kvm_arch_vcpu_ioctl_run+0x50f/0x1cf0 arch/x86/kvm/x86.c:11283
       kvm_vcpu_ioctl+0x570/0xf00 arch/x86/kvm/../../../virt/kvm/kvm_main.c:4122
       vfs_ioctl fs/ioctl.c:51 [inline]
       __do_sys_ioctl fs/ioctl.c:870 [inline]
       __se_sys_ioctl fs/ioctl.c:856 [inline]
       __x64_sys_ioctl+0x19a/0x210 fs/ioctl.c:856
       do_syscall_x64 arch/x86/entry/common.c:51 [inline]
       do_syscall_64+0x35/0x80 arch/x86/entry/common.c:81
       entry_SYSCALL_64_after_hwframe+0x6e/0xd8
    
    Architecturally, INIT is blocked when the CPU is in SMM, hence KVM's WARN()
    in kvm_vcpu_reset() to guard against KVM bugs, e.g. to detect improper
    emulation of INIT.  SHUTDOWN on SVM is a weird edge case where KVM needs to
    do _something_ sane with the VMCB, since it's technically undefined, and
    INIT is the least awful choice given KVM's ABI.
    
    So, double down on stuffing INIT on SHUTDOWN, and force the vCPU out of
    SMM to avoid any weirdness (and the WARN).
    
    Found by Linux Verification Center (linuxtesting.org) with Syzkaller.
    
    Fixes: ed129ec9057f ("KVM: x86: forcibly leave nested mode on vCPU reset")
    Cc: stable@vger.kernel.org
    Suggested-by: Sean Christopherson <seanjc@google.com>
    Signed-off-by: Mikhail Lobanov <m.lobanov@rosa.ru>
    Link: https://lore.kernel.org/r/20250414171207.155121-1-m.lobanov@rosa.ru
    [sean: massage changelog, make it clear this isn't architectural behavior]
    Signed-off-by: Sean Christopherson <seanjc@google.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

KVM: SVM: Update SEV-ES shutdown intercepts with more metadata [+ + +]

Author: Peter Gonda <pgonda@google.com>
Date:   Thu Sep 7 09:24:49 2023 -0700

    KVM: SVM: Update SEV-ES shutdown intercepts with more metadata
    
    [ Upstream commit bc3d7c5570a03ab45bde4bae83697c80900fb714 ]
    
    Currently if an SEV-ES VM shuts down userspace sees KVM_RUN struct with
    only errno=EINVAL. This is a very limited amount of information to debug
    the situation. Instead return KVM_EXIT_SHUTDOWN to alert userspace the VM
    is shutting down and is not usable any further.
    
    Signed-off-by: Peter Gonda <pgonda@google.com>
    Suggested-by: Sean Christopherson <seanjc@google.com>
    Suggested-by: Tom Lendacky <thomas.lendacky@amd.com>
    Cc: Paolo Bonzini <pbonzini@redhat.com>
    Cc: Sean Christopherson <seanjc@google.com>
    Cc: Tom Lendacky <thomas.lendacky@amd.com>
    Cc: Joerg Roedel <joro@8bytes.org>
    Cc: Borislav Petkov <bp@alien8.de>
    Cc: x86@kernel.org
    Cc: kvm@vger.kernel.org
    Cc: linux-kernel@vger.kernel.org
    Reviewed-by: Tom Lendacky <thomas.lendacky@amd.com>
    Link: https://lore.kernel.org/r/20230907162449.1739785-1-pgonda@google.com
    [sean: tweak changelog]
    Signed-off-by: Sean Christopherson <seanjc@google.com>
    Stable-dep-of: a2620f8932fa ("KVM: SVM: Forcibly leave SMM mode on SHUTDOWN interception")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Linux: Linux 6.6.92 [+ + +]

Author: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Date:   Thu May 22 14:12:26 2025 +0200

    Linux 6.6.92
    
    Link: https://lore.kernel.org/r/20250520125803.981048184@linuxfoundation.org
    Tested-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Tested-by: Miguel Ojeda <ojeda@kernel.org>
    Tested-by: Shuah Khan <skhan@linuxfoundation.org>
    Tested-by: Ron Economos <re@w6rz.net>
    Tested-by: Jon Hunter <jonathanh@nvidia.com>
    Tested-by: Linux Kernel Functional Testing <lkft@linaro.org>
    Tested-by: Mark Brown <broonie@kernel.org>
    Tested-by: Peter Schneider <pschneider1968@googlemail.com>
    Tested-by: Hardik Garg <hargar@linux.microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

LoongArch: Explicitly specify code model in Makefile [+ + +]

Author: Huacai Chen <chenhuacai@kernel.org>
Date:   Fri Nov 22 15:47:47 2024 +0800

    LoongArch: Explicitly specify code model in Makefile
    
    commit e67e0eb6a98b261caf45048f9eb95fd7609289c0 upstream.
    
    LoongArch's toolchain may change the default code model from normal to
    medium. This is unnecessary for kernel, and generates some relocations
    which cannot be handled by the module loader. So explicitly specify the
    code model to normal in Makefile (for Rust 'normal' is 'small').
    
    Cc: stable@vger.kernel.org
    Tested-by: Haiyong Sun <sunhaiyong@loongson.cn>
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

LoongArch: Fix MAX_REG_OFFSET calculation [+ + +]

Author: Huacai Chen <chenhuacai@kernel.org>
Date:   Wed May 14 22:17:43 2025 +0800

    LoongArch: Fix MAX_REG_OFFSET calculation
    
    commit 90436d234230e9a950ccd87831108b688b27a234 upstream.
    
    Fix MAX_REG_OFFSET calculation, make it point to the last register
    in 'struct pt_regs' and not to the marker itself, which could allow
    regs_get_register() to return an invalid offset.
    
    Cc: stable@vger.kernel.org
    Fixes: 803b0fc5c3f2baa6e5 ("LoongArch: Add process management")
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

LoongArch: Prevent cond_resched() occurring within kernel-fpu [+ + +]

Author: Tianyang Zhang <zhangtianyang@loongson.cn>
Date:   Wed May 14 22:17:43 2025 +0800

    LoongArch: Prevent cond_resched() occurring within kernel-fpu
    
    commit 2468b0e3d5659dfde77f081f266e1111a981efb8 upstream.
    
    When CONFIG_PREEMPT_COUNT is not configured (i.e. CONFIG_PREEMPT_NONE/
    CONFIG_PREEMPT_VOLUNTARY), preempt_disable() / preempt_enable() merely
    acts as a barrier(). However, in these cases cond_resched() can still
    trigger a context switch and modify the CSR.EUEN, resulting in do_fpu()
    exception being activated within the kernel-fpu critical sections, as
    demonstrated in the following path:
    
    dcn32_calculate_wm_and_dlg()
        DC_FP_START()
            dcn32_calculate_wm_and_dlg_fpu()
                dcn32_find_dummy_latency_index_for_fw_based_mclk_switch()
                    dcn32_internal_validate_bw()
                        dcn32_enable_phantom_stream()
                            dc_create_stream_for_sink()
                               kzalloc(GFP_KERNEL)
                                    __kmem_cache_alloc_node()
                                        __cond_resched()
        DC_FP_END()
    
    This patch is similar to commit d02198550423a0b (x86/fpu: Improve crypto
    performance by making kernel-mode FPU reliably usable in softirqs).  It
    uses local_bh_disable() instead of preempt_disable() for non-RT kernels
    so it can avoid the cond_resched() issue, and also extend the kernel-fpu
    application scenarios to the softirq context.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Tianyang Zhang <zhangtianyang@loongson.cn>
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

LoongArch: Save and restore CSR.CNTC for hibernation [+ + +]

Author: Huacai Chen <chenhuacai@kernel.org>
Date:   Wed May 14 22:17:52 2025 +0800

    LoongArch: Save and restore CSR.CNTC for hibernation
    
    commit ceb9155d058a11242aa0572875c44e9713b1a2be upstream.
    
    Save and restore CSR.CNTC for hibernation which is similar to suspend.
    
    For host this is unnecessary because sched clock is ensured continuous,
    but for kvm guest sched clock isn't enough because rdtime.d should also
    be continuous.
    
    Host::rdtime.d = Host::CSR.CNTC + counter
    Guest::rdtime.d = Host::CSR.CNTC + Host::CSR.GCNTC + Guest::CSR.CNTC + counter
    
    so,
    
    Guest::rdtime.d = Host::rdtime.d + Host::CSR.GCNTC + Guest::CSR.CNTC
    
    To ensure Guest::rdtime.d continuous, Host::rdtime.d should be at first
    continuous, while Host::CSR.GCNTC / Guest::CSR.CNTC is maintained by KVM.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Xianglai Li <lixianglai@loongson.cn>
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

LoongArch: uprobes: Remove redundant code about resume_era [+ + +]

Author: Tiezhu Yang <yangtiezhu@loongson.cn>
Date:   Wed May 14 22:18:10 2025 +0800

    LoongArch: uprobes: Remove redundant code about resume_era
    
    commit 12614f794274f63fbdfe76771b2b332077d63848 upstream.
    
    arch_uprobe_skip_sstep() returns true if instruction was emulated, that
    is to say, there is no need to single step for the emulated instructions.
    regs->csr_era will point to the destination address directly after the
    exception, so the resume_era related code is redundant, just remove them.
    
    Cc: stable@vger.kernel.org
    Fixes: 19bc6cb64092 ("LoongArch: Add uprobes support")
    Signed-off-by: Tiezhu Yang <yangtiezhu@loongson.cn>
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

LoongArch: uprobes: Remove user_{en,dis}able_single_step() [+ + +]

Author: Tiezhu Yang <yangtiezhu@loongson.cn>
Date:   Wed May 14 22:18:10 2025 +0800

    LoongArch: uprobes: Remove user_{en,dis}able_single_step()
    
    commit 0b326b2371f94e798137cc1a3c5c2eef2bc69061 upstream.
    
    When executing the "perf probe" and "perf stat" test cases about some
    cryptographic algorithm, the output shows that "Trace/breakpoint trap".
    This is because it uses the software singlestep breakpoint for uprobes
    on LoongArch, and no need to use the hardware singlestep. So just remove
    the related function call to user_{en,dis}able_single_step() for uprobes
    on LoongArch.
    
    How to reproduce:
    
    Please make sure CONFIG_UPROBE_EVENTS is set and openssl supports sm2
    algorithm, then execute the following command.
    
    cd tools/perf && make
    ./perf probe -x /usr/lib64/libcrypto.so BN_mod_mul_montgomery
    ./perf stat -e probe_libcrypto:BN_mod_mul_montgomery openssl speed sm2
    
    Cc: stable@vger.kernel.org
    Fixes: 19bc6cb64092 ("LoongArch: Add uprobes support")
    Signed-off-by: Tiezhu Yang <yangtiezhu@loongson.cn>
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mctp: no longer rely on net->dev_index_head[] [+ + +]

Author: Eric Dumazet <edumazet@google.com>
Date:   Fri Dec 6 22:38:11 2024 +0000

    mctp: no longer rely on net->dev_index_head[]
    
    [ Upstream commit 2d20773aec14996b6cc4db92d885028319be683d ]
    
    mctp_dump_addrinfo() is one of the last users of
    net->dev_index_head[] in the control path.
    
    Switch to for_each_netdev_dump() for better scalability.
    
    Use C99 for mctp_device_rtnl_msg_handlers[] to prepare
    future RTNL removal from mctp_dump_addrinfo()
    
    (mdev->addrs is not yet RCU protected)
    
    Signed-off-by: Eric Dumazet <edumazet@google.com>
    Cc: Matt Johnston <matt@codeconstruct.com.au>
    Reviewed-by: Kuniyuki Iwashima <kuniyu@amazon.com>
    Acked-by: Jeremy Kerr <jk@codeconstruct.com.au>
    Link: https://patch.msgid.link/20241206223811.1343076-1-edumazet@google.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Stable-dep-of: f11cf946c0a9 ("net: mctp: Don't access ifa_index when missing")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

memblock: Accept allocated memory before use in memblock_double_array() [+ + +]

Author: Tom Lendacky <thomas.lendacky@amd.com>
Date:   Thu May 8 12:24:10 2025 -0500

    memblock: Accept allocated memory before use in memblock_double_array()
    
    commit da8bf5daa5e55a6af2b285ecda460d6454712ff4 upstream.
    
    When increasing the array size in memblock_double_array() and the slab
    is not yet available, a call to memblock_find_in_range() is used to
    reserve/allocate memory. However, the range returned may not have been
    accepted, which can result in a crash when booting an SNP guest:
    
      RIP: 0010:memcpy_orig+0x68/0x130
      Code: ...
      RSP: 0000:ffffffff9cc03ce8 EFLAGS: 00010006
      RAX: ff11001ff83e5000 RBX: 0000000000000000 RCX: fffffffffffff000
      RDX: 0000000000000bc0 RSI: ffffffff9dba8860 RDI: ff11001ff83e5c00
      RBP: 0000000000002000 R08: 0000000000000000 R09: 0000000000002000
      R10: 000000207fffe000 R11: 0000040000000000 R12: ffffffff9d06ef78
      R13: ff11001ff83e5000 R14: ffffffff9dba7c60 R15: 0000000000000c00
      memblock_double_array+0xff/0x310
      memblock_add_range+0x1fb/0x2f0
      memblock_reserve+0x4f/0xa0
      memblock_alloc_range_nid+0xac/0x130
      memblock_alloc_internal+0x53/0xc0
      memblock_alloc_try_nid+0x3d/0xa0
      swiotlb_init_remap+0x149/0x2f0
      mem_init+0xb/0xb0
      mm_core_init+0x8f/0x350
      start_kernel+0x17e/0x5d0
      x86_64_start_reservations+0x14/0x30
      x86_64_start_kernel+0x92/0xa0
      secondary_startup_64_no_verify+0x194/0x19b
    
    Mitigate this by calling accept_memory() on the memory range returned
    before the slab is available.
    
    Prior to v6.12, the accept_memory() interface used a 'start' and 'end'
    parameter instead of 'start' and 'size', therefore the accept_memory()
    call must be adjusted to specify 'start + size' for 'end' when applying
    to kernels prior to v6.12.
    
    Cc: stable@vger.kernel.org # see patch description, needs adjustments for <= 6.11
    Fixes: dcdfdd40fa82 ("mm: Add support for unaccepted memory")
    Signed-off-by: Tom Lendacky <thomas.lendacky@amd.com>
    Link: https://lore.kernel.org/r/da1ac73bf4ded761e21b4e4bb5178382a580cd73.1746725050.git.thomas.lendacky@amd.com
    Signed-off-by: Mike Rapoport (Microsoft) <rppt@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mlxsw: spectrum_router: Fix use-after-free when deleting GRE net devices [+ + +]

Author: Ido Schimmel <idosch@nvidia.com>
Date:   Wed May 14 14:48:05 2025 +0200

    mlxsw: spectrum_router: Fix use-after-free when deleting GRE net devices
    
    [ Upstream commit 92ec4855034b2c4d13f117558dc73d20581fa9ff ]
    
    The driver only offloads neighbors that are constructed on top of net
    devices registered by it or their uppers (which are all Ethernet). The
    device supports GRE encapsulation and decapsulation of forwarded
    traffic, but the driver will not offload dummy neighbors constructed on
    top of GRE net devices as they are not uppers of its net devices:
    
     # ip link add name gre1 up type gre tos inherit local 192.0.2.1 remote 198.51.100.1
     # ip neigh add 0.0.0.0 lladdr 0.0.0.0 nud noarp dev gre1
     $ ip neigh show dev gre1 nud noarp
     0.0.0.0 lladdr 0.0.0.0 NOARP
    
    (Note that the neighbor is not marked with 'offload')
    
    When the driver is reloaded and the existing configuration is replayed,
    the driver does not perform the same check regarding existing neighbors
    and offloads the previously added one:
    
     # devlink dev reload pci/0000:01:00.0
     $ ip neigh show dev gre1 nud noarp
     0.0.0.0 lladdr 0.0.0.0 offload NOARP
    
    If the neighbor is later deleted, the driver will ignore the
    notification (given the GRE net device is not its upper) and will
    therefore keep referencing freed memory, resulting in a use-after-free
    [1] when the net device is deleted:
    
     # ip neigh del 0.0.0.0 lladdr 0.0.0.0 dev gre1
     # ip link del dev gre1
    
    Fix by skipping neighbor replay if the net device for which the replay
    is performed is not our upper.
    
    [1]
    BUG: KASAN: slab-use-after-free in mlxsw_sp_neigh_entry_update+0x1ea/0x200
    Read of size 8 at addr ffff888155b0e420 by task ip/2282
    [...]
    Call Trace:
     <TASK>
     dump_stack_lvl+0x6f/0xa0
     print_address_description.constprop.0+0x6f/0x350
     print_report+0x108/0x205
     kasan_report+0xdf/0x110
     mlxsw_sp_neigh_entry_update+0x1ea/0x200
     mlxsw_sp_router_rif_gone_sync+0x2a8/0x440
     mlxsw_sp_rif_destroy+0x1e9/0x750
     mlxsw_sp_netdevice_ipip_ol_event+0x3c9/0xdc0
     mlxsw_sp_router_netdevice_event+0x3ac/0x15e0
     notifier_call_chain+0xca/0x150
     call_netdevice_notifiers_info+0x7f/0x100
     unregister_netdevice_many_notify+0xc8c/0x1d90
     rtnl_dellink+0x34e/0xa50
     rtnetlink_rcv_msg+0x6fb/0xb70
     netlink_rcv_skb+0x131/0x360
     netlink_unicast+0x426/0x710
     netlink_sendmsg+0x75a/0xc20
     __sock_sendmsg+0xc1/0x150
     ____sys_sendmsg+0x5aa/0x7b0
     ___sys_sendmsg+0xfc/0x180
     __sys_sendmsg+0x121/0x1b0
     do_syscall_64+0xbb/0x1d0
     entry_SYSCALL_64_after_hwframe+0x4b/0x53
    
    Fixes: 8fdb09a7674c ("mlxsw: spectrum_router: Replay neighbours when RIF is made")
    Signed-off-by: Ido Schimmel <idosch@nvidia.com>
    Reviewed-by: Petr Machata <petrm@nvidia.com>
    Signed-off-by: Petr Machata <petrm@nvidia.com>
    Link: https://patch.msgid.link/c53c02c904fde32dad484657be3b1477884e9ad6.1747225701.git.petrm@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

mm/migrate: correct nr_failed in migrate_pages_sync() [+ + +]

Author: Zi Yan <ziy@nvidia.com>
Date:   Tue Oct 17 12:31:28 2023 -0400

    mm/migrate: correct nr_failed in migrate_pages_sync()
    
    commit a259945efe6ada94087ef666e9b38f8e34ea34ba upstream.
    
    nr_failed was missing the large folio splits from migrate_pages_batch()
    and can cause a mismatch between migrate_pages() return value and the
    number of not migrated pages, i.e., when the return value of
    migrate_pages() is 0, there are still pages left in the from page list.
    It will happen when a non-PMD THP large folio fails to migrate due to
    -ENOMEM and is split successfully but not all the split pages are not
    migrated, migrate_pages_batch() would return non-zero, but
    astats.nr_thp_split = 0.  nr_failed would be 0 and returned to the caller
    of migrate_pages(), but the not migrated pages are left in the from page
    list without being added back to LRU lists.
    
    Fix it by adding a new nr_split counter for large folio splits and adding
    it to nr_failed in migrate_page_sync() after migrate_pages_batch() is
    done.
    
    Link: https://lkml.kernel.org/r/20231017163129.2025214-1-zi.yan@sent.com
    Fixes: 2ef7dbb26990 ("migrate_pages: try migrate in batch asynchronously firstly")
    Signed-off-by: Zi Yan <ziy@nvidia.com>
    Acked-by: Huang Ying <ying.huang@intel.com>
    Reviewed-by: Baolin Wang <baolin.wang@linux.alibaba.com>
    Cc: David Hildenbrand <david@redhat.com>
    Cc: Matthew Wilcox <willy@infradead.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/page_alloc: fix race condition in unaccepted memory handling [+ + +]

Author: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
Date:   Tue May 6 16:32:07 2025 +0300

    mm/page_alloc: fix race condition in unaccepted memory handling
    
    commit fefc075182275057ce607effaa3daa9e6e3bdc73 upstream.
    
    The page allocator tracks the number of zones that have unaccepted memory
    using static_branch_enc/dec() and uses that static branch in hot paths to
    determine if it needs to deal with unaccepted memory.
    
    Borislav and Thomas pointed out that the tracking is racy: operations on
    static_branch are not serialized against adding/removing unaccepted pages
    to/from the zone.
    
    Sanity checks inside static_branch machinery detects it:
    
    WARNING: CPU: 0 PID: 10 at kernel/jump_label.c:276 __static_key_slow_dec_cpuslocked+0x8e/0xa0
    
    The comment around the WARN() explains the problem:
    
            /*
             * Warn about the '-1' case though; since that means a
             * decrement is concurrent with a first (0->1) increment. IOW
             * people are trying to disable something that wasn't yet fully
             * enabled. This suggests an ordering problem on the user side.
             */
    
    The effect of this static_branch optimization is only visible on
    microbenchmark.
    
    Instead of adding more complexity around it, remove it altogether.
    
    Link: https://lkml.kernel.org/r/20250506133207.1009676-1-kirill.shutemov@linux.intel.com
    Signed-off-by: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
    Fixes: dcdfdd40fa82 ("mm: Add support for unaccepted memory")
    Link: https://lore.kernel.org/all/20250506092445.GBaBnVXXyvnazly6iF@fat_crate.local
    Reported-by: Borislav Petkov <bp@alien8.de>
    Tested-by: Borislav Petkov (AMD) <bp@alien8.de>
    Reported-by: Thomas Gleixner <tglx@linutronix.de>
    Cc: Vlastimil Babka <vbabka@suse.cz>
    Cc: Suren Baghdasaryan <surenb@google.com>
    Cc: Michal Hocko <mhocko@suse.com>
    Cc: Brendan Jackman <jackmanb@google.com>
    Cc: Johannes Weiner <hannes@cmpxchg.org>
    Cc: <stable@vger.kernel.org>    [6.5+]
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net/mlx5e: Disable MACsec offload for uplink representor profile [+ + +]

Author: Carolina Jubran <cjubran@nvidia.com>
Date:   Sun May 11 13:15:52 2025 +0300

    net/mlx5e: Disable MACsec offload for uplink representor profile
    
    [ Upstream commit 588431474eb7572e57a927fa8558c9ba2f8af143 ]
    
    MACsec offload is not supported in switchdev mode for uplink
    representors. When switching to the uplink representor profile, the
    MACsec offload feature must be cleared from the netdevice's features.
    
    If left enabled, attempts to add offloads result in a null pointer
    dereference, as the uplink representor does not support MACsec offload
    even though the feature bit remains set.
    
    Clear NETIF_F_HW_MACSEC in mlx5e_fix_uplink_rep_features().
    
    Kernel log:
    
    Oops: general protection fault, probably for non-canonical address 0xdffffc000000000f: 0000 [#1] SMP KASAN
    KASAN: null-ptr-deref in range [0x0000000000000078-0x000000000000007f]
    CPU: 29 UID: 0 PID: 4714 Comm: ip Not tainted 6.14.0-rc4_for_upstream_debug_2025_03_02_17_35 #1
    Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS rel-1.16.0-0-gd239552ce722-prebuilt.qemu.org 04/01/2014
    RIP: 0010:__mutex_lock+0x128/0x1dd0
    Code: d0 7c 08 84 d2 0f 85 ad 15 00 00 8b 35 91 5c fe 03 85 f6 75 29 49 8d 7e 60 48 b8 00 00 00 00 00 fc ff df 48 89 fa 48 c1 ea 03 <80> 3c 02 00 0f 85 a6 15 00 00 4d 3b 76 60 0f 85 fd 0b 00 00 65 ff
    RSP: 0018:ffff888147a4f160 EFLAGS: 00010206
    RAX: dffffc0000000000 RBX: 0000000000000000 RCX: 0000000000000001
    RDX: 000000000000000f RSI: 0000000000000000 RDI: 0000000000000078
    RBP: ffff888147a4f2e0 R08: ffffffffa05d2c19 R09: 0000000000000000
    R10: 0000000000000001 R11: 0000000000000000 R12: 0000000000000000
    R13: dffffc0000000000 R14: 0000000000000018 R15: ffff888152de0000
    FS:  00007f855e27d800(0000) GS:ffff88881ee80000(0000) knlGS:0000000000000000
    CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    CR2: 00000000004e5768 CR3: 000000013ae7c005 CR4: 0000000000372eb0
    DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 0000000000000400
    Call Trace:
     <TASK>
     ? die_addr+0x3d/0xa0
     ? exc_general_protection+0x144/0x220
     ? asm_exc_general_protection+0x22/0x30
     ? mlx5e_macsec_add_secy+0xf9/0x700 [mlx5_core]
     ? __mutex_lock+0x128/0x1dd0
     ? lockdep_set_lock_cmp_fn+0x190/0x190
     ? mlx5e_macsec_add_secy+0xf9/0x700 [mlx5_core]
     ? mutex_lock_io_nested+0x1ae0/0x1ae0
     ? lock_acquire+0x1c2/0x530
     ? macsec_upd_offload+0x145/0x380
     ? lockdep_hardirqs_on_prepare+0x400/0x400
     ? kasan_save_stack+0x30/0x40
     ? kasan_save_stack+0x20/0x40
     ? kasan_save_track+0x10/0x30
     ? __kasan_kmalloc+0x77/0x90
     ? __kmalloc_noprof+0x249/0x6b0
     ? genl_family_rcv_msg_attrs_parse.constprop.0+0xb5/0x240
     ? mlx5e_macsec_add_secy+0xf9/0x700 [mlx5_core]
     mlx5e_macsec_add_secy+0xf9/0x700 [mlx5_core]
     ? mlx5e_macsec_add_rxsa+0x11a0/0x11a0 [mlx5_core]
     macsec_update_offload+0x26c/0x820
     ? macsec_set_mac_address+0x4b0/0x4b0
     ? lockdep_hardirqs_on_prepare+0x284/0x400
     ? _raw_spin_unlock_irqrestore+0x47/0x50
     macsec_upd_offload+0x2c8/0x380
     ? macsec_update_offload+0x820/0x820
     ? __nla_parse+0x22/0x30
     ? genl_family_rcv_msg_attrs_parse.constprop.0+0x15e/0x240
     genl_family_rcv_msg_doit+0x1cc/0x2a0
     ? genl_family_rcv_msg_attrs_parse.constprop.0+0x240/0x240
     ? cap_capable+0xd4/0x330
     genl_rcv_msg+0x3ea/0x670
     ? genl_family_rcv_msg_dumpit+0x2a0/0x2a0
     ? lockdep_set_lock_cmp_fn+0x190/0x190
     ? macsec_update_offload+0x820/0x820
     netlink_rcv_skb+0x12b/0x390
     ? genl_family_rcv_msg_dumpit+0x2a0/0x2a0
     ? netlink_ack+0xd80/0xd80
     ? rwsem_down_read_slowpath+0xf90/0xf90
     ? netlink_deliver_tap+0xcd/0xac0
     ? netlink_deliver_tap+0x155/0xac0
     ? _copy_from_iter+0x1bb/0x12c0
     genl_rcv+0x24/0x40
     netlink_unicast+0x440/0x700
     ? netlink_attachskb+0x760/0x760
     ? lock_acquire+0x1c2/0x530
     ? __might_fault+0xbb/0x170
     netlink_sendmsg+0x749/0xc10
     ? netlink_unicast+0x700/0x700
     ? __might_fault+0xbb/0x170
     ? netlink_unicast+0x700/0x700
     __sock_sendmsg+0xc5/0x190
     ____sys_sendmsg+0x53f/0x760
     ? import_iovec+0x7/0x10
     ? kernel_sendmsg+0x30/0x30
     ? __copy_msghdr+0x3c0/0x3c0
     ? filter_irq_stacks+0x90/0x90
     ? stack_depot_save_flags+0x28/0xa30
     ___sys_sendmsg+0xeb/0x170
     ? kasan_save_stack+0x30/0x40
     ? copy_msghdr_from_user+0x110/0x110
     ? do_syscall_64+0x6d/0x140
     ? lock_acquire+0x1c2/0x530
     ? __virt_addr_valid+0x116/0x3b0
     ? __virt_addr_valid+0x1da/0x3b0
     ? lock_downgrade+0x680/0x680
     ? __delete_object+0x21/0x50
     __sys_sendmsg+0xf7/0x180
     ? __sys_sendmsg_sock+0x20/0x20
     ? kmem_cache_free+0x14c/0x4e0
     ? __x64_sys_close+0x78/0xd0
     do_syscall_64+0x6d/0x140
     entry_SYSCALL_64_after_hwframe+0x4b/0x53
    RIP: 0033:0x7f855e113367
    Code: 0e 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b9 0f 1f 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 2e 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 51 c3 48 83 ec 28 89 54 24 1c 48 89 74 24 10
    RSP: 002b:00007ffd15e90c88 EFLAGS: 00000246 ORIG_RAX: 000000000000002e
    RAX: ffffffffffffffda RBX: 0000000000000002 RCX: 00007f855e113367
    RDX: 0000000000000000 RSI: 00007ffd15e90cf0 RDI: 0000000000000004
    RBP: 00007ffd15e90dbc R08: 0000000000000028 R09: 000000000045d100
    R10: 00007f855e011dd8 R11: 0000000000000246 R12: 0000000000000019
    R13: 0000000067c6b785 R14: 00000000004a1e80 R15: 0000000000000000
     </TASK>
    Modules linked in: 8021q garp mrp sch_ingress openvswitch nsh mlx5_ib mlx5_fwctl mlx5_dpll mlx5_core rpcrdma rdma_ucm ib_iser libiscsi scsi_transport_iscsi ib_umad rdma_cm ib_ipoib iw_cm ib_cm ib_uverbs ib_core xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xt_addrtype iptable_nat nf_nat br_netfilter rpcsec_gss_krb5 auth_rpcgss oid_registry overlay zram zsmalloc fuse [last unloaded: mlx5_core]
    ---[ end trace 0000000000000000 ]---
    
    Fixes: 8ff0ac5be144 ("net/mlx5: Add MACsec offload Tx command support")
    Signed-off-by: Carolina Jubran <cjubran@nvidia.com>
    Reviewed-by: Shahar Shitrit <shshitrit@nvidia.com>
    Reviewed-by: Dragos Tatulea <dtatulea@nvidia.com>
    Signed-off-by: Tariq Toukan <tariqt@nvidia.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/1746958552-561295-1-git-send-email-tariqt@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net/tls: fix kernel panic when alloc_page failed [+ + +]

Author: Pengtao He <hept.hept.hept@gmail.com>
Date:   Wed May 14 21:20:13 2025 +0800

    net/tls: fix kernel panic when alloc_page failed
    
    [ Upstream commit 491deb9b8c4ad12fe51d554a69b8165b9ef9429f ]
    
    We cannot set frag_list to NULL pointer when alloc_page failed.
    It will be used in tls_strp_check_queue_ok when the next time
    tls_strp_read_sock is called.
    
    This is because we don't reset full_len in tls_strp_flush_anchor_copy()
    so the recv path will try to continue handling the partial record
    on the next call but we dettached the rcvq from the frag list.
    Alternative fix would be to reset full_len.
    
    Unable to handle kernel NULL pointer dereference
    at virtual address 0000000000000028
     Call trace:
     tls_strp_check_rcv+0x128/0x27c
     tls_strp_data_ready+0x34/0x44
     tls_data_ready+0x3c/0x1f0
     tcp_data_ready+0x9c/0xe4
     tcp_data_queue+0xf6c/0x12d0
     tcp_rcv_established+0x52c/0x798
    
    Fixes: 84c61fe1a75b ("tls: rx: do not use the standard strparser")
    Signed-off-by: Pengtao He <hept.hept.hept@gmail.com>
    Link: https://patch.msgid.link/20250514132013.17274-1-hept.hept.hept@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: cadence: macb: Fix a possible deadlock in macb_halt_tx. [+ + +]

Author: Mathieu Othacehe <othacehe@gnu.org>
Date:   Fri May 9 14:19:35 2025 +0200

    net: cadence: macb: Fix a possible deadlock in macb_halt_tx.
    
    [ Upstream commit c92d6089d8ad7d4d815ebcedee3f3907b539ff1f ]
    
    There is a situation where after THALT is set high, TGO stays high as
    well. Because jiffies are never updated, as we are in a context with
    interrupts disabled, we never exit that loop and have a deadlock.
    
    That deadlock was noticed on a sama5d4 device that stayed locked for days.
    
    Use retries instead of jiffies so that the timeout really works and we do
    not have a deadlock anymore.
    
    Fixes: e86cd53afc590 ("net/macb: better manage tx errors")
    Signed-off-by: Mathieu Othacehe <othacehe@gnu.org>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20250509121935.16282-1-othacehe@gnu.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: dsa: sja1105: discard incoming frames in BR_STATE_LISTENING [+ + +]

Author: Vladimir Oltean <vladimir.oltean@nxp.com>
Date:   Fri May 9 14:38:16 2025 +0300

    net: dsa: sja1105: discard incoming frames in BR_STATE_LISTENING
    
    [ Upstream commit 498625a8ab2c8e1c9ab5105744310e8d6952cc01 ]
    
    It has been reported that when under a bridge with stp_state=1, the logs
    get spammed with this message:
    
    [  251.734607] fsl_dpaa2_eth dpni.5 eth0: Couldn't decode source port
    
    Further debugging shows the following info associated with packets:
    source_port=-1, switch_id=-1, vid=-1, vbid=1
    
    In other words, they are data plane packets which are supposed to be
    decoded by dsa_tag_8021q_find_port_by_vbid(), but the latter (correctly)
    refuses to do so, because no switch port is currently in
    BR_STATE_LEARNING or BR_STATE_FORWARDING - so the packet is effectively
    unexpected.
    
    The error goes away after the port progresses to BR_STATE_LEARNING in 15
    seconds (the default forward_time of the bridge), because then,
    dsa_tag_8021q_find_port_by_vbid() can correctly associate the data plane
    packets with a plausible bridge port in a plausible STP state.
    
    Re-reading IEEE 802.1D-1990, I see the following:
    
    "4.4.2 Learning: (...) The Forwarding Process shall discard received
    frames."
    
    IEEE 802.1D-2004 further clarifies:
    
    "DISABLED, BLOCKING, LISTENING, and BROKEN all correspond to the
    DISCARDING port state. While those dot1dStpPortStates serve to
    distinguish reasons for discarding frames, the operation of the
    Forwarding and Learning processes is the same for all of them. (...)
    LISTENING represents a port that the spanning tree algorithm has
    selected to be part of the active topology (computing a Root Port or
    Designated Port role) but is temporarily discarding frames to guard
    against loops or incorrect learning."
    
    Well, this is not what the driver does - instead it sets
    mac[port].ingress = true.
    
    To get rid of the log spam, prevent unexpected data plane packets to
    be received by software by discarding them on ingress in the LISTENING
    state.
    
    In terms of blame attribution: the prints only date back to commit
    d7f9787a763f ("net: dsa: tag_8021q: add support for imprecise RX based
    on the VBID"). However, the settings would permit a LISTENING port to
    forward to a FORWARDING port, and the standard suggests that's not OK.
    
    Fixes: 640f763f98c2 ("net: dsa: sja1105: Add support for Spanning Tree Protocol")
    Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com>
    Link: https://patch.msgid.link/20250509113816.2221992-1-vladimir.oltean@nxp.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: ethernet: mtk_eth_soc: fix typo for declaration MT7988 ESW capability [+ + +]

Author: Bo-Cun Chen <bc-bocun.chen@mediatek.com>
Date:   Tue May 13 05:27:30 2025 +0100

    net: ethernet: mtk_eth_soc: fix typo for declaration MT7988 ESW capability
    
    [ Upstream commit 1bdea6fad6fb985ff13828373c48e337c4e939f9 ]
    
    Since MTK_ESW_BIT is a bit number rather than a bitmap, it causes
    MTK_HAS_CAPS to produce incorrect results. This leads to the ETH
    driver not declaring MAC capabilities correctly for the MT7988 ESW.
    
    Fixes: 445eb6448ed3 ("net: ethernet: mtk_eth_soc: add basic support for MT7988 SoC")
    Signed-off-by: Bo-Cun Chen <bc-bocun.chen@mediatek.com>
    Signed-off-by: Daniel Golle <daniel@makrotopia.org>
    Reviewed-by: Michal Swiatkowski <michal.swiatkowski@linux.intel.com>
    Link: https://patch.msgid.link/b8b37f409d1280fad9c4d32521e6207f63cd3213.1747110258.git.daniel@makrotopia.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: mctp: Don't access ifa_index when missing [+ + +]

Author: Matt Johnston <matt@codeconstruct.com.au>
Date:   Thu May 8 13:18:32 2025 +0800

    net: mctp: Don't access ifa_index when missing
    
    [ Upstream commit f11cf946c0a92c560a890d68e4775723353599e1 ]
    
    In mctp_dump_addrinfo, ifa_index can be used to filter interfaces, but
    only when the struct ifaddrmsg is provided. Otherwise it will be
    comparing to uninitialised memory - reproducible in the syzkaller case from
    dhcpd, or busybox "ip addr show".
    
    The kernel MCTP implementation has always filtered by ifa_index, so
    existing userspace programs expecting to dump MCTP addresses must
    already be passing a valid ifa_index value (either 0 or a real index).
    
    BUG: KMSAN: uninit-value in mctp_dump_addrinfo+0x208/0xac0 net/mctp/device.c:128
     mctp_dump_addrinfo+0x208/0xac0 net/mctp/device.c:128
     rtnl_dump_all+0x3ec/0x5b0 net/core/rtnetlink.c:4380
     rtnl_dumpit+0xd5/0x2f0 net/core/rtnetlink.c:6824
     netlink_dump+0x97b/0x1690 net/netlink/af_netlink.c:2309
    
    Fixes: 583be982d934 ("mctp: Add device handling and netlink interface")
    Reported-by: syzbot+e76d52dadc089b9d197f@syzkaller.appspotmail.com
    Closes: https://lore.kernel.org/all/68135815.050a0220.3a872c.000e.GAE@google.com/
    Reported-by: syzbot+1065a199625a388fce60@syzkaller.appspotmail.com
    Closes: https://lore.kernel.org/all/681357d6.050a0220.14dd7d.000d.GAE@google.com/
    Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>
    Link: https://patch.msgid.link/20250508-mctp-addr-dump-v2-1-c8a53fd2dd66@codeconstruct.com.au
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: mctp: Ensure keys maintain only one ref to corresponding dev [+ + +]

Author: Andrew Jeffery <andrew@codeconstruct.com.au>
Date:   Thu May 8 14:16:00 2025 +0930

    net: mctp: Ensure keys maintain only one ref to corresponding dev
    
    [ Upstream commit e4f349bd6e58051df698b82f94721f18a02a293d ]
    
    mctp_flow_prepare_output() is called in mctp_route_output(), which
    places outbound packets onto a given interface. The packet may represent
    a message fragment, in which case we provoke an unbalanced reference
    count to the underlying device. This causes trouble if we ever attempt
    to remove the interface:
    
        [   48.702195] usb 1-1: USB disconnect, device number 2
        [   58.883056] unregister_netdevice: waiting for mctpusb0 to become free. Usage count = 2
        [   69.022548] unregister_netdevice: waiting for mctpusb0 to become free. Usage count = 2
        [   79.172568] unregister_netdevice: waiting for mctpusb0 to become free. Usage count = 2
        ...
    
    Predicate the invocation of mctp_dev_set_key() in
    mctp_flow_prepare_output() on not already having associated the device
    with the key. It's not yet realistic to uphold the property that the key
    maintains only one device reference earlier in the transmission sequence
    as the route (and therefore the device) may not be known at the time the
    key is associated with the socket.
    
    Fixes: 67737c457281 ("mctp: Pass flow data & flow release events to drivers")
    Acked-by: Jeremy Kerr <jk@codeconstruct.com.au>
    Signed-off-by: Andrew Jeffery <andrew@codeconstruct.com.au>
    Link: https://patch.msgid.link/20250508-mctp-dev-refcount-v1-1-d4f965c67bb5@codeconstruct.com.au
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: qede: Initialize qede_ll_ops with designated initializer [+ + +]

Author: Nathan Chancellor <nathan@kernel.org>
Date:   Wed May 7 21:47:45 2025 +0100

    net: qede: Initialize qede_ll_ops with designated initializer
    
    commit 6b3ab7f2cbfaeb6580709cd8ef4d72cfd01bfde4 upstream.
    
    After a recent change [1] in clang's randstruct implementation to
    randomize structures that only contain function pointers, there is an
    error because qede_ll_ops get randomized but does not use a designated
    initializer for the first member:
    
      drivers/net/ethernet/qlogic/qede/qede_main.c:206:2: error: a randomized struct can only be initialized with a designated initializer
        206 |         {
            |         ^
    
    Explicitly initialize the common member using a designated initializer
    to fix the build.
    
    Cc: stable@vger.kernel.org
    Fixes: 035f7f87b729 ("randstruct: Enable Clang support")
    Link: https://github.com/llvm/llvm-project/commit/04364fb888eea6db9811510607bed4b200bcb082 [1]
    Signed-off-by: Nathan Chancellor <nathan@kernel.org>
    Link: https://patch.msgid.link/20250507-qede-fix-clang-randstruct-v1-1-5ccc15626fba@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net_sched: Flush gso_skb list too during ->change() [+ + +]

Author: Cong Wang <xiyou.wangcong@gmail.com>
Date:   Tue May 6 21:35:58 2025 -0700

    net_sched: Flush gso_skb list too during ->change()
    
    [ Upstream commit 2d3cbfd6d54a2c39ce3244f33f85c595844bd7b8 ]
    
    Previously, when reducing a qdisc's limit via the ->change() operation, only
    the main skb queue was trimmed, potentially leaving packets in the gso_skb
    list. This could result in NULL pointer dereference when we only check
    sch->limit against sch->q.qlen.
    
    This patch introduces a new helper, qdisc_dequeue_internal(), which ensures
    both the gso_skb list and the main queue are properly flushed when trimming
    excess packets. All relevant qdiscs (codel, fq, fq_codel, fq_pie, hhf, pie)
    are updated to use this helper in their ->change() routines.
    
    Fixes: 76e3cc126bb2 ("codel: Controlled Delay AQM")
    Fixes: 4b549a2ef4be ("fq_codel: Fair Queue Codel AQM")
    Fixes: afe4fd062416 ("pkt_sched: fq: Fair Queue packet scheduler")
    Fixes: ec97ecf1ebe4 ("net: sched: add Flow Queue PIE packet scheduler")
    Fixes: 10239edf86f1 ("net-qdisc-hhf: Heavy-Hitter Filter (HHF) qdisc")
    Fixes: d4b36210c2e6 ("net: pkt_sched: PIE AQM scheme")
    Reported-by: Will <willsroot@protonmail.com>
    Reported-by: Savy <savy@syst3mfailure.io>
    Signed-off-by: Cong Wang <xiyou.wangcong@gmail.com>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

nfs: handle failure of nfs_get_lock_context in unlock path [+ + +]

Author: Li Lingfeng <lilingfeng3@huawei.com>
Date:   Thu Apr 17 15:25:08 2025 +0800

    nfs: handle failure of nfs_get_lock_context in unlock path
    
    [ Upstream commit c457dc1ec770a22636b473ce5d35614adfe97636 ]
    
    When memory is insufficient, the allocation of nfs_lock_context in
    nfs_get_lock_context() fails and returns -ENOMEM. If we mistakenly treat
    an nfs4_unlockdata structure (whose l_ctx member has been set to -ENOMEM)
    as valid and proceed to execute rpc_run_task(), this will trigger a NULL
    pointer dereference in nfs4_locku_prepare. For example:
    
    BUG: kernel NULL pointer dereference, address: 000000000000000c
    PGD 0 P4D 0
    Oops: Oops: 0000 [#1] SMP PTI
    CPU: 15 UID: 0 PID: 12 Comm: kworker/u64:0 Not tainted 6.15.0-rc2-dirty #60
    Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.3-2.fc40
    Workqueue: rpciod rpc_async_schedule
    RIP: 0010:nfs4_locku_prepare+0x35/0xc2
    Code: 89 f2 48 89 fd 48 c7 c7 68 69 ef b5 53 48 8b 8e 90 00 00 00 48 89 f3
    RSP: 0018:ffffbbafc006bdb8 EFLAGS: 00010246
    RAX: 000000000000004b RBX: ffff9b964fc1fa00 RCX: 0000000000000000
    RDX: 0000000000000000 RSI: fffffffffffffff4 RDI: ffff9ba53fddbf40
    RBP: ffff9ba539934000 R08: 0000000000000000 R09: ffffbbafc006bc38
    R10: ffffffffb6b689c8 R11: 0000000000000003 R12: ffff9ba539934030
    R13: 0000000000000001 R14: 0000000004248060 R15: ffffffffb56d1c30
    FS: 0000000000000000(0000) GS:ffff9ba5881f0000(0000) knlGS:00000000
    CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    CR2: 000000000000000c CR3: 000000093f244000 CR4: 00000000000006f0
    Call Trace:
     <TASK>
     __rpc_execute+0xbc/0x480
     rpc_async_schedule+0x2f/0x40
     process_one_work+0x232/0x5d0
     worker_thread+0x1da/0x3d0
     ? __pfx_worker_thread+0x10/0x10
     kthread+0x10d/0x240
     ? __pfx_kthread+0x10/0x10
     ret_from_fork+0x34/0x50
     ? __pfx_kthread+0x10/0x10
     ret_from_fork_asm+0x1a/0x30
     </TASK>
    Modules linked in:
    CR2: 000000000000000c
    ---[ end trace 0000000000000000 ]---
    
    Free the allocated nfs4_unlockdata when nfs_get_lock_context() fails and
    return NULL to terminate subsequent rpc_run_task, preventing NULL pointer
    dereference.
    
    Fixes: f30cb757f680 ("NFS: Always wait for I/O completion before unlock")
    Signed-off-by: Li Lingfeng <lilingfeng3@huawei.com>
    Reviewed-by: Jeff Layton <jlayton@kernel.org>
    Link: https://lore.kernel.org/r/20250417072508.3850532-1-lilingfeng3@huawei.com
    Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

NFSv4/pnfs: Reset the layout state after a layoutreturn [+ + +]

Author: Trond Myklebust <trond.myklebust@hammerspace.com>
Date:   Sat May 10 10:50:13 2025 -0400

    NFSv4/pnfs: Reset the layout state after a layoutreturn
    
    [ Upstream commit 6d6d7f91cc8c111d40416ac9240a3bb9396c5235 ]
    
    If there are still layout segments in the layout plh_return_lsegs list
    after a layout return, we should be resetting the state to ensure they
    eventually get returned as well.
    
    Fixes: 68f744797edd ("pNFS: Do not free layout segments that are marked for return")
    Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

nvme-pci: acquire cq_poll_lock in nvme_poll_irqdisable [+ + +]

Author: Keith Busch <kbusch@kernel.org>
Date:   Thu May 8 16:57:06 2025 +0200

    nvme-pci: acquire cq_poll_lock in nvme_poll_irqdisable
    
    [ Upstream commit 3d8932133dcecbd9bef1559533c1089601006f45 ]
    
    We need to lock this queue for that condition because the timeout work
    executes per-namespace and can poll the poll CQ.
    
    Reported-by: Hannes Reinecke <hare@kernel.org>
    Closes: https://lore.kernel.org/all/20240902130728.1999-1-hare@kernel.org/
    Fixes: a0fa9647a54e ("NVMe: add blk polling support")
    Signed-off-by: Keith Busch <kbusch@kernel.org>
    Signed-off-by: Daniel Wagner <wagi@kernel.org>
    Signed-off-by: Christoph Hellwig <hch@lst.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

nvme-pci: make nvme_pci_npages_prp() __always_inline [+ + +]

Author: Kees Cook <kees@kernel.org>
Date:   Tue May 6 20:35:40 2025 -0700

    nvme-pci: make nvme_pci_npages_prp() __always_inline
    
    [ Upstream commit 40696426b8c8c4f13cf6ac52f0470eed144be4b2 ]
    
    The only reason nvme_pci_npages_prp() could be used as a compile-time
    known result in BUILD_BUG_ON() is because the compiler was always choosing
    to inline the function. Under special circumstances (sanitizer coverage
    functions disabled for __init functions on ARCH=um), the compiler decided
    to stop inlining it:
    
       drivers/nvme/host/pci.c: In function 'nvme_init':
       include/linux/compiler_types.h:557:45: error: call to '__compiletime_assert_678' declared with attribute error: BUILD_BUG_ON failed: nvme_pci_npages_prp() > NVME_MAX_NR_ALLOCATIONS
         557 |         _compiletime_assert(condition, msg, __compiletime_assert_, __COUNTER__)
             |                                             ^
       include/linux/compiler_types.h:538:25: note: in definition of macro '__compiletime_assert'
         538 |                         prefix ## suffix();                             \
             |                         ^~~~~~
       include/linux/compiler_types.h:557:9: note: in expansion of macro '_compiletime_assert'
         557 |         _compiletime_assert(condition, msg, __compiletime_assert_, __COUNTER__)
             |         ^~~~~~~~~~~~~~~~~~~
       include/linux/build_bug.h:39:37: note: in expansion of macro 'compiletime_assert'
          39 | #define BUILD_BUG_ON_MSG(cond, msg) compiletime_assert(!(cond), msg)
             |                                     ^~~~~~~~~~~~~~~~~~
       include/linux/build_bug.h:50:9: note: in expansion of macro 'BUILD_BUG_ON_MSG'
          50 |         BUILD_BUG_ON_MSG(condition, "BUILD_BUG_ON failed: " #condition)
             |         ^~~~~~~~~~~~~~~~
       drivers/nvme/host/pci.c:3804:9: note: in expansion of macro 'BUILD_BUG_ON'
        3804 |         BUILD_BUG_ON(nvme_pci_npages_prp() > NVME_MAX_NR_ALLOCATIONS);
             |         ^~~~~~~~~~~~
    
    Force it to be __always_inline to make sure it is always available for
    use with BUILD_BUG_ON().
    
    Reported-by: kernel test robot <lkp@intel.com>
    Closes: https://lore.kernel.org/oe-kbuild-all/202505061846.12FMyRjj-lkp@intel.com/
    Fixes: c372cdd1efdf ("nvme-pci: iod npages fits in s8")
    Signed-off-by: Kees Cook <kees@kernel.org>
    Signed-off-by: Christoph Hellwig <hch@lst.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

octeontx2-af: Fix CGX Receive counters [+ + +]

Author: Hariprasad Kelam <hkelam@marvell.com>
Date:   Tue May 13 12:45:54 2025 +0530

    octeontx2-af: Fix CGX Receive counters
    
    [ Upstream commit bf449f35e77fd44017abf991fac1f9ab7705bbe0 ]
    
    Each CGX block supports 4 logical MACs (LMACS). Receive
    counters CGX_CMR_RX_STAT0-8 are per LMAC and CGX_CMR_RX_STAT9-12
    are per CGX.
    
    Due a bug in previous patch, stale Per CGX counters values observed.
    
    Fixes: 66208910e57a ("octeontx2-af: Support to retrieve CGX LMAC stats")
    Signed-off-by: Hariprasad Kelam <hkelam@marvell.com>
    Link: https://patch.msgid.link/20250513071554.728922-1-hkelam@marvell.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

octeontx2-pf: macsec: Fix incorrect max transmit size in TX secy [+ + +]

Author: Subbaraya Sundeep <sbhatta@marvell.com>
Date:   Mon May 12 18:12:36 2025 +0530

    octeontx2-pf: macsec: Fix incorrect max transmit size in TX secy
    
    [ Upstream commit 865ab2461375e3a5a2526f91f9a9f17b8931bc9e ]
    
    MASCEC hardware block has a field called maximum transmit size for
    TX secy. Max packet size going out of MCS block has be programmed
    taking into account full packet size which has L2 header,SecTag
    and ICV. MACSEC offload driver is configuring max transmit size as
    macsec interface MTU which is incorrect. Say with 1500 MTU of real
    device, macsec interface created on top of real device will have MTU of
    1468(1500 - (SecTag + ICV)). This is causing packets from macsec
    interface of size greater than or equal to 1468 are not getting
    transmitted out because driver programmed max transmit size as 1468
    instead of 1514(1500 + ETH_HDR_LEN).
    
    Fixes: c54ffc73601c ("octeontx2-pf: mcs: Introduce MACSEC hardware offloading")
    Signed-off-by: Subbaraya Sundeep <sbhatta@marvell.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/1747053756-4529-1-git-send-email-sbhatta@marvell.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

phy: Fix error handling in tegra_xusb_port_init [+ + +]

Author: Ma Ke <make24@iscas.ac.cn>
Date:   Mon Mar 3 15:27:39 2025 +0800

    phy: Fix error handling in tegra_xusb_port_init
    
    commit b2ea5f49580c0762d17d80d8083cb89bc3acf74f upstream.
    
    If device_add() fails, do not use device_unregister() for error
    handling. device_unregister() consists two functions: device_del() and
    put_device(). device_unregister() should only be called after
    device_add() succeeded because device_del() undoes what device_add()
    does if successful. Change device_unregister() to put_device() call
    before returning from the function.
    
    As comment of device_add() says, 'if device_add() succeeds, you should
    call device_del() when you want to get rid of it. If device_add() has
    not succeeded, use only put_device() to drop the reference count'.
    
    Found by code review.
    
    Cc: stable@vger.kernel.org
    Fixes: 53d2a715c240 ("phy: Add Tegra XUSB pad controller support")
    Signed-off-by: Ma Ke <make24@iscas.ac.cn>
    Acked-by: Thierry Reding <treding@nvidia.com>
    Link: https://lore.kernel.org/r/20250303072739.3874987-1-make24@iscas.ac.cn
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

phy: renesas: rcar-gen3-usb2: Fix role detection on unbind/bind [+ + +]

Author: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
Date:   Wed May 7 15:50:28 2025 +0300

    phy: renesas: rcar-gen3-usb2: Fix role detection on unbind/bind
    
    commit 54c4c58713aaff76c2422ff5750e557ab3b100d7 upstream.
    
    It has been observed on the Renesas RZ/G3S SoC that unbinding and binding
    the PHY driver leads to role autodetection failures. This issue occurs when
    PHY 3 is the first initialized PHY. PHY 3 does not have an interrupt
    associated with the USB2_INT_ENABLE register (as
    rcar_gen3_int_enable[3] = 0). As a result, rcar_gen3_init_otg() is called
    to initialize OTG without enabling PHY interrupts.
    
    To resolve this, add rcar_gen3_is_any_otg_rphy_initialized() and call it in
    role_store(), role_show(), and rcar_gen3_init_otg(). At the same time,
    rcar_gen3_init_otg() is only called when initialization for a PHY with
    interrupt bits is in progress. As a result, the
    struct rcar_gen3_phy::otg_initialized is no longer needed.
    
    Fixes: 549b6b55b005 ("phy: renesas: rcar-gen3-usb2: enable/disable independent irqs")
    Cc: stable@vger.kernel.org
    Reviewed-by: Yoshihiro Shimoda <yoshihiro.shimoda.uh@renesas.com>
    Tested-by: Yoshihiro Shimoda <yoshihiro.shimoda.uh@renesas.com>
    Reviewed-by: Lad Prabhakar <prabhakar.mahadev-lad.rj@bp.renesas.com>
    Signed-off-by: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
    Link: https://lore.kernel.org/r/20250507125032.565017-2-claudiu.beznea.uj@bp.renesas.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

phy: renesas: rcar-gen3-usb2: Set timing registers only once [+ + +]

Author: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
Date:   Wed May 7 15:50:32 2025 +0300

    phy: renesas: rcar-gen3-usb2: Set timing registers only once
    
    commit 86e70849f4b2b4597ac9f7c7931f2a363774be25 upstream.
    
    phy-rcar-gen3-usb2 driver exports 4 PHYs. The timing registers are common
    to all PHYs. There is no need to set them every time a PHY is initialized.
    Set timing register only when the 1st PHY is initialized.
    
    Fixes: f3b5a8d9b50d ("phy: rcar-gen3-usb2: Add R-Car Gen3 USB2 PHY driver")
    Cc: stable@vger.kernel.org
    Reviewed-by: Yoshihiro Shimoda <yoshihiro.shimoda.uh@renesas.com>
    Tested-by: Yoshihiro Shimoda <yoshihiro.shimoda.uh@renesas.com>
    Reviewed-by: Lad Prabhakar <prabhakar.mahadev-lad.rj@bp.renesas.com>
    Signed-off-by: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
    Link: https://lore.kernel.org/r/20250507125032.565017-6-claudiu.beznea.uj@bp.renesas.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

phy: tegra: xusb: remove a stray unlock [+ + +]

Author: Dan Carpenter <dan.carpenter@linaro.org>
Date:   Wed Apr 23 16:08:23 2025 +0300

    phy: tegra: xusb: remove a stray unlock
    
    commit 83c178470e0bf690d34c8c08440f2421b82e881c upstream.
    
    We used to take a lock in tegra186_utmi_bias_pad_power_on() but now we
    have moved the lock into the caller.  Unfortunately, when we moved the
    lock this unlock was left behind and it results in a double unlock.
    Delete it now.
    
    Fixes: b47158fb4295 ("phy: tegra: xusb: Use a bitmask for UTMI pad power state tracking")
    Signed-off-by: Dan Carpenter <dan.carpenter@linaro.org>
    Reviewed-by: Jon Hunter <jonathanh@nvidia.com>
    Link: https://lore.kernel.org/r/aAjmR6To4EnvRl4G@stanley.mountain
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

phy: tegra: xusb: Use a bitmask for UTMI pad power state tracking [+ + +]

Author: Wayne Chang <waynec@nvidia.com>
Date:   Tue Apr 8 11:09:05 2025 +0800

    phy: tegra: xusb: Use a bitmask for UTMI pad power state tracking
    
    commit b47158fb42959c417ff2662075c0d46fb783d5d1 upstream.
    
    The current implementation uses bias_pad_enable as a reference count to
    manage the shared bias pad for all UTMI PHYs. However, during system
    suspension with connected USB devices, multiple power-down requests for
    the UTMI pad result in a mismatch in the reference count, which in turn
    produces warnings such as:
    
    [  237.762967] WARNING: CPU: 10 PID: 1618 at tegra186_utmi_pad_power_down+0x160/0x170
    [  237.763103] Call trace:
    [  237.763104]  tegra186_utmi_pad_power_down+0x160/0x170
    [  237.763107]  tegra186_utmi_phy_power_off+0x10/0x30
    [  237.763110]  phy_power_off+0x48/0x100
    [  237.763113]  tegra_xusb_enter_elpg+0x204/0x500
    [  237.763119]  tegra_xusb_suspend+0x48/0x140
    [  237.763122]  platform_pm_suspend+0x2c/0xb0
    [  237.763125]  dpm_run_callback.isra.0+0x20/0xa0
    [  237.763127]  __device_suspend+0x118/0x330
    [  237.763129]  dpm_suspend+0x10c/0x1f0
    [  237.763130]  dpm_suspend_start+0x88/0xb0
    [  237.763132]  suspend_devices_and_enter+0x120/0x500
    [  237.763135]  pm_suspend+0x1ec/0x270
    
    The root cause was traced back to the dynamic power-down changes
    introduced in commit a30951d31b25 ("xhci: tegra: USB2 pad power controls"),
    where the UTMI pad was being powered down without verifying its current
    state. This unbalanced behavior led to discrepancies in the reference
    count.
    
    To rectify this issue, this patch replaces the single reference counter
    with a bitmask, renamed to utmi_pad_enabled. Each bit in the mask
    corresponds to one of the four USB2 PHYs, allowing us to track each pad's
    enablement status individually.
    
    With this change:
      - The bias pad is powered on only when the mask is clear.
      - Each UTMI pad is powered on or down based on its corresponding bit
        in the mask, preventing redundant operations.
      - The overall power state of the shared bias pad is maintained
        correctly during suspend/resume cycles.
    
    The mutex used to prevent race conditions during UTMI pad enable/disable
    operations has been moved from the tegra186_utmi_bias_pad_power_on/off
    functions to the parent functions tegra186_utmi_pad_power_on/down. This
    change ensures that there are no race conditions when updating the bitmask.
    
    Cc: stable@vger.kernel.org
    Fixes: a30951d31b25 ("xhci: tegra: USB2 pad power controls")
    Signed-off-by: Wayne Chang <waynec@nvidia.com>
    Reviewed-by: Jon Hunter <jonathanh@nvidia.com>
    Tested-by: Jon Hunter <jonathanh@nvidia.com>
    Link: https://lore.kernel.org/r/20250408030905.990474-1-waynec@nvidia.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

platform/x86/amd/pmc: Declare quirk_spurious_8042 for MECHREVO Wujie 14XA (GX4HRXL) [+ + +]

Author: Runhua He <hua@aosc.io>
Date:   Wed May 7 18:01:03 2025 +0800

    platform/x86/amd/pmc: Declare quirk_spurious_8042 for MECHREVO Wujie 14XA (GX4HRXL)
    
    [ Upstream commit 0887817e4953885fbd6a5c1bec2fdd339261eb19 ]
    
    MECHREVO Wujie 14XA (GX4HRXL) wakes up immediately after s2idle entry.
    This happens regardless of whether the laptop is plugged into AC power,
    or whether any peripheral is plugged into the laptop.
    
    Similar to commit a55bdad5dfd1 ("platform/x86/amd/pmc: Disable keyboard
    wakeup on AMD Framework 13"), the MECHREVO Wujie 14XA wakes up almost
    instantly after s2idle suspend entry (IRQ1 is the keyboard):
    
    2025-04-18 17:23:57,588 DEBUG:  PM: Triggering wakeup from IRQ 9
    2025-04-18 17:23:57,588 DEBUG:  PM: Triggering wakeup from IRQ 1
    
    Add this model to the spurious_8042 quirk to workaround this.
    
    This patch does not affect the wake-up function of the built-in keyboard.
    Because the firmware of this machine adds an insurance for keyboard
    wake-up events, as it always triggers an additional IRQ 9 to wake up the
    system.
    
    Suggested-by: Mingcong Bai <jeffbai@aosc.io>
    Suggested-by: Xinhui Yang <cyan@cyano.uk>
    Suggested-by: Rong Zhang <i@rong.moe>
    Fixes: a55bdad5dfd1 ("platform/x86/amd/pmc: Disable keyboard wakeup on AMD Framework 13")
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4166
    Cc: Mario Limonciello <mario.limonciello@amd.com>
    Link: https://zhuanldan.zhihu.com/p/730538041
    Tested-by: Yemu Lu <prcups@krgm.moe>
    Signed-off-by: Runhua He <hua@aosc.io>
    Link: https://lore.kernel.org/r/20250507100103.995395-1-hua@aosc.io
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/x86: asus-wmi: Fix wlan_ctrl_by_user detection [+ + +]

Author: Hans de Goede <hdegoede@redhat.com>
Date:   Thu May 1 15:17:02 2025 +0200

    platform/x86: asus-wmi: Fix wlan_ctrl_by_user detection
    
    [ Upstream commit bfcfe6d335a967f8ea0c1980960e6f0205b5de6e ]
    
    The wlan_ctrl_by_user detection was introduced by commit a50bd128f28c
    ("asus-wmi: record wlan status while controlled by userapp").
    
    Quoting from that commit's commit message:
    
    """
    When you call WMIMethod(DSTS, 0x00010011) to get WLAN status, it may return
    
    (1) 0x00050001 (On)
    (2) 0x00050000 (Off)
    (3) 0x00030001 (On)
    (4) 0x00030000 (Off)
    (5) 0x00000002 (Unknown)
    
    (1), (2) means that the model has hardware GPIO for WLAN, you can call
    WMIMethod(DEVS, 0x00010011, 1 or 0) to turn WLAN on/off.
    (3), (4) means that the model doesn’t have hardware GPIO, you need to use
    API or driver library to turn WLAN on/off, and call
    WMIMethod(DEVS, 0x00010012, 1 or 0) to set WLAN LED status.
    After you set WLAN LED status, you can see the WLAN status is changed with
    WMIMethod(DSTS, 0x00010011). Because the status is recorded lastly
    (ex: Windows), you can use it for synchronization.
    (5) means that the model doesn’t have WLAN device.
    
    WLAN is the ONLY special case with upper rule.
    """
    
    The wlan_ctrl_by_user flag should be set on 0x0003000? ((3), (4) above)
    return values, but the flag mistakenly also gets set on laptops with
    0x0005000? ((1), (2)) return values. This is causing rfkill problems on
    laptops where 0x0005000? is returned.
    
    Fix the check to only set the wlan_ctrl_by_user flag for 0x0003000?
    return values.
    
    Fixes: a50bd128f28c ("asus-wmi: record wlan status while controlled by userapp")
    Link: https://bugzilla.kernel.org/show_bug.cgi?id=219786
    Signed-off-by: Hans de Goede <hdegoede@redhat.com>
    Reviewed-by: Armin Wolf <W_Armin@gmx.de>
    Link: https://lore.kernel.org/r/20250501131702.103360-2-hdegoede@redhat.com
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

qlcnic: fix memory leak in qlcnic_sriov_channel_cfg_cmd() [+ + +]

Author: Abdun Nihaal <abdun.nihaal@gmail.com>
Date:   Mon May 12 10:18:27 2025 +0530

    qlcnic: fix memory leak in qlcnic_sriov_channel_cfg_cmd()
    
    [ Upstream commit 9d8a99c5a7c7f4f7eca2c168a4ec254409670035 ]
    
    In one of the error paths in qlcnic_sriov_channel_cfg_cmd(), the memory
    allocated in qlcnic_sriov_alloc_bc_mbx_args() for mailbox arguments is
    not freed. Fix that by jumping to the error path that frees them, by
    calling qlcnic_free_mbx_args(). This was found using static analysis.
    
    Fixes: f197a7aa6288 ("qlcnic: VF-PF communication channel implementation")
    Signed-off-by: Abdun Nihaal <abdun.nihaal@gmail.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20250512044829.36400-1-abdun.nihaal@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

RDMA/rxe: Fix slab-use-after-free Read in rxe_queue_cleanup bug [+ + +]

Author: Zhu Yanjun <yanjun.zhu@linux.dev>
Date:   Sat Apr 12 09:57:14 2025 +0200

    RDMA/rxe: Fix slab-use-after-free Read in rxe_queue_cleanup bug
    
    [ Upstream commit f81b33582f9339d2dc17c69b92040d3650bb4bae ]
    
    Call Trace:
     <TASK>
     __dump_stack lib/dump_stack.c:94 [inline]
     dump_stack_lvl+0x7d/0xa0 lib/dump_stack.c:120
     print_address_description mm/kasan/report.c:378 [inline]
     print_report+0xcf/0x610 mm/kasan/report.c:489
     kasan_report+0xb5/0xe0 mm/kasan/report.c:602
     rxe_queue_cleanup+0xd0/0xe0 drivers/infiniband/sw/rxe/rxe_queue.c:195
     rxe_cq_cleanup+0x3f/0x50 drivers/infiniband/sw/rxe/rxe_cq.c:132
     __rxe_cleanup+0x168/0x300 drivers/infiniband/sw/rxe/rxe_pool.c:232
     rxe_create_cq+0x22e/0x3a0 drivers/infiniband/sw/rxe/rxe_verbs.c:1109
     create_cq+0x658/0xb90 drivers/infiniband/core/uverbs_cmd.c:1052
     ib_uverbs_create_cq+0xc7/0x120 drivers/infiniband/core/uverbs_cmd.c:1095
     ib_uverbs_write+0x969/0xc90 drivers/infiniband/core/uverbs_main.c:679
     vfs_write fs/read_write.c:677 [inline]
     vfs_write+0x26a/0xcc0 fs/read_write.c:659
     ksys_write+0x1b8/0x200 fs/read_write.c:731
     do_syscall_x64 arch/x86/entry/common.c:52 [inline]
     do_syscall_64+0xaa/0x1b0 arch/x86/entry/common.c:83
     entry_SYSCALL_64_after_hwframe+0x77/0x7f
    
    In the function rxe_create_cq, when rxe_cq_from_init fails, the function
    rxe_cleanup will be called to handle the allocated resources. In fact,
    some memory resources have already been freed in the function
    rxe_cq_from_init. Thus, this problem will occur.
    
    The solution is to let rxe_cleanup do all the work.
    
    Fixes: 8700e3e7c485 ("Soft RoCE driver")
    Link: https://paste.ubuntu.com/p/tJgC42wDf6/
    Tested-by: liuyi <liuy22@mails.tsinghua.edu.cn>
    Signed-off-by: Zhu Yanjun <yanjun.zhu@linux.dev>
    Link: https://patch.msgid.link/20250412075714.3257358-1-yanjun.zhu@linux.dev
    Reviewed-by: Daisuke Matsuda <matsuda-daisuke@fujitsu.com>
    Signed-off-by: Leon Romanovsky <leon@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

regulator: max20086: fix invalid memory access [+ + +]

Author: Cosmin Tanislav <demonsingur@gmail.com>
Date:   Thu May 8 09:49:43 2025 +0300

    regulator: max20086: fix invalid memory access
    
    [ Upstream commit 6b0cd72757c69bc2d45da42b41023e288d02e772 ]
    
    max20086_parse_regulators_dt() calls of_regulator_match() using an
    array of struct of_regulator_match allocated on the stack for the
    matches argument.
    
    of_regulator_match() calls devm_of_regulator_put_matches(), which calls
    devres_alloc() to allocate a struct devm_of_regulator_matches which will
    be de-allocated using devm_of_regulator_put_matches().
    
    struct devm_of_regulator_matches is populated with the stack allocated
    matches array.
    
    If the device fails to probe, devm_of_regulator_put_matches() will be
    called and will try to call of_node_put() on that stack pointer,
    generating the following dmesg entries:
    
    max20086 6-0028: Failed to read DEVICE_ID reg: -121
    kobject: '\xc0$\xa5\x03' (000000002cebcb7a): is not initialized, yet
    kobject_put() is being called.
    
    Followed by a stack trace matching the call flow described above.
    
    Switch to allocating the matches array using devm_kcalloc() to
    avoid accessing the stack pointer long after it's out of scope.
    
    This also has the advantage of allowing multiple max20086 to probe
    without overriding the data stored inside the global of_regulator_match.
    
    Fixes: bfff546aae50 ("regulator: Add MAX20086-MAX20089 driver")
    Signed-off-by: Cosmin Tanislav <demonsingur@gmail.com>
    Link: https://patch.msgid.link/20250508064947.2567255-1-demonsingur@gmail.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Revert "drm/amd: Stop evicting resources on APUs in suspend" [+ + +]

Author: Alex Deucher <alexander.deucher@amd.com>
Date:   Thu May 1 13:00:16 2025 -0400

    Revert "drm/amd: Stop evicting resources on APUs in suspend"
    
    [ Upstream commit d0ce1aaa8531a4a4707711cab5721374751c51b0 ]
    
    This reverts commit 3a9626c816db901def438dc2513622e281186d39.
    
    This breaks S4 because we end up setting the s3/s0ix flags
    even when we are entering s4 since prepare is used by both
    flows.  The causes both the S3/s0ix and s4 flags to be set
    which breaks several checks in the driver which assume they
    are mutually exclusive.
    
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/3634
    Cc: Mario Limonciello <mario.limonciello@amd.com>
    Reviewed-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit ce8f7d95899c2869b47ea6ce0b3e5bf304b2fff4)
    Cc: stable@vger.kernel.org
    Signed-off-by: Sasha Levin <sashal@kernel.org>

scsi: sd_zbc: block: Respect bio vector limits for REPORT ZONES buffer [+ + +]

Author: Steve Siwinski <ssiwinski@atto.com>
Date:   Thu May 8 16:01:22 2025 -0400

    scsi: sd_zbc: block: Respect bio vector limits for REPORT ZONES buffer
    
    commit e8007fad5457ea547ca63bb011fdb03213571c7e upstream.
    
    The REPORT ZONES buffer size is currently limited by the HBA's maximum
    segment count to ensure the buffer can be mapped. However, the block
    layer further limits the number of iovec entries to 1024 when allocating
    a bio.
    
    To avoid allocation of buffers too large to be mapped, further restrict
    the maximum buffer size to BIO_MAX_INLINE_VECS.
    
    Replace the UIO_MAXIOV symbolic name with the more contextually
    appropriate BIO_MAX_INLINE_VECS.
    
    Fixes: b091ac616846 ("sd_zbc: Fix report zones buffer allocation")
    Cc: stable@vger.kernel.org
    Signed-off-by: Steve Siwinski <ssiwinski@atto.com>
    Link: https://lore.kernel.org/r/20250508200122.243129-1-ssiwinski@atto.com
    Reviewed-by: Damien Le Moal <dlemoal@kernel.org>
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

sctp: add mutual exclusion in proc_sctp_do_udp_port() [+ + +]

Author: Eric Dumazet <edumazet@google.com>
Date:   Mon Mar 31 09:15:32 2025 +0000

    sctp: add mutual exclusion in proc_sctp_do_udp_port()
    
    commit 10206302af856791fbcc27a33ed3c3eb09b2793d upstream.
    
    We must serialize calls to sctp_udp_sock_stop() and sctp_udp_sock_start()
    or risk a crash as syzbot reported:
    
    Oops: general protection fault, probably for non-canonical address 0xdffffc000000000d: 0000 [#1] SMP KASAN PTI
    KASAN: null-ptr-deref in range [0x0000000000000068-0x000000000000006f]
    CPU: 1 UID: 0 PID: 6551 Comm: syz.1.44 Not tainted 6.14.0-syzkaller-g7f2ff7b62617 #0 PREEMPT(full)
    Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 02/12/2025
     RIP: 0010:kernel_sock_shutdown+0x47/0x70 net/socket.c:3653
    Call Trace:
     <TASK>
      udp_tunnel_sock_release+0x68/0x80 net/ipv4/udp_tunnel_core.c:181
      sctp_udp_sock_stop+0x71/0x160 net/sctp/protocol.c:930
      proc_sctp_do_udp_port+0x264/0x450 net/sctp/sysctl.c:553
      proc_sys_call_handler+0x3d0/0x5b0 fs/proc/proc_sysctl.c:601
      iter_file_splice_write+0x91c/0x1150 fs/splice.c:738
      do_splice_from fs/splice.c:935 [inline]
      direct_splice_actor+0x18f/0x6c0 fs/splice.c:1158
      splice_direct_to_actor+0x342/0xa30 fs/splice.c:1102
      do_splice_direct_actor fs/splice.c:1201 [inline]
      do_splice_direct+0x174/0x240 fs/splice.c:1227
      do_sendfile+0xafd/0xe50 fs/read_write.c:1368
      __do_sys_sendfile64 fs/read_write.c:1429 [inline]
      __se_sys_sendfile64 fs/read_write.c:1415 [inline]
      __x64_sys_sendfile64+0x1d8/0x220 fs/read_write.c:1415
      do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
    
    Fixes: 046c052b475e ("sctp: enable udp tunneling socks")
    Reported-by: syzbot+fae49d997eb56fa7c74d@syzkaller.appspotmail.com
    Closes: https://lore.kernel.org/netdev/67ea5c01.050a0220.1547ec.012b.GAE@google.com/T/#u
    Signed-off-by: Eric Dumazet <edumazet@google.com>
    Cc: Marcelo Ricardo Leitner <marcelo.leitner@gmail.com>
    Acked-by: Xin Long <lucien.xin@gmail.com>
    Link: https://patch.msgid.link/20250331091532.224982-1-edumazet@google.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    [Minor conflict resolved due to code context change.]
    Signed-off-by: Jianqi Ren <jianqi.ren.cn@windriver.com>
    Signed-off-by: He Zhe <zhe.he@windriver.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

selftests/exec: Build both static and non-static load_address tests [+ + +]

Author: Kees Cook <kees@kernel.org>
Date:   Wed May 8 10:31:46 2024 -0700

    selftests/exec: Build both static and non-static load_address tests
    
    [ Upstream commit b57a2907c9d96c56494ef25f8ec821cd0b355dd6 ]
    
    After commit 4d1cd3b2c5c1 ("tools/testing/selftests/exec: fix link
    error"), the load address alignment tests tried to build statically.
    This was silently ignored in some cases. However, after attempting to
    further fix the build by switching to "-static-pie", the test started
    failing. This appears to be due to non-PT_INTERP ET_DYN execs ("static
    PIE") not doing alignment correctly, which remains unfixed[1]. See commit
    aeb7923733d1 ("revert "fs/binfmt_elf: use PT_LOAD p_align values for
    static PIE"") for more details.
    
    Provide rules to build both static and non-static PIE binaries, improve
    debug reporting, and perform several test steps instead of a single
    all-or-nothing test. However, do not actually enable static-pie tests;
    alignment specification is only supported for ET_DYN with PT_INTERP
    ("regular PIE").
    
    Link: https://bugzilla.kernel.org/show_bug.cgi?id=215275 [1]
    Link: https://lore.kernel.org/r/20240508173149.677910-1-keescook@chromium.org
    Signed-off-by: Kees Cook <kees@kernel.org>
    Stable-dep-of: 11854fe263eb ("binfmt_elf: Move brk for static PIE even if ASLR disabled")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

selftests/exec: load_address: conform test to TAP format output [+ + +]

Author: Muhammad Usama Anjum <usama.anjum@collabora.com>
Date:   Mon Mar 4 20:59:24 2024 +0500

    selftests/exec: load_address: conform test to TAP format output
    
    [ Upstream commit c4095067736b7ed50316a2bc7c9577941e87ad45 ]
    
    Conform the layout, informational and status messages to TAP. No
    functional change is intended other than the layout of output messages.
    
    Signed-off-by: Muhammad Usama Anjum <usama.anjum@collabora.com>
    Link: https://lore.kernel.org/r/20240304155928.1818928-2-usama.anjum@collabora.com
    Signed-off-by: Kees Cook <keescook@chromium.org>
    Stable-dep-of: 11854fe263eb ("binfmt_elf: Move brk for static PIE even if ASLR disabled")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

selftests/mm: compaction_test: support platform with huge mount of memory [+ + +]

Author: Feng Tang <feng.tang@linux.alibaba.com>
Date:   Wed Apr 23 18:36:45 2025 +0800

    selftests/mm: compaction_test: support platform with huge mount of memory
    
    commit ab00ddd802f80e31fc9639c652d736fe3913feae upstream.
    
    When running mm selftest to verify mm patches, 'compaction_test' case
    failed on an x86 server with 1TB memory.  And the root cause is that it
    has too much free memory than what the test supports.
    
    The test case tries to allocate 100000 huge pages, which is about 200 GB
    for that x86 server, and when it succeeds, it expects it's large than 1/3
    of 80% of the free memory in system.  This logic only works for platform
    with 750 GB ( 200 / (1/3) / 80% ) or less free memory, and may raise false
    alarm for others.
    
    Fix it by changing the fixed page number to self-adjustable number
    according to the real number of free memory.
    
    Link: https://lkml.kernel.org/r/20250423103645.2758-1-feng.tang@linux.alibaba.com
    Fixes: bd67d5c15cc1 ("Test compaction of mlocked memory")
    Signed-off-by: Feng Tang <feng.tang@linux.alibaba.com>
    Acked-by: Dev Jain <dev.jain@arm.com>
    Reviewed-by: Baolin Wang <baolin.wang@linux.alibaba.com>
    Tested-by: Baolin Wang <baolin.wang@inux.alibaba.com>
    Cc: Shuah Khan <shuah@kernel.org>
    Cc: Sri Jayaramappa <sjayaram@akamai.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

smb: client: fix memory leak during error handling for POSIX mkdir [+ + +]

Author: Jethro Donaldson <devel@jro.nz>
Date:   Thu May 15 01:23:23 2025 +1200

    smb: client: fix memory leak during error handling for POSIX mkdir
    
    commit 1fe4a44b7fa3955bcb7b4067c07b778fe90d8ee7 upstream.
    
    The response buffer for the CREATE request handled by smb311_posix_mkdir()
    is leaked on the error path (goto err_free_rsp_buf) because the structure
    pointer *rsp passed to free_rsp_buf() is not assigned until *after* the
    error condition is checked.
    
    As *rsp is initialised to NULL, free_rsp_buf() becomes a no-op and the leak
    is instead reported by __kmem_cache_shutdown() upon subsequent rmmod of
    cifs.ko if (and only if) the error path has been hit.
    
    Pass rsp_iov.iov_base to free_rsp_buf() instead, similar to the code in
    other functions in smb2pdu.c for which *rsp is assigned late.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Jethro Donaldson <devel@jro.nz>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

spi: loopback-test: Do not split 1024-byte hexdumps [+ + +]

Author: Geert Uytterhoeven <geert+renesas@glider.be>
Date:   Fri May 2 13:10:35 2025 +0200

    spi: loopback-test: Do not split 1024-byte hexdumps
    
    [ Upstream commit a73fa3690a1f3014d6677e368dce4e70767a6ba2 ]
    
    spi_test_print_hex_dump() prints buffers holding less than 1024 bytes in
    full.  Larger buffers are truncated: only the first 512 and the last 512
    bytes are printed, separated by a truncation message.  The latter is
    confusing in case the buffer holds exactly 1024 bytes, as all data is
    printed anyway.
    
    Fix this by printing buffers holding up to and including 1024 bytes in
    full.
    
    Fixes: 84e0c4e5e2c4ef42 ("spi: add loopback test driver to allow for spi_master regression tests")
    Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://patch.msgid.link/37ee1bc90c6554c9347040adabf04188c8f704aa.1746184171.git.geert+renesas@glider.be
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

spi: tegra114: Use value to check for invalid delays [+ + +]

Author: Aaron Kling <webgeek1234@gmail.com>
Date:   Tue May 6 13:36:59 2025 -0500

    spi: tegra114: Use value to check for invalid delays
    
    commit e979a7c79fbc706f6dac913af379ef4caa04d3d5 upstream.
    
    A delay unit of 0 is a valid entry, thus it is not valid to check for
    unused delays. Instead, check the value field; if that is zero, the
    given delay is unset.
    
    Fixes: 4426e6b4ecf6 ("spi: tegra114: Don't fail set_cs_timing when delays are zero")
    Cc: stable@vger.kernel.org
    Signed-off-by: Aaron Kling <webgeek1234@gmail.com>
    Reviewed-by: Jon Hunter <jonathanh@nvidia.com>
    Link: https://patch.msgid.link/20250506-spi-tegra114-fixup-v1-1-136dc2f732f3@gmail.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

tools/net/ynl: ethtool: fix crash when Hardware Clock info is missing [+ + +]

Author: Hangbin Liu <liuhangbin@gmail.com>
Date:   Thu May 8 03:54:14 2025 +0000

    tools/net/ynl: ethtool: fix crash when Hardware Clock info is missing
    
    [ Upstream commit 45375814eb3f4245956c0c85092a4eee4441d167 ]
    
    Fix a crash in the ethtool YNL implementation when Hardware Clock information
    is not present in the response. This ensures graceful handling of devices or
    drivers that do not provide this optional field. e.g.
    
      Traceback (most recent call last):
        File "/net/tools/net/ynl/pyynl/./ethtool.py", line 438, in <module>
          main()
          ~~~~^^
        File "/net/tools/net/ynl/pyynl/./ethtool.py", line 341, in main
          print(f'PTP Hardware Clock: {tsinfo["phc-index"]}')
                                       ~~~~~~^^^^^^^^^^^^^
      KeyError: 'phc-index'
    
    Fixes: f3d07b02b2b8 ("tools: ynl: ethtool testing tool")
    Signed-off-by: Hangbin Liu <liuhangbin@gmail.com>
    Acked-by: Stanislav Fomichev <sdf@fomichev.me>
    Link: https://patch.msgid.link/20250508035414.82974-1-liuhangbin@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tools: ynl: ethtool.py: Output timestamping statistics from tsinfo-get operation [+ + +]

Author: Rahul Rameshbabu <rrameshbabu@nvidia.com>
Date:   Wed Apr 3 14:28:45 2024 -0700

    tools: ynl: ethtool.py: Output timestamping statistics from tsinfo-get operation
    
    [ Upstream commit 2e0e148c727061009d3db5f436f51890bbb49a80 ]
    
    Print the nested stats attribute containing timestamping statistics when
    the --show-time-stamping flag is used.
    
      [root@binary-eater-vm-01 linux-ethtool-ts]# ./tools/net/ynl/ethtool.py --show-time-stamping mlx5_1
      Time stamping parameters for mlx5_1:
      Capabilities:
        hardware-transmit
        hardware-receive
        hardware-raw-clock
      PTP Hardware Clock: 0
      Hardware Transmit Timestamp Modes:
        off
        on
      Hardware Receive Filter Modes:
        none
        all
      Statistics:
        tx-pkts: 8
        tx-lost: 0
        tx-err: 0
    
    Signed-off-by: Rahul Rameshbabu <rrameshbabu@nvidia.com>
    Reviewed-by: Dragos Tatulea <dtatulea@nvidia.com>
    Link: https://lore.kernel.org/r/20240403212931.128541-8-rrameshbabu@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Stable-dep-of: 45375814eb3f ("tools/net/ynl: ethtool: fix crash when Hardware Clock info is missing")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tpm: tis: Double the timeout B to 4s [+ + +]

Author: Michal Suchanek <msuchanek@suse.de>
Date:   Fri Apr 4 10:23:14 2025 +0200

    tpm: tis: Double the timeout B to 4s
    
    [ Upstream commit 2f661f71fda1fc0c42b7746ca5b7da529eb6b5be ]
    
    With some Infineon chips the timeouts in tpm_tis_send_data (both B and
    C) can reach up to about 2250 ms.
    
    Timeout C is retried since
    commit de9e33df7762 ("tpm, tpm_tis: Workaround failed command reception on Infineon devices")
    
    Timeout B still needs to be extended.
    
    The problem is most commonly encountered with context related operation
    such as load context/save context. These are issued directly by the
    kernel, and there is no retry logic for them.
    
    When a filesystem is set up to use the TPM for unlocking the boot fails,
    and restarting the userspace service is ineffective. This is likely
    because ignoring a load context/save context result puts the real TPM
    state and the TPM state expected by the kernel out of sync.
    
    Chips known to be affected:
    tpm_tis IFX1522:00: 2.0 TPM (device-id 0x1D, rev-id 54)
    Description: SLB9672
    Firmware Revision: 15.22
    
    tpm_tis MSFT0101:00: 2.0 TPM (device-id 0x1B, rev-id 22)
    Firmware Revision: 7.83
    
    tpm_tis MSFT0101:00: 2.0 TPM (device-id 0x1A, rev-id 16)
    Firmware Revision: 5.63
    
    Link: https://lore.kernel.org/linux-integrity/Z5pI07m0Muapyu9w@kitsune.suse.cz/
    Signed-off-by: Michal Suchanek <msuchanek@suse.de>
    Reviewed-by: Jarkko Sakkinen <jarkko@kernel.org>
    Signed-off-by: Jarkko Sakkinen <jarkko@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tracing: probes: Fix a possible race in trace_probe_log APIs [+ + +]

Author: Masami Hiramatsu (Google) <mhiramat@kernel.org>
Date:   Sat May 10 12:44:41 2025 +0900

    tracing: probes: Fix a possible race in trace_probe_log APIs
    
    [ Upstream commit fd837de3c9cb1a162c69bc1fb1f438467fe7f2f5 ]
    
    Since the shared trace_probe_log variable can be accessed and
    modified via probe event create operation of kprobe_events,
    uprobe_events, and dynamic_events, it should be protected.
    In the dynamic_events, all operations are serialized by
    `dyn_event_ops_mutex`. But kprobe_events and uprobe_events
    interfaces are not serialized.
    
    To solve this issue, introduces dyn_event_create(), which runs
    create() operation under the mutex, for kprobe_events and
    uprobe_events. This also uses lockdep to check the mutex is
    held when using trace_probe_log* APIs.
    
    Link: https://lore.kernel.org/all/174684868120.551552.3068655787654268804.stgit@devnote2/
    
    Reported-by: Paul Cacheux <paulcacheux@gmail.com>
    Closes: https://lore.kernel.org/all/20250510074456.805a16872b591e2971a4d221@kernel.org/
    Fixes: ab105a4fb894 ("tracing: Use tracing error_log with probe events")
    Signed-off-by: Masami Hiramatsu (Google) <mhiramat@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tracing: samples: Initialize trace_array_printk() with the correct function [+ + +]

Author: Steven Rostedt <rostedt@goodmis.org>
Date:   Fri May 9 15:26:57 2025 -0400

    tracing: samples: Initialize trace_array_printk() with the correct function
    
    commit 1b0c192c92ea1fe2dcb178f84adf15fe37c3e7c8 upstream.
    
    When using trace_array_printk() on a created instance, the correct
    function to use to initialize it is:
    
      trace_array_init_printk()
    
    Not
    
      trace_printk_init_buffer()
    
    The former is a proper function to use, the latter is for initializing
    trace_printk() and causes the NOTICE banner to be displayed.
    
    Cc: stable@vger.kernel.org
    Cc: Masami Hiramatsu <mhiramat@kernel.org>
    Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
    Cc: Divya Indi <divya.indi@oracle.com>
    Link: https://lore.kernel.org/20250509152657.0f6744d9@gandalf.local.home
    Fixes: 89ed42495ef4a ("tracing: Sample module to demonstrate kernel access to Ftrace instances.")
    Fixes: 38ce2a9e33db6 ("tracing: Add trace_array_init_printk() to initialize instance trace_printk() buffers")
    Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

tsnep: fix timestamping with a stacked DSA driver [+ + +]

Author: Gerhard Engleder <gerhard@engleder-embedded.com>
Date:   Wed May 14 21:56:57 2025 +0200

    tsnep: fix timestamping with a stacked DSA driver
    
    [ Upstream commit b3ca9eef6646576ad506a96d941d87a69f66732a ]
    
    This driver is susceptible to a form of the bug explained in commit
    c26a2c2ddc01 ("gianfar: Fix TX timestamping with a stacked DSA driver")
    and in Documentation/networking/timestamping.rst section "Other caveats
    for MAC drivers", specifically it timestamps any skb which has
    SKBTX_HW_TSTAMP, and does not consider if timestamping has been enabled
    in adapter->hwtstamp_config.tx_type.
    
    Evaluate the proper TX timestamping condition only once on the TX
    path (in tsnep_xmit_frame_ring()) and store the result in an additional
    TX entry flag. Evaluate the new TX entry flag in the TX confirmation path
    (in tsnep_tx_poll()).
    
    This way SKBTX_IN_PROGRESS is set by the driver as required, but never
    evaluated. SKBTX_IN_PROGRESS shall not be evaluated as it can be set
    by a stacked DSA driver and evaluating it would lead to unwanted
    timestamps.
    
    Fixes: 403f69bbdbad ("tsnep: Add TSN endpoint Ethernet MAC driver")
    Suggested-by: Vladimir Oltean <vladimir.oltean@nxp.com>
    Signed-off-by: Gerhard Engleder <gerhard@engleder-embedded.com>
    Reviewed-by: Vladimir Oltean <vladimir.oltean@nxp.com>
    Link: https://patch.msgid.link/20250514195657.25874-1-gerhard@engleder-embedded.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tsnep: Inline small fragments within TX descriptor [+ + +]

Author: Gerhard Engleder <gerhard@engleder-embedded.com>
Date:   Wed Oct 11 20:21:54 2023 +0200

    tsnep: Inline small fragments within TX descriptor
    
    [ Upstream commit dccce1d7c04051bc25d3abbe7716d0ae7af9c28a ]
    
    The tsnep network controller is able to extend the descriptor directly
    with data to be transmitted. In this case no TX data DMA address is
    necessary. Instead of the TX data DMA address the TX data buffer is
    placed at the end of the descriptor.
    
    The descriptor is read with a 64 bytes DMA read by the tsnep network
    controller. If the sum of descriptor data and TX data is less than or
    equal to 64 bytes, then no additional DMA read is necessary to read the
    TX data. Therefore, it makes sense to inline small fragments up to this
    limit within the descriptor ring.
    
    Inlined fragments need to be copied to the descriptor ring. On the other
    hand DMA mapping is not necessary. At most 40 bytes are copied, so
    copying should be faster than DMA mapping.
    
    For A53 1.2 GHz copying takes <100ns and DMA mapping takes >200ns. So
    inlining small fragments should result in lower CPU load. Performance
    improvement is small. Thus, comparision of CPU load with and without
    inlining of small fragments did not show any significant difference.
    With this optimization less DMA reads will be done, which decreases the
    load of the interconnect.
    
    Signed-off-by: Gerhard Engleder <gerhard@engleder-embedded.com>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Stable-dep-of: b3ca9eef6646 ("tsnep: fix timestamping with a stacked DSA driver")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

udf: Make sure i_lenExtents is uptodate on inode eviction [+ + +]

Author: Jan Kara <jack@suse.cz>
Date:   Wed May 7 11:49:41 2025 +0200

    udf: Make sure i_lenExtents is uptodate on inode eviction
    
    commit 55dd5b4db3bf04cf077a8d1712f6295d4517c337 upstream.
    
    UDF maintains total length of all extents in i_lenExtents. Generally we
    keep extent lengths (and thus i_lenExtents) block aligned because it
    makes the file appending logic simpler. However the standard mandates
    that the inode size must match the length of all extents and thus we
    trim the last extent when closing the file. To catch possible bugs we
    also verify that i_lenExtents matches i_size when evicting inode from
    memory. Commit b405c1e58b73 ("udf: refactor udf_next_aext() to handle
    error") however broke the code updating i_lenExtents and thus
    udf_evict_inode() ended up spewing lots of errors about incorrectly
    sized extents although the extents were actually sized properly. Fix the
    updating of i_lenExtents to silence the errors.
    
    Fixes: b405c1e58b73 ("udf: refactor udf_next_aext() to handle error")
    CC: stable@vger.kernel.org
    Signed-off-by: Jan Kara <jack@suse.cz>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

usb: typec: ucsi: displayport: Fix deadlock [+ + +]

Author: Andrei Kuchynski <akuchynski@chromium.org>
Date:   Thu Apr 24 08:44:28 2025 +0000

    usb: typec: ucsi: displayport: Fix deadlock
    
    commit 364618c89d4c57c85e5fc51a2446cd939bf57802 upstream.
    
    This patch introduces the ucsi_con_mutex_lock / ucsi_con_mutex_unlock
    functions to the UCSI driver. ucsi_con_mutex_lock ensures the connector
    mutex is only locked if a connection is established and the partner pointer
    is valid. This resolves a deadlock scenario where
    ucsi_displayport_remove_partner holds con->mutex waiting for
    dp_altmode_work to complete while dp_altmode_work attempts to acquire it.
    
    Cc: stable <stable@kernel.org>
    Fixes: af8622f6a585 ("usb: typec: ucsi: Support for DisplayPort alt mode")
    Signed-off-by: Andrei Kuchynski <akuchynski@chromium.org>
    Reviewed-by: Heikki Krogerus <heikki.krogerus@linux.intel.com>
    Link: https://lore.kernel.org/r/20250424084429.3220757-2-akuchynski@chromium.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

wifi: mac80211: Set n_channels after allocating struct cfg80211_scan_request [+ + +]

Author: Kees Cook <kees@kernel.org>
Date:   Fri May 9 11:46:45 2025 -0700

    wifi: mac80211: Set n_channels after allocating struct cfg80211_scan_request
    
    [ Upstream commit 82bbe02b2500ef0a62053fe2eb84773fe31c5a0a ]
    
    Make sure that n_channels is set after allocating the
    struct cfg80211_registered_device::int_scan_req member. Seen with
    syzkaller:
    
    UBSAN: array-index-out-of-bounds in net/mac80211/scan.c:1208:5
    index 0 is out of range for type 'struct ieee80211_channel *[] __counted_by(n_channels)' (aka 'struct ieee80211_channel *[]')
    
    This was missed in the initial conversions because I failed to locate
    the allocation likely due to the "sizeof(void *)" not matching the
    "channels" array type.
    
    Reported-by: syzbot+4bcdddd48bb6f0be0da1@syzkaller.appspotmail.com
    Closes: https://lore.kernel.org/lkml/680fd171.050a0220.2b69d1.045e.GAE@google.com/
    Fixes: e3eac9f32ec0 ("wifi: cfg80211: Annotate struct cfg80211_scan_request with __counted_by")
    Signed-off-by: Kees Cook <kees@kernel.org>
    Reviewed-by: Gustavo A. R. Silva <gustavoars@kernel.org>
    Link: https://patch.msgid.link/20250509184641.work.542-kees@kernel.org
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mt76: disable napi on driver removal [+ + +]

Author: Fedor Pchelkin <pchelkin@ispras.ru>
Date:   Tue May 6 14:55:39 2025 +0300

    wifi: mt76: disable napi on driver removal
    
    commit 78ab4be549533432d97ea8989d2f00b508fa68d8 upstream.
    
    A warning on driver removal started occurring after commit 9dd05df8403b
    ("net: warn if NAPI instance wasn't shut down"). Disable tx napi before
    deleting it in mt76_dma_cleanup().
    
     WARNING: CPU: 4 PID: 18828 at net/core/dev.c:7288 __netif_napi_del_locked+0xf0/0x100
     CPU: 4 UID: 0 PID: 18828 Comm: modprobe Not tainted 6.15.0-rc4 #4 PREEMPT(lazy)
     Hardware name: ASUS System Product Name/PRIME X670E-PRO WIFI, BIOS 3035 09/05/2024
     RIP: 0010:__netif_napi_del_locked+0xf0/0x100
     Call Trace:
     <TASK>
     mt76_dma_cleanup+0x54/0x2f0 [mt76]
     mt7921_pci_remove+0xd5/0x190 [mt7921e]
     pci_device_remove+0x47/0xc0
     device_release_driver_internal+0x19e/0x200
     driver_detach+0x48/0x90
     bus_remove_driver+0x6d/0xf0
     pci_unregister_driver+0x2e/0xb0
     __do_sys_delete_module.isra.0+0x197/0x2e0
     do_syscall_64+0x7b/0x160
     entry_SYSCALL_64_after_hwframe+0x76/0x7e
    
    Tested with mt7921e but the same pattern can be actually applied to other
    mt76 drivers calling mt76_dma_cleanup() during removal. Tx napi is enabled
    in their *_dma_init() functions and only toggled off and on again inside
    their suspend/resume/reset paths. So it should be okay to disable tx
    napi in such a generic way.
    
    Found by Linux Verification Center (linuxtesting.org).
    
    Fixes: 2ac515a5d74f ("mt76: mt76x02: use napi polling for tx cleanup")
    Cc: stable@vger.kernel.org
    Signed-off-by: Fedor Pchelkin <pchelkin@ispras.ru>
    Tested-by: Ming Yen Hsieh <mingyen.hsieh@mediatek.com>
    Link: https://patch.msgid.link/20250506115540.19045-1-pchelkin@ispras.ru
    Signed-off-by: Felix Fietkau <nbd@nbd.name>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/its: Fix build error for its_static_thunk() [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Mon May 19 13:43:42 2025 -0700

    x86/its: Fix build error for its_static_thunk()
    
    Due to a likely merge resolution error of backport commit 772934d9062a
    ("x86/its: FineIBT-paranoid vs ITS"), the function its_static_thunk() was
    placed in the wrong ifdef block, causing a build error when
    CONFIG_MITIGATION_ITS and CONFIG_FINEIBT are both disabled:
    
      /linux-6.6/arch/x86/kernel/alternative.c:1452:5: error: redefinition of 'its_static_thunk'
       1452 | u8 *its_static_thunk(int reg)
            |     ^~~~~~~~~~~~~~~~
    
    Fix it by moving its_static_thunk() under CONFIG_MITIGATION_ITS.
    
    Fixes: e52c1dc7455d ("x86/its: FineIBT-paranoid vs ITS")
    Reported-by: Natanael Copa <ncopa@alpinelinux.org>
    Link: https://lore.kernel.org/all/20250519164717.18738b4e@ncopa-desktop/
    Reported-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>