Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stable and/or mainline release #9735

Open
uhthomas opened this issue Nov 16, 2024 · 5 comments
Open

Stable and/or mainline release #9735

uhthomas opened this issue Nov 16, 2024 · 5 comments

Comments

@uhthomas
Copy link
Contributor

Feature Request

It would be great if Talos could provide a version based on the stable or mainline kernels instead of just LTS.

Description

New hardware is sometimes impossible to use with Talos because support for it is not available in the LTS kernels. The LTS kernels are sometimes a year behind and so it impedes compatibility with new hardware.

Most recently, the GPUs on new 14th gen Intel CPUs (i915 7d55) don't work because support was added in Kernel 6.8. LTS is currently 6.6.

I recognise it may be tricky to support multiple kernels at once, and may make it harder to focus, but it's something I've really wanted to see from Talos for a long time and it has often come up as a point of friction.

@onedr0p
Copy link

onedr0p commented Nov 16, 2024

I am running into this exact issue on a 14th Gen Intel CPU...

k8s-0: kern:    info: [2024-11-16T04:09:27.992549659Z]: i915 0000:00:02.0: Your graphics device 7d55 is not properly supported by i915 in this
kernel version. To force driver probe anyway, use i915.force_probe=7d55
module parameter or CONFIG_DRM_I915_FORCE_PROBE=7d55 configuration option,
or (recommended) check for kernel updates.

As per the workaround, I've tried

machine:
  kernel:
    modules:
      - name: i915
        parameters:
          - force_probe=7d55 # also tried force_probe=!7d55

and

machine:
  install:
    extraKernelArgs:
      - i915.force_probe=7d55 # also tried i915.force_probe=!7d55

without any success, I get the same error in dmesg. Looks like I am stuck trying to build my own Talos image with the 6.8.x Kernel or waiting for the next update to the Kernel from here.

@frezbo
Copy link
Member

frezbo commented Nov 16, 2024

I am running into this exact issue on a 14th Gen Intel CPU...

k8s-0: kern:    info: [2024-11-16T04:09:27.992549659Z]: i915 0000:00:02.0: Your graphics device 7d55 is not properly supported by i915 in this
kernel version. To force driver probe anyway, use i915.force_probe=7d55
module parameter or CONFIG_DRM_I915_FORCE_PROBE=7d55 configuration option,
or (recommended) check for kernel updates.

As per the workaround, I've tried

machine:
  kernel:
    modules:
      - name: i915
        parameters:
          - force_probe=7d55 # also tried force_probe=!7d55

and

machine:
  install:
    extraKernelArgs:
      - i915.force_probe=7d55 # also tried i915.force_probe=!7d55

without any success. Looks like I am stuck trying to build my own Talos image with the 6.8.x Kernel or waiting for the next update to the Kernel from here.

probably need to add a udev blacklist for the driver and then explicitly load in machineconfig with the module parameter, otherwise if udevd already loaded the module, the machineconfig module parameter is a no-op

@onedr0p
Copy link

onedr0p commented Nov 16, 2024

@frezbo I am not familiar with blacklisting with udev, perhaps I could via the extraKernelArgs, e.g. module_blacklist=i915 or does it really need to be done via udev rules? Apologies if that is what you meant.

@frezbo
Copy link
Member

frezbo commented Nov 16, 2024

@frezbo I am not familiar with blacklisting with udev, perhaps I could via the extraKernelArgs, e.g. module_blacklist=i915 or does it really need to be done via udev rules? Apologies if that is what you meant.

I think the kernel arg would work, just need an upgrade, or you add a file like this: https://github.com/siderolabs/extensions/blob/main/nvidia-gpu/nvidia-modules/lts/pkg.yaml#L18 with content like this: https://github.com/siderolabs/extensions/blob/main/nvidia-gpu/nvidia-modules/lts/files/nvidia.conf

@onedr0p
Copy link

onedr0p commented Nov 16, 2024

Nevermind, looks like I wasn't doing talosctl update when changing

machine:
  install:
    extraKernelArgs:
      - i915.force_probe=7d55

I can see it loaded now.

❯ talosctl -n k8s-0 list /dev/dri/by-path
NODE    NAME
k8s-0   .
k8s-0   pci-0000:00:02.0-card
k8s-0   pci-0000:00:02.0-render

For stability I am unsure. Will know in the future. Thanks for the rubberducking @frezbo !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants