Skip to content

Conversation

@docjyJ
Copy link
Collaborator

@docjyJ docjyJ commented Dec 28, 2024

@docjyJ docjyJ force-pushed the ench/noid/vulkan-ai branch from cf639fe to f142de1 Compare December 28, 2024 15:24
@docjyJ docjyJ marked this pull request as draft December 28, 2024 15:28
@szaimen szaimen added 2. developing Work in progress enhancement New feature or request labels Jan 6, 2025
@szaimen
Copy link
Collaborator

szaimen commented Jan 6, 2025

@docjyJ thanks a lot for this PR! :)

@docjyJ docjyJ self-assigned this May 26, 2025
@szaimen
Copy link
Collaborator

szaimen commented Jun 18, 2025

Btw @docjyJ have you had the chance to look a bit further into this? :)

@docjyJ
Copy link
Collaborator Author

docjyJ commented Jun 30, 2025

Not yet, it's quite complicated to have Vulkan on my server. I have a very old version of the Linux kernel so I need to find a solution to update.

@docjyJ
Copy link
Collaborator Author

docjyJ commented Jul 2, 2025

Local AI doesn't have a Vulkan image for arm64...
Since my cloud is on arm, I'm stuck...

I've opened an issue: mudler/LocalAI#5778

@docjyJ docjyJ added the blocked label Jul 2, 2025
@docjyJ docjyJ force-pushed the ench/noid/vulkan-ai branch from f142de1 to 463d825 Compare July 2, 2025 11:43
@docjyJ
Copy link
Collaborator Author

docjyJ commented Jul 2, 2025

I have Vulkan on my PC and am running a local instance of Nextcloud over HTTP, so I opted for simplicity with automatic configuration and explanations for accessing the web interface.

@szaimen
Copy link
Collaborator

szaimen commented Jul 2, 2025

Thanks @docjyJ for continuing the work on this! 😊

I mean you could also try to build the image from source instead of proxying it. Then you could also build for arm64... However of course not sure how feasible this is...

@docjyJ
Copy link
Collaborator Author

docjyJ commented Jul 2, 2025

image
It's using my GPU. But I don't know why it's so slow...

@docjyJ
Copy link
Collaborator Author

docjyJ commented Jul 2, 2025

So WDYT ?

@docjyJ
Copy link
Collaborator Author

docjyJ commented Jul 2, 2025

See that https://www.reddit.com/r/LocalLLaMA/comments/1j1swtj/vulkan_is_getting_really_close_now_lets_ditch/

@szaimen
Copy link
Collaborator

szaimen commented Jul 3, 2025

So WDYT ?

I was wondering if it would be possible to make this container the default LocalAI container in AIO instead of being a variant?

Of course, we would need to resolve the arm64 problem first.

Would it be possible to simply build the project from sources in the Dockerfile? WDYT?

@docjyJ
Copy link
Collaborator Author

docjyJ commented Jul 3, 2025

Yes, I think it could be the main container.

ARM support has an open PR: mudler/LocalAI#5780

@docjyJ docjyJ marked this pull request as ready for review July 3, 2025 10:53
docjyJ added 3 commits July 3, 2025 14:05
Signed-off-by: Jean-Yves <[email protected]>
Signed-off-by: Jean-Yves <[email protected]>
Signed-off-by: Jean-Yves <[email protected]>
docjyJ added 3 commits July 3, 2025 14:05
Signed-off-by: Jean-Yves <[email protected]>
Signed-off-by: Jean-Yves <[email protected]>
@docjyJ docjyJ force-pushed the ench/noid/vulkan-ai branch from d099c84 to 8d013cf Compare July 3, 2025 12:05
@szaimen
Copy link
Collaborator

szaimen commented Jul 4, 2025

Yes, I think it could be the main container.

Cool!

ARM support has an open PR: mudler/LocalAI#5780

I see... Honestly, I would like to wait a few weeks and see if arm64 support evolves upstream so that we can push this forward here...

@szaimen szaimen added 3. to review Waiting for reviews and removed 2. developing Work in progress labels Jul 4, 2025
@docjyJ
Copy link
Collaborator Author

docjyJ commented Jul 4, 2025

yes !

@docjyJ
Copy link
Collaborator Author

docjyJ commented Jul 6, 2025

I added web ui access with basic auth, we can support caddy community container.
Updated doc and use local AI image without preconfigured models (to save space disk, user can install models from local AI web ui)

Signed-off-by: Jean-Yves <[email protected]>
@szaimen
Copy link
Collaborator

szaimen commented Sep 8, 2025

FYI: you can now use string replacement in nextcloud_exec_commands since #6835. See #6835 for example.

@docjyJ
Copy link
Collaborator Author

docjyJ commented Sep 27, 2025

I saw that PR has had the Roadmap label for two weeks. Waiting for news.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

3. to review Waiting for reviews blocked enhancement New feature or request upstream

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants