Update Llama.cpp & Llama 3 Support #55

PSchmiedmayer · 2024-05-02T19:10:33Z

Update Llama.cpp & Llama 3 Support

⚙️ Release Notes

Updates llama.cpp
Supports Llama 3

📝 Code of Conduct & Contributing Guidelines

By submitting creating this pull request, you agree to follow our Code of Conduct and Contributing Guidelines:

I agree to follow the Code of Conduct and Contributing Guidelines.

PSchmiedmayer · 2024-05-07T21:26:57Z

@vishnuravi @philippzagar Feel free to add to this WIP here after lifting llama.cpp. @vishnuravi can you share some of the issues you had with Llama3 so we can document this here in case this gets stale or someone else wants to pick it up?

vishnuravi · 2024-05-10T14:50:41Z

@vishnuravi @philippzagar Feel free to add to this WIP here after lifting llama.cpp. @vishnuravi can you share some of the issues you had with Llama3 so we can document this here in case this gets stale or someone else wants to pick it up?

Hi @PSchmiedmayer, I'm running into the same issue documented here: ollama/ollama#3759 and here: https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct/discussions/4. The model does not stop at the stop token and continues generating indefinitely.

vishnuravi · 2024-05-11T01:09:02Z

@vishnuravi @philippzagar Feel free to add to this WIP here after lifting llama.cpp. @vishnuravi can you share some of the issues you had with Llama3 so we can document this here in case this gets stale or someone else wants to pick it up?

Hi @PSchmiedmayer, I'm running into the same issue documented here: ollama/ollama#3759 and here: https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct/discussions/4. The model does not stop at the stop token and continues generating indefinitely.

I was able to resolve this issue by using a different GGUF with the end token correctly configured.

Sources/SpeziLLMLocal/LLMLocalSchema+PromptFormatting.swift

codecov · 2024-05-11T05:40:24Z

Codecov Report

Attention: Patch coverage is 0% with 91 lines in your changes are missing coverage. Please review.

Project coverage is 31.18%. Comparing base (cbaf204) to head (98753fc).

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #55      +/-   ##
==========================================
- Coverage   32.03%   31.18%   -0.85%     
==========================================
  Files          67       67              
  Lines        2932     3012      +80     
==========================================
  Hits          939      939              
- Misses       1993     2073      +80

Files	Coverage Δ
...ces/SpeziLLMLocal/LLMLocalSession+Generation.swift	`0.00% <0.00%> (ø)`
...s/SpeziLLMLocal/LLMLocalSession+Tokenization.swift	`0.00% <0.00%> (ø)`
...ocal/Configuration/LLMLocalContextParameters.swift	`0.00% <0.00%> (ø)`
...Download/LLMLocalDownloadManager+DefaultUrls.swift	`0.00% <0.00%> (ø)`
...peziLLMLocal/LLMLocalSchema+PromptFormatting.swift	`0.00% <0.00%> (ø)`

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update cbaf204...98753fc. Read the comment docs.

Update Llama.cpp

a327ca2

PSchmiedmayer added the enhancement New feature or request label May 2, 2024

PSchmiedmayer self-assigned this May 2, 2024

PSchmiedmayer and others added 2 commits May 2, 2024 22:29

Update llama.cpp & methods

5dc4021

Add Llama3 8B GGUF

ed81c65

Update prompt format

e92498c

Fix stop token issue

99ee559

vishnuravi requested a review from philippzagar May 10, 2024 19:01

vishnuravi self-assigned this May 10, 2024

vishnuravi self-requested a review May 10, 2024 19:01

Fix model URL

9d5b298

vishnuravi approved these changes May 10, 2024

View reviewed changes

Fix broken link in README

c3cd98a

PSchmiedmayer added 2 commits May 10, 2024 22:04

Update CodeCov

5e5068c

Improve README

98753fc

PSchmiedmayer commented May 11, 2024

View reviewed changes

Sources/SpeziLLMLocal/LLMLocalSchema+PromptFormatting.swift Show resolved Hide resolved

PSchmiedmayer merged commit 94f14f6 into main May 13, 2024
17 of 18 checks passed

PSchmiedmayer deleted the feature/updatellamacpp branch May 13, 2024 06:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Llama.cpp & Llama 3 Support #55

Update Llama.cpp & Llama 3 Support #55

PSchmiedmayer commented May 2, 2024

PSchmiedmayer commented May 7, 2024

vishnuravi commented May 10, 2024 •

edited

Loading

vishnuravi commented May 11, 2024

codecov bot commented May 11, 2024 •

edited

Loading

Update Llama.cpp & Llama 3 Support #55

Update Llama.cpp & Llama 3 Support #55

Conversation

PSchmiedmayer commented May 2, 2024

Update Llama.cpp & Llama 3 Support

⚙️ Release Notes

📝 Code of Conduct & Contributing Guidelines

PSchmiedmayer commented May 7, 2024

vishnuravi commented May 10, 2024 • edited Loading

vishnuravi commented May 11, 2024

codecov bot commented May 11, 2024 • edited Loading

Codecov Report

vishnuravi commented May 10, 2024 •

edited

Loading

codecov bot commented May 11, 2024 •

edited

Loading