Skip to content

Commit b87b41e

Browse files
localai-botmudler
andauthored
chore(model gallery): 🤖 add 1 new models via gallery agent (#7017)
chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <[email protected]>
1 parent 424acd6 commit b87b41e

File tree

1 file changed

+29
-0
lines changed

1 file changed

+29
-0
lines changed

gallery/index.yaml

Lines changed: 29 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22994,3 +22994,32 @@
2299422994
- filename: Qwen3-4B-Thinking-2507-GSPO-Easy.Q4_K_M.gguf
2299522995
sha256: f75798ff521ce54c1663fb59d2d119e5889fd38ce76d9e07c3a28ceb13cf2eb2
2299622996
uri: huggingface://mradermacher/Qwen3-4B-Thinking-2507-GSPO-Easy-GGUF/Qwen3-4B-Thinking-2507-GSPO-Easy.Q4_K_M.gguf
22997+
- !!merge <<: *qwen3
22998+
name: "qwen3-yoyo-v4-42b-a3b-thinking-total-recall-pkdick-v-i1"
22999+
urls:
23000+
- https://huggingface.co/mradermacher/Qwen3-Yoyo-V4-42B-A3B-Thinking-TOTAL-RECALL-PKDick-V-i1-GGUF
23001+
description: |
23002+
### **Qwen3-Yoyo-V4-42B-A3B-Thinking-TOTAL-RECALL-PKDick-V**
23003+
**Base Model:** Qwen3-Coder-30B-A3B-Instruct (Mixture of Experts)
23004+
**Size:** 42B parameters (finetuned version)
23005+
**Context Length:** 1 million tokens (native), supports up to 256K natively with Yarn extension
23006+
**Architecture:** Mixture of Experts (MoE) — 128 experts, 8 activated per forward pass
23007+
**Fine-tuned For:** Advanced coding, agentic workflows, creative writing, and long-context reasoning
23008+
**Key Features:**
23009+
- Enhanced with **Brainstorm 20x** fine-tuning for deeper reasoning, richer prose, and improved coherence
23010+
- Optimized for **coding in multiple languages**, tool use, and long-form creative tasks
23011+
- Includes optional **"thinking" mode** via system prompt for structured internal reasoning
23012+
- Trained on **PK Dick Dataset** (inspired by Philip K. Dick’s works) for narrative depth and conceptual richness
23013+
- Supports **high-quality GGUF, GPTQ, AWQ, EXL2, and HQQ quantizations** for efficient local inference
23014+
- Recommended settings: 6–10 active experts, temperature 0.3–0.7, repetition penalty 1.05–1.1
23015+
23016+
**Best For:** Developers, creative writers, researchers, and AI researchers seeking a powerful, expressive, and highly customizable model with exceptional long-context and coding performance.
23017+
23018+
> 🌟 *Note: This is a quantization and fine-tune of the original Qwen3-Coder-30B-A3B-Instruct by DavidAU, further enhanced by mradermacher’s GGUF conversion. The base model remains the authoritative version.*
23019+
overrides:
23020+
parameters:
23021+
model: Qwen3-Yoyo-V4-42B-A3B-Thinking-TOTAL-RECALL-PKDick-V.i1-Q4_K_M.gguf
23022+
files:
23023+
- filename: Qwen3-Yoyo-V4-42B-A3B-Thinking-TOTAL-RECALL-PKDick-V.i1-Q4_K_M.gguf
23024+
sha256: 6955283520e3618fe349bb75f135eae740f020d9d7f5ba38503482e5d97f6f59
23025+
uri: huggingface://mradermacher/Qwen3-Yoyo-V4-42B-A3B-Thinking-TOTAL-RECALL-PKDick-V-i1-GGUF/Qwen3-Yoyo-V4-42B-A3B-Thinking-TOTAL-RECALL-PKDick-V.i1-Q4_K_M.gguf

0 commit comments

Comments
 (0)