|
22994 | 22994 | - filename: Qwen3-4B-Thinking-2507-GSPO-Easy.Q4_K_M.gguf |
22995 | 22995 | sha256: f75798ff521ce54c1663fb59d2d119e5889fd38ce76d9e07c3a28ceb13cf2eb2 |
22996 | 22996 | uri: huggingface://mradermacher/Qwen3-4B-Thinking-2507-GSPO-Easy-GGUF/Qwen3-4B-Thinking-2507-GSPO-Easy.Q4_K_M.gguf |
| 22997 | +- !!merge <<: *qwen3 |
| 22998 | + name: "qwen3-yoyo-v4-42b-a3b-thinking-total-recall-pkdick-v-i1" |
| 22999 | + urls: |
| 23000 | + - https://huggingface.co/mradermacher/Qwen3-Yoyo-V4-42B-A3B-Thinking-TOTAL-RECALL-PKDick-V-i1-GGUF |
| 23001 | + description: | |
| 23002 | + ### **Qwen3-Yoyo-V4-42B-A3B-Thinking-TOTAL-RECALL-PKDick-V** |
| 23003 | + **Base Model:** Qwen3-Coder-30B-A3B-Instruct (Mixture of Experts) |
| 23004 | + **Size:** 42B parameters (finetuned version) |
| 23005 | + **Context Length:** 1 million tokens (native), supports up to 256K natively with Yarn extension |
| 23006 | + **Architecture:** Mixture of Experts (MoE) — 128 experts, 8 activated per forward pass |
| 23007 | + **Fine-tuned For:** Advanced coding, agentic workflows, creative writing, and long-context reasoning |
| 23008 | + **Key Features:** |
| 23009 | + - Enhanced with **Brainstorm 20x** fine-tuning for deeper reasoning, richer prose, and improved coherence |
| 23010 | + - Optimized for **coding in multiple languages**, tool use, and long-form creative tasks |
| 23011 | + - Includes optional **"thinking" mode** via system prompt for structured internal reasoning |
| 23012 | + - Trained on **PK Dick Dataset** (inspired by Philip K. Dick’s works) for narrative depth and conceptual richness |
| 23013 | + - Supports **high-quality GGUF, GPTQ, AWQ, EXL2, and HQQ quantizations** for efficient local inference |
| 23014 | + - Recommended settings: 6–10 active experts, temperature 0.3–0.7, repetition penalty 1.05–1.1 |
| 23015 | + |
| 23016 | + **Best For:** Developers, creative writers, researchers, and AI researchers seeking a powerful, expressive, and highly customizable model with exceptional long-context and coding performance. |
| 23017 | + |
| 23018 | + > 🌟 *Note: This is a quantization and fine-tune of the original Qwen3-Coder-30B-A3B-Instruct by DavidAU, further enhanced by mradermacher’s GGUF conversion. The base model remains the authoritative version.* |
| 23019 | + overrides: |
| 23020 | + parameters: |
| 23021 | + model: Qwen3-Yoyo-V4-42B-A3B-Thinking-TOTAL-RECALL-PKDick-V.i1-Q4_K_M.gguf |
| 23022 | + files: |
| 23023 | + - filename: Qwen3-Yoyo-V4-42B-A3B-Thinking-TOTAL-RECALL-PKDick-V.i1-Q4_K_M.gguf |
| 23024 | + sha256: 6955283520e3618fe349bb75f135eae740f020d9d7f5ba38503482e5d97f6f59 |
| 23025 | + uri: huggingface://mradermacher/Qwen3-Yoyo-V4-42B-A3B-Thinking-TOTAL-RECALL-PKDick-V-i1-GGUF/Qwen3-Yoyo-V4-42B-A3B-Thinking-TOTAL-RECALL-PKDick-V.i1-Q4_K_M.gguf |
0 commit comments