From 082ea1f778802446ce131fc8974a0c28c7b63017 Mon Sep 17 00:00:00 2001 From: DefTruth <31974251+DefTruth@users.noreply.github.com> Date: Sat, 20 Apr 2024 12:27:34 +0800 Subject: [PATCH] =?UTF-8?q?=F0=9F=94=A5=F0=9F=94=A5[Hidden=20Transfer]=20P?= =?UTF-8?q?arallel=20Decoding=20via=20Hidden=20Transfer=20for=20Lossless?= =?UTF-8?q?=20Large=20Language=20Model=20Acceleration(@pku.edu.cn=20etc)?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- README.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index f206c9e..147417d 100644 --- a/README.md +++ b/README.md @@ -230,7 +230,8 @@ Awesome-LLM-Inference: A curated list of [📙Awesome LLM Inference Papers with |2023.10|[**OSD**] Online Speculative Decoding(@UC Berkeley etc) | [[pdf]](https://arxiv.org/pdf/2310.07177.pdf)| ⚠️ |⭐️⭐️| |2023.12|[**Cascade Speculative**] Cascade Speculative Drafting for Even Faster LLM Inference(@illinois.edu) | [[pdf]](https://arxiv.org/pdf/2312.11462.pdf)| ⚠️ |⭐️| |2024.02|🔥[LookaheadDecoding] Break the Sequential Dependency of LLM Inference Using LOOKAHEAD DECODING(@UCSD&Google&UC Berkeley)|[[pdf]](https://arxiv.org/pdf/2402.02057.pdf)| [[LookaheadDecoding]](https://github.com/hao-ai-lab/LookaheadDecoding) ![](https://img.shields.io/github/stars/hao-ai-lab/LookaheadDecoding.svg?style=social) |⭐️⭐️ | -|2024.02| [speculative Decoding] Decoding Speculative Decoding(@cs.wisc.edu)|[[pdf]](https://arxiv.org/pdf/2402.01528.pdf)| ⚠️ |⭐️| +|2024.02|🔥🔥[**Speculative Decoding**] Decoding Speculative Decoding(@cs.wisc.edu)|[[pdf]](https://arxiv.org/pdf/2402.01528.pdf)| ⚠️ |⭐️| +|2024.04|🔥🔥[**Hidden Transfer**] Parallel Decoding via Hidden Transfer for Lossless Large Language Model Acceleration(@pku.edu.cn etc)|[[pdf]](https://arxiv.org/pdf/2404.12022.pdf)| ⚠️ |⭐️| ### 📖Structured Prune/KD/Weight Sparse ([©️back👆🏻](#paperlist))