list of open-source publicly-available llms for code #49

andre15silva · 2023-05-03T14:37:36Z

	Name	Publication Date	Model Type	Sizes	URL
-	CodeGen	03/22	Decoder	350M, 2B, 6B, 16B	https://huggingface.co/Salesforce/codegen-16B-mono
-	InCoder	04/22	Decoder	1.3B, 6.7B	https://huggingface.co/facebook/incoder-6B
-	CodeGeeX	09/22	Decoder	13B	https://huggingface.co/spaces/THUDM/CodeGeeX
-	santacoder				https://huggingface.co/bigcode/santacoder
-	replit				https://huggingface.co/replit/replit-code-v1_5-3b
-	codet5				https://huggingface.co/Salesforce/codet5-large
-	plbart				https://huggingface.co/models?other=plbart

monperrus · 2023-05-10T05:07:47Z

diff-codegen-350m
diff-codegen-2b
diff-codegen-6b

all fine-tuned from Salesforce’s CodeGen code synthesis models

ref: https://carper.ai/diff-models-a-new-way-to-edit-code/

monperrus · 2023-05-10T05:08:54Z

GPT-J / GPT-J-6B

demo at https://6b.eleuther.ai/
https://github.com/kingoflolz/mesh-transformer-jax/#gpt-j-6b

used by https://arxiv.org/pdf/2208.08289.pdf

andre15silva · 2023-05-10T07:25:05Z

to merge: https://github.com/eugeneyan/open-llms (section "Open LLMs for code")

andre15silva · 2023-05-10T07:31:55Z

codegen2 (also supports infilling)

https://github.com/salesforce/CodeGen2

monperrus · 2023-05-11T12:24:46Z

https://github.com/bigcode-project/starcoder
15.5B parameter model supports code generation and infilling

monperrus · 2023-08-26T06:50:40Z

StableCode by StableAI
https://huggingface.co/stabilityai/stablecode-instruct-alpha-3b

monperrus · 2023-08-26T06:50:54Z

WizardCoder
https://huggingface.co/WizardLM/WizardCoder-15B-V1.0

monperrus · 2023-08-26T06:52:29Z

code-llama by Meta
https://about.fb.com/news/2023/08/code-llama-ai-for-coding/

Code Llama: Open Foundation Models for Code
https://arxiv.org/pdf/2308.12950

monperrus · 2023-10-16T06:12:47Z

The Mistral models https://mistral.ai/

@martinezmatias says they are good.

Mistral 7B
https://arxiv.org/pdf/2310.06825

monperrus · 2023-10-18T15:27:47Z

CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model
https://arxiv.org/pdf/2310.06266

monperrus · 2023-10-21T07:45:39Z

Qwen

CODE-QWEN and CODE-QWEN-CHAT 通义千问 (Alibaba)

QWEN TECHNICAL REPORT
https://arxiv.org/pdf/2309.16609.pdf
https://github.com/QwenLM/Qwen

Qwen2. 5-Coder Technical Report
https://arxiv.org/pdf/2409.12186

Nov 2024:

Qwen 2.5-Coder-32B-Instruct Performance: @Alibaba_Qwen announced Qwen 2.5-Coder-32B-Instruct, which matches or surpasses GPT-4o on multiple coding benchmarks. Early testers reported it as "indistinguishable from o1-preview results" (@hrishioa) and noted its competitive performance in code generation and reasoning.

updated @andre15silva

monperrus · 2023-11-20T07:30:17Z

For the record: CodeTrans

paper CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing
model https://github.com/agemagician/CodeTrans

monperrus · 2023-12-20T13:37:12Z

DeepSeek Coder: Let the Code Write Itself

monperrus · 2024-01-15T14:32:36Z

Magicoder
Magicoder: Source Code Is All You Need
https://arxiv.org/abs/2312.02120
https://huggingface.co/TheBloke/Magicoder-S-DS-6.7B-GGUF

monperrus · 2024-03-04T14:04:05Z

StarCoder2
https://drive.google.com/file/d/17iGn3c-sYNiLyRSY-A85QOzgzGnGiVI3/view

monperrus · 2024-04-02T17:58:46Z

CodeShell Technical Report
https://arxiv.org/pdf/2403.15747

CodeShell-Base, a seven billion-parameter foundation model with 8K context length, showcasing exceptional proficiency in code comprehension, which outperforms CodeLlama in Humaneval after training on just 500 billion tokens (5 epochs).

monperrus · 2024-06-07T08:45:20Z

Mixtral, @FredBonux is able to use it over groq

andre15silva · 2024-06-07T08:49:52Z

mistralai/Codestral-22B-v0.1

https://huggingface.co/mistralai/Codestral-22B-v0.1

monperrus · 2024-08-20T13:23:25Z

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
https://arxiv.org/pdf/2406.11931

monperrus · 2024-10-22T06:00:05Z

aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Completion
https://www.semanticscholar.org/reader/2c5dd0f56eff1caa3edb20354374a9585181ea73

monperrus · 2024-11-08T16:34:41Z

Tencent 's Hunyuan (huggingface, paper)

monperrus · 2024-11-15T08:21:01Z

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
https://arxiv.org/pdf/2411.04905

monperrus changed the title ~~list of llms (for code)~~ list of publicly-available llms for code Aug 26, 2023

monperrus mentioned this issue Oct 17, 2023

api endpoints for LLMs #62

Closed

ASSERT-KTH deleted a comment from bbaudry Oct 21, 2024

monperrus changed the title ~~list of publicly-available llms for code~~ list of open-source publicly-available llms for code Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

list of open-source publicly-available llms for code #49

list of open-source publicly-available llms for code #49

andre15silva commented May 3, 2023 •

edited by monperrus

Loading

monperrus commented May 10, 2023

monperrus commented May 10, 2023 •

edited

Loading

andre15silva commented May 10, 2023 •

edited by monperrus

Loading

andre15silva commented May 10, 2023

monperrus commented May 11, 2023

monperrus commented Aug 26, 2023

monperrus commented Aug 26, 2023

monperrus commented Aug 26, 2023 •

edited

Loading

monperrus commented Oct 16, 2023 •

edited

Loading

monperrus commented Oct 18, 2023

monperrus commented Oct 21, 2023 •

edited

Loading

monperrus commented Nov 20, 2023

monperrus commented Dec 20, 2023

monperrus commented Jan 15, 2024

monperrus commented Mar 4, 2024

monperrus commented Apr 2, 2024

monperrus commented Jun 7, 2024

andre15silva commented Jun 7, 2024 •

edited

Loading

monperrus commented Aug 20, 2024

monperrus commented Oct 22, 2024

monperrus commented Nov 8, 2024

monperrus commented Nov 15, 2024

list of open-source publicly-available llms for code #49

list of open-source publicly-available llms for code #49

Comments

andre15silva commented May 3, 2023 • edited by monperrus Loading

monperrus commented May 10, 2023

monperrus commented May 10, 2023 • edited Loading

andre15silva commented May 10, 2023 • edited by monperrus Loading

andre15silva commented May 10, 2023

monperrus commented May 11, 2023

monperrus commented Aug 26, 2023

monperrus commented Aug 26, 2023

monperrus commented Aug 26, 2023 • edited Loading

monperrus commented Oct 16, 2023 • edited Loading

monperrus commented Oct 18, 2023

monperrus commented Oct 21, 2023 • edited Loading

Qwen

monperrus commented Nov 20, 2023

monperrus commented Dec 20, 2023

monperrus commented Jan 15, 2024

monperrus commented Mar 4, 2024

monperrus commented Apr 2, 2024

monperrus commented Jun 7, 2024

andre15silva commented Jun 7, 2024 • edited Loading

monperrus commented Aug 20, 2024

monperrus commented Oct 22, 2024

monperrus commented Nov 8, 2024

monperrus commented Nov 15, 2024

andre15silva commented May 3, 2023 •

edited by monperrus

Loading

monperrus commented May 10, 2023 •

edited

Loading

andre15silva commented May 10, 2023 •

edited by monperrus

Loading

monperrus commented Aug 26, 2023 •

edited

Loading

monperrus commented Oct 16, 2023 •

edited

Loading

monperrus commented Oct 21, 2023 •

edited

Loading

andre15silva commented Jun 7, 2024 •

edited

Loading