Skip to content

Commit 378b5fe

Browse files
committed
Add support for Pascal and Volta
1 parent 748efa4 commit 378b5fe

File tree

2 files changed

+16
-12
lines changed

2 files changed

+16
-12
lines changed

.github/workflows/build-cuda.yml

Lines changed: 13 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -63,23 +63,23 @@ jobs:
6363
- cuda_version: '12.4.1'
6464
cuda_version_short: '12.4'
6565
cuda_tag: '12.4.1-cudnn-devel-ubuntu22.04'
66-
architectures: '75-virtual;80-virtual;86-virtual;89-virtual;90-virtual'
66+
architectures: '61-virtual;70-virtual;75-virtual;80-virtual;86-virtual;89-virtual;90-virtual'
6767
- cuda_version: '12.6.3'
6868
cuda_version_short: '12.6'
6969
cuda_tag: '12.6.3-cudnn-devel-ubuntu22.04'
70-
architectures: '75-virtual;80-virtual;86-virtual;89-virtual;90-virtual'
70+
architectures: '61-virtual;70-virtual;75-virtual;80-virtual;86-virtual;89-virtual;90-virtual'
7171
- cuda_version: '12.8.1'
7272
cuda_version_short: '12.8'
7373
cuda_tag: '12.8.1-cudnn-devel-ubuntu22.04'
74-
architectures: '75-virtual;80-virtual;86-virtual;89-virtual;90-virtual;100-virtual;120-virtual'
74+
architectures: '61-virtual;70-virtual;75-virtual;80-virtual;86-virtual;89-virtual;90-virtual;100-virtual;120-virtual'
7575
- cuda_version: '12.9.1'
7676
cuda_version_short: '12.9'
7777
cuda_tag: '12.9.1-cudnn-devel-ubuntu22.04'
78-
architectures: '75-virtual;80-virtual;86-virtual;89-virtual;90-virtual;100-virtual;120-virtual'
78+
architectures: '61-virtual;70-virtual;75-virtual;80-virtual;86-virtual;89-virtual;90-virtual;100-virtual;120-virtual'
7979
- cuda_version: '13.0.1'
8080
cuda_version_short: '13.0'
8181
cuda_tag: '13.0.1-cudnn-devel-ubuntu22.04'
82-
architectures: '75-virtual;80-virtual;86-virtual;89-virtual;90-virtual;100-virtual;120-virtual'
82+
architectures: '61-virtual;70-virtual;75-virtual;80-virtual;86-virtual;89-virtual;90-virtual;100-virtual;120-virtual'
8383

8484
steps:
8585
- name: Checkout
@@ -240,14 +240,16 @@ jobs:
240240
**Commit:** ${{ needs.check-release.outputs.release_hash }}
241241
242242
## CUDA Versions
243-
- CUDA 12.4 - Architectures: 7.5, 8.0, 8.6, 8.9, 9.0
244-
- CUDA 12.6 - Architectures: 7.5, 8.0, 8.6, 8.9, 9.0
245-
- CUDA 12.8 - Architectures: 7.5, 8.0, 8.6, 8.9, 9.0, 10.0, 12.0
246-
- CUDA 12.9 - Architectures: 7.5, 8.0, 8.6, 8.9, 9.0, 10.0, 12.0
247-
- CUDA 13.0 - Architectures: 7.5, 8.0, 8.6, 8.9, 9.0, 10.0, 12.0
243+
- CUDA 12.4 - Architectures: 6.1, 7.0, 7.5, 8.0, 8.6, 8.9, 9.0
244+
- CUDA 12.6 - Architectures: 6.1, 7.0, 7.5, 8.0, 8.6, 8.9, 9.0
245+
- CUDA 12.8 - Architectures: 6.1, 7.0, 7.5, 8.0, 8.6, 8.9, 9.0, 10.0, 12.0
246+
- CUDA 12.9 - Architectures: 6.1, 7.0, 7.5, 8.0, 8.6, 8.9, 9.0, 10.0, 12.0
247+
- CUDA 13.0 - Architectures: 6.1, 7.0, 7.5, 8.0, 8.6, 8.9, 9.0, 10.0, 12.0
248248
249249
## Architecture Reference
250-
- 7.5: Tesla T4, RTX 2000 series, Quadro RTX
250+
- 6.1: Titan XP, Tesla P40, GTX 10xx
251+
- 7.0: Tesla V100
252+
- 7.5: Tesla T4, RTX 20xx series, Quadro RTX
251253
- 8.0: A100
252254
- 8.6: RTX 3000 series
253255
- 8.9: RTX 4000 series, L4, L40

README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,12 +24,14 @@ The official llama.cpp repository does not provide pre-built CUDA binaries. This
2424

2525
| Compute Capability | GPU Examples | CUDA 12.4/12.6 | CUDA 12.8+ |
2626
|-------------------|--------------|----------------|------------|
27+
| 6.1 | Titan XP, Tesla P40, GTX 10xx |||
28+
| 7.0 | Tesla V100 |||
2729
| 7.5 | Tesla T4, RTX 2000 series, Quadro RTX |||
2830
| 8.0 | A100 |||
2931
| 8.6 | RTX 3000 series |||
3032
| 8.9 | RTX 4000 series, L4, L40 |||
3133
| 9.0 | H100, H200 |||
32-
| 10.0 | B100, B200, GB200 |||
34+
| 10.0 | B200 |||
3335
| 12.0 | RTX Pro series, RTX 5000 series |||
3436

3537
**Note:** Blackwell architectures (10.0 & 12.0) require CUDA 12.8 or later.

0 commit comments

Comments
 (0)