Skip to content

Commit d5aef48

Browse files
committed
Fix for theme settings
1 parent 5b6769a commit d5aef48

File tree

2 files changed

+8
-8
lines changed

2 files changed

+8
-8
lines changed

input/theme-settings.txt

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
This file contains the original theme settings for leanpub, because I want to keep them around in case I want to go back and restore the original settings.
2+
3+
Spacing between list items
4+
--------------------------
5+
Top-level lists: -0.7
6+
First level nested lists: 0
7+
Second level nested lists: 0
8+
Third level nested lists: 0

manuscript/01-understanding-large-language-models.md

Lines changed: 0 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -377,30 +377,22 @@ Having said that, I've had great experiences with these models:
377377
what Meta says because the instruction-tuned version is also useful for non-chat
378378
purposes, as long as you have an instruction-based use case.
379379

380-
381-
382380
- **Mistral:** The Mistral model by the identically named company is a fast open-source
383381
LLM mostly used for chat purposes. This model is generally less capable than the LLaMA
384382
model variants, but its speed makes up for that. This model is also hosted on
385383
[HuggingFace][HUGGINGFACE_MISTRAL] and has many fine-tuned variants.
386384

387-
388-
389385
- **Gemma2:** Google published this model in February 2024 and trained it using a
390386
[teacher/student technique][HUGGINGFACE_GEMMA2]. The training technique looks very
391387
interesting, but Gemma2 isn't quite as good for many of the tasks I worked on as the
392388
other models in the open-source space.
393389

394-
395-
396390
- **Phi 4:** Is a new model that was [introduced by Microsoft in December
397391
2024][PHI4_ANNOUNCEMENT]. It is similar to the Mistral and Gemma2 models, 14 billion
398392
parameters, but shows higher performance in [the benchmarks][PHI4_BENCHMARKS]. While
399393
this doesn't tell the whole story, it's worth trying this model for a smaller
400394
open-source model.
401395

402-
403-
404396
- **DeepSeek** is another new model that's showing great promise. The R1 variant of
405397
DeepSeek is great at reasoning and shows similar performance to the OpenAI o1 models.
406398
The V3 variant of the model is great too, for more general-purpose tasks. You should

0 commit comments

Comments
 (0)