Quality of training results? #1851

matthiasgeihs · 2023-06-14T10:53:21Z

matthiasgeihs
Jun 14, 2023

I've been experimenting with training-text-from-scratch.
Compared to training on the same data with nanoGPT, the results seem considerably worse.
(Given roughly the same training time on CPU.)

I've been using the default parameters, which differ between the different code bases (llama.cpp vs nanoGPT).
I've noticed that the loading of the training data is somewhat different as well.

What are your experiences with llama.cpp training?

xaedes · 2023-06-14T16:17:47Z

xaedes
Jun 14, 2023
Collaborator

Which nanoGPT configuration and which data preparation did you use?

8 replies

matthiasgeihs Jun 30, 2023
Author

Thanks a lot for investigating this. I'll have a closer look at your results and maybe try to reproduce when I find the time. 🙏

Green-Sky Jun 30, 2023
Collaborator

ggml-bytewise-vocab.bin.zip
generated using https://gist.github.com/Green-Sky/aae372314f981d14b00852c34985c93a

Green-Sky Jun 30, 2023
Collaborator

i hope it works, atleast it loads

xaedes Jun 30, 2023
Collaborator

Yes, it works :)

matthiasgeihs Jun 30, 2023
Author

Maybe 1 additional note / question @xaedes: I was kind of assuming that train-text-from-scratch efficiency would beat nanoGPT/pytorch training efficiency on cpu (reasoning: basically because llama.cpp inference is so convincingly fast). hence i thought at same runtime, train-text-from-scratch results should be at least comparable, if not better...

According to your recent comment, this assumption seems to be wrong, meaning train-text-from-scratch is currently not more efficient than conventional pytorch training on cpu. is that correct? can you maybe shed more light on how the two different approaches compare?

aseok · 2023-06-21T08:12:12Z

aseok
Jun 21, 2023

here's output of training-text-from-scratch with latest default parameters from examples page, seems it's exact chunk of input text:
Generating 16 tokens.
in hue,
Could make me any summer's story tell:
Or from their proud lap pluck them where they grew:
Nor did I wonder at the lily's white,
Nor praise the deep vermilion in---
very veryist in my love,
Which to be
Thou

0 replies

the-crypt-keeper · 2023-06-21T15:57:59Z

the-crypt-keeper
Jun 21, 2023

1 training epoch output:

Thy still bladoth middle self I
For thou againing their mort-read.
There not my seek not?
M kind,
The beauty in his days in the soul is so.
And-atiencealill rose


And my love, and notich shalls thoughts I to this he love,
Why his rose,
O, and I to his strong thou shall beom,


Wh are org defence the maj sacredetses me since a kind,

Since.
But love,
And having,

But that thouet night.

perplexity on input text:

[1]322.9544,[2]344.1555,[3]384.6766,[4]408.3293,[5]445.5106,[6]410.7461,[7]415.0346,[8]393.5498,[9]405.0035,[10]391.7012,[11]396.9100,[12]404.1142,[13]386.9103,[14]386.8066,[15]380.7482,[16]382.7702,[17]384.9585,[18]371.8133,[19]381.9014,[20]373.3629,[21]380.8767,[22]381.7560,[23]398.5143,[24]402.8592,[25]398.6330,[26]398.5414,[27]393.8434,[28]391.7720,[29]389.1273,[30]387.1373,[31]383.4092,[32]379.2208,[33]377.6578,[34]382.7226,[35]384.2917,[36]382.4465,[37]376.8589,[38]373.1323,[39]373.4557,[40]375.5122,[41]380.7512,[42]379.9440,[43]379.6943,[44]380.9680,[45]380.9655,[46]378.0578,[47]376.2581,[48]375.4618,[49]371.8680,[50]373.0968,[51]378.1924,[52]374.2484,[53]377.5955,

2nd iteration of training output:

 .

And I doth fairest it that I I'st,
Mine of all thy self-air, and to that I in the world to be be are's mays sight to mine then of love,

Whoseillalt his golden am I do,
For,
Inou wy beauty of his salan of your praise,
Theup love.
But then not as eyes as my self the verse,
Since thouoe, by ais with-ire,

Who best to my self is my love'sid are in hispass thy beauty's

2nd iteration perplexity:

[1]160.8362,[2]167.8282,[3]193.6216,[4]217.0640,[5]246.7007,[6]222.9784,[7]238.6129,[8]219.8489,[9]224.8858,[10]215.0258,[11]218.3044,[12]230.5945,[13]219.4109,[14]219.0968,[15]214.7964,[16]217.4400,[17]215.9933,[18]210.9362,[19]220.2809,[20]213.3470,[21]220.1183,[22]220.8794,[23]233.6096,[24]238.2936,[25]236.2420,[26]237.0931,[27]232.7523,[28]231.1317,[29]232.6202,[30]231.0412,[31]229.6245,[32]227.1756,[33]224.2280,[34]230.1064,[35]231.7224,[36]229.5500,[37]225.8269,[38]222.3605,[39]222.1848,[40]224.6341,[41]230.0675,[42]227.1575,[43]228.5041,[44]228.3183,[45]229.0491,[46]227.9974,[47]227.8194,[48]227.1214,[49]224.7755,[50]224.6679,[51]229.1267,[52]225.9887,[53]229.7836,

3rd iteration output:

  of his society to looks this our his rose is,
And you on the dro wide, since his middle age,
Incertitled in your self by day'seds,
For blessed than my hath thee,
Therefore are but what blessed than my truth suppressed:
Thy black-day, and so are past the rhancyit,
For blessed than mythough she so solemn
And for I in thy love, he not for that which he doth say,
The it self by night, he will not every bluns,
Therefore, and so both to bl

3rd iteration perplexity:

[1]330.3301,[2]335.3353,[3]381.7894,[4]450.8096,[5]509.5158,[6]436.1262,[7]485.2961,[8]454.9817,[9]454.4107,[10]427.2342,[11]446.5920,[12]480.8394,[13]454.9135,[14]462.6143,[15]457.2441,[16]458.1195,[17]458.5167,[18]422.7817,[19]446.6622,[20]425.9623,[21]441.9038,[22]444.7060,[23]472.0163,[24]481.6250,[25]478.6315,[26]486.0071,[27]472.7340,[28]469.3013,[29]477.4521,[30]469.9810,[31]466.6954,[32]461.6271,[33]451.5267,[34]464.8970,[35]469.3584,[36]460.3976,[37]446.8347,[38]439.6502,[39]441.1156,[40]441.3072,[41]450.1176,[42]440.6842,[43]439.7314,[44]439.9365,[45]438.7138,[46]430.9184,[47]431.8198,[48]432.9292,[49]427.4609,[50]428.2032,[51]437.8436,[52]432.3350,[53]439.6829,

0 replies

daboe01 · 2023-07-02T09:34:01Z

daboe01
Jul 2, 2023

i have trained 100MB medical records from scratch with the aim of text competition. training was 1 week on a macpro (48GB ram).
unfortunately, no useful text is generated from this model (although the generated tokens after training looked promising).

the same training data give excellent results upon LORA finetuning a 7b llama (pytorch, GPU cluster).

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quality of training results? #1851

{{title}}

Replies: 4 comments 8 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Quality of training results? #1851

matthiasgeihs Jun 14, 2023

Replies: 4 comments · 8 replies

xaedes Jun 14, 2023 Collaborator

matthiasgeihs Jun 30, 2023 Author

Green-Sky Jun 30, 2023 Collaborator

Green-Sky Jun 30, 2023 Collaborator

xaedes Jun 30, 2023 Collaborator

matthiasgeihs Jun 30, 2023 Author

aseok Jun 21, 2023

the-crypt-keeper Jun 21, 2023

daboe01 Jul 2, 2023

matthiasgeihs
Jun 14, 2023

Replies: 4 comments 8 replies

xaedes
Jun 14, 2023
Collaborator

matthiasgeihs Jun 30, 2023
Author

Green-Sky Jun 30, 2023
Collaborator

Green-Sky Jun 30, 2023
Collaborator

xaedes Jun 30, 2023
Collaborator

matthiasgeihs Jun 30, 2023
Author

aseok
Jun 21, 2023

the-crypt-keeper
Jun 21, 2023

daboe01
Jul 2, 2023