makemore/lecture4/backprop/dC #57

we89 · 2024-06-26T14:57:44Z

dC = torch.zeros_like(C)
for k in range(Xb.shape[0]):
    dC[Xb[k]] += demb[k]

I don't know why it is `wrong.
And the correct answer is:

dC = torch.zeros_like(C)
for k in range(Xb.shape[0]):
    for j in range(Xb.shape[1]):
        ix = Xb[k,j]
        dC[ix] += demb[k,j]

The text was updated successfully, but these errors were encountered:

Mountagha · 2024-06-27T15:58:58Z

What is your question. Can you elaborate more clearly ?

junqi-lu · 2024-09-15T19:00:01Z

It's an issue about how PyTorch (or numpy?) manages in-place operations and advanced indexing with repeating indices, it seems PyTorch only adds the last corresponding row from src to the specified index.

So if Xb[0]=[1,1,1], dC[1] will add the last row of demb[0], which is demb[0][2], not to accumulate demb[0][0], demb[0][1] and demb[0][2].

You can find more in this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

makemore/lecture4/backprop/dC #57

makemore/lecture4/backprop/dC #57

we89 commented Jun 26, 2024

Mountagha commented Jun 27, 2024

junqi-lu commented Sep 15, 2024

makemore/lecture4/backprop/dC #57

makemore/lecture4/backprop/dC #57

Comments

we89 commented Jun 26, 2024

Mountagha commented Jun 27, 2024

junqi-lu commented Sep 15, 2024