Inference on CPU or MPS(Arm based Mac) ? #3

Pawandeep-prog · 2023-05-12T03:37:18Z

Is there any workaround for running inference on CPU or my arm based Mac M1.
Currently trying to run on Mac m1 and I am getting the following error

 /Users/pawandeepsingh/Documents/Development/llm/PaLM/inference.py:50 in main 
 ❱ 50 │   model = torch.hub.load("conceptofmind/PaLM", args.model).to(device).to(dtype)  

RuntimeError: Attempting to deserialize object on a CUDA device but torch.cuda.is_available() is False.
If you are running on a CPU-only machine, please use torch.load with map_location=torch.device('cpu') 
to map your storages to the CPU.

Thanks.

The text was updated successfully, but these errors were encountered:

conceptofmind · 2023-05-12T03:40:09Z

I will have to convert it to cpp at some time in the near future.

Pawandeep-prog · 2023-05-12T03:42:09Z

Thanks for quick reply.
Will be waiting and also will be looking to contribute to that.
:)

conceptofmind · 2023-05-16T22:54:55Z

Thanks for quick reply. Will be waiting and also will be looking to contribute to that. :)

You can map the model to the CPU as well by doing:

    device = torch.device("cpu")

    model = PaLM(
        num_tokens=50304, dim=1024, depth=24, dim_head=128, heads=8, flash_attn=False, qk_rmsnorm = False,
    ).to(device).eval()

    checkpoint = torch.load('./palm_410m_8k_v0.pt', map_location=device)
    model.load_state_dict(checkpoint)

I still need to build the .cpp version but this should work for the meantime. I will put a note in the documentation.

tomsib2001 · 2023-09-14T23:24:16Z

Should the parameters in this script be changed for, say, the 1B version?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference on CPU or MPS(Arm based Mac) ? #3

Inference on CPU or MPS(Arm based Mac) ? #3

Pawandeep-prog commented May 12, 2023

conceptofmind commented May 12, 2023

Pawandeep-prog commented May 12, 2023

conceptofmind commented May 16, 2023 •

edited

Loading

tomsib2001 commented Sep 14, 2023

Inference on CPU or MPS(Arm based Mac) ? #3

Inference on CPU or MPS(Arm based Mac) ? #3

Comments

Pawandeep-prog commented May 12, 2023

conceptofmind commented May 12, 2023

Pawandeep-prog commented May 12, 2023

conceptofmind commented May 16, 2023 • edited Loading

tomsib2001 commented Sep 14, 2023

conceptofmind commented May 16, 2023 •

edited

Loading