Add Replicate provider #435

josefarias · 2025-09-28T20:21:30Z

What this does

This PR adds Replicate as a provider. It is WIP as I’m still getting familiar with RubyLLM and Replicate’s API, which has quite a few idiosyncrasies. Tests are notably pending.

Replicate may stand out from other providers in several areas:

They don’t expose pricing via API (that I can tell). I’ve opted for not including pricing for now.
Different users/companies will name their Replicate models using different schemes. So it’s hard to come up with a general pattern to identify models of the same family. I’ve opted not to include model families for now.
Replicate mostly wants to you send them an HTTP request and let them ping you back via webhook when the result is ready, which is different from the existing duck type called via RubyLLM.paint.
- This HTTP request returns a URL for the prediction, which is notably not the actual image, which isn’t ready yet.
- Returning Image.new(url: prediction_url) seems ill-fitting (because the prediction’s url is not the image url). I’ve opted to return the full response JSON instead.
Replicate models have drastically different input signatures. I’ve opted to wrap these in a generic model_params hash which we forward to {ProviderInstance}#paint. I folded the existing size kwarg into the same model_params concept to keep uniformity across providers.
Replicate’s API is quite flexible, offering a sync mode (vs the default async mode with webhooks), a streaming option, and a number of models with drastically different capabilities. In order to keep things revieweable and completable with reasonable effort, I’ve opted to limit this PR to the default async mode, and to text-to-image models, which is selfishly my immediate need.
- Including other kinds of models should be fairly manageable as follow-up PRs.

I patched the models refresh logic to append the replicate models to existing ones (patch not committed). Without this patch, the whole file would get overwritten and models would be missing, seeing as I don’t have credentials for all supported providers. The result is probably suboptimal—specially as it didn’t seem to work for regenerating docs. Let me know if you’d like me to try something else!

Please do let me know if any of the assumptions above don’t hold up or you’d like to do things differently.

Thanks for reading!

Type of change

Scope check

I read the Contributing Guide
This aligns with RubyLLM's focus on LLM communication
This isn't application-specific logic that belongs in user code
This benefits most users, not just my specific use case

Quality check

I ran overcommit --install and all hooks pass
I tested my changes thoroughly
- For provider changes: Re-recorded VCR cassettes with bundle exec rake vcr:record[provider_name]
- All tests pass: bundle exec rspec
I updated documentation if needed
I didn't modify auto-generated files manually (models.json, aliases.json)

API changes

Breaking change
New public methods/classes
Changed method signatures
No API changes

Related issues

#410

I hacked the models refresh logic to append the replicate models to existing ones. Without this patch, the whole file would get overwritten and models would be missing, seeing as I don’t have credentials for all supported providers.

josefarias · 2025-09-29T03:01:24Z

spec/support/rspec_configuration.rb

  config.around do |example|
    cassette_name = example.full_description.parameterize(separator: '_').delete_prefix('rubyllm_')
-    VCR.use_cassette(cassette_name) do
+    VCR.use_cassette(cassette_name, record: :new_episodes) do


This is so existing stubs are kept and only new HTTP interactions are recorded onto the existing cassette. We can remove if we’re not feeling it.

josefarias · 2025-09-29T03:10:03Z

docs/_core_features/image-generation.md

 )
 ```

-> Not all models support size customization. If a size is specified for a model that doesn't support it (like Google Imagen), RubyLLM may log a debug message indicating the size parameter is ignored. Check the provider's documentation or the [Available Models Guide]({% link _reference/available-models.md %}) for supported sizes.


I’ve removed the log because we’d now support passing arbitrary params. It’d be up to the user to make sure the model supports what they’re passing in.

It’s a bit of a sharp knife, but given Replicate opens the door to tons of models with tons of input signatures, allowing arbitrary params is the only way I can think of to implement. Open to other ideas though.

I’m wrapping up for today but will add this to the docs later.

josefarias added 7 commits September 28, 2025 13:49

Add Replicate provider

792200a

Current replicate models output images

7c5ad04

Drop 'replicate' prefix from model names

c7be9c4

Force array on replicate webhook filters

743a250

Run models rake tasks

cd04dc4

I hacked the models refresh logic to append the replicate models to existing ones. Without this patch, the whole file would get overwritten and models would be missing, seeing as I don’t have credentials for all supported providers.

Fold image size into model params

55a0687

Record model list cassettes

4eeb644

josefarias commented Sep 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add Replicate provider #435

Add Replicate provider #435

Uh oh!

josefarias commented Sep 28, 2025 •

edited

Loading

Uh oh!

josefarias Sep 29, 2025

Uh oh!

josefarias Sep 29, 2025

Uh oh!

Uh oh!

Uh oh!

Add Replicate provider #435

Are you sure you want to change the base?

Add Replicate provider #435

Uh oh!

Conversation

josefarias commented Sep 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this does

Type of change

Scope check

Quality check

API changes

Related issues

Uh oh!

josefarias Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

josefarias Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

josefarias commented Sep 28, 2025 •

edited

Loading