All predictors compiled for Linux
x86_64
on Muna support remote predictions by default.Remote predictions are an experimental feature, and can be drastically altered or removed on short notice.
Making a Remote Prediction
Use themuna.beta.predictions.remote.create
method to request a prediction to be created in the cloud:
Leveraging GPU Acceleration
One advantage with remote predictions is having access to orders of magnitude more compute than on your local device. Muna supports specifying aRemoteAcceleration
when creating remote predictions:
RemoteAcceleration
:
Remote Acceleration | Notes |
---|---|
remote_auto | Automatically use the ideal remote acceleration. |
remote_cpu | Predictions are run on AMD CPU servers. |
remote_a40 | Predictions are run on an Nvidia A40 GPU. |
remote_a100 | Predictions are run on an Nvidia A100 GPU. |
Remote predictions are priced by the remote acceleration, per second of prediction time (i.e.
prediction.latency
).
See our pricing for more information.If you want to self-host the remote acceleration servers in your VPC or on-prem, reach out to us.