Machine Learning Inference

The Spice ML runtime currently supports prediction via an API in the Spice runtime.

GET `/v1/models/:name/predict`

curl "http://localhost:3000/v1/models/my_model_name/predict"

Where:

name: References the name provided in the spicepod.yaml.

Response

Success
Bad Request
Internal Error

{
    "status": "Success",
    "model_name": "my_model_name",
    "model_version": "1.0",
    "lookback": 30,
    "prediction": [0.45, 0.50, 0.55],
    "duration_ms": 123
}

{
    "status": "BadRequest",
    "error_message": "You have me a bad request :(",
    "model_name": "my_model_name",
    "lookback": 30,
    "duration_ms": 12
}

{
    "status": "InternalError",
    "error_message": "Oops, the server couldn't predict",
    "model_name": "my_model_name",
    "lookback": 30,
    "duration_ms": 12
}

POST `/v1/predict`

It's also possible to run multiple prediction models in parallel, useful for ensembling or A/B testing.

curl --request POST \
  --url http://localhost:3000/v1/predict \
  --data '{
    "predictions": [
        {
            "model_name": "drive_stats_a"
        },
        {
            "model_name": "drive_stats_b"
        }
    ]
}'

Where:

Each model_name provided references a model name in the Spicepod.

{
    "duration_ms": 81,
    "predictions": [{
        "status": "Success",
        "model_name": "drive_stats_a",
        "model_version": "1.0",
        "lookback": 30,
        "prediction": [0.45, 0.5, 0.55],
        "duration_ms": 42
    }, {
        "status": "Success",
        "model_name": "drive_stats_b",
        "model_version": "1.0",
        "lookback": 30,
        "prediction": [0.43, 0.51, 0.53],
        "duration_ms": 42
    }]
}

Limitations

Univariate predictions only
Multiple covariates
Covariate and output variate must have a fixed time frequency.
No support for discrete or exogenous variables.

GET /v1/models/:name/predict​

Response​

POST /v1/predict​

Limitations​

GET `/v1/models/:name/predict`

Response

POST `/v1/predict`

Limitations