0
, the system won’t retry the request to this model. In general, there will be up to N+1
LLM calls performed to the same model (the original, and the retries.)
To be used in combination with max_switch_model_retries
(max_same_model_retries+1) * (max_switch_model_retries+1)
0
, the system won’t use any other models. In general, Pulze will try the request with N+1
different LLM models (the best model, and one for each “retry”.)
To be used in combination with max_same_model_retries
(max_same_model_retries+1) * (max_switch_model_retries+1)
prompt
, response
, and all the metadata associated with it (labels, weights, costs…)prompt
and the LLM’s response
will not be logged in any way. The log is still visible, labelled, and searcheable.prompt
, response
or labels are stored.Pulze-Labels
header, the policies will be stored as part of the Labels in this format:
Pulze-Labels
are defined.Send {}
if you don't want to send any particular Labels, but want the Pulze-Weights
and Pulze-Policies
to be auto-saved.
Ignoring the Pulze-Labels
header will not store the other header information as part of it -- essentially, ignoring the Pulze-Labels
header is like saying "I don't want labels".privacy_level
isn’t stored at all