Ephemeral
Ephemeral cache type.
Caches the prompt prefix up to and including the block this is attached to. Cache entries are reused across requests that share the same prefix within the TTL window.
Ephemeral cache type.
Caches the prompt prefix up to and including the block this is attached to. Cache entries are reused across requests that share the same prefix within the TTL window.