Gateway Caching

Use this section to choose the right Gateway caching method for your environment.

Cache Types

Gateway caching has four practical patterns to choose from:

TypeBest forAvoid when
Runtime CachingStandard request acceleration and reduced repeated SaaS fetchesYou need all reads to always bypass cache
Proactive CachingPre-warming and refresh jobs to reduce first-read latencyYour environment is rate-limit constrained
Cluster Cache (Standalone)Shared Redis cache for multi-pod Gateway with low operational overheadYou require cache failover across Redis node or pod failures
Cluster Cache High Availability (HA)Sentinel-based high-availability cache service for resilient shared cacheYou prefer the simplest deployment footprint and can accept standalone Redis risk

Choosing a Caching Method

Use this starting decision flow:

  1. Start with runtime caching for most environments.
  2. Add proactive caching when you need faster first-read performance for frequently used secrets.
  3. Add standalone cluster cache when running multiple Gateway pods that should share cache state.
  4. Use Cluster Cache High Availability (HA) when shared cache availability across failures is a requirement.

For planning guidance and tradeoffs, see Gateway Best Practices: Caching strategy considerations.

For Kubernetes proactive cache sizing guidance, see Gateway Best Practices: Resource planning for Kubernetes proactive cache.

Configure in Gateway UI

To manage cache runtime settings from Gateway Configuration Manager:

  1. Open https://<your-gateway-url>:8000/console.
  2. Go to Gateways > Your Gateway > Manage Gateway > Caching Configuration.
  3. Configure cache and proactive cache options.
  4. Save changes.

For Kubernetes deployment keys (globalConfig.clusterCache, cacheHA, and persistence options), see Helm Values Reference.

ignore-cache Behavior

The ignore-cache flag is intended to bypass cache and fetch directly from SaaS.

akeyless get-secret-value --name /mysecret --ignore-cache true

In disconnected mode (when SaaS is unreachable), runtime still checks cache first even when ignore-cache=true. If the value is not cached, the request fails.


Footer Section