Changelog¶

AI Gateway Helm chart v0.2.0¶

May 29, 2025

🆕 Features

Updates AI Gateway Core image to v1.1.0.
Updates AI Gateway Processors image to v1.1.0.
Adds AI Gateway processor labs Data Security and Prompt Guard v0.0.1 images.
Adds Azure Blob Storage support to the audit exporter.
Adds clusterIP, loadbalancerIP, externalTrafficPolicy and imagePullSecrets configuration for all services.
Moves AI Gateway chart to private-registry.f5.com/aigw/charts/aigw.

✅ Compatibility:

Kubernetes version 1.25.0-0 or higher

📦 Helm registry:

oci://private-registry.f5.com/aigw/charts/aigw:0.2.0

AI Gateway Core v1.1.0¶

May 29, 2025

✨ Highlights:

Major enhancements to transaction export, including a new format (this is a breaking change to an experimental feature), Azure Blob Storage support, and OpenTelemetry tracing for exports. Note: this feature remains experimental and may change in future releases.
Service reliability improvements: fallback and retry support for HTTP requests to services.
Expanded observability: new OpenTelemetry metrics for services and processors.
Expanded troubleshooting: stacktrace debug logging on panics, and detailed raw HTTP logging.

🆕 Features:

Improve the format of exported transactions to capture more data and request/response attributes. Note: This is breaking change. The feature remains experimental and may change in future releases.
Support service fallback: if the core fails to send an HTTP request to a service, it can now send the request to a fallback service.
Support service retries: the core can now retry failed HTTP requests to services.
Support exporting transactions to Azure Blob Storage.
Add OpenTelemetry tracing for transaction exports: each time the core exports a transaction, it creates a trace.
Support logging stacktraces when the core panics while processing a request (requires debug logging to be enabled).
Add OpenTelemetry metrics to count tokens, extracted from LLM provider response metadata.
Add OpenTelemetry metrics to count rejections performed by processors.
Support logging raw HTTP request/response attributes and payloads for communications with services, processors, and blob storages, to aid debugging and troubleshooting.
Change timeout related response status code: the core now returns a 502 Gateway Timeout status code (instead of 500 Internal Server Error) when request processing is aborted due to a route timeout.

🐛 Bug fixes:

Fix debug logging level: previously, the core logged debug-level entries at the info level.
Fix the default Anthropic version: previously, if the version was not set in the config, the core would use an empty version instead of the default, resulting in failed requests.
Fix OpenTelemetry span for services: previously, the core created a span for the service API call with the wrong parent (the service phase span instead of the selected service span).
Ensure F5 FCP licenses are supported.

✅ Compatibility:

Kubernetes version 1.25.0-0 or higher
F5 processors 1.1.0 or higher
Processors developed with SDK 1.0 or higher

📦 Container images:

private-registry.f5.com/aigw/aigw:v1.1.0

AI Gateway Processors v1.1.0¶

May 29, 2025

✨ Highlights:

Minor optimizations for the language-id and prompt-injection processors
API updates to support new features in Core v1.1.0

✅ Compatibility:

Kubernetes version 1.25.0-0 or higher

📦 Container images:

private-registry.f5.com/aigw/aigw-processors-f5:v1.1.0

AI Gateway Prompt Guard Labs processor v0.0.1¶

May 29, 2025

✨ Highlights:

Initial release

✅ Compatibility:

Kubernetes version 1.25.0-0 or higher

📦 Container images:

private-registry.f5.com/aigw/processor-labs/prompt-guard:v0.0.1

AI Gateway Helm chart v0.1.1¶

March 31, 2025

🆕 Features

Adds readiness configuration and endpoint for AI Gateway Core image.
Updates AI Gateway Core image to v1.0.2.
Updates AI Gateway Processors image to v1.0.1.

✅ Compatibility:

Kubernetes version 1.25.0-0 or higher

📦 Helm registry:

oci://private-registry.f5.com/aigw/aigw:0.1.1

AI Gateway Core v1.0.2¶

March 31, 2025

🐛 Bug fixes:

Fix the core not rejecting request/response for parallel stages: if a stage was parallel, the core would not reject request/response, even if a processor rejected it.

✅ Compatibility:

Kubernetes version 1.25.0-0 or higher

📦 Container images:

private-registry.f5.com/aigw/aigw:v1.0.2

AI Gateway Core v1.0.1¶

March 28, 2025

✨ Highlights:

The release includes a number of bug fixes.

🐛 Bug fixes:

Improve error messages in logs: make the messages more descriptive and user-friendly.
Fix the readiness probe health check: previously, the probe could have succeeded even if the core wasn’t ready to receive the traffic.
Return 502 Bad Gateway status when the core doesn’t select any service for a request. Previously, the core would return 404.
Ensure the core adds an error to an OTel trace for a span that corresponds to the profile sequential stage.
Ensure the core validates the processor parameters (the params field of the processor configuration).
Ensure the core returns a 500 Internal Server Error when a processor executed in a parallel stage modifies the client request or LLM response. Such modifications are not allowed. Previously, the core would disregard any modifications, but incorrectly proceed further with any processing.
Ensure the core fails to start if aigw start is invoked without the config file path.
Do not print usage if the core terminates with an error.
Ensure the core returns JSON errors when it cannot match a request against any route. Previously, it would return plain text errors.
When a service returns an error, ensure the core propagates the HTTP status code to the client. Previously, the core would always return the 500 status code.
Ensure the core returns errors to clients in JSON using the OpenAI-like error format. Previously, for some cases, the core would return errors in different format.
Ensure when the core is being shutdown, it exports any pending unexpected transaction. Note: if transactions are not exported in 30s (default terminationGracePeriodSeconds pod parameter), Kubernetes will kill the pod, and, as a result, the unexported transactions will get lost.
Improve validation of the path field in routes.
Change the minimal acceptable TLS version for connections to processors and services from v1.3 to v1.2.

✅ Compatibility:

Kubernetes version 1.25.0-0 or higher

📦 Container images:

private-registry.f5.com/aigw/aigw:v1.0.1

AI Gateway Processors v1.0.1¶

March 28, 2025

✨ Highlights:

Minor optimizations for the language-id and prompt-injection processors
Improved validation of model pipeline parameters

🐛 Bug fixes:

Fixed an issue with processors server not utilizing worker processors when WEB_CONCURRENCY is set

✅ Compatibility:

Kubernetes version 1.25.0-0 or higher

📦 Container images:

private-registry.f5.com/aigw/aigw-processors-f5:v1.0.1

AI Gateway v1.0.0¶

February 25, 2025

AI Gateway Core¶

✨ Highlights:

Export OpenTelemetry Metrics
Simplify Route schema configuration.
Entitlement and Visibility tracking enabled (for usage-based pricing compliance)

🐛 Bug fixes:

Fixed an issue with OpenTelemetry tracing that sometimes resulted in duplicate spans.
Improve OpenTelemetry Tracing configuration to avoid performance degradation under load
Exit on TLS misconfiguration
Fix various Selector evaluation and configuration bugs.
Support path style addressing when connecting to AWS S3 for transaction exporting.

AI Gateway Processors¶

✨ Highlights:

F5 processor set revised to:
- prompt-injection
- system-prompt
- language-id
- repetition-detect
prompt-injection and language-id processors now process the full conversation history
system-prompt processor adds the ability to strip existing system prompts. Changes made to ordering of inserted prompts to fit model expectations
language-id processor now allows rejection
Processors now accept common parameters reject, annotate, modify for more predictability in processor actions
Support for enabling GPU support for model-based processors added to helm chart.

✅ Compatibility:

Kubernetes version 1.25.0-0 or higher

📦 Container images:

private-registry.f5.com/aigw/aigw:v1.0.0
private-registry.f5.com/aigw/aigw-processors-f5:v1.0.0

AI Gateway Core v0.2.0¶

January 23, 2025

AI Gateway Core¶

🆕 Features

Secure communications between core, processors, and upstream LLMs using private TLS certificates.
Authenticate processors using mTLS; private certificates are supported here as well.
Configure core application loging level, verbosity, and contextual output.
Model-based LLM request routing.
List models API for OpenAI

🐛 Bug fixes:

Transaction exporter “busy looping” causing unnecessary CPU utilization
OTEL Exporter fails to shut down cleanly when receiving SIGINT

✅ Compatibility:

Kubernetes version 1.25.0-0 or higher

📦 Container images:

private-registry.f5.com/aigw/aigw:v0.2.0

v0.1.0¶

November 13, 2024

✨ Highlights:

F5 AI Gateway routes generative AI traffic to an appropriate Large Language Model (LLM) backend and protects the traffic against common threats, which includes:

Inspecting and filtering client requests and LLM responses
Preventing malicious inputs from reaching an LLM backend
Ensuring that LLM responses are safe to send to clients
Protecting against leaking sensitive information

AI Gateway is a standalone product that runs on Kubernetes.

AI Gateway Core¶

🆕 Features:

Support the following LLM APIs: OpenAI, Microsoft Azure OpenAI Service, Anthropic, Ollama (only between AI Gateway and Ollama)
Authentication of client requests using JWT and TLS
Authorization of client requests using JWT claims and HTTP headers
Executing a chain of processors for client requests and LLM responses.
Routing client requests based on request attributes and metadata generated by processors.
Conversion between LLM API schemas when the request schema and the LLM backend schema do not match.
Recording every request/response on disk with an ability to export to AWS S3 or S3-compatible storage.
Generating and exporting observability data via OpenTelemetry.
Providing a configuration interface via an HTTP API and a config file.

✅ Compatibility:

Kubernetes >= 1.25.0-0

📦 Container images:

private-registry.f5.com/aigw/aigw:v0.1.0

AI Gateway Processors¶

🆕 Features:

Added the following processors:
- Language ID
- Prompt enforcement
- Prompt injection detection (PI Processor)
- Repetition detection
Efficacy tests for above processors
Added GPU support for model-based processors
TLS support added to the processor server
Added support for system messages in the processor with new API revision

✅ Compatibility:

Kubernetes >= 1.25.0-0

📦 Container images:

private-registry.f5.com/aigw/aigw-processors-f5:v0.1.0