Apache Flink Agents is a streaming Agent OS for enterprise, production-grade scenarios. Three characteristics define it: it is event-driven, distributed, and reliable. Like any other Agent OS, it manages the core building blocks of an agent — orchestration, context, memory, and tool/MCP invocation. But more than that, because it targets production workloads at scale, it also handles distributed coordination, consistency, fault tolerance and recovery.
In one line, Flink Agents brings AI agents into the Flink streaming pipeline — an agent becomes a first-class operator in your real-time datastream. Why would you want that? Because a fast-growing class of workloads needs AI decisions to be made in the flow of live events, not in response to a human prompt:
These production agent scenarios share a common shape:
Years of building Flink have taught us that a reliable, distributed, coordinated event-stream processing system is not something you get by running a single-node agent as a few replicas, adding some retries, and switching to event-triggered invocation. Massive scale, millisecond latency, exactly-once consistency, fault tolerance, and stateful coordination are hard distributed-systems problems — and they are exactly what Flink has spent over a decade solving in production. So instead of reinventing that runtime, Flink Agents lets you run modern AI agents directly on Flink's battle-tested distributed streaming runtime.
Agent Skills are an emerging standard for packaging prompts, tools, and resources into self-contained capabilities that an agent can discover and load on demand. Flink Agents 0.3 adds support for using Agent Skills, available in both the Python and Java APIs. See the documentation for more details.
Long-Term Memory is now backed by Mem0 in both Python and Java, replacing the previous vector-store-based implementation. This delivers more robust semantic retrieval, summarization, and isolation, and comes with a Mem0 vector store integration.
Flink Agents 0.3 introduces a declarative YAML API for describing agents in both Python and Java. Resources such as chat model connections, prompts, tools, and vector stores are declared in YAML, while action logic continues to live in Python or Java code and is referenced from YAML via function pointers. This separation decouples infrastructure configuration from business logic, making it easy to swap model providers or adjust prompts across environments without touching action code. A JSON Schema is published alongside the YAML API to support IDE validation, autocompletion, and LLM-assisted authoring.
Building on the cross-language resource access introduced in 0.2, Flink Agents 0.3 adds Cross-Language Actions: you can author an action in one language and run it inside an agent built in the other — for example, invoking a Java action from a Python agent, or vice versa. Functions, FunctionTools, and events are now unified across the two runtimes.
Durable execution gains a reconciler mechanism: durable blocks can register reconciler callables to reconcile in-flight side effects upon failure recovery, with reconciler exceptions persisted as failures. This is supported in both Python and Java.
Flink Agents 0.3 adds Apache Fluss (Incubating) as a supported backend for the action state store, alongside the existing options Kafka.
Flink Agents 0.3 broadens its built-in integrations across model providers and vector stores:
Chat Models:
Embedding Models:
Vector Stores:
@Action(listenEvents = {InputEvent.class}) becomes @Action(listenEventTypes = {InputEvent.EVENT_TYPE}); in Python, @action(InputEvent) becomes @action(InputEvent.EVENT_TYPE).chat() signature: Prompt arguments and model parameters are now separate. Java chat(messages, params) becomes chat(messages, promptArgs, modelParams) and Python chat(messages, **kwargs) becomes chat(messages, prompt_args, **kwargs); the connection-layer argument is renamed to modelParams.ChatModelSetup requires an explicit model name; configurations that previously relied on a default must specify one.embedding_model is now optional (pass pre-computed embeddings when omitted), and queries support a unified filters DSL.We know that a stable, backward-compatible API matters to anyone building on Flink Agents, and committing to compatibility as early as possible is a priority for us. At the same time, the agent space is evolving fast, and our own experience working alongside early users has surfaced real usability gaps in the current API. We'd rather fix those now than freeze them into a 1.0 we all have to live with.
So our plan is:
If you're building on Flink Agents today, thank you — your feedback is exactly what's shaping these final changes. Expect one more migration across 0.3 → 0.4 → 1.0, and a stable foundation from there.
The Apache Flink community would like to thank each and every one of the contributors that have made this release possible:
Adesh Nalpet Adimurthy, Alan Z., Avichay Marciano, Eugene, Haocong Wang, Howie Wang, JennyChen, Jinkun Liu, Joey Tong, Junbo Wang, Kerui Wang, Leonard Xu, meichuanyi, Nico Duldhardt, Vino1016, WAR10CK, Weiqing Yang, Wenjin Xie, XL Zhao, Xintong Song, Xuannan, Yash Anand, bosiew.tian, daken, hope, twosom, vishnu prakash, wangxinglong, yan.xu, yunfengzhou-hub
207 posts | 58 followers
FollowApache Flink Community China - April 17, 2023
Apache Flink Community - April 16, 2026
Data Geek - May 9, 2023
Apache Flink Community China - November 8, 2023
Apache Flink Community China - April 17, 2023
Apache Flink Community - October 15, 2025
207 posts | 58 followers
Follow
Realtime Compute for Apache Flink
Realtime Compute for Apache Flink offers a highly integrated platform for real-time data processing, which optimizes the computing of Apache Flink.
Learn More
Message Queue for Apache Kafka
A fully-managed Apache Kafka service to help you quickly build data pipelines for your big data analytics.
Learn More
Big Data Consulting for Data Technology Solution
Alibaba Cloud provides big data consulting services to help enterprises leverage advanced data technology.
Learn More
Big Data Consulting Services for Retail Solution
Alibaba Cloud experts provide retailers with a lightweight and customized big data consulting service to help you assess your big data maturity and plan your big data journey.
Learn MoreMore Posts by Apache Flink Community