Python SDK
Stateful functions are the building blocks of applications; they are atomic units of isolation, distribution, and persistence. As objects, they encapsulate the state of a single entity (e.g., a specific user, device, or session) and encode its behavior. Stateful functions can interact with each other, and external systems, through message passing. The Python SDK is supported as a remote module.
To get started, add the Python SDK as a dependency to your application.
apache-flink-statefun==2.0.0
- Defining A Stateful Function
- Type Hints
- Function Types and Messaging
- Sending Delayed Messages
- Persistence
- Exposing Functions
- Context Reference
Defining A Stateful Function
A stateful function is any function that that takes two parameters, a context
and message
. The function is bound to the runtime through the stateful functions decorator. The following is an example of a simple hello world function.
from statefun import StatefulFunctions
functions = StatefulFunctions()
@functions.bind("example/hello")
def hello_function(context, message):
"""A simple hello world function"""
user = User()
message.Unpack(user)
print("Hello " + user.name)
This code declares a function with in the namespace example
and of type hello
and binds it to the hello_function
Python instance.
Messages’s are untyped and passed through the system as google.protobuf.Any
so one function can potentially process multiple types of messages.
The context
provides metadata about the current message and function, and is how you can call other functions or external systems. A full reference of all methods supported by the context object are listed at the bottom of this page.
Type Hints
If the function has a static set of known supported types, they may be specified as type hints. This includes union types for functions that support multiple input message types.
import typing
from statefun import StatefulFunctions
functions = StatefulFunctions()
@functions.bind("example/hello")
def hello_function(context, message: User):
"""A simple hello world function with typing"""
print("Hello " + message.name)
@function.bind("example/goodbye")
def goodbye_function(context, message: typing.Union[User, Admin]):
"""A function that dispatches on types"""
if isinstance(message, User):
print("Goodbye user")
elif isinstance(message, Admin):
print("Goodbye Admin")
Function Types and Messaging
The decorator bind
registers each function with the runtime under a function type. The function type must take the form <namespace>/<name>
. Function types can then be referenced from other functions to create an address and message a particular instance.
from google.protobuf.any_pb2 import Any
from statefun import StatefulFunctions
functions = StatefulFunctions()
@functions.bind("example/caller")
def caller_function(context, message):
"""A simple stateful function that sends a message to the user with id `user1`"""
user = User()
user.user_id = "user1"
user.name = "Seth"
envelope = Any()
envelope.Pack(user)
context.send("example/hello", user.user_id, envelope)
Alternatively, functions can be manually bound to the runtime.
functions.register("example/caller", caller_function)
Sending Delayed Messages
Functions are able to send messages on a delay so that they will arrive after some duration. Functions may even send themselves delayed messages that can serve as a callback. The delayed message is non-blocking so functions will continue to process records between the time a delayed message is sent and received. The delay is specified via a Python timedelta.
from google.protobuf.any_pb2 import Any
from statefun import StatefulFunctions
functions = StatefulFunctions()
@functions.bind("example/caller")
def caller_function(context, message):
"""A simple stateful function that sends a message to the user with id `user1`"""
user = User()
user.user_id = "user1"
user.name = "Seth"
envelope = Any()
envelope.Pack(user)
context.send("example/hello", user.user_id, envelope)
Persistence
Stateful Functions treats state as a first class citizen and so all stateful functions can easily define state that is automatically made fault tolerant by the runtime. All stateful functions may contain state by merely storing values within the context
object. The data is always scoped to a specific function type and identifier. State values could be absent, None
, or a google.protobuf.Any
.
Attention: [Remote modules](//ci.apache.org/projects/flink/flink-statefun-docs-release-2.0/sdk/modules.html#remote-module) require that all state values are eagerly registered at module.yaml.
Below is a stateful function that greets users based on the number of times they have been seen.
from google.protobuf.any_pb2 import Any
from statefun import StatefulFunctions
functions = StatefulFunctions()
@functions.bind("example/count")
def count_greeter(context, message):
"""Function that greets a user based on
the number of times it has been called"""
user = User()
message.Unpack(user)
state = context["count"]
if state is None:
state = Any()
state.Pack(Count(1))
output = generate_message(1, user)
else:
counter = Count()
state.Unpack(counter)
counter.value += 1
output = generate_message(counter.value, user)
state.Pack(counter)
context["count"] = state
print(output)
def generate_message(count, user):
if count == 1:
return "Hello " + user.name
elif count == 2:
return "Hello again!"
elif count == 3:
return "Third time's the charm"
else:
return "Hello for the " + count + "th time"
Additionally, persisted values may be cleared by deleting its value.
del context["count"]
Exposing Functions
The Python SDK ships with a RequestReplyHandler
that automatically dispatches function calls based on RESTful HTTP POSTS
. The RequestReplyHandler
may be exposed using any HTTP framework.
from statefun import RequestReplyHandler
handler RequestReplyHandler(functions)
Serving Functions With Flask
One popular Python web framework is Flask. It can be used to quickly and easily expose a RequestResponseHandler
.
@app.route('/statefun', methods=['POST'])
def handle():
response_data = handler(request.data)
response = make_response(response_data)
response.headers.set('Content-Type', 'application/octet-stream')
return response
if __name__ == "__main__":
app.run()
Context Reference
The context
object passed to each function has the following attributes / methods.
- send(self, typename: str, id: str, message: Any)
- Send a message to any function with the function type of the the form
<namesapce>/<type>
and message of typegoogle.protobuf.Any
- Send a message to any function with the function type of the the form
- pack_and_send(self, typename: str, id: str, message)
- The same as above, but it will pack the protobuf message in an
Any
- The same as above, but it will pack the protobuf message in an
- reply(self, message: Any)
- Sends a message to the invoking function
- pack_and_reply(self, message)
- The same as above, but it will pack the protobuf message in an
Any
- The same as above, but it will pack the protobuf message in an
- send_after(self, delay: timedelta, typename: str, id: str, message: Any)
- Sends a message after a delay
- pack_and_send_after(self, delay: timedelta, typename: str, id: str, message)
- The same as above, but it will pack the protobuf message in an
Any
- The same as above, but it will pack the protobuf message in an
- send_egress(self, typename, message: Any)
- Emits a message to an egress with a typename of the form
<namespace>/<name>
- Emits a message to an egress with a typename of the form
- pack_and_send_egress(self, typename, message)
- The same as above, but it will pack the protobuf message in an
Any
- The same as above, but it will pack the protobuf message in an
- __getitem__(self, name)
- Retrieves the state registered under the name as an
Any
orNone
if no value is set
- Retrieves the state registered under the name as an
- __delitem__(self, name)
- Deletes the state registered under the name
- __setitem__(self, name, value: Any)
- Stores the value under the given name in state.