- Concurrency and Actor Model: An Overview
- Case study: ThingsBoard's Implementation of Actor Model
- Discussions
-
Concurrency means multiple computations are happening at the same time.
-
Processor clock speeds are no longer increasing. Instead, we’re getting more cores with each new generation of chips. In order to get a computation to run faster, we’ll have to split up a computation into concurrent pieces and running these pieces in parallel.
-
Concurrency is not parallelism.
- Concurrency is about dealing with lots of things at once, parallelism is about doing lots of things at once.
- Concurrency is about structure, parallelism is about execution.
-
Implementation: Two common models for concurrent programming
-
Shared memory: In the shared memory model of concurrency, concurrent modules interact by reading and writing shared objects in memory.
- Examples of the shared-memory model:
- A and B might be two processors (or processor cores) in the same computer, sharing the same physical memory.
- A and B might be two programs running on the same computer, sharing a common filesystem with files they can read and write.
- A and B might be two threads in the same Java program, sharing the same Java objects.
- Examples of the shared-memory model:
-
Message passing: In the message-passing model, concurrent modules interact by sending messages to each other through a communication channel. Modules send off messages, and incoming messages to each module are queued up for handling
- Examples of message-passing model:
- A and B might be two computers in a network, communicating by network connections.
- A and B might be a web browser and a web server – A opens a connection to B, asks for a web page, and B sends the web page data back to A.
- A and B might be an instant messaging client and server.
- A and B might be two programs running on the same computer whose input and output have been connected by a pipe, like
ls | grep
typed into a command prompt.
- Examples of message-passing model:
-
-
The actor model in computer science is a mathematical model of concurrent computation that treats actor as the universal primitive of concurrent computation. Actors may modify their own private state, but can only affect each other indirectly through messaging. In actor model, everything is an actor.
-
Actor processing is triggered by receiving a message. The behavior of an actor defines the operations it performs when a message is received. In response to a message, an actor may:
- send a finite number of messages to other actors;
- create a finite number of new actors;
- designate the behavior to be used for the next message it receives. If a new behavior is not specified, the current behavior will be used to process the next message it receives.
-
Messages are sent asynchronously.
- The receiving actor is identified by an actor identity.
- The sender does not wait for the message to be received. There is no indication to the sender when the message was received.
- The receiving actor does not know the identity of the sender, unless the sender explicitly includes their own identity in the message.
- Typically, messages are buffered in an actor's mailbox, which is essentially a queue of messages.
-
Actor concurrency control: each received message is processed serially. The next message cannot be received until the behavior processing the current message determines the behavior for processing subsequent messages.
-
The actor model of computation defines a semantic model. Implementations of actor model must support:
- Fundamental data types: address, behavior, message
- Address: Actors are identified by their addresses. An actor's address represents the capability to send messages to actor.
- Behavior: The behavior of an actor describes the actions it will take when it receives a message.
- Message: A message is an immutable value used to pass data between actors. Messages may include actor addresses, may even be empty.
- Fundamental operations
- Create a new actor
- Send a message from an actor to existing actors
- Change an actor's behavior
Figure 3: An example network of several actors. Each actor has its own mailbox and isolated state. Based on its designated behavior, the actor responds to incoming messages by send new messages, spawn new actors and/or changing its future behavior.
- Fundamental data types: address, behavior, message
- A little bit of history:
- Initially, ThingsBoard used
akka
as their actor system. - ThingsBoard removed
akka
and move to their own implementation on June, 2020 by commit 0fbe40c. - There are 3 reasons for the migration, as explained by Andrew Shvayka - TB architect & co-author
- They suspected a bug in Akka that they were not able to investigate properly due to lack of Scala experience.
- Tired of fixing vulnerabilities in code.
- Simper code, easier to read and improve.
- Initially, ThingsBoard used
- ThingsBoard Architecture
- Each TB node (core / rule engine) can join the cluster and is responsible for certain partitions of the incoming messages.
- Two components using actor system are TB core and TB rule engine.
- TB Core is responsible for handling REST API calls and WebSocket subscriptions. ThingsBoard Core uses Actor System under the hood to implement actors for main entities: tenants and devices.
- TB Rule Engine is the heart of the system and is responsible for processing incoming messages. Rule Engine uses Actor System under the hood to implement actors for main entities: rule chains and rule nodes
-
Code is located in
common/actor
submodule andcommon/message
submodule. -
Fundamental classes
TbActor
: aTbActor
represents an actor.TbActorMsg
: defines messages an actor will receive.TbActorId
: aTbActorId
is the unique identifier of the actor.TbActorRef
: aTbActorRef
is a reference to an actor, its foremost purpose is to provide a way to send messages to the actor it represents.TbActorCreator
: an abstract factory for creating actors, which includes methods for creatingTbActorId
andTbActor
TbActorSystem
: aTbActorSystem
is the entrypoint for creating, looking up, interacting with actors.TbActorCtx
: aTbActorCtx
provides context in which an actor reacts. TheTbActorCtx
helps an actor communicate with other actors in the system (i.e. an actor can send a finite number of messages to other actors it knows; and create a number of child actors). In addition, anTbActorCtx
provides access to the actor's own identity, theTbActorSystem
it is part of, methods for working with the list of child actors it created.Dispatcher
s bind a set of actors to a thread pool. The thread pool is where commands will be executed. Dispatchers are part ofTbActorSystem
.TbActorMailbox
: aTbActorMailbox
provides a reference to the actor, a message queues to store messages the actor received, and context for interaction with other actors.InitFailureStrategy
andProcessFailureStrategy
: strategies for an actor to recover from errors.
-
Fundamental data types
- Address:
TbActorId
provides a unique identifier,TbActorRef
provides a way to send messages to the actor. - Behavior:
TbActor
- combination ofTbActor#process(TbActorMsg msg)
method and state variables of the class. - Message: subclasses of
TbActorMsg
- Address:
-
Actor System
TbActorSystem
(DefaultTbActorSystem
) is the entrypoint for creating, looking up, interacting with actors.TbActorSystem
also manages dispatchers and actor hierarchy.- An actor can interact with the actor system through
TbActorCtx
.
-
Fundamental Operations
-
Create actors
- APIs:
TbActorSystem#createRootActor(...)
to create a root actor andTbActorSystem#createChildActor(...)
to create a child actor. - Internally, these two API methods call the method
TbActorSystem#createActor(dispatcherId, actorCreator, parent)
. This method creates an actor by calling factory methods defined bycreator
. The actor will be run by a dispatcher identified bydispatcherId
. The parent of the actor is defined by theparent
parameter, which is the identifier of the parent actor.
- APIs:
-
Stop actors
- Done by a recursive function:
TbActorSystem#stop(actorId)
- Internally, it removes and destroys the actor identified by the actorId (by destroying the mailbox) and all of its children.
- Done by a recursive function:
-
Send and receive messages between actors
- Messages are sent asynchronously. Messages are placed in a mailbox.
- Two queues: high priority (
TbActorMailbox#highPriorityMsgs
) and normal priority (TbActorMailbox#normalPriorityMsgs
) - Two queue states:
TbActorMailbox#FREE
/TbActorMailbox#BUSY
- Sending a single message from a sender to a receiver.
- The sender calls
ActorRef#tell(TbActorMsg)
for a normal priority message andActorRef#tellWithHighPriority(TbActorMsg)
for a high priority message; - The receiver enqueues the message, tries processing by priority until queues are empty (
TbActorMailbox#processMailbox
).
- The sender calls
- The
dispatcher
is where the actual work is done.
-
Change behavior: by the combination of states and logics by
TbActor#process(TbActorMsg)
method.
-
-
Handle errors
InitFailureStrategy
(TbActor#onInitFailure
): what to do if the system failed to init the actor (mailbox). There are three strategies in used:retryWithDelay
: schedule a retry after a specified delaystop
: stop the actorretryImmediately
: execute a retry immediately
ProcessFailureStrategy
(TbActor#onProcessFailure
): what to do if the actor failed to process a message. There are two strategies:resume()
: resume the actor, keeping its internal state;stop()
: stop the actor permanently.
-
Actor runtime:
Dispatcher
Dispatcher
s bind a set of actors to a thread pool. The thread pool is where command will be executed. Dispatchers are part ofTbActorSystem
.- Internally,
Dispatcher
is just a wrapper of anExecutorService
with identifier. DefaultTbActorSystem#dispatchers
is the registry ofDispatcher
s in the system.
-
Testing actors
- Testing actor behaviors by writing familiar JUnit and Mockito. (
StatsActorTest
) - Testing actor system:
ActorSystemTest
- Testing actor behaviors by writing familiar JUnit and Mockito. (
-
Code is located in
application
module,org.ThingsBoard.server.actors
package. -
Actor system initialization:
DefaultActorService#initActorSystem
-
Usages of TB Actor System in ThingsBoard
- ThingsBoard core: tenants and devices.
- ThingsBoard rule engine: rule chains and rule nodes.
-
- App Actor - responsible for management of tenant actors. An instance of this actor is always present in memory.
- Tenant Actor - responsible for management of tenant device & rule chain actors.
- Device Actor - maintain state of the device: active sessions, subscriptions, pending RPC commands, etc. An actor is created when the first message from the device is processed. The actor is stopped when there is no messages from devices for a certain time.
- Rule Chain Actor - process incoming messages and dispatches them to rule node actors. An instance of this actor (for the root rule chain) is always present in memory.
- Rule Node Actor - process incoming messages, and report results back to rule chain actor. An instance of this actor (for the input node of root rule chain) is always present in memory.
- Stats Actor - process stats messages. An instance of this actor is always present in memory.
-
Usage in ThingsBoard core: (tenants and devices)
- Tenants are created by
AppActor
. TenantActor
,TenantActor.ActorCreator
.DeviceActor
,DeviceActorCreator
,DeviceActorMessageProcessor
- a component defines how to process messages (it is part of aDeviceActor
)- When start? - when a message comes
- When stop? - not immediately, after a specified duration
- Tenant - device communication
- A tenant actor gets/creates device actors by calling:
TenantActor#getOrCreateDeviceActor()
- Messages are subclasses of
DeviceAwareMsg
(which is a subclass ofTbActorMsg
)
- A tenant actor gets/creates device actors by calling:
- Tenants are created by
-
Usage in ThingsBoard rule engine: (rule chains and rule nodes)
RuleChainActor
,RuleChainActor.ActorCreator
,RuleChainActorMessageProcessor
- a component defines how to process messages (it is part ofRuleChainActor
)RuleNodeActor
- rule node behaviors (init, destroy, process),RuleNodeActor.ActorCreator
- actor creation,RuleNodeActorMessageProcessor
- defines how to process messages (it is part ofRuleNodeActor
)- Rule chain to rule node communication
- Rule node to the next rule nodes communication
- The flow: rule node -> rule chain -> next rule nodes.
- Request-reply pattern: includes
actorRef
in the message to get the actor for replying.- Message:
RULE_TO_RULE_CHAIN_TELL_NEXT_MSG
,RuleNodeToRuleChainTellNextMsg
RuleNodeCtx#chainActor
: reference to chain actor
- Message:
- The benefits of actor model
- Learnable by developers and "safer" than lock-based concurrency.
- Structure and manage concurrency applications easier.
- Separation of concerns: business logic and concurrency logic.
- Better concurrency performance: actors are lightweight
-
Actor Model vs. Object Oriented Programming
- Actors look similar to objects? - Calling a method of an object is like passing a message to invoke computation.
- OOP - originally developed without concurrency in mind
- Actor Model - originally developed with concurrency in mind
-
Actor Model vs. Message Queue
- Actors look similar to producers / consumers in MQ? - asynchronous messaging, event-based.
- Actors should be implemented in a single programming language. MQ can connect different systems (e.g. some are implemented in Python, some are in Java)
- Actors are more lightweight.
-
It should be based on the nature of the problem.
- If the problem is to speedup a nested loop / recursion? - parallel loop is likely a better solution.
- Complex system with dependencies and coordination of states - actor model is likely a good solution.
-
Example: Using actor model for an IoT system
- A large number of managed devices, each of which consists of changing internal state.
- Device managers and hierarchical groups of IoT devices.
- The system must support millions of these device actors.
- The system must run on a cluster of nodes (for distributed workload) and should scale elastically (both horizontally across many nodes and vertically on a single node) if more devices come online.
- Wikipedia - Concurrent Computing
- Reading 17: Concurrency
- Actor model - Wikipedia
- Actor model - Wiki C2
- Deconstructing the actor model
- Requirements for an Actor Programming Language
- Actor-based concurrency
- Message Passing and the Actor Model
- Benefits of the Actor Model
- Learn you an actor system for great good: Part 1 and Part 2
- A minimal actor system implementation - Victor Klang https://en.wikipedia.org/wiki/Actor_model_and_process_calculi_history
- SO - When should one use the Actor Model? https://stackoverflow.com/questions/4648280/what-design-decisions-would-favour-scalas-actors-instead-of-jms
- Actor Model for IoT
- Actor Model vs. Object Oriented Programming: SO, Hacker News
-
Demo Accounts
- System Administrator:
[email protected]
/sysadmin
- Tenant Administrator:
[email protected]
/tenant
- Customer User:
[email protected]
/customer
- System Administrator:
-
Update latest telemetry data
ACCESS_TOKEN=<ACCESS-TOKEN-HERE> && curl -v -X POST \ --data "{"temperature":42,"humidity":73}" \ http://localhost:8080/api/v1/$ACCESS_TOKEN/telemetry \ --header "Content-Type:application/json"