ScalaFlow: Continuation-Based Data Flow in Scala
ScalaFlow: Continuation-Based Data Flow in Scala
ScalaFlow: Continuation-Based Data Flow in Scala
Master’s Thesis
ScalaFlow
Continuation Based DataFlow Concurrency in Scala
Submitted on:
Submitted by:
B. Sc. Kai H. Meder
Im Dauenkamp 3
33332 Gütersloh, Germany
Phone: +49 (160) 80 55 0 77
E-mail: kai@meder.info
Supervised by:
Prof. Dr. Ulrich Hoffmann
Fachhochschule Wedel
Feldstraße 143
22880 Wedel, Germany
Phone: +49 (41 03) 80 48-41
E-mail: uh@fh-wedel.de
ScalaFlow
Continuation Based DataFlow Concurrency in Scala
Master’s Thesis by Kai H. Meder
Copyright
c 2010 Kai H. Meder
List of Listings V
1. Introduction 1
1.1. Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2. Scope . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3. Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4. Outline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2. DataFlow Model 7
2.1. Data-Driven Concurrency Model . . . . . . . . . . . . . . . . . . . . 7
2.2. Introducing Nondeterminism . . . . . . . . . . . . . . . . . . . . . . 10
2.3. DataFlow Semantics . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
3. Aspects of Scala 17
3.1. Object Orientation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
3.2. Functional . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
3.3. For Comprehension . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
3.4. Delimited Continuations . . . . . . . . . . . . . . . . . . . . . . . . . 22
5. Networking Capabilities 57
5.1. NIO Dispatcher . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
5.2. Network Sockets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
5.3. Connection Acceptor . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
5.4. HTTP Processing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
III
Contents
5.5. Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
6. ScalaFlow in Action 75
6.1. Explicit Usage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
6.2. Using Variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
6.3. Using Channels . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
6.4. Networking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
7. Conclusion 89
7.1. Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
7.2. Assessment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
B. Installation Guide 95
Bibliography 97
Affidavit 101
IV
List of Listings
V
List of Listings
VI
List of Listings
VII
1
Introduction
Moore proposed his famous law in 1965 [Moo65], stating that the number of transistors
on an integrated circuit would double roughly every two years. This prediction has
been proven correct, allowing engineers to increase the performance of processors
by a comparable factor. Recently though, the raw clock speed of processors has no
longer been increasing at the same pace. Instead, the number of execution cores
in each processor is increasing. This change lead to the coining of the popular
term ”Multicore Crisis” [War07, Far08]. Developers must invest considerable effort
in migrating software from a sequential design to a concurrent design, leveraging
multiple physical and virtual cores. This is a complex task; there are multiple
concurrency models and patterns, each with varying efficiency and implementation
difficulty, depending on the specific problem to be solved.
1.1. Motivation
Creating concurrent programs using the Java programming language relies on threads
and monitors, within a model of shared mutable state [GBB+ 06]. Thread-safety is not
1
1. Introduction
The general solution to scale the number of possible concurrent connections beyond
the JVM imposed thread-limit is to employ the Reactor Pattern [Sch94] to process
asynchronous I/O. However this imposes inversion of control [Fow05] on the program,
increasing the semantic distance between the programmer’s intention and the actual
code.
The goal of this work is to support the development of scalable concurrent programs
without imposing inversion of control. Therefore this thesis defines and focuses on
the DataFlow Model, which is based on the declarative Data-Driven Concurrency
Model. The foundations of the DataFlow Model are single-assignment variables and
communication channels. The control flow is suspended when reading an unassigned
2
1.2. Scope
1.2. Scope
The framework’s basic design is based on Flow s which represent and initialize con-
current computations. Flows can communicate with the main program by returning
a Future [Rep10c]. Variables are the basic building blocks in a DataFlow environ-
ment. They can be bound to a value only once and suspend their readers until
bound. Signal s are based on suspending Variables. They do not contain a value but
allow code to wait for the signal’s invocation by suspending. Channel s are bounded
sequences of variables that can be shared used for communication between concur-
rent computations. Channels have queue semantics; multiple concurrent readers
compete against each other to consume the channel’s head element. Consumers
are transparently suspended when they attempt to read from an empty channel.
Channels provide lazy, eager and bounded buffer flow-control to suspend producers
based on their capacity. Additionally, channels provide higher-order functions like
foreach, map, filter, fold and several others. Pipelines combine multiple channels
3
1. Introduction
This thesis was inspired by Jonas Boner’s dataflow implementation [Bon09] which is
based on Scala Actors and blocking Queues.
1.4. Outline
Prior to the development of the framework, the key concepts of the DataFlow model
are outlined in chapter 2. Selected features of the Scala programming language follow
in chapter 3, including the core feature of this thesis: delimited transformation into
Continuation-Passing style.
4
1.4. Outline
A note regarding listings: all ScalaFlow code excerpts are shortened to a minimum
to focus on key features. All visibility modifiers like private and protected are
omitted for brevity. If irrelevant, variables, methods and functions are only denoted
by a type and not bound to any value or implemented. Lines prefixed with ”scala>”
denote input to the Scala Interpreter shell.
5
DataFlow Model
2
This chapter introduces the declarative Data-Driven Concurrency Model 1 and extends
it with nondeterminism to create the DataFlow Model, specific to this thesis. The first
section 2.1 introduces the basic declarative model of programming and then extends
it with concurrency and dataflow variables to create the Data-Driven Concurrency
Model. The shortcomings of the deterministic model are discussed, and extensions
introduced to address them in section 2.2. The last section 2.3 defines the DataFlow
Model, combining Data-Driven Concurrency Model with non-declarative constructs,
as specified by a Domain Specific Language (DSL) within Scala.
The Declarative Model is based upon the principle that a program’s output is a
mathematical function of its input. A declarative operation is independent of outside
state, stateless and deterministic. Functional and logical programming languages
1
This chapter is based on the book Concepts, Techniques, and Models of Computer Programming
[RH04]
7
2. DataFlow Model
like Haskell (mathematical functions) and Prolog (logical relations) are declarative
programming languages. Declarative programming as a specification can formally
make the implementation dispensable, since the specification is the program. Practi-
cally however, declarative programming still requires an efficient implementation.
Thus a means of transforming specifications into an efficient executable programs is
required, either automatically or manually.
Variables that cause the program to wait until they are bound are called dataflow
variables. It is unreasonable to use such variables in a sequential program since
reading an unbound variable would cause the program to wait forever. In a concurrent
program however, where multiple computations are executed at the same time,
dataflow variables can be used to great effect. The meaning of ”at the same time”
depends on the environment: on a single processor machine the order of concurrent
statements is interleaved (by a nondeterministic scheduler). By contrast on a multi
processor machine, statements from several computations are executed simultaneously
(in parallel). As there is always at least one interleaving that is observationally
equivalent to the parallel execution, the parallel execution can be described in terms
of the interleaved sequence [RH04].
The Data-Driven Concurrent Model extends the declarative model with concurrency
but remains declarative. Therefore the model provides declarative concurrency. A
concurrent declarative program does not need to terminate, instead it computes
incrementally since its input can grow indefinitely: the input might contain unbound
variables, therefore the input is extended when binding the variables. It is declarative
in the sense that the following axiom holds for all possible inputs:
8
2.1. Data-Driven Concurrency Model
– Where partial termination means that the program will eventually stop
executing if its input stops growing.
– Where two results R1 and R2 are logically equivalent if they contain the
same variables and for each variable Vi , values(Vi ,R1 ) = values(Vi ,R2 )
(they define the same set of values).
• Any operation reading a variable has no choice but to wait until the variable is
bound, therefore the nondeterminism does not become visible.
Any deterministic model is also inherently race free, since race conditions result from
observable nondeterministic behaviour. Dataflow variables are order-independent
and replace static dependencies with dynamic dependencies that are determined
by the data itself. The output of one computation can be passed as the input to
another computation, independent of the order in which the two computations are
executed. Furthermore, the reader of a dataflow variable implicitly synchronizes on
the availability of the variable’s value, so no synchronization operations are directly
visible.
9
2. DataFlow Model
10
2.3. DataFlow Semantics
To summarize the preceeding sections, dataflow variables can be used to great effect
in concurrent programs as they provide declarative single-assignment, race-free access,
implicit synchronization and permit data-dependent dynamic dependencies between
computations. Dataflow variables are in the borderland between stateless and stateful
programming. On the one hand, a dataflow variable is stateful because it changes
state from unbound to being bound. On the other hand, it is stateless because binding
is monotonic (see section 2.1). The stateful aspect is required for communication
between concurrent computations and the stateless aspect can be used to gain the
advantages of declarative programming within a more expressive nondeclarative
model. Therefore dataflow variables are a reasonable tool to create concurrent
components with dataflow execution, even within a nondeclarative language such as
Scala.
This section specifies a Domain Specific Language within Scala, implementing the
DataFlow Model. This model provides dataflow execution and combines dataflow vari-
ables from the Data-Driven Concurrency model with nondeterministic communication
channels.
The most basic operation is flow that schedules a computation for concurrent
execution (see listing 2.1). Scheduling of multiple computations is nondeterministic,
but the instructions comprising each computation are executed in sequential order.
By applying flow to a computation, the computation becomes a dataflow execution
in which dataflow variables and channels can be used. Using the operations outside
of a flow yields unspecified behaviour. val denotes an immutable value, which may
be a dataflow variable.
11
2. DataFlow Model
flow {
val x = 30 // value , not a dataflow variable
val y = 12
x + y
}
flow { v () + 2 } // reading
flow { v := 1 } // binding
Communication channels that can be shared among multiple flows are created
by instantiating the Channel class. A channel provides similar suspend/resume
behaviour to a Variable: if empty, readers will be suspended. When an empty
channel is filled with new elements, suspended readers are resumed in FIFO order,
thus implementing the semantics of a Queue.
12
2.3. DataFlow Semantics
c () + c ()
}
flow { // enqueing
c << 1
c << 2
}
Listing 2.4: Channel
The second higher-order function map is executed concurrently and does not sus-
pend the caller. map takes a function of type A => B and returns a newly created
Channel[B]. The passed function is successively applied to each channel element.
The return values are put into the new Channel[B]. In summary, invoking map
concurrently executes the passed function and immediately returns a new channel
that is filled by the output of that function when applied to the values in the passed
channel.
13
2. DataFlow Model
// concurrently executing
val ys = xs . map ( x = > 10* x )
// foreach , suspending
for ( y <- ys )
println ( y )
}
flow { xs << 1; xs << 2; xs << 3; }
Listing 2.6: Channel with For-Comprehensions using both foreach and map
Until now, application of foreach will never return control to the caller, since
it will encounter the empty channel when all elements have been processed. This
suspends foreach indefinitely, resuming when another element is put into the channel.
The same applies for map, with the distinction that map is concurrently executing
and thus its suspension does not affect the caller. To signal an End-of-Stream
to the channel, the channel can be terminated (see listing 2.7). Consumers of
an empty and terminated channel are resumed by raising an TerminatedChannel
Exception. This requires consumers to be termination-aware to catch this exception
appropriately. foreach and map silently catch the exception and return control to
the caller. Therefore, foreach will return control to the caller if a terminated channel
has been fully processed, and is now permanently empty. Terminating an already
terminated channel or putting elements to an already terminated channel will also
raise a TerminatedChannel Exception.
14
2.3. DataFlow Semantics
flow {
xs << 1; xs << 2; xs << 3;
xs <<# ; // terminate channel
}
flow { xs () } // competing with first flow .
// depending on order of execution ,
// this might raise an exception if
// scheduled too late , when the
// terminated channel is already empty !
In conclusion, the Data-Driven Concurrency Model has been extended with non-
determinism to allow for nondeterministic communication channels. This model is
referred to as the DataFlow Model throughout this thesis and has been specified by
a Domain Specific Language within Scala. The DSL provides constructs to create
and process dataflow variables and channels and operations to create concurrent
computations using dataflow execution.
15
Aspects of Scala
3
The ScalaFlow framework presented in this thesis is implemented using the Scala 2.8
release in combination with the Delimited Continuations compiler plug-in. Scala is a
hybrid programming language, fusing object-oriented and functional programming.
As the reader is expected to be familiar with these basic principles, Scala’s most
basic syntax and features will not be described here, but are available at [O+ ].
Section 3.1 describes the object-oriented features of Scala, followed by section 3.2
covering the functional features. Scala is statically typed and features a very pow-
erful type system, including Type Inference and Higher Kinded Types. As Scala
claims to be a ’scalable’ language, prominent features such as for-comprehensions
are ’syntactic sugar’ for arbitrary control structures using a monadic style. The
transformation of for-loops is described in section 3.3. Finally section 3.4 introduces
the concept of Delimited Continuations via a type-directed, selective transformation
into Continuation-Passing Style. This feature is heavily used in this thesis to suspend
and resume computations.
17
3. Aspects of Scala
Scala is based on Java’s core capabilities as it compiles natively into Java bytecode.
Therefore the same basic principles and features of object oriented programming
apply to Scala. Scala also inherits Java’s basic shortcomings, such as type erasure
[Comb]. The foundation of the type system is a class hierarchy, starting with the
Any type which has AnyVal and AnyRef as subtypes. Java’s reference types and the
basic ScalaObject are descendants of AnyRef. There are two bottom types: null
is a subtype of AnyRef and therefore is not assignable to AnyVals, while Nothing is
a subtype of every type. However, there is no value of type Nothing as this type
signals abnormal termination. Value classes (Byte, Int, Double, Boolean, etc.) are
subtypes of AnyVal; they are mapped to Java’s primitive types. Finally, there is a
special value class Unit which corresponds to Java’s void type. Its single instance
value is ().
Scala differentiates (by paradigm) between mutable and immutable classes and fields.
Instances of mutable classes can alter their internal state, whereas the state of
immutable objects can not be modified after construction. Therefore operations
on immutable classes return newly constructed objects to represent the modified
state. Scala’s collection library offers mutable as well as immutable implementations.
Methods may be overloaded as in Java, the resolution is done on the static types of
the arguments.
In contrast to Java, there are no raw or primitive types in Scala. Every type is a class
and all operations are methods on objects, e.g. 1+2 is translated to (1).+(2). Every
method, regardless of its number of arguments, may be written in infix-operator
style. (e.g. str.indexOf(’a’, 13) may be written as str indexOf (’a’, 13)).
Additionally, Scala features prefix and postfix operator notation for unary methods.
Names of methods or operator identifiers may consist of most of the printable
ASCII-characters, including +, :, ?, and #.
18
3.1. Object Orientation
Instead of interfaces, Scala features Traits, which are the fundamental unit of code
reuse. Traits mix the concepts of interfaces and abstract classes: they specify
interface-like contracts by defining methods and fields. But unlike Java’s interfaces,
they also contain pre-implemented code similar to an abstract class. A class is
allowed to inherit (”mix in”) multiple traits. Traits can augment classes by adding
new methods. This allows to create thin interfaces and then extend them to rich
interfaces by mixing in traits containing the auxiliary operations. Apart from adding
new behaviour, traits can modify behaviour by overwriting methods. A class may
mix-in multiple traits which all modify one specific method. The resulting class
contains all these modifications by combining them in a specific order, so traits can
function as ”stackable modifications” (see listing 3.1). The order in which the traits
are mixed-in is significant and the actual linear order of super-calls within the traits
is determined by a linearization-process [Ode10, p.56]. This stackable and linearized
order of super-calls differentiates traits from multiple inheritance.
19
3. Aspects of Scala
3.2. Functional
Scala features currying and partially applied functions. As functions are first-class
value and therefore can be passed as parameters to other functions, it is possible to
define higher-order functions: functions applied to functions (see listing 3.3).
It is possible to define closures 3.4. Closures are functions containing values passed
as arguments (bound variables) and additional free variables. The values of the free
variables are captured in the surrounding scope of the closure definition.
Scala differentiates between for loops and for comprehensions [Ode10, p. 89]. For-
loops of type for (enums) expr execute the expression expr for each binding gen-
erated by the enumerators enums. In contrast, for-comprehensions of type for
20
3.3. For Comprehension
(enums) yield expr execute expr for each binding generated by enums and collect
the results.
The for construct is syntactic sugar for the applications of higher-order func-
tions. For-loops are expanded into applications of foreach(f: A => Unit): Unit
whereas for-comprehensions expand to applications of map(f: A => B): C[B] and
flatMap(f: A => C[B]): C[B]), respectively. Additionally, it is possible to de-
fine guards of type for (enums if guard) ... which expand to applications of
filter(f: A => Boolean). The expansion does not constrain the specific types
of the functions. Therefore, foreach can be defined as foreach(f: A => U): U if
this definition satisfies the type-checker.
The expansion allows to write libraries that can be used in for-loops and for-
comprehension by providing the higher-order functions. Furthermore, arbitrary
control-structures can be used by for-comprehensions, they resemble Haskell’s do-
notation that is used for monadic applications [Coma].
The expansion of the for-loop is demonstrated in listing 3.5, whereas the for-
comprehension is listed in 3.6 and 3.7. The expansion of guards is demonstrated in
listing 3.8.
21
3. Aspects of Scala
There are several techniques for capturing continuations. One method is to save
the stack and restore it upon reification of the continuation. The other method,
used by Scala’s Delimited Continuations compiler plug-in [RMO09], is to selectively
transform the program into Continuation-Passing-Style (CPS). A program written in
imperative style applies functions that return values to the caller. The computation
then continues by invoking other statements and waits for their result. In contrast, a
program written in CPS calls a function and additionally passes the continuation of
the program, which is a first-class value and represented by a function. The function
never returns to the caller, instead the function can apply, transform or even discard
the passed continuation to continue the computation. Therefore the control flow is
explicitly specified by functions that always pass the computation’s continuation. As
a consequence, the state of the program is always stored in closures bypassing the
stack.
22
3.4. Delimited Continuations
Instead of transforming every function into CPS, Scala performs a selective transfor-
mation which only transforms functions that are required to pass along continuations.
The transformation is type-directed by annotations of the form @cps[A] (actually by
fully qualified annotations of the form @cpsParam[A,B]). The annotation @cps[A]
denotes that its function’s continuation returns type A, whereas the annotation
@suspendable is a type-alias for @cps[Unit] and denotes functions which are meant
to only suspend and whose continuations return Unit. As the transformation is
type-directed only functions providing the cps-annotations are transformed. Non-
transformed functions still use the stack, which avoids problems from keeping the
state only in closures on a stack-based architecture like the JVM.
The primitive operations in the CPS context are reset and shift. The operation
reset delimits the range of the continuations, whereas shift captures the current
continuation up to the next enclosing reset as a function. CPS-transformed code
is executed in a continuation-monad of type ControlContext[A,B,C] where type B
and C are denoted by the annotation @cpsParam[A,B]. Listings 3.9, 3.10 and 3.10
show the usage of reset in conjunction with shift.
val r = reset {
shift { k : ( Int = > Int ) = >
// not invoking k
}
// continuation starts here
}
r : Unit = ()
23
3. Aspects of Scala
kArg + 1
} * 2
r : Int = 42
The while loop is rewritten by the CPS plugin to a recursive function application.
This makes it possible to use loops in a CPS context. However a large loop iteration
count will overflow the call stack. This problem can be solved by using trampolining
[Dou09, Ode08] to emulate tail-call optimisation. Another workaround is to throw
exceptions which Scala’s Actor library uses to unwind the call stack [HO07]. Further-
more, it is not possible to use the for-loop on arbitrary collections: the collections’
definitions of foreach and map are not applicable inside the CPS context because
they lack the required @cps[T] annotation.
24
Design and Implementation
4
This chapter describes the implementation of the ScalaFlow framework and discusses
the design decisions made. The ScalaFlow framework provides operations to create
concurrent computations based on the DataFlow model. It is implemented using
Scala 2.8 and uses Scala’s transformation into Continuation-Passing-Style to suspend
computations. The chapter starts by detailing why continuations were chosen as the
mechanism for suspending computations, rather than Java monitors (in section 4.1.
Section 4.2 discusses how computations enter the framework and how they are
processed and scheduled. The foundation of the DataFlow Model is the single-
assignment variable, which is designed and incrementally implemented in section 4.3.
Communication channels introduce nondeterminism into the DataFlow Model. Their
basic implementation is discussed in section 4.4.1. Subsequently channels are extended
with flow control and a termination mechanism in section 4.4.2. Finally, section 4.4.3
splits the channel’s implementation into different traits, defines DSL-aliases and
implements multiple higher-order functions. The chapter ends by combining channels
to form concurrent pipelines in section 4.5.
25
4. Design and Implementation
This section explains the decision to use continuations to suspend and resume
computations, instead of using Java monitors.
Java provides Wait & Notify monitors that can be used to pause computations by
suspending their thread. In the following, operations that suspend threads are called
blocking operations to distinguish the behaviour from other suspending methods.
Invoking wait on an object blocks the calling thread, while invoking notifyAll
on an object unblocks all waiting threads. This can be used to design DataFlow
variables: programs reading a variable first acquire the variable’s monitor. If the
variable is unbound, the accessor calls wait to block the computation’s underlying
thread. Computations that bind a variable to a value also acquire the variable’s
monitor, set the value and then call notifyAll to resume any blocked threads. This
design occupies one thread per concurrent computation. JVM threads are a limited
resource and context switches between threads are expensive. Therefore, this design
has limited scalability.
This problem can be circumvented by using the Task Pattern, where each compu-
tation is represented by an independent task that is submitted to a managed pool
of threads. The tasks are concurrently executed on the pool’s threads. However if
the pool is limited in its size, this can lead to a starvation problem: if all running
tasks block their threads, no further task will be executed until a thread is resumed
and freed. Submitting numerous tasks that read an unbound dataflow variable could
block all of the pool’s threads. The task binding the variable and thus resuming the
waiting threads will not get executed: the blocked threads ”starve”. The starvation
problem can be solved by using a dynamically adapting thread-pool. However this
solution again raises the problem of threads as a limiting resource.
26
4.2. Flows and Scheduling
Since the DataFlow Model is a model of concurrent execution, Java monitors are
still required to synchronize access to critical regions. However the monitor-based
wait mechanism will not be used to suspend computations. To achieve concurrent
execution, computations are executed simultaneously on multiple threads. A managed
thread pool of fixed size will not suffer from starvation if the submitted tasks do
not block their underlying thread. Since the tasks are suspended by capturing their
continuations, the underlying threads will immediately be available to execute other
tasks. A computation is resumed by submitting its continuation to the pool as a
separate task.
This section discusses the way DataFlow computations are created. A solution
to exchange data between concurrent DataFlow computations and the main non-
CPS-transformed program is developed. Subsequently, the scheduling of DataFlow
computations is discussed and multiple scheduler implementations are developed.
A DataFlow computation that enters the ScalaFlow framework is called a flow. Scala
provides ’by-name’ parameters that capture passed expressions without evaluating
them (non-strict evaluation). This allows passing of unevaluated computations with
a terse syntax and evaluation on demand, which is demonstrated in listing 4.1.
27
4. Design and Implementation
trait BlockingAwait {
def await : Boolean
}
trait SuspendingAwait {
def sawait : Boolean @ suspendable
}
The future result returned by flows is implemented by the class FlowResult[A] where
type A denotes the flow’s resulting value. It implements both the BlockingAwait
trait and SuspendingAwait trait to await the flow’s completion. Likewise, it provides
the methods get to obtain the future value by blocking and sget to get the value by
suspending until the flow has completed. This is demonstrated in listing 4.4 and 4.5.
28
4.2. Flows and Scheduling
def get : A
def sget : A @ suspendable
}
29
4. Design and Implementation
trait Scheduler {
def execute ( f : = > Unit ) : Unit
def shutdown : BlockingAwait
}
The second scheduler is a more comfortable variation of the first one: the thread
pool is initialized with Daemon Threads, which are automatically disposed when the
30
4.2. Flows and Scheduling
main program terminates. This allows the programmer to omit an explicit scheduler
shutdown.
Finally, the third scheduler uses the Fork/Join Framework [Lea00] which is targeted
for Java 7. It is designed to support efficient implementation of concurrent divide-
and-conquer algorithms, which fork sub-tasks (divide) and later join them (conquer)
to calculate the overall result. As such the pool is optimized for tasks that spawn sub-
tasks. Furthermore the ForkJoinPool is an optimized thread pool, which employs
a work-stealing mechanism to efficiently distribute submitted tasks among its pool
threads. The pool also provides an asynchronous mode for event-style tasks that
are never joined. Since DataFlow computations are suspended and resumed by
submitting new sub-tasks, and tasks never join forked sub-tasks, the ForkJoinPool
in asynchronous mode is a perfect fit for ScalaFlow.
The scheduler is the central component in ScalaFlow which is accessed by all compo-
nents that submit tasks. DataFlow variables and channels will resume computations
by submitting their captured continuations to the scheduler. Instead of explicitly
passing the scheduler to every component, the scheduler can be implicitly passed.
Scala features Implicit Parameters [Ode10, p.103] which are automatically sup-
plied. The compiler uses variables with a matching type which are in scope and
explicitly marked as implicit e.g. implicit val foo: String = "I’m implic-
itly supplied". This feature is used throughout the following components where
access to the scheduler is required. Since it is still possible to explicitly supply
values to implicit parameters, multiple schedulers can be used and explicitly passed
to components. The final implementation of the flow method defines an implicit
parameter to obtain a reference to the scheduler (see listing 4.8).
object Flow {
// factory method to create a Flow and convert the by - name
computation to a function
def create [ R ]( comp : = > R @ suspendable ) ( implicit scheduler :
Scheduler ) : Flow [ R ] =
new Flow [ R ](() = > comp )
31
4. Design and Implementation
This section will incrementally describe the design of DataFlow variables. DataFlow
variables are single-assignment: they can be bound to a value only once. Attempts to
rebind the variable result in failure and undefined behaviour (practically, an exception
is thrown). Readers attempting to access an unbound variable are suspended until
the variable is bound. Variables may be concurrently accessed by multiple parallel
computations. As a result monitor synchronization or atomic operations are required
to provide thread-safe access. The following design uses a lock-free approach via
atomic references and explicit data-race detection.
32
4.3. Single-Assignment Variables
method, which compares the reference to a comparator and then sets the reference
to a new value if the comparison succeeds. Thus binding the variable to a value is
a matter of AtomRef.compareAndSet(None, Some(value)). If the variable is still
unbound in the exact moment of binding it, the comparison will succeed and the
AtomicReference is set to Some(value) to reflect the bound value. If the variable
has already been bound, the comparison will fail and the implementation must deal
with the failure by throwing an Exception.
To resume the suspended readers, the readers are iteratively polled from the queue.
Each reader is submitted to the scheduler as a task that applies the reader’s continu-
ation with the variable’s new value.
The first implementation of a variable is demonstrated in listing 4.9. The get method
suspends the reader by shifting its continuation if the Option[A] in the Atomi-
cReference is None. Method set tries to set the AtomicReference to Some(value).
If the atomic compareAndSet succeeds, all suspended readers are resumed. A failed
compareAndSet indicates that the single-assignment variable has already been bound:
an exception is thrown to signal failure.
33
4. Design and Implementation
else
throw new Exception ( " variable already bound " )
}
def resumeReaders ( v : A ) : Unit = {
while (! suspendedReaders . isEmpty ) {
val k = suspendedReaders . poll
if ( k != null )
scheduler execute { k ( v ) }
} } }
The implementation presented in (4.9) has one major flaw: vulnerability to data-
races. The interleaving of the concurrent execution of get and set is demonstrated
in the following table; the suspended reader is enqueued after resumeReaders has
completed. Therefore, the suspended reader is enqueued too late and will never be
resumed.
if defined == false
compareAndSet == true
resumeReaders
shift
suspendedReaders offer k
34
4.4. Channels
class Variable [ A ] {
type Reader = ( A = > Unit )
val suspendedReaders = new Co nc u rr en t Li n ke dQ u eu e [ Reader ]
val v = new AtomicReference [ Option [ A ]]( None )
def get : A @ cps [ Unit ] = {
if v . get . isDefined
v . get . get
else shift { k : Reader = >
// handle DataRace # 1
if ( v . get . isDefined )
k ( v . get )
else {
suspendedReaders offer k
// handle DataRace # 2
if ( v . get . isDefined && suspendedReaders remove k )
k ( v . get )
} } } }
Listing 4.10: Data-Race Safe Variable Implementation
4.4. Channels
This section incrementally designs and implements the DataFlow channel, which is
used for inter-flow communication. Starting with an unbound channel, the channel is
35
4. Design and Implementation
A DataFlow channel has the semantics of a queue: elements are enqueued and
dequeued in FIFO order. Multiple concurrent consumers compete for the queue’s
first element which is removed from the channel. Concurrent producers compete for
the order in which elements are enqueued.
class Channel [ A ] {
class Stream [ A ] {
val value = new Variable [ A ]
var next : Stream [ A ] = null
}
var streamTake = new Stream [ A ]
var streamPut = streamTake
}
1. Reference streamTake points to the first element and must not be null
2. Reference streamPut points to the last element and must not be null
36
4.4. Channels
The channel provides two core operations: method take dequeues the first element
from the Stream and returns it. Method put enqueues an element, appending it to
the end of the Stream. When initialized, a new channel is empty; its streamTake and
streamPut references point to the same Stream[A] instance, whose value contains
one unbound variable and next-reference is null. To synchronize concurrent access
the two locks lockTake and lockPut are used.
Method take first acquires lockTake to exclude other consumers. The first element
is referenced by streamTake. Reading its DataFlow variable suspends the consumer
if the variable is unbound. When the consumer is resumed, the element’s next
reference is not null (Invariant4 ). The streamTake reference is advanced to the next
element. lockTake is then released and take returns the value.
Method put, applied with value v, first aquires lockPut to exclude other producers.
The last element is referenced by streamPut. By Invariant3 , its value is unbound
and its next is null. The next reference is set to a newly created Stream instance,
then the DataFlow variable value is bound to v. The order of setting the next
reference and then binding value is crucial to satisfy Invariant4 for suspended
consumers. The method then advances the streamPut reference to the new element
and finally releases lockPut.
37
4. Design and Implementation
However as locks are bound to threads, releasing lockTake in method take will
usually fail. If the channel is empty the consumer will be suspended by the unbound
DataFlow variable. The consumer is resumed by the DataFlow variable as a scheduled
task, which is executed on an arbitrary thread. Thus the resumed consumer can not
release lockTake which may have been acquired from a different thread (prior to
suspension). Instead of using ReentrantLocks, Semaphores with a resource count of
one can be used, since acquiring and releasing a semaphore is thread-independent.
Even with this improvement the method take is defective when multiple consumers
take values from the channel, given the framework’s aim to suspend consumers
instead of blocking their threads. If the channel is empty, the first consumer that
acquires lock lockTake is suspended by the unbound DataFlow variable. The lock is
not released until the consumer resumes, thus other consumers are blocked until the
lock is released.
The following implementation will resolve the problem of consumers being blocked
instead of suspended (see listing 4.14). To suspend each consumer that is accessing
take on an empty channel, the consumer must be suspended by its own DataFlow
variable. Therefore take has to create an empty Stream element for next consumer if
the channel is empty. To synchronize access to the Stream’s next element, take has
38
4.4. Channels
to acquire lockPut. After the lock is acquired, take appends a new Stream element
that will suspend the next consumer.
The following adapted invariants must hold for the Stream of elements:
1. streamTake points to the next element to be taken and must not be null
2. streamPut points to the next element to be bound and must not be null
As a result the put method must check whether empty Stream elements have already
been appended by take. If empty elements already exist put will bind the first
existing element to a value, instead of creating a new element. Since each consumer
is suspended by its own DataFlow variable, lockTake can be released before reading
the variable and potentially getting suspended. In this manner the framework avoids
attempting to acquire and release ReentrantLock from different threads.
To conclude the discussion of locking: the take method acquires both locks if the
channel is empty to exclude producers from concurrently appending. This could
impair channel performance when producers compete with consumers for the lockPut
on an empty channel.
39
4. Design and Implementation
dfvar set v
}
This section extends the unbounded channel to a bounded channel with a specific
capacity. Producers are allowed to append values to the channel until the channel’s
capacity is reached. Producers that call put on a full channel are suspended. They are
resumed in FIFO order when the channel is drained by consumers. The unbounded
channel described in the previous section implicitly operates in eager mode, meaning
that its producers are executing unconstrained. The channel is also extended with a
termination mechanism, which is used to prevent consumers from waiting indefinitely
for new elements. The bounded channel supports multiple types of flow-control which
are determined by the channel’s capacity:
40
4.4. Channels
To suspend producers the CPS primitive shift is used. The operator captures
the producer’s continuation as a function of type Unit => Unit. In contrast to
consumers, the producer does not need any input to resume producing, thus the
function’s parameter is Unit. The captured continuation is enqueued to a concurrent
queue, since consumers concurrently access it to resume producers. To keep track of
the channel’s queue length, an AtomicInteger is used, which allows atomic increment
and decrement operations. The length is incrementing when the put method enqueues
an element and decrementing when the take method dequeues an element.
So far, channels are not able to signal termination to consumers if the produc-
ers finished producing. Consumers are suspended by the empty channel and will
not be resumed until another element is put into the channel. To resume con-
sumers on termination, the unbound DataFlow variables in Stream that suspend
the consumers are bound to the bottom value null, which is expressed by val T =
null.asInstanceOf[A]. This requires the take method to check whether the value
is T (following resumption) and react accordingly.
41
4. Design and Implementation
from consumers by providing higher-order functions such as foreach and map, which
are developed in the next section. To mitigate the performance impact of throwing
exceptions, TerminatedChannel mixes-in the trait ControlThrowable. This avoids
costly stack trace generation by using a singleton instance of TerminatedChannel,
rather than constructing one for every throw.
The implementation of the put method and all relevant fields of the channel class
are listed in 4.16. This is only a slight extension to the previous implementation (see
listing 4.14). After lockTake is acquired, new Stream elements are only appended if
the channel has not yet been terminated. After streamNext is advanced to the next
element, the counter is decremented to reflect the channel’s new size. If the channel
is empty, the counter will be decremented to a negative counter, which represents
waiting consumers. Finally the first suspended producer is resumed (in FIFO order)
by invoking the resumeReaders method, after which the value returned from the
DataFlow variable is checked for T, which signals a resumption due to termination of
the channel.
42
4.4. Channels
lockTake . lock
if (! terminatedFlag && streamTake . next == null ) {
lockPut . lock
if ( streamTake . next == null )
streamTake . next = new Stream [ A ]
lockPut . unlock
}
resumeProducer ()
The implementation of the put method is listed in 4.16. The channel’s fields are
omitted since they are already listed in 4.15. The method begins by checking for
termination after which the counter is incremented. The producer is suspended if
the counter is over capacity. The producer must then acquire the first lockPut lock.
43
4. Design and Implementation
If a consumer takes an element from the channel and thus resumes producer Pa ,
another producer Pb in method put might detect a data race and continue execution.
Both producers will have to acquire the first lockPut lock which synchronizes their
execution. The producer that is able to acquire the lock rechecks the counter and
suspends if the channel is full. When suspending succeeds, the first lock is also
released. The second lock protects the code that binds the streamPut reference and
advances it. Producers that have acquired the first lock, already own the lock and
increment the lock’s hold count instead of acquiring it a second time. Therefore the
method put ends by releasing the lockPut lock until the hold count is zero.
class Channel [ A ]( val capacity : Int ) ( implicit val scheduler :
Scheduler ) {
def isFull = ( counter . get >= capacity )
lockPut . lock
if ( isFull )
suspendProducer ( Some ( lockPut ) )
else
cpsunit
lockPut . lock
val dfvar = streamPut . value
dfvar set v
44
4.4. Channels
To signal termination of the channel and resume suspended consumers, the method
terminate is defined (see listing 4.17). The channel stores its termination state
in the @volatile field terminatedFlag. Additionally a terminatedSignal signals
termination. The method terminate obtains exlusive access to the Stream elements
by acquiring both locks. The next element that would be bound by put is referenced
by streamPut. It is bound to the bottom termination value T to resume its suspended
consumer. Since consumers may have created additional empty DataFlow variables in
method take, all elements reachable from streamPut are also bound to T to resume
their consumers. Finally the locks are released and terminatedSignal is invoked to
signal termination of the channel.
terminatedFlag = true
45
4. Design and Implementation
lockTake . lock
lockPut . lock
streamPut . value := T
lockPut . unlock
lockTake . unlock
terminatedSignal . invoke
}
Listing 4.17: Bounded Channel with Termination - terminate
This section splits the bounded channel into different traits for separation of concerns.
The channel’s interface will be enriched with DSL aliases and multiple higher-order
functions to support Scala’s for-comprehension.
The channel provides three basic operations: putting a value into a channel, taking a
value from a channel and terminating a channel. Therefore, the interface is split into
the traits ChannelPut (4.18), ChannelTake (4.19) and ChannelTerminate (4.20).
Each trait defines the basic abstract methods to deal with its concern and additional
DSL aliases.
46
4.4. Channels
The basic pattern for looping over a channel’s elements is to use an unrestricted
while (true) loop inside a try-catch block to intercept the TerminatedChannel
exception. The loop successively takes values from the channel by calling the take
method. When the channel is terminated and empty, the put method will throw
the TerminatedChannel exception which breaks the while loop. The exception is
intercepted and silently discarded, after which the looping method terminates and
returns control to the caller. Further applications of the method on the terminated
channel instantly return because the channel will throw the TerminatedChannel
exception on the first call to take.
47
4. Design and Implementation
The foreach method takes a function that is successively applied to each element
in the channel (see 4.22). The passed function’s return type is in general Unit,
since the function is meant to perform computations with side effects rather than
producing a result. Generalizing from this behaviour, foreach accepts any function
of type A => U @suspendable, where U implies the Unit type but does not enforce
it. Nevertheless, foreach returns Unit after completion. The method does not run
concurrently: control is only returned to the caller when foreach has completed.
Since the CPS-transformation transforms the while loop into recursive function calls,
code using while(true) such as the above will eventually produce a StackOver-
flowError. This can be avoided by using channels with a restricted capacity to
increase the chance of consumers getting suspended. When resumed the consumers
are submitted as a new task with clean stack, minimizing the chance of producing a
StackOverflowError.
48
4.4. Channels
Based on tailrec a loop operator can be implemented that executes passed ex-
pressions indefinitely. The operator loop takes a by-name computation comp which
returns type U, indicating but not enforcing the Unit type. The computation rep-
resents the loop’s body. Since the computation does not return TailRec, the inner
function f wraps comp and returns a recursive Call thunk after the computation has
been executed (see listing 4.24).
Using for in conjunction with the yield keyword expands the loop to applications
of map and flatMap. The implementation of map and flatMap take a function f that
represents the yield expression. Their implementation is listed in 4.26 and 4.27,
respectively.
The map method, applied on channel Ca , creates a new channel Cb which is imme-
diately returned to the caller. Concurrently, map transforms elements from Ca by
49
4. Design and Implementation
applying the passed function f to each element. The transformed elements are succes-
sively put into the new channel Cb . map does not have a @suspendable annotation
on its return-type because it instantly returns the new channel without suspending.
For-comprehensions of type for (x <- xs; if x > y) are also expanded into ap-
plications of filter or withFilter. The constraint is passed as a predicate of
50
4.4. Channels
type A => Boolean @suspendable. While each application of filter returns a new
Channel containing only elements satisfying the predicate, withFilter is intended
to return a new object that accumulates the predicates and transparently applies
them on successive calls of foreach, map and flatMap. This avoids the need to
create a separate channel for each filter constraint.
The filter method is implemented in a similar manner to map (see listing 4.28).
When applied to channel Ca , it creates a new channel Cb which is immediately
returned. Concurrently, values taken from Ca and satisfying the predicate are put
into Cb .
51
4. Design and Implementation
if ( p ( v ) ) f ( v )
else cpsunit
} catch { case TerminatedChannel = > }
}
def map [ M ]( f : A = > M @ suspendable ) ( implicit scheduler :
Scheduler ) : Channel [ M ] = { ... }
def flatMap [ M ]( f : A = > Channel [ M ] @ suspendable ) : Channel [ M ]
= { ... }
def withFilter ( q : A = > Boolean ) : ChannelFilter =
new ChannelFilter ( x = > p ( x ) && q ( x ) )
} }
52
4.5. Pipeline
4.5. Pipeline
A sequence of concurrent computations, where each computation feeds its output into
the next computation is called a Pipeline. The individual computations represent
the pipeline’s Stages. This section demonstrates the construction of a pipeline by
concatenating channels using the higher-order function map. Finally a fluent interface
is defined, to simplify pipeline construction.
A stage is a concurrent computation that connects two channels. Using map, the
passed transformation function is scheduled for concurrent execution and a new
channel is immediately returned. Thus stages can be defined by applications of map
on the input channel. The returned channel is the stage’s output and the input to
the next stage. A two-staged pipeline can be created by two applications of map, as
demonstrated in listing 4.30.
To create pipelines with a fluent interface [FE], ScalaFlow provides the Pipeline
class, which represents the pipeline’s source. Its implementation is listed in 4.31.
This class contains an internal channel which is the pipeline’s source, with associated
methods to put values into the channel and to terminate it. Thus the internal
channel contains the input values for the first stage. A stage is created by applying
the method stage with a transformation function f: A => B @suspendable. It
returns a PipelineStage[A,B] that uses the Pipeline’s internal channel as its input
channel. The stage applies map on the input channel which creates the stage’s output
channel. The PipelineStage also implements the stage method which creates the
next stage. The next stage is in turn supplied by the previous stage’s output channel.
In contrast to the Pipeline class, PipelineStage does not provide methods to put
values or terminate. Putting values into the stage and propagating termination is
handled by map. The last call to stage creates a stage that represents the pipeline’s
sink. Thus every PipelineStage provides a foreach method to process its output
channel.
53
4. Design and Implementation
4.6. Conclusion
54
4.6. Conclusion
55
Networking Capabilities
5
This chapter extends ScalaFlow with networking capabilities to communicate with
remote nodes and to create distributed systems. As discussed earlier, a major design
goal of the framework is the avoidance of thread blocking operations. This rules out
use of the standard Input/Output (I/O) operations in the java.io package, which
are realized by thread-blocking Stream abstractions. Since Java 1.4.2, the java.nio
package supports a new non-blocking I/O approach (NIO) based on multiplexing
selectors, which is used in the following design for ScalaFlow’s I/O subsystem.
NIO features Channels that represent connections to entities and are capable of
performing I/O operations on files and sockets. Channels implementing the interface
SelectableChannel can be registered with a Selector that monitors the readiness
of any number of I/O operations, on every channel registered with the selector
(multiplexing). The selector is polled to retrieve a set of SelectionKeys, where each
key is bound to a specific channel and represents the readiness of its operations.
Each key also holds an arbitrary ’attachment object’ to allow for maintaining state
or providing context to handlers. Key processing follows the Reactor Pattern [Sch94,
Rot07] and imposes inversion of control on the code: a dispatcher polls the selector and
processes the selected keys. Each SelectionKey’s ready operations are dispatched
57
5. Networking Capabilities
Section 5.1 will describe a lightweight event dispatcher based on Java’s NIO library.
Based on this dispatcher, section 5.2 will describe ScalaFlow network sockets. Their
interface is based on DataFlow channels to read from and write to the socket.
Accepting connections from client is implemented by the acceptor in section 5.3.
Finally section 5.4 describes a HTTP server capable of processing HTTP requests
and generating HTTP responses, based on socket read and write channels.
Listing 5.1 sketches the basic dispatch-loop: the method polls the singleton Selector
instance for SelectionKeys using the blocking select method until the dispatcher
is shutdown, which is marked by the volatile _shutdown-flag. Each SelectionKey is
passed to dispatchKey which pre-processes the key’s ready operations and dispatches
to registered event handlers. To store distinct handlers for each operation, the handler
functions are aggregated in the NIO_Handler class (see listing 5.2) which is stored as
an attachment to every SelectionKey.
class Dispatcher {
val selector = Selector . open
@ volatile var _shutdown = false
def dispatch () : Unit = {
while (! _shutdown ) {
if ( selector . select > 0) {
val it = selector . selectedKeys . iterator
while ( it . hasNext ) {
val key = it . next
it . remove
dispatchKey ( key )
} } } }
def dispatchKey ( key : SelectionKey ) : Unit = {
val handler = key . attachment . asInstanceOf [ NIO_Handler ]
if ( key . isValid && key . isReadable )
...
} }
58
5.1. NIO Dispatcher
class NIO_Handler {
type OpHandler = SelectableChannel = > Unit
type FailureHandler = Throwable = > Unit
NIO provides four distinct operation flags to indicate readiness of a channel opera-
tion:
Each operation flag is wrapped in the sealed class NIO_OP to maintain type safety.
This class enriches the operation with methods to add interest and remove interest
from a specified bitmask (see listing 5.3), which is used to register multiple interests
at the channel’s SelectionKey.
A NIO channel must first be registered at the dispatcher using the register method,
which optionally takes functions to handle closed connections and failures (see listing
5.4). Upon initial registration, an instance of NIO_Handler is created and attached
59
5. Networking Capabilities
to the channel’s key. Each operation of interest has to be specifically registered at the
dispatcher using the registerOpInterest method, which takes a handler function
that will be invoked to handle the operation. The NIO_Handler, which is attached
to the key, is updated with the registered handler function and subsequently used by
dispatchKey during dispatching.
class Dispatcher {
def register ( ch : SelectableChannel ,
onClosed : Option [ NIO_Handler # OpHandler ] = None ,
onFailure : Option [ NIO_Handler # FailureHandler ] = None ) :
Unit = {
selectorGate . synchronized {
selector . wakeup ()
ch . register ( selector , 0 , handler )
}
}
selectorGate . synchronized {
selector . wakeup ()
This basic design directly dispatches readiness events to registered handlers and
imposes Inversion-of-Control on the program. However, it is possible to use con-
tinuations to avoid this. The method waitFor takes a channel and the operation
to wait for in curried syntax (see listing 5.5). The CPS primitive shift is used to
capture the current continuation as a function of type SelectableChannel => Unit,
60
5.1. NIO Dispatcher
which corresponds to the type definition OpHandler from NIO_Handler. This type
allows the continuation to be captured and directly registrated as a handler. When
the handler is invoked by the dispatcher, the continuation is resumed and waitFor
returns the channel for which an operation became ready (see 5.6).
class Dispatcher {
import scala . util . continuations . _
...
flow {
val ch : SelectableChannel
val d : Dispatcher
d . register ( ch , None , None )
// end of continuations
}
61
5. Networking Capabilities
To read from or write to a socket, client code must call the socket’s process method,
to initiate processing of its read and write DataFlow channels (readCh and writeCh,
see listing 5.8). This method returns a suspending future (SuspendingAwait) which
is invoked when both the read channel has been terminated due to an end-of-stream
on the network socket, and the write channel has been terminated by client code
(signalling the end of data to be sent).
62
5.2. Network Sockets
// public interface
def read : ChannelTake [ Array [ Byte ]] = readCh
def write : ChannelPut [ Array [ Byte ]] = writeCh
63
5. Networking Capabilities
64
5.2. Network Sockets
Reading from and writing to the socket is based on low-level byte arrays. To ease
the use of textual protocols such as HTTP, ScalaFlow provides traits that can be
mixed in with the Socket to enrich its operations. Trait SocketReadUtils provides,
among others, the method readStrings. This method concurrently maps all data
from the socket’s read channel into a new channel, which contains the read bytes
decoded into strings (see listing 5.11).
65
5. Networking Capabilities
dispatcher . register (
serverSocketCh ,
Some ( onClosed ) ,
Some ( onFailure )
)
val wait = dispatcher . waitFor ( serverSocketCh ) _
66
5.4. HTTP Processing
doneSignal
}
object HttpResponseStatus {
val OK = HttpResponseStatus (200 , " OK " )
...
67
5. Networking Capabilities
sealed trait H tt p R e s po n s e St a t u s Ty p e
object Informational extends H t t p Re s p o n se S t a tu s T y p e {
def unapply ( status : HttpResponseStatus ) : Boolean = status
. isInformational
}
object Success extends H tt p R e s po n s e S ta t u s T yp e {
def unapply ( status : HttpResponseStatus ) : Boolean = status
. isSuccess
}
object Redirection extends H t t p R es p o n s eS t a t us T y p e {
def unapply ( status : HttpResponseStatus ) : Boolean = status
. isRedirection
}
object ClientError extends H t t p R es p o n s eS t a t us T y p e {
def unapply ( status : HttpResponseStatus ) : Boolean = status
. isClientError
}
object ServerError extends H t t p R es p o n s eS t a t us T y p e {
def unapply ( status : HttpResponseStatus ) : Boolean = status
. isServerError
}
}
A Tokenizer splits request and response payloads into Tokens that contain instances
of the HTTP specification type classes. The implementation is based on a finite
state machine and type classes representing the different states. The tokenizer is
initialized with a clean state, applying its tokenize method with a raw byte array
(Array[Byte]), which tokenizes the bytes into a List[Token], which is returned.
Multiple invocations of tokenize continue processing based on the tokenizer’s internal
state, which can be reset. This design directly feeds the Socket’s read channel
bytes into the tokenizer method.
object Tokenizer {
sealed trait State
object State {
case object Init extends State
68
5.4. HTTP Processing
...
case object InitChunk extends State
case class Chunk ( size : Int ) extends State
case class Re moveChun kBounda ry ( nextState : State ) extends
State
...
case object Success extends State
case class Failure ( cause : String ) extends State
}
class Tokenizer {
var state : State = State . Init
def tokenize ( bs : Array [ Byte ]) : List [ Token ]
}
The processor uses a Pipeline design to concurrently transform the raw bytes from
the socket into requests and responses. The first pipeline stage consumes bytes
from the socket’s read channel, tokenizes them using the Tokenizer and puts the
Tokens into a Channel[Token]. The second stage consumes the Channel[Token]
and transforms the Tokens to HttpRequests or HttpResponses.
69
5. Networking Capabilities
There are two alternative second stages: one produces HttpRequests, the other
HttpResponses. Their implementations only differ in details which are omitted from
this section. To achieve concurrent execution, the second stage also uses a higher-
order function of the Channel class, which concurrently executes the passed function.
The function transforming the tokens has type Token => Option[HttpRequest].
It aggregates Tokens and yields Some[HttpRequest] if the collected tokens form a
valid HttpRequest, or None if further Tokens are required. Thus the transformation
function is passed to the higher-order function collect which accepts functions of
type A => Option[B].
The pipeline stages are created on demand using lazy vals to store the channels. The
first lazy val stores the Channel[Token], the others store a Channel[HttpRequest]
and a Channel[HttpResponse] respectively. The processor provides the meth-
ods requests and responses which return the Channel[HttpRequest] and Chan-
nel[HttpResponse] respectively. When accessing the lazy references to the channels,
the lazy-val mechanism creates their instances and thus starts the pipeline processing.
However client code must not invoke both the requests and responses methods, as
they compete for the single Channel[Token], thus clients can either use the processor
for processing requests or for processing responses.
class HttpProcessor ( val socket : Socket , val forceCharset :
Option [ Charset ] = None ) {
70
5.5. Conclusion
5.5. Conclusion
71
5. Networking Capabilities
protocol case classes. The processor uses the Tokenizer to preprocess the byte
sequences from the Socket’s read channel. The tokens are then aggregated and
enqueued into the requests or responses Channels.
72
5.5. Conclusion
73
ScalaFlow in Action
6
This chapter illustrates the intended use of the ScalaFlow framework. The first
section 6.1 demonstrates explicit usage of ScalaFlow in conjunction with a dataflow
variable, emphasizing on types and avoiding any shortcuts. The next section 6.2
contains examples using Variables with ScalaFlow’s terse DSL notation, followed by
section 6.3 which presents dataflow Channels. The networking components Socket
and Acceptor are demonstrated in section 6.4. Finally a basic web server is developed
and discussed.
Listing 6.1 demonstrates explicit usage of the ScalaFlow framework. This listing avoids
type-inference and the terse syntax through implicit variables and operator-aliases.
DSL operations such as flow are not used in favour of explicit Flow instantiation.
Important behavior is denoted by comments throughout the listing.
object Explicit {
val scheduler : Scheduler = new Thread PoolSche duler
75
6. ScalaFlow in Action
// computations
val c1 : (() = > Int @ dataflow ) = () = > v . get
val c2 : (() = > Unit @ dataflow ) = () = > v . set (42)
// flows
val f1 : FlowResult [ Int ] = new Flow [ Int ]( c1 ) ( scheduler ) .
execute . result
val f2 : FlowResult [ Unit ] = new Flow [ Unit ]( c2 ) ( scheduler ) .
execute . result
// results
val r1 : Int = f1 . get // blocks
val r2 : Unit = f2 . get // blocks
println ( " r1 = " + r1 + " r2 = " + r2 ) // r1 =42 r2 =()
Listing 6.2 demonstrates how to use Variable with ScalaFlow’s terse DSL nota-
tion. In contrast to listing 6.1, the scheduler is created as an implicit reference
and hence is implicitly passed to the Variable and Flow. Instead of creating a
ThreadPoolScheduler, this example uses a DaemonThreadPoolScheduler which is
automatically disposed when the program terminates, avoiding the need to explicitly
shut it down. Furthermore the Variable’s DSL aliases are used: get is replaced by
an application and set is replaced by the := alias.
object Var1 {
implicit val scheduler = new D a e m o n T h r e a d P o o l S c h e d u l e r
def main ( args : Array [ String ]) : Unit = {
val v = new Variable [ Int ]
val f1 = flow { v () }
val f2 = flow { v := 42 }
println ( " r1 = " + f1 . get + " r2 = " + f2 . get )
} }
Listing 6.2: Using a Variable in terse DSL notation
76
6.3. Using Channels
Similiar to the previous section, listing 6.3 uses ScalaFlow’s terse DSL notation to
present the intended usage of channels. Listing 6.3 appends two values to a channel.
The channel is created by ScalaFlow’s default factory which creates bounded channels
with a default capacity of 1000.
object Channel1 {
implicit val scheduler = new D a e m o n T h r e a d P o o l S c h e d u l e r
def main ( args : Array [ String ]) : Unit ={
val ch = Channel . create [ Int ]
// consumer - flow
val f = flow {
val v1 = ch ()
val v2 = ch ()
v1 + v2
}
// producer - flow
flow {
ch << 11
ch << 31
}
println ( " result : " + f . get )
} }
Listing 6.4 introduces an additional consumer that competes with the first one. They
are scheduled nondeterministically, so the ordering of their take operations and hence
the program’s outcome will vary between runs. The channel is again bounded by a
default capacity and is not terminated.
object Channel2 {
implicit val scheduler = new D a e m o n T h r e a d P o o l S c h e d u l e r
// consumer - flow # 1
val f1 = flow { ch () + ch () }
// consumer - flow # 2
val f2 = flow { ch () }
// producer - flow
77
6. ScalaFlow in Action
flow {
ch << 1
ch << 2
ch << 3
}
val r1 = f1 . get
val r2 = f2 . get
println ( r1 + " - " + r2 + " = " + ( r1 - r2 ) )
}
}
object Channel3 {
implicit val scheduler = new D a e m o n T h r e a d P o o l S c h e d u l e r
// consumer - flow
val f = flow {
val v1 = safeTake
val v2 = safeTake // raises TerminatedChannel
v1 + v2
}
// producer - flow
flow {
ch << 1
ch <<# // terminate
}
78
6.3. Using Channels
object Channel4 {
implicit val scheduler = new D a e m o n T h r e a d P o o l S c h e d u l e r
// consumer - flow
val f = flow {
for ( x <- ch )
println ( x )
}
// producer - flow
flow {
ch << 1
ch << 2
ch << 3
ch <<# // terminate
}
f . await
}
}
Listening 6.7 uses the for-comprehension expanded into applications of map and
filter. The second flow concurrently maps values from xs into the new channel ys,
thus returning a FlowResult[Channel[Int]]. The third flow uses sget (suspending
get) on the FlowResult to get the ys (applying get is a thread-blocking operation
which is to be avoided in flow environments). That said it is more idiomatic to use
a Variable[Channel[Int]] to communicate the ys between the flows.
object Channel5 {
implicit val scheduler = new D a e m o n T h r e a d P o o l S c h e d u l e r
// producing xs
79
6. ScalaFlow in Action
flow {
xs << 1
xs << 2
xs << 3
xs <<# // terminate
}
// consuming xs , producing ys
val f1 = flow {
val ys = for ( x <- xs ; x > 1; y = x * x )
yield y
}
// consuming ys
val f2 = flow {
val ys = f1 . sget // suspending get
for ( y <- ys )
println ( y )
}
f2 . await
}
}
Listing 6.7: Concurrent mapping and waiting for another flow’s result
The Channel class provides multiple factory methods, to produce channels with
different types of flow-control behavior (see listing 6.8). The listing also demonstrates
the implicit conversion of Scala’s Iterable to ScalaFlow’s DataFlowIterable, which
is CPS-aware. The conversion is forced by using the identity operation dataflow on
the Iterable.
object Channel6 {
import DataFlowIterable . convert
80
6.3. Using Channels
}}
flow {
// not consuming eager - channel at all
// chEager ()
chLazy ()
chBounded ()
}
The final listing 6.9 demonstrates usage of multiple higher-order functions on channels.
However the selection of demonstrated functions is not exhaustive. The flows
communicate via dataflow variables and all higher-order functions work concurrently.
The termination of the first channel is automatically propagated to all following
channels that are created by higher-order functions, thus terminating the enclosing
program.
object Channel7 {
import DataFlowIterable . convert
81
6. ScalaFlow in Action
82
6.4. Networking
val f = flow {
println ( " square - sum : " + squareSum () )
println ( " sum : " + sum () )
}
f . await
}
}
6.4. Networking
This section demonstrates the intended use of the Dispatcher, Socket, Acceptor
and the HTTP processing components. The first example shows how to communicate
with a host using the Dispatcher and Socket. The HTTP protocol is used to
demonstrate connecting, writing requests as raw bytes and processing response bytes.
The second example creates a simple Echo server [Pos83] that accepts connections
from clients and echos back all incoming data. Finally the third example describes a
simple web server demonstrating ScalaFlow’s HTTP components, which are built
using the Dispatcher, Socket and Acceptor classes.
83
6. ScalaFlow in Action
The second flow simply sawaits the socket’s connected signal, to show the intended
use of the socket’s signals. The third flow writes a raw byte request to the socket’s
write channel. Since the socket’s internal processing of its read and write channels
is synchronized on the connected signal, reading from and writing to the socket
channels is allowed at any time.
The fourth flow concurrently decodes the bytes from the socket’s read channel into
a Channel[Array[Char]]. The flow passes this new channel to the fifth flow via the
chars variable. The fifth and final flow uses the for-comprehension to iterate over
the Channel[Array[Char]], which is bound to variable chars by the fourth flow.
The program awaits the FlowResults of flows one and five, since they are the last
flows to return.
object IO1 {
val f1 = flow {
sock . connect ( heise )
println ( " connected to heise " )
sock . process . sawait
println ( " socket processed , no more data to write & server
disconnected " )
}
val f2 = flow {
sock . connected . sawait
println ( " signal : connected to heise " )
}
val f3 = flow {
sock . write << " GET http :// www . heise . de / HTTP /1.1\ r \ nHost :
www . heise . de \ r \ n \ r \ n " . getBytes ( " ASCII " )
sock . write <<# ;
}
val f4 = flow {
84
6.4. Networking
val f5 = flow {
for ( cs <- chars . get ) {
println ( cs . length + " bytes " )
// print ( cs . mkString ("") )
}
}
The second example in listing 6.11 accepts client connections on a specified IP address
by invoking the Acceptor’s accept method with a local java.net.InetSocketAddress
in the first flow. The method returns a SuspendingAwait and concurrently accepts
connections. The returned SuspendingAwait is not awaited in this example.
The second flow uses the for-comprehension on the acceptor’s connections channel.
The channel contains client sockets that encapsulate the accepted connection requests.
The sockets are already in processing mode since the Acceptor internally invoked
the socket’s process method. For each client socket from the connections channel,
a new embedded flow is created to process the request concurrently. The embedded
flows are executed independently and use the for-comprehension to process their
socket’s read channel. Each Array[Byte] from the read channel is written back to
the socket’s write channel, thus the example is echoing back each client’s input.
object IO2 {
val f1 = flow {
ator accept addr
}
85
6. ScalaFlow in Action
val f2 = flow {
for ( sock <- ator . connections ) { flow {
for ( bytes <- sock . read ) {
sock . write << bytes
}
}}
}
f2 . await
}
}
The final example uses ScalaFlow’s HTTP processing components to create a simple
web server, which accepts HTTP connections and responds with a HTML page
that summarizes protocol information (see listing 6.12). Similiar to the previous
example, the acceptor is bound to a local InetSocketAddress, with the first flow
accepting client connections by invoking its accept method. As before the second
flow creates a new embedded flow for each client connection. Each client flow creates
a HttpProcessor that processes the client socket. A for-comprehension is used to
iterate over the requests, which are concurrently produced by the HTTP processor
based on the socket’s raw input bytes. Each request is an instance of HttpRequest
which is ScalaFlow’s type-safe HTTP representation. createResponse creates a
HttpResponse using Scala’s XML literals. The response is written to the client
socket’s write channel and displayed in the client’s browser.
object IO3 {
implicit val scheduler = new D a e m o n T h r e a d P o o l S c h e d u l e r
val f1 = flow {
ator accept addr
}
val f2 = flow {
for ( socket <- ator . connections ) { flow {
val http = new HttpProcessor ( socket )
for ( request <- http . requests ) {
val resp = createResponse ( request )
86
6.4. Networking
f2 . await
}
87
6. ScalaFlow in Action
response . addBody ( """ <! DOCTYPE html PUBLIC " -// W3C // DTD
XHTML 1.0 Strict // EN " " http :// www . w3 . org / TR / xhtml1 / DTD /
xhtml1 - strict . dtd " > """ )
response . addBody ( res . toString )
response
}
}
88
7
Conclusion
To conclude this thesis, this chapter begins with a summary of the achieved results.
Subsequently the framework is critically assessed and recommendations for future
work are made.
7.1. Summary
This thesis has described the design and implemention of a framework for creating
concurrent programs with DataFlow execution. The framework, named ScalaFlow,
allows creation of embedded DataFlow environments within Scala programs. The
DataFlow Model is based on the Data Driven Concurrency Model, extended with
nondeterminism. ScalaFlow features dataflow Variables, Signals, communication
Channels, Pipelines and networking components.
89
7. Conclusion
Dataflow Variables are single-assignment and suspend their readers until the variable
is bound. Signal s suspend their listeners until the signal is invoked. In contrast to
variables, signals do not communicate any value. Channel s are streams of dataflow
variables used to communicate between concurrent dataflow computations. Con-
sumers of empty channels are suspended until the channel is filled. Additionally,
channels suspend and resume producers based on the channel’s capacity; an eager
channel does not suspend producers. A lazy channel suspends producers by default
and resumes them based on consumer demand. Any capacity in between constitutes
a bounded buffer that suspends producers when the channel is filled up to its capac-
ity. A bounded channel resumes producers in first-in, first-out (FIFO) order when
consumers drain the channel. The Pipeline is a lightweight interface that makes use
of the channel’s concurrent higher-order function map to connect channels to form a
pipeline, where each mapping function represents a concurrent pipeline stage.
90
7.2. Assessment
7.2. Assessment
The foundation of the dataflow model is the dataflow Variable, which uses a lock-free
implemented based on atomic references, allowing it to perform well under consumer
contention. By contrast Channel s uses heavy locking to manage their internal
streams and to provide flow-control to both consumers and producers. Future work
should attempt to design a more efficient locking scheme or even a lock-free Channel
implementation. Although performance profiling and evaluation of the framework
was not undertaken in this thesis, tests and microbenchmarks indicate reasonable
performance. However further work should profile the channel’s behavior under heavy
load with fixed capacity and multiple consumers and producers.
91
7. Conclusion
ScalaFlow and the projects listed in the appendix are public, BSD licensed projects
on github. It is expected that some of the stated problems will be improved or solved
in future development cycles. Support from the Scala community is appreciated.
92
Appendix Source Code and Projects
A
The full source code of ScalaFlow can be found at http://github.com/hotzen/-
ScalaFlow and is available under the BSD license. ScalaFlow includes the pack-
ages dataflow, dataflow.io, dataflow.io.http and dataflow.util. Additionally,
package dataflow.tests contains many more examples than presented in the exam-
ples chapter.
Building on ScalaFlow, there are two projects hosted at github. The first, called
MetaFlow is available at http://github.com/hotzen/MetaFlow and combines heuris-
tics with ScalaFlow’s Sockets and HTTP processing to create a web crawler that
extracts informations about movies. It is still work in progress and far from completion
but can be studied for a real-word example on using ScalaFlow.
93
Installation Guide
B
A CD is attached to this thesis, containing a PDF version of the thesis as well as the
source code of the developed ScalaFlow framework. Additionally, the framework is
provided as a self-contained JAR-archive. ScalaFlow is also publicly available under
www.github.com/hotzen/ScalaFlow.
The source code contains a project definition for the simple-build-tool (sbt) [H+ a]
and can be built by invoking sbt compile inside the ScalaFlow directory. The
directory-structure follows the recommendations of the SBT-project [H+ b]. A project
definition for the Eclipse IDE can be created by invoking sbt eclipse.
95
Bibliography
97
Bibliography
[GBB+ 06] Goetz, Brian ; Bloch, Joshua ; Bowbeer, Joseph ; Lea, Doug
; Holmes, David ; Peierls, Tim: Java Concurrency in Practice.
Addison-Wesley, 2006
[H+ a] Harrah, Mark et al.: simple-build-tool - A build tool for Scala. http:
//code.google.com/p/simple-build-tool/, last checked: 2010-09-01
[HO07] Haller, Philipp ; Odersky, Martin: Actors that Unify Threads and
Events / Programming Methods Laboratory, Ecole Polytechnique Fed-
erale de Lausanne. Version: 2007. http://citeseerx.ist.psu.edu/
viewdoc/download?doi=10.1.1.121.5844&rep=rep1&type=pdf, last
checked: 2010-08-04. 2007. – Forschungsbericht
[Lea00] Lea, Doug: A Java fork/join framework. In: JAVA ’00: Proceedings of
the ACM 2000 conference on Java Grande, Department of Computer
Science, University of Rochester, 2000 http://gee.cs.oswego.edu/
dl/papers/fj.pdf
98
Bibliography
[Moo65] Moore, Gordon E.: Cramming more components onto integrated cir-
cuits. ftp://download.intel.com/research/silicon/moorespaper.
pdf. Version: 1965, last checked: 2010-09-01
[MS96] Michael, Maged M. ; Scott, Michael L.: Simple, Fast, and Practical
Non-Blocking and Blocking Concurrent Queue Algorithms. In: Annual
ACM Symposium on Principles of Distributed Computing, Department
of Computer Science, University of Rochester, 1996 http://www.cs.
rochester.edu/u/scott/papers/1996_PODC_queues.pdf
[Ode08] Odersky, Martin: Tail calls via trampolining and an explicit instruction.
http://scala-programming-language.1934581.n4.nabble.com/
Tail-calls-via-trampolining-and-an-explicit-instruction-td2007474.
html#a2007485. Version: 2008, last checked: 2010-09-01
99
Bibliography
[RH04] Roy, Peter van ; Haridi, Seif: Concepts, Techniques, and Models of
Computer Programming. MIT Press, 2004
[Sch94] Schmidt, Douglas C.: Reactor - An Object Behavioral Pattern for De-
multiplexing and Dispatching Handles for Synchronous Events. http://
www.cs.wustl.edu/~schmidt/PDF/PLoP-94.pdf. Version: 1994, last
checked: 2010-09-01
100
Affidavit
I hereby declare that this thesis has been written independently by me, solely based
on the specified literature and resources. All ideas that have been adopted directly
or indirectly from other works are denoted appropriately. This thesis has not been
submitted to any other board of examiners in its present or a similar form and was
not yet published in any other way.
Kai H. Meder
101