0% found this document useful (0 votes)

84 views16 pages

A Fortran-Keras Deep Learning Bridge For Scientific Computing

The document describes a software library called the Fortran-Keras Bridge (FKB) that connects deep learning models built in Keras with programs written in Fortran. FKB allows deep learning methods developed in Python frameworks to be integrated into large-scale scientific computing projects written in Fortran. As an example, FKB is used to search over 100 neural network climate models initially built in Keras and transfer them for use in a Fortran-based global climate simulation code. This reveals relationships between how models perform during offline validation and online simulation when coupled to fluid dynamics that leads to improved climate model stability.

Uploaded by

Logador

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views16 pages

A Fortran-Keras Deep Learning Bridge For Scientific Computing

Uploaded by

Logador

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

A Fortran-Keras Deep Learning Bridge for Scientific Computing

Jordan Ott1 , Mike Pritchard2 , Natalie Best3 , Erik Linstead3 , Milan Curcic4 , and Pierre
Baldi1
1
Department of Computer Science
University of California, Irvine
2
Department of Earth System Science
University of California, Irvine
3
Fowler School of Engineering
Chapman University
4
Department of Ocean Sciences
University of Miami
arXiv:2004.10652v2 [cs.LG] 4 Aug 2020

Abstract
Implementing artificial neural networks is commonly achieved via high-level programming lan-
guages like Python and easy-to-use deep learning libraries like Keras. These software libraries come
pre-loaded with a variety of network architectures, provide autodifferentiation, and support GPUs
for fast and efficient computation. As a result, a deep learning practitioner will favor training a
neural network model in Python, where these tools are readily available. However, many large-scale
scientific computation projects are written in Fortran, making it difficult to integrate with modern
deep learning methods. To alleviate this problem, we introduce a software library, the Fortran-
Keras Bridge (FKB). This two-way bridge connects environments where deep learning resources are
plentiful, with those where they are scarce. The paper describes several unique features offered by
FKB, such as customizable layers, loss functions, and network ensembles.
The paper concludes with a case study that applies FKB to address open questions about the
robustness of an experimental approach to global climate simulation, in which subgrid physics are
outsourced to deep neural network emulators. In this context, FKB enables a hyperparameter search
of one hundred plus candidate models of subgrid cloud and radiation physics, initially implemented
in Keras, to be transferred and used in Fortran. Such a process allows the model’s emergent behavior
to be assessed, i.e. when fit imperfections are coupled to explicit planetary-scale fluid dynamics.
The results reveal a previously unrecognized strong relationship between offline validation error and
online performance, in which the choice of optimizer proves unexpectedly critical. This in turn
reveals many new neural network architectures that produce considerable improvements in climate
model stability including some with reduced error, for an especially challenging training dataset.

1 Introduction
The Fortran programming language was originally developed in the 1950s and published in 1957. It was
created to help programmers implement solutions for scientific and engineering problems on the IBM
704 computer, which at the time needed to be written in machine or assembly language. Fortran has
been regarded as revolutionary and possibly one of the most influential software products in history [20].
Having evolved many times since its creation, with the most recent release in 2018, each version adds
new features and capabilities. Fortran initially gained popularity and remains a widely used language
due to its fast and efficient computational ability. Additionally, Fortran’s strength is its backward
compatibility, which allows modern compilers to build code written in the 60s and 70s.
Though not as popular as it once was, Fortran is still used in specialized fields, including oceanogra-
phy, solid mechanics, computational physics, earthquake simulation, climate modeling, and aerospace.
Because of Fortran’s continued use, a great deal of legacy code and new code exists. Unfortunately, it
is difficult to rewrite all existing code bases in more mainstream languages, due to their size and com-
plexity. Therefore, when algorithms and extensive libraries are created in modern languages, backwards
compatible methods must be developed to make them available in older legacy code, like Fortran.
In recent years, the rise of machine learning and deep learning has led to successful applications
in various domains. Substantial improvements in the size of the training sets and available computing

1
power have led to a new wave of implementations [37, 55]. In turn, this success has increased the
usage and dissemination of deep learning. These methods have been applied to a variety of domains,
e.g., ranging from remote sensing [67, 38] to computer vision [45, 61, 46, 44, 63], and to games [2, 56].
Specifically, within scientific computing, many advancements have been achieved through the application
of neural networks. Neural networks have been augmented with physically informed capabilities [51,
9], better suiting them for conservation restrictions. Learning partial differential equations [5, 54] has
proved valuable in multiple scientific domains.
The success and popularity of deep learning have inspired the creation of powerful software libraries
written in several modern programming languages. However, Fortran is not among the modern languages
that benefit from these deep learning libraries. This absence leaves Fortran programmers with few
options to implement deep neural networks.
The implementation of deep neural networks, in Fortran, may be achieved via two primary pathways.
One solution is to rewrite all existing deep learning libraries in Fortran. The second solution is to leverage
existing frameworks and bridge available functionalities to Fortran. The former is extremely arduous
and time consuming, considering the size and scope of existing deep learning packages and the dizzying
pace of their evolution [16, 1, 47]. The latter approach, which this paper describes, is to allow users
to leverage the power of existing frameworks while providing a bridge between paradigms where deep
learning resources are plentiful and those where they are scarce. In this way, we can leverage aspects
of currently available deep learning software libraries, like Keras [16], and bring them to large-scale
scientific computing packages written in Fortran. To this end, we propose the Fortran-Keras Bridge
(FKB) – A two-way bridge connecting models in Keras with ones available in Fortran. The source code
is publicly available and can be found here: https://github.com/scientific-computing/FKB. We
begin by reviewing existing Fortran projects that would benefit from the integration of FKB.

2 Fortran Projects
FKB can be integrated with many existing large-scale and computationally intensive projects written
in Fortran. These projects will benefit from the easy integration of neural network models, which FKB
makes possible.
For example, Fortran is used to do a great deal of work in climate and ocean modeling. For instance,
the US-produced Community Earth System Model [29] is written in object-oriented Fortran-90; this is
the most widely used climate model in the world. So are the other climate simulation codes used by
the US Department of Energy [25] and the National Oceanographic and Atmospheric Administration’s
Geophysical Fluid Dynamics Laboratory [27]. Meanwhile, the Nucleus for European Modelling of the
Ocean (NEMO) engine is used for studying ocean circulation problems on regional and global scales
[59] and making future predictions, is also written in Fortran. The Hybrid Coordinate Ocean Model
(HYCOM) [66], also used for ocean modeling, extends traditional ocean models to allow for a smooth
transition from the deep ocean to coastal regimes. Researchers have also developed models for the mod-
eling of waves and wind stress [18]. The Weather Research and Forecasting Model (WRF), is arguably
the most widely used numerical weather prediction models for regional decision support [49]. Since
its release in 2000, the number of WRF registrations has grown to over 36,000. WRF produces atmo-
spheric simulations with support for special applications, including air chemistry, hydrology, wildland
fires, hurricanes, and regional climate, and is again a Fortran-based model.
Fortran has found continued use in solid mechanics packages for implementing finite element methods.
Popular packages such as ANSYS [41], ABAQUS [10], and LS-DYNA [42] are written in Fortran or
accept Fortran subroutines. Similarly, in earthquake modeling, the SPECFEM3D [36] package leverages
Fortran for simulations.
The list goes on. Code Saturne [3], developed by Électricité de France, and NEK5000 [48], are For-
tran open-source computational fluid dynamics packages. Code Saturne allows for user customization
via Fortran subroutines, which is just one application domain for FKB. NEK5000 is actively used in
the Center for Exascale Simulation of Advanced Reactors (CESAR) projects. Fortran has also been
continually used for molecular modeling within chemistry and physics. The Chemistry at Harvard
Macromolecular Mechanics (CHARMM) Development Project has produced a powerful molecular sim-
ulation program in Fortran [14]. This simulation program primarily targets biological systems but can
also be used for inorganic materials. A similar tool, NWChem, has been developed by the Molecular Sci-
ences Software Group at the Pacific Northwest National Laboratory [64]. NWChem is a computational
chemistry software that includes quantum chemical and molecular dynamics functionalities. Within the

2
(a) (b)

Figure 1: (a) Usage of programming languages for machine learning and data science. Statistics are
from the 2018 Kaggle ML & DS Survey [32]. (b) Usage metrics of deep learning frameworks. Statistics
are from the 2019 Kaggle State of Data Science and Machine Learning report [33].

molecular physics domain, Fluktuierende Kaskade (FLUKA) is a proprietary tool for calculations of
particle transport and interactions with matter [19].
The models mentioned above and projects can leverage the FKB library to leverage neural networks
within their codebases. For example, neural networks have proven useful in modeling sea surface tem-
perature cooling for typhoon forecasting [31]. Therefore the integration of FKB with tools like NEMO,
HYCOM, or WRF models is a possibility. In a recent study of computational fluid dynamics, Ling
et al. solve the Reynolds-averaged Navier-Stokes equations, similar to Code Saturne and NEK5000.
By implementing deep neural networks, the authors report that the architecture improved prediction
accuracy [40]. Finally, the Fluka tool contains a wide range of molecular physics applications, including
dosimetry calculations. Vega-Carrillo et al. have shown neural networks aided in the calculation of
neutron doses [65]. For global climate simulation, there is proof that deep neural networks can offer
skillful alternatives to assumption-prone approximations of sub-grid cloud and turbulence physics in the
atmosphere [53, 11]. We hope that the FKB library enables Fortran users to expand their research and
projects to include neural networks.
Having reviewed several Fortran based projects that can leverage FKB, we now introduce the two
sides of this bridge. The following sections will develop the foundations on which to anchor each side of
this two-way bridge. We start by introducing the deep learning anchor.

3 The Python Anchor (Deep Learning)

Many programming languages offer tools and libraries for implementing artificial neural networks. How-
ever, in recent years, Python has emerged as the clear favorite within this domain. Metrics in Figure 1a
display Python’s dominance. Python is used nearly 50% more than the second most popular language,
R. Python’s ubiquitous presence in machine learning makes it the obvious choice to leverage existing
libraries for Fortran. The question then becomes, which available software library within Python, is
best suited to bridge to Fortran?
Of the available deep learning libraries, Keras [16] is the most popular among practitioners (Figure
1b). Keras is an Application Programming Interface (API) built on top of Tensorflow [1], that provides
users the ability to implement quickly, train, and test networks. This convenience encapsulates much
of the low-level complexity one must manage when implementing deep networks from scratch. Keras
abstracts many of the complicated aspects of Tensorflow while still providing customizability and ease of
use. This combination makes Keras the first choice of many for deep learning applications. As a result
of its popularity and ease of use, Keras is the clear choice on which to build one end of the two-way
bridge.
Figure 2, depicts the positioning of the Python anchor, FKB/P, within the deep learning ecosystem.

3
Figure 2: Positioning of FKB within Fortran and Python ecosystems.

The Keras API leverages Python to build deep neural networks. FKB/P resides on top of Keras to
access models produced from Keras and transmit them to the Fortran anchor, FKB/F. This structure
allows for integration with Fortran applications that wish to leverage deep neural network architectures.
Having described the deep learning anchor within Python, the next section develops the foundation for
anchoring the bridge with Fortran.

4 The Fortran Anchor (Scientific Computing)

Several attempts have been made to implement neural networks in Fortran, with some success [17, 7,
8, 13, 43]. However, many implementations resort to hacking a single-use neural network by hand,
or binding code from other languages [43]. Along these lines, one may consider accessing Python
functionality directly from Fortran, by running a Python instance within Fortran. While providing
flexibility and ease of use, this is vulnerable to extreme deficiencies in speed and computational resources.
As a result, this solution becomes untenable for large-scale computation projects like the ones described
in Section 2.
There are a small number of existing neural network libraries in Fortran [43, 39, 17]. The most
recent and well developed library is Neural Fortran [17], a lightweight neural network library, written
natively in Fortran. The Neural Fortran library provides the ability to implement artificial neural
networks of arbitrary size with data-based parallelism. Additionally, in benchmark studies, Neural
Fortran was shown to have comparable compute performance with Keras while maintaining a lower
memory footprint. This library offers a foundation to anchor the Fortran side of the two-way bridge,
FKB/F. By extending - and building on top of - Neural Fortran, we can convert Keras models to ones
readily available in Fortran and implement them in existing Fortran projects.
The positioning of FKB within the scientific computing ecosystem is shown in Figure 2. The For-
tran anchor, FKB/F, can use models originally constructed and trained in Keras, which can then be
transferred to Fortran via FKB/P. To use these models, the Fortran side of FKB implements a neural
network library. This portion of FKB can be used within large-scale scientific computation software,
like the projects identified in Section 2.
By leveraging FKB, it becomes seamless to train networks in Python and transfer them to Fortran,
to run inside large scale simulations. Similarly, neural network models constructed in Fortran can be
transferred to Python for additional analysis, expansion, and optimization - including hyperparameter
searches using available tools in Python [28, 57, 6]. As both sides of the bridge have been properly
introduced, the following section will describe the specific features and functionalities of FKB.

4
5 Features of FKB
Once a neural network is trained in high-level APIs like Keras, the practitioner has few practical avenues
for using this model in Fortran-based projects. One approach may be to hard code network operations
inside Fortran while manually moving parameters from the Keras model. Several examples of this can
been seen in climate modeling [53, 11, 23, 22].
To provide one specific example, in [53], the authors trained a deep neural network (DNN) to repre-
sent sub-grid cloud and convective energy transport processes, in Keras. To assess its credibility, they
needed to test the DNN’s two-way interactions when thousands of replicates of it were embedded within
a coarse-resolution global atmospheric model, written in Fortran – neural network emulated clouds in-
teracting with determinstic physical calculations of planetary geophysical fluid dynamics. As the global
atmospheric simulator does not offer native neural network support, the authors hardcoded their DNN
model into the global simulation software framework. This approach has obvious disadvantages. Every
minor change made to the model in Keras requires rewriting the Fortran code. If one wishes to test a
suite of models in Fortran, this approach becomes untenable. As each network may require different
hyperparameters and, as a result, necessitates rewriting and compiling the Fortran code for every new
model. This process drastically limits the breadth of available models to be tested within the simula-
tor. This bottleneck is currently a significant roadblock to ongoing debates in the climate simulation
community, more broadly, about whether or not to use DNN representations of subgrid physics in next-
generation climate modeling. Insufficient testing of diverse candidate neural networks (NN) means that
little is known about how minor imperfections in the fit of one NN can amplify when the NN is coupled
to fluid dynamics, which is just beginning to be explored [12].
These issues demand a solution, in the form of a bridge between Keras and Fortran. The FKB
software solves these issues via two key elements. First, it provides a neural network library imple-
mented in Fortran (FKB/F). Second, it offers the ability to parse existing Keras models into formats
consistent with the Fortran neural network library (FKB/P). As a result, users can switch, seamlessly,
back and forth between Python and Fortran. This context provides a way for iterative neural network
tuning (Python) and testing (Fortran), with a simple way to translate between the two software envi-
ronments. Additionally, FKB offers currently unavailable Fortran specific features for neural networks.
It will be useful to highlight those new features while documenting the format to which FKB adheres.
The following subsections describe the Python and Fortran anchors’ features, FKB/P and FKB/F,
respectively.

5.1 FKB/P
Keras models - once built, trained, and saved - are stored in Hierarchical Data Format 5 (HDF5) files.
These files contain the network architecture, weights, biases, and additional information - optimizers,
learning rates, gradients, etc. From the HDF5 file, FKB/P parses the network architecture, extracting
the number of layers, activation functions, nodes per layer, and all weights and biases. This information
is converted to match the Fortran neural network configuration in FKB/F. This allows users to build
an equivalent network in Fortran, which can easily be loaded and used within a Fortran environment. If
any modifications to the model are made inside Fortran, FKB/P will parse this back into the equivalent
HDF5 file to be used in Keras once again.
On the other hand, networks may be initially constructed in Fortran. After initial training and
testing, a user can switch to Keras for further evaluation. From Keras, users can conduct additional
testing or hyperparameter tuning where these tools are readily available [28].
The ability to seamlessly pass neural network architectures between Python and Fortran is essential
for any practitioner working in this space. This bridge allows users to take advantage of the high-level
Keras API - training on computationally efficient GPUs - then to insert their trained model into a
Fortran codebase. The functionality provided bridges the chasm between Keras and Fortran.

5.2 FKB/F
The Fortran anchor of FKB leverages and extends the original Neural Fortran library. Below we intro-
duce newly implemented features to make Neural Fortran more flexible and able to communicate on the
two-way bridge.

5
5.2.1 Custom Layers
To implement neural networks in Fortran, FKB leverages and extends the Neural Fortran library [17].
The prototype Neural Fortran library format that we build on was only capable of implementing a
fully connected layer. Forward and backward operations occurred outside this layer - in the network
module. An example of this is shown in Listing 1. From the listing, one can observe hard-coded
matrix multiplication of layer weights, the addition of biases, and the activation functions inside the
network module. This network-level subroutine accesses and modifies individual layer attributes. This
rigid format is inconsistent with modern neural network implementation paradigms [16, 1, 47], but it
makes it impossible to implement other layers or custom operations. To increase the library’s flexibility,
operations must be encapsulated inside the layer, consistent with current practice.

pure subroutine fwdprop(self, x)

! Performs the forward propagation and stores arguments to activation
! functions and activations themselves for use in backprop.
class(network_type), intent(in out) :: self
real(rk), intent(in) :: x(:)
integer(ik) :: n
associate(layers => self % layers)
layers(1) % a = x
do n = 2, size(layers)
layers(n) % z = matmul(transpose(layers(n-1) % w), layers(n-1) % a) + layers(n) % b
layers(n) % a = self % layers(n) % activation(layers(n) % z)
end do
end associate
end subroutine fwdprop

Listing 1: Original code from [17]. Layer operations occur inside the network module, limiting flexibility.

In FKB we introduce an extendable layer type module (Listing 2). To implement a layer, one simply
extends the layer type and specifies the construction of the forward and backward functions. Adhering
to this format offers several advantages. By restructuring the format of the library, we offer the ability
to implement arbitrary layers. Additionally, in the network module, all layers are stored in an array
of pointers. This leads to the encapsulated version shown in Listing 2 wherein a forward pass, in the
network module, calls the layer-specific forward function. In this way, all operations are confined to the
layer module, and the output from one layer is passed as input to the next.

function output(self, input) result(last_layer_output)

...
! iterate through layers passing activation forward
do n = 1, size(layers)
call layers(n) % p % forward(layers(n-1) % p % o)
end do
! get output from last layer
last_layer_output = layers(size(layers)) % p % o
end function output

Listing 2: Forward pass in the FKB network module. Each layer simply calls its own forward function.
The technical operations occur within each layer.

FKB supports fully connected or dense layers, dropout [58, 4], and batch normalization [30]. Shown
in Listing 3 is an example of extending the layer type to implement a Batch Normalization layer.
This format translates to increased functionality and customizability to the user. As a result, more
standard layers from Keras are available, while giving users the flexibility to implement their own
custom operations.

! BatchNorm layer - extends from base layer_type

! Implements batch normalization
type, extends(layer_type) :: BatchNorm
! epsilon parameter
real(rk) :: epsilon
contains
procedure, public, pass(self) :: forward => batchnorm_forward

6
procedure, public, pass(self) :: backward => batchnorm_backward
end type BatchNorm

Listing 3: Example of extending the layer type to implement Batch Normalization

5.2.2 Training in Fortran

It is necessary to distinguish between the terms offline versus online for the following section. These
terms serve to distinguish two different settings in which a neural network can be used in a Fortran
computing package. Both settings can make use of historical or simulated data to train an artificial
network. The distinguishing feature is how the predictions of a model are used. In an online setting,
predictions from the model are used to evolve a physical process. The predictions at one time step effect
how the system acts at the following time step. As a result, inputs to the model will change based on
how the model acted in the past. In offline settings, this is not the case. Predictions made in the past
do not affect the input to the model in the future.
In many cases, offline training may be sufficient to learn a model, if enough prior data is available.
However, in some cases, online training may be the method of choice. To this end, FKB is equipped to
handle backpropagation for gradient descent optimization of a specified cost function.
The layer encapsulation mentioned above of forward and backward operations (Section 5.2.1) be-
comes extremely valuable in training. Instead of all computations occurring within the network module
[17], they are contained in layer-specific functions. Much like the forward pass, backward operations
occur in the layer. In this fashion, each layer is responsible for computing its gradients with respect to
its parameters and returning the gradient with respect to the layer below it.
Online training can serve a variety of purposes. First, a neural network model may be learned
entirely in Fortran, based on the evolving state variables during the integration of a physical dynamical
system simulation, and then transferred to Keras after the fact. In this setting, the ground truth,
from the simulator, is passed to the network for it to calculate its errors and update its parameters
accordingly through backpropagation. Second, online training could serve to provide gentle corrections
to an imperfect pretrained model, for instance, to hedge against the amplification of its imperfections
that are only revealed once the NN is coupled to other physical calculations. Here a model is trained
offline in Keras and transferred to Fortran (Section 5.1). In some cases, for a variety of reasons, the offline
training data may have a differing distribution than that of the online data. In such a setting, it proves
beneficial to offer slight corrections to the network. Finally, a secondary model may be constructed to
learn and compensate for the deficiencies in the primary model. In this way, the two networks work
together to balance out any instability issues.
The ease of use and proper format directly results from the encapsulation of layer operations. Online
training offers a solution to tackle a suite of potential problems. As a result, models may be updated
with slight corrections or learned entirely online.

5.2.3 Custom Loss Functions

In many applications, practitioners may wish to optimize a unique quantity - a function other than
a mean squared error or cross-entropy. This is common when target variables interact or additional
information is known about their relationship in a desired application. For example, in modeling any
physical system, predictions from a neural network must not violate physical constraints - energy cannot
be created or destroyed in the system. To satisfy this restriction, a loss function can be written to
quantify the amount of violation of physical properties. This construction can then be minimized to
alleviate constraint infractions [9].
The implementation of custom loss functions is standard for high-level APIs like Keras, Tensorflow,
and PyTorch to provide this ability in their codebase [16, 1, 47]. As FKB is designed for those working
in the physical sciences where environmental, physical, or application-specific constraints are common,
it provides the ability to implement custom loss functions. To take advantage of this functionality, users
must implement their desired loss function, just as they would in Keras. As FKB does not provide
automatic differentiation, the derivatives with respect to the input are also required for training. Once
these functions have been specified they can be dropped into the existing framework and run normally,
much like Keras.

real(rk) function crossentropy_loss(self, y_true, y_pred)

! Given predicted and expected output, returns the scalar loss

7
class(network_type), intent(in out) :: self
real(rk), intent(in) :: y_true(:), y_pred(:)

loss = - sum(y_true * log(y_pred))

end function loss

function d_crossentropy_loss(self, y_true, y_pred) result(loss)

! Given predicted and expected output
! returns the loss with respect to softmax input
class(network_type), intent(in out) :: self
real(rk), intent(in) :: y_true(:), y_pred(:)
real(rk), allocatable :: loss(:)

loss = y_pred - y_true

end function d_loss

Listing 4: Implementation of crossentropy loss function and the corresponding derivation with respect
to the input logits.

This capability is demonstrated through the implementation of the cross-entropy loss function in
Listing 4. To implement this previously unavailable loss function, we first declare two functions. First,
the cross-entropy scalar loss is. Second, the loss with respect to the input logits is derived. These two
functions are then referenced as the loss and d loss, respectively. By providing this functionality, users
may leverage a variety of loss functions that can be used to minimize application-specific quantities.
Once described, they may be included with the existing framework and used during online training.

5.2.4 Ensembles
Ensembles consist of different models, each trained on the same, or bootstrapped, data. The output of
the ensemble will be an average of all its member’s predictions. In machine learning, ensembles of models
typically perform better than any one of its members alone. The ensemble strategy exploits the fact
that each model will make different errors. Therefore, when averaged together, these predictions become
more accurate, as certain errors get smoothed out. A consensus from machine learning practitioners is
ensembling gives 1-2% improvement in performance [15].
As a result of this averaging, ensembles provide a boost in performance as well as additional robust-
ness. In domains where physical constraint violations yield stability issues, ensembles may be applied
to dampen these problems. By averaging across many networks, the instability of any one model will
be drastically reduced in the presence of more sound predictions.
The functionality provided requires the user to specify a directory that contains the models of interest
and a desired amount of noise. The ensemble type will read in each model and construct a network
corresponding to each of them. To get a prediction from the ensemble, an input vector is passed to it.
For non-zero amounts of noise, Gaussian noise is applied to the input vector each time it is passed to an
ensemble member. This allows each member to see a slightly different variant of the input, increasing the
robustness of prediction around that point. This operation runs in parallel using OpenMP, where each
network can be given its thread to expedite computation; such an approach could easily be adapted via
OpenACC for GPU-based threading of large ensemble network calculations. Following the computation,
the predictions are averaged together, and the final output is given.

6 Case Study
The following section provides a case study demonstrating an application of FKB to experimental
next-generation climate modeling. The Superparameterized Community Atmospheric Model version
3.0 (SPCAM3) is used for all simulations in this study. SuperParameterization is an approach that con-
fronts the decades-long problem of representing subgrid cloud physics in climate models by embedding
thousands of limited-domain explicit sub-models of moist convection within a conventional planetary-
scale model of the large scale atmosphere [26, 35, 34, 60]. This approach tends to involve two orders
of magnitude more computational intensity per unit area of the simulated earth, but recently Rasp et
al. used a deep neural network to emulate all of the expensive subgrid cloud resolving models’ (CRM)
influence on the planetary host at drastically reduced computational expense [53]. This study, along
with others in the emerging climate modeling literature [11] have demonstrated the potential advantages

8
Name Options Parameter Type
Batch Normalization [yes, no] Choice
Dropout [0, 0.25] Continuous
Leaky ReLU coefficient [0 - 0.4] Continuous
Learning Rate [0.00001 - 0.01] Continuous (log)
Nodes per Layer [128,256,512] Discrete
Number of layers [4 - 11] Discrete
Optimizer [Adam, RMSProp, SGD ]Choice

Table 1: Hyperparameter Space

of a data-driven approach for addressing the critical unresolved effects of clouds and convection on plan-
etary climate, as compared to previous, heuristic based, approximations to subgrid physics. However,
the idea of emulating turbulence in climate simulation is still an emerging one, with unclear trade-offs,
including frequent instabilities when NN emulators are coupled with fluid dynamics, which the commu-
nity is seeking to learn how to control [11]. It has even been questioned whether the offline skill of such
emulators, during their training, is predictive of their online performance [52, 21], an important open
question.
These questions are understudied primarily due to the lack of the simple software interface that
FKB now enables for climate scientists to test diverse candidate neural networks, and ensembles within
planetary climate models.
To illustrate an advance on this front we now apply FKB to shed new light on two related questions
currently in debate:
1. Does offline performance translate to online model performance [52, 21]?
2. Which neural network hyperparameters most affect online performance?
Using FKB, the study can be broken into two stages. First, a suite of 108 candidate neural network
models of convection are trained, via Keras, on simulated data from the SPCAM3. Second, the models
are converted to Fortran and run online (i.e. coupled to planetary fluid dynamics) in the SPCAM3
simulator. The number of steps serves as a preliminary metric of performance until catastrophic failure.
It is clear that in the absence of the FKB library, running hundreds of candidate neural network
submodels of convection within the Fortran based model of the rest of the planet’s atmosphere would
be nearly impossible. As each network contains various hyperparameters, each with different weights
and biases learned during training, including layer-specific properties such as optional use of dropout
or batch-normalization. To leverage the FKB library with SPCAM3, we simply compile the neural
network library in advance and link it to the compilation of SPCAM3. Documentation steps for the
implementation of this case study are provided here: https://github.com/scientific-computing/
FKB/blob/master/SPCAM_Instructions.md.
The input to this neural network model is a 94-dimensional vector. Features include vertically re-
solved vectors representing the large scale (host model) temperature, humidity, meridional wind vertical
structure, surface pressure, incoming solar radiation, sensible heat flux, and latent heat flux scalars.
The output of the network is a 65-dimensional vector composed of the embedded models’ influence on
their host - i.e. the sum of the CRM and radiative heating rates, the CRM moistening rate, the net
radiative fluxes at the top of the atmosphere and surface of the earth, and the precipitation.
The training data used here are challenging to fit, as they come from an enhanced version of the CRM
training data that was originally studied by [53]. In superparameterized simulations, one can control
the degrees of freedom of the interior resolved scale through the room available for interesting forms of
sub-grid storm organization to form. One can control the physical extent (i.e. number of columns used
in) each embedded CRM array [50]. In [53], CRM arrays with only 8 columns (32-km extent, given the
4-km horizontal resolution) were used. Here we quadruple the extent (from 32 km to 128 km, i.e. from
8-columns to 32-columns) to improve its physical realism. Despite several attempts, these data have
never been fit successfully. NNs trained from the enriched data tend to produce crashes within just a
few simulated weeks after they are embedded in the climate model (see discussion of “NN-unstable” by
[12] for details).
Our working hypothesis is that historical failures in free-running tests when emulators are trained
on higher quality CRM training data reflect a broader issue of insufficient hyperparameter tuning in
climate model applications. To address this, we conducted neural network optimization via a random
search using SHERPA [28], a Python library for hyperparameter tuning. We detail the hyperparameters

9
Figure 3: The time-evolution of the tropospheric (a) temperature and (b) humidity biases, colorized by
the offline validation error

of interest in Table 1, as well as the range of available options during the search. The hyperparameters
of interest consisted of whether or not to use batch normalization, the amount of dropout, the leaky
ReLU coefficient, learning rate, nodes per layer, the number of layers, and the optimizer. The random
search algorithm has the advantage of making no assumptions about the structure of the hyperparameter
search problem and is ideal for exploring a variety of settings.
We attained 108 candidate neural network model configurations, each trained for 25 epochs with
early stopping monitoring the validation loss. Following the offline training stage, the neural network
models were converted into their Fortran counterparts and ran inside SPCAM3. We underscore that this
critical step would have been prohibitive using standard tools that have required manual translation of
each candidate model. However, by leveraging the FKB library, each model was loaded independently
into Fortran and run as the subgrid physics emulator inside SPCAM3’s host planetary model, of the
large-scale atmospheric state. Each model was coupled to fluid dynamics, to run a wide ensemble of
prognostic tests across an unprecedented diversity of candidate neural network architectures. Each of
the one hundred and eight candidate neural network models - with their various numbers of layers,
layer-specific settings (batch-normalization, relu magnitude, etc), nodes per layer, weights, and biases -
were run online, all without rewriting any Fortran code.
In order to address the first question and evaluate a neural network model’s performance, we compare
its validation MSE during training with the time-to-failure of the online tests in which 8,192 instances
of the NN, spaced at regular intervals around the globe, are coupled interactively to their host global
atmospheric model of large scale geophysical fluid dynamics. This yields Figure 4a, which sheds new
light on the offline vs. online relationship.
The results in this figure demonstrate a relationship between offline validation error and online per-
formance. There is a distinct, negative, relationship between offline MSE and online stability (Spearman
correlation of −0.73; p = 4.961e−19 . Intriguingly, the mean-squared error loss of our multi-layer percep-
tron is a reasonable predictor of stability once coupled to the climate model, insofar as the time-to-failure
is concerned. This finding is interesting in the context of the recent speculation by [52] that such a re-
lationship might not exist using similar NNs in a similar setting, as well as the comments by [21] about
similar incongruities even in reduced-order dynamical systems when emulated with GANs.
Of course, stability alone is a necessary but not a sufficient condition of prognostic success, which
also requires an in-depth analysis of biases in the simulated climate. Figure 3 shows the time-evolution
of the tropospheric temperature and humidity biases, colorized by the offline validation error. These
metrics reveal that although our search has uncovered many runs that are “stable” - can run without
catastrophically crashing for several months - most of these runs would not be very useful in an opera-
tional setting. Almost all NNs exhibit major errors in the simulated climate, having drifted to erroneous
attractors with root-mean-square errors in temperature frequently above 10 K. However, the NN that
produced the best offline validation error stands out as having the combined desired qualities of stability
and skill with temperature biases of less than 2 K, competitive with [53]. Interestingly, coupling instead
to the ensemble mean of a few of the best-ranked models (magenta dashed lines) does not outperform
coupling to the best fit model, the value of having found it using SHERPA (Figure 3).
In short, we have produced a successful coupled simulation that was particularly challenging without
formal hyper-parameter tuning and FKB. This result suggests that sufficient hyperparameter tuning may

10
(a) (b) (c)

(d) (e) (f)

(g) (h) (i)

Figure 4: Offline performance - validation mean squared error (MSE) - vs online performance - number
of steps until crash.(a) All models. (b) By batch normalization usage. (c) By Dropout amount. (d) By
leaky ReLU coefficient. (e) By learning rate. (f) By number of dense nodes per layer. (g) By number
of layers. (h) By total number of model parameters. (i) By optimizer type.

be critical to solving chronic instability in climate model applications of DNNs for subgrid physics.
The second question naturally arises as to which of the hyperparameters are most impactful to the
online performance. To assess this, Figure 2b-i decomposes the sensitivity of the baseline relationship
to individual hyperparameter choices. The choice of optimizer is shown to correlate most strongly with
online performance (Figure 4i). This finding is confirmed by Spearman values, shown in Table 2. The
optimizer hyperparameter has the largest absolute correlation value with online performance. No other
hyperparameter shows as clear a distinction in correlation that is evident in the choice of optimizer,
including the network depth and total number of parameters, which are known to be important to
offline fits for this problem [24], but are surprisingly not as predictive of coupled skill as the choice of
optimizer, whose impact has not previously been isolated (for this application).
Further investigation into the specific optimizer used, reveals the SGD optimizer to perform poorly;
NNs fit with SGD never run longer than 1,000 steps when coupled online (Figure 4i). Again the visual
intuition from Figure 4i is confirmed by Spearman correlation values. SGD, Adam, and RMSProp have
Spearman values of −0.6670, 0.5936, 0.0586 respectively. These values demonstrate that the use of
SGD is negatively correlated with online performance, whereas Adam positively correlates with online
performance. This result leads one to speculate that increased improvements in online skill may be
realized from more advanced optimizers with enhanced gradient update schedules.
Finally, after answering the two questions motivating this case study, we can compare the results
of the best performing model with that of previously published models of [53] when applied to the
challenging limit of CRMs with 32-km horizontal extent. The model proposed by Rasp et al. was a
single deep neural network. The hyperparameter space of this model was not fully explored online in

11
Correlation P-Value
BatchNorm 0.0859 3.7896e-01
Dropout 0.1919 4.7591e-02
Leaky ReLU 0.0055 9.5465e-01
Learning Rate -0.2087 3.0923e-02
Dense Nodes 0.1427 1.4249e-01
Layers 0.0410 6.7491e-01
Optimizer -0.6998 5.0177e-17
Parameters 0.1528 1.1609e-01

Table 2: Spearman correlation of corresponding hyperparameter with online performance, and associ-
ated p-value.

large part due to the laborious process required to transfer those models into Fortran. The Rasp et
al. model (provided by the authors) ran for 128 steps before crashing due to instability issues. The
five best models achieved in this study ran to completion of a 5-year simulation, i.e. for 87,840 steps;
of these, two of the five models further exhibited root-mean-square errors in simulated tropospheric
temperature of less than 2 degrees Celsius. This dramatic improvement in stability is a direct result of
the ease with which a wide variety of models (identified by SHERPA) can be transferred between Python
and Fortran (thanks to FKB). We also note that this method is preferable to another approach that
was recently proposed to begin stabilizing the same model, through small-amplitude Gaussian input
perturbation [12] - a strategy that, while promising, adds computational expense and introduces out-of-
sample extrapolation issues that can be avoided with the brute-force optimization and wide-ensemble
prognostic testing path to stabilization we have outlined here.
This case study has investigated two closely entangled questions: 1) Does offline performance cor-
respond to online model performance? 2) What neural network hyperparameters most effect online
performance? Both of these questions have been answered by leveraging the FKB library. The li-
brary offers the ability to expeditiously transfer models trained in Keras to Fortran, where they may
be run online in existing simulators. In the absence of FKB, neither one of these questions could be
approached without unreasonable human intervention, as the operational target is a climate model with
over a hundred thousand lines of code written in Fortran.

7 Conclusion
The ubiquitousness of deep learning has resulted from extensive free and open source libraries [16, 1, 47].
Deep learning’s success and popularity merit its integration in large-scale computing packages, like those
written in Fortran. Instead of rewriting all existing libraries in Fortran, we introduced a two-way bridge
between low-level, Fortran, and Python through the FKB Library. The library provides researchers the
ability to implement neural networks into Fortran code bases while being able to transfer them back
and forth with Keras.
Fortran, which has been a staple within computationally intensive fields for decades, will undoubtedly
see continued use due to its fast computational ability and vast amounts of legacy code. The FKB library
enables users to access many features of the Keras API directly in Fortran, including the ability to create
custom layers and loss functions to suit their needs. We demonstrate the integrability of FKB through
our case study involving the SPCAM3 simulator. An advantage of FKB is its ease of use, demonstrated
by its ability to be compiled in advance and once linked can be easily leveraged in existing large scale
simulators, as we have illustrated for the application of multi-scale physical simulations of the global
atmosphere.

8 Acknowledgments
The work of JO and PB is supported by NSF NRT grant 1633631. MP acknowledges NSF funding from
OAC-1835863 and AGS-1734164. This research also used HPC resources of the Extreme Science and
Engineering Discovery Environment (XSEDE), which is supported by the National Science Foundation
under grant number ACI-1548562 [62] and allocation number TG-ATM190002.

12
9 Conflict of Interest
The authors declare that there is no conflict of interest regarding the publication of this paper.

References
[1] Martin Abadi et al. “Tensorflow: A system for large-scale machine learning”. In: 12th Symposium
on Operating Systems Design and Implementation 2016). 2016, pp. 265–283.
[2] Forest Agostinelli et al. “Solving the Rubik’s cube with deep reinforcement learning and search”.
In: Nature Machine Intelligence 1.8 (2019), pp. 356–363.
[3] Frédéric Archambeau, Namane Méchitoua, and Marc Sakiz. “Code Saturne: A Finite Volume Code
for the computation of turbulent incompressible flows - Industrial Applications”. In: International
Journal on Finite Volumes 1.1 (Feb. 2004), http://www.latp.univ–mrs.fr/IJFV/spip.php?article3.
url: https://hal.archives-ouvertes.fr/hal-01115371.
[4] Pierre Baldi and Peter Sadowski. “The dropout learning algorithm”. In: Artificial intelligence 210
(2014), pp. 78–122.
[5] Yohai Bar-Sinai et al. “Learning data-driven discretizations for partial differential equations”. In:
Proceedings of the National Academy of Sciences 116.31 (2019), pp. 15344–15349.
[6] James Bergstra, Dan Yamins, and David D Cox. “Hyperopt: A python library for optimizing the
hyperparameters of machine learning algorithms”. In: Proceedings of the 12th Python in science
conference. Citeseer. 2013, pp. 13–20.
[7] Javier Bernal. NEURBT: A Program for Computing Neural Networks for Classification using
Batch Learning. Feb. 2015. doi: 10.6028/NIST.IR.8037.
[8] Javier Bernal and Jose Torres-Jimenez. “SAGRAD: A Program for Neural Network Training with
Simulated Annealing and the Conjugate Gradient Method”. In: Journal of research of the National
Institute of Standards and Technology 120 (June 2015), p. 113. doi: 10.6028/jres.120.009.
[9] Tom Beucler et al. “Enforcing analytic constraints in neural-networks emulating physical systems”.
In: arXiv preprint arXiv:1909.00912 (2020).
[10] L Börgesson. “Abaqus”. In: Developments in geotechnical engineering. Vol. 79. Elsevier, 1996,
pp. 565–570.
[11] Noah D Brenowitz and Christopher S Bretherton. “Prognostic validation of a neural network
unified physics parameterization”. In: Geophysical Research Letters 45.12 (2018), pp. 6289–6298.
[12] Noah D. Brenowitz et al. Interpreting and Stabilizing Machine-learning Parametrizations of Con-
vection. 2020. arXiv: 2003.06549 [physics.ao-ph].
[13] Phil Brierley. Fortran90 MLP Backprop Code. url: http://www.philbrierley.com/phil.html.
[14] Bernard Brooks et al. “CHARMM: A Program for Macromolecular Energy, Minimization, and
Dynamics Calculations”. In: Journal of Computational Chemistry 4 (Sept. 2004), pp. 187–217.
doi: 10.1002/jcc.540040211.
[15] Francois Chollet. Deep Learning mit Python und Keras: Das Praxis-Handbuch vom Entwickler der
Keras-Bibliothek. MITP-Verlags GmbH & Co. KG, 2018.
[16] François Chollet et al. Keras. https://github.com/fchollet/keras. 2015.
[17] Milan Curcic. “A parallel Fortran framework for neural networks and deep learning”. In: ACM
SIGPLAN Fortran Forum. Vol. 38. 1. ACM. 2019, pp. 4–21.
[18] M. A. Donelan et al. “Modeling waves and wind stress”. In: Journal of Geophysical Research:
Oceans 117.C11 (2012).
[19] Alfredo Ferrari et al. “FLUKA: a multi-particle transport code”. In: CERN Yellow report 2005-10
(Jan. 2005). doi: 10.2172/877507.
[20] FORTRAN. Mar. 2011. url: https : / / www . ibm . com / ibm / history / ibm100 / us / en / icons /
fortran/.

13
[21] David John Gagne II et al. “Machine Learning for Stochastic Parameterization: Generative Adver-
sarial Networks in the Lorenz ’96 Model”. In: Journal of Advances in Modeling Earth Systems 12.3
(2020). e2019MS001896 10.1029/2019MS001896, e2019MS001896. doi: 10.1029/2019MS001896.
eprint: https://agupubs.onlinelibrary.wiley.com/doi/pdf/10.1029/2019MS001896. url:
https://agupubs.onlinelibrary.wiley.com/doi/abs/10.1029/2019MS001896.
[22] David John Gagne, Chih-Chieh Chen, and Andrew Gettelman. “Emulation of Bin Microphysical
Processes with Machine Learning”. In: 100th American Meteorological Society Annual Meeting.
AMS. 2020.
[23] David John Gagne et al. “Machine Learning Parameterization of the Surface Layer: Bridging the
Observation-Modeling Gap”. In: AGUFM 2019 (2019), IN44A–04.
[24] Pierre Gentine et al. “Could machine learning break the convection parameterization deadlock?”
In: Geophysical Research Letters 45.11 (2018), pp. 5742–5751.
[25] Jean-Christophe Golaz et al. “The DOE E3SM coupled model version 1: Overview and evaluation
at standard resolution”. In: Journal of Advances in Modeling Earth Systems 11.7 (2019), pp. 2089–
2129.
[26] W. W. Grabowski. “Coupling cloud processes with the large-scale dynamics using the Cloud-
Resolving Convection Paramaterization (CRCP)”. In: Journal of the Atmospheric Sciences 58.9
(2001), pp. 978–997.
[27] IM Held et al. “Structure and performance of GFDL’s CM4. 0 climate model”. In: Journal of
Advances in Modeling Earth Systems 11.11 (2019), pp. 3691–3727.
[28] Lars Hertel et al. “Sherpa: Robust Hyperparameter Optimization for Machine Learning”. In:
Submitted to SoftwareX (2020).
[29] James W Hurrell et al. “The community earth system model: a framework for collaborative re-
search”. In: Bulletin of the American Meteorological Society 94.9 (2013), pp. 1339–1360.
[30] Sergey Ioffe and Christian Szegedy. “Batch normalization: Accelerating deep network training by
reducing internal covariate shift”. In: arXiv preprint arXiv:1502.03167 (2015).
[31] Guo-Qing Jiang, Jing Xu, and Jun Wei. “A Deep Learning Algorithm of Neural Network for
the Parameterization of Typhoon-Ocean Feedback in Typhoon Forecast Models”. In: Geophysi-
cal Research Letters 45.8 (2018), pp. 3706–3716. doi: 10.1002/2018GL077004. eprint: https:
/ / agupubs . onlinelibrary . wiley . com / doi / pdf / 10 . 1002 / 2018GL077004. url: https :
//agupubs.onlinelibrary.wiley.com/doi/abs/10.1002/2018GL077004.
[32] Kaggle. 2018 Kaggle ML & DS Survey. 2018. url: https://www.kaggle.com/kaggle/kaggle-
survey-2018.
[33] Kaggle. State of Data Science and Machine Learning 2019. 2019. url: https://www.docdroid.
net/qzyxCr4/kaggle-state-of-data-science-and-machine-learning-2019.pdf.
[34] Marat Khairoutdinov, Charlotte DeMott, and David Randall. “Evaluation of the simulated inter-
annual and subseasonal variability in an AMIP-style simulation using the CSU multiscale modeling
framework”. In: Journal of Climate 21.3 (2008), pp. 413–431.
[35] Marat Khairoutdinov, David Randall, and Charlotte DeMott. “Simulations of the atmospheric
general circulation using a cloud-resolving model as a superparameterization of physical processes”.
In: Journal of the Atmospheric Sciences 62.7 (2005), pp. 2136–2154.
[36] D. Komatitsch et al. SPECFEM3D Cartesian v2.0.2 [software]. Computational Infrastructure for
Geodynamics, 2012. doi: http://doi.org/NoDOI.
[37] Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. “Imagenet classification with deep convo-
lutional neural networks”. In: Advances in neural information processing systems. 2012, pp. 1097–
1105.
[38] Nicholas LaHaye et al. “Multi-Modal Object Tracking and Image Fusion With Unsupervised Deep
Learning”. In: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
12.8 (2019), pp. 3056–3066.
[39] DJ Lary, MD Müller, and HY Mussa. “Using neural networks to describe tracer correlations”. In:
Atmospheric Chemistry and Physics 4.1 (2004), pp. 143–146.

14
[40] Julia Ling, Andrew Kurzawski, and Jeremy Templeton. “Reynolds averaged turbulence modelling
using deep neural networks with embedded invariance”. In: Journal of Fluid Mechanics 807 (2016),
pp. 155–166. doi: 10.1017/jfm.2016.615.
[41] Erdogan Madenci and Ibrahim Guven. The finite element method and applications in engineering
using ANSYS .
R Springer, 2015.

[42] Yvonne D Murray et al. Users manual for LS-DYNA concrete material model 159. Tech. rep.
United States. Federal Highway Administration. Office of Research . . ., 2007.
[43] Steffen Nissen. “Implementation of a Fast Artificial Neural Network library (FANN)”. In: (Dec.
2003).
[44] Jordan Ott, Abigail Atchison, and Erik J Linstead. “Exploring the applicability of low-shot learn-
ing in mining software repositories”. In: Journal of Big Data 6.1 (2019), p. 35.
[45] Jordan Ott et al. “A deep learning approach to identifying source code in images and video”. In:
2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR). IEEE.
2018, pp. 376–386.
[46] Jordan Ott et al. “Learning lexical features of programming languages from imagery using con-
volutional neural networks”. In: Proceedings of the 26th Conference on Program Comprehension.
ACM. 2018, pp. 336–339.
[47] Adam Paszke et al. “Automatic differentiation in pytorch”. In: (2017).
[48] James W. Lottes Paul F. Fischer and Stefan G. Kerkemeier. nek5000 Web page. http://nek5000.mcs.anl.gov.
2008.
[49] J. G. Powers et al. “The Weather Research and Forecasting Model: Overview, System Efforts, and
Future Directions”. In: Bulletin of the American Meteorological Society 98.8 (2017), pp. 1717–
1737. doi: 10.1175/BAMS-D-15-00308.1.
[50] Michael S Pritchard, Christopher S Bretherton, and Charlotte A DeMott. “Restricting 32–128
km horizontal scales hardly affects the MJO in the Superparameterized Community Atmosphere
Model v. 3.0 but the number of cloud-resolving grid columns constrains vertical mixing”. In:
Journal of Advances in Modeling Earth Systems 6.3 (2014), pp. 723–739.
[51] Maziar Raissi, Paris Perdikaris, and George E Karniadakis. “Physics-informed neural networks:
A deep learning framework for solving forward and inverse problems involving nonlinear partial
differential equations”. In: Journal of Computational Physics 378 (2019), pp. 686–707.
[52] Stephan Rasp. “Online learning as a way to tackle instabilities and biases in neural network
parameterizations”. In: arXiv preprint arXiv:1907.01351 (2019).
[53] Stephan Rasp, Michael S Pritchard, and Pierre Gentine. “Deep learning to represent subgrid
processes in climate models”. In: Proceedings of the National Academy of Sciences 115.39 (2018),
pp. 9684–9689.
[54] Samuel H Rudy et al. “Data-driven discovery of partial differential equations”. In: Science Ad-
vances 3.4 (2017), e1602614.
[55] Jürgen Schmidhuber. “Deep learning in neural networks: An overview”. In: Neural Networks 61
(2015), pp. 85–117.
[56] David Silver et al. “Mastering the game of Go with deep neural networks and tree search”. In:
nature 529.7587 (2016), p. 484.
[57] Jasper Snoek, Hugo Larochelle, and Ryan P Adams. “Practical bayesian optimization of machine
learning algorithms”. In: Advances in neural information processing systems. 2012, pp. 2951–2959.
[58] Nitish Srivastava et al. “Dropout: a simple way to prevent neural networks from overfitting”. In:
The Journal of Machine Learning Research 15.1 (2014), pp. 1929–1958.
[59] NEMO System Team. NEMO ocean engine. Scientific Notes of Climate Modelling Center 27.
Zenodo. doi: 10.5281/zenodo.1464816.
[60] Katherine Thayer-Calder and David A Randall. “The role of convective moistening in the Madden–
Julian oscillation”. In: Journal of the Atmospheric Sciences 66.11 (2009), pp. 3297–3312.
[61] Jonathan Tompson et al. “Real-time continuous pose recovery of human hands using convolutional
networks”. In: ACM Transactions on Graphics (ToG) 33.5 (2014), p. 169.

15
[62] John Towns et al. “XSEDE: Accelerating Scientific Discovery”. In: Computing in Science Engi-
neering 16.5 (Sept. 2014), pp. 62–74. issn: 1558-366X. doi: 10.1109/MCSE.2014.80.
[63] Gregor Urban et al. “Deep Learning Achieves near Human-level Polyp Detection in Screening
Colonoscopy”. In: Gastroenterology 155.4 (2018), pp. 1069–1078.
[64] M. Valiev et al. “NWChem: a comprehensive and scalable open-source solution for large scale
molecular simulations”. In: Computer Physics Communications 181.9 (2010), pp. 1477–1489.
[65] H. R. Vega-Carrillo et al. “Artificial neural networks in neutron dosimetry”. In: Radiation Pro-
tection Dosimetry 118.3 (Oct. 2005), pp. 251–259. issn: 0144-8420. doi: 10.1093/rpd/nci354.
eprint: https://academic.oup.com/rpd/article-pdf/118/3/251/4543434/nci354.pdf. url:
https://doi.org/10.1093/rpd/nci354.
[66] A.J. Wallcraft et al. “Global Ocean Prediction Using HYCOM”. In: July 2007, pp. 259–262. isbn:
978-0-7695-3088-5. doi: 10.1109/HPCMP-UGC.2007.36.
[67] Xiao Xiang Zhu et al. “Deep learning in remote sensing: A comprehensive review and list of
resources”. In: IEEE Geoscience and Remote Sensing Magazine 5.4 (2017), pp. 8–36.

DL Lab Manual
No ratings yet
DL Lab Manual
65 pages
Lista de Spare Parts / Spare Parts List: Electrical
No ratings yet
Lista de Spare Parts / Spare Parts List: Electrical
3 pages
Full download Feminist Art in Resistance: Aesthetics, Methods and Politics of Art in Turkey Elif Dastarlı pdf docx
No ratings yet
Full download Feminist Art in Resistance: Aesthetics, Methods and Politics of Art in Turkey Elif Dastarlı pdf docx
40 pages
Oblicon Cases Fourth and Fifth Assignment (Lim To Alipio)
No ratings yet
Oblicon Cases Fourth and Fifth Assignment (Lim To Alipio)
275 pages
Atkinson 1997
No ratings yet
Atkinson 1997
12 pages
FINAL Lab Report #2
No ratings yet
FINAL Lab Report #2
9 pages
Hse Powerpoint - pptx1
100% (3)
Hse Powerpoint - pptx1
161 pages
3_machine_learning_tools
No ratings yet
3_machine_learning_tools
69 pages
Machine Learning in Python Main Developments and T
100% (1)
Machine Learning in Python Main Developments and T
44 pages
Day5 FDP IoT Part1
No ratings yet
Day5 FDP IoT Part1
89 pages
Data Sets
No ratings yet
Data Sets
36 pages
UNIT-II
No ratings yet
UNIT-II
83 pages
Shear Tests On Masonry Triplets With Different Soft Layer Membranes
No ratings yet
Shear Tests On Masonry Triplets With Different Soft Layer Membranes
47 pages
dsbda Unit4
No ratings yet
dsbda Unit4
110 pages
DesignSafe Bootcamp V1
No ratings yet
DesignSafe Bootcamp V1
129 pages
Grafcut-73H-paper-cutter-operator-manual-v.06.2014
No ratings yet
Grafcut-73H-paper-cutter-operator-manual-v.06.2014
47 pages
(Undergraduate Texts in Computer Science) David R. Brooks (Auth.) - Problem Solving With Fortran 90 - For Scientists and Engineers-Springer-Verlag New York (1997) PDF
100% (1)
(Undergraduate Texts in Computer Science) David R. Brooks (Auth.) - Problem Solving With Fortran 90 - For Scientists and Engineers-Springer-Verlag New York (1997) PDF
711 pages
machine learning
No ratings yet
machine learning
39 pages
Green Procedure For Highly Efficient Rapid Synthesis of Imidazo 1 2 A Pyridine and Its Late Stage Functionalization
No ratings yet
Green Procedure For Highly Efficient Rapid Synthesis of Imidazo 1 2 A Pyridine and Its Late Stage Functionalization
10 pages
40 Most Popular Python Scientific Libraries
No ratings yet
40 Most Popular Python Scientific Libraries
9 pages
Re Cat 2 Material PDF
No ratings yet
Re Cat 2 Material PDF
47 pages
Machine Learning Python
No ratings yet
Machine Learning Python
48 pages
Ch6 - Variable Costing and Segment Reporting
No ratings yet
Ch6 - Variable Costing and Segment Reporting
33 pages
PDS Labmanualword
No ratings yet
PDS Labmanualword
32 pages
A Survey On Distributed Machine Learning
No ratings yet
A Survey On Distributed Machine Learning
33 pages
Playbook PG Students 2019 1 PDF
No ratings yet
Playbook PG Students 2019 1 PDF
82 pages
Face Mask Detection
No ratings yet
Face Mask Detection
32 pages
Eeb131 Intro to Ai and It-03
No ratings yet
Eeb131 Intro to Ai and It-03
23 pages
to send
No ratings yet
to send
6 pages
20180000413
No ratings yet
20180000413
26 pages
deep-learning-r18-jntuh-lab-manual
No ratings yet
deep-learning-r18-jntuh-lab-manual
20 pages
Revision Python For Computer Vision
No ratings yet
Revision Python For Computer Vision
50 pages
penerbit0053
No ratings yet
penerbit0053
14 pages
2 - Data Science Tools
No ratings yet
2 - Data Science Tools
21 pages
Abc Analysis of Inventory Management
No ratings yet
Abc Analysis of Inventory Management
10 pages
Deep Learning With Python: TH TH
No ratings yet
Deep Learning With Python: TH TH
36 pages
AI Tools.R
No ratings yet
AI Tools.R
15 pages
Unit V
No ratings yet
Unit V
13 pages
Fan Modelling For Front End Cooling With CFD: Master'S Thesis
No ratings yet
Fan Modelling For Front End Cooling With CFD: Master'S Thesis
58 pages
Learned Multiphysics Inversion With Differentiable Programming and Machine Learning
No ratings yet
Learned Multiphysics Inversion With Differentiable Programming and Machine Learning
13 pages
A Parallel Fortran Framework For Neural Networks and Deep Learning
No ratings yet
A Parallel Fortran Framework For Neural Networks and Deep Learning
12 pages
Neko: A Modern, Portable, and Scalable Framework For High-Fidelity Computational Fluid Dynamics
No ratings yet
Neko: A Modern, Portable, and Scalable Framework For High-Fidelity Computational Fluid Dynamics
13 pages
Micro Project Report Format
No ratings yet
Micro Project Report Format
11 pages
Deep Learning Manual
No ratings yet
Deep Learning Manual
57 pages
ML Libraries Ppt(3.3)
No ratings yet
ML Libraries Ppt(3.3)
10 pages
F2PY A Tool For Connecting Fortran and Python Prog
No ratings yet
F2PY A Tool For Connecting Fortran and Python Prog
11 pages
Hba Checklist
No ratings yet
Hba Checklist
10 pages
Practical 1
No ratings yet
Practical 1
8 pages
Sec-D ML Practical File PDF
No ratings yet
Sec-D ML Practical File PDF
19 pages
KERAS-123
No ratings yet
KERAS-123
7 pages
Climate Models Challenges For Fortran Development Tools
No ratings yet
Climate Models Challenges For Fortran Development Tools
8 pages
“Transfer Learning” for Bridging the Gap Between Data Sciences and the Deep Learning
No ratings yet
“Transfer Learning” for Bridging the Gap Between Data Sciences and the Deep Learning
9 pages
Fortran: The Introduction
No ratings yet
Fortran: The Introduction
13 pages
Top 20 Incredibly Impressive Trending Python Libraries To Work With
No ratings yet
Top 20 Incredibly Impressive Trending Python Libraries To Work With
15 pages
To Study The Characteristic Curve of Centrifugal Compressor.
50% (2)
To Study The Characteristic Curve of Centrifugal Compressor.
19 pages
Alaska Textile Co., Inc. v. Chase Manhattan Bank, N.A., 982 F.2d 813, 2d Cir. (1992)
No ratings yet
Alaska Textile Co., Inc. v. Chase Manhattan Bank, N.A., 982 F.2d 813, 2d Cir. (1992)
17 pages
1002799.Stancin_Jovic
No ratings yet
1002799.Stancin_Jovic
6 pages
Python Libraries For Data Science
No ratings yet
Python Libraries For Data Science
6 pages
An Overview and Comparison of Free Python Libraries For Data Mining and Big Data Analysis
No ratings yet
An Overview and Comparison of Free Python Libraries For Data Mining and Big Data Analysis
6 pages
Done Assignment
No ratings yet
Done Assignment
9 pages
Deep Learning Lab Manual
100% (1)
Deep Learning Lab Manual
19 pages
Etude Rondo PDF
0% (1)
Etude Rondo PDF
2 pages
On PDA
100% (1)
On PDA
13 pages
IEEE Xplore Reference Download 2025.2.9.22.39.11
No ratings yet
IEEE Xplore Reference Download 2025.2.9.22.39.11
3 pages
CV of Ranjana
No ratings yet
CV of Ranjana
3 pages
LIBRARIES FOR DATA SCIENCE_CBS_PDS
No ratings yet
LIBRARIES FOR DATA SCIENCE_CBS_PDS
2 pages
Solid Wood Flooring vs. Engineered Wood Flooring
No ratings yet
Solid Wood Flooring vs. Engineered Wood Flooring
7 pages
Core Libraries For Machine Learning
No ratings yet
Core Libraries For Machine Learning
5 pages
Cvpro
No ratings yet
Cvpro
3 pages
Commission Agreement Template Signaturely
No ratings yet
Commission Agreement Template Signaturely
3 pages
A.C. No. 10583 [Formerly CBD 09-2555]
No ratings yet
A.C. No. 10583 [Formerly CBD 09-2555]
2 pages
1963 Republican Constitution
No ratings yet
1963 Republican Constitution
2 pages
26feb05 4 Closed
100% (1)
26feb05 4 Closed
5 pages
The First Artificial Neuron
No ratings yet
The First Artificial Neuron
2 pages
Admit 2
No ratings yet
Admit 2
1 page
Course C-2023 Beam-Bearing Plates and Column Base Plates by ASD/LRFD Steel Construction Manual
No ratings yet
Course C-2023 Beam-Bearing Plates and Column Base Plates by ASD/LRFD Steel Construction Manual
24 pages
Deep Learning Applications and Image Processing
No ratings yet
Deep Learning Applications and Image Processing
5 pages
Assignment 2
No ratings yet
Assignment 2
4 pages
Deep Learning Development Environment Construction
No ratings yet
Deep Learning Development Environment Construction
3 pages
Poster Cal Tech
No ratings yet
Poster Cal Tech
1 page
Python Vs Fortran in Scientific Computing
No ratings yet
Python Vs Fortran in Scientific Computing
2 pages
ML Week1 Tools Services PDF
No ratings yet
ML Week1 Tools Services PDF
1 page
Kerasr: R Interface To The Keras Deep Learning Library
No ratings yet
Kerasr: R Interface To The Keras Deep Learning Library
1 page
PESTEL Analysis
No ratings yet
PESTEL Analysis
3 pages
Introduction To Python and ML Libraries
No ratings yet
Introduction To Python and ML Libraries
11 pages
2016 PACSA CONVENTION GUIDELINES Final PDF
No ratings yet
2016 PACSA CONVENTION GUIDELINES Final PDF
1 page
30 Amazing Machine Learning Projects For The Past Year (v.2018)
No ratings yet
30 Amazing Machine Learning Projects For The Past Year (v.2018)
22 pages
Fortran Programming Essentials: Definitive Reference for Developers and Engineers
From Everand
Fortran Programming Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mathematical and Optimization Computing with FORTRAN
From Everand
Mathematical and Optimization Computing with FORTRAN
Pasquale De Marco
No ratings yet
Caffe Deep Learning Framework Essentials: Definitive Reference for Developers and Engineers
From Everand
Caffe Deep Learning Framework Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

A Fortran-Keras Deep Learning Bridge For Scientific Computing

Uploaded by

A Fortran-Keras Deep Learning Bridge For Scientific Computing

Uploaded by

A Fortran-Keras Deep Learning Bridge for Scientific Computing

3 The Python Anchor (Deep Learning)

4 The Fortran Anchor (Scientific Computing)

pure subroutine fwdprop(self, x)

function output(self, input) result(last_layer_output)

! BatchNorm layer - extends from base layer_type

Listing 3: Example of extending the layer type to implement Batch Normalization

5.2.2 Training in Fortran

5.2.3 Custom Loss Functions

real(rk) function crossentropy_loss(self, y_true, y_pred)

loss = - sum(y_true * log(y_pred))

function d_crossentropy_loss(self, y_true, y_pred) result(loss)

loss = y_pred - y_true

Table 1: Hyperparameter Space

(d) (e) (f)

(g) (h) (i)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.