fortran_dnn_from_tf

Barebones method of implementing a pre-trained DNN from TensorFlow in a Fortran script.

https://github.com/ajfurlong/fortran_dnn_from_tf

Keywords

deep-learning deep-neural-network fortran neural-network tensorflow

Last synced: 9 months ago · JSON representation ·

Repository

Barebones method of implementing a pre-trained DNN from TensorFlow in a Fortran script.

Basic Info

Host: GitHub
Owner: ajfurlong
License: mit
Language: Fortran
Default Branch: main
Homepage:
Size: 429 KB

Statistics

Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Topics

deep-learning deep-neural-network fortran neural-network tensorflow

Created almost 2 years ago · Last pushed over 1 year ago

Metadata Files

Readme Changelog License Citation

Using Pre-Trained TensorFlow DNNs in Fortran

The purpose of this project is to provide a simple yet very accurate framework for using deep neural networks (DNNs) previously trained with TensorFlow to make predictions in a Fortran project. This approach allows Fortran programs to leverage the predictive power of TensorFlow-trained models without requiring complex integration or dependencies beyond HDF5 for data handling. There are a few other projects out there that focus on training and creating DNNs within Fortran, but this project simply does one thing (pretty well) that is easy to understand and implement. The basic idea behind this workflow is to leverage the ease of model training in Python with TensorFlow, with a seamless integration of the trained model into Fortran.

No conversion of any information is required for this script, and it will take model.h5 files directly from the TensorFlow script without outside manipulation.

Relevant verification cases and narrative can be found here.

Project Overview

This project offers a minimalistic solution for incorporating TensorFlow-trained DNNs into Fortran applications. The project also includes data processing routines that facilitate benchmarking against original TensorFlow predictions to verify accuracy and consistency. Users are responsible for defining the network architecture manually, but the instructions provided make this process manageable and hopefully intuitive. The system is designed to process large HDF5 files for input and output data.

Prerequisites

Fortran Compiler: Ensure you have a Fortran compiler installed (e.g., gfortran).
HDF5 Library: Required for handling HDF5 files. Install via package manager or from HDF5 Group.
Python: Required for running the model conversion script.
TensorFlow: Install via pip install tensorflow for converting models.

Project Structure

bin/: Directory for storing compiled executables.
src/: Contains Fortran source files, including modules and the main program.
obj/: Stores compiled object and module files.
example/: Example case to train a DNN model using TensorFlow in Python, in this case sinusoid.py.
models/: Stores model weights and biases in a model.h5 file and scaler parameters in a metadata.h5 file.
data/: Stores input and output datasets in a single data.h5 file.
output/: Contains saved "verification mode" performance metrics in a .txt file.

Usage: Example

The included example, sinusoid.py covers the entire process from training the DNN on the TensorFlow side to running inference within Fortran. The example directory includes the data, model, and source script for a DNN learning a modified sinusoid.

Step 1: Train the DNN using TensorFlow

The sinusoid.py script should already be configured correctly, so just verify that you have the correct packages installed and run it. This script first generates 5,000 points with a small amount stochastic noise (reproducible with the random seed being set). The data is shuffled, standardized (to mu=0 and sigma=1 to reduce bias from magnitude), and then 80% is used to train the model with 20% being held out for testing.

The 1,000 test points (inputs, true outputs, and the TensorFlow model's predictions) are exported into an HDF5 file within the example/data/ directory. The model is saved as an HDF5 to example/model/, technically a "legacy" format, but is easy to extract from and will be supported by TensorFlow for the foreseeable future.

In the output of this script, there will be some performance statistics. Directly under these, are the input/output means and standard deviations used during the standardization process. These standardization parameters are automatically stored in the metadata.h5 file saved under models/.

Step 2: Modify main.f90 network architecture (or whatever module you'd like the predictions to go to)

This is where you will specify the network structure and configuration. This has gotten a lot easier than the first version of the DNN framework, which needed the user to dig through the source network module. Now, a network architecture can be simply defined with two sections of information within whatever main program you are using. First, you will need to specify the number of inputs for the network with numinputs. You will also need to add more variables at the top of main.f90 to reflect the number of inputs you have. Then, initialize the network with the number of layers and the number of "neurons" per layer, which is done with initializenetwork([network structure]). The network structure is defined by [inputneurons, layer1neurons, ..., output_neurons]. The output neurons, if you are trying to predict a single parameter, will be one.

num_inputs = 2
call initialize_network([2,16,16,2])

call load_weights(model_path)
call load_metadata(metadata_path, x_mean, y_mean, x_std, y_std)

! Assign activation functions for each layer
layer_activations(1)%func => relu_fn
layer_activations(2)%func => relu_fn
layer_activations(3)%func => no_activation

Step 3: Modify main.f90

The first thing to change is the definition of the data arrays (e.g., input1(:), input2(:), ydata(:)). Add or remove these depending on how many input parameters you have. If you are not in "verification mode" comparing outputs of the Fortran implementation against those of the TensorFlow model, remove the ydata(:) and ypredtf(:) and other relevant mentions throughout main.f90.

When it comes to the read_dataset calls, you will need to adjust the string to match the names of the datasets in the HDF5 file and the input array you would like to send that information to. The other arguments will be automatically adjusted. For our example, where there are two inputs, one true output, and the TensorFlow predicted output, this becomes:

! Read datasets from data.h5 file
print *, 'Reading datasets...'
call read_dataset(filename, 'input1', input1, num_entries, debug)
call read_dataset(filename, 'input2', input2, num_entries, debug)
call read_dataset(filename, 'output_true', y_data, num_entries, debug)
call read_dataset(filename, 'output_pred', y_pred_tf, num_entries, debug)

The data_extracted array will then be allocated, with the user setting the integer to the number of inputs + the output. The user will then add the relevant lines to accomodate those input data channels:

! Allocate and combine input data into a single array
allocate(x_data(num_entries, num_inputs))
x_data(:, 1) = input1(:)
x_data(:, 2) = input2(:)

The list of channels standardized then can be modified as well:

! Standardize datasets if needed (if you are using physical data)
if (standardize_data) then
    print *, 'Standardizing datasets...'
    call standardize(x_data(:, 1), x_mean(1), x_std(1))
    call standardize(x_data(:, 2), x_mean(2), x_std(2))
end if

All of the necessary modifications should be complete now. Again, this is set up in "verification mode", which assumes that you have known outputs (ydata) and TensorFlow predictions (ypredtf), which are then compared against the Fortran predictions (ypred). If "production mode" is desired, feel free to remove the references of these two variables and computemetrics(). Using predict() is currently configured to predict a full list of predictions, but you can call it for individual xdata vectors by removing it from that do loop and adjusting the array dimensions elsewhere.

Step 5: Compile

Head back out to the parent directory fortrandnnfrom_tf and compile with the Makefile. Ensure that you have the necessary packages installed: a Fortran compiler (gfortran) and HDF5. These are currently setup as they would be on macOS, but make modifications to the Makefile as needed.

make

This will produce an executable in the bin/ directory.

Step 6: Run the Fortran Program

./bin/main path/to/datafile path/to/modelfile path/to/metadatafile [options]

Options: standardize - applicable in most cases, standardizes your incoming data with the specified means/stds from the metadata file debug - detailed information to determine where the issue lies

In the case of the example, the number of entries in the testing set is 1000, and the data file that was exported contained physical values, which will need to be standardized.

./bin/main data/sinusoid_test_data.h5 models/sinusoid_model_tf.h5 models/sinusoid_metadata_tf.h5 standardize

The example is configured in "verification mode", so it will print a table of performance values and save this table to a .txt file under output/.

Future Enhancements

• Additional Activation Functions: Expand the set of supported activation functions to accommodate more complex models.
• Automatic Network Configuration: Allow users to define network architecture and parameters via an input file, improving flexibility.
• OR: Configure automatically from the attributes located in the model.h5 file.
• Support for Current TensorFlow Model Format: Implement compatibility with the current TensorFlow model save format for broader usability (unlikely due to complexity and lack of necessity).

Owner

Login: ajfurlong
Kind: user

Repositories: 1
Profile: https://github.com/ajfurlong

Citation (CITATION.cff)

cff-version: 1.4.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: Furlong
    given-names: Aidan
    orcid: https://orcid.org/0000-0002-6291-2959
title: "Making Predictions in Fortran using Pre-trained TensorFlow DNN Models "
version: 1.4.0
date-released: 2025-01-18

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science