How to get stable results with TensorFlow, setting random seed

Question

I'm trying to run a neural network multiple times with different parameters in order to calibrate the networks parameters (dropout probabilities, learning rate e.d.). However I am having the problem that running the network while keeping the parameters the same still gives me a different solution when I run the network in a loop as follows:

filename = create_results_file()
for i in range(3):
  g = tf.Graph()
  with g.as_default():
    accuracy_result, average_error = network.train_network(
        parameters, inputHeight, inputWidth, inputChannels, outputClasses)
    f, w = get_csv_writer(filename)
    w.writerow([accuracy_result, "did run %d" % i, average_error])
    f.close()

I am using the following code at the start of my train_network function before setting up the layers and error function of my network:

np.random.seed(1)
tf.set_random_seed(1)

I have also tried adding this code before the TensorFlow graph creation, but I keep getting different solutions in my results output.

I am using an AdamOptimizer and am initializing network weights using tf.truncated_normal. Additionally I am using np.random.permutation to shuffle the incoming images for each epoch.

Possible duplicate of [Reproducible results in Tensorflow with tf.set\_random\_seed](https://stackoverflow.com/questions/51249811/reproducible-results-in-tensorflow-with-tf-set-random-seed) — STJ, Sep 27 '19 at 22:24

score 48 · Answer 1 · edited Oct 04 '17 at 11:00

48

Setting the current TensorFlow random seed affects the current default graph only. Since you are creating a new graph for your training and setting it as default (with g.as_default():), you must set the random seed within the scope of that with block.

For example, your loop should look like the following:

for i in range(3):
  g = tf.Graph()
  with g.as_default():
    tf.set_random_seed(1)
    accuracy_result, average_error = network.train_network(
        parameters, inputHeight, inputWidth, inputChannels, outputClasses)

Note that this will use the same random seed for each iteration of the outer for loop. If you want to use a different—but still deterministic—seed in each iteration, you can use tf.set_random_seed(i + 1).

edited Oct 04 '17 at 11:00

Augustin

2,166
17
23

answered Mar 29 '16 at 16:10

mrry

120,078
23
381
391

4

I believe my set_random_seed(1) was already within the g.as_default() block they as it is one of the first lines within the train_network code. Nonetheless I have tried putting the code as in your example, but I am still getting unstable results: > accuracy labels error > 0.9805 did run0 2.96916 > 0.9807 did run1 2.96494 > 0.9804 did run2 2.95215 – Waanders Mar 30 '16 at 13:32
I'm having the same problem. `tensorflow` `0.12.1` setting random seed as specified I see slight differences in probability outputs from run to run. – Luke Jan 31 '17 at 20:27
5

The reason will depend on what your function is, but it's likely that small differences in the accuracy calculation are caused by a non-deterministic parallel reduction in ops like `tf.reduce_sum()`. (These ops treat floating point addition as commutative, when in reality it isn't, and changes in the reduction order can lead to slight errors in the result....) – mrry Jan 31 '17 at 20:41
@mrry, I'm facing a similar error with `DropoutWrapper`. When I keep `keep_prob=1`, I get consistent results. However, different runs give me different values with `keep_prob=0.8`. What could be the reason? I've moved it to http://stackoverflow.com/questions/42156296/ – martianwars Feb 10 '17 at 10:36

score 16 · Answer 2 · edited Mar 29 '17 at 09:16

16

Deterministic behaviour can be obtained either by supplying a graph-level or an operation-level seed. Both worked for me. A graph-level seed can be placed with tf.set_random_seed. An operation-level seed can be placed e.g, in a variable intializer as in:

myvar = tf.Variable(tf.truncated_normal(((10,10)), stddev=0.1, seed=0))

edited Mar 29 '17 at 09:16

SimonFojtu

413
4
12

answered Jul 17 '16 at 16:01

ssegvic

2,974
1
18
19

score 16 · Answer 3 · answered Feb 06 '20 at 06:20

Tensorflow 2.0 Compatible Answer: For Tensorflow version greater than 2.0, if we want to set the Global Random Seed, the Command used is tf.random.set_seed.

If we are migrating from Tensorflow Version 1.x to 2.x, we can use the command, tf.compat.v2.random.set_seed.

Note that tf.function acts like a re-run of a program in this case.

To set the Operation Level Seed (as answered above), we can use the command, tf.random.uniform([1], seed=1).

For more details, refer this Tensorflow Page.

score 8 · Answer 4 · edited Aug 08 '20 at 02:38

8

Please add all random seed functions before your code:

tf.reset_default_graph()
tf.random.set_seed(0)
random.seed(0)
np.random.seed(0)

I think, some models in TensorFlow are using numpy or the python random function.

edited Aug 08 '20 at 02:38

Rémy Hosseinkhan Boucher

124
7

answered Oct 08 '19 at 20:05

Zhihui Shao

307
3
4

score 7 · Answer 5 · answered Jan 13 '20 at 18:36

7

It seems as if none of these answers will work due to underlying implementation issues in CuDNN.

You can get a bit more determinism by adding an extra flag

os.environ['PYTHONHASHSEED']=str(SEED)
os.environ['TF_CUDNN_DETERMINISTIC'] = '1'  # new flag present in tf 2.0+
random.seed(SEED)
np.random.seed(SEED)
tf.set_random_seed(SEED)

But this still won't be entirely deterministic. To get an even more exact solution, you need use the procedure outlined in this nvidia repo.

answered Jan 13 '20 at 18:36

Luke

5,052
9
39
68

It should be noted that setting `PYTHONHASHSEED` within the process does nothing - it needs to be set outside the process itself, or the script must open a new process with that environmental variable set beforehand. – craymichael Jan 13 '21 at 04:39

Dan · Answer 6 · 2021-02-24T02:20:45.653

Backend Setup: cuda:10.1, cudnn: 7, tensorflow-gpu: 2.1.0, keras: 2.2.4-tf, and vgg19 customized model

After looking into the issue of unstable results for tensorflow backend with GPU training and large neural network models based on keras, I was finally able to get reproducible (stable) results as follows:

Import only those libraries that would be required for setting seed and initialize a seed value

import tensorflow as tf
import os
import numpy as np
import random

SEED = 0

Function to initialize seeds for all libraries which might have stochastic behavior

def set_seeds(seed=SEED):
    os.environ['PYTHONHASHSEED'] = str(seed)
    random.seed(seed)
    tf.random.set_seed(seed)
    np.random.seed(seed)

Activate Tensorflow deterministic behavior

def set_global_determinism(seed=SEED):
    set_seeds(seed=seed)

    os.environ['TF_DETERMINISTIC_OPS'] = '1'
    os.environ['TF_CUDNN_DETERMINISTIC'] = '1'
    
    tf.config.threading.set_inter_op_parallelism_threads(1)
    tf.config.threading.set_intra_op_parallelism_threads(1)

# Call the above function with seed value
set_global_determinism(seed=SEED)

Important notes:

Please call the above code before executing any other code
Model training might become slower since the code is deterministic, hence there's a tradeoff
I experimented several times with a varying number of epochs and different settings (including model.fit() with shuffle=True) and the above code gives me reproducible results.

References:

score 0 · Answer 7 · answered Jul 20 '20 at 16:48

I'm using TensorFlow 2 (2.2.0) and I'm running code in JupyterLab. I've tested this in macOS Catalina and in Google Colab with same results. I'll add some stuff to Tensorflow Support's answer.

When I do some training using the model.fit() method I do it in a cell. I do some other stuff in other cells. This is the code I run in the mentioned cell:

# To have same results always place this on top of the cell
tf.random.set_seed(1)

(x_train, y_train), (x_test, y_test) = load_data_onehot_grayscale()
model = get_mlp_model_compiled() # Creates the model, compiles it and returns it

history = model.fit(x=x_train, y=y_train,
                    epochs=30,
                    callbacks=get_mlp_model_callbacks(),
                    validation_split=.1,
                   )

This is what I understand:

TensorFlow has some random processes that happen at different stages (initializing, shuffling, ...), every time those processes happen TensorFlow uses a random function. When you set the seed using tf.random.set_seed(1) you make those processes use it and if the seed is set and the processes don't change the results will be the same.
Now, in the code above, if I change tf.random.set_seed(1) to go below the line model = get_mlp_model_compiled() my results change, I believe it's because get_mlp_model_compiled() uses randomness and isn't using the seed I want.
Caveat about point 2: if I run the cell 3 times in a row I do get same results. I believe this happens because, in run nº1 get_mlp_model_compiled() isn't using TensorFlow's internal counter with my seed. In run nº2 it will be using a seed and all subsequent runs it will be using the seed too so after run nº2 results will be the same.

I might have some information wrong so feel free to correct me.

To understand what's going on you should read the docs, they're not so long and kind of easy to understand.

Harsh · Answer 8 · 2020-10-25T06:30:47.087

-1

This answer is an addition to Luke's answer and for TF v2.2.0

import numpy as np
import os
import random
import tensorflow as tf # 2.2.0

SEED = 42
os.environ['PYTHONHASHSEED']=str(SEED)
os.environ['TF_CUDNN_DETERMINISTIC'] = '1'  # TF 2.1
random.seed(SEED)
np.random.seed(SEED)
tf.random.set_seed(SEED)

edited Oct 25 '20 at 06:30

answered Jul 15 '20 at 04:46

Harsh

614
3
13
26

Using `TF_CUDNN_DETERMINISTIC` breaks the notebook with tf 2.3.0 not sure why. This made me debug straight for 2 days in my code! Don't use this. – Kaushal28 Sep 22 '20 at 07:17

How to get stable results with TensorFlow, setting random seed

8 Answers8

Important notes:

References:

Linked

Related