backgammon/tensorflow_impl_tests/eager_main.py

import time
import numpy as np
import tensorflow as tf
from board import Board
import tensorflow.contrib.eager as tfe


tf.enable_eager_execution()
xavier_init = tf.contrib.layers.xavier_initializer()


opt = tf.train.MomentumOptimizer(learning_rate=0.1, momentum=1)

output_size = 1
hidden_size = 40
input_size = 30


model = tf.keras.Sequential([
    tf.keras.layers.Dense(40, activation="sigmoid", kernel_initializer=xavier_init, input_shape=(1,input_size)),
    tf.keras.layers.Dense(1, activation="sigmoid", kernel_initializer=xavier_init)
])

#tfe.Saver(model.variables).restore(tf.train.latest_checkpoint("./"))

input = [0, 2, 0, 0, 0, 0, -5, 0, -3, 0, 0, 0, 5, -5, 0, 0, 0, 3, 0, 5, 0, 0, 0, 0, -2, 0]


all_input = np.array([Board.board_features_quack_fat(input, 1) for _ in range(20)])


single_in = Board.board_features_quack_fat(input, 1)


start = time.time()

all_predictions = model.predict_on_batch(all_input)

print(all_predictions)
print(time.time() - start)


start = time.time()
all_predictions = [model(single_in) for _ in range(20)]

#print(all_predictions[:10])
print(time.time() - start)

print("-"*30)
with tf.GradientTape() as tape:
    val = model(single_in)
grads = tape.gradient(val, model.variables)

# grads = [0.1*val-np.random.uniform(-1,1)+grad for grad, trainable_var in zip(grads, model.variables)]

# print(model.variables[0][0])
weights_before = model.weights[0]

start = time.time()
#[trainable_var.assign_add(0.1*val-0.3+grad) for grad, trainable_var in zip(grads, model.variables)]

start = time.time()
for gradient, trainable_var in zip(grads, model.variables):
    backprop_calc = 0.1 * (0.9 - val) * gradient
    trainable_var.assign_add(backprop_calc)

# opt.apply_gradients(zip(grads, model.variables))

print(time.time() - start)

print(model(single_in))

vals = model.predict_on_batch(all_input)
vals = list(vals)
vals[3] = 4
print(vals)
print(np.argmax(np.array(vals)))

# tfe.Saver(model.variables).save("./tmp_ckpt")
Different implementations of different speed 2018-05-07 20:24:47 +00:00			`import time`
			`import numpy as np`
			`import tensorflow as tf`
Getting closer to functionality. We're capable of evaluating moves and a rework of global_step has begun, such that we now use episode_count as a way of calculating exp_decay, which have been implemented as a function. 2018-05-09 20:22:12 +00:00			`from board import Board`
Ongoing rewrite of network to use an eager model. We're now capable of evaluating a list of states with network.py. We can also save and restore models. 2018-05-08 22:33:05 +00:00			`import tensorflow.contrib.eager as tfe`

Different implementations of different speed 2018-05-07 20:24:47 +00:00
			`tf.enable_eager_execution()`
Ongoing rewrite of network to use an eager model. We're now capable of evaluating a list of states with network.py. We can also save and restore models. 2018-05-08 22:33:05 +00:00			`xavier_init = tf.contrib.layers.xavier_initializer()`

Different implementations of different speed 2018-05-07 20:24:47 +00:00

Ongoing rewrite of network to use an eager model. We're now capable of evaluating a list of states with network.py. We can also save and restore models. 2018-05-08 22:33:05 +00:00			`opt = tf.train.MomentumOptimizer(learning_rate=0.1, momentum=1)`
Different implementations of different speed 2018-05-07 20:24:47 +00:00
			`output_size = 1`
			`hidden_size = 40`
			`input_size = 30`


			`model = tf.keras.Sequential([`
Ongoing rewrite of network to use an eager model. We're now capable of evaluating a list of states with network.py. We can also save and restore models. 2018-05-08 22:33:05 +00:00			`tf.keras.layers.Dense(40, activation="sigmoid", kernel_initializer=xavier_init, input_shape=(1,input_size)),`
			`tf.keras.layers.Dense(1, activation="sigmoid", kernel_initializer=xavier_init)`
Different implementations of different speed 2018-05-07 20:24:47 +00:00			`])`

Ongoing rewrite of network to use an eager model. We're now capable of evaluating a list of states with network.py. We can also save and restore models. 2018-05-08 22:33:05 +00:00			`#tfe.Saver(model.variables).restore(tf.train.latest_checkpoint("./"))`

Getting closer to functionality. We're capable of evaluating moves and a rework of global_step has begun, such that we now use episode_count as a way of calculating exp_decay, which have been implemented as a function. 2018-05-09 20:22:12 +00:00			`input = [0, 2, 0, 0, 0, 0, -5, 0, -3, 0, 0, 0, 5, -5, 0, 0, 0, 3, 0, 5, 0, 0, 0, 0, -2, 0]`
Different implementations of different speed 2018-05-07 20:24:47 +00:00
Ongoing rewrite of network to use an eager model. We're now capable of evaluating a list of states with network.py. We can also save and restore models. 2018-05-08 22:33:05 +00:00
Different implementations of different speed 2018-05-07 20:24:47 +00:00
Getting closer to functionality. We're capable of evaluating moves and a rework of global_step has begun, such that we now use episode_count as a way of calculating exp_decay, which have been implemented as a function. 2018-05-09 20:22:12 +00:00			`all_input = np.array([Board.board_features_quack_fat(input, 1) for _ in range(20)])`


			`single_in = Board.board_features_quack_fat(input, 1)`
Different implementations of different speed 2018-05-07 20:24:47 +00:00

			`start = time.time()`

			`all_predictions = model.predict_on_batch(all_input)`

			`print(all_predictions)`
			`print(time.time() - start)`



			`start = time.time()`
Ongoing rewrite of network to use an eager model. We're now capable of evaluating a list of states with network.py. We can also save and restore models. 2018-05-08 22:33:05 +00:00			`all_predictions = [model(single_in) for _ in range(20)]`

			`#print(all_predictions[:10])`
			`print(time.time() - start)`

			`print("-"*30)`
			`with tf.GradientTape() as tape:`
Getting closer to functionality. We're capable of evaluating moves and a rework of global_step has begun, such that we now use episode_count as a way of calculating exp_decay, which have been implemented as a function. 2018-05-09 20:22:12 +00:00			`val = model(single_in)`
Ongoing rewrite of network to use an eager model. We're now capable of evaluating a list of states with network.py. We can also save and restore models. 2018-05-08 22:33:05 +00:00			`grads = tape.gradient(val, model.variables)`

Getting closer to functionality. We're capable of evaluating moves and a rework of global_step has begun, such that we now use episode_count as a way of calculating exp_decay, which have been implemented as a function. 2018-05-09 20:22:12 +00:00			`# grads = [0.1*val-np.random.uniform(-1,1)+grad for grad, trainable_var in zip(grads, model.variables)]`
Ongoing rewrite of network to use an eager model. We're now capable of evaluating a list of states with network.py. We can also save and restore models. 2018-05-08 22:33:05 +00:00
			`# print(model.variables[0][0])`
			`weights_before = model.weights[0]`

			`start = time.time()`
			`#[trainable_var.assign_add(0.1*val-0.3+grad) for grad, trainable_var in zip(grads, model.variables)]`

			`start = time.time()`
Getting closer to functionality. We're capable of evaluating moves and a rework of global_step has begun, such that we now use episode_count as a way of calculating exp_decay, which have been implemented as a function. 2018-05-09 20:22:12 +00:00			`for gradient, trainable_var in zip(grads, model.variables):`
			`backprop_calc = 0.1 * (0.9 - val) * gradient`
			`trainable_var.assign_add(backprop_calc)`
Ongoing rewrite of network to use an eager model. We're now capable of evaluating a list of states with network.py. We can also save and restore models. 2018-05-08 22:33:05 +00:00
Getting closer to functionality. We're capable of evaluating moves and a rework of global_step has begun, such that we now use episode_count as a way of calculating exp_decay, which have been implemented as a function. 2018-05-09 20:22:12 +00:00			`# opt.apply_gradients(zip(grads, model.variables))`
Different implementations of different speed 2018-05-07 20:24:47 +00:00
			`print(time.time() - start)`

Getting closer to functionality. We're capable of evaluating moves and a rework of global_step has begun, such that we now use episode_count as a way of calculating exp_decay, which have been implemented as a function. 2018-05-09 20:22:12 +00:00			`print(model(single_in))`

			`vals = model.predict_on_batch(all_input)`
			`vals = list(vals)`
			`vals[3] = 4`
			`print(vals)`
			`print(np.argmax(np.array(vals)))`
Ongoing rewrite of network to use an eager model. We're now capable of evaluating a list of states with network.py. We can also save and restore models. 2018-05-08 22:33:05 +00:00
Getting closer to functionality. We're capable of evaluating moves and a rework of global_step has begun, such that we now use episode_count as a way of calculating exp_decay, which have been implemented as a function. 2018-05-09 20:22:12 +00:00			`# tfe.Saver(model.variables).save("./tmp_ckpt")`