backgammon/bot.py

from cup import Cup
import tensorflow as tf
from network import Network
import numpy as np
from board import Board
import subprocess
import random
import sys

class Bot:

    def __init__(self, sym, config = None):
        self.config = config
        self.cup = Cup()
        self.sym = sym
        self.graph = tf.Graph()
        with self.graph.as_default():
            self.session = tf.Session()
            self.network = Network(self.session, config)
            self.network.restore_model()
            

    def roll(self):
        print("{} rolled: ".format(self.sym))
        roll = self.cup.roll()
#        print(roll)
        return roll
        

    def switch(self,cur):
        return -1 if cur == 1 else 1

    def restore_model(self):
        with self.graph.as_default():
            self.network.restore_model()

    def get_session(self):
        return self.session
        
    def get_sym(self):
        return self.sym

    def get_network(self):
        return self.network

    def make_random_move(self, board, sym, roll):
        legal_moves = Board.calculate_legal_states(board, sym, roll)
        return random.choice(list(legal_moves))


    # TODO: Test this, the score results should be deterministic
    def make_pubeval_move(self, board, sym, roll):
        legal_moves = Board.calculate_legal_states(tuple(board), sym, roll)
        moves_and_scores = []
        for board in legal_moves:
            call_argument = ["./pubeval/pubeval"]
            for x in Board.board_features_to_pubeval(board, sym):
                call_argument.append(str(x))
            data = subprocess.check_output(call_argument)
            moves_and_scores.append([board, float(bytes.decode(data))])
        scores = [ x[1] for x in moves_and_scores ]
        best_move_pair = moves_and_scores[np.array(scores).argmax()]
        return best_move_pair
        
                
    def make_move(self, board, sym, roll):
        # print(Board.pretty(board))
        legal_moves = Board.calculate_legal_states(board, sym, roll)
        moves_and_scores = [ (move, self.network.eval_state(np.array(move).reshape(1,26))) for move in legal_moves ]
        scores = [ x[1] for x in moves_and_scores ]
        best_move_pair = moves_and_scores[np.array(scores).argmax()]
        #print("Found the best state, being:", np.array(move_scores).argmax())
        return best_move_pair
Now actually has a bot playing, although randomly, it still works 2018-02-05 22:50:31 +00:00			`from cup import Cup`
Potentially functioning network 2018-03-04 16:35:36 +00:00			`import tensorflow as tf`
			`from network import Network`
			`import numpy as np`
does not work, but is improved 2018-02-13 13:38:49 +00:00			`from board import Board`
Added a bunch of pubeval stuff 2018-03-11 19:00:24 +00:00			`import subprocess`
Now actually has a bot playing, although randomly, it still works 2018-02-05 22:50:31 +00:00			`import random`
Added a bunch of pubeval stuff 2018-03-11 19:00:24 +00:00			`import sys`
Now actually has a bot playing, although randomly, it still works 2018-02-05 22:50:31 +00:00
			`class Bot:`

woooow 2018-03-08 15:27:16 +00:00			`def __init__(self, sym, config = None):`
			`self.config = config`
Now actually has a bot playing, although randomly, it still works 2018-02-05 22:50:31 +00:00			`self.cup = Cup()`
			`self.sym = sym`
Potentially functioning network 2018-03-04 16:35:36 +00:00			`self.graph = tf.Graph()`
			`with self.graph.as_default():`
			`self.session = tf.Session()`
woooow 2018-03-08 15:27:16 +00:00			`self.network = Network(self.session, config)`
save and restore number of trained episodes 2018-03-09 23:22:20 +00:00			`self.network.restore_model()`
Potentially functioning network 2018-03-04 16:35:36 +00:00
Now actually has a bot playing, although randomly, it still works 2018-02-05 22:50:31 +00:00
			`def roll(self):`
			`print("{} rolled: ".format(self.sym))`
			`roll = self.cup.roll()`
Might be able to learn now (?) 2018-03-06 15:23:08 +00:00			`# print(roll)`
Now actually has a bot playing, although randomly, it still works 2018-02-05 22:50:31 +00:00			`return roll`


			`def switch(self,cur):`
Changed look of board, now using -1 and 1 as players, cleaner, easier and faster 2018-02-06 22:29:51 +00:00			`return -1 if cur == 1 else 1`
Now actually has a bot playing, although randomly, it still works 2018-02-05 22:50:31 +00:00
Now only using one bot again. Also changed learning rate to 0.1 2018-03-07 13:44:17 +00:00			`def restore_model(self):`
			`with self.graph.as_default():`
			`self.network.restore_model()`
I think we can play against a random bot now 2018-03-08 12:32:40 +00:00
Potentially functioning network 2018-03-04 16:35:36 +00:00			`def get_session(self):`
			`return self.session`

Changed look of board, now using -1 and 1 as players, cleaner, easier and faster 2018-02-06 22:29:51 +00:00			`def get_sym(self):`
			`return self.sym`
Now actually has a bot playing, although randomly, it still works 2018-02-05 22:50:31 +00:00
Potentially functioning network 2018-03-04 16:35:36 +00:00			`def get_network(self):`
			`return self.network`

I think we can play against a random bot now 2018-03-08 12:32:40 +00:00			`def make_random_move(self, board, sym, roll):`
			`legal_moves = Board.calculate_legal_states(board, sym, roll)`
			`return random.choice(list(legal_moves))`
Added a bunch of pubeval stuff 2018-03-11 19:00:24 +00:00

pubeval evaluation 2018-03-11 23:11:40 +00:00			`# TODO: Test this, the score results should be deterministic`
Added a bunch of pubeval stuff 2018-03-11 19:00:24 +00:00			`def make_pubeval_move(self, board, sym, roll):`
pubeval evaluation 2018-03-11 23:11:40 +00:00			`legal_moves = Board.calculate_legal_states(tuple(board), sym, roll)`
Added a bunch of pubeval stuff 2018-03-11 19:00:24 +00:00			`moves_and_scores = []`
			`for board in legal_moves:`
remove pubeval binary and change expected binary location 2018-03-11 23:09:33 +00:00			`call_argument = ["./pubeval/pubeval"]`
Added a bunch of pubeval stuff 2018-03-11 19:00:24 +00:00			`for x in Board.board_features_to_pubeval(board, sym):`
			`call_argument.append(str(x))`
			`data = subprocess.check_output(call_argument)`
remove pubeval binary and change expected binary location 2018-03-11 23:09:33 +00:00			`moves_and_scores.append([board, float(bytes.decode(data))])`
Added a bunch of pubeval stuff 2018-03-11 19:00:24 +00:00			`scores = [ x[1] for x in moves_and_scores ]`
			`best_move_pair = moves_and_scores[np.array(scores).argmax()]`
			`return best_move_pair`



I think we can play against a random bot now 2018-03-08 12:32:40 +00:00
Bot reimplemented with new representation. Fixed bug where black could not bear off. More tests written. 2018-02-22 13:01:28 +00:00			`def make_move(self, board, sym, roll):`
			`# print(Board.pretty(board))`
			`legal_moves = Board.calculate_legal_states(board, sym, roll)`
flips 2018-03-06 12:04:47 +00:00			`moves_and_scores = [ (move, self.network.eval_state(np.array(move).reshape(1,26))) for move in legal_moves ]`
			`scores = [ x[1] for x in moves_and_scores ]`
fix bot.py 2018-03-06 12:08:01 +00:00			`best_move_pair = moves_and_scores[np.array(scores).argmax()]`
flips 2018-03-06 12:04:47 +00:00			`#print("Found the best state, being:", np.array(move_scores).argmax())`
fix bot.py 2018-03-06 12:08:01 +00:00			`return best_move_pair`
Now actually has a bot playing, although randomly, it still works 2018-02-05 22:50:31 +00:00
I think we can play against a random bot now 2018-03-08 12:32:40 +00:00