backgammon

Author	SHA1	Message	Date
Christoffer Müller Madsen	1062b72bda	fix typo	2018-04-19 16:04:49 +02:00
Alexander Munch-Hansen	66589dfde3	fixed global step, now using exp decay	2018-04-19 16:01:19 +02:00
Alexander Munch-Hansen	cba0f67ae2	fixed the bug	2018-04-19 15:22:00 +02:00
Pownie	611f6cdba0	Changed alpha to learning_rate	2018-04-15 23:53:35 +02:00
Pownie	7d29fc02f2	Added global step + exponential decay	2018-04-14 23:11:20 +02:00
Christoffer Müller Madsen	17f5b62e9b	proper Tesauro board representation	2018-03-28 14:36:52 +02:00
Christoffer Müller Madsen	fda2c6e08d	parametric board representation in network	2018-03-28 12:00:47 +02:00
Christoffer Müller Madsen	abce56dd40	fix typo	2018-03-27 23:13:59 +00:00
alex	95b12a6c35	Added another board_rep	2018-03-28 00:33:39 +02:00
Christoffer Müller Madsen	2654006222	fix wrongful mergings	2018-03-27 13:02:36 +02:00
Christoffer Müller Madsen	c248ca0452	Merge branch 'fuck_git' into 'rework-1' # Conflicts: # network.py	2018-03-27 10:15:51 +00:00
alex	f43108c239	Training using slightly revamped version of our own board rep. Not sure if works yet.	2018-03-27 04:06:08 +02:00
alex	006f791727	Functioning network using board representation shamelessly ripped from Tesauro	2018-03-27 02:26:15 +02:00
Christoffer Müller Madsen	4c43bf19a3	Add evaluation variance benchmark To do a benchmark for `pubeval`, run `python3 main.py --bench-eval-scores --eval-methods pubeval` Logs will be placed in directory `bench` Use `plot_bench(data_path)` in `plot.py` for plotting	2018-03-26 16:45:26 +02:00
Christoffer Müller Madsen	1f1e806306	fix errant whitespace	2018-03-26 15:55:48 +02:00
Christoffer Müller Madsen	98c9af72e7	rework network	2018-03-22 15:30:47 +01:00
Alexander Munch-Hansen	b7e6dd10af	move evaluation code into network.py	2018-03-20 13:17:38 +01:00
Alexander Munch-Hansen	99783ee4f8	clean up and move things to network.py	2018-03-20 13:03:21 +01:00
Christoffer Müller Madsen	2fc7a2a09c	fixed dumb bugs; still messy	2018-03-14 20:42:09 +01:00
Christoffer Müller Madsen	55898d0e66	renaming parameters	2018-03-12 00:11:55 +01:00
Christoffer Müller Madsen	9bc1a8ba9f	save and restore number of trained episodes	2018-03-10 00:22:20 +01:00
Christoffer Müller Madsen	150036a6cb	plot-plot	2018-03-08 17:13:25 +01:00
Christoffer Müller Madsen	30183448ec	woooow	2018-03-08 16:27:16 +01:00
Alexander Munch-Hansen	bae1e73692	Now only using one bot again. Also changed learning rate to 0.1	2018-03-07 14:44:17 +01:00
Alexander Munch-Hansen	11d25603cf	Might be able to learn now (?)	2018-03-06 16:23:08 +01:00
Anders Ladefoged	e7fe827ceb	Merge branch 'master' of gitfub.space:Pownie/backgammon	2018-03-06 12:22:45 +01:00
Anders Ladefoged	c9e4446a52	Custom activation (2*tanh(x)) function implemented with tensorflow primitives.	2018-03-06 12:19:04 +01:00
Christoffer Müller Madsen	00033a7aca	re-enable Tensorflow logging	2018-03-06 12:04:56 +01:00
Alexander Munch-Hansen	5845edf084	works now	2018-03-06 11:53:42 +01:00
Alexander Munch-Hansen	22870b90d3	things are better now	2018-03-06 11:06:38 +01:00
Alexander Munch-Hansen	d3fe3c918c	Potentially functioning network	2018-03-04 17:35:36 +01:00
Alexander Munch-Hansen	c2118d0549	Added network	2018-02-07 15:31:05 +01:00

32 Commits