Switch from cutechess-cli to fastchess #2119

vondele · 2024-07-13T11:15:26Z

This PR switches from cutechess-cli to fast-chess.

cutechess-cli has been serving us well in the past years, however, some issues have accumulated, namely the difficulty of compiling cutechess-cli, the observed timeouts at high concurrency and short TC, and e.g. slowness when indexing larger books. fast-chess https://github.com/Disservin/fast-chess has addressed these issues, and has now probably become mature enough to serve as the game manager for fishtest.

As an example of its ability to deal with short TC and high concurrency: https://dfts-0.pigazzini.it/tests/view/669249cdbee8253775cede32 with concurrency 25, and TC 1+0.01s no timeouts are observed.

fast-chess is built from sources, with the zip download as well as the binary cached as needed. There is fine-grained control over which version of fast-chess is used, so we can easily upgrade for new features.

In this PR, fast-chess is built in cutechess compatibility to facilitate integration, and to benefit from the existing fishtest checks. Once validated, we should be able to switch easily to its native mode, which can output trinomial and pentanomial results, and we should be able significantly simplify the worker's book-keeping.

vondele · 2024-07-13T11:16:32Z

The PR is ready for review from the fishtest side, still in a draft state to allow for a couple of fixes and updates to be made to fast-chess.

vondele · 2024-07-15T16:04:09Z

current pushed version runs correctly for fishtest AFAICT. Still not master fast-chess, but nearly there.

I think the most important thing now verifying that fast-chess is building with all the version of the compilers we need, and in all environments.

Current fishtest version check is gcc >= 7.3 and clang >= 8.0 (current online workers are gcc >= 9.3 and clang >= 13.0).

vondele · 2024-07-15T16:32:06Z

updated to master now.

vondele · 2024-07-15T19:11:53Z

updated to current fast-chess, verified to compile with gcc 7.3.0 and clang 8.0.0, the fishtest minimum required versions.

worker/games.py

worker/worker.py

Disservin · 2024-07-15T20:43:04Z

there was some code somewhere which made high concurrency worker only pickup ltc tasks, arguably we don't need that bit anymore with fc, but can be done later

vondele · 2024-07-15T20:43:53Z

there was some code somewhere which made high concurrency worker only pickup ltc tasks, arguably we don't need that bit anymore with fc, but can be done later

OK, will remove that, it is a server change.

Disservin · 2024-07-16T10:52:27Z

Should we also first compile the binary with tests, to make sure that everything works on the host ? make -j tests && ./fast-chess-tests
Not quite sure

vondele · 2024-07-16T11:41:57Z

To be considered... will the tests work if fast-chess is compiled with 'USE_CUTE' and how long does it take?
Are there additional dependencies if done (gtest or similar) ?

Disservin · 2024-07-16T12:11:40Z

will the tests work if fast-chess is compiled with 'USE_CUTE'

it is a separate binary which can only run the tests

how long does it take?

the tests are rather quick, ~3s.. compiling them takes considerably more time... i need to check if that can be improved

Are there additional dependencies if done (gtest or similar) ?

there's a depedency on doctest, but the repo has source code for that

vondele · 2024-07-16T12:48:41Z

testing worked locally with gcc 7.3 and clang 8.0, and is simple to add. Trying in CI.

working. Timing added is not so important, this happens just once and the binary is reused.

ppigazzini · 2024-07-16T15:12:11Z

fishtest/worker/games.py

Lines 1157 to 1180 in 12981ff

    
           # Run cutechess-cli binary. 
        
           # Stochastic rounding and probability for float N.p: (N, 1-p); (N+1, p) 
        
           idx = cmd.index("_spsa_") 
        
           cmd = ( 
        
               cmd[:idx] 
        
               + [ 
        
                   "option.{}={}".format( 
        
                       x["name"], math.floor(x["value"] + random.uniform(0, 1)) 
        
                   ) 
        
                   for x in w_params 
        
               ] 
        
               + cmd[idx + 1 :] 
        
           ) 
        
           idx = cmd.index("_spsa_") 
        
           cmd = ( 
        
               cmd[:idx] 
        
               + [ 
        
                   "option.{}={}".format( 
        
                       x["name"], math.floor(x["value"] + random.uniform(0, 1)) 
        
                   ) 
        
                   for x in b_params 
        
               ] 
        
               + cmd[idx + 1 :] 
        
           )

if I recall correctly fast-chess does not need stochastic rounding.

vondele · 2024-07-16T15:13:50Z

if I recall correctly fast-chess does not need stochastic rounding.

Isn't that more a property of SF? I think parameters are integers in SF, or at least only that is well tested.

ppigazzini · 2024-07-16T15:31:37Z

cutechess accepts only integers, but it's a pleasure to start blaming SF devs for the convoluted SPSA parameters rounding.

vondele · 2024-07-16T15:32:56Z

They consulted their legal department, which points to the UCI spec

		* spin
			a spin wheel that can be an integer in a certain range

ppigazzini · 2024-07-16T15:44:52Z

No way a legal department takes can be as a specification.

Disservin · 2024-07-16T16:39:20Z

working. Timing added is not so important, this happens just once and the binary is reused.

just for documentation, we get a one time addition of 2min - 4min

vondele · 2024-07-16T16:42:56Z

ah, you mean in fishtest CI. For workers, I think it is acceptable.
Also, this is CI, where I hardcoded concurrency to 1. Let me change that to 4, I recall that's the concurrency that is available to actions (and concurrency is used for the compilation with -j N)

Disservin · 2024-07-30T17:41:06Z

I think the culprit is that fastchess_WLD_results["games"] is not computed as W+L+D (which would include the saved_stats).

Mh fastchess updates calculates this as w+l+d.. now what comes to my mind is that maybe some race could lead to flawed updates but this shouldn't happend because all relevant parts are behind a lock. Can you give more information perhaps i.e. was the assertion wrong by 1 or 2? meaning a pair was missed or something was not updated in pairs.. also are you on the latest version of this pr? I think there were issues in the past not sure if they were ever included here or fixed before

vdbergh · 2024-07-30T17:47:46Z

I think the culprit is that fastchess_WLD_results["games"] is not computed as W+L+D (which would include the saved_stats).

Mh fastchess updates calculates this as w+l+d.. now what comes to my mind is that maybe some race could lead to flawed updates but this shouldn't happend because all relevant parts are behind a lock. Can you give more information perhaps i.e. was the assertion wrong by 1 or 2? meaning a pair was missed or something was not updated in pairs.. also are you on the latest version of this pr? I think there were issues in the past not sure if they were ever included here or fixed before

There is nothing wrong with fastchess. In the case of SPSA the worker computes WLD of a task as the sum of the WLD for the "mini tasks" (corresponding to a choice of parameters). The same should be done for the total number of games.

vdbergh · 2024-07-30T18:03:11Z

One may wonder however what the point is of this summing of WLD data corresponding to different sets of parameters...

Disservin · 2024-07-30T18:34:24Z

There is nothing wrong with fastchess. In the case of SPSA the worker computes WLD of a task as the sum of the WLD for the "mini tasks" (corresponding to a choice of parameters). The same should be done for the total number of games.

oh! i missed the fact that both assertion errors came from a spsa test, thanks

vondele · 2024-08-02T07:43:23Z

I got two assertion errors https://tests.stockfishchess.org/actions?max_actions=2&action=&user=&text=AssertionError&before=1722328311.350552&run_id=

I think the culprit is that fastchess_WLD_results["games"] is not computed as W+L+D (which would include the saved_stats).

I am bit surprised however that this issue (if present) does not cause more problems.

@vdbergh is this the proper fix (quick local test suggest it is):

diff --git a/worker/games.py b/worker/games.py
index 88a26c2..cc82c02 100644
--- a/worker/games.py
+++ b/worker/games.py
@@ -990,7 +990,12 @@ def parse_fastchess_output(
                 spsa["losses"] = fastchess_WLD_results["losses"]
                 spsa["draws"] = fastchess_WLD_results["draws"]
 
-            num_games_finished = fastchess_WLD_results["games"]
+            num_games_finished = (
+                fastchess_WLD_results["games"]
+                + saved_stats["wins"]
+                + saved_stats["losses"]
+                + saved_stats["draws"]
+            )
 
             fastchess_ptnml_results = None
             fastchess_WLD_results = None

vdbergh · 2024-08-02T07:45:31Z

Yes I think so.

vdbergh · 2024-08-08T08:55:29Z

For reference: there are some more assertion errors https://tests.stockfishchess.org/actions?max_actions=2&action=&user=&text=AssertionError&before=1723033368.540148&run_id=

I have not debugged it yet (it is again for SPSA tests).

vdbergh · 2024-08-08T17:46:31Z

Ok this is master

            assert num_games_finished == 2 * sum(rounds["pentanomial"])
            assert num_games_finished <= num_games_updated + batch_size
            assert num_games_finished <= games_to_play

This is this PR

            assert num_games_finished == 2 * sum(result["stats"]["pentanomial"])
            assert num_games_finished <= num_games_updated + batch_size
            assert num_games_finished <= games_to_play

num_games_finished is the number of games finished in the current mini task (so the fix in #2119 (comment) is actually wrong, sorry about this).

The difference between master and this PR is the right hand side of the first assert.

2 * sum(rounds["pentanomial"])

in master vs

2 * sum(result["stats"]["pentanomial"])

in this PR.

The difference it that the first expression only refers to the current mini task and the second expression refers to the aggregated results.

So I think the second expression should be replaced by

2*sum(fastchess_ptnml_results)

vdbergh · 2024-08-09T09:15:09Z

This would be the patch wrt the current PR. I am now testing it

diff --git a/worker/games.py b/worker/games.py
index cc82c02..9ef2796 100644
--- a/worker/games.py
+++ b/worker/games.py
@@ -990,15 +990,7 @@ def parse_fastchess_output(
                 spsa["losses"] = fastchess_WLD_results["losses"]
                 spsa["draws"] = fastchess_WLD_results["draws"]
 
-            num_games_finished = (
-                fastchess_WLD_results["games"]
-                + saved_stats["wins"]
-                + saved_stats["losses"]
-                + saved_stats["draws"]
-            )
-
-            fastchess_ptnml_results = None
-            fastchess_WLD_results = None
+            num_games_finished = fastchess_WLD_results["games"]
 
             assert (
                 2 * sum(result["stats"]["pentanomial"])
@@ -1006,10 +998,13 @@ def parse_fastchess_output(
                 + result["stats"]["losses"]
                 + result["stats"]["draws"]
             )
-            assert num_games_finished == 2 * sum(result["stats"]["pentanomial"])
+            assert num_games_finished == 2 * sum(fastchess_ptnml_results)
             assert num_games_finished <= num_games_updated + batch_size
             assert num_games_finished <= games_to_play
 
+            fastchess_ptnml_results = None
+            fastchess_WLD_results = None
+
             # Send an update_task request after a batch is full or if we have played all games.
             if (num_games_finished == num_games_updated + batch_size) or (
                 num_games_finished == games_to_play

worker/games.py

Viren6 · 2024-08-17T16:07:17Z

Seems time losses can be a lot higher than games played:

Checking the pgn shows the time losses is exactly twice as high as it should be, so there is probably an error between pairs vs games somewhere.

gahtan-syarif · 2024-08-17T16:11:10Z

Seems time losses can be a lot higher than games played:

Checking the pgn shows the time losses is exactly twice as high as it should be, so there is probably an error between pairs vs games somewhere.

it seems i found out what went wrong, fastchess when encountering time losses displays a warning to the user like below:

Warning; Engine Engine2 loses on time
Finished game 3 (Engine1 vs Engine2): 1-0 {Black loses on time}

this warning does not exist on cutechess.
fishtest counts the occurrences of the string "loses on time" which means it counts twice for every timeloss on fastchess output. the same applies to crashes as fc also sends a warning for disconnects

Disservin · 2024-08-17T16:18:53Z

Seems time losses can be a lot higher than games played:

Checking the pgn shows the time losses is exactly twice as high as it should be, so there is probably an error between pairs vs games somewhere.

it seems i found out what went wrong, fastchess when encountering time losses displays a warning to the user like below:
Warning; Engine Engine2 loses on time
Finished game 3 (Engine1 vs Engine2): 1-0 {Black loses on time}
this warning does not exist on cutechess. fishtest counts the occurrences of the string "loses on time" which means it counts twice for every timeloss on fastchess output. the same applies to crashes as fc also sends a warning for disconnects

Lest just get rid of that I don’t see any value in the warning report

gahtan-syarif · 2024-08-17T16:19:45Z

Seems time losses can be a lot higher than games played:

Checking the pgn shows the time losses is exactly twice as high as it should be, so there is probably an error between pairs vs games somewhere.

it seems i found out what went wrong, fastchess when encountering time losses displays a warning to the user like below:
Warning; Engine Engine2 loses on time
Finished game 3 (Engine1 vs Engine2): 1-0 {Black loses on time}
this warning does not exist on cutechess. fishtest counts the occurrences of the string "loses on time" which means it counts twice for every timeloss on fastchess output. the same applies to crashes as fc also sends a warning for disconnects
Lest just get rid of that I don’t see any value in the warning report

i suggest making it logger::trace so it could still show up on the log file

vondele · 2024-08-17T18:19:28Z

Updated fast-chess after the fix, thanks for analysis and fixes.

gahtan-syarif · 2024-08-25T06:50:45Z

        # Parse line like this:
        # Finished game 1 (stockfish vs base): 0-1 {White disconnects}
        if "disconnects" in line or "connection stalls" in line:
            result["stats"]["crashes"] += 1

        if "on time" in line:
            result["stats"]["time_losses"] += 1

i suggest changing to:

        # Parse line like this:
        # Finished game 1 (stockfish vs base): 0-1 {White disconnects}
        if "disconnect" in line or "stall" in line:
            result["stats"]["crashes"] += 1

        if "on time" in line or "timeout" in line:
            result["stats"]["time_losses"] += 1

might also want to rename timelosses to timeouts
based on: https://github.com/cutechess/cutechess/blob/780065637f9936bc29cc592c6f0b2007ccbf66de/projects/lib/src/board/result.cpp#L109-L132

vondele · 2024-09-02T05:32:28Z

converting to draft, awaiting a change in convention for mate in plies / moves in the pgn output:

Disservin/fastchess#696

Edit: Fixed, PR updated.

gahtan-syarif · 2024-10-02T09:40:50Z

                "-tournament",
                "gauntlet",

better to get rid of this since its unecessary and fastchess currently supports only roundrobin format

vondele marked this pull request as draft July 13, 2024 11:15

vondele force-pushed the fastchessPR branch 2 times, most recently from f340f91 to 91babcd Compare July 13, 2024 11:22

This was referenced Jul 13, 2024

Dealing with invalid options.. Disservin/fastchess#535

Closed

output message order at very fast time control Disservin/fastchess#554

Closed

gahtan-syarif mentioned this pull request Jul 15, 2024

Hardcode source and header files in the makefile Disservin/fastchess#562

Closed

vondele force-pushed the fastchessPR branch from 91babcd to 92fd18d Compare July 15, 2024 15:58

vondele force-pushed the fastchessPR branch from 92fd18d to dc5ae98 Compare July 15, 2024 19:10

vondele force-pushed the fastchessPR branch from dc5ae98 to 080e2c7 Compare July 15, 2024 19:17

Disservin reviewed Jul 15, 2024

View reviewed changes

worker/games.py Outdated Show resolved Hide resolved

worker/worker.py Outdated Show resolved Hide resolved

vondele force-pushed the fastchessPR branch 2 times, most recently from 4cdb5f2 to deaf11b Compare July 15, 2024 21:33

vondele force-pushed the fastchessPR branch from 26cf015 to 7067393 Compare July 16, 2024 13:02

Fix num games finished, updated fast-chess

13254f6

Fix num_games_finished

5309d48

Viren6 reviewed Aug 17, 2024

View reviewed changes

worker/games.py Outdated Show resolved Hide resolved

Fix rounds

42ea7e5

Update fast-chess, fix 'loses on time' count.

8b63a22

Update fastchess

5135d43

Generalize match for crashes and time_losses

43927b6

vondele marked this pull request as draft September 2, 2024 05:31

Update fast-chess, fix pgn mate scores

089a471

vondele marked this pull request as ready for review September 3, 2024 07:14

vondele added 2 commits September 3, 2024 17:42

Update fastchess

f45cd1c

fastchess update

f30ed35

Update fastchess, remove gauntlet

e86863b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch from cutechess-cli to fastchess #2119

Switch from cutechess-cli to fastchess #2119

vondele commented Jul 13, 2024

vondele commented Jul 13, 2024

vondele commented Jul 15, 2024

vondele commented Jul 15, 2024

vondele commented Jul 15, 2024

Disservin commented Jul 15, 2024

vondele commented Jul 15, 2024

Disservin commented Jul 16, 2024

vondele commented Jul 16, 2024

Disservin commented Jul 16, 2024

vondele commented Jul 16, 2024 •

edited

Loading

ppigazzini commented Jul 16, 2024

vondele commented Jul 16, 2024

ppigazzini commented Jul 16, 2024

vondele commented Jul 16, 2024

ppigazzini commented Jul 16, 2024 •

edited

Loading

Disservin commented Jul 16, 2024

vondele commented Jul 16, 2024 •

edited

Loading

Disservin commented Jul 30, 2024

vdbergh commented Jul 30, 2024

vdbergh commented Jul 30, 2024

Disservin commented Jul 30, 2024

vondele commented Aug 2, 2024

vdbergh commented Aug 2, 2024

vdbergh commented Aug 8, 2024

vdbergh commented Aug 8, 2024

vdbergh commented Aug 9, 2024

Viren6 commented Aug 17, 2024

gahtan-syarif commented Aug 17, 2024 •

edited

Loading

Disservin commented Aug 17, 2024

gahtan-syarif commented Aug 17, 2024

vondele commented Aug 17, 2024

gahtan-syarif commented Aug 25, 2024 •

edited

Loading

vondele commented Sep 2, 2024 •

edited

Loading

gahtan-syarif commented Oct 2, 2024

Switch from cutechess-cli to fastchess #2119

Are you sure you want to change the base?

Switch from cutechess-cli to fastchess #2119

Conversation

vondele commented Jul 13, 2024

vondele commented Jul 13, 2024

vondele commented Jul 15, 2024

vondele commented Jul 15, 2024

vondele commented Jul 15, 2024

Disservin commented Jul 15, 2024

vondele commented Jul 15, 2024

Disservin commented Jul 16, 2024

vondele commented Jul 16, 2024

Disservin commented Jul 16, 2024

vondele commented Jul 16, 2024 • edited Loading

ppigazzini commented Jul 16, 2024

vondele commented Jul 16, 2024

ppigazzini commented Jul 16, 2024

vondele commented Jul 16, 2024

ppigazzini commented Jul 16, 2024 • edited Loading

Disservin commented Jul 16, 2024

vondele commented Jul 16, 2024 • edited Loading

Disservin commented Jul 30, 2024

vdbergh commented Jul 30, 2024

vdbergh commented Jul 30, 2024

Disservin commented Jul 30, 2024

vondele commented Aug 2, 2024

vdbergh commented Aug 2, 2024

vdbergh commented Aug 8, 2024

vdbergh commented Aug 8, 2024

vdbergh commented Aug 9, 2024

Viren6 commented Aug 17, 2024

gahtan-syarif commented Aug 17, 2024 • edited Loading

Disservin commented Aug 17, 2024

gahtan-syarif commented Aug 17, 2024

vondele commented Aug 17, 2024

gahtan-syarif commented Aug 25, 2024 • edited Loading

vondele commented Sep 2, 2024 • edited Loading

gahtan-syarif commented Oct 2, 2024

vondele commented Jul 16, 2024 •

edited

Loading

ppigazzini commented Jul 16, 2024 •

edited

Loading

vondele commented Jul 16, 2024 •

edited

Loading

gahtan-syarif commented Aug 17, 2024 •

edited

Loading

gahtan-syarif commented Aug 25, 2024 •

edited

Loading

vondele commented Sep 2, 2024 •

edited

Loading