add testgen to generate test cases #1801

mewmew · 2019-10-12T19:49:05Z

Usage:

make -f MakefileVC testgen

mkdir -p testdata
wine testgen.exe

Or run the gen_tests.sh script which runs the above commands:

./gen_tests.sh

Edit: this Git history of this one is really hacky. So feel free to squash.

AJenbo · 2019-10-21T18:36:50Z

So how does this thing work? Do we need to run something afterward to make use of the test cases?
How does a success look versus a fail?

mewmew · 2019-10-22T12:22:19Z

So how does this thing work? Do we need to run something afterward to make use of the test cases?
How does a success look versus a fail?

The idea is we use gen_tests.sh to generate "golden" output for the DRLG algorithms. These correspond to the actual contents of the dungeon array (i.e. the tile ID map). Then, what we do is we compute the SHA1 hashsum of these outputs, and add them as expected output for the test cases.

$ sha1sum testdata/tiles_dlvl=1,quest_id=255,seed=123.bin
12a0410904ebf2507b6b7017f0ae191ae476686b  testdata/tiles_dlvl=1,quest_id=255,seed=123.bin
$ sha1sum testdata/tiles_dlvl=2,quest_id=6,seed=123.bin
659b95eec3e1c18d13b7f9932de108b88b356b9b  testdata/tiles_dlvl=2,quest_id=6,seed=123.bin

Now, we create a test case in C++ using Google test or something like that, and essentially set up the pre-condition (i.e. the state of global variables) before calling Createl5Dungeon. Then we compute the SHA1 hash of the result we got (i.e. the contents of the dungeon array). If the hashes match, we pass the test, if not we fail.

I played around with this and implemented a PoC in Go. For reference, see https://github.com/sanctuary/djavul/blob/e1ed5212bc6e67e0bdf6270f5da1bb474fe1bf97/d1/l1/l1_testxxx.go#L33, a simplified extract of which is provided below:

golden := []struct {
		dungeonName      string
		dlvl             int
		dtype            enum.DType
		questID          enum.QuestID
		seed             int32
		wantTiles        string
		wantDPieces      string
		wantArches       string
		wantTransparency string
	}{
		{
			dungeonName:      "Cathedral",
			dlvl:             1,
			dtype:            enum.DTypeCathedral,
			questID:          enum.QuestIDNone,
			seed:             123,
			wantTiles:        "12a0410904ebf2507b6b7017f0ae191ae476686b",
			wantDPieces:      "e15a7afb7505cb01b0b3d1befce5b8d4833ae1c6",
			wantArches:       "5438e3d7761025a2ee6f7fec155c840fc289f5dd",
			wantTransparency: "1269467cb381070f72bc6c8e69938e88da7e58cc",
		},
...
		{
			dungeonName:      "The Butcher",
			dlvl:             quests.QuestData[enum.QuestIDTheButcher].DLvlSingle,
			dtype:            enum.DTypeCathedral,
			questID:          enum.QuestIDTheButcher,
			seed:             123,
			wantTiles:        "659b95eec3e1c18d13b7f9932de108b88b356b9b",
			wantDPieces:      "15f2209ff5d066cfd568a1eab77e4328d08474e8",
			wantArches:       "42941df3ada356ebf87ce2987d26a06c44da711a",
			wantTransparency: "74c24e596ec57a91261bc3a559270f31d6811336",
		},
}

multi.MaxPlayers = 1 // single player.
pass := true
for _, g := range golden {
	// Establish pre-conditions.
	gendung.DLvl = g.dlvl
	gendung.DType = g.dtype
	for i := range quests.Quests {
		quests.Quests[i].QuestID = enum.QuestID(i)
		quests.Quests[i].QuestState = 0
	}
	gendung.IsSetLevel = false
	if g.questID != enum.QuestIDNone {
		quests.Quests[g.questID].QuestState = 1
		quests.Quests[g.questID].DLvl = g.dlvl
	}
	entry := int(0)
	CreateDungeon(uint32(g.seed), entry)
	if err := check(gendung.TileIDMap, "tiles", g.seed, g.wantTiles); err != nil {
		// FAIL
		pass = false
	}
}
if !pass {
	fmt.Println("test case failed")
} else {
	fmt.Println("test case passed")
}

Helper function to compute SHA1 hash:

// check validates the data against the given SHA1 hashsum.
func check(data interface{}, name string, seed int32, want string) error {
	buf := &bytes.Buffer{}
	if err := binary.Write(buf, binary.LittleEndian, data); err != nil {
		return errors.WithStack(err)
	}
	sum := sha1.Sum(buf.Bytes())
	got := fmt.Sprintf("%040x", sum[:])
	if got != want {
		return errors.Errorf("SHA1 hash mismatch for %v, seed 0x%08X; expected %q, got %q", name, seed, want, got)
	}
	return nil
}

AJenbo · 2019-10-22T15:54:12Z

Ok, so if I understand this correctly the goal is to provide a way for others to easily verify there important gains that of devilution, not to check devilution against vanilla?

We should take care not to use the SHA1bad implementation 😅

mewmew · 2019-10-22T18:45:32Z

Ok, so if I understand this correctly the goal is to provide a way for others to easily verify there important gains that of devilution, not to check devilution against vanilla?

Ideally, we would add this hook to the vanilla Diablo.exe executable, so we could get the ensured golden SHA1 hashes. Then adding test cases to Devilution would help ensure that we don't mess things up when doing future cleanups. Adding test cases to DevilutionX becomes especially important since DevilutionX seeks to do quite a few cleanups, any of which could change something subtle in how the DRLG works (e.g. signed vs. unsigned integers, 32- vs 64-bit integers, etc).

AJenbo · 2019-10-22T18:51:51Z

Ok, I will be looking forward for part two then :) Really exciting that this can also be used with vanilla Diablo.exe

Note, SHA1 hash of testdata/tiles_123.bin is correct for input seed 123. 12a0410904ebf2507b6b7017f0ae191ae476686b

…f drlg_l1

This seed generates a dungeon with an opening into nothing at the end of the map.

Also, add dump_dun.sh script to convert DUN file to TMX format.

In other words, this test case checks that "Levels/L1Data/SKngDO.DUN" is loaded correctly on dungeon level 3. This raises test case coverage of d1/l1 from 88.3% to 88.6%.

This helps track when TMX maps need to be regenerated based on time stmaps.

Add all broken seeds from https://github.com/sanctuary/graphics/tree/master/l1/broken d1/l1 test coverage is now at 91%.

This takes d1/l1 test coverage from 91% to 91.3%.

mewmew · 2021-01-02T22:57:44Z

@AJenbo should we close this PR now that DevilutionX has integrated test cases?

AJenbo · 2021-01-03T01:28:47Z

My understanding was that this could be used to generate test data for the tests in DevilutionX. So I was thinking we keep this around until we have done so and implemented the tests for the dungeon generators (and verified it against original game exe). I'm currently seeing some issues with the Crypt levels in DevilutionX where save games appear to not match the original layout, but I haven't fully verified it yet.

mewmew requested a review from AJenbo October 12, 2019 19:49

mewmew mentioned this pull request Feb 15, 2020

Write tests diasurgical/devilutionX#583

Open

mewmew added 23 commits March 23, 2020 23:45

testgen: prepare for test generation

7de8112

testgen: rename MakefileVC to Makefile

cb22e77

testgen: add testgen build to Makefile

3976dfc

testgen: add test case for drlg_l1

3599bd3

Note, SHA1 hash of testdata/tiles_123.bin is correct for input seed 123. 12a0410904ebf2507b6b7017f0ae191ae476686b

testgen: dump test data for dungeon pieces, arches and transparency o…

2e59003

…f drlg_l1

testgen: add table for test cases

26d2add

testgen: add more test cases for drlg_l1

dde50c3

move Makefile back to MakefileVC

5d0e48d

move MakefileGCC back to Makefile

3545d9d

testgen: add TESTGEN define to use WinMain of SourceT/main.cpp

8bf78a9

testgen: use TESTGEN to handle testgen.exe specific changes

e41b800

clean testgen objects

8f30306

testgen: add test case for Skeleton King's Lair

55ecd40

testgen: fix build after rebasing on master

3eeba0b

testgen: set setlvlnum to zero when not in a set level

e2e688e

testgen: add broken seed 7281

cbe5638

This seed generates a dungeon with an opening into nothing at the end of the map.

cmd/viewdun: add viewdun tool to convert TILES.BIN to DUN

adcdb45

Also, add dump_dun.sh script to convert DUN file to TMX format.

d1/l1: add test case for entrance to Skeleton King's Lair

4a463fc

In other words, this test case checks that "Levels/L1Data/SKngDO.DUN" is loaded correctly on dungeon level 3. This raises test case coverage of d1/l1 from 88.3% to 88.6%.

testgen: replace dump_dun.sh script with equivalent Makefile

02c214f

This helps track when TMX maps need to be regenerated based on time stmaps.

testgen: invoke LoadSetMap for instead CreateL5Dungeon of set levels

64425bc

testgen: add more broken seeds

5ad5ee8

Add all broken seeds from https://github.com/sanctuary/graphics/tree/master/l1/broken d1/l1 test coverage is now at 91%.

testgen: format source code

4ee568d

testdata: add 1666 seed

bad2361

This takes d1/l1 test coverage from 91% to 91.3%.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add testgen to generate test cases #1801

add testgen to generate test cases #1801

mewmew commented Oct 12, 2019 •

edited

Loading

AJenbo commented Oct 21, 2019

mewmew commented Oct 22, 2019 •

edited

Loading

AJenbo commented Oct 22, 2019

mewmew commented Oct 22, 2019

AJenbo commented Oct 22, 2019

mewmew commented Jan 2, 2021

AJenbo commented Jan 3, 2021

add testgen to generate test cases #1801

Are you sure you want to change the base?

add testgen to generate test cases #1801

Conversation

mewmew commented Oct 12, 2019 • edited Loading

AJenbo commented Oct 21, 2019

mewmew commented Oct 22, 2019 • edited Loading

AJenbo commented Oct 22, 2019

mewmew commented Oct 22, 2019

AJenbo commented Oct 22, 2019

mewmew commented Jan 2, 2021

AJenbo commented Jan 3, 2021

mewmew commented Oct 12, 2019 •

edited

Loading

mewmew commented Oct 22, 2019 •

edited

Loading