Canonical test ordering - lexicographic #78570

RenechCDDA · 2024-12-14T18:04:00Z

Summary

Infrastructure "Tests now run in a guaranteed order (lexicographic) by default (can be overriden with cli argument)"

Purpose of change

Fix vehicle_turret test #78567 (comment) exposed some good questions. What is the ordering in how our tests are run? Apparently it's partly determined by linking order, which makes reproducing some test fails impossible without using the same link order (e.g. being on the same commit and using the same build method).

This is an unnecessary impediment when we're trying to diagnose test failures.

Describe the solution

Run the tests in lexicographical order instead.

Note that this works across subsets of tests. As an example I used the tests from #78524 which are DECLARED in the order
NPC rules (avoid doors)
NPC rules (close doors)
NPC rules (avoid locks)

By default that is the order they are run in (declared order). Running the tests with the arguments of "[npc_rules]" --wait-for-keypress exit --order lex instead gets us the following order:
NPC rules (avoid doors)
NPC rules (avoid locks)
NPC rules (close doors)

So the desired order(lexicographical) remains consistent even when additional arguments are provided, or subsets of tests are run.

Describe alternatives you've considered

We could run in the provided rand order instead. I initially thought this would be useful if we needed to specify subsets of tests to run, but lexicographical has us covered there already.

Since the random order was made subset stable, we promise that given the same random seed, the order of test cases will be the same across different platforms, as long as the tests were compiled against identical version of Catch2. We reserve the right to change the relative order of tests cases between Catch2 versions, but it is unlikely to happen often.

I thought random order might also be useful if we needed to shuffle the order of test cases for some reason, but the more I thought about it I failed to find any 'some reason'.

Testing

Works on my machine

Would like to see github CI all green before this is considered mergeable

Additional context

There is a second, mostly-unrelated commit on this branch, intentionally.

It enables --wait-for-keypress exit on MSVC builds, because it is really annoying that the default is to close the tests after being run.

This should only be the case when a human being manually runs the test (and only when using MSVC). It should NOT run when github tests does its auto magic, otherwise they'll get stuck waiting for a keypress that never comes.

If it turns out to be problematic... well that's why it's in a separate commit, we can just get rid of it.

PatrikLundell · 2024-12-14T18:16:07Z

The only reason I can see to run tests in a random order would be to find poorly designed tests that rely on conditions they aren't setting up properly, but then we'd run into the problem of tracking the cause down when it happens. I'd say it's better not to detect such errors or encounter them deterministically, especially since the tests aren't intended to test that the tests are good, but that the tested code does what it's intended to do.

RenechCDDA · 2024-12-14T18:16:40Z

The only reason I can see to run tests in a random order would be to find poorly designed tests that rely on conditions they aren't setting up properly, but then we'd run into the problem of tracking the cause down when it happens. I'd say it's better not to detect such errors or encounter them deterministically, especially since the tests aren't intended to test that the tests are good, but that the tested code does what it's intended to do.

Agree on all points! :)

RenechCDDA · 2024-12-14T18:47:59Z

...Oh goodness it looks like we're gonna have some fixing to do

mqrause · 2024-12-15T11:55:15Z

There is a second, mostly-unrelated commit on this branch, intentionally.

It enables --wait-for-keypress exit on MSVC builds, because it is really annoying that the default is to close the tests after being run.

I'm a bit confused by this. I don't think that does anything when you put it there? Plus it doesn't just close for me. I don't think it ever did and I don't remember ever changing some setting. Though it's been too long since I used VS2019 honestly.

RenechCDDA mentioned this pull request Dec 14, 2024

Fix vehicle_turret test #78567

Merged

github-actions bot added json-styled JSON lint passed, label assigned by github actions astyled astyled PR, label is assigned by github actions labels Dec 14, 2024

github-actions bot added Vehicles Vehicles, parts, mechanics & interactions [C++] Changes (can be) made in C++. Previously named `Code` labels Dec 14, 2024

RenechCDDA force-pushed the canonical_test_order branch from 17141e6 to c5d5c18 Compare December 17, 2024 19:22

github-actions bot added the NPC / Factions NPCs, AI, Speech, Factions, Ownership label Dec 17, 2024

RenechCDDA force-pushed the canonical_test_order branch 2 times, most recently from 20d729f to fbb058d Compare December 17, 2024 20:42

github-actions bot added the Melee Melee weapons, tactics, techniques, reach attack label Dec 17, 2024

RenechCDDA force-pushed the canonical_test_order branch 2 times, most recently from a351191 to a1b5ada Compare December 17, 2024 23:35

RenechCDDA added 2 commits December 19, 2024 19:24

Run tests in lexographical order

dfe6c86

MSVC manual test runs wait for keypress to exit

515e30d

RenechCDDA force-pushed the canonical_test_order branch from a1b5ada to e1414e4 Compare December 20, 2024 00:29

github-actions bot added the Items: Containers Things that hold other things label Dec 20, 2024

Fix some tests that relied on previous ordering, maybe

c4b5075

RenechCDDA force-pushed the canonical_test_order branch from e1414e4 to c4b5075 Compare December 20, 2024 01:34

github-actions bot added the EOC: Effects On Condition Anything concerning Effects On Condition label Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Canonical test ordering - lexicographic #78570

Canonical test ordering - lexicographic #78570

RenechCDDA commented Dec 14, 2024 •

edited

Loading

PatrikLundell commented Dec 14, 2024

RenechCDDA commented Dec 14, 2024

RenechCDDA commented Dec 14, 2024

mqrause commented Dec 15, 2024

Canonical test ordering - lexicographic #78570

Are you sure you want to change the base?

Canonical test ordering - lexicographic #78570

Conversation

RenechCDDA commented Dec 14, 2024 • edited Loading

Summary

Purpose of change

Describe the solution

Describe alternatives you've considered

Testing

Additional context

PatrikLundell commented Dec 14, 2024

RenechCDDA commented Dec 14, 2024

RenechCDDA commented Dec 14, 2024

mqrause commented Dec 15, 2024

RenechCDDA commented Dec 14, 2024 •

edited

Loading