Experiments are setups to test predicitons about the behavior of systems.
In software development, tests are automated experiments to verify that the behavior of the system has not changed in unwanted ways, after having changed the code and the "rules" of the system. Changes are meant to be additive only or to change behavior in predicted ways.
Since writing "tests" is shorter and writing actual code and tests that test them is easier than setting and showing how experiments are wrong, I will use software from now on. But the epistomological argumentative mistakes apply to regular scientific experiments.
The difficulty with tests, is that for them to be meaningful, the setup, data and actions of running them has to be "reasonably" identical.
There are multiple easy ways to do this wrong. Assuming we have a behavior like a natural law or piece of code we want to test and were running a test to verify:
rule: x**0=1 test(x) return type(x) == numberThe test returns a false positive.