A few thoughts about testing chess engines

My adventure with chess engine development started a good few years ago, and one of the issues I’ve definitely struggled with a lot was: how to test if a new build with a very sound feature is actually performing better than the old one. This article is a summary of the experience I gained after four engines, where each of them had a different methodology.