You prepared your ratings, configured a target search engine, selected a few metrics (Precision, Recall, Ndcg…) and ran an evaluation.
The search quality evaluation dashboard has a handy Overview of the results, showing the historical progress of the metrics of your interest across your data collections and experiments, warnings are raised automatically in case of suspicious results.