Performance metrics for testing statistical calculations in interlaboratory comparisons