A comparison of rater calibration methods