A distance based measure of data quality