Text mining using data compression models : doctoral thesis