Baseline transliteration corpus for improved english-amharic machine translation