Zero shot classification for unstructured text of archival value