28 29 Slovenščina 2.0, 2020 (2) ENCODING POLYLEXICAL UNITS WITH TEI LEX-0: A CASE STUDY T o m a T A S O V A C Belgrade Center for Digital Humanities, Belgrade, Serbia A n a S A L G A D O NOVA CLUNL Universidade NOVA de Lisboa, Lisbon, Portugal, Academia das Ciências de Lisboa, Lisbon, Portugal R u t e C O S T A NOVA CLUNL Universidade NOVA de Lisboa, Lisbon, Portugal Tasovac, T., Salgado, A., Costa, R. (2020): Encoding polylexical units with TEI Lex-0: A case study. Slovenščina 2.0, 8(2): 28–57. DOI: https://doi.org/10.4312/slo2.0.2020.2.28-57 The modelling and encoding of polylexical units, i.e. recurrent sequences of lexemes that are perceived as independent lexical units, is a topic that has not been covered adequately and in sufficient depth by the Guidelines of the Text Encoding Initiative (TEI), a de facto standard for the digital representation of textual resources in the scholarly research community. In this paper, we use the Dictionary of the Portuguese Academy of Sciences as a case study for presenting our ongoing work on encoding polylexical units using TEI Lex-0, an initiative aimed at simplifying and streamlining the encoding of lexical data with TEI in order to improve interoperability. We introduce the notion of macro- and microstructural relevance to differentiate between polylexicals that serve as headwords for their own independent dictionary entries and those which ap- pear inside entries for different headwords. We develop the notion of lexico- graphic transparency to distinguish between those units which are not accom- panied by an explicit definition and those that are: the former are encoded as