This three-year institute series will offer an opportunity for advanced exploration and training in three areas of critical importance to scholarly users of the TEI Guidelines. Building on two previous seminar series, we will offer advanced three-day institutes on TEI customization, XSLT for digital humanists, and bringing text encoding into the digital humanities classroom. These seminars will be aimed at an audience of faculty, archivists, librarians, and digital humanists who have had some experience with TEI and need grounding in more advanced tools, systems, and approaches. Each seminar will emphasize discussion, hands-on practice, and close attention to the specifics of participants’ own projects. The seminars will be led by the WWP staff, who are internationally recognized as experts in TEI encoding, project design, and digital humanities, together with guest instructors with expertise in specific domains appropriate to each seminar.
Many scholars, archivists, librarians, technologists, and students working with TEI at smaller institutions often do so without peer or technical support. Having completed the considerable task of encoding their text, they often ask: now that I have my document encoded, what do I do with it? TEI has huge potential for multimedia presentation, data mashups, visualizations, or sophisticated print layout. However, without knowledge of XSLT and other XML technologies, the options are either inaccessible or difficult to learn and use. The TEI Archiving, Publishing, and Access Service (TAPAS) Project seeks to build a service that will allow users with TEI-encoded texts to store, share, and discuss their materials, and transform them into forms appropriate for reading, visualizing, and participating in the web ecosystem. We will design and develop the User Interface and User Experience for this service and create an intuitive and simple-to-use environment for working with TEI-encoded data.
This presentation is a general introduction to the Text Encoding Initiative offered at the DH@Madrid Summer School at LINHD in July 2015.
This article proposes a method for marking up textual lacunae in TEI compliant XML.
The New Handbook of Syriac Literature (NHSL) is a born-digital TEI-encoded reference work for the study of Syriac literature. The first volume, Bibliotheca Hagiographica Syriaca Electronica, was published by Syriaca.org in 2016 using a simple TEI schema to describe a single genre (hagiography) (Saint-Laurent et al. 2016; see also Saint-Laurent 2016, Zanetti 2016). Past TEI-encoding practice has focused on describing specific manuscripts or creating editions of works. By contrast, the NHSL seeks to describe abstract or conceptual works (including unpublished ones) and to relate them to people, places, and other works, as well as to the manuscripts, editions, and translations that embody them. Two key features of this encoding model include using for description of works and fully leveraging @source for scholarly citations. In preparation for expanding the NHSL to include other genres, Syriaca.org is revising the TEI schema used for hagiographic works. Among other revisions, the new model will employ RDF classes and properties as @type and @ref values, respectively, in order to aid RDF serialization. The authors actively seek feedback, suggestions, and criticism concerning this revised schema.
Digital technology in teaching the work of Karen Tei Yamashita
On the teaching of Karen Tei Yamahita’s Brazil-Maru
En este artículo, tras analizar el sistema de transcripción electrónico semipaleográfi- co del Hispanic Seminar of Medieval Studies (HSMS) de la Universidad de Madison, diseñado para un fin específico, la redacción de un diccionario del español medieval, se presenta el lenguaje XML y sus posibilidades para la codificación digital de textos medievales. Para ello se ofrece una introducción ampliamente ejemplificada de las po- sibilidades que puede tener el eXtended Markup Language y en la especificación de la Text Encoding Initiative (TEI). Y se da cuenta de aplicación que se está haciendo a los manuscritos del Libro de la caza de las aves de Pero López de Ayala. This article begins with a detailed analysis of the semi-paleographic electronic-trans- cription system devised by the Hispanic Seminar of Medieval Studies (HMSM) for the construction of a database to be used for compiling a dictionary of Old Spanish. The reader is then introduced to the possibilities offered by the eXtended Markup Lan- guage (XML) and the Text Encoding Initiative (TEI) standard. These possibilities are exhaustively exemplified with digital transcriptions from the extant manuscripts of Pero López de Ayala’s Libro de la caza de las aves.
On the teaching of Karen Tei Yamashita’s I Hotel in the People’s Republic of China
The paper describes the preparation of a Buddhist corpus in the Middle Indo-Aryan language Pāli, which is available only in a flat TEI format, for content-based analysis. This task includes transforming the file into a hierarchical TEI P5 representation, followed by tokenisation (including sandhi resolution), lemmatisation, and POS tagging.