Introduction to automatic collation

collation.png

 

Description en français dans le programme complet en PDF ci-dessous

 

This two-day course (26 & 27.03.2020) offers an introduction to the theory and the practice of automatic collation.

Collation is the practice of comparing texts in order to identify their variant forms. It is used in the preparation of critical editions, as well in genetic criticism and in the study of textual transmissions. More in general, the alignment of two or more texts, which is the basis of collation, can be useful for analysing text reuse and intertextuality. Computers have been used to automatize collation since the 60’s, but even though several examples of stable software are available nowadays, this tedious task is still often done by hand. This workshop will show participants how to take advantage of automatic collation tools in the field of textual and genetic criticism. A brief history of collation together with a reflection about the practices of the research community in Humanities and Social Sciences when working on the digital ecosystem will contextualise the analysis of the tools.

During the course, we will address the basic principles of the theory of automatic collation and briefly present a number of programs. In the hands-on part of the course, we will use CollateX, a collation software, to put into practice what the participants have learned. Eventually, the last part of the course can be devoted to find the right software and parameters for the use cases brought by the participants.

Invited Speaker

Elli Bleeker (Royal Netherlands Academy of Arts and Sciences) : "What has been left out: The perks and pitfalls of collating multi-layered manuscripts."

Elli Bleeker works as postdoctoral researcher in the Research and Development Team at the Humanities Cluster, part of the Royal Netherlands Academy of Arts and Sciences. She specializes in digital scholarly editing and computational philology, with a focus on modern manuscripts, genetic criticism and semi-automated collation. As a Research Fellow in the Marie Sklodowska-Curie funded network DiXiT (2013 – 2017), she received advanced training in manuscript studies, text modeling, and XML technologies. She also enjoys spending time in archives looking for the forgotten writers from the 20th century.

For whom

The course is open to students and researchers working with texts in any discipline.

Requirements. No prior knowledge of automatic collation nor programming is required.

Your materials. Participants will have the opportunity to send their texts in advance, so they can benefit from a personalised support during the course.

Installation. There is no need to install software to follow the course, because we will be using virtual machines. If, nevertheless, participants want to have their own installation, they may follow the instructions that they will receive after registration.

 

PROGRAMME COMPLET (ENG/FR)

IntroCollation.pdf  (119 Ko)

Informations pratiques

Intitulé du cours Introduction à la collation automatique
Enseignants Helena Bermudez Sabel, Elisa Nury, Elena Spadini
Date 26 et 27 mars 2020
Lieu UNIL, Géopolis 2215
Horaire 9h30 - 18h

Enseignantes

Helena Bermudez Sabel est chercheuse post-doctorante en linguistique computationnelle à l’Université de Lausanne. Elle a enseigné différents aspects de la philologie numérique

Elisa Nury est chercheuse post-doctorante en à l'Université de Genève. Sa thèse porte sur la collation automatique et les éditions critiques numériques.

Elena Spadini est chercheuse post-doctorante en philologie numérique à l'Université de Lausanne où elle enseigne notamment la collation automatique.

Programme

 

Day 1 - 26.03.2020

9h30-9h45 Reception

9h45-11h00 First session. Explore your computer (command line, Jupyter Notebooks, Python)

11h00 – 11h30 Coffee break

11h30 – 12h30 Second session. [Continuation] Explore your computer (command line, Jupyter Notebooks, Python)

12h30 – 14h Lunch

14h – 15h15 Third session. Theory of automatic collation and reflection about the use of digital tools for textual criticism.

15h15 – 15h45 Coffee break

15h45 – 17h00 Fourth session. [Continuation] Theory of automatic collation and reflection about the use of digital tools for textual criticism.

17h15 Keynote. Elli Bleeker, Royal Academy of Arts and Sciences (The Netherlands).

Evening. Social dinner.

 

Day 2 - 27.03.2020

9h30 – 10h45 First session. Automatic collation practise: environment configuration, making a simple collation.

10h45 – 11h15 Coffee break

11h15 – 12h30 Invited speaker: Rudolf Mahrer (UNIL) will present the platform Variance. Second session. [Continuation] Automatic collation practise: making a simple collation.

12h30 – 14h00 Lunch

14h – 15h15 Third session. Automatic collation practise: making a simple collation with the possibility to work with the texts of the participants.

15h15 – 15h45 Coffee break

15h45 – 17h00 Fourth session. Automatic collation practise: making a simple collation with the possibility to work with the texts of the participants.

Partagez: