Category Archives: Announcement

TRACER tutorial, Göttingen, May 2017!

We’re excited to announce that eTRAP will be giving its next text reuse tutorial as a pre-conference workshop of the Datech International Conference being held in Göttingen, Germany!

The tutorial will run on 30th May at the Historical Library Building (“Vortragsraum”, Papendiek 14, first floor) of the University of Göttingen.

The tutorial builds on eTRAP’s research activities, most of which deploy our TRACER machine. TRACER is a suite of algorithms aimed at investigating text reuse in different corpora, be those prose, poetry, in Italian, Latin, Ancient Greek or medieval German. TRACER provides researchers with statistical information about the texts germany-652967_1280under investigation and its integrated reuse visualiser, TRAViz, displays the reuses in a more readable format for further study.

This tutorial is for anyone wishing to independently understand, use and run TRACER on his/her own data. For the purpose of the tutorial, participants will initially be working on an English data-set provided by eTRAP. Depending on the overall progress, we may also allocate some time for investigating the participants’ own data-sets! For more information about previous editions of this tutorial, visit our Events page.

If you’re interested in exploring text reuse between two or multiple texts (in the same language) and would like to learn how to do it semi-automatically, then this tutorial is for you! In order to provide everyone with adequate (technical) assistance, the workshop can only accommodate 15 participants. To apply to the tutorial, please send a short CV and a brief motivation letter to contact(at)etrap(dot)eu by 30th April 2017. Those accepted will have to register for the conference at http://ddays.digitisation.eu/registration/

In summary:
WHAT: TRACER tutorial for computational text reuse detection
WHEN: 30th May 2017, 9am-6pm
WHERE: GCDH, Seminar Room 1 (ground floor), Heyne Haus, Papendiek 16, 37073 Göttingen, Germany
WHO: For humanists and computer scientists alike who bring their own laptop
HOW MANY: Maximum of 15 participants
HOW: You may attend by applying to the email address provided and then registering to the conference. Registration to the conference is necessary for attending the workshop.  There will be an extra charge of €50 for catering at the workshop and to receive the conference pack
LANGUAGE: The workshop will be in English, with assistance in German should it be necessary
OTHER: You will receive very clear instructions on what to bring and prepare before the workshop

We look forward to seeing you in Göttingen!

TRACER tutorial, Rome 2017

Photo of a fingerprintWe’re very pleased to announce that eTRAP will be giving a text reuse tutorial in collaboration with DiXiT at the annual conference of the Italian Association for Digital Humanities (AIUCD) in Rome, Italy, this coming January!

The tutorial will run on 23rd and 24th January at the Sapienza University in Rome.

The tutorial builds on eTRAP’s research activities, most of which deploy our TRACER machine. TRACER is a suite of algorithms aimed at investigating text reuse in multifarious corpora, be those prose, poetry, in Italian, Latin, Ancient Greek or medieval German. TRACER provides researchers with statistical information about the texts under investigation and its integrated reuse visualiser, TRAViz, displays the reuses in a more readable format for further study.

This tutorial seeks to teach participants to independently understand, use and run TRACER. For the purpose of the tutorial and to ensure the smoothest possible outcome, participants will initially be working on an English data-set provided by eTRAP. Depending on the overall progress, we may also allocate some time to investigating the participants’ own data-sets, provided these comply with the TRACER format. A detailed description of the tutorial can be DOWNLOADED HERE.

The workshop will be conducted in English, with assistance in Italian should it be necessary. For more information about previous editions of this tutorial, visit our Events page.

Eligibility, Requirements and Bursaries

If you’re interested in exploring text reuse between two or multiple texts (in the same language) and would like to learn how to do it semi-automatically, then this tutorial is for you. In order to provide everyone with adequate (technical) assistance, the workshop can only accommodate 12 participants. To apply to the tutorial, please send a short CV and a brief motivation letter to contact(at)etrap(dot)eu by 16th December 2016. Those accepted will have to register for the AIUCD conference at https://www.conftool.net/aiucd2017/

La Sapienza University makes available travel bursaries for early career researchers, who submit an abstract to the EADH day. Should you be eligible for the bursary and wish to attend our tutorial, you must submit both an abstract to EADH and a CV with motivation letter to eTRAP. You may also apply for the tutorial without an EADH submission but you will not be eligible for a bursary in that case.

We look forward to seeing you in beautiful Rome!

Announcement: Winner of the Göttingen Dialog in Digital Humanities (GDDH) award 2016

The board of the Göttingen Dialog in Digital Humanities is pleased to announce the three best contributions of this year’s GDDH series. The winner will be handed a prize of €500 and candidates in the second and third positions will receive a notable mention.

We are delighted to announce that the winner of the seminar series of 2016 is:

Hazel Wilkinson
from the University of Cambridge, United Kingdom

with
“A database of printers’ ornaments”

Screen Shot 2016-08-12 at 12.17.46
Hazel Wilkinson presenting at GDDH16 on June 27th

The prize is awarded on the basis of an evaluation of both the paper and the quality of the presentation, for which this candidate received 85.73/100!

The winner is followed by yet another worthy candidate with a paper entitled “Inferring standard name form, gender and nobility from historical texts using stable model semantics”. The paper, written by Davor Lauc and Darko Vitek and presented by Davor Lauc from the University of Zagreb in Croatia, receives a notable mention for its high standard and well-presented research results. This candidate received a score of 79.84/100.

The second notable mention is awarded to the paper “Experiments of distributional semantics in stylometry” by Giulia Benotto from the Institute of Computer Linguistics (CNR) in Pisa, Italy. This paper and presentation follows with a total score of 75.68/100. This candidate was appreciated for the originality of the topic and the clear explanation of the methodology.

The slides and videos of these talks are available here.

Evaluation Method

Continue reading

ADHO Award 2016

Just like the happy ending of a fairy tale: the Franzini sisters win an ADHO Bursary Award of 2016 for research on the Grimm brothers!

Once upon a time there were two sisters, Greta and Emily. The sisters lived in an old building called Heyne Haus in the small German town of Göttingen. As Digital Humanities elves they busied themselves to find something that would keep them occupied indoors during the cold winter of 2015. And like the Grimm brothers who lived in that very same town two hundred years before them, they started, together with a group of loyal companions, collecting stories and motifs. Fairy tale motifs to be precise…

The Digital Breadcrumbs of Brothers Grimm project, which began in October 2015, is collecting and automatically detecting folktale motifs as text reuse units or minimal primitives. In fact, with this project Emily, Greta and their colleagues (constituting the early career research group eTRAP) are addressing two specific challenges of their field: text reuse detection at scale and cross-lingual text reuse detection. While they’ve already experimented and have shown good results for the former (download the DH 2016 slides and poster from their website here, the latter challenge is still ongoing as they’re in the process of manually collecting the necessary data in order to train TRACER, a text reuse detection engine comprising 700 different algorithms, and other software, to detect motifs across multiple languages. For this challenge, they’ve selected three Grimm fairy tales to work with: Snow White, Puss in Boots and The Fisherman and his Wife. To push their research forward, the sisters and their team find and read as many versions of these tales as they can (original versions, not translations, both predating and following the Grimm collection), collect motifs therein and add them to a matrix that maps languages against one another. In the second stage of the project this multilingual dataset will be used by the software tools to automatically find other matches at web scale. Furthermore, the dataset will be integrated with existing ontological resources and will lead to an exploration of folktale motifs as Linked Open Data. With the Digital Breadcrumbs of Brothers Grimm project, Emily, Greta and their team are not only able to advance research in automatic text reuse detection, but can also support folklorists and literary scholars in tackling the large amount of folkloristic materials now available online.

For their ideas and work the Franzini sisters have been awarded the ADHO Bursary Award of 2016. The award is given to promising young scholars of the Digital Humanities who make a new valuable contribution to the field. The ceremony took place during this year’s Digital Humanities Conference in Kraków. Greta has been exploring this area of study since 2009 when she began a Master’s in Digital Humanities at King’s College London (KCL). She is now at the end of a PhD in Digital Humanities at University College London (UCLDH), while working as a Research Associate in the Institute of Computer Science at Göttingen University. Emily was introduced later to the field, when she was hired as a Researcher at the Humboldt Chair of Digital Humanities in Leipzig in 2013 and later began working as a Research Associate at the Institute of Computer Science in Göttingen.

Photo collage.

AIUCD 2016, Venice

Photo of a fingerprintWe’re very pleased to announce that eTRAP will be giving a text reuse tutorial at the annual conference of the Italian Association for Digital Humanities in Venice, Italy, this coming September! It’s the only tutorial of the conference and it will run on 6th and 7th September at the Ca’ Foscari University.

The tutorial builds on eTRAP’s research activities, most of which deploy Marco Büchler’s TRACER tool. TRACER is a suite of algorithms aimed at investigating text reuse in multifarious corpora, be those prose, poetry, in Italian or medieval German. TRACER provides researchers with statistical information about the texts under investigation and its integrated reuse visualiser, TRAViz, displays the reuses in a more readable format for further study.

This tutorial seeks to teach participants to independently understand, use and run TRACER. For the purpose of the tutorial and to ensure the smoothest possible outcome, participants will initially be working on data-sets provided by eTRAP. Depending on the overall progress, we may also allocate some time to investigating the participants’ own data-sets, provided these comply with the TRACER format1.

The workshop will be conducted in English. An Italian version of the tutorial flyer is available here. For more information about previous editions of this tutorial, visit our Events page.

Eligibility & Requirements

If you’re interested in exploring text reuse between two or multiple texts (in the same language) and would like to learn how to do it semi-automatically, then this tutorial is for you. In order to provide everyone with adequate (technical) assistance, the workshop can only accommodate 12 participants. To apply to the tutorial, please send your CV and a motivation letter to etrap-applications(at)gcdh(dot)de by July 31th, 2016. Those accepted will have to register for the AIUCD conference.

We look forward to seeing you in Venice!


1Should you be interested in investigating your own texts, please send us an email to the address above so that we can send you the requirements.

Current open paid positions for Student Assistants

Two Transcribers wanted!

(Targeted at students of German Literature or other Humanities subjects)

The early career research group eTRAP is looking for Student Assistants. The research group is associated with the Institute of Computer Science and operates from the Göttingen Centre for Digital Humanities (GCDH). Further information about the research group and its work can be found at http://etrap.gcdh.de.

Job description
We are looking for applicants interested in joining the research group on TrAIN, a new project which was recently awarded the sum of €20,000 by the University of Göttingen. TrAIN, which stands for Tracing Authorship in Noise, will run for the duration of six months from 1st June 2016. The aim of the project is to obtain digital and searchable copies of the original correspondence of the Grimm brothers – the famous authors of the Kinder- und Hausmärchen. The digital copies will be obtained in two different ways, namely by the use of an HTR (Handwritten Text Recognition) tool and multiple OCR (Optical Character Recognition) tools. The output of such work will then be used to further research in the fields of stylometry and authorship attribution.
We are hiring 2 students for the duration of 3 months (extendable contract) who will act as the transcribers of the team. They will work with Transkribus, an HTR tool used to transcribe handwritten texts.

Continue reading

Grant awarded!

We are very pleased to announce that eTRAP has been awarded a 20,000€ grant from the University of Göttingen for a six-month pilot project. The project, TrAiN (Tracing Authorship in Noise), seeks to investigate the complex relation between noisy OCR’d data and automatic text analyses. In particular, we will investigate and attempt to define the maximum noise threshold that will allow us to adequately conduct authorship and text reuse analyses on a number of texts selected for this study. Our research questions: at which point does OCR/HTR noise interfere with the automatic identification of stable linguistic and stylistic markers? What is the minimum amount of noise we need to correct?

The project includes a joint research workshop with stylometry experts to optimise existing algorithms, and to exchange ideas and knowledge.

Congratulations, team!

Project Co-PIs: Marco Büchler, Greta Franzini, Emily Franzini, Gabriela Rotari, Maria Moritz.

Article: Sentence Shortening in Historical Language Learning

eTRAP’s article “Sentence Shortening via Morpho-Syntactic Annotated Data in Historical Language Learning” authored by Maria Moritz, Barbara Pavlek, Greta Franzini and Gregory Crane, is now published in the current issue of the ACM Journal on Computing and Cultural Heritage (JOCCH). The work was supported by the Federal Ministry of Education (BMBF) and the European Social Fund (ESF). Here is the abstract:

We present an approach to shorten Ancient Greek sentences by using morpho-syntactic information attached to each word in a sentence. This work underpins the content of our eLearning application, AncientGeek, whose unique teaching technique draws from primary Greek sources. By applying a technique that skips the clausal dependents of a main verb, we reached a well-formed rate of 89% of the sentences.

Call for Papers: 2016 Göttingen Dialog in Digital Humanities

GDDHlogo

The Göttingen Dialog in Digital Humanities has established a forum for the discussion of digital methods applied to all areas of the Humanities and Social Sciences, including Classics, Philosophy, History, Literature, Law, Languages, Archaeology and more. The initiative is organized by the Göttingen Centre for Digital Humanities (GCDH) with the involvement of DARIAH.EU.

The dialogs will take place every Monday from April 11th until early July 2016 in the form of 90-minute seminars. Presentations will be 45 minutes long and delivered in English, followed by 45 minutes of discussion and student participation. Seminar content should be of interest to humanists, digital humanists, librarians and computer scientists. Furthermore, we proudly announce that Prof. Dr. Stefan Gradmann (KU Leuven) will be giving the opening keynote on April 11th.

We invite submissions of abstracts describing research which employs digital methods, resources or technologies in an innovative way in order to enable a better or new understanding of the humanities, both in the past and present. We also encourage contributions describing ‘work-in-progress’. Themes may include – but are not limited to –  text mining, machine learning, network analysis, time series, sentiment analysis, agent-based modelling, lexical and conceptual resources for DH, or efficient visualization of big and humanities-relevant data.

Continue reading

Announcement: Winner of the Göttingen Dialog in Digital Humanities (GDDH) award 2015

The board of the Göttingen Dialog in Digital Humanities is pleased to announce the winners of this year’s dialog series award. The winner will be handed a prize of €500 and candidates in the second and third position will receive a notable mention.

The winner of the seminar series of 2015 is the paper:

Automated Pattern Analysis in Gesture Research: Similarity Measuring in 3D Motion Capture Models of Communicative Action
by
Daniel Schüller et al.
in combination with the presentation given by
Daniel Schüller, Christian Beecks & Irene Mittelberg
from RWTH Aachen University, Germany and University of Alberta, Canada
on 23rd June

The prize is awarded on the basis of an evaluation of both the paper and the quality of the presentation, for which this candidate received 85/100. “It was awesome”, “Valuable for studying the meaning of gestures”, are comments accompanying the scores, which were given for content quality, significance for theory or practice, level of innovation and presentation style by the reviewers of the papers, and by the audience for the presentations.

Continue reading