Digital humanities and global interaction Eetu Mäkelä This presentation: http://j.mp/dhgi-td
What to learn if you’re a humanist? 1. Knowledge of easy to use end-user data processing and visualization tools ○ Easy to use for their intended purpose, but limited 2. Knowledge of the fundamentals concepts of programming ○ Frees you to process your data more efficiently ○ Allows you to more freely apply visualizations etc based on ready libraries and tutorials on the Internet 3. High-level understanding of what types of things can be accomplished with advanced CS methods ○ To be able to communicate in collaborative projects
Collaborative digital humanities Data Science
HELDIG?
Legal Tech
Digital Social Science
Digital Humanities Digital Religion
Theses ● Digital humanities makes it possible to study global interaction without regard to artificial boundaries, be they geographical, administrative or institutional ● However, doing so requires ambitious interdisciplinary cooperation
Digital humanities research process analysis tools
raw data
Iterative exploration of data
processing tools
results
research articles
Digital humanities research process analysis tools
cleanup tools
Iterative cleanup, exploration of data, with attendant tool development
raw data
understanding data
clean data
processing tools
results
research articles
How does it actually work?
Dissecting food reputation claims in requests for EU geographical protection with Marjo Särkkä-Tirkkonen, Ruralia Institute, UH 9378 geographical 5295 specific 2601 traditional 2421 natural 2309 high 2197 fresh 1983 local 1887 particular 1457 national 1428 organoleptic 1230 special 1190 typical 1155 small 1116 characteristic 825 unique
Dissecting food reputation claims in requests for EU geographical protection with Marjo Särkkä-Tirkkonen, Ruralia Institute, UH ●
●
●
●
Physical qualities: “It therefore enjoys a great reputation with connoisseurs who consider it to be a prestige cheese, known for its organoleptic qualities and its great delicacy.” Historical importance: “Belia, a researcher at the Modern Greek Historical Studies Centre of the Athens Academy, shows that the Finiki area was known for its olive-growing and the economic importance of its olive oil production at least 180 years ago.” Current reputation: “The reputation enjoyed by Cantabria meat can be deduced from a recent study showing that it was known to 90.4 % of the people living in Cantabria.” Cultural significance: “Many poets and painters have in the past written well known poems and painted pictures with Longjing cha as the theme, inspired by the landscape.”
180 people, 33 countries, led by Oxford
Digital humanities research process analysis tools
cleanup tools integration tools
Iterative integration, cleanup, exploration of data, all with attendant tool development understanding data
clean data
processing tools
results
research articles
Digital humanities research process
research articles??
research articles??
research articles??
The COMHIS collective: Mikko Tolonen, Eetu Mäkelä, Jani Marjanen, Leo Lahti, Ville Vaara, Antti Kanner, Hege Roivainen, Laura Tarkka, Viivi Lähteenoja – computer scientists, linguists, historians with loosely aligned interests
Public discourse and knowledge production in 18th century Britain and 17th to 19th century Finland CERL
ESTC
analysis tools
cleanup tools
KUNGLIGA
integration tools
Iterative integration, cleanup, exploration of data, all with attendant tool development
FENNICA DFN BLN ECCO EEBO
understanding data
clean data
processing tools
results
research articles
Cleaning up data ●
80 % of statistical analysis is tidying up of the data. Often neglected yet implicitly assumed by many tools.
“Man without Government is of all Creatures the most unfit for Society” Discussion on luxury & pride: “What the Luxury of Military Men consists in”, ... What’s not discussed? “Why Man's craving Flesh for Food is unnatural”, ...
Number of newspapers
Number of words in the newspapers (logarithmic)
100%
50%
0% 1800 100%
50%
0%
1850
1900
A major problem.
●
●
Wiki-style correcting of (originally automatically OCR’d) transcriptions with original shown side by side. Suggested corrections increased tenfold after moving to this type of UI from old-style “send feedback” box
Digital Sinology
Ancient Name Dropping, with Hans Wietzke, Stanford Detecting clusters in references to authority in ancient greek texts on natural science – co-citations hand curated by Hans Wietzke
This presentation: http://j.mp/dhgi-td
Interfacing structured and unstructured data in sociolinguistic research on language change
Academy of Finland DIGIHUM programme
LINGUISTIC QUESTIONS Social meaning of spelling variation in historical periods of English and Finnish Neologisms in early English correspondence
TOOLS AND MATERIALS Developing a modular research toolkit for sociolinguistic analysis RESEARCH GROUP: Terttu Nevalainen (PI; University of Helsinki) Samuli Kaislaniemi (University of Helsinki)
Anni Sairio (University of Helsinki)
Taru Nordlund (PI; University of Helsinki)
Eetu Mäkelä (Aalto University)
Tanja Säily (University of Helsinki)
Katja Litola (University of Helsinki)
Poika Isokoski (PI; University of Tampere)
Anna Merikallio (University of Helsinki)
Johanna Utriainen (University of Helsinki)
Harri Siirtola (University of Tampere)
Fibra, with Dan Edelstein and Nicole Coleman, Stanford Making massive open cultural heritage databases usable for an individual scholar in the humanities
http://tinyurl.com/y9wcakhr
Dissecting food reputation claims in requests for EU geographical protection with Marjo Särkkä-Tirkkonen, Ruralia Institute, UH 9378 geographical 5295 specific 2601 traditional 2421 natural 2309 high 2197 fresh 1983 local 1887 particular 1457 national 1428 organoleptic 1230 special 1190 typical 1155 small 1116 characteristic 825 unique