• David Bamman (2020), “Born-Literary Natural Language Processing,” Debates in Digital Humanities.
  • David Bamman and Patrick J. Burns (2020), “Latin BERT: A Contextual Language Model for Classical Philology.”
  • David Bamman, Olivia Lewke and Anya Mansoor (2020), “An Annotated Dataset of Coreference in English Literature,” LREC 2020.
  • Peter M. Broadwell and Timothy R. Tangherlini. 2020. “Comparative K-Pop Choreography Analysis through Deep-Learning Pose Estimation across A Large Video Corpus.” DH Quarterly (forthcoming)
  • Shadi Shahsavari, Ehsan Ebrahimzadeh, Behnam Shahbazi, Misagh Falahi, Pavan Holur, RojaBandari, Timothy R. Tangherlini, Vwani Roychowdhury. “An Automated Pipeline for Character and Relationship Extraction from Readers Literary Book Reviews on” In: Proceedings of WebSci 2020, Southampton, England. Association for Computing Machinery.
  • Matthew Sims and David Bamman (2020), “Measuring Information Propagation in Literary Social Networks,” EMNLP 2020.
  • Tangherlini, Timothy R., Shadi Shahsavari, Behnam Shahbazi, Ehsan Ebrahimzadeh, and Vwani Roychowdhury. “An automated pipeline for the discovery of conspiracy and conspiracy theory narrative frameworks: Bridgegate, Pizzagate and storytelling on the web.” PloS one 15, no. 6 (2020): e0233879


  • David Bamman, Sejal Popat and Sheng Shen (2019), “An Annotated Dataset of Literary Entities,” NAACL 2019.
  • Jon Gillick, Adam Roberts, Jesse Engel, Douglas Eck and David Bamman (2019), “Learning to Groove with Inverse Sequence Transformations,” ICML 2019.
  • Jon Gillick, Carmine-Emanuele Cella and David Bamman, “Estimating Unobserved Audio Features for Target-Based Orchestration,” ISMIR 2019.
  • Jon Gillick and David Bamman (2019), “Breaking Speech Recognizers to Imagine Lyrics,” NeurIPS Workshop on Machine Learning for Creativity and Design.
  • Matthew Sims, Jong Ho Park and David Bamman (2019), “Literary Event Detection,” ACL 2019.


  • Peter M. Broadwell, Peter Leonard, and Timothy R. Tangherlini. 2018. “‘Hvad der byggedes om dagen, blev revet ned om natten …’: Word Sequence Repetition in Danish Legend Tradition.” Svenska landsmål och svenskt folkliv 140(2017): 9-27.
  • Broadwell, Peter M. and Timothy R. Tangherlini. 2018. “Confusing the Modern Breakthrough: Naïve Bayes Classification of Authors and Works.” Human IT 14(2): 19-42.
  • Jon Gillick and David Bamman (2018), “Please Clap: Modeling Applause in Campaign Speeches,” NAACL 2018.
  • Jon Gillick and David Bamman (2018), “Telling Stories with Soundtracks: An Empirical Analysis of Music in Film,” NAACL 2018 Storytelling Workshop.
    Kimiko Ryokai, Elena Durán López, Noura Howell, Jon Gillick, and David
  • Bamman (2018), “Capturing, Representing, and Interacting with Laughter,” CHI 2018.
  • Storm, Ida, and Timothy R. Tangherlini. 2018. “  ‘En temmelig lang fodtur’: hGIS, Text Mining, and Folklore Collection in 19th Century Denmark.”  Human IT 14(2): 43-81.
  • Schmitt, Christoph and Timothy R. Tangherlini. 2018. “Folklore Archives Online. Zur Sichtbarmachung, Auswertbarkeit und Interoperabilität einer dänischen und einer nordostdeutschen Sammlung.” Jahrbuch für Europäische ethnologie 2018 13(3): 181-204.
  • Ted Underwood, David Bamman, and Sabrina Lee (2018), “The Transformation of Gender in English-Language Fiction,” Cultural Analytics.


  • David Bamman, Michelle Carney, Jon Gillick, Cody Hennesy, and Vijitha Sridhar (2017), “Estimating the Date of First Publication in a Large-Scale Digital Library,” JCDL 2017.
  • David Bamman (2017), “Natural Language Processing for the Long Tail,” Digital Humanities 2017.
  • Roja Bandari, Zicong Zhou, Hai Qian, Timothy R. Tangherlini, Vwani Roychowdhury. 2017. “A Resistant Strain: The Grassroots Rise of the Anti-Vaccination Movement among Online Moms.” Computer (November 2017): 2-9.
  • Peter M. Broadwell and Timothy R. Tangherlini. 2017. “GhostScope: Conceptual Mapping of Supernatural Phenomena in a Large Folklore Corpus.” In, Maths meets myths: Quantitative approaches to ancient texts. Edited by Ralph Kenna, Máirín MacCarron, Padraíg MacCarron. Cham, Switzerland: Springer. Pp. 131-158.
  • Peter M. Broadwell, Timothy R. Tangherlini, and Hyung Kyong Chang. 2017. “Online Knowledge Bases and Cultural Technology: Analyzing Production Networks in Korean Popular Music.” In, Proceedings of the 2016 International Conference of Digital Archives and Digital Humanities. NTNU, Taiwan.
  • Peter M. Broadwell, David Mimno, and Timothy R. Tangherlini. 2017. “The Tell-Tale Hat: Surfacing the Uncertainty in Folklore Classification.” Journal of Cultural Analytics. DOI: 10.7910/DVN/SYZ1PZ
  • Jianbo Gao, Matthew Jockers, John Laudun, and Timothy R. Tangherlini. 2017. “A multiscale theory for the dynamical evolution of sentiment in novels.” In, Proceedings of BESC 2016.
  • Lara McConnaughey, Jennifer Dai and David Bamman (2017), “The Labeled Segmentation of Printed Books,” EMNLP 2017.
  • Ida Storm, Holly Nicol, Georgia Broughton, Timothy R. Tangherlini. 2017. “Folklore Tracks: Historical GIS and Folklore Collection in 19th Century Denmark.” In, DH 2016. Proceedings of the International Symposium on Digital Humanities (Växjö, Sweden). Edited by Korajlka Golub and Marcelo Milrad. CEUR Workshop Proceedings, vol 20-21: 75-98.
  • Timothy R. Tangherlini, Vwani Roychowdhury, Beth Glenn, Catherine M. Crespi, Roja Bandari, Akshay Wadia, Misagh Falahi, Ehsan Ebrahimzadeh, Roshan Bastani. 2017. “‘Mommy Blogs’ and the Vaccination Exemption Narrative: Results from A Machine-Learning Approach for Story Aggregation on Parenting Social Media Sites.” JMIR Public Health and Surveillance 2(2): e166.


  • Smitha Milli and David Bamman (2016), “Beyond Canonical Texts: A Computational Analysis of Fanfiction,” EMNLP 2016.
  • Timothy R. Tangherlini, ed. 2016. “Computational Folkloristics.” Special issue of Journal of American Folklore 129(511).
  • Timothy R. Tangherlini and Peter M. Broadwell. 2016. “WitchHunter: GeoSemantic Browsing in a Large Folklore Corpus.” Journal of American Folklore 129(511): 14-42.


  • Timothy R. Tangherlini. 2015. “Facebook for Vikings: Social Network Analysis and Egil’s Saga.” In Egil, The Viking Poet: New Approaches to Egils saga. Edited by Karl Helgason, Laurence De Looze, Russell Poole, and Torfi Tulinius. Toronto Old Norse-Icelandic Series. Pp. 149-172. Toronto: University of Toronto Press.
  • Timothy R. Tangherlini, Aurelijus Vijūnas, Kryztof Urban, and Peter M. Broadwell. 2014 “IceMorph: An Automated Morphological Analyzer and English Language Look-up Tool for Old Icelandic”. Scandinavian Studies 86(4):425-50.
  • Kryztof Urban, Timothy R. Tangherlini, Aurelijus Vijūnas, and Peter M. Broadwell. 2014. “Semi-Supervised Morphosyntactic Classification of Old Icelandic.” PlosOne DOI: 10.1371/journal.pone.0102366
  • Timothy R. Tangherlini. 2014. “Big Folklore: The Archive in the Age of Big Data.” Arv: 193-197.
  • Timothy R. Tangherlini and Peter Leonard. 2013. “Trawling in the Sea of the Great Unread: Sub-Corpus Topic Modeling and Humanities Research”. Poetics 41(6): 725-749.
  • Timothy R. Tangherlini. 2013. “The Folklore Macroscope: Challenges for a Computational Folkloristics.” The 34th Archer Taylor Memorial Lecture. Western Folklore vol 72(1): 7-27.
  • Peter M. Broadwell and Timothy R. Tangherlini. 2013. “aiSelections: Computational Techniques for Matching Faculty Research Profiles to Library Acquisitions.” Proceedings of the 5th International Conference on Qualitative and Quantitative Methods in Libraries.
  • Peter Broadwell and Timothy R. Tangherlini. 2012. “TrollFinder: Geo-Semantic Exploration of a Very Large Corpus of Danish Folklore.” In Proceedings of LREC. Istanbul, Turkey.
  • James Abello, Peter Broadwell and Timothy R. Tangherlini. 2012. “Computational Folkloristics.” Communications of the Association for Computing Machinery. 55(7): 60-70.
  • Timothy R. Tangherlini. 2011. “Ghostly distribution: Applications from machine learning for belief tale research.” Proceedings of the International Society for Folk Narrative Research. Athens: The Academy of Athens.
  • Timothy R. Tangherlini, Todd Presner and Zoe Borovsky. 2008. Thick Viewing: Integrated Visualization Environments for Humanities Research on Complex Corpora. In Electronic Techtonics: Thinking at the Interface. Proceedings of the First International HASTAC Conference. Edited by Erin Ennis, et. al. Pp. 130-137.
  • Timothy R. Tangherlini. 2003. “ ‘And All Anyone Heard’: Crystallization in Paramedic Storytelling.” In, Dynamics of Tradition: Perspectives on Oral Poetry and Folk Belief. Ed. Lotte Tarkka. Pp. 343-358. Studia Fennica Folkloristica 13.