Skip to main content Skip to docs navigation

Research, Scholarly, and Creative Achievements

Check out my Google Scholar profile.

Journal Articles

  1. , , , , , , , , , and . . Building datasets to support information extraction and structure parsing from electronic theses and dissertations.” International Journal on Digital Libraries. 10.1007/s00799-024-00395-4
  2. , , , and . . Teaching Natural Language Processing through Big Data Text Summarization with Problem-Based Learning.” Data and Information Management, Vol. 4 (1), pp. 1843. 10.2478/dim-2020-0003
  3. , , and . . Summarizing ETDs with deep learning.” Cadernos BAD, Vol. 1 , pp. 4652. 10.48798/cadernosbad.2014
  4. , , , , , , , , and . . Overly Honest Data Repository Development.” The Code4Lib Journal, (34).
  5. , , , , and . . Developments in Digital Preservation at the University of Illinois: The Hub and Spoke Architecture for Supporting Repository Interoperability and Emerging Preservation Standards.” Library Trends, Vol. 57 (3), pp. 556579. 10.1353/lib.0.0052

Book Chapters

  1. and . . Archives, Digital Search, and AI Ethics.” In The Routledge Companion to Libraries, Archives, and the Digital Humanities, edited by Isabel Galina Russell and Glen Layne-Worthey. Routledge, pp. 479492. 10.4324/9781003327738-38

Conference Papers

  1. , , , , , , , and . . Integrated Digital Library System for Long Documents and their Elements.” In Proceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries (JCDL ’23), Santa Fe, New Mexico, USA, pp. 1324. Nominated for Best Student Paper Award. 10.1109/JCDL57899.2023.00012
  2. , , , , , and . . MetaEnhance: Metadata Quality Improvement for Electronic Theses and Dissertations of University Libraries.” In Proceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries (JCDL ’23), Santa Fe, New Mexico, USA, pp. 6165. Best Short Paper Award. 10.1109/JCDL57899.2023.00019
  3. , , , , and . . Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations.” In Proceedings of the 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL ’21), Virtual Event, pp. 230233. 10.1109/JCDL52503.2021.00066

Workshop Papers

  1. , , , , and . . A New Annotation Method and Dataset for Layout Analysis of Long Documents.” In Companion Proceedings of the ACM Web Conference 2023 (WWW ’23 Companion), Austin, TX, USA, pp. 834842. As part of 3rd International Workshop on Scientific Knowledge Representation, Discovery, and Assessment (Sci-K 2023). 10.1145/3543873.3587609
  2. , , , , , , and . . A Study of Computational Reproducibility Using URLs Linking to Open Access Datasets and Software.” In Companion Proceedings of the Web Conference 2022 (WWW ’22 Companion), Virtual Event, Lyon, France, pp. 784788. As part of Sci-K 2022 - International Workshop on Scientific Knowledge: Representation, Discovery, and Assessment. 10.1145/3487553.3524658
  3. , , , and . . Applications of Data Analysis on Scholarly Long Documents.” In 2022 IEEE International Conference on Big Data (Big Data ’22), Osaka, Japan, pp. 24732481. As part of The 7th Computational Archival Science (CAS) Workshop. 10.1109/BigData55660.2022.10020935

Extended Abstracts

  1. , , and . . Maximizing Equitable Reach and Accessibility of ETDs.” In Proceedings of the 23rd ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL ’23), Santa Fe, New Mexico, USA, pp. 256257. Poster Presentation. 10.1109/JCDL57899.2023.00049
  2. , , , , and . . AI and Public Archives: Collaborative Leadership for Responsible Adoption.” In Proceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries (JCDL ’23), Santa Fe, New Mexico, USA, pp. 323324. Panel Discussion. 10.1109/JCDL57899.2023.00079
  3. , , and . . Electronic Theses and Dissertations: A Research Corpus of Scholarly Big Data.” Presented at GL24 — Twenty-Fourth International Conference on Grey Literature, GreyNet International.
  4. , , , , , and . . Analyzing and Navigating ETDs Using Topic Models.” Presented at 25th International Symposium on Electronic Theses and Dissertations, Novi Sad, Serbia.
  5. , , , and . . Applications of Mining ETDs.” Presented at 24th International Symposium on Electronic Theses and Dissertations, Abu Dhabi, UAE.
  6. and . . Why and How We Went Serverless, and How You Can Too.” Presented at CNI: Coalition for Networked Information Spring 2021 Membership Meeting.
  7. . . Mining ETDs for Trends in Graduate Research.” Presented at CNI: Coalition for Networked Information Fall 2020 Membership Meeting.
  8. . . Bringing Computational Access to Book-length Documents Via an ETD Pilot.” Presented at CNI: Coalition for Networked Information Fall 2019 Membership Meeting.

Tutorials

  1. and . . Preparing Code and Data for Computational Reproducibility.” (Half-day tutorial). In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020 (JCDL ’20), Virtual Event, China, pp. 565566. 10.1145/3383583.3398714
  2. and . . Introduction to Digital Libraries.” (Half-day tutorial). In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020 (JCDL ’20), Virtual Event, China, pp. 567568. 10.1145/3383583.3398501
  3. and . . Preparing Code and Data for Computational Reproducibility: A Hands-On Workshop.” (Half-day tutorial). Presented at 22nd International Symposium on Electronic Theses and Dissertations.

Workshops (Hosting/Organizing)

  1. Leading the Future of AI and Public Archives: Toward a Shared AI Ethics Framework.” November 16, 2022–November 17, 2022. Virtual Workshop. William A. Ingram, Virginia Tech University Libraries; Sylvester A. Johnson, Virginia Tech Center for Humanities; Abigail Potter, Library of Congress Labs; Meghan Ferriter, Library of Congress Labs; Rebecca Dikow, Smithsonian OCIO Data Science Lab; Mike Trizna, Smithsonian OCIO Data Science Lab; and Jill Reilly, National Archives Office of Innovation. Part two of a workshop series aimed at developing a shared AI Ethics Framework for galleries, libraries, archives, and museums. Keynote speakers included Afua Bruce and Isaac Johnson. Activities focused on creating an institutional AI ethics statement and operationalizing AI in LAMs. .
  2. Leading the Future of AI and Public Archives.” May 6, 2022. Virtual Workshop. William A. Ingram, Virginia Tech University Libraries; Sylvester A. Johnson, Virginia Tech Center for Humanities; Abigail Potter, Library of Congress; Meghan Ferriter, Smithsonian; and Jill Reilly, National Archives Office of Innovation. Part one of a workshop series aimed at leaders and collaborators from institutions with public digital collections and archives programs. Activities included a leadership roundtable, lessons learned, problem definition, and action priority matrix. Keynote speaker: Elham Tabassi. .
  3. Ensuring Scholarly Access to Government Archives and Records.” April 16, 2021–May 7, 2021. Virtual Workshop. William A. Ingram, Virginia Tech University Libraries and Sylvester A. Johnson, Virginia Tech Center for Humanities. Virginia Tech and NARA convened archivists, librarians, humanists, technologists, and scientists for a set of five weekly workshops to plan for ensuring future access to government records through AI and machine learning. Sponsored by the Andrew W. Mellon Foundation. .

Other Reports and White Papers

  1. and . . Ensuring Scholarly Access to Government Archives and Records.” Virginia Tech. . Sponsored by The Andrew W. Mellon Foundation.
  2. , , , , and . . Classification and Extraction of Information from ETD Documents.” Virginia Tech. .
  3. , , , , , and . . Big Data Text Summarization: Using Deep Learning to Summarize Theses and Dissertations.” Virginia Tech. .