e-learning

Transcribing Audio and Video files with Automated Speech Recognition

Abstract

Audio and media files are a rich source in the social sciences and the humanities.

About This Material

This is a Hands-on Tutorial from the GTN which is usable either for individual self-study, or as a teaching material in a classroom.

Questions this will address

  • How can you convert audio and video files into written text?
  • How can you extract passages from certain speakers for further analysis?

Learning Objectives

  • Use WhisperX in Galaxy to convert your media to machine-readable text.
  • Use Regular Expressions (RegEx) to extract meaningful passages and clean the text.

Licence: Creative Commons Attribution 4.0 International

Keywords: Digital Humanities, audio, video

Target audience: Students

Resource type: e-learning

Version: 1

Status: Active

Learning objectives:

  • Use WhisperX in Galaxy to convert your media to machine-readable text.
  • Use Regular Expressions (RegEx) to extract meaningful passages and clean the text.

Date modified: 2026-04-08

Date published: 2026-04-08

Authors: Daniela Schneider

Contributors: Armin Dadras, Saskia Hiltemann, Daniela Schneider


Activity log