Transcribing Audio and Video files with Automated Speech Recognition - TeSS (Training eSupport System)

Content provider

The GTN is a global community of teachers, trainers, developers, administrators, and learners based around the Galaxy Platform!

For Learners
Our community provides over 370 high quality, peer-reviewed training materials from a global community of 330 instructors, with a focus on reproducibility and FAIRness.

Not Just Galaxy
We've expanded our content to include data science, software development, and even alternative SciWMSs like snakemake!

For Teachers
All of our materials are FAIR, and we put a lot of effort into the platform's accessibility for all audiences.. If you're interested in contributing your own materials we'd love to help you share them with the world!

Most of our materials are licensed CC-BY-SA so you're welcome to use these materials for your own course and classes! If you want to use a specific version of a material you may be interested in accessing them via our archive.

For Administrators
If you're a budding Galaxy Administrator, or an experienced one looking to catch up, the GTN is the source for all of those materials. We also work to make it easy for you as an admin to support tutorials on your infrastructure, you'll find a list of tools and their installation instructions at the bottom of every tutorial.

e-learning

Transcribing Audio and Video files with Automated Speech Recognition

View material

Abstract

Audio and media files are a rich source in the social sciences and the humanities.

About This Material

This is a Hands-on Tutorial from the GTN which is usable either for individual self-study, or as a teaching material in a classroom.

Questions this will address

How can you convert audio and video files into written text?
How can you extract passages from certain speakers for further analysis?

Learning Objectives

Use WhisperX in Galaxy to convert your media to machine-readable text.
Use Regular Expressions (RegEx) to extract meaningful passages and clean the text.

Licence: Creative Commons Attribution 4.0 International

Keywords: Digital Humanities, audio, video

Competency level: • Beginner

Target audience: Students

Resource type: e-learning

Version: 1

Status: Active

Learning objectives:

Use WhisperX in Galaxy to convert your media to machine-readable text.
Use Regular Expressions (RegEx) to extract meaningful passages and clean the text.

Date modified: 2026-04-08

Date published: 2026-04-08

Authors: Daniela Schneider

Contributors: Armin Dadras, Saskia Hiltemann, Daniela Schneider

External resources:

Associated Workflows

Galaxy

Associated Training Datasets