Parallel Computing with R using SPRINT on post-genomic data @ EPCC
Date: 29 - 30 May 2014
Please note the course will be held in Wales:
Digital Technium Lab 002
University of Swansea
Singleton Park
Swansea SA2 8PP
Description
SPRINT provides easy access to HPC for the analysis of high throughput post genomic data using the statistical programming language R. SPRINT contains several already parallelised functions of use in statistical analysis, machine learning pattern analysis and processing of biological “big data” (eg. microarrays, next-generation sequencing and high-throughput screens). SPRINT allows R users easy access to HPC without the need to engage in parallel programming.
This course gives an introduction to HPC (what it is, how to access it, and complications of use), followed by a detailed introduction to SPRINT, practical use cases and tutorials with SPRINT on an HPC platform.Pre-requisites
Basic familiarity with running statistical analyses in R.
Basic familiarity with analysis of large biological data sets (e.g. microarrays or Next Generation Sequencing).
Basic ability to program
Learning Outcomes
Able to use SPRINT to speed up specific R statistical analyses on an HPC platform
An appreciation of HPC concepts applicable to post-genomics analyses
Event types:
- Workshops and courses
Activity log