Data Carpentry – Genomics! August 19 & 20

RC is proud to be one of the few sites hosting these innovative workshops: The aim of Data Carpentry is to teach researchers basic concepts, skills, and tools for working with data so that they can get more done in less time, and with less pain. Data Carpentry is a sister organization to the highly successful Software Carpentry.

The focus of this workshop will be on working with genomics data and data management and analysis for genomics research. We will cover:
- metadata organization
- connecting to and using a high-performance computing environment (Odyssey)
- the command line for sequence quality control and bioinformatics workflows
- automating your workflows for scaling on HPC systems
- R for data analysis and visualization.

We will not be teaching any particular bioinformatics tools (though we will use several during our exercises) but the foundational skills that will allow you to conduct any analysis and analyze the output of a genomics pipeline.

Information on the workshop itself and the materials covered can be found on our FASRC Workshop page. There are a limited number of slots available, though we may run this workshop several times over the course of the next year.

When: Wednesday & Thursday, August 19 & 20

What: Project planning; Basic automation and text transformations with the Unix shell; Data Quality Assessment; Scaling Analyses and Visualization; and Utilizing Cloud/HPC Resources.

Who: Slots for 40 RC users, taught by experienced teachers and helpers.

Cost: $25 registration fee to cover coffee break snacks and beverages.

Where: Jefferson Hall 356, Harvard Main Campus

Sign-up: Will be first-come, first-served; preference will be given to eligible RC account holders and Odyssey cluster users. If successful, we will run this workshop again over the next year.

Register: Starting June 22nd, on our EventBrite page