Speaker
Description
Delving into the world of data-driven research presents exciting possibilities. However, there are a few topics that need to be addressed before one can jump right in.
Research data management (RDM) planning, software version control, automation, and data cleaning might appear dry at first glance, and they might indeed be to many. Despite these topics and practices seeming like ‘checkbox’ activities, expending some time actively thinking about them, and setting them up can save a lot of time and set you up for success in your research endeavour.
This workshop aims to introduce you to these topics. By the end of the workshop you will be prepared to independently self-teach these topics.
We will begin by exploring some academic research project examples and how RDM and RDM planning could be approached.
Moving on, I will introduce software version control via Git, with remotes on the developer platform, GitHub, and touching on working as a team on GitHub. This exercise will also explore automation via GitHub Actions and Docker.
We finish with data cleaning and wrangling concepts, and a practical demonstration in Python.
Participation by following along is encouraged but not expected.
Github management, workflows, exercises