
The Handbook¶
Welcome!
This handbook is a living resource about why and – more importantly – how to use DataLad. It aims to provide novices and advanced users of all backgrounds with both the basics of DataLad and start-to-end use cases of specific applications. If you want to get hands-on experience and learn DataLad, the Basics part of this book will teach you. If you want to know what is possible, the use cases will show you. And if you want to help others to get started with DataLad, the companion repository provides free and open source teaching material tailored to the handbook.
Before you read on, please note that the handbook is based on DataLad version 0.12, but the section Installation and configuration will set you up with what you need if you currently do not have DataLad 0.12 or higher installed.
If you’re new here, please start the handbook here. Alternatively, try to identify with one of several user-types in this user specific guide to the handbook.
Important
PLEASE NOTE: This is an archived version of the DataLad handbook If you would be willing to provide feedback on its contents, please corresponding to its 0.14 version (February 2021 - September 2021), which in turn was corresponding to the 0.13.x series of DataLad.
This handbook version is not a complete documentation of all functionality in DataLad 0.13, but the state the handbook was in at this time. Find the latest released version of the handbook at handbook.datalad.org/en/stable, and its most recent version (including general fixes, visual improvements, and additions of existing commands or workflows based on existing functionality) at handbook.datalad.org/en/latest. The CHANGELOG summarizes the contents and additions that happened between Handbook versions.
What this is all about
The fundamentals of DataLad
Beyond the Basics
Hands-on real-world applications with step-by-step recipes
- Use cases
- A typical collaborative data management workflow
- Basic provenance tracking
- Writing a reproducible paper
- Student supervision in a research project
- An automatically and computationally reproducible neuroimaging analysis from scratch
- Scaling up: Managing 80TB and 15 million files from the HCP release
- Building a scalable data storage for scientific computing
- Using Globus as a data store for the Canadian Open Neuroscience Portal
- DataLad for reproducible machine-learning analyses
- Contributing
Appendix¶
Further information and references
- Glossary
- Frequently Asked Questions
- DataLad cheat sheet
- Contributing
- Teaching with the DataLad Handbook
- Acknowledgements
- Tell me what you are and I tell you where to start
- Handbook Poster from the 2020 (virtual) OHBM
- OpenNeuro Quickstart Guide: Accessing OpenNeuro datasets via DataLad
- So… Windows… eh?
Code lists from chapters¶
Easy access to copy-paste snippets for workshops
- About code lists
- Code from chapter: 01_dataset_basics
- Code from chapter: 02_reproducible_execution
- Code from chapter: 10_yoda
- OHBM Brainhack TrainTrack: DataLad
- OHBM 2020 Open Science Room: Reproducible Research Objects with DataLad
- An introduction to DataLad with a focus on ML
- DataLad tutorial at the MPI Leipzig
- An introduction to DataLad at the MPI Berlin
- How to use DataLad
- An introduction to DataLad for the ABCD ReproNim course week 8b
- How to use DataLad