Skip to content

Commit

Permalink
Merge pull request #104 from PennLINC/update-datalad
Browse files Browse the repository at this point in the history
update datalad
  • Loading branch information
mattcieslak authored Nov 12, 2024
2 parents 2ff2f68 + 7f618c9 commit 26dd6d1
Showing 1 changed file with 10 additions and 5 deletions.
15 changes: 10 additions & 5 deletions docs/DataWorkflows/Datalad.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ nav_order: 0
has_toc: false
---

[Datalad](https://handbook.datalad.org/en/latest/) is basically GitHub for data.
[Datalad](https://handbook.datalad.org/en/latest/) is basically git for data.

Building on top of Git and git-annex, DataLad allows you to version control arbitrarily large files in datasets, without the need for custom data structures, central infrastructure, or third party services.

Expand All @@ -19,11 +19,16 @@ Building on top of Git and git-annex, DataLad allows you to version control arbi

### Installing Datalad

1. The best way to install datalad on HPC systems like cubic is using conda. First, make sure Miniconda is installed in your project folder (see instructions [here](https://pennlinc.github.io/docs/cubic#installing-miniconda-in-your-project-the-hard-way)).
1. The best way to install datalad on HPC systems like cubic is using conda. First, make sure Miniforge is installed in your project folder (see instructions [here](https://pennlinc.github.io/docs/cubic#installing-miniforge-in-your-project)).
2. Then, create an environment for this:
`conda create -n dlad python=3.10`
`conda activate dlad`
```bash
conda create -n dlad python=3.11
conda activate dlad
```
3. Then, install datalad:
`conda install -c conda-forge datalad`
```bash
conda install datalad git git-annex
pip install --upgrade datalad # datalad from conda-forge might not be the most update
```

Note that this page is still under construction and more information may be added at a later date.

0 comments on commit 26dd6d1

Please sign in to comment.