-
Notifications
You must be signed in to change notification settings - Fork 2
Fix mddatasrc for all studyforrest datasets #62
Description
At the moment, the Studyforrest datasets hosted here on GitHub are all broken. The reason for this is a faulty special remote mddatasrc pointing to psydata.ovgu.de, which used to redirect to datapub.fz-juelich.de (where the data was migrated to), but was taken down recently. The first user issue that brought this problem to light is psychoinformatics-de/studyforrest-data-visualrois#6.
Although I've only probed a handful of repositories/subdatasets in this repo, I believe they all have a now broken mddatasrc special remote registered. I suggest we put in a coordinated effort to fixing this with as many people as possible. @bpoldrack outlined a fix for this issue in psychoinformatics-de/studyforrest-data-visualrois#6. Here's my translation for the general procedure that anyone can follow:
- Take a repo from the list below, tick if off so that others don't duplicate efforts, and clone it from GitHub.
- Check if you see errors about
mddatasrcduring cloning. If not, nevertheless try to retrieve data to make sure it all works. If everything works, move to the next dataset; if not, move to 3. - As a first sanity check, investigate
remote.logand make sure there is only onemddatasrcspecial remote (git cat-file -p git-annex:remote.logis the command to do it). If there are two, leave a note, and move to the next dataset for now. - Make a note of the UUID of the
mddatasrcspecial remote inremote.log - Go to https://datapub.fz-juelich.de/studyforrest/studyforrest/ and find the folder that corresponds to the dataset you're handling. The names aren't always identical, but should be easily inferable. If unsure, compare directory contents and filenames. If you can't find a corresponding directory, ask for help in the chat. Make a note of the URL (e.g., https://datapub.fz-juelich.de/studyforrest/studyforrest/aligned) and append
/.git - In the cloned dataset, remove the git remote
mddatasrcusinggit remote remove mddatasrc - Fix the special remote
mddatasrcusing its UUID as an identifier, and the URL you constructed from datapub.fz-juelich.de (see example below) to fix the location information:
git annex enableremote 7dd5970d-cee5-404e-a3be-6430ec03657f location=https://datapub.fz-juelich.de/studyforrest/studyforrest/aligned/.git
- Retrieve a file using
datalad getto confirm that this fix worked, and retrieval frommddatasrcis possible again - The fix caused an update in the git-annex branch.
datalad pushthe changes back to GitHub. There is no need (or possibility) to do a pull request. Make sure that the git-annex branch gets successfully pushed. If you run into permission errors, seek help in the chat. - After pushing, re-clone the repo, and retry data retrieval. If things don't work, add a comment to this issue and seek help.
List of repositories:
- https://github.com/psychoinformatics-de/studyforrest-data-phase2
- https://github.com/psychoinformatics-de/studyforrest-data-structural
- https://github.com/psychoinformatics-de/studyforrest-data-freesurfer
- https://github.com/psychoinformatics-de/studyforrest-data-confoundsannotation (no
mddatasrcerror) - https://github.com/psychoinformatics-de/studyforrest-data-perceivedemotions (no
mddatasrcerror) - https://github.com/psychoinformatics-de/studyforrest-data-eyemovementlabels
- https://github.com/psychoinformatics-de/studyforrest-data-multires3t
- https://github.com/psychoinformatics-de/studyforrest-data-aggregate
(nofixed!mddatasrcerror; connection errors duringcloneandget) - https://github.com/psychoinformatics-de/studyforrest-data-aligned
- https://github.com/psychoinformatics-de/studyforrest-data-annotations (no
mddatasrcerror) - https://github.com/psychoinformatics-de/studyforrest-data-visualrois Has 2 special remotes
mddatasrc, leave for later! - https://github.com/psychoinformatics-de/studyforrest-data-templatetransforms (fixed, but some get "impossible" and "errors" remain. See
datalad get sub-01) - https://github.com/psychoinformatics-de/studyforrest-data-retinotopy
- https://github.com/psychoinformatics-de/studyforrest-data-multires7t
- https://github.com/psychoinformatics-de/studyforrest-data-phase2-denoised (no
mddatasrcerror; getting.gzfiles works, but.txtfiles look like availability was never pushed. Seesub-05/ses-movie/func/sub-05_ses-movie_task-movie_run-1_desc-MELODICSm5_componentLabels.txt) -> this dataset was an external contribution, and has been replaced by a fork of the corresponding maintained openneuro dataset (https://github.com/psychoinformatics-de/studyforrest-data-phase2-denoised_openneuro)