Skip to content

Fix mddatasrc for all studyforrest datasets #62

@adswa

Description

@adswa

At the moment, the Studyforrest datasets hosted here on GitHub are all broken. The reason for this is a faulty special remote mddatasrc pointing to psydata.ovgu.de, which used to redirect to datapub.fz-juelich.de (where the data was migrated to), but was taken down recently. The first user issue that brought this problem to light is psychoinformatics-de/studyforrest-data-visualrois#6.

Although I've only probed a handful of repositories/subdatasets in this repo, I believe they all have a now broken mddatasrc special remote registered. I suggest we put in a coordinated effort to fixing this with as many people as possible. @bpoldrack outlined a fix for this issue in psychoinformatics-de/studyforrest-data-visualrois#6. Here's my translation for the general procedure that anyone can follow:

  1. Take a repo from the list below, tick if off so that others don't duplicate efforts, and clone it from GitHub.
  2. Check if you see errors about mddatasrc during cloning. If not, nevertheless try to retrieve data to make sure it all works. If everything works, move to the next dataset; if not, move to 3.
  3. As a first sanity check, investigate remote.log and make sure there is only one mddatasrc special remote (git cat-file -p git-annex:remote.log is the command to do it). If there are two, leave a note, and move to the next dataset for now.
  4. Make a note of the UUID of the mddatasrc special remote in remote.log
  5. Go to https://datapub.fz-juelich.de/studyforrest/studyforrest/ and find the folder that corresponds to the dataset you're handling. The names aren't always identical, but should be easily inferable. If unsure, compare directory contents and filenames. If you can't find a corresponding directory, ask for help in the chat. Make a note of the URL (e.g., https://datapub.fz-juelich.de/studyforrest/studyforrest/aligned) and append /.git
  6. In the cloned dataset, remove the git remote mddatasrc using git remote remove mddatasrc
  7. Fix the special remote mddatasrc using its UUID as an identifier, and the URL you constructed from datapub.fz-juelich.de (see example below) to fix the location information:
git annex enableremote 7dd5970d-cee5-404e-a3be-6430ec03657f   location=https://datapub.fz-juelich.de/studyforrest/studyforrest/aligned/.git  
  1. Retrieve a file using datalad get to confirm that this fix worked, and retrieval from mddatasrc is possible again
  2. The fix caused an update in the git-annex branch. datalad push the changes back to GitHub. There is no need (or possibility) to do a pull request. Make sure that the git-annex branch gets successfully pushed. If you run into permission errors, seek help in the chat.
  3. After pushing, re-clone the repo, and retry data retrieval. If things don't work, add a comment to this issue and seek help.

List of repositories:

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions