Replies: 12 comments 7 replies
-
|
quite fuzzy vision, inspired by https://colinraffel.com/blog/a-call-to-build-models-like-we-build-open-source-software.html: "Can we manage scientific data as open source software?" |
Beta Was this translation helpful? Give feedback.
-
|
what are the incentives for maintaining something like this? Had a couple of proposals killed because "it is not science". How can we, as a community, fix the incentive structure for the developers and maintainers of data infrastructure? |
Beta Was this translation helpful? Give feedback.
-
|
On the adoption front, training/education plays a big part. What standards/guidelines/technologies/tools would you teach new undergrads today? |
Beta Was this translation helpful? Give feedback.
-
|
Also: what are the minimum technical meta-requirements for a standard/ontology/schema/data model to be adopted?
I think these could apply to things we typically think of as standards, but also simply datasets/databases published on the web. Does adoption mean something different for software, specifications and data? |
Beta Was this translation helpful? Give feedback.
-
|
I guess for experimental data is is a large challenge but some groups have dome something to get there. There (used to be) HTE-MEAD, then there is HTEM and others. At my university we have some thing called KADI4MAT. Most ELN are append only which would make them amendable for something like merge requests etc. What I find more important though and what could be a future vision is that you absolutely need to store the analysis and plotting code alongside your (experimental) analysed data. Gold standard would probably be a versioned docker container of some micro data analysis and plotting service with that you can just rerun the analysis of old data. |
Beta Was this translation helpful? Give feedback.
-
|
Might be a bit too philosophical, but I really like how Nielsen phrases the problem as collective action problem. The way to overcome this according to Olson is:
and it seems also intuitive that these would be the key ingredients for something that works in the sciences. an interesting tangent then is, however, how this plays with the cathedral and bazar can we come up with the key elements of good governance for such data infrastructure in the sciences? Completely disorganized does not work, but also completely centralized does not work. |
Beta Was this translation helpful? Give feedback.
-
|
From @jschrier's keynote:
|
Beta Was this translation helpful? Give feedback.
-
|
From chat (Michael)
|
Beta Was this translation helpful? Give feedback.
-
|
From Chat (Chloe)
|
Beta Was this translation helpful? Give feedback.
-
|
in chat (Kenneth)
|
Beta Was this translation helpful? Give feedback.
-
|
@shyamd from chat
|
Beta Was this translation helpful? Give feedback.
-
|
As I just mentioned in the breakout session, I think an interesting model to look at is the one of 2i2c that is developing models to make sustainable the development and deployment of open-source infrastructure for interactive computing in research and education (mostly coming from the Jupyter ecosystem, but I think they are open to more). Pinging here the director Chris Holdgraf, so he's aware of this workshop, and also he might be a person to be in contact with for further discussions: @choldgraf |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
This thread is for discussing, finalizing and organizing a breakout with the theme in the title above.
Link to current description
Beta Was this translation helpful? Give feedback.
All reactions