Skip to content

Commit

Permalink
clean training material
Browse files Browse the repository at this point in the history
  • Loading branch information
annefou committed Jan 10, 2024
1 parent 34b62d6 commit 267aa89
Show file tree
Hide file tree
Showing 7 changed files with 72 additions and 11 deletions.
2 changes: 1 addition & 1 deletion workshop/_toc.yml
Original file line number Diff line number Diff line change
Expand Up @@ -9,8 +9,8 @@ chapters:
- file: terminology
- file: fair_os
- file: current
- file: registration
- file: example
- file: registration
- file: exercises
- file: collaboration
- file: next
Expand Down
1 change: 0 additions & 1 deletion workshop/collaboration.md

This file was deleted.

12 changes: 7 additions & 5 deletions workshop/current.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,8 +2,15 @@

## Identifying current practices

## An additional burden for scientists or an opportunity

As a whole classroom or in small groups (3-4) depending on the total number of participants.

- What are the current practices?
- What are the current bottlenecks?
- Are FAIR and Open Science practices an additional burden or an opportunity ?


The main goal of this exercise is to understand the current practices of researchers (how they effectively work in their lab) with a goal to express the main bottlenecks.

- You can use the figure below that sketches the research life cycle:
Expand All @@ -30,10 +37,5 @@ And you can also consider the following list of questions:
- Is your University/organisation taking into account all your contributions or only scientific papers published in some journals?
- How do you collaborate? With internal collaborators? With external collaborators?

And list the biggest issues in your work (we give some examples below):
- Too much administrative work and not enough time for my own research;
- Only scientific papers are taken into account by my managements so I cannot afford to waste time on FAIR and Open Science;
- Access to compute and storage resources appropriate for my work;
- ...


59 changes: 58 additions & 1 deletion workshop/example.md
Original file line number Diff line number Diff line change
@@ -1,2 +1,59 @@
# Work along session with an example
# Examples of services to realize Open Science in practice
To practice Open Science, we need to share data, code, tools, and results (see image below), early and definitely not at the end of a project. The goal of this section is to introduce some of the EOSC services that can help you to do that.

## EOSC | Explore https://search.marketplace.eosc-portal.eu

Browse through over 3 millions of research and innovation tools and services, thousands of datasets from a wide scale of research domains from renowned European service providers.

## OpenAIRE | Explore https://explore.openaire.eu


A comprehensive and open dataset of research information covering 174M publications, 60M research data, 397K research software items, from 129K data sources, linked to 3M grants and 204K organizations. All linked together through citations and semantics.

## ROHub, Research Data Management Hub https://www.rohub.org

Research Objects are used to gather the data, code, tools and results so when publishing a paper, the RO DOI can be provided and would contain all the required information.

## EGI Datahub https://datahub.egi.eu


DataHub allows you to bring data close to computing to exploit it efficiently, and to publish a dataset and make it available to a specific community, or worldwide, across federated sites.



## Jupyter notebook services:

### EGI notebooks https://notebooks.egi.eu/hub

Notebooks is a browser-based tool for interactive analysis of data using EGI storage and compute services.

- General purpose:
- Number of CPU Cores 1 - 8
- Amount of RAM per CPU core 1 - 4 GB
- Local disk 10 - 40 GB
- Access type opportunistic - reserved


### EGI Replay https://replay.notebooks.egi.eu


EGI Replay allows to reproduce and share custom computing environments effortlessly. With Replay one can replicate the execution of their analysis in a notebooks-based platform ensuring that others can easily access and interact with the content. The service builds on BinderHub, and allows to build docker images from a Git repository and then makes them available through your browser with EGI resources (up to 6GB RAM, no hard limit on the session time per user, etc.).

## European Galaxy Server https://usegalaxy.eu



The European Galaxy server is an open web-based platform for data intensive research which provides access to computing and storage resources without charge, but also tools, training material and a Training Infrastructure as a Service (TIAaS). Taking it further EuroScienceGateway (http://www.eurosciencegateway.eu) is a project which aims at connecting Galaxy to large e-infrastructures in order to be able to distribute jobs across Europe (Pulsar Network).


## Pangeo@EOSC

And you can also “become” a service provider. For instance, EOSC-Pangeo http://pangeo-eosc.vm.fedcloud.eu


The latest version of the Pangeo stack and, depending on your selection:
- A Jupyter notebook with Pangeo Python core packages (Xarray, Iris, Dask)
- A Machine Learning notebook (with Tensorflow and a GPU)
- A similar Pytorch notebook (also with GPU)
- A DataScience notebook (with also Julia and R)

5 changes: 5 additions & 0 deletions workshop/exercises.md
Original file line number Diff line number Diff line change
@@ -1 +1,6 @@
# Try it out yourself

Select a service (for which the enrollment is “automatic”) and start exploring. Ask your instructor if you do not know where to start.



2 changes: 1 addition & 1 deletion workshop/fair_os.md
Original file line number Diff line number Diff line change
Expand Up @@ -13,4 +13,4 @@
<!-- Google embed ends -->
</div>

## An additional burden for scientists or an opportunity

2 changes: 0 additions & 2 deletions workshop/next.md

This file was deleted.

0 comments on commit 267aa89

Please sign in to comment.