diff --git a/workshop/_toc.yml b/workshop/_toc.yml index 96c9503..98dd616 100644 --- a/workshop/_toc.yml +++ b/workshop/_toc.yml @@ -9,8 +9,8 @@ chapters: - file: terminology - file: fair_os - file: current -- file: registration - file: example +- file: registration - file: exercises - file: collaboration - file: next diff --git a/workshop/collaboration.md b/workshop/collaboration.md deleted file mode 100644 index f735b17..0000000 --- a/workshop/collaboration.md +++ /dev/null @@ -1 +0,0 @@ -# Why and how getting involved diff --git a/workshop/current.md b/workshop/current.md index a1ba6d9..51ee244 100644 --- a/workshop/current.md +++ b/workshop/current.md @@ -2,8 +2,15 @@ ## Identifying current practices +## An additional burden for scientists or an opportunity + As a whole classroom or in small groups (3-4) depending on the total number of participants. +- What are the current practices? +- What are the current bottlenecks? +- Are FAIR and Open Science practices an additional burden or an opportunity ? + + The main goal of this exercise is to understand the current practices of researchers (how they effectively work in their lab) with a goal to express the main bottlenecks. - You can use the figure below that sketches the research life cycle: @@ -30,10 +37,5 @@ And you can also consider the following list of questions: - Is your University/organisation taking into account all your contributions or only scientific papers published in some journals? - How do you collaborate? With internal collaborators? With external collaborators? -And list the biggest issues in your work (we give some examples below): -- Too much administrative work and not enough time for my own research; -- Only scientific papers are taken into account by my managements so I cannot afford to waste time on FAIR and Open Science; -- Access to compute and storage resources appropriate for my work; -- ... diff --git a/workshop/example.md b/workshop/example.md index 23b4385..c189372 100644 --- a/workshop/example.md +++ b/workshop/example.md @@ -1,2 +1,59 @@ -# Work along session with an example +# Examples of services to realize Open Science in practice +To practice Open Science, we need to share data, code, tools, and results (see image below), early and definitely not at the end of a project. The goal of this section is to introduce some of the EOSC services that can help you to do that. + +## EOSC | Explore https://search.marketplace.eosc-portal.eu + +Browse through over 3 millions of research and innovation tools and services, thousands of datasets from a wide scale of research domains from renowned European service providers. + +## OpenAIRE | Explore https://explore.openaire.eu + + +A comprehensive and open dataset of research information covering 174M publications, 60M research data, 397K research software items, from 129K data sources, linked to 3M grants and 204K organizations. All linked together through citations and semantics. + +## ROHub, Research Data Management Hub https://www.rohub.org + +Research Objects are used to gather the data, code, tools and results so when publishing a paper, the RO DOI can be provided and would contain all the required information. + +## EGI Datahub https://datahub.egi.eu + + +DataHub allows you to bring data close to computing to exploit it efficiently, and to publish a dataset and make it available to a specific community, or worldwide, across federated sites. + + + +## Jupyter notebook services: + +### EGI notebooks https://notebooks.egi.eu/hub + +Notebooks is a browser-based tool for interactive analysis of data using EGI storage and compute services. + +- General purpose: +- Number of CPU Cores 1 - 8 +- Amount of RAM per CPU core 1 - 4 GB +- Local disk 10 - 40 GB +- Access type opportunistic - reserved + + +### EGI Replay https://replay.notebooks.egi.eu + + +EGI Replay allows to reproduce and share custom computing environments effortlessly. With Replay one can replicate the execution of their analysis in a notebooks-based platform ensuring that others can easily access and interact with the content. The service builds on BinderHub, and allows to build docker images from a Git repository and then makes them available through your browser with EGI resources (up to 6GB RAM, no hard limit on the session time per user, etc.). + +## European Galaxy Server https://usegalaxy.eu + + + +The European Galaxy server is an open web-based platform for data intensive research which provides access to computing and storage resources without charge, but also tools, training material and a Training Infrastructure as a Service (TIAaS). Taking it further EuroScienceGateway (http://www.eurosciencegateway.eu) is a project which aims at connecting Galaxy to large e-infrastructures in order to be able to distribute jobs across Europe (Pulsar Network). + + +## Pangeo@EOSC + +And you can also “become” a service provider. For instance, EOSC-Pangeo http://pangeo-eosc.vm.fedcloud.eu + + +The latest version of the Pangeo stack and, depending on your selection: +- A Jupyter notebook with Pangeo Python core packages (Xarray, Iris, Dask) +- A Machine Learning notebook (with Tensorflow and a GPU) +- A similar Pytorch notebook (also with GPU) +- A DataScience notebook (with also Julia and R) diff --git a/workshop/exercises.md b/workshop/exercises.md index 2056b44..8bcdc19 100644 --- a/workshop/exercises.md +++ b/workshop/exercises.md @@ -1 +1,6 @@ # Try it out yourself + +Select a service (for which the enrollment is “automatic”) and start exploring. Ask your instructor if you do not know where to start. + + + diff --git a/workshop/fair_os.md b/workshop/fair_os.md index 5c9c5c4..e1c9862 100644 --- a/workshop/fair_os.md +++ b/workshop/fair_os.md @@ -13,4 +13,4 @@ -## An additional burden for scientists or an opportunity + diff --git a/workshop/next.md b/workshop/next.md deleted file mode 100644 index c830877..0000000 --- a/workshop/next.md +++ /dev/null @@ -1,2 +0,0 @@ -# Discussion, summary and next steps -