From 6d7b1016514722a15037f966d2159d27285742af Mon Sep 17 00:00:00 2001 From: Jonny Pearson Date: Wed, 1 May 2024 16:11:35 +0100 Subject: [PATCH 1/9] Addition of PhDIntern Page --- docs/PhDInterns.md | 29 +++++++++++++++++++++++++++++ mkdocs.yml | 1 + 2 files changed, 30 insertions(+) create mode 100644 docs/PhDInterns.md diff --git a/docs/PhDInterns.md b/docs/PhDInterns.md new file mode 100644 index 00000000..45f0d343 --- /dev/null +++ b/docs/PhDInterns.md @@ -0,0 +1,29 @@ +--- +hide: + - navigation +--- + +# PhD Intern Scheme + +Our internships are aimed at current PhD students looking for an industrial placement of around five months with the right to work in the UK. The projects are focussed on innovation, in particular around getting the most value out of NHS data. + +The projects often have a focus on emerging data science techniques and so we advertise mainly to data science programmes, however previous interns have come from other disciplines such as clinical, mathematics, computer science and bioinformatics, which have added huge value through the range of approaches and knowledge. + +For more information and details on how to apply see the [Scheme Overview](https://nhsx.github.io/nhsx-internship-projects/overview.html) page on the microsite + +For details on open projects see the [Projects](https://nhsx.github.io/nhsx-internship-projects/projects.html) page on the microsite + +Available outputs from previous projects can also be seen at [Previous Projects](https://nhsx.github.io/nhsx-internship-projects/completed.html) on the microsite + +### Current Projects +Currently our interns are working on the following projects in two waves. These are the original briefs they applied to and their work and outputs will be available on our organisation GitHub. + + + +|**Wave 6** | February - July 2024 | +|---|---| +|![area](https://img.shields.io/badge/NLP-yellow)   ![technique](https://img.shields.io/badge/Curation-lightgrey)   ![datamodality](https://img.shields.io/badge/MultiModal-blueviolet)|[NHS Language Corpus Extension](https://nhsx.github.io/nhsx-internship-projects/nhs-language-corpus/)| +|![area](https://img.shields.io/badge/Explainability-orange)   ![technique](https://img.shields.io/badge/Fairness-lightgrey)   ![datamodality](https://img.shields.io/badge/MultiModal-blueviolet) | [Understanding Fairness and Explainability in Multi-modal Approaches within Healthcare](https://nhsx.github.io/nhsx-internship-projects/advances-modalities-explainability/)| +|**Wave 7**| July - December 2024 | +|![area](https://img.shields.io/badge/NLP-yellow)   ![technique](https://img.shields.io/badge/NER-lightgrey)   ![datamodality](https://img.shields.io/badge/Text-9cf) |[Evaluating NER-focussed models and LLMs for identifying key entities in histopathology reports – working with GOSH DRIVE](https://nhsx.github.io/nhsx-internship-projects/eval-ner-histopathology-goshdrive/)| +|![area](https://img.shields.io/badge/NLP-yellow)   ![technique](https://img.shields.io/badge/Foundation-lightgrey)   ![datamodality](https://img.shields.io/badge/MultiModal-blueviolet) | [Investigating Privacy Concerns and Mitigations for Healthcare Language and Foundation Models](https://nhsx.github.io/nhsx-internship-projects/language-foundation-privacy-concern-mitigation/)| diff --git a/mkdocs.yml b/mkdocs.yml index 18a2f243..0acc78be 100644 --- a/mkdocs.yml +++ b/mkdocs.yml @@ -19,6 +19,7 @@ nav: - What is Data Science?: - ... | flat | what_is_data_science/*.md - About the team: about.md + - PHDInterns: PhDInterns.md - Articles: - ... | flat | articles/*.md - Projects: From d4a5e478ed8a32610057b6235cb04c51cfa9a1a0 Mon Sep 17 00:00:00 2001 From: Jonny Pearson Date: Mon, 13 May 2024 09:24:41 +0100 Subject: [PATCH 2/9] Move page and minor correction to "interns" --- docs/PhDInterns.md | 2 +- mkdocs.yml | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/PhDInterns.md b/docs/PhDInterns.md index 45f0d343..c1c76d54 100644 --- a/docs/PhDInterns.md +++ b/docs/PhDInterns.md @@ -3,7 +3,7 @@ hide: - navigation --- -# PhD Intern Scheme +# PhD Internership Scheme Our internships are aimed at current PhD students looking for an industrial placement of around five months with the right to work in the UK. The projects are focussed on innovation, in particular around getting the most value out of NHS data. diff --git a/mkdocs.yml b/mkdocs.yml index 0acc78be..61305470 100644 --- a/mkdocs.yml +++ b/mkdocs.yml @@ -19,7 +19,6 @@ nav: - What is Data Science?: - ... | flat | what_is_data_science/*.md - About the team: about.md - - PHDInterns: PhDInterns.md - Articles: - ... | flat | articles/*.md - Projects: @@ -222,6 +221,7 @@ nav: - ... | flat | regex=our_work/(?!template-project\.md).*\.md # - Tools: # - ... | flat | tools/*.md + - PHDInterns: PhDInterns.md - Playbooks: playbooks.md # - Codebases: codebases.md - Useful links: useful_links.md From 152ff2fc7d04601ab93f8ef4057da9c24af7c3fe Mon Sep 17 00:00:00 2001 From: amaiaita <114224821+amaiaita@users.noreply.github.com> Date: Tue, 14 May 2024 07:54:51 +0100 Subject: [PATCH 3/9] Update template-project.md --- docs/our_work/template-project.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/docs/our_work/template-project.md b/docs/our_work/template-project.md index 4e565f15..6a6a3064 100644 --- a/docs/our_work/template-project.md +++ b/docs/our_work/template-project.md @@ -29,5 +29,7 @@ Case Study| Technical report|[e.g. biorxiv.org]() Algorithmic Impact Assessment|e.g. N/A +When you finish creating your project page, make sure you add it into the appropriate navigation sections in the [mkdocs.yml](https://github.com/nhsengland/datascience/blob/main/mkdocs.yml) for website navigation purposes! + [comment]: <> (The below header stops the title from being rendered (as mkdocs adds it to the page from the "title" attribute) - this way we can add it in the main.html, along with the summary.) # From a8d12c3111e5fab18dc49906689032134c296c09 Mon Sep 17 00:00:00 2001 From: Harriet Sands Date: Tue, 14 May 2024 10:22:44 +0000 Subject: [PATCH 4/9] Makes capitalisation of Data Science, Data Scientists, Analysts and Engineers consistent. --- .../Benefits of Data Science in the NHS.md | 2 +- .../How you can learn Data Science.md | 4 ++-- docs/what_is_data_science/index.md | 10 +++++----- 3 files changed, 8 insertions(+), 8 deletions(-) diff --git a/docs/what_is_data_science/Benefits of Data Science in the NHS.md b/docs/what_is_data_science/Benefits of Data Science in the NHS.md index 881f1d28..142a8eaf 100644 --- a/docs/what_is_data_science/Benefits of Data Science in the NHS.md +++ b/docs/what_is_data_science/Benefits of Data Science in the NHS.md @@ -1,6 +1,6 @@ # Benefits of Data Science in the NHS -Data Science can be helpful in [solving a number of problems](./index.md). Specifically, this could lead to benefits such as: +Data science can be helpful in [solving a number of problems](./index.md). Specifically, this could lead to benefits such as: - **better resource planning** (e.g. [improving bed allocation using AI](../our_work/bed-allocation.md)) - **increased responsiveness to demand and seasonal pressures** (e.g. A&E demand prediction) diff --git a/docs/what_is_data_science/How you can learn Data Science.md b/docs/what_is_data_science/How you can learn Data Science.md index a238a2fa..6e94aa17 100644 --- a/docs/what_is_data_science/How you can learn Data Science.md +++ b/docs/what_is_data_science/How you can learn Data Science.md @@ -1,6 +1,6 @@ # How you can learn Data Science -Data Science isn't just for Data Scientists! As a profession, we're passionate about sharing these skills and techniques. +Data science isn't just for data scientists! As a profession, we're passionate about sharing these skills and techniques. For this purpose we've put together a **monthly newsletter** with valuable **insights**, **training opportunities** and **events** for people interested in learning more about the various aspects of data science, further developing their skills, and progressing in their career: @@ -12,7 +12,7 @@ For this purpose we've put together a **monthly newsletter** with valuable **ins The newsletter is targeted towards members of the NHS England Data Science team, so some links may only be accessible to those with the necessary login credentials, however the newsletter and its archive are available for all at the link above. -Through [AnalystX] we also support the [NHS Data Science community](https://data-science-community.analystx.uk/) which is the home of spreading data science knowledge within the NHS. +Through [AnalystX] we also support the [NHS Data Science Community](https://data-science-community.analystx.uk/) which is the home of spreading data science knowledge within the NHS. You can also learn a lot about data science by simply getting to know the wider cross-government/health community: diff --git a/docs/what_is_data_science/index.md b/docs/what_is_data_science/index.md index e281b049..e65c29fd 100644 --- a/docs/what_is_data_science/index.md +++ b/docs/what_is_data_science/index.md @@ -4,7 +4,7 @@ Quoting the [Digital, Data and Technology (DDaT) Capability Framework](https://d > Data science is a broad and fast-moving field spanning maths, statistics, software engineering and communications. Data scientists will often work as part of a multidisciplinary team, using data and analytics to inform and achieve organisational goals. -Specifically, this means that often data scientists use programming languages like Python and R (though there are others too!) to solve problems within the business, or inform about courses of action, working with other colleagues such as analysts and data engineers, in particular. +Data scientists will often use programming languages such as Python and R (among others!) to solve problems within a business or to inform decisions, working with other colleagues such as analysts and data engineers. Some examples of the kinds of problems data science can help with are: @@ -14,9 +14,9 @@ Some examples of the kinds of problems data science can help with are: - **enriching and transforming data**, such as through linkage, feature engineering, artificial and synthetic data generation - **classification and regression** - that is saying what something is, or if something will or won't happen, and/or quantifying something unknown. -**Artificial Intelligence** and **Machine Learning** are techniques which are widely associated with Data Science and Data Scientists, and which can be applied to basically any of the problems listed above. See also this article on the ["Seven Patterns of AI"](https://www.forbes.com/sites/cognitiveworld/2019/09/17/the-seven-patterns-of-ai/?sh=448cf51812d0). +**Artificial Intelligence** and **Machine Learning** are techniques which are widely associated with data science and data scientists, and which can be applied to basically any of the problems listed above. See also this article on the ["Seven Patterns of AI"](https://www.forbes.com/sites/cognitiveworld/2019/09/17/the-seven-patterns-of-ai/?sh=448cf51812d0). -??? info "How does Data Scientists differ from Analysts and Data engineers?" - Data Science can be quite hard to pin down, as it covers a lot of different techniques, and problems, and data scientists themselves often have a lot of overlap with Analysts and Data Engineers. Analysts and Engineers might well use data science techniques in their work! +??? info "How do data scientists differ from analysts and Data engineers?" + Data science can be quite hard to pin down, as it covers a lot of different techniques, and problems, and data scientists themselves often have a lot of overlap with analysts and Data engineers. Analysts and engineers might well use data science techniques in their work! - However, generally, data scientists are slightly more focussed on looking ahead, embracing and exploiting new techniques, e.g. unstructured data, such as text, images, audio. + However, generally data scientists are more focused on looking ahead, embracing and exploiting new techniques across a range of different types of data, e.g. unstructured data, such as text, images, audio. From 6109c96152f20826de7e4418131d4c40345886b6 Mon Sep 17 00:00:00 2001 From: Harriet Sands Date: Tue, 14 May 2024 10:34:07 +0000 Subject: [PATCH 5/9] correct missed data engineer capitalisation --- docs/what_is_data_science/index.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/what_is_data_science/index.md b/docs/what_is_data_science/index.md index e65c29fd..2189b080 100644 --- a/docs/what_is_data_science/index.md +++ b/docs/what_is_data_science/index.md @@ -16,7 +16,7 @@ Some examples of the kinds of problems data science can help with are: **Artificial Intelligence** and **Machine Learning** are techniques which are widely associated with data science and data scientists, and which can be applied to basically any of the problems listed above. See also this article on the ["Seven Patterns of AI"](https://www.forbes.com/sites/cognitiveworld/2019/09/17/the-seven-patterns-of-ai/?sh=448cf51812d0). -??? info "How do data scientists differ from analysts and Data engineers?" - Data science can be quite hard to pin down, as it covers a lot of different techniques, and problems, and data scientists themselves often have a lot of overlap with analysts and Data engineers. Analysts and engineers might well use data science techniques in their work! +??? info "How do data scientists differ from analysts and data engineers?" + Data science can be quite hard to pin down, as it covers a lot of different techniques, and problems, and data scientists themselves often have a lot of overlap with analysts and data engineers. Analysts and engineers might well use data science techniques in their work! However, generally data scientists are more focused on looking ahead, embracing and exploiting new techniques across a range of different types of data, e.g. unstructured data, such as text, images, audio. From c5bcf00a663a82517c16977578d2d0329461ef0c Mon Sep 17 00:00:00 2001 From: Harriet Sands Date: Tue, 14 May 2024 10:35:47 +0000 Subject: [PATCH 6/9] Initially capitalise and complete list items --- docs/what_is_data_science/index.md | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/docs/what_is_data_science/index.md b/docs/what_is_data_science/index.md index 2189b080..8f30e6dc 100644 --- a/docs/what_is_data_science/index.md +++ b/docs/what_is_data_science/index.md @@ -8,11 +8,11 @@ Data scientists will often use programming languages such as Python and R (among Some examples of the kinds of problems data science can help with are: -- **getting more value out of unstructured data** (e.g. text, images, audio) through "Natural Language Processing", "Neural Networks" and recently "Large Language Models" (e.g. ChatGPT). -- **modelling systems and forecasting** (though compared with statisticians and economists, this tends to be more empirical, requiring more focus on evaluation) -- **explaining existing models and their performance** - communication is a key pillar of data science: making the complex understandable to everyone else. -- **enriching and transforming data**, such as through linkage, feature engineering, artificial and synthetic data generation -- **classification and regression** - that is saying what something is, or if something will or won't happen, and/or quantifying something unknown. +- **Getting more value out of unstructured data** (e.g. text, images, audio) through "Natural Language Processing", "Neural Networks" and recently "Large Language Models" (e.g. ChatGPT). +- **Modelling systems and forecasting** (though compared with statisticians and economists, this tends to be more empirical, requiring more focus on evaluation). +- **Explaining existing models and their performance** - communication is a key pillar of data science: making the complex understandable to everyone else. +- **Enriching and transforming data**, such as through linkage, feature engineering, artificial and synthetic data generation. +- **Classification and regression** - that is saying what something is, or if something will or won't happen, and/or quantifying something unknown. **Artificial Intelligence** and **Machine Learning** are techniques which are widely associated with data science and data scientists, and which can be applied to basically any of the problems listed above. See also this article on the ["Seven Patterns of AI"](https://www.forbes.com/sites/cognitiveworld/2019/09/17/the-seven-patterns-of-ai/?sh=448cf51812d0). From adf8649d722a9bd3ac3664a82576f78480998388 Mon Sep 17 00:00:00 2001 From: Harriet Sands Date: Tue, 14 May 2024 10:37:33 +0000 Subject: [PATCH 7/9] correct capitalisation --- docs/what_is_data_science/How you can learn Data Science.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/what_is_data_science/How you can learn Data Science.md b/docs/what_is_data_science/How you can learn Data Science.md index 6e94aa17..85cb3b48 100644 --- a/docs/what_is_data_science/How you can learn Data Science.md +++ b/docs/what_is_data_science/How you can learn Data Science.md @@ -1,4 +1,4 @@ -# How you can learn Data Science +# How You Can Learn Data Science Data science isn't just for data scientists! As a profession, we're passionate about sharing these skills and techniques. From c289af570d6b38bdfe4362d4bd97963b124d8d8a Mon Sep 17 00:00:00 2001 From: Harriet Sands Date: Tue, 14 May 2024 10:41:38 +0000 Subject: [PATCH 8/9] Consistent capitalisation --- docs/PhDInterns.md | 2 +- docs/about.md | 4 ++-- docs/useful_links.md | 4 ++-- 3 files changed, 5 insertions(+), 5 deletions(-) diff --git a/docs/PhDInterns.md b/docs/PhDInterns.md index c1c76d54..371affe1 100644 --- a/docs/PhDInterns.md +++ b/docs/PhDInterns.md @@ -3,7 +3,7 @@ hide: - navigation --- -# PhD Internership Scheme +# PhD Internship Scheme Our internships are aimed at current PhD students looking for an industrial placement of around five months with the right to work in the UK. The projects are focussed on innovation, in particular around getting the most value out of NHS data. diff --git a/docs/about.md b/docs/about.md index d25a051b..8496677e 100644 --- a/docs/about.md +++ b/docs/about.md @@ -24,7 +24,7 @@ We are the [NHS England](https://www.england.nhs.uk/) Data Science Team. -## How are we different from analytical teams? +## How We Are Different From Analytical Teams
@@ -54,7 +54,7 @@ We are the [NHS England](https://www.england.nhs.uk/) Data Science Team.
-## Our missions +## Our Missions === "Deliver"

**Deliver problem led data science products to commissioners.**

diff --git a/docs/useful_links.md b/docs/useful_links.md index 33573f0a..fcd16d35 100644 --- a/docs/useful_links.md +++ b/docs/useful_links.md @@ -3,7 +3,7 @@ hide: - navigation --- -# Useful links +# Useful Links This is a list, mostly copied from [RAP guidance pages](https://harrietrs.github.io/rap-community-of-practice/useful_links/) and shows a range of resources throughout the Government and other areas. Let us know if you have any links to useful information, resources or guides that could be added to the list. @@ -47,7 +47,7 @@ The [Turing Way](https://the-turing-way.netlify.app/index.html#) have also produ - The [NHS Digital Github](https://github.com/NHSDigital/data-analytics-services) contains code shared from NHS Digital (and NHS England) projects. Well worth a look. -## Community spaces +## Community Spaces There are several slack channels that discuss RAP and related topics: the govdatascience.slack.com RAP channel, the NHS-R community, and the NHS-pycom community We have an MS Teams page (internal to NHS Digital) From 9ec54e9089aa1c0797746efed710c47ffd881845 Mon Sep 17 00:00:00 2001 From: Harriet Sands Date: Tue, 14 May 2024 10:41:38 +0000 Subject: [PATCH 9/9] Consistent capitalisation --- docs/PhDInterns.md | 2 +- docs/about.md | 6 +++--- docs/useful_links.md | 4 ++-- 3 files changed, 6 insertions(+), 6 deletions(-) diff --git a/docs/PhDInterns.md b/docs/PhDInterns.md index c1c76d54..371affe1 100644 --- a/docs/PhDInterns.md +++ b/docs/PhDInterns.md @@ -3,7 +3,7 @@ hide: - navigation --- -# PhD Internership Scheme +# PhD Internship Scheme Our internships are aimed at current PhD students looking for an industrial placement of around five months with the right to work in the UK. The projects are focussed on innovation, in particular around getting the most value out of NHS data. diff --git a/docs/about.md b/docs/about.md index d25a051b..7e8bbb11 100644 --- a/docs/about.md +++ b/docs/about.md @@ -24,7 +24,7 @@ We are the [NHS England](https://www.england.nhs.uk/) Data Science Team. -## How are we different from analytical teams? +## How We Are Different From Analytical Teams
@@ -54,7 +54,7 @@ We are the [NHS England](https://www.england.nhs.uk/) Data Science Team.
-## Our missions +## Our Missions === "Deliver"

**Deliver problem led data science products to commissioners.**

@@ -72,7 +72,7 @@ We are the [NHS England](https://www.england.nhs.uk/) Data Science Team.

**Devise a great place to work where group work solves great problems​​​​.​**

## Our Members -??? "Our members" +??? "Our Members"