From 14b4c1e503f5bca70b23f6e0275a4bf71238f493 Mon Sep 17 00:00:00 2001 From: Quarto GHA Workflow Runner Date: Fri, 23 Aug 2024 21:38:59 +0000 Subject: [PATCH] Built site for gh-pages --- .nojekyll | 2 +- case-os.html | 110 ++- change-management.html | 178 +++- contributors.html | 130 ++- datatrials.html | 809 ++++++++++++++++++ definitions.html | 110 ++- index.html | 124 ++- modern-sce.html | 108 ++- multi-modal.html | 106 ++- os-depends.html | 106 ++- references.html | 108 ++- search.json | 140 +-- shiny-csr.html | 106 ++- site_libs/bootstrap/bootstrap.min.css | 4 +- .../quarto-syntax-highlighting.css | 2 + site_libs/quarto-html/quarto.js | 21 +- site_libs/quarto-nav/quarto-nav.js | 38 +- site_libs/quarto-search/quarto-search.js | 69 +- smallmidpharma.html | 761 ++++++++++++++++ validate-shiny.html | 171 ++-- 20 files changed, 2747 insertions(+), 456 deletions(-) create mode 100644 datatrials.html create mode 100644 smallmidpharma.html diff --git a/.nojekyll b/.nojekyll index cf7f170..254a596 100644 --- a/.nojekyll +++ b/.nojekyll @@ -1 +1 @@ -c41ddf21 \ No newline at end of file +c800ca80 \ No newline at end of file diff --git a/case-os.html b/case-os.html index 8ce41a6..38b8fc5 100644 --- a/case-os.html +++ b/case-os.html @@ -2,12 +2,12 @@ - + -R/Pharma round tables - 7  The case for OS +7  The case for OS – R/Pharma round tables + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+
+ +
+ +
+ + +
+ + + +
+ +
+
+

9  Can we do data better?

+
+ + + +
+ + + + +
+ + + +
+ + +
+

9.1 2024

+

Chairs: Stephanie Lussier and Doug Kelkhoff

+
+

9.1.1 databases

+

A lot of thought currently about databases, but not a lot of companies using it in primary data flows (although it is used in curated trial data for secondary use, e.g. Novartis’ Data42 and Roche’s EDIS).

+
+
+

9.1.2 Blockers

+
    +
  • Dependence on CROs who deliver SAS datasets generated by SAS code is a factor.
  • +
  • Often fear from IT groups about the cloud, which is sometimes confusing when platforms like medidate are already cloud-based and other companies already have STDM/ADaM in AWS S3/cloud.
  • +
  • Unclear justification for changes, particularly what are we getting from databases for current STDM/ADaM primary use; existing systems are mostly functional.
  • +
  • Challenges with concurrent data access by multiple teams in some file based approaches, leading to errors.
  • +
+
+
+

9.1.3 an approach around tortoiseSVN

+
    +
  • One company had been using tortoiseSVN for a while, and is considering moving to snowflake.
  • +
  • Pros: Integration with version control and modern cloud storage solutions.
  • +
  • Cons: +
      +
    • Higher entry threshold for users.
    • +
    • Gap in a user friendly GUI
    • +
    • Storing data in ‘normal’ version control rather than tools designed for data versioning rapidly leads to bloated repositories.
    • +
  • +
+
+
+

9.1.4 Version Control and Data Storage

+
    +
  • Alignment code versioning in Git; data versioning in tools like S3 versioning
  • +
  • S3 can be accessed as a mounted drive (e.g. Lustre) and the S3 API.
  • +
+
+
+

9.1.5 Denodo as Data Fabric Mesh

+

One company uses Denodo as a data fabric mesh; users interact via Denodo, which serves as an API layer. No direct interaction with the source data by users.

+
+
+

9.1.6 Nontabular Data

+
    +
  • Not common for statistical programmers working on clinical trial data.
  • +
+
+
+

9.1.7 CDISC Dataset JSON vs. Manifest JSON

+

Writing CDISC JSON is super slow and potentially not sufficient for regular working data.

+
+
+

9.1.8 Popularity and Concerns with Parquet Datasets

+
    +
  • Admiral tool generates Parquet directly; others convert from SAS to Parquet.
  • +
  • Questions about the longevity and maintenance requirements of Parquet as it’s a blob (vs a ‘human readable’ format like CSV/JSON)
  • +
+
+
+

9.1.9 Handling Legacy Data

+
    +
  • Suggest stacking legacy data into a database if for secondary data use
  • +
+
+
+

9.1.10 Change Management

+
    +
  • For statistical programming, direct instruction to new systems is necessary.
  • +
  • Emphasize direct support over broad training.
  • +
  • Simplify systems for users to reduce friction.
  • +
  • Consider a GUI similar to Azure.
  • +
  • Focus on reducing the user burden.
  • +
+
+
+

9.1.11 Different Data Use Cases

+

Differences in data use (e.g., Shiny App vs. regulatory documents). Dashboards directly accessing EDC without needing snapshots.

+
+
+

9.1.12 Summary

+

Uncertain value in moving from CDISC data standards to databases. Limited interest and action in this area across the organization. Not a high priority given other ongoing organizational changes. Ongoing shift away from SAS-based datasets and file storage to cloud-based systems, with increasing use of Parquet.

+
+
+

9.1.13 Action Items

+
    +
  • SCE whitepaper - mark bynum from J&J
  • +
  • Is there actual value / gain in databases?
  • +
  • Not the best investment relative to other non-data changes going on across organization (e.g. R, containers, etc)
  • +
+ + +
+
+ +
+ + +
+ + + + + \ No newline at end of file diff --git a/definitions.html b/definitions.html index b67e82e..775fb9a 100644 --- a/definitions.html +++ b/definitions.html @@ -2,12 +2,12 @@ - + -R/Pharma round tables - Definitions +Definitions – R/Pharma round tables + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+
+ +
+ +
+ + +
+ + + +
+ +
+
+

8  Small/mid pharma & OS

+
+ + + +
+ + + + +
+ + + +
+ + +

Chair: Katie Igartua and Kas Yousefi

+
+

8.0.1 Proposal

+

A pressing topic I think we should discuss is how best to engage and enable small/mid-sized pharma to use open source Ecosystems.

+

It can be sometimes overwhelming and would be helpful to have an idea where to start to join the open source journey and utilize the resources and approaches already available.

+

Some questions:

+
    +
  1. How to go about setting up an open source Environment? What are the hurdles? How can the crowd-sourced groups help?
  2. +
  3. Could small pharma use R for IB/DSUR? Does this need to be in a validated environment?
  4. +
  5. How do other small pharma companies uses open-source environments? and what would be areas for collaboration?
  6. +
+
+
+

8.0.2 Expected impact

+

I believe this topic would be a benefit because small pharma often needs to be innovative, but due to resource limits may rely on traditional approaches and heavily rely on CROs. Having stronger small pharma presence could help in new resources and approaches for open-sourced solutions

+
+
+

8.0.3 Prior discussions/work

+

Some prior work includes……

+ +
+
+

9 Round table

+
    +
  • Open sourcing adoption/contribution barrier +
      +
    • risk taking when you only have one product
    • +
    • cost: small companies often fully outsource regulatory work to CROs, hard to justify additional investment on infra or open source work for non regulatory tasks
    • +
    • IT resource: posit installation for small pharma - small company may not have the right in house IT talent to even get posit running
    • +
    • Are you ready to be a shiny dev ops person & R admin?
    • +
  • +
  • What are use cases to use open source? mainly in non validated env +
      +
    • data review, monitoring, visualization, patient profile, DSUR
    • +
    • IDCC uses shiny for IDMC close sessions (instead of long pdf)
    • +
  • +
  • Asks to the community +
      +
    • can large pharma open source their infra config? such as AMS yaml file? +
        +
      • from the perspective at the intersection of IT, DS, stat
      • +
    • +
    • what about a “single” opinionated workflow (pharmaverse is seen more as a comprehensive tool box) +
        +
      • from an independent body such as r consortium?
      • +
      • or can individual big pharma publish the whole workflow
      • +
    • +
  • +
  • GxP set up: timing tradeoff - is it good to set up early or late? setting up early can put on restrictions that is hard to change later. balance of flexibility, best practice and cost
  • +
+ + +
+ +
+ + +
+ + + + + \ No newline at end of file diff --git a/validate-shiny.html b/validate-shiny.html index cde7ec9..66a2c6f 100644 --- a/validate-shiny.html +++ b/validate-shiny.html @@ -2,12 +2,12 @@ - + -R/Pharma round tables - 2  Validate shiny? +2  Validate shiny? – R/Pharma round tables