Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Distinction between external/internal datasets #16

Open
kaijagahm opened this issue Aug 20, 2024 · 0 comments
Open

Distinction between external/internal datasets #16

kaijagahm opened this issue Aug 20, 2024 · 0 comments
Labels
episode:datasets Any issues related to episode 4. Minimal Reproducible Data

Comments

@kaijagahm
Copy link
Collaborator

The problem with loading the ratdat package is that it means the datasets are already loaded. In the Data episode, there were some places where we wanted to show the audience "see, we're working with this dataset, but others might not be able to find the csv file on their computer..." I think this was confusing because in fact the audience could find the dataset on their computer... by loading the ratdat package!

I think it would be good to have the learners working mostly or partly from a csv for the data portion so they can actually see how hard it is to share a csv file. I.e. consider reading in the complete dataset from csv instead of using the version stored in the ratdat package.

@kaijagahm kaijagahm added the episode:datasets Any issues related to episode 4. Minimal Reproducible Data label Aug 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
episode:datasets Any issues related to episode 4. Minimal Reproducible Data
Projects
None yet
Development

No branches or pull requests

1 participant