Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SAFI farms cognitive load #49

Open
mikerspencer opened this issue Apr 23, 2021 · 4 comments
Open

SAFI farms cognitive load #49

mikerspencer opened this issue Apr 23, 2021 · 4 comments

Comments

@mikerspencer
Copy link

The SAFI farms dataset has quite a high cognitive load for those without regional, development or farming knowledge. The lessons may work better for a wider audience with a more general social dataset, e.g. a census.

@atheobold
Copy link

@mikerspencer -- As maintainers, we have often discussed the possibility of changing the dataset for these lessons, however, this is a major change that would need to be approved by the Curriculum Advisory Council. Since you have brought up the issue, we would be happy to relay the suggestion of transitioning to a different dataset.

To ensure we accurately summarize your concerns, could you respond with a list of reasons why you believe the SAFI dataset is a poor fit for these lessons? Additionally, would you have any datasets that you would suggest the lesson using?

Thanks!

@mikerspencer
Copy link
Author

Hi Allison,

Problems:

  • Dataset is location specific, place names are difficult for those not familiar with the area
  • Specific terms, e.g. mud daub, which are difficult for non-English speakers (or anyone not making houses from mud)
  • Agricultural knowledge required, e.g. irrigation, crop types

Solutions/ideas:

  • What types of data do we need, categorical, numeric, nested?
  • What data structures do we need? single file/table, but also lookups/joins
  • Dataset which doesn't depend on place, unless perhaps global?
  • Topic which is widely accessible

I've previously used subsets of the Stackoverflow developer with success, but generic population census data could be even better.

I appreciate, change will be difficult and there's likely no perfect answer.

@brownsarahm
Copy link
Contributor

I want to vote strongly against stackoverflow developer data. The target of these curricula is social scientists, the data should be survey data, not logs.

the data also needs to be plausibly messy for cleaning with openrefine.

Census could be okay, but I actually prefer incorporating more about reading the meta-data into the spreadsheets lesson (which is supposed to be a precursor to the rest of the lessons) so that over the course of a workshop, people learn enough about the data for it to make sense later.

@USNRyan
Copy link

USNRyan commented Jan 9, 2024

I agree with @mikerspencer . However, I think the SAFI dataset itself isn't problematic, maybe a more detailed introduction about the background of the study would help social scientists feel more at home. In the lesson introduction, they provide a brief description of the dataset, which I think is helpful, but there needs more detail that is social science specific. I attended a physical/in-person carpentries workshop in Norway, and the facilitator provided pictures of the different types of roofs found in the dataset, added info about the type of family structures in the area, etc - the context she built made the dataset "come to life", so to speak. But I guess this is really up to the workshop holders to give this added info.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants