Skip to content

Code for our ICWSM 2025 paper "Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models."

License

Notifications You must be signed in to change notification settings

mariateleki/masculine-defaults

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

README.txt

Code Structure

The code is structured as a set of scripts. The following diagram captures the dependency structure of the scripts (the following script depends on the output of the previous script):

For D-WEAT: The results were plotted in get_results_figs.ipynb. For GDCF: The final csvs from get_all-features-results.ipynb were imported into Google Sheets, and analyzed there.

Modules

LDA

Link: https://scikit-learn.org/stable/modules/generated/sklearn.discriminant_analysis.LinearDiscriminantAnalysis.html

inaSpeechSegmenter

Link: https://github.com/ina-foss/inaSpeechSegmenter

english-fisher-annotations

Link: https://github.com/pariajm/english-fisher-annotations

We had to modify this code, so we provide the code here as a subdirectory.

Spotify Podcast Dataset

Link: https://podcastsdataset.byspotify.com/

This dataset is maintained by Spotify, and access to the dataset is determined by Spotify.

Open AI Embeddings API

Link: https://platform.openai.com/docs/guides/embeddings

About

Code for our ICWSM 2025 paper "Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models."

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published