README.txt

Code Structure

The code is structured as a set of scripts. The following diagram captures the dependency structure of the scripts (the following script depends on the output of the previous script):

For D-WEAT: The results were plotted in get_results_figs.ipynb. For GDCF: The final csvs from get_all-features-results.ipynb were imported into Google Sheets, and analyzed there.

Modules

We had to modify this code, so we provide the code here as a subdirectory.

Spotify Podcast Dataset

Link: https://podcastsdataset.byspotify.com/

This dataset is maintained by Spotify, and access to the dataset is determined by Spotify.

Open AI Embeddings API

Link: https://platform.openai.com/docs/guides/embeddings

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
english-fisher-annotations		english-fisher-annotations
img		img
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ablations.py		ablations.py
get_30sec10min_correlations.ipynb		get_30sec10min_correlations.ipynb
get_LDA_df.ipynb		get_LDA_df.ipynb
get_LDA_topic_coherence.ipynb		get_LDA_topic_coherence.ipynb
get_all-features-df.ipynb		get_all-features-df.ipynb
get_all-features-results.ipynb		get_all-features-results.ipynb
get_dfs.ipynb		get_dfs.ipynb
get_english_nonenglish.ipynb		get_english_nonenglish.ipynb
get_inaSpeechSegmenter_annotations.py		get_inaSpeechSegmenter_annotations.py
get_low_words.ipynb		get_low_words.ipynb
get_manual_inaSpeechSegmenter.ipynb		get_manual_inaSpeechSegmenter.ipynb
get_results_figs.ipynb		get_results_figs.ipynb
get_whisperx_googleasr_table.ipynb		get_whisperx_googleasr_table.ipynb
get_whisperx_transcriptions.py		get_whisperx_transcriptions.py
run_experiment.py		run_experiment.py
tb.py		tb.py
topic-ablations.py		topic-ablations.py
topic-script.py		topic-script.py
topic60.txt		topic60.txt
topic62.txt		topic62.txt
utils_embeddings.py		utils_embeddings.py
utils_general.py		utils_general.py
utils_podcasts.py		utils_podcasts.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README.txt

Code Structure

Modules

LDA

inaSpeechSegmenter

english-fisher-annotations

Spotify Podcast Dataset

Open AI Embeddings API

About

Releases

Packages

Languages

License

mariateleki/masculine-defaults

Folders and files

Latest commit

History

Repository files navigation

README.txt

Code Structure

Modules

LDA

inaSpeechSegmenter

english-fisher-annotations

Spotify Podcast Dataset

Open AI Embeddings API

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages