Skip to content

Commit

Permalink
Update README.md - small spelling mistake
Browse files Browse the repository at this point in the history
  • Loading branch information
harrygcoppock authored May 19, 2023
1 parent 88b15ae commit cb1e107
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -44,7 +44,7 @@ The full UK COVID-19 Vocal Audio Dataset is not publicly available as is classed
[https://www.gov.uk/government/publications/accessing-ukhsa-protected-data/accessing-ukh]{https://www.gov.uk/government/publications/accessing-ukhsa-protected-data/accessing-ukhsa-protected-data}


We understand that this might not be practical for a number of users interested in our work and therefore we have created a new curated dataset which has been classed as 'Open Access' data (there will be a downloadable link which anyone can use, without the need to even register). In order to achieve this the 'sentence' modality has been removed, leaving behind the 'cough', 'three cough' and 'exahaltion' modalities. In addition, to meet open access requirements, some select attributes of the meta data have been aggregated (to prevent groups of individuals of smaller than 3 being singled out on selection of attributes). This means that the 'sentence' modality results are not replicable or the creation of the train-test splits. We note that this just applys for the the open access version of the data and that our full stack is replicable with the original dataset which can be accessed following the instructions above. We note that we provide the train-test splits in _.csv_ form so that the machine learning experiments can be replicated with the open access data. This open access dataset has been created however, is waiting final UKHSA approval before we upload it to zenodo.
We understand that this might not be practical for a number of users interested in our work and therefore we have created a new curated dataset which has been classed as 'Open Access' data (there will be a downloadable link which anyone can use, without the need to even register). In order to achieve this the 'sentence' modality has been removed, leaving behind the 'cough', 'three cough' and 'exahaltion' modalities. In addition, to meet open access requirements, some select attributes of the meta data have been aggregated (to prevent groups of individuals of smaller than 3 being singled out on selection of attributes). This means that the 'sentence' modality results are not replicable or the creation of the train-test splits. We note that this just applies for the the open access version of the data and that our full stack is replicable with the original dataset which can be accessed following the instructions above. We note that we provide the train-test splits in _.csv_ form so that the machine learning experiments can be replicated with the open access data. This open access dataset has been created however, is waiting final UKHSA approval before we upload it to zenodo.

### Demo!
To easily run the code yourself using your own voice recordings, (no need to download the data), we have provided a short demo hosted on google colab. Please follow this [link](https://colab.research.google.com/drive/1Hdy2H6lrfEocUBfz3LoC5EDJrJr2GXpu?usp=sharing) to have a go yourself!
Expand Down

0 comments on commit cb1e107

Please sign in to comment.