Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

guideline about your paper #1

Open
parisa1984 opened this issue Jan 15, 2023 · 0 comments
Open

guideline about your paper #1

parisa1984 opened this issue Jan 15, 2023 · 0 comments

Comments

@parisa1984
Copy link

hi
I am working on sound source localization. I have read your papers entitled: "Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and Detection" and "A Fast and Effective Feature for Polyphonic Sound Event Localization and Detection with Microphone Arrays" , and I have found that you tested the approach on 60-sec sound files. I have tested your implemented approach ("https://github.com/thomeou/SALSA") on TAU-NIGENS Spatial Sound Events 2021 dataset , Everything is OK without any error, but the output of the network is correct only for the audios in the datasets, and for the data recorded by myself (I have recorded the data with the help of a 4-channel microphone array named ReSpeaker USB Mic Array) the output is completely wrong. I am just wondering what is wrong with my data. It is a 4-channal data, fp16, and with the same PCM coding.
Which device did you use for recording?
My data has some kind of echo, therein. Is it possible that a small amount of echo degrades the performance of your localization algorithm significantly?
Thanks for your helping.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant