discussion / Acoustics  / 5 February 2025

Giving different types of labeled data to the community - solutions?

Hello everyone!

During my master thesis in engineering (past autumn), I developed Machine Learning for anti-poaching. As such, I recorded a bunch of data that I now would like to share/put somewhere useful. The data is audio recordings of gunshots and other sounds on the savannah (.wav files) + geophone data with associated pictures of geophone location + accelerometer data from fence-breaching. The data is labeled with varying quality, but generally quite good. All data was collected in South Africa, Limpopo. 

The data is currently stored on Dropbox and google drive, with all data maybe amassing to a few hundred GB (including raw files, so only keeping the data that can be used out-of-the-box without any work would reduce the storage a lot). 

What ways are there to contribute to the open-source data community with this? I remember reading about data services for specifically wildlife on here before, but I am not sure what the best solution is. Preferably I would not like to pay for a data storage service.