MLEnd London Sounds

A dataset for acoustic scence

About Dataset

Auditory perception is one of the most intriguing abilities of humans and many animal species. In addition to allowing us to recognise speech or music, auditory perception can help us make sense of our environment by recognising acoustic scenes. Can we develop machines that have the same ability?

The MLEnd London Sounds dataset will give you an opportunity to explore machine listening, specifically problems around acoustic scene recognition. The MLEnd London Sounds dataset consists of more than 2,500 audio files recorded across London, at iconic places such The British Museum, Covent Garden and The Southbank Centre.

The MLEnd datasets have been created by students at the School of Electronic Engineering and Computer Science, Queen Mary University of London. Other datasets include the MLEnd Spoken Numerals and the MLEnd Hums and Whistles datasets, also available on Kaggle. Do not hesitate to reach out if you want to know more about how we did it.

Enjoy!

Sample Dataset

Here are some samples of London Sounds dataset.

British Museum: Forecourt

Euston: Library

Euston: Gardens

Kensington: Marine

MLEnd Campus: Canal

MLEnd Campus: Square

Southbank: Bridge

Westend: Market

Westend: Trafalgar

Download Data

Install mlend

To download the Spoken Numerals data, first step is to install mlend library. Use pip to install library.

pip install mlend

Download subset of data

To download subset of the data, only one area ‘British Meusum’ with two spots namely; ‘forecourt’,’greatcourt’, use following piece of code:

import mlend
from mlend import download_london_sounds, london_sounds_load

subset = {'Area':['british_museum'], 'Spot':['forecourt','greatcourt']}

datadir = download_london_sounds(save_to = '../MLEnd', subset = subset,pbar_style='colab')

This code will download data in given path (‘../MLEnd’) and returns the path of data as datadir (='../MLEnd/london_sounds')

Download full dataset

To download full dataset, use empty subset, as in following piece of code:

import mlend
from mlend import download_london_sounds, london_sounds_load

subset = {}
datadir = download_london_sounds(save_to = '../MLEnd', subset = subset,pbar_style='colab')

Load the Data and benchmark sets

After downloading partial or full dataset, mlend allows you to load the dataset with specified method (‘Benchmark A’ or ‘random’) of training and testing split. Note, mlend doesn’t read and load the audio files in memory, instead it reads the path of files, for further reading and cleaning data as per requirement of the model. For more details, check help(london_sounds_load).

import mlend
from mlend import download_london_sounds, london_sounds_load

subset = {'Area':['british_museum'], 'Spot':['forecourt','greatcourt']}

datadir = download_london_sounds(save_to = '../MLEnd', subset = subset,pbar_style='colab'))

TrainSet,TestSet, MAPs = mlend.london_sounds_load(datadir_main = datadir,
                                            train_test_split = 'Benchmark_A', 
                                            verbose=1,encode_labels=True)

MLEnd Documentation

For mlend documentation use help(fun) in python terminal or Jupyter-notebook. Alternately, check out