Welcome, Guest: Register On Nairaland / LOGIN! / Trending / Recent / New
Stats: 3,150,189 members, 7,807,642 topics. Date: Wednesday, 24 April 2024 at 04:44 PM

Know How Music-identification Apps Work - Phones - Nairaland

Nairaland Forum / Science/Technology / Phones / Know How Music-identification Apps Work (319 Views)

Anybody Know How To Redeem Coinme bitcoin voucher / FG Orders Cancellation Of ₦20 National Identification Number Retrieval Charge / Facial Identification On Smart 2/ 2 Pro (2) (3) (4)

(1) (Reply)

Know How Music-identification Apps Work by Kalapizim(m): 1:34pm On Dec 05, 2018
Music-Identification apps help us in identifying the song playing around you. From a user’s perspective, it’s simple: Start the app, press a button, and let your phone listen to the song. After a few seconds, even with background noise and distortion, the app will tell you what the song is. It works so quickly and so well that it almost seems like magic – but, as with most magical things these days, it’s mostly run by algorithms. Let's now the idea behind them and how they work.

All the music-identification services work basically the same way: they have a big database of song information, an algorithm that can quickly extract information from your song sample, and an app to let you interface with those things. Technically, you don’t even need a smartphone.

Shazam was originally usable on old-fashioned flip phones by just recording a song and texting it to the service. Soundhound has actually gone a few steps further by also enabling you to sing or hum into their app which they match against a user-submitted database of other singing/humming recordings.

In simple terms, the process looks like this:

The database of the app contains a huge collection of song “fingerprints,” or small pieces of data about the song’s unique sound patterns.

When a user hits the “Record” button, the app listens to the music and creates a fingerprint based on the few seconds of audio it hears.
This fingerprint is checked against the database of existing fingerprints. If your ten-second fingerprint is a match to part of a song, you get your song result (hopefully correct). If it’s not, you’ll get back an error.
If you’re just looking for a surface-level explanation, that’s all you need to know. The really interesting part is how you actually get that fingerprint.

It all starts with a spectrogram (like the one in the graph) taken from a paper written by one of Shazam’s founders, Avery Wang. This is essentially a graph with time on the x-axis (horizontal), frequency on the y-axis (vertical), and amplitude represented by different levels of color intensity. Any sequence of sounds can thus be converted into a spectrogram, and any point on the spectrogram can be assigned a set of coordinates. Just like that, notes can be numbers.

If all you needed to do was match a few sounds to each other, you could stop here. If you want to look through a database full of millions of songs, though, a full-detail spectrogram has way too many data points to look through at any sort of speed.

The big breakthrough in music recognition was the realization that you can identify sounds with only a few pieces of data: the peaks, or the most intense parts. Not only does getting rid of most of a song’s lower-energy parts decrease the size of the spectrogram, but it makes the apps less susceptible to identifying dull, consistent background noise as part of the target sounds. Imagine a city skyline – the most identifiable parts are the tops of buildings, not the middle floors, and that’s what you can see from farthest away.

Every second of every song is stripped down to just a few of the most intense data points; everything on the city skyline is removed except the very top. But that’s still not quite efficient enough to be immediately searchable, so the next step is to “hash” this sequence of peaks. Hashing simply takes a set of inputs, runs them through an algorithm, and assigns them an integer output. In this case, the hash is generated by taking two of the high-intensity peaks, measuring the time between them, and adding their two frequencies together.

The result is a string of numbers, easily storable and searchable. When a computer reads this hash, it will recognize them as representing frequency and time-distance. Once all the peaks in the song have been identified and hashed, the transformation is complete: the song now has a unique 32-bit number that serves as its ID in the database. More importantly, every second of the song is represented by the numbers.

When your phone hears music, it goes through this exact process: it filters out everything but the highest points, hashes them, and creates a fingerprint for the few seconds it has recorded. Once this is complete, your phone just needs to see where the corresponding strings of numbers appear in the database, allowing it to match the detected frequencies and timing to the correct song and returning it to you in seconds.

This technology has been most widely used for music recognition, but sound recognition apps can also work with movies, commercials, TV shows, bird songs, and more. Shazam and Soundhound are the most well known, butyou can also now ask Google what song is playing and get an accurate response.

So this is the entire process of how music identification apps work.

http://www.infinix.club/ng/forum/686/859929
Re: Know How Music-identification Apps Work by jetz(m): 6:54pm On Dec 05, 2018
U try o if only i could keep up
Re: Know How Music-identification Apps Work by teewhydope(m): 8:05pm On Dec 05, 2018
wao. understood it a little bit shocked

(1) (Reply)

Wanna Sell Or Swap My Brand New Umidigi S2 Lite For Umidigi One / Help My Facebook Account Was Hacked?!!! / Urgently Needed - Brand New Sony Xperia XZ3

(Go Up)

Sections: politics (1) business autos (1) jobs (1) career education (1) romance computers phones travel sports fashion health
religion celebs tv-movies music-radio literature webmasters programming techmarket

Links: (1) (2) (3) (4) (5) (6) (7) (8) (9) (10)

Nairaland - Copyright © 2005 - 2024 Oluwaseun Osewa. All rights reserved. See How To Advertise. 15
Disclaimer: Every Nairaland member is solely responsible for anything that he/she posts or uploads on Nairaland.