Thursday, January 17, 2008

MP3

MPEG-1 Audio Layer 3, more commonly referred to as MP3, is a digital audio encoding format.

This encoding format is used to create an MP3 file, a way to store a single segment of audio, commonly a song, so that it can be organized or easily transferred between computers and other devices such as MP3 players.

MP3 uses a lossy compression algorithm that is designed to greatly reduce the amount of data required to represent the audio recording, yet still sound like a faithful reproduction of the original uncompressed audio to most listeners. An MP3 digital file created using the mid-range bitrate setting of 128 kbit/s results in a file that is typically about 1/10th the size of the digital data found on an audio CD.

MP3 is an audio-specific format. It was invented by a team of international engineers at Philips, CCETT (Centre commun d'études de télévision et télécommunications), IRT, AT&T-Bell Labs and Fraunhofer Society, and it became an ISO/IEC standard in 1991. The compression works by reducing accuracy of certain parts of sound that are deemed beyond the auditory resolution ability of most people. This method is commonly referred to as Perceptual Coding [1].

It provides a representation of sound within the frequency domain, by using psychoacoustic models to discard or reduce precision of components less audible to human hearing, and recording the remaining information in an efficient manner. This is relatively similar from the principles used by, say, JPEG, an image compression format.

File structure

An MP3 file is made up of multiple MP3 frames, which consist of the MP3 header and the MP3 data. This sequence of frames is called an Elementary stream. Frames are not independent items ("byte reservoir") and therefore cannot be extracted on arbitrary frame boundaries. The MP3 data is the actual audio payload. The diagram shows that the MP3 header consists of a sync word, which is used to identify the beginning of a valid frame. This is followed by a bit indicating that this is the MPEG standard and two bits that indicate that layer 3 is being used, hence MPEG-1 Audio Layer 3 or MP3. After this, the values will differ depending on the MP3 file. ISO/IEC 11172-3 defines the range of values for each section of the header along with the specification of the header. Most MP3 files today contain ID3 metadata, which precedes or follows the MP3 frames; this is also shown in the diagram.