Skip to main content icon/video/no-internet

Database studies in music research, also known as corpus-based musicology, have their roots in the advent of digital encoding of music during the 1970s and are closely influenced by corpus-based work undertaken in linguistics. In music research, database studies are an example of a data-rich approach advocated by empirical musicology.

The central idea of database studies is that the research question is formulated in such a way that the answers can be obtained by analyzing large quantities of materials coded in a systematic way into a database. Databases usually consist of music coded in a particular way and the relevant metadata but can also consist of musical behavior or historical information about the sources.

The central benefits of database studies are transparency, impartiality, and generalizibility in comparison to the conventional music scholarship that usually operates using a few handpicked case examples. In database studies, transparency is achieved in the way databases are structured and through unambiguous definition of the fields and objects of databases. This allows the analyst to accurately define the objects of music that can be subjected to analysis. In principle, anyone can replicate the results of a database study using an identical set of retrieval commands and the database. In comparison with traditional music analysis, which values intuition, stylistic knowledge, and possibly subtle judgments about the aesthetic value of the analysis content, database studies can be considered to be more impartial in the analysis process. This is not to say that intuition and stylistic knowledge are not important in database studies but these need to be explicitly coded in the queries made for the database. Generalizibility of database studies refers to the possibility of identifying common factors in the music of a given collection due to the possibility of processing much larger samples of music than is possible in a conventional analysis.

Databases Types and Tools

In music research, the types of databases can be broadly divided into (1) visual, (2) audio, (3) editorial, and (4) crow sourced.

Visual data represents conventional notation in its many forms (score, tabulature) as well as the most computer-friendly symbolic representations of music (MIDI, kern, MuseData). These latter representations are often referred to as symbolically encoded music, where note symbols and other information often conveyed by notation have been encoded in a machine-readable format. The benefit of this encoding is that it captures the common notational aspects of music, is extremely compact representation, and can be converted into notation, tabulature, or audible form using music software. The downside is that the notational schemes may not be adequate to capture the important aspects of music from many traditions.

Audio representation encodes the complex waveform of a music performance using uncompressed (WAV or AIFF) or compressed (MPEG or MP3) formats. While the audio captures all nuances of the actual performance, extracting meaningful analytic elements from audio is often problematic. Various low-level features such as the spectral centroid can be reliably extracted, but identifying the exact pitches on a polyphonic material is a task not reliably undertaken by computational algorithms.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading