Online music teaching resources

Today is my last day of in-person teaching at the New School for the near (foreesable?) future, and NYU has already moved classes online. Many of you are probably in the same boat. In this post, I share my curated collection of online music teaching, learning and creation resources, and my plan for my classes going forward. This is not a comprehensive list of online resources, just the ones that I myself use.

Big collections

A spreadsheet of online music theory resources and projects, plus my New School syllabus that uses many of these things.

A spreadsheet of online music technology resources and projects.

The NYU Steinhardt Music Experience Design Lab has a variety of interesting web interactives. Two notable ones:

The Groove Pizza

Groove Pizza logo

Some activities and examples:

The aQWERTYon

Some useful links:

Other recommended sites

I will be adding more materials here over the coming weeks, so check back! Wash your hands and stay safe.

The Groove Pizzeria

For his NYU music technology masters thesis, Tyler Bisson created a web app called Groove Pizzeria, a polyrhythmic/polymetric extension of the Groove Pizza. Click the image to try it for yourself.

<img data-attachment-id="18497" data-permalink="http://www.ethanhein.com/wp/2019/the-groove-pizzeria/groove-pizzaria/#main" data-orig-file="https://i2.wp.com/www.ethanhein.com/wp/wp-content/uploads/2019/05/Groove-Pizzaria.png?fit=2529%2C1458" data-orig-size="2529,1458" data-comments-opened="1" data-image-meta="{"aperture":"0","credit":"","camera":"","caption":"","created_timestamp":"0","copyright":"","focal_length":"0","iso":"0","shutter_speed":"0","title":"","orientation":"0"}" data-image-title="Groove Pizzeria" data-image-description="

Groove Pizzeria

” data-medium-file=”https://i2.wp.com/www.ethanhein.com/wp/wp-content/uploads/2019/05/Groove-Pizzaria.png?fit=300%2C173″ data-large-file=”https://i2.wp.com/www.ethanhein.com/wp/wp-content/uploads/2019/05/Groove-Pizzaria.png?fit=680%2C392″ class=”alignnone size-large wp-image-18497″ src=”https://i2.wp.com/www.ethanhein.com/wp/wp-content/uploads/2019/05/Groove-Pizzaria.png?resize=680%2C392″ alt=”” width=”680″ height=”392″ data-recalc-dims=”1″ />

Note that the Groove Pizzeria is still a prototype, and it doesn’t yet have the full feature set that the Groove Pizza does. As of this writing, there are no presets, no saving, no exporting of audio or MIDI, and no changing drum kits. You can record the Groove Pizzeria’s output using Audio Hijack, however.

Like the Groove Pizza, the Groove Pizzeria is based on the idea of the rhythm necklace, a circular representation of musical rhythm. The Groove Pizza is a set of three concentric rhythm necklaces, each of which controls one drum sound, e.g. kick, snare and hi-hat. The Groove Pizzeria gives you two sets of concentric rhythm necklaces, each of which can have its own time duration and subdivisions. This means that you can use the Groove Pizzeria to make polyrhythm and polymeter.

The words “polyrhythm” and “polymeter” are frequently used interchangeably, but they are not the same thing. Tyler’s thesis contains the clearest definition of the terms that I know of, which I paraphrase here.

  • Polyrhythm is two or more concurrent loops of equal duration. Each loop consists of a set of evenly-spaced subdivisions or rhythmic onsets. The loops contain different numbers of onsets, meaning that the subdivisions of each loop are not same length. Finally, the ratio of the number of onsets in each loop is not a whole number (otherwise one loop would just be an even subdivision of the other). When people talk about 4:3 or 5:2 polyrhythm, this is what they mean. In Western music, polyrhythms usually only occur for short time spans in the form of tuplets, but in West African drumming, polyrhythms are a core structural feature. 
  • Polymeter is two or more concurrent loops of different duration. The onsets in each loop have the same duration, but each loop has a different number of onsets. This is much more common in Western music than polyrhythm. In Western music, you mostly see polymeter over short time spans in the form of hemiola or syncopation.

With these two definitions in mind, let’s take a look at the Groove Pizzeria interface. For each loop, you can control both the number of subdivisions (the number of onsets) in each loop and the length (duration) of each subdivision. The basic time unit in the Groove Pizzeria is one sixteenth note. Each of the “teeth” on the outer radius of each circle represents the duration of one sixteenth note. If you change the Time Units setting, you make the sixteenth notes shorter, and the radius of the circle gets smaller to preserve the cumulative distances between each tooth of the loop. The easiest way to understand the difference is just to draw some rhythm patterns on the grid, play with the sliders, and see what happens. Notice that the Groove Pizzeria visualizes the compound pattern formed by the two loops in the top left corner of the screen.

Here’s a 5:4 polyrhythm created by taking two loops that are the same length and dividing them into five and four steps respectively:

Simple 5 against 4 polyrhythm on the Groove Pizzeria

If you want a 5:4 polymeter rather than a polyrhythm, then you will need to adjust the number of time units in each loop as well. (The patterns aren’t perfectly symmetric so you can hear where they start and end.)

Simple 5 against 4 polymeter

Here’s a less exotic sound, a 4:3 polymeter, also known as hemiola. On the left is a 4/4 hip-hop pattern. On the right, I made a 12-beat-long pattern that repeats four times in the same amount of time as it takes the hip-hip pattern to repeat three times.

4 vs 3 polymeter, also known as hemiola

Here’s a less familiar sound, an 11:5 polyrhythm. On the left, I made the closest thing to a hip-hop pattern that’s possible in 11/8 time, and on the right I made a simple quintuplet pattern. This will probably sound weird to you at first, but if you listen to it for a while, it will eventually start to make a wonky kind of sense.

11 against 5 polyrhythm

How about some real-world examples? Genuine polyrhythm is unusual in popular music, but it’s not unheard of. James Blake uses a quintuplet hi-hat pattern in his song “Unluck.”

Here’s my Groove Pizzeria representation of this beat. On the left is the kick and snare playing a straight quarter note pattern in 4/4, and on the right is the hi-hat pattern (though it’s not playing back on a hi-hat sound.)

Hip-hop producers sometimes use polyrhythms to create specific varieties of swing. On drum machines, swing (sometimes called shuffle) shortens and lengthens each alternate beat. At zero swing, also known as 1:1 swing, the beats within each pair are the same length. At maximum swing, the first beat in each pair will be twice as long as the second beat in the pair. This is known as 2:1 swing, sometimes called “triplet swing” because it’s as if the first beat is two triplets long, while the second is one triplet long. In real life, you usually want your swing setting somewhere between these two extremes. (Click here for a more detailed explanation of swing.)

One way to get a swing ratio in between 1:1 and 2:1 is to use a quintuplet grid. If you think of the first three quintuplets in each group as being one “beat” in a pair and the last two as being the “beat” in the pair, you get the equivalent of 5:3 swing. Slynk explains how to set this up in Ableton:

Here’s a neo soul groove I made using pentuplet swing:

Neo soul pentuplet swing groove

For an even narrower swing ratio, you can use septuplet swing. It’s the same idea, except now you’re grouping together the first four septuplets into one “beat” in the pair, and the last three septuplets into the other “beat”. This gives you a 4:3 swing ratio. This is pretty close to no swing at all, but it’s noticeably “off,” in a way that gives you a nice J Dilla “drunken drummer” feel. Slynk explains again:

Here’s a neo soul groove I made using septuplet swing:

Neo soul septuplet swing groove

Beyond complex rhythms, the Groove Pizzeria can teach another useful musical concept called event fusion. When a rhythm gets fast enough, you stop hearing individual beats and start to hear a continuous thrum. The transition happens at around twenty beats per second. If you play the rhythm even faster, the thrum becomes a steady pitch, and the higher the tempo, the faster the pitch. Here’s how you can experiment with event fusion yourself. First, put a clap on every sixteenth note. Next, reduce the number of time units to a small number (5 is fine) and set the tempo to 300 bpm. Now reduce the number of steps. Listen for the point when the claps fuse into a single tone. You can control the pitch of this tone by changing the number of steps.

Event fusion at extreme tempo

If you think of more interesting music learning or creation applications for the Groove Pizzeria, please let me know. Happy drumming!

Chord pizzas

The Groove Pizza uses geometry to help visualize rhythms. The MusEDLab is planning to create a similar tool for visualizing music theory by merging the aQWERTYon with the Scale Wheel. When you put the twelve pitch classes in a circle, you can connect the dots between different notes in a chord or scale to form shapes. My hypothesis is that seeing these shapes along with hearing the notes will help people learn music theory more easily. In this post, I’ll talk through some concept images.

First, let’s look at two different ways to represent the pitch classes on a circle. On the left is the chromatic circle, showing the notes in the order of pitch height (the way they are on a piano keyboard.) On the right is the circle of fifths. These two circles have an interesting relationship: the circle of fifths is the involute of the chromatic circle. Notice that C, D, E, G-flat, A-flat and B-flat are in the same places on both circles, while the other six notes trade places across the circle. Pretty cool!

The chromatic circle and the circle of fifths

The colors represent the harmonic function of each note relative to the root C. Purple notes are perfect (neither major nor minor.) Green notes are major or natural. Blue notes are minor or flatted. You could technically think of, say, B-flat as being the sharp sixth rather than the flat seventh, but that usage is rare in real life. G-flat is a special case–it’s equally likely to be the sharp fourth or flat fifth. I represented this ambiguity by making it blue-green. (We could make it blue if we knew it was flat fifth from Locrian mode, or green if it was the sharp fourth from Lydian mode.)

Once the Scale Wheel and aQWERTYon get combined, then whenever you play more than one note at a time, they will be connected on the circle. Here are some common chord progressions, and what their shapes can tell us about how they function. First, let’s look at the I-vi-ii-V jazz turnaround in C major.

Major scale chords

Seeing things on the circle really helps you understand the voice leading. You can see how the notes move very little from one chord to the next. To get from Cmaj7 to Am7, you just move the B to A while keeping the other three notes the same. To get from Am7 to Dm7, you move the G to F and the E to D while keeping the other two notes the same. To get from Dm7 to G7, you move the A to G and the C to B while keeping the other two notes the same. Finally, to get from G7 back to Cmaj7, you move the D to C and the F to E while keeping the other two notes the same. In general, any chord you can produce by moving the notes as little as possible from the current chord is likely to sound smooth and logical.

The pitch circle doesn’t represent musical “real life” perfectly–while pitch classes are circular, actual notes belong to specific octaves. That makes the voice leading harder to figure out, because you will need to introduce some jumps or additional chord voices to make it work. That said, thinking in terms of pitch class rather than pitch makes it easier to learn the concept; then you can work out the logistics of voice leading actual pitches from a place of understanding.

Next, let’s look at the Mixolydian mode I-bVII-IV-I turnaround that’s ubiquitous in rock, e.g. the “na na na” section in the Beatles’ “Hey Jude.”

Mixoydian mode chords

The circle of fifths view is more clear here. Getting from the Bb to the F is just a matter of rotating the little triangle clockwise by one slot. If you voice the C7 chord like a jazz musician and leave out the G, then the voice leading in this progression becomes exquisitely clear and simple.

Finally, here’s a more exotic-sounding progression from Phrygian dominant, the I-bvii you hear in Middle Eastern and Jewish music like “Hava Nagilah.”

Phrygian dominant mode chords

Seeing these chords on the circle of fifths is not very enlightening–while Western functional harmony keeps things close together on the circle of fifths, non-Western harmony jumps around a lot more. But on the chromatic circle, you can see exactly what’s happening: To get from C7 to Bb-7, B-flat stays the same, but all the other notes move one scale degree clockwise. To get from Bb-7 back to C7, B-flat stays the same while the other notes move one scale degree counterclockwise. This is very close to the way I conceptualize this progression in my head. It’s like the notes in Bb-7 are lifting or pulling away from their homes in C7, and when you release them, they snap back into place. You could also think of this progression as being iv-V7 in the key of F minor, in which case the Bb-7 is acting more like C7sus(b9 #5). Here the suspension metaphor makes even more sense.

Beyond the fact that it looks cool, seeing geometric representations of music gives you insight into why it works the way it does. The main insight you get from the circles is that perfect symmetry is boring. On the Groove Pizza, squares and equilateral triangles produce steady isochronous rhythms, like the four on the floor kick drum pattern. These rhythms are musical, but they’re boring, because they’re perfectly predictable. The more exciting rhythms come from shapes that don’t evenly fit the metrical grid. On a sixteen-step grid, pentagons produce clave patterns, while hexagons make habanera and tresillo.

The same concept applies to the pitch wheel. A square on the pitch wheel is a diminished seventh chord; an equilateral triangle is an augmented triad; and a hexagon is a whole tone scale. (Interestingly, this is true both on the chromatic circle and the circle of fifths.) These sounds are fine for occasional use or special effects, but they get tedious very quickly if you repeat them too much. By contrast, the harmonic devices we use most commonly, like major and minor triads and seventh chords, are uneven and asymmetrical. The same uneven seven-sided figure produces the major scale and its modes on the pitch wheel, and the “standard bell pattern” on the Groove Pizza. Food (ha) for thought.

The aQWERTYon pitch wheels and the future of music theory visualization

The MusEDLab will soon be launching a revamped version of the aQWERTYon with some enhancements to its visual design, including a new scale picker. Beyond our desire to make our stuff look cooler, the scale picker represents a challenge that we’ve struggled with since the earliest days of aQW development. On the one hand, we want to offer users a wide variety of intriguing and exotic scales to play with. On the other hand, our audience of beginner and intermediate musicians is likely to be horrified by a list of terms like “Lydian dominant mode.” I recently had the idea to represent all the scales as colorful icons, like so:

Read more about the rationale and process behind this change here. In this post, I’ll explain what the icons mean, and how they can someday become the basis for a set of new interactive music theory visualizations.

Musical pitches rise and fall linearly, but pitch class is circular. When you go up or down the chromatic scale, the note names “wrap around” every twelve notes. This naming convention reflects the fact that we hear notes an octave apart as being “the same”, probably because they share so many overtones. (Non-human primates hear octaves as being equivalent too.)

chromatic circle

The note names and numbers are all based on the C major scale, which is Western music’s “default setting.” The scale notes C, D, E, F, G, A and B (the white keys on the piano) are the “normal” notes. (Why do they start on C and not A? I have no idea.) You get D-flat, E-flat, G-flat, A-flat and B-flat (the black keys on the piano) by lowering (flatting) their corresponding white key notes. Alternately, you can get the black key notes by raising or sharping the white key notes, in which case they’ll be called C-sharp, D-sharp, F-sharp, G-sharp, and A-sharp. (Let’s just briefly acknowledge that the imagery of the “normal” white and “deviant” black keys is just one of many ways that Western musical culture is super racist, and move on.)

You can represent any scale on the chromatic circle just by “switching” notes on and off. For example, if you activate the notes C, D, E-flat, F, G, A-flat and B, you get C harmonic minor. (Alternatively, you could just deactivate D-flat, E, G-flat, A, and B-flat.) Here’s how the scale looks when you write it this way:

C harmonic minor - monochrome

This is how I conceive scales in my head, as a pattern of activated and deactivated chromatic scale notes. As a guitarist, it’s the most intuitive way to think about them, because each box on the circular grid corresponds to a fret, so you can read the fingering pattern right off the circle. When I think “harmonic minor,” I don’t think of note names, I think “pattern of notes and gaps with one unusually wide gap.”

Another beauty of the circle view is that you can get the other eleven harmonic minor scales just by rotating the note names while keeping the pattern of activated/deactivated notes the same. If I want E-flat harmonic minor, I just have to grab the outer ring and rotate it counterclockwise a few notches:

E-flat harmonic minor

My next thought was to color-code the scale tones to give an indication of their sound and function:

C harmonic minor scale necklace

Here’s how the color scheme works:

  • Green – major, natural, sharp, augmented
  • Blue – minor, flat, diminished
  • Purple – perfect (neither major nor minor)
  • Grey – not in the scale

Scales with more green in them sound “happier” or brighter. Scales with more blue sound “sadder” or darker. Scales with a mixture of blue and green (like harmonic minor) will have a more complex and ambiguous feeling.

My ambition with the pitch wheels is not just to make the aQWERTYon’s scale menu more visually appealing. I’d eventually like to have it be an interactive way to visualize chords too. Followers of this blog will notice a strong similarity between the circular scale and the rhythm necklaces that inspired the Groove Pizza. Just like symmetries and patterns on the rhythm necklace can tell you a lot about how beats work, so too can symmetries and patterns on the scale necklace can tell you how harmony works. So here’s my dream for the aQWERTYon’s future theory visualization interface. If you load the app and set it to C harmonic minor, here’s how it would look. To the right is a staff notation view with the appropriate key signature.

When you play a note, it would change color on the keyboard and the wheel, and appear on the staff. The app would also tell you which scale degree it is (in this case, seven.)

If you play two notes simultaneously, in this case the third and seventh notes in C Mixolydian mode, the app would draw a line between the two notes on the circle:

If you play three notes at a time, like the first, fourth and fifth notes in C Lydian, you’d get a triangle.

If your three notes spell out a chord, like the second, fourth and sixth notes in C Phrygian mode, the app would recognize it and shows the chord symbol on the staff.

The pattern continues if you play four notes at a time:

Or five notes at a time:

By rotating the outer ring of the pitch wheel, you could change the root of the scale, like I showed above with C harmonic minor. And if you rotated the inner ring, showing the scale degrees, you could get different modes of the scale. Modes are one of the most difficult concepts in music theory. That is, they’re difficult until you learn to imagine them as rotations of the scale necklace, at which point they become nothing harder than a memorization exercise.

I’m designing this system to be used with the aQWERTYon, but there’s no reason it couldn’t take ordinary MIDI input as well. Wouldn’t it be nice to have this in a window in your DAW or notation program?

Music theory is hard. There’s a whole Twitter account devoted to retweeting students’ complaints about it. Some of this difficulty is due to the intrinsic complexity of modern harmony. But a lot of it is due to terminology and notation. Our naming system for notes and chords is a set of historically contingent kludges. No rational person would design it this way from the ground up. Thanks to path dependency, we’re stuck with it, much like we’re stuck with English grammar and the QWERTY keyboard layout. Fortunately, technology gives us a lot of new ways to make all the arcana more accessible, by showing multiple representations simultaneously and by making those representations discoverable through playful tinkering.

Do you find this idea exciting? Would you like it to be functioning software, and not just a bunch of flat images I laboriously made by hand? Help the MusEDLab find a partner to fund the developer and designer time. A grant or gift would work, and we’d also be open to exploring a commercial partnership. The aQW has been a labor of volunteer love for the lab so far, and it’s already one of the best music theory pedagogy tools on the internet. But development would go a lot faster if we could fund it properly. If you have ideas, please be in touch!

Update: Will Kuhn’s response to this post.

Groove challenges with the Groove Pizza

One of our key design principles at the NYU MusEDLab is not to confront beginners with a blank canvas. We want to introduce people to our tools by giving them specific, real-world music to play around with. That was the motivation behind creating presets for the aQWERTYon, and a similar impulse informs Ableton’s approach to their online music tutorials. The Groove Pizza comes with some preset patterns (specials), but there aren’t direct prompts for creative beatmaking. This post introduces some prototype prompts.

Groove Pizza logo

The Funky Drummer boom-bap challenge

The pattern below is the first quarter note of the kick and snare pattern in Clyde Stubblefield’s classic drum break. Fill in the missing kick and snare hits to make your own golden age breakbeat. Try removing some hi-hats as well.

Musical inspiration:

The Levee break asymmetrical kick challenge

The groove below uses the kick and snare pattern from “When The Levee Breaks” by Led Zeppelin. Add hi-hats and customize the kick and snare to best convey the Awesome Majesty of Rock.

Musical inspiration:

Four-on-the-floor squares challenge

These two squares make a classic dance beat, kicks on the quarter notes with hi-hats in between. Add snares and break up the symmetry to make a dance floor filler.

Musical inspiration:

So Fresh So Clean challenge

The pattern below is the basis for a sixteenth note hip-hop groove. Place more kicks and snares to make a crunk Dirty South beat in the spirit of OutKast.

Musical inspiration:

It’s A Trap challenge

The pattern below is the basis for a thirty-second note groove. Add kicks and snares and remove hi-hats to make a radio-friendly trap beat.

Musical inspiration: I would include a link to a Future song but can’t find one that whose lyrics aren’t extremely objectionable. Just turn on the radio.

Deconstructing the bassline in Herbie Hancock’s “Chameleon”

If you have even a passing interest in funk, you will want to familiarize yourself with Herbie Hancock’s “Chameleon.” And if you are preoccupied and dedicated to the preservation of the movement of the hips, then the bassline needs to be a cornerstone of your practice.

Chameleon - circular bass

Here’s a transcription I did in Noteflight – huge props to them for recently introducing sixteenth note swing.

And here’s how it looks in the MIDI piano roll:

The “Chameleon” bassline packs an incredible amount of music into just two bars. To understand how it’s put together, it’s helpful to take a look at the scale that Herbie built the tune around, the B-flat Dorian mode. Click the image below to play it on the aQWERTYon. I recommend doing some jamming with it over the song before you move on.

B-flat Dorian

Fun fact: this scale contains the same pitches as A-flat major. If you find that fact confusing, then feel free to ignore it. You can learn more about scales and modes in my Soundfly course.

The chord progression

The opening section of “Chameleon” is an endless loop of two chords, B♭-7 and E♭7. You build both of them using the notes in B-flat Dorian. To make B♭-7, start on the root of the scale, B-flat. Skip over the second scale degree to land on the third, D-flat. Skip over the fourth scale degree to land on the fifth, F. Then skip over the sixth to land on the seventh, A-flat. If you want to add extensions to the chord, just keep skipping scale degrees, like so:

B-flat Dorian mode chords

To make E♭7, you’re going to use the same seven pitches in the same order, but you’re going to treat E-flat as home base rather than B-flat. You could think of this new scale as being E-flat Mixolydian, or B-flat Dorian starting on E-flat; they’re perfectly interchangeable. Click to play E-flat Mixolydian on the aQWERTYon. You build your E♭7 chord like so:

B-flat Dorian mode chords on E-flat

Once you’ve got the sound of B♭-7 and E♭7 in your head, let’s try an extremely simplified version of the bassline.

Chord roots only

At the most basic level, the “Chameleon” bassline exists to spell out the chord progression in a rhythmically interesting way. (This is what all basslines do.) Here’s a version of the bassline that removes all of the notes except the ones on the first beat of each bar. They play the roots of the chords, B-flat and E-flat.

That’s boring, but effective. You can never go wrong playing chord roots on the downbeat.

Simple arpeggios

Next, we’ll hear a bassline that plays all of the notes in B♭-7 and E♭7 one at a time. When you play chords in this way, they’re called arpeggios.

The actual arpeggios

The real “Chameleon” bassline plays partial arpeggios–they don’t have all of the notes from each chord. Also, the rhythm is a complicated and interesting one.

Below, you can explore the rhythm in the Groove Pizza. The orange triangle shows the rhythm of the arpeggio notes, played on the snare. The yellow quadrilateral shows the rhythm of the walkups, played on the kick–we’ll get to those below.

The snare rhythm has a hit every three sixteenth notes. It’s a figure known in Afro-Latin music as tresillo, which you hear absolutely everywhere in all styles of American popular and vernacular music. Tresillo also forms the front half of the equally ubiquitous son clave. (By the way, you can also use the Groove Pizza to experiment with the “Chameleon” drum pattern.)

As for the pitches: Instead of going root-third-fifth-seventh, the bassline plays partial arpeggios. The figure over B♭-7 is just the root, seventh and root again, while the one over E♭7 is the root, fifth and seventh.

Adding the walkups

Now let’s forget about the arpeggios for a minute and go back to just playing the chord roots on the downbeats. The bassline walks up to each of these notes via the chromatic scale, that is, every pitch on the piano keyboard.

Chromatic walkups are a great way to introduce some hip dissonance into your basslines, because they can include notes that aren’t in the underlying scale. In “Chameleon” the walkups include A natural and D natural. Both of these notes sound really weird if you sustain them over B-flat Dorian, but in the context of the walkup they sound perfectly fine.

Putting it all together

The full bassline consists of the broken arpeggios anticipated by the walkups.

If you’re a guitarist or bassist, you can play this without even shifting position. Use your index on the third fret, your middle on the fourth fret, your ring on the fifth fret, and your pinkie on the sixth fret.

              .          . .
G|----------.-3----------3-6--|
D|----------6-----------------|
A|---------------3-4-5-6------|
E|--3-4-5-6-------------------|

If you’ve got this under your fingers, maybe you’d like to figure out the various keyboard and horn parts. They aren’t difficult, but you’ll need one more scale, the B-flat blues scale. Click the image to jam with it over the song and experience how great it sounds.

B-flat blues

There you have it, one of the cornerstones of funk. Good luck getting it out of your head!

Freedom ’90

Since George Michael died, I’ve been enjoying all of his hits, but none of them more than this one. Listening to it now, it’s painfully obvious how much it’s about George Michael’s struggles with his sexual orientation. I wonder whether he was being deliberately coy in the lyrics, or if he just wasn’t yet fully in touch with his identity. Being gay in the eighties must have been a nightmare.

This is the funkiest song that George Michael ever wrote, which is saying something. Was he the funkiest white British guy in history? Quite possibly. 

The beat

There are five layers to the drum pattern: a simple closed hi-hat from a drum machine, some programmed bongos and congas, a sampled tambourine playing lightly swung sixteenth notes, and finally, once the full groove kicks in, the good old Funky Drummer break. I include a Noteflight transcription of all that stuff below, but don’t listen to it, it sounds comically awful.

George Michael uses the Funky Drummer break on at least two of the songs on Listen Without Prejudice Vol 1. Hear him discuss the break and how it informed his writing process in this must-watch 1990 documentary.

The intro and choruses

Harmonically, this is a boilerplate C Mixolydian progression: the chords built on the first, seventh and fourth degrees of the scale. You can hear the same progression in uncountably many classic rock songs.

C Mixolydian chords

For a more detailed explanation of this scale and others like it, check out Theory For Producers.

The rhythm is what makes this groove so fresh. It’s an Afro-Cuban pattern full of syncopation and hemiola. Here’s an abstraction of it on the Groove Pizza. If you know the correct name of this rhythm, please tell me in the comments!

The verses

There’s a switch to plain vanilla C major, the chords built on the fifth, fourth and root of the scale.

C major chords

Like the chorus, this is standard issue pop/rock harmonically speaking, but it also gets its life from a funky Latin rhythm. It’s a kind of clave pattern, five hits spread more or less evenly across the sixteen sixteenth notes in the bar. Here it is on the Groove Pizza.

The prechorus and bridge

This section unexpectedly jumps over to C minor, and now things get harmonically interesting. The chords are built around a descending chromatic bassline: C, B, B-flat, A. It’s a simple idea but with complicated implications, because it implies four chords built on three different scales between them. First, we have the tonic triad in C natural minor, no big deal there. Next comes the V chord in C harmonic minor. Then we’re back to C natural minor, but with the seventh in the bass. Finally, we go to the IV chord in C Dorian mode. Really, all that we’re doing is stretching C natural minor to accommodate a couple of new notes, B natural in the second chord, and A natural in the fourth one.

C minor - descending chromatic bassline

The rhythm here is similar but not identical to the clave-like pattern in the verse–the final chord stab is a sixteenth note earlier. See and hear it on the Groove Pizza.

I don’t have the time to transcribe the whole bassline, but it’s absurdly tight and soulful. The album credits list bass played both by Deon Estus and by George Michael himself. Whichever one of them laid this down, they nailed it.

Song structure

“Freedom ’90” has an exceedingly peculiar structure for a mainstream pop song. The first chorus doesn’t hit until almost two minutes in, which is an eternity–most pop songs are practically over that that point. The graphic below shows the song segments as I marked them in Ableton.

Freedom '90 structure

The song begins with a four bar instrumental intro, nothing remarkable about that. But then it immediately moves into an eight bar section that I have trouble classifying. It’s the spot that would normally be occupied by verse one, but this part uses the chorus harmony and is different from the other verses. I labeled it “intro verse” for lack of a better term. (Update: upon listening again, I realized that this section is the backing vocals from the back half of the chorus. Clever, George Michael!) Then there’s an eight bar instrumental break, before the song has really even started. George Michael brings you on board with this unconventional sequence because it’s all so catchy, but it’s definitely strange.

Finally, twenty bars in, the song settles into a more traditional verse-prechorus-chorus loop. The verses are long, sixteen bars. The prechorus is eight bars, and the chorus is sixteen. You could think of the chorus as being two eight bar sections, the part that goes “All we have to do…” and the part that goes “Freedom…” but I hear it as all one big section.

After two verse-prechorus-chorus units, there’s a four bar breakdown on the prechorus chord progression. This leads into sixteen bar bridge, still following the prechorus form. Finally, the song ends with a climactic third chorus, which repeats and fades out as an outtro. All told, the song is over six minutes. That’s enough time (and musical information) for two songs by a lesser artist.

A word about dynamics: just from looking at the audio waveform, you can see that “Freedom ’90” has very little contrast in loudness and fullness over its duration. It starts sparse, but once the Funky Drummer loop kicks in at measure 13, the sound stays constantly big and full until the breakdown and bridge. These sections are a little emptier without the busy piano part. The final chorus is a little bigger than the rest of the song because there are more vocals layered in, but that still isn’t a lot of contrast. I guess George Michael decided that the groove was so hot, why mess with it by introducing contrast for the sake of contrast? He was right to feel that way.

The Groove Pizza now exports MIDI

Since its launch, you’ve been able to export your Groove Pizza beats as WAV files, or continue working on them in Soundtrap. But now, thanks to MusEDLab developer Jordana Bombi, you can also save your beats as MIDI files as well.

Groove Pizza MIDI export

You can bring these MIDI files into your music production software tool of choice: Ableton Live, Logic, Pro Tools, whatever. How cool is that?

There are a few limitations at the moment: your beats will be rendered in 4/4 time, regardless of how many slices your pizza has. You can always set the right time signature after you bring the MIDI into your production software. Also, your grooves will export with no swing–you’ll need to reinstate that in your software as well.

We have some more enhancements in the pipeline, aside from fixing the limitations just mentioned. We’re working on a “continue in Noteflight” feature, real-time MIDI input and output, and live performance using the QWERTY keyboard. I’ll keep you posted.

Design for Real Life – QWERTYBeats research

Writing assignment for Design For The Real World with Claire Kearney-Volpe and Diana Castro – research about a new rhythm interface for blind and low-vision novice musicians

Definition

I propose a new web-based accessible rhythm instrument called QWERTYBeats.Traditional instruments are highly accessible to blind and low-vision musicians. Electronic music production tools are not. I look at the history of accessible instruments and software interfaces, give an overview of current electronic music hardware and software, and discuss the design considerations underlying my project.

QWERTYBeats logo

Historical overview

Acoustic instruments give rich auditory and haptic feedback, and pose little obstacle to blind musicians. We need look no further for proof than the long history of iconic blind musicians like Ray Charles and Stevie Wonder. Even sighted instrumentalists rarely look at their instruments once they have attained a sufficient level of proficiency. Music notation is not accessible, but Braille notation has existed since the language’s inception. Also, a great many musicians both blind and sighted play entirely by ear anyway.

Most of the academic literature around accessibility issues in music education focuses on wider adoption of and support for Braille notation. See, for example, Rush, T. W. (2015). Incorporating Assistive Technology for Students with Visual Impairments into the Music Classroom. Music Educators Journal, 102(2), 78–83. For electronic music, notation is rarely if ever a factor.

Electronic instruments pose some new accessibility challenges. They may use graphical interfaces with nested menus, complex banks of knobs and patch cables, and other visual control surfaces. Feedback may be given entirely with LED lights and small text labels. Nevertheless, blind users can master these devices with sufficient practice, memorization and assistance. For example, Stevie Wonder has incorporated synthesizers and drum machines in most of his best-known recordings.

Most electronic music creation is currently done not with instruments, but rather using specialized software applications called digital audio workstations (DAWs). Keyboards and other controllers are mostly used to access features of the software, rather than as standalone instruments. The most commonly-used DAWs include Avid Pro Tools, Apple Logic, Ableton Live, and Steinberg Cubase. Mobile DAWs are more limited than their desktop counterparts, but are nevertheless becoming robust music creation tools in their own right. Examples include Apple GarageBand and Steinberg Cubasis. Notated music is commonly composed using score editing software like Sibelius and Finale, whose functionality increasingly overlaps with DAWs, especially in regard to MIDI sequencing.

DAWs and notation editors pose steep accessibility challenges due to their graphical and spatial interfaces, not to mention their sheer complexity. In class, we were given a presentation by Leona Godin, a blind musician who records and edits audio using Pro Tools by means of VoiceOver. While it must have taken a heroic effort on her part to learn the program, Leona demonstrates that it is possible. However, some DAWs pose insurmountable problems even to very determined blind users because they do not use standard operating system elements, making them inaccessible via screen readers.

Technological interventions

There are no mass-market electronic interfaces specifically geared toward blind or low-vision users. In this section, I discuss one product frequently hailed for its “accessibility” in the colloquial rather than blindess-specific sense, along with some more experimental and academic designs.

Ableton Push

Push layout for IMPACT Faculty Showcase

Ableton Live has become the DAW of choice for electronic music producers. Low-vision users can zoom in to the interface and modify the color scheme. However, Live is inaccessible via screen readers.

In recent years, Ableton has introduced a hardware controller, the Push, which is designed to make the software experience more tactile and instrument-like. The Push combines an eight by eight grid of LED-lit touch pads with banks of knobs, buttons and touch strips. It makes it possible to create, perform and record a piece of music from scratch without looking at the computer screen. In addition to drum programming and sampler performance, the Push also has an innovative melodic mode which maps scales onto the grid in such a way that users can not play a wrong note. Other comparable products exist; see, for example, the Native Instruments Maschine.

There are many pad-based drum machines and samplers. Live’s main differentiator is its Session view, where the pads launch clips: segments of audio or MIDI that can vary in length from a single drum hit to the length of an entire song. Clip launching is tempo-synced, so when you trigger a clip, playback is delayed until the start of the next measure (or whatever the quantization interval is.) Clip launching is a forgiving and beginner-friendly performance method, because it removes the possibility of playing something out of rhythm. Like other DAWs, Live also gives rhythmic scaffolding in its software instruments by means of arpeggiators, delay and other tempo-synced features.

The Push is a remarkable interface, but it has some shortcomings for blind users. First of all, it is expensive, $800 for the entry-level version and $1400 for the full-featured software suite. Much of its feedback is visual, in the form of LED screens and color-coded lighting on the pads. It switches between multiple modes which can be challenging to distinguish even for sighted users. And, like the software it accompanies, the Push is highly complex, with a steep learning curve unsuited to novice users, blind or sighted.

The aQWERTYon

Most DAWs enable users to perform MIDI instruments on the QWERTY keyboard. The most familiar example is the Musical Typing feature in Apple GarageBand.

GarageBand musical typing

Musical Typing makes it possible to play software instruments without an external MIDI controller, which is convenient and useful. However, its layout counterintuively follows the piano keyboard, which is an awkward fit for the computer keyboard. There is no easy way to distinguish the black and white keys, and even expert users find themselves inadvertantly hitting the keyboard shortcut for recording while hunting for F-sharp.

The aQWERTYon is a web interface developed by the NYU Music Experience Design Lab specifically intended to address the shortcomings of Musical Typing.

aQWERTYon screencap

Rather than emulating the piano keyboard, the aQWERTYon draws its inspiration from the chord buttons of an accordion. It fills the entire keyboard with harmonically related notes in a way that supports discovery by naive users. Specifically, it maps scales across the rows of keys, staggered by intervals such that each column forms a chord within the scale. Root notes and scales can be set from pulldown menus within the interface, or preset using URL parameters. It can be played as a standalone instrument, or as a MIDI controller in conjunction with a DAW. Here is a playlist of music I created using the aQWERTYon and GarageBand or Ableton Live:

The aQWERTYon is a completely tactile experience. Sighted users can carefully match keys to note names using the screen, but more typically approach the instrument by feel, seeking out patterns on the keyboard by ear. A blind user would need assistance loading the aQWERTYon initially and setting the scale and root note parameters, but otherwise, it is perfectly accessible. The present project was motivated in large part by a desire to make exploration of rhythm as playful and intuitive as the aQWERTYon makes exploring chords and scales.

Soundplant

The QWERTY keyboard can be turned into a simple drum machine quite easily using a free program called Soundplant. The user simply drags audio files onto a graphical key to have it triggered by that physical key. I was able to create a TR-808 kit in a matter of minutes:

Soundplant with 808 samples

After it is set up and configured, Soundplant can be as effortlessly accessible as the aQWERTYon. However, it does not give the user any rhythmic assistance. Drumming in perfect time is an advanced musical skill, and playing drum machine samples out of time is not much more satisfying than banging on a metal bowl with a spoon out of time. An ideal drum interface would offer beginners some of the rhythmic scaffolding and support that Ableton provides via Session view, arpeggiators, and the like.

The Groove Pizza

Drum machines and their software counterparts offer an alternative form of rhythmic scaffolding. The user sequences patterns in a time-unit box system or piano roll, and the computer performs those patterns flawlessly. The MusEDLab‘s Groove Pizza app is a web-based drum sequencer that wraps the time-unit box system into a circle.

Groove Pizza - Bembe

The Groove Pizza was designed to make drum programming more intuitive by visualizing the symmetries and patterns inherent in musical-sounding rhythms. However, it is totally unsuitable for blind or low-vision users. Interaction is only possible through the mouse pointer or touch, and there are no standard user interface elements that can be parsed by screen readers.

Before ever considering designing for the blind, the MusEDLab had already considered the Groove Pizza’s limitations for younger children and users with special needs: there is no “live performance” mode, and there is always some delay in feedback between making a change in the drum pattern and hearing the result. We have been considering ways to make a rhythm interface that is more immediate, performance-oriented and tactile. One possible direction would be to create a hardware version of the Groove Pizza; indeed, one of the earliest prototypes was a hardware version built by Adam November out of a pizza box. However, hardware design is vastly more complex and difficult than software, so for the time being, software promises more immediate results.

Haenselmann-Lemelson-Effelsberg MIDI sequencer

This experimental interface is described in Haenselmann, T., Lemelson, H., & Effelsberg, W. (2011). A zero-vision music recording paradigm for visually impaired people. Multimedia Tools and Applications, 5, 1–19.

Haenselmann-Lemelson-Effelsberg MIDI sequencer

The authors create a new mode for a standard MIDI keyboard that maps piano keys to DAW functions like playback, quantization, track selection, and so on. They also add “earcons” (auditory icons) to give sonic feedback when particular functions have been activated that normally only give graphical feedback. For example, one earcon sounds when recording is enabled; another sounds for regular playback. This interface sounds promising, but there are significant obstacles to its adoption. While the authors have released the source code as a free download, that requires a would-be user to be able to compile and run it. This is presuming that they could access the code in the first place; the download link given in the paper is inactive. It is an all-too-common fate of academic projects to never get widespread usage. By posting our projects on the web, the MusEDLab hopes to avoid this outcome.

Statement

Music education philosophy

My project is animated by a constructivist philosophy of music education, which operates by the following axiomatic assumptions:

  • Learning by doing is better than learning by being told.
  • Learning is not something done to you, but rather something done by you.
  • You do not get ideas; you make ideas. You are not a container that gets filled with knowledge and new ideas by the world around you; rather, you actively construct knowledge and ideas out of the materials at hand, building on top of your existing mental structures and models.
  • The most effective learning experiences grow out of the active construction of all types of things, particularly things that are personally or socially meaningful, that you develop through interactions with others, and that support thinking about your own thinking.

If an activity’s challenge level is beyond than your ability, you experience anxiety. If your ability at the activity far exceeds the challenge, the result is boredom. Flow happens when challenge and ability are well-balanced, as seen in this diagram adapted from Csikszentmihalyi.

Flow

Music students face significant obstacles to flow at the left side of the Ability axis. Most instruments require extensive practice before it is possible to make anything that resembles “real” music. Electronic music presents an opportunity here, because even a complete novice can produce music with a high degree of polish quickly. It is empowering to use technologies that make it impossible to do anything wrong; it frees you to begin exploring what you find to sound right. Beginners can be scaffolded in their pitch explorations with MIDI scale filters, Auto-Tune, and the configurable software keyboards in apps like Thumbjam and Animoog. Rhythmic scaffolding is more rare, but it can be had via Ableton’s quantized clip launcher, by MIDI arpeggiators, and using the Note Repeat feature on many drum machines.

QWERTYBeats proposal

My project takes drum machine Note Repeat as its jumping off point. When Note Repeat is activated, holding down a drum pad triggers the corresponding sound at a particular rhythmic interval: quarter notes, eighth notes, and so on. On the Ableton Push, Note Repeat automatically syncs to the global tempo, making it effortless to produce musically satisfying rhythms. However, this mode has a major shortcoming: it applies globally to all of the drum pads. To my knowledge, no drum machine makes it possible to simultaneously have, say, the snare drum playing every dotted eighth note while the hi-hat plays every sixteenth note.

I propose a web application called QWERTYBeats that maps drums to the computer keyboard as follows:

  • Each row of the keyboard triggers a different drum/beatbox sound (e.g. kick, snare, closed hi-hat, open hi-hat).
  • Each column retriggers the sample at a different rhythmic interval (e.g. quarter note, dotted eighth note).
  • Circles dynamically divide into “pie slices” to show rhythmic values.

The rhythm values are shown below by column, with descriptions followed by the time interval as shown as a fraction of the tempo in beats per minute.

  1. quarter note (1)
  2. dotted eighth note (3/4)
  3. quarter note triplet (2/3)
  4. eighth note (1/2)
  5. dotted sixteenth note (3/8)
  6. eighth note triplet (1/3)
  7. sixteenth note (1/4)
  8. dotted thirty-second note (3/16)
  9. sixteenth note triplet (1/6)
  10. thirty-second note (1/8)

By simply holding down different combinations of keys, users can attain complex syncopations and polyrhythms. If the app is synced to the tempo of a DAW or music playback, the user can perform good-sounding rhythms over any song that is personally meaningful to them.

The column layout leaves some unused keys in the upper right corner of the keyboard: “-“, “=”, “[“, “]”, “”, etc. These can be reserved for setting the tempo and other UI elements.

The app defaults to Perform Mode, but clicking Make New Kit opens Sampler mode, where users can import or record their own drum sounds:

  • Keyboard shortcuts enable the user to select a sound, audition it, record, set start and end point, and set its volume level.
  • A login/password system enables users to save kits to the cloud where they can be accessed from any computer. Kits get unique URL identifiers, so users can also share them via email or social media.

It is my goal to make the app accessible to users with the widest possible diversity of abilities.

  • The entire layout will use plain text, CSS and JavaScript to support screen readers.
  • All user interface elements can be accessed via the keyboard: tab to change the keyboard focus, menu selections and parameter changes via the up and down arrows, and so on.

Perform Mode:

QWERTYBeats concept images - Perform mode

Sampler Mode:

sampler-mode

Mobile version

The present thought is to divide up the screen into a grid mirroring the layout of the QWERTY keyboard. User testing will determine whether this will produce a satisfying experience.

QWERTYDrum - mobile

Prototype

I created a prototype of the app using Ableton Live’s Session View.

QWERTYBeats - Ableton prototype

Here is a sample performance:

There is not much literature examining the impact of drum programming and other electronic rhythm sequencing on students’ subsequent ability to play acoustic drums, or to keep time more accurately in general. I can report anecdotally that my own time spent sequencing and programming drums improved my drumming and timekeeping enormously (and mostly inadvertently.) I will continue to seek further support for the hypothesis that electronically assisted rhythm creation builds unassisted rhythmic ability. In the meantime, I am eager to prototype and test QWERTYBeats.

Why hip-hop is interesting

The title of this post is also the title of a tutorial I’m giving at ISMIR 2016 with Jan Van Balen and Dan Brown. The conference is organized by the International Society for Music Information Retrieval, and it’s the fanciest of its kind. You may be wondering what Music Information Retrieval is. MIR is a specialized field in computer science devoted to teaching computers to understand music, so they can transcribe it, organize it, find connections and similarities, and, maybe, eventually, create it.

So why are we going to talk to the MIR community about hip-hop? So far, the field has mostly studied music using the tools of Western classical music theory, which emphasizes melody and harmony. Hip-hop songs don’t tend to have much going on in either of those areas, which makes the genre seem like it’s either too difficult to study, or just too boring. But the MIR community needs to find ways to engage this music, if for no other reason than the fact that hip-hop is the most-listened to genre in the world, at least among Spotify listeners.

Hip-hop has been getting plenty of scholarly attention lately, but most of it has been coming from cultural studies. Which is fine! Hip-hop is culturally interesting. When humanities people do engage with hip-hop as an art form, they tend to focus entirely on the lyrics, treating them as a subgenre of African-American literature that just happens to be performed over beats. And again, that’s cool! Hip-hop lyrics have literary interest. If you’re interested in the lyrical side, we recommend this video analyzing the rhyming techniques of several iconic emcees. But what we want to discuss is why hip-hop is musically interesting, a subject which academics have given approximately zero attention to.

Much of what I find exciting (and difficult) about hip-hop can be found in Kanye West’s song “Famous” from his album The Life Of Pablo.

The song comes with a video, a ten minute art film that shows Kanye in bed sleeping after a group sexual encounter with his wife, his former lover, his wife’s former lover, his father-in-law turned mother-in-law, various of his friends and collaborators, Bill Cosby, George Bush, Taylor Swift, and Donald Trump. There’s a lot to say about this, but it’s beyond the scope of our presentation, and my ability to verbalize thoughts. The song has some problematic lyrics. Kanye drops the n-word in the very first line and calls Taylor Swift a bitch in the second. He also speculates that he might have sex with her, and that he made her famous. I find his language difficult and objectionable, but that too is beyond the scope. Instead, I’m going to focus on the music itself.

“Famous” has a peculiar structure, shown in the graphic below.

The track begins with a six bar intro, Rihanna singing over a subtle gospel-flavored organ accompaniment in F-sharp major. She’s singing few lines from “Do What You Gotta Do” by Jimmy Webb. This song has been recorded many times, but for Kanye’s listeners, the most significant one is by Nina Simone.

Next comes a four-bar groove, a more aggressive organ part over a drum machine beat, with Swizz Beatz exclaiming on top. The beat is a minimal funk pattern on just kick and snare, treated with cavernous artificial reverb. The organ riff is in F-sharp minor, which is an abrupt mode change so early in the song. It’s sampled from the closing section of “Mi Sono Svegliato E…Ho Chiuso Gli Occhi” by Il Rovescio della Medaglia, an Italian prog-rock band I had never heard of until I looked the sample up just now. The song is itself built around quotes of Bach’s Well-Tempered Clavier–Kanye loves sampling material built from samples.

Verse one continues the same groove, with Kanye alternating between aggressive rap and loosely pitched singing. Rap is widely supposed not to be melodic, but this idea collapses immediately under scrutiny. The border between rapping and singing is fluid, and most emcees cross it effortlessly. Even in “straight” rapping, though, the pitch sequences are deliberate and meaningful. The pitches might not fall on the piano keys, but they are melodic nonetheless.

The verse is twelve bars long, which is unusual; hip-hop verses are almost always eight or sixteen bars. The hook (the hip-hop term for chorus) comes next, Rihanna singing the same Jimmy Webb/Nina Simone quote over the F-sharp major organ part from the intro. Swizz Beatz does more interjections, including a quote of “Wake Up Mr. West,” a short skit on Kanye’s album Late Registration in which DeRay Davis imitates Bernie Mac.

Verse two, like verse one, is twelve bars on the F-sharp minor loop. At the end, you think Rihanna is going to come back in for the hook, but she only delivers the pickup. The section abruptly shifts into an F-sharp major groove over fuller drums, including a snare that sounds like a socket wrench. The lead vocal is a sample of “Bam Bam” by Sister Nancy, which is a familiar reference for hip-hop fans–I recognize it from “Lost Ones” by Lauryn Hill and “Just Hangin’ Out” by Main Source. The chorus means “What a bum deal.” Sister Nancy’s track is itself sample-based–like many reggae songs, it uses a pre-existing riddim or instrumental backing, and the chorus is a quote of the Maytals.

Kanye doesn’t just sample “Bam Bam”, he also reharmonizes it. Sister Nancy’s original is a I – bVII progression in C Mixolydian. Kanye pitch shifts the vocal to fit it over a I – V – IV – V progression in F-sharp major. He doesn’t just transpose the sample up or down a tritone; instead, he keeps the pitches close by changing their chord function. Here’s Sister Nancy’s original:

And here’s Kanye’s version:

The pitch shifting gives Sister Nancy the feel of a robot from the future, while the lo-fidelity recording places her in the past. It’s a virtuoso sample flip.

After 24 bars of the Sister Nancy groove, the track ends with the Jimmy Webb hook again. But this time it isn’t Rihanna singing. Instead, it’s a sample of Nina Simone herself.It reminds me of Kanye’s song “Gold Digger“, which includes Jamie Foxx imitating Ray Charles, followed by a sample of Ray Charles himself. Kanye is showing off here. It would be a major coup for most producers to get Rihanna to sing on a track, and it would be an equally major coup to be able to license a Nina Simone sample, not to mention requiring the chutzpah to even want to sample such a sacred and iconic figure. Few people besides Kanye could afford to use both Rihanna and Nina Simone singing the same hook, and no one else would dare. I don’t think it’s just a conspicuous show of industry clout, either; Kanye wants you to feel the contrast between Rihanna’s heavily processed purr and Nina Simone’s stark, preacherly tone.

Here’s a diagram of all the samples and samples of samples in “Famous.”

In this one track, we have a dense interplay of rhythms, harmonies, timbres, vocal styles, and intertextual meaning, not to mention the complexities of cultural context. This is why hip-hop is interesting.

You probably have a good intuitive idea of what hip-hop is, but there’s plenty of confusion around the boundaries. What are the elements necessary for music to be hip-hop? Does it need to include rapping over a beat? When blues, rock, or R&B singers rap, should we retroactively consider that to be hip-hop? What about spoken-word poetry? Does hip-hop need to include rapping at all? Do singers like Mary J. Blige and Aaliyah qualify as hip-hop? Is Run-DMC’s version of “Walk This Way” by Aerosmith hip-hop or rock? Is “Love Lockdown” by Kanye West hip-hop or electronic pop? Do the rap sections of “Rapture” by Blondie or “Shake It Off” by Taylor Swift count as hip-hop?

If a single person can be said to have laid the groundwork for hip-hop, it’s James Brown. His black pride, sharp style, swagger, and blunt directness prefigure the rapper persona, and his records are a bottomless source of classic beats and samples. The HBO James Brown documentary is a must-watch.

Wikipedia lists hip-hop’s origins as including funk, disco,
electronic music, dub, R&B, reggae, dancehall, rock, jazz, toasting, performance poetry, spoken word, signifyin’, The Dozens, griots, scat singing, and talking blues. People use the terms hip-hop and rap interchangeably, but hip-hop and rap are not the same thing. The former is a genre; the latter is a technique. Rap long predates hip-hop–you can hear it in classicalrock, R&B, swingjazz fusion, soul, funkcountry, and especially blues, especially especially the subgenre of talking blues. Meanwhile, it’s possible to have hip-hop without rap. Nearly all current pop and R&B are outgrowths of hip-hop. Turntablists and controllerists have turned hip-hop into a virtuoso instrumental music.

It’s sometimes said that rock is European harmony combined with African rhythm. Rock began as dance music, and rhythm continues to be its most important component. This is even more true of hip-hop, where harmony is minimal and sometimes completely absent. More than any other music of the African diaspora, hip-hop is a delivery system for beats. These beats have undergone some evolution over time. Early hip-hop was built on funk, the product of what I call The Great Cut-Time Shift, as the underlying pulse of black music shifted from eighth notes to sixteenth notes. Current hip-hop is driving a Second Great Cut-Time Shift, as the average tempo slows and the pulse moves to thirty-second notes.

Like all other African-American vernacular music, hip-hop uses extensive syncopation, most commonly in the form of a backbeat. You can hear the blues musician Taj Mahal teach a German audience how to clap on the backbeat. (“Schvartze” is German for “black.”) Hip-hop has also absorbed a lot of Afro-Cuban rhythms, like the omnipresent son clave. This traditional Afro-Cuban rhythm is everywhere in hip-hop: in the drums, of course, but also in the rhythms of bass, keyboards, horns, vocals, and everywhere else. You can hear son clave in the snare drum part in “WTF” by Missy Elliott.

The NYU Music Experience Design Lab created the Groove Pizza app to help you visualize and interact with rhythms like the ones in hip-hop beats. You can use it to explore classic beats or more contemporary trap beats. Hip-hop beats come from three main sources: drum machines, samples, or (least commonly) live drummers.

Hip-hop was a DJ medium before emcees became the main focus. Party DJs in the disco era looped the funkiest, most rhythm-intensive sections of the records they were playing, and sometimes improvised toasts on top. Sampling and manipulating recordings has become effortless in the computer age, but doing it with vinyl records requires considerable technical skill. In the movie Wild Style, you can see Grandmaster Flash beat juggle and scratch “God Make Me Funky” by the Headhunters and “Take Me To The Mardi Gras” by Bob James (though the latter song had to be edited out of the movie for legal reasons.)

The creative process of making a modern pop recording is very different from composing on paper or performing live. Hip-hop is an art form about tracks, and the creativity is only partially in the songs and the performances. A major part of the art form is the creation of sound itself. It’s the timbre and space that makes the best tracks come alive as much as any of the “musical” components. The recording studio gives you control over the finest nuances of the music that live performers can only dream of. Most of the music consists of synths and samples that are far removed from a “live performance.” The digital studio erases the distinction between composition, improvisation, performance, recording and mixing. The best popular musicians are the ones most skilled at “playing the studio.”

Hip-hop has drawn much inspiration from the studio techniques of dub producers, who perform mixes of pre-existing multitrack tape recordings by literally playing the mixing desk. When you watch The Scientist mix Ted Sirota’s “Heavyweight Dub,” you can see him shaping the track by turning different instruments up and down and by turning the echo effect on and off. Like dub, hip-hop is usually created from scratch in the studio. Brian Eno describes the studio as a compositional tool, and hip-hop producers would agree.

Aside from the human voice, the most characteristic sounds in hip-hop are the synthesizer, the drum machine, the turntable, and the sampler. The skills needed by a hip-hop producer are quite different from the ones involved in playing traditional instruments or recording on tape. Rock musicians and fans are quick to judge electronic musicians like hip-hop producers for not being “real musicians” because sequencing electronic instruments appears to be easier to learn than guitar or drums. Is there something lazy or dishonest about hip-hop production techniques? Is the guitar more of a “real” instrument than the sampler or computer? Are the Roots “better” musicians because they incorporate instruments?

Maybe we discount the creative prowess of hip-hop producers because we’re unfamiliar with their workflow. Fortunately, there’s a growing body of YouTube videos that document various aspects of the process:

Before affordable digital samplers became available in the late 1980s, early hip-hop DJs and producers did most of their audio manipulation with turntables. Record scratching  demands considerable skill and practice, and it has evolved into a virtuoso form analogous to bebop saxophone or metal guitar shredding.

Hip-hop is built on a foundation of existing recordings, repurposed and recombined. Samples might be individual drum hits, or entire songs. Even hip-hop tracks without samples very often started with them; producers often replace copyrighted material with soundalike “original” beats and instrumental performances for legal reasons. Turntables and samplers make it possible to perform recordings like instruments.

The Amen break, a six-second drum solo, is one of the most important samples of all time. It’s been used in uncountably many hip-hop songs, and is the basis for entire subgenres of electronic music. Ali Jamieson gives an in-depth exploration of the Amen.

There are few artistic acts more controversial than sampling. Is it a way to enter into a conversation with other artists? An act of liberation against the forces of corporatized mass culture? A form of civil disobedience against a stifling copyright regime? Or is it a bunch of lazy hacks stealing ideas, profiting off other musicians’ hard work, and devaluing the concept of originality? Should artists be able to control what happens to their work? Is complete originality desirable, or even possible?

We look to hip-hop to tell us the truth, to be real, to speak to feelings that normally go unspoken. At the same time, we expect rappers to be larger than life, to sound impossibly good at all times, and to live out a fantasy life. And many of our favorite artists deliberately alter their appearance, race, gender, nationality, and even species. To make matters more complicated, we mostly experience hip-hop through recordings and videos, where artificiality is the nature of the medium. How important is authenticity in this music? To what extent is it even possible?

The “realness” debate in hip-hop reached its apogee with the controversy over Auto-Tune. Studio engineers have been using computer software to correct singers’ pitch since the early 1990s, but the practice only became widely known when T-Pain overtly used exaggerated Auto-Tune as a vocal effect rather than a corrective. The “T-Pain effect” makes it impossible to sing a wrong note, though at the expense of making the singer sound like a robot from the future. Is this the death of singing as an art form? Is it cheating to rely on software like this? Does it bother you that Kanye West can have hits as a singer when he can barely carry a tune? Does it make a difference to learn that T-Pain has flawless pitch when he turns off the Auto-Tune?

Hip-hop is inseparable from its social, racial and political environment. For example, you can’t understand eighties hip-hop without understanding New York City in the pre-Giuliani era. Eric B and Rakim capture it perfectly in the video for “I Ain’t No Joke.”

Given that hip-hop is the voice of the most marginalized people in America and the world, why is it so compelling to everyone else? Timothy Brennan argues that the musical African diaspora of which hip-hop is a part helps us resist imperialism through secular devotion. Brennan thinks that America’s love of African musical practice is related to an interest in African spiritual practice. We’re unconsciously drawn to the musical expression of African spirituality as a way of resisting oppressive industrial capitalism and Western hegemony. It isn’t just the defiant stance of the lyrics that’s doing the resisting. The beats and sounds themselves are doing the major emotional work, restructuring our sense of time, imposing a different grid system onto our experience. I would say that makes for some pretty interesting music.