Where APLS data comes from

On this page
  1. Where APLS audio data comes from
  2. From audio data to APLS

Where APLS audio data comes from

APLS contains audio recordings of one-on-one sociolinguistic interviews from fieldwork conducted in 2003–2005 with native Pittsburghers in four Pittsburgh neighborhoods: the Hill District (abbreviated HD in APLS), Lawrenceville (LV), Forest Hills (FH), and Cranberry Township (CB).1 These interviews typically consisted of several sections (though not all interviews had all of these):

  • A long conversation
    • Sometimes including chatter about “Pittsburghese” and/or African American English
  • 1–2 reading tasks
  • A minimal pairs task

APLS includes just a subset of the audio files from this fieldwork. All interviewees in APLS are natives of the Pittsburgh area, and all interviewees consented to make their data publicly available. In addition, APLS currently contains only files that had just one interviewee.

From audio data to APLS

The audio files in APLS have been transcribed by research assistants according to a specific set of conventions that facilitate analysis in LaBB-CAT. (This takes a ton of time and effort!) After a transcription file is uploaded with an audio file to APLS, APLS generates numerous layers for the transcript, using dictionaries with compositional representations of words (e.g., morphological parsing), machine learning models (e.g., the [Hidden Markov Toolkit][htk] for determining time-alignments of individual speech sounds), and other techniques. Additionally, metadata about the participant and the transcript are uploaded to APLS.

On this page
  1. TOC
  1. In keeping with Pittsburgh parlance, we use neighborhood to encompass geographic areas inside or outside Pittsburgh city limits. Technically, only the Hill District and Lawrenceville are within city limits. Forest Hills is a borough within Allegheny County, and Cranberry Township is a township just outside Allegheny County. In the original fieldwork, these sites were chosen to reflect a distinction (between inner-city, inner-ring suburb, and outer-ring suburb) that shows up in some classic sociolinguistic literature (e.g., Bailey et al. 1993, Eckert 2000)Â