Layers: Layer typology
The layers in APLS can be categorized along some important properties, in terms of how they appear, how theyâre generated, how they can be searched, and other properties. This page outlines those categories.
On this page
Scope
Not all annotations in a transcript are equally âwideâ (in duration). Some layers contain annotations that pertain to individual words (e.g., part_of_speech), others to individual speech sounds (e.g., segment), and others to multiple words (e.g., redaction). This distinction is captured by layer scope.
Layers can have one of four possible scopes, defined by how long (in time) they can span. From longest to shortest, these are:
Scope | Meaning | Notes |
---|---|---|
Span | Annotations can span beyond an individual turn of talk, up to the length of the entire transcript | Â |
Phrase | Annotations can span beyond an individual word, up to the length of an entire turn | Â |
Word | Annotations usually span the length of a word | Some word layers (like syllables) have annotations that span just part of a word. These annotations are called horizontal peers, discussed below |
Segment | Annotations span the length of a speech sound | Â |
What youâll see in APLS
- Individual transcript pages
- You can usually tell what each layerâs scope is by just glancing at the transcript. For example:
- The noise annotations arenât bounded by a single line, so noise is clearly a span layer. (Note that transcripts always display the noise and comment layers.)
- Both overlap and speech_rate have annotations that span multiple words, so they are clearly phrase layers. You can tell where these annotations start and end because of the colored arcs above each line.
- Each foll_pause annotation is lined up above each word annotation, so foll_pause is clearly a word layer
- Segment layers like segment arenât quite as obvious as the other scopes, but you can tell them apart because theyâre plain text rather than links, and theyâre always underneath word
- Finally, note that layers are always in scope order from longest to shortest
- You can usually tell what each layerâs scope is by just glancing at the transcript. For example:
- Search
- In the layered search matrix, layers are organized into columns by scope:
Only a few layers appear when you first load the search page, but you can select projects to show more (see below).
- In the layered search matrix, layers are organized into columns by scope:
- Exporting data
- Layers are organized into columns by scope
- Depending on how youâre exporting data, there might be additional columns. For example, when exporting a formatted transcript, you can also export participant and transcript attributes:
Layers by scope
Span | Phrase | Word | Segment |
---|---|---|---|
|
|
|
|
Alignment and horizontal peers
Some annotations take up just part of their scope, and some annotations take up their entire scope. For example, syllables annotations can take up just part of their corresponding word (if the word is multisyllabic); by contrast, each word has just one frequency_in_corpus annotation. When annotations take up just part of their scope, we say they have horizontal peers: âpeersâ because they share a scope with other annotations, âhorizontalâ because we often conceptualize time on an x-axis (i.e., from left to right). This distinction is captured by layer alignment.
Layers can have one of three possible alignments:
Alignment | Symbol* | Meaning | Notes |
---|---|---|---|
Complete interval | Annotations always span their entire scope | Â | |
Sub-interval | Annotations can span part of their scope, or their entire scope Annotations that share their scope with other annotations are called horizontal peers | There may or may not be gaps between annotations within a scope. For example, there are usually long gaps between comment annotations within a transcript | |
Timepoint | Annotations donât have a start and end time, just a time | There arenât currently any timepoint layers in APLS. If we wanted to store vowel measurements in a layer, then we might do so in a timepoint layer (i.e., at the vowelâs midpoint) |
* Used on individual transcript pages and the search page
What youâll see in APLS
- Individual transcript pages
- In the layer selector, the symbols
denote alignments
- When multiple annotations share a word, they crowd into the space above the word (with spaces separating labels). For example:
- Most words in this line have one part_of_speech annotation, but donât and itâs each have two (nât is an adverb, and âs is a present-tense 3rd-person-singular verb)
- Most words in this line are monosyllabic and just have one syllables annotation, but Family has three syllables annotations and Dollar has two.
- In the layer selector, the symbols
- Search
- In the layer selector, the symbols
denote alignments
- [Like with cross-scope search, you can do anchoring] [But you canât do e.g. multiple syllables within a word]
- In the layer selector, the symbols
- Exporting data
Layers by alignment
Since alignment is relative to the layerâs scope, these are broken down by scope too:
Scope | Alignment | |
---|---|---|
Complete interval | Sub-interval | |
Span | |
|
Phrase |
|
|
Word |
|
|
Segment |
|
|
There arenât currently any timepoint layers in APLS.
Vertical peers
Some layers allow vertical peers: 2 or more annotations that occupy an identical timespan. For example, the dictionary_phonemes layer represents all possible phonemic representations of a word, while the phonemes layer represents the speech sounds actually in a word. Since the word the can be pronounced /Ă°i/ or /Ă°É/, the has two annotations on the dictionary_phonemes layerâbut since only one of these is how the word was actually pronounced, the only has one phonemes annotation.
Unlike horizontal peers, which divide the timespan of their scope, we can think of vertical peers as being âstackedâ on top of one another within the same timespan.
Allow vertical peers? | Symbol* | Meaning |
---|---|---|
True | A single timespan may contain multiple annotations | |
False | (none) | A single timespan can contain at most one annotation |
* Used on individual transcript pages
What youâll see in APLS
- Individual transcript pages
- In the layer selector, the symbol
denotes layers that allow vertical peers
- Only one vertical peer is visible on the transcript page, whichever is first in alphabetical order
- Example: The dictionary_phonemes layer only shows
Ă°É
for the word the, even though the has two dictionary_phonemes annotations: - To check whether an annotation has vertical peers, export the line to TextGrid instead
- Example: The dictionary_phonemes layer only shows
- In the layer selector, the symbol
- Transcripts or fragments exported to Praat TextGrids
- Normally, TextGrid transcripts/fragments show one tier per layer per speaker. However, if a layer allows vertical peers, then that layer can have multiple tiers per speaker.
- Example: The word times has two lemma annotations, time and times,1 so this TextGrid shows two lemma tiers for CB17:
- Example: The word times has two lemma annotations, time and times,1 so this TextGrid shows two lemma tiers for CB17:
- If a layer allows multiple peers, but the exported transcript/fragment doesnât actually have vertical peers on that layer, then that layer will only have one tier.
- Example: None of the words in this fragment have multiple lemma annotations, so this TextGrid shows just one lemma tier for CB17:
- Example: None of the words in this fragment have multiple lemma annotations, so this TextGrid shows just one lemma tier for CB17:
- An exported TextGrid will only show as many tiers per layer as there are vertical peers on that layer in the exported transcript/fragment. As a result, longer transcripts/fragments tend to have more tiers.
- Example: When the entire transcript for CB17interview3.eaf is exported to TextGrid, there are three lemma tiers for CB17 for the whole TextGrid, even though there is only one word in the transcript with three lemma annotations
- Example: When the entire transcript for CB17interview3.eaf is exported to TextGrid, there are three lemma tiers for CB17 for the whole TextGrid, even though there is only one word in the transcript with three lemma annotations
- Normally, TextGrid transcripts/fragments show one tier per layer per speaker. However, if a layer allows vertical peers, then that layer can have multiple tiers per speaker.
- Search
- Searching a layer that allows vertical peers will match any of the vertical peers for a given word
- Example: All tokens of the have two dictionary_phonemes annotations:
D@
andDi
. Thus, a search forthe
on orthography andD@
on dictionary_phonemes returns the same set of matches asthe
on orthography andDi
on dictionary_phonemes - As a result, âdoesnât matchâ searches on a layer that allows vertical peers can return false positives
- Example: Searching doesnât match
D@
on dictionary_phonemes will still return all the tokens, because theDi
vertical peer doesnât matchD@
- Example: Searching doesnât match
- Example: All tokens of the have two dictionary_phonemes annotations:
- Searching a layer that allows vertical peers will match any of the vertical peers for a given word
- Exporting data
Layers by vertical peers
True | False |
---|---|
|
|
Data type
Different layers contain different kinds of annotations. For example, foll_segment annotations are speech sounds, while foll_pause annotations are numbers. This distinction is captured by a layerâs data type:
Data type | Symbol* | Meaning | Notes |
---|---|---|---|
Phonological layers | ![]() | Annotations are labeled with speech sounds | Some phonological layersâ annotations are individual sounds (like segment); othersâ are sequences of sounds (like syllables). |
Numeric layers | ![]() | Annotations are labeled with a measurement | Some numeric layersâ annotations are counts (like frequency_in_corpus); othersâ are decimal numbers (like foll_pause). |
Text layers | ![]() | Annotations are labeled with text | Most text layersâ annotations are normal English spelling (like word) or lowercase English spelling (like orthography) |
Timing-only layers | (none) | Annotations donât have a label, only a start/end time | The two timing-only layers represent temporal subdivisions of a transcript: turn and utterance |
* Used on individual transcript pages
What youâll see in APLS
This distinction is mostly important for search.
- Individual transcript pages
- In the layer selector, alignment is denoted by the symbols in the previous table
- Search
Layers by data type
Phonological | Numeric | Text | Timing-only |
---|---|---|---|
|
|
|
|
Notation system
While data type describes the kinds of annotation that different layers contain, notation systems are what those annotations actually look like. In other words, notation systems are the details of how layers represent their data. For example, the phonemes layer represents speech sounds as symbols in the DISC phonemic alphabet, such as fIS
for the word fish.
Almost all layers have a primary notation system. (The exceptions are the timing-only layers, turn and utterance, since their annotations donât have labelsâsee above.) Some layers have additional notation, depending on what their annotations need to represent. For example, the syllables layer uses the DISC phonemic alphabet for speech sounds plus stress markers for stress, such as 'fIS
for the word fish.
Here are brief descriptions of primary notation systems, with links to more details on the notation systems page if applicable:
Data type | Primary notation | Description |
---|---|---|
Phonological | DISC | DISC phonemic alphabet |
Phonological | Stress markers | ' (primary stress)" (secondary stress)0 (unstressed) |
Numeric | Count | Positive whole numbers |
Numeric | Decimal | Decimal numbers |
Text | Boolean | True or False |
Text | English spelling | Â |
Text | English spelling (lowercase) | Â |
Text | Treebank part-of-speech tags | Â |
Here are additional notations:
Data type | Additional notation | Brief description |
---|---|---|
Phonological | DISC pause | . |
Phonological | Stress markers | ' (primary stress)" (secondary stress)0 (unstressed) |
Phonological | Syllable boundary | - |
Text | Hesitation marker | ~ (at the end of an incomplete word) |
Text | Morpheme boundary | + |
Text | Transcription pause/question markers | . (short pause)- (long pause)? (question) |
What youâll see in APLS
- Search
- [More about how users will interact w/ notation systems than what theyâll see]
Layers by notation system
Primary notation system:
Boolean | Count | DISC | Decimal | English spelling | English spelling (lowercase) | Stress markers | Treebank part-of-speech tags |
---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
Additional notation:
DISC pause | Hesitation marker | Morpheme boundary | Stress markers | Syllable boundary | Transcription pause/question markers |
---|---|---|---|---|---|
|
|
|
|
|
|
Alignment dependency
Some layers need information about individual speech sounds: which sounds are in a word, and where these sounds begin and end. For example, a wordâs phonemes annotation is only possible if the word has segment annotations. Other layers (like part_of_speech) donât depend on segment annotations. This distinction is captured by alignment dependency:
Alignment-dependent? | Meaning |
---|---|
True | Annotations are only present if the line contains segment annotationsâin other words, if the line is aligned |
False | Annotations can be present even if the line is not aligned |
A layer can be alignment-dependent even if the segment layer isnât an input to the layer. For example, speech_rate (a phrase layer) takes syllables as input, which takes segment as input; if segment has no annotations in a turn, then syllables wonât have any annotations, which means speech_rate wonât have any annotations.
What youâll see in APLS
- Individual transcript pages
- [Whole turns w/ big chunks of no annotations]
- Search
- [False negatives in search because turn isnât aligned]
Layers by alignment dependency
True | False |
---|---|
|
|
Project
Different layers pertain to different levels of linguistic representation. For example, morphemes annotations pertain to how words are represented in the lexicon, while speech_rate annotations pertain to the timing of a particular line of speech. As a result, APLS categorizes layers into projects:
Project | Meaning | Notes |
---|---|---|
syntax | Annotations pertain to syntactic structure | Â |
lexicon | Annotations pertain to how words are represented in the lexicon | Â |
timing | Annotations pertain to timing/rhythm of speech and speaker turns | Â |
phonology | Annotations pertain to the speech sounds in a word | All of these layers are phonological layers, see data type above |
imported | Annotations were imported from the original transcription | Â |
temp | Annotations are either metadata on how other layers were generated, or their only job is to provide data for other layers | Â |
(none) | Layers whose check-boxes are always shown by default on the search and transcript pages | Â |
What youâll see in APLS
The main reason projects exist is to reduce visual clutter on the search and transcript pages. When the page loads, only a few layers are selectable (those without a project): [IMAGE] Click a layerâs project to make the layer selectable: [IMAGE]
Unlike other layer properties, projects donât affect anything about how layers are represented, how you need to search for them, export them, etc.
Layers by project
(none) | syntax | lexicon | timing | phonology | imported | temp |
---|---|---|---|---|---|---|
|
|
|
|
|
|
|
Layers in the âtempâ project isnât meaningful for end-users, so the other âlayers by propertyâ tables on this page omit these layers.
-
The lemma layer represents all possible base forms of the word. Depending on context, the word times could be the plural of the base form time, or times could be the base form itself (as in âtwo times fourâ), hence the multiple annotations. ↩