Speaker Clustering Results
The following XML shows a single record produced by speaker clustering.
<output>
<record>
<timestamp>
...
</timestamp>
<trackname>SpeakerClustering.Result</trackname>
<ClusterSpeechResult>
<id>8fb22011-d041-4694-a925-5bf87df5d6b7</id>
<label>Cluster_3</label>
</ClusterSpeechResult>
</record>
</output>
The record contains the following information:
- The
idelement provides a unique identifier for the section of audio. -
The
labelelement describes who was speaking.The first speaker is named
Cluster_1, the second is namedCluster_2, and so on. If you know the maximum number of speakers in your audio, OpenText recommends setting the parameter MaxSpeakers.This element can also have a value of
<SIL>or<s>, which indicates a period of audio without speech, such as silence or background noise.<SIL>indicates silence that probably has no linguistic role.<s>is more likely to end a chain of words, for example when a speaker begins a new sentence.