_____________________
quick links
auto-harmonization plugin
real-time pitch shifting plugin
_____________________
élastique SDK
time and pitch scaling
[aufTAKT]
tempo and beat tracking
[tONaRT]
key detection
espace
parametric convolution
AtAAK!
audio-to-audio-alignment
[museg]
music/speech classification
fx::pack
effects bundle
z.reverb
parametric reverberation
range-x pro
_____________________
references
contact

[museg] music/speech segmentation
The [museg] SDK is able to segment audio input data into segments containing music and segments containing speech and can be used e.g. for broadcast monitoring etc.
The SDK consists of two parts: the training part and the classification/segmentation part. With an individual selection of training files, it is easily possible to adapt the algorithm to special use cases and thus increase the classification performance significantly.
The classification resp. segmentation system consists of three processing stages:-
Feature Extraction: a set of meaningful and characteristic parameters is extracted from the audio samples in a sliding time window
-
Classification: for each time window, the extracted feature vector is reduced/transformed to a lower dimensional space and a decision is met to what class this specific observation belongs to
-
Segmentation: the classification result is post-processed with a path finding algorithm that tries to determine the path with the highest overall probability for the whole audio file resp. all time windows. This stage allows especially to avoid short segments with low probability and thus smoothes out the overall result
The training system contains the same feature extraction stage as the classification, and afterwards iteratively optimizes the dimensionality reduction transformation matrix for these feature to optimize the classification results.
references
[museg] segmentation engine is successfully being used by:
- SoundAware MusicTracker
pricing & licensing
Please visit our licensing page for more information. We are happy to discuss license fees with you in personal communication.Previous page: [museg] music/speech segmentation
Next page: [museg] specifications

Print this page