This class represents audio event - object describing audio event detection result (segment) and containing multiple Tensor objects (inference results) attached by multiple models. For example, it can be audio event with detected speech and converts speech to text. It can be produced by a pipeline with gvaaudiodetect with detection model and gvaspeechtotext element with speechtotext model. Such AudioEvent will have start and end timestamps filled and will have 2 Tensor objects attached - 1 Tensor object with detection result and other with speech to text tensor objectresult.
More...
|
Segment< gulong > | segment () const |
| Get Segment of AudioEvent as start and end timestamps, timestamps are presentation time. More...
|
|
std::string | label () const |
| Get AudioEvent label. More...
|
|
double | confidence () const |
| Get AudioEvent detection confidence (set by gvaaudiodetect) More...
|
|
std::vector< Tensor > | tensors () const |
| Get all Tensor instances added to this AudioEvent. More...
|
|
Tensor | add_tensor (const std::string &name) |
| Add new tensor (inference result) to this AudioEvent with name set. To add detection tensor, set name to "detection". More...
|
|
Tensor | detection () |
| Returns detection Tensor, last added to this AudioEvent. As any other Tensor, returned detection Tensor can contain arbitrary information. If you use AudioEvent based on GstGVAAudioEventMeta attached by gvaaudiodetect by default, then this Tensor will contain "label_id", "confidence", "start_timestamp", "end_timestamp" fields. If AudioEvent doesn't have detection Tensor, it will be created in-place. More...
|
|
int | label_id () const |
| Get label_id from detection Tensor, last added to this AudioEvent. More...
|
|
| AudioEvent (GstGVAAudioEventMeta *meta) |
| Construct AudioEvent instance from GstGVAAudioEventMeta. After this, AudioEvent will obtain all tensors (detection & inference results) from GstGVAAudioEventMeta. More...
|
|
void | set_label (std::string label) |
| Set AudioEvent label. More...
|
|
GstGVAAudioEventMeta * | _meta () const |
| Internal function, don't use or use with caution. More...
|
|
This class represents audio event - object describing audio event detection result (segment) and containing multiple Tensor objects (inference results) attached by multiple models. For example, it can be audio event with detected speech and converts speech to text. It can be produced by a pipeline with gvaaudiodetect with detection model and gvaspeechtotext element with speechtotext model. Such AudioEvent will have start and end timestamps filled and will have 2 Tensor objects attached - 1 Tensor object with detection result and other with speech to text tensor objectresult.