Model Info Section#
The OpenVINO™ Intermediate Representation (IR) includes an XML file with description of network topology as well as conversion and runtime metadata.
If “model_proc” file is not present, Intel® Deep Learning Streamer parses “model_info” section located at the end of the XML model file. An example section looks as in the code snippet below:
<rt_info>
...
<model_info>
<iou_threshold value="0.7" />
<labels value="person bicycle ... " />
<model_type value="yolo_v8" />
<pad_value value="114" />
<resize_type value="fit_to_window_letterbox" />
<reverse_input_channels value="True" />
<scale_values value="255" />
</model_info>
</rt_info>
Intel® Deep Learning Streamer supports the following fields in the model info section:
Field |
Type |
Possible values or example |
Description |
Corresponding ‘model-proc’ key |
---|---|---|---|---|
model_type |
string |
label
detection_output
yolo_v8
|
The converter to parse output tensors and map to GStreamer meta data. |
converter |
confidence_threshold |
float |
[0.0, 1.0] |
The confidence level to report inference results, typically depends on training accuracy. |
threshold (command line param) |
iou_threshold |
float |
[ 0.0, 1.0 ] |
Threshold for non-maximum suppression (NMS) intersection over union (IOU) filtering. |
iou_threshold |
multilabel |
boolean |
True
False
|
Classification model predicts a set of labels per input image. |
method=multi |
output_raw_scores |
boolean |
True
False
|
Classification model outputs all non-normalized scores for all detected labels. |
method=softmax |
labels |
string list |
person bicycle … |
List of labels for predicted object classes. |
labels |
resize_type |
string |
crop
standard
fit_to_window
fit_to_window_letterbox
|
Resize method to map input video images to model input tensor. |
resize |
reverse_input_channels |
boolean |
True
False
|
Convert input video image to RGB format (model trained with RGB images) |
color_space=”RGB” |
scale |
float |
255.0 |
Divide input image values by ‘scale’ before mapping to model input tensor (typically used when model was trained with input data normalized in <0,1> range). |
range: [0.0, 1.0] |
Please also refer to Deep Learning Workbench for more information on the “model_info” section.