Totally off accuracies for anomaly detection with quantized I/O model #140

i3abghany · 2023-07-23T03:08:26Z

Hello,

I am trying to run inference for the Anomaly Detection benchmark against the model with weights, activations, inputs, and outputs quantized. I am getting totally off results for the average AUC.

I changed nothing but the input handling before inference as the data have to be scaled down and converted to np.int8 (just like other benchmarks). Here's the code for that:

def run_inference(model_path, data):
    interpreter = tf.lite.Interpreter(model_path=model_path)
    interpreter.allocate_tensors()

    input_details = interpreter.get_input_details()
    input_scale, zero_point = input_details[0]['quantization']
    input_data = numpy.array(data/input_scale + zero_point, dtype=numpy.int8)            # Just like other benchmarks

    output_details = interpreter.get_output_details()
    output_data = numpy.empty_like(data)

    for i in range(input_data.shape[0]):
        interpreter.set_tensor(input_details[0]['index'], input_data[i:i+1, :])
        interpreter.invoke()
        output_data[i:i+1, :] = interpreter.get_tensor(output_details[0]['index'])

    return output_data

The data parameter comes from the untouched inference code in 03_tflite_test.py and model_path is trained_models/model_ToyCar_quant_fullint_micro_intio.tflite.

The average AUC is 0.5564.

The same exact code (without re-scaling the input data type) works for the trained_models/model_ToyCar_quant_fullint_micro.tflite model.

I tried to scale the input representative dataset using the following code in the conversion script:

def representative_dataset_gen():
    for sample in train_data[::5]:
        sample = numpy.expand_dims(sample.astype(numpy.float32), axis=0)
        sample = sample / numpy.max(numpy.abs(sample), axis=0)
        yield [sample]

However, this makes the average AUC even worse: 0.4605.

Any hints would be appreciated,
Thanks

The text was updated successfully, but these errors were encountered:

i3abghany · 2023-07-23T03:09:46Z

PS: I am aware of this issue: #110. The author seems to have had a similar problem, but there is no solution on the issue page.

nemcekova · 2023-08-24T13:53:57Z

Hello,

I managed to get results with int i/o models, similar to float i/o models.

How I did it:

scale input data and cast to int8, as you did.
"dequantize" output of the model

scale, zp = output_details[0]['quantization']
out = output_data.astype(numpy.float32)
out = scale * (out - zp)

compute errors variable from input data in float (before scaling) and output data after dequantization.

I have average AUC 0.8408.

Hope this helps. :)

cskiraly · 2024-02-19T16:49:13Z

@i3abghany did you manage to solve the issue based on the suggestions above? If so I would close the issue.

i3abghany closed this as completed Feb 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Totally off accuracies for anomaly detection with quantized I/O model #140

Totally off accuracies for anomaly detection with quantized I/O model #140

i3abghany commented Jul 23, 2023

i3abghany commented Jul 23, 2023

nemcekova commented Aug 24, 2023

cskiraly commented Feb 19, 2024

Totally off accuracies for anomaly detection with quantized I/O model #140

Totally off accuracies for anomaly detection with quantized I/O model #140

Comments

i3abghany commented Jul 23, 2023

i3abghany commented Jul 23, 2023

nemcekova commented Aug 24, 2023

cskiraly commented Feb 19, 2024