TensorRT inference Time

alexgershgorin · September 17, 2018, 7:58am

Hi,
I understand that my Tensorflow model should run faster on Jetson TX2 using TensorRT.
But after converting my TF model to TensorRT I found out that the inference time is slower with TensorRT engine, 80 ms instead of 20 ms.

My net:
1 Input1 of 1,448,576
1 Input1 of 1,448,576
1 output of 5,233,297

After converting to uff I run this function once:

def preprare_inference(self, channel_size, height, width, batch_size):             # Allocate pagelocked memory             self.output = pycuda.pagelocked_empty(5 * 233 * 297, dtype=np.float32)             # alocate device memory             self.d_input1 = pycuda.mem_alloc(1 * 448 * 576 * 4)             self.d_input2 = pycuda.mem_alloc(1 * 448 * 576 * 4)             self.d_output = pycuda.mem_alloc(1 * 5 * 233 * 297 * 4)              self.stream = cuda.Stream()             self.bindings = [int(self.d_input1), int(self.d_input2), int(self.d_output)]

and run with the following code

def do_infer(self, input1, input2):             input1 = input1.astype(np.float32)             input2 = input2.astype(np.float32)             cuda.memcpy_htod_async(self.d_input1, input1,self.stream)             cuda.memcpy_htod_async(self.d_input2, input2,self.stream)              # execute model             self.context.enqueue(1, self.bindings, self.stream.handle, None)             # transfer predictions back             cuda.memcpy_dtoh(self.output, self.d_output)              return np.reshape(self.output, (5, 233, 297))

Can you please help me understand how is it possible?
Thanks

NVES · September 20, 2018, 10:54pm

Hello,

can you share the UFF with us? What version of TF and TRT are you using?

thanks

Topic		Replies	Views
inference time of UFF using tensorrt is slower than tensorflow Jetson TX2	9	2772	October 18, 2021
inference time of tensorrt is slower than tensorflow !!! TensorRT	2	1447	September 27, 2019
Low Compute utilization of converted TensorFlow model during inference Jetson TX2	19	1735	October 18, 2021
The first inference using tensorRT model takes far longer time than that using tensorflow model TensorRT	0	669	November 13, 2020
TensorRT inference is slower than tensorflow model TensorRT	1	960	June 28, 2019
Slow first inference and very slow two models inference TensorRT	3	1275	August 2, 2022
Tensorrt inference slower than tensorflow TensorRT	3	504	November 27, 2020
Taking longer for inferencing even after TensorRT optimization TensorRT	3	420	May 28, 2020
Dont see any speedups using TensorRT TensorRT	14	3004	October 12, 2021
Inference time using TF-TRT is the same as Native Tensorflow for Object Detection Models TensorRT tensorrt , tf-trt	4	1028	March 31, 2022

TensorRT inference Time

Related topics