Thumbnail

Sentiment Classification on the IMDB Movie Reviews dataset

TransformerRoBERTaEnd-to-End DevelopmentONXX Runtime OptimizationPerformance Benchmarking

Implementation details:

Optimization:

Used Optimum(Huggingface) to convert model to ONXX and used the ORTOptimizer to perform the following optimizations (Guide):

Benchmark Results:

Model Latency(ms)
Original finetuned model 151.81 ± 5.58
Optimized model 144.87 ± 1.96