Register
Register
Register

ProductUpdated on 5 June 2025

AI Inference Optimization Framework

Quentin Sinig

Head of Go-to-Market at Pruna AI

Paris, France

About

Pruna is an inference optimization framework built for AI developers, enabling you to deliver faster, more efficient models with minimal overhead. The open-source package offers two main features: the combination of optimization algorithms (including caching, quantization, pruning, distillation and compilation techniques) as well as evaluation metrics (to help understand how compression affects your models across different dimensions - from output quality to resource requirements)

Organisation

Pruna AI

Company

Paris, France

Similar opportunities