ProductUpdated on 5 June 2025
AI Inference Optimization Framework
About
Pruna is an inference optimization framework built for AI developers, enabling you to deliver faster, more efficient models with minimal overhead. The open-source package offers two main features: the combination of optimization algorithms (including caching, quantization, pruning, distillation and compilation techniques) as well as evaluation metrics (to help understand how compression affects your models across different dimensions - from output quality to resource requirements)
Organisation
Similar opportunities
Expertise
Neuromorphic computing and Edge AI
Marcel van Gerven
Professor of Artificial Intelligence at Radboud University
Nijmegen, Netherlands
Service
"AI Efficiency Fundamentals" Training
Quentin Sinig
Head of Go-to-Market at Pruna AI
Paris, France