ProductUpdated on 5 June 2025
AI Inference Optimization Framework
About
Pruna is an inference optimization framework built for AI developers, enabling you to deliver faster, more efficient models with minimal overhead. The open-source package offers two main features: the combination of optimization algorithms (including caching, quantization, pruning, distillation and compilation techniques) as well as evaluation metrics (to help understand how compression affects your models across different dimensions - from output quality to resource requirements)
Organisation
Similar opportunities
Expertise
Neuromorphic computing and Edge AI
Marcel van Gerven
Professor of Artificial Intelligence at Radboud University
Nijmegen, Netherlands
Investment
- Startup
- Expansion
- Growth and Establishment
- ICT Industry and Services
Sher Khattak
CEO at NSOC360
The Hague, Netherlands
Expertise
Gul T. Temur
Prof. Dr. at Bahcesehir University-Department of Industrial Engineering
Istanbul, Türkiye