option
Home
News
Pruna AI Unveils Open-Source AI Model Optimization Framework

Pruna AI Unveils Open-Source AI Model Optimization Framework

April 10, 2025
103

Pruna AI, a European startup focused on developing compression algorithms for AI models, is set to release its optimization framework as open source this Thursday. The company has been working on a framework that incorporates various efficiency techniques such as caching, pruning, quantization, and distillation to enhance AI model performance.

John Rachwan, co-founder and CTO of Pruna AI, explained to TechCrunch that their framework not only applies these methods but also standardizes the process of saving, loading, and evaluating compressed models. This allows users to assess any potential quality loss and the performance improvements achieved through compression.

Rachwan likened Pruna AI's role to that of Hugging Face, which standardized the use of transformers and diffusers. "We are doing the same, but for efficiency methods," he stated, emphasizing the standardization of how these methods are applied and managed.

Major AI labs have already adopted similar compression techniques. For example, OpenAI has used distillation to develop faster versions of its models, such as GPT-4 Turbo. Similarly, Black Forest Labs created Flux.1-schnell, a distilled version of their Flux.1 model. Distillation involves a "teacher-student" approach where a larger model's outputs are used to train a smaller, more efficient model.

Rachwan pointed out that while large companies often develop these tools internally, the open-source community typically focuses on single methods. "But you cannot find a tool that aggregates all of them, makes them all easy to use and combine together," he said, highlighting Pruna AI's unique value proposition.

Left to right: Rayan Nait Mazi, Bertrand Charpentier, John Rachwan, Stephan GünnemannImage Credits:Pruna AI
Although Pruna AI's framework supports a wide range of models, including large language models, diffusion models, speech-to-text models, and computer vision models, the company is currently focusing on image and video generation models. Existing users of Pruna AI include Scenario and PhotoRoom.

In addition to the open-source version, Pruna AI offers an enterprise edition with advanced optimization features, including an upcoming compression agent. Rachwan described this agent as a tool that automatically finds the best compression combination for a model based on user-specified performance and accuracy requirements.

Pruna AI's pro version is billed by the hour, similar to renting a GPU on cloud services like AWS. By optimizing models, users can significantly reduce inference costs. For instance, Pruna AI managed to compress a Llama model to one-eighth its original size with minimal quality loss, demonstrating the potential cost savings.

The company recently secured a $6.5 million seed funding round from investors including EQT Ventures, Daphni, Motier Ventures, and Kima Ventures. Pruna AI views its compression framework as a strategic investment that can pay for itself through reduced operational costs.

Related article
Audible Boosts AI-Narrated Audiobook Offerings with New Publisher Partnerships Audible Boosts AI-Narrated Audiobook Offerings with New Publisher Partnerships Audible, Amazon’s audiobook platform, revealed on Tuesday a collaboration with select publishers to transform print and e-books into AI-narrated audiobooks. This move is designed to rapidly grow its l
AI-Powered Music Creation: Exploring Britney Spears and Michael Jackson's AI-Powered Music Creation: Exploring Britney Spears and Michael Jackson's "Circus" The music industry is undergoing a transformative shift, with artificial intelligence (AI) driving innovation in music creation. From crafting vocals to composing full tracks, AI is redefining artisti
AI Comic Factory: Revolutionizing Education with Creative AI Tools AI Comic Factory: Revolutionizing Education with Creative AI Tools In today's dynamic educational landscape, educators are continually exploring innovative ways to captivate students and spark creativity. The integration of Artificial Intelligence (AI) into education
Comments (30)
0/200
PaulRoberts
PaulRoberts April 25, 2025 at 12:04:39 AM EDT

O framework de código aberto da Pruna AI é uma bênção para nós entusiastas de AI DIY! É como ter uma faca suíça para otimizar modelos. Consegui reduzir meus modelos sem perder muita precisão, o que é incrível. O único problema? A documentação poderia ser mais detalhada. Ainda assim, mal posso esperar para ver o que mais eles vão lançar! 🚀

DouglasMitchell
DouglasMitchell April 24, 2025 at 1:25:23 PM EDT

El marco de código abierto de Pruna AI es un regalo para nosotros los entusiastas del AI DIY. ¡Es como tener un cuchillo suizo para optimizar modelos! He podido reducir mis modelos sin perder mucha precisión, lo cual es genial. El único inconveniente es que la documentación podría ser más completa. ¡Aun así, no puedo esperar a ver qué más sacan! 🚀

WillieMartinez
WillieMartinez April 19, 2025 at 9:20:47 PM EDT

Pruna AI's open-source framework sounds promising, but the setup was a bit of a headache. Once I got it running, the optimization really sped up my models. Just wish the documentation was clearer. Still, it's a solid tool for anyone looking to optimize AI models! 🤓

JamesLopez
JamesLopez April 18, 2025 at 6:46:00 PM EDT

Pruna AI's open-source framework is a godsend for us DIY AI enthusiasts! It's like having a Swiss Army knife for optimizing models. I've been able to shrink my models without losing much accuracy, which is just awesome. The only hiccup? The documentation could use a bit more love. Still, can't wait to see what else they roll out! 🚀

CharlesNelson
CharlesNelson April 18, 2025 at 3:07:22 PM EDT

Pruna AI's open-source framework sounds promising, but I'm not a tech whiz, so I'm a bit lost. The idea of optimizing AI models is cool, but I wish they had more user-friendly tutorials. Maybe they'll release something simpler soon? 🤔🧠

JerryMoore
JerryMoore April 17, 2025 at 5:56:48 AM EDT

Pruna AI의 오픈소스 프레임워크는 promising하지만, 기술에 밝지 않아서 좀 헷갈려. AI 모델 최적화는 흥미로운데, 좀 더 사용자 친화적인 튜토리얼이 있었으면 좋겠어. 곧 더 간단한 걸 내놓을까? 🤔🧠

Back to Top
OR