option
Home News New open source AI company Deep Cogito releases first models and they’re already topping the charts

New open source AI company Deep Cogito releases first models and they’re already topping the charts

release date release date June 6, 2025
views views 2

New open source AI company Deep Cogito releases first models and they’re already topping the charts

Deep Cogito Emerges with Revolutionary AI Models

In a groundbreaking move, Deep Cogito, a cutting-edge AI research startup located in San Francisco, has officially unveiled its first line of open-source large language models (LLMs), named Cogito v1. These models, fine-tuned from Meta’s Llama 3.2, boast hybrid reasoning capabilities that allow them to respond swiftly or engage in introspective thinking—a feature reminiscent of OpenAI’s “o” series and DeepSeek R1.

Deep Cogito envisions pushing AI beyond conventional human oversight constraints by fostering iterative self-improvement within its models. Their ultimate goal? To develop superintelligence—AI that surpasses human capabilities across all fields. Yet, the company assures that all models will remain open-source.

Drishan Arora, CEO and co-founder of Deep Cogito, previously served as a Senior Software Engineer at Google, leading the development of LLMs for Google’s generative search product. He confidently stated on X that these models are among the strongest open models at their scale, outperforming competitors like LLaMA, DeepSeek, and Qwen.

The Model Lineup

The initial offering includes five base sizes—3 billion, 8 billion, 14 billion, 32 billion, and 70 billion parameters—and is already accessible on platforms such as Hugging Face, Ollama, and APIs via Fireworks and Together AI. These models operate under Llama licensing terms, allowing commercial use for up to 700 million monthly users before requiring a paid license from Meta.

Deep Cogito intends to roll out even larger models, potentially reaching 671 billion parameters, in the near future.

Training Approach: Iterated Distillation and Amplification (IDA)

Arora introduced IDA, a novel method distinct from traditional reinforcement learning from human feedback (RLHF) or teacher-model distillation. IDA focuses on allocating additional computational resources to generate superior solutions, subsequently embedding this enhanced reasoning into the model itself—a continuous feedback loop aimed at boosting capabilities. This approach mirrors Google AlphaGo’s self-play strategy adapted for natural language processing.

Benchmarks and Evaluations

Deep Cogito presented comprehensive evaluation results comparing Cogito models against open-source counterparts in areas such as general knowledge, mathematical reasoning, and multilingual tasks. Key findings include:

  • Cogito 3B (Standard): Outperforms LLaMA 3.2 3B on MMLU by 6.7 percentage points (65.4% vs. 58.7%) and on Hellaswag by 18.8 points (81.1% vs. 62.3%).
  • Cogito 3B (Reasoning Mode): Scores 72.6% on MMLU and 84.2% on ARC.
  • Cogito 8B (Standard): Achieves 80.5% on MMLU, outscoring LLaMA 3.1 8B by 12.8 points.
  • Cogito 8B (Reasoning Mode): Scores 83.1% on MMLU and 92.0% on ARC.
  • Cogito 70B (Standard): Leads LLaMA 3.3 70B on MMLU by 6.4 points (91.7% vs. 85.3%) and surpasses LLaMA 4 Scout 109B on aggregate benchmarks (54.5% vs. 53.3%).

While Cogito models excel in reasoning mode, certain trade-offs exist, particularly in mathematical tasks.

Native Tool Calling

Deep Cogito also assessed its models’ native tool-calling performance, a crucial aspect for agent and API-integrated systems.

  • Cogito 3B: Supports four tool-calling tasks and excels in simple and multiple tool calls.
  • Cogito 8B: Demonstrates strong performance across all tool call types, outperforming LLaMA 3.1 8B significantly.

Future Plans

Looking forward, Deep Cogito plans to introduce larger models, including mixture-of-experts variants at 109B, 400B, and 671B parameters, alongside ongoing updates to existing checkpoints. The company views IDA as a sustainable pathway toward scalable self-improvement, reducing reliance on human or static teacher models.

Arora highlighted that real-world utility and adaptability are the ultimate measures of success, emphasizing that this is merely the start of a promising journey. Deep Cogito collaborates with renowned entities like Hugging Face, RunPod, Fireworks AI, Together AI, and Ollama, ensuring all models remain open-source and freely accessible.

Related article
구글의 인공지능 미래 펀드는 신중하게 접근해야 할 수 있다 구글의 인공지능 미래 펀드는 신중하게 접근해야 할 수 있다 구글의 새로운 AI 투자 이니셔티브: 규제 심사 속 전략적 전환 구글의 최근 AI 퓨처스 펀드 발표는 기술 거인의 인공지능 미래 구축 노력에서 큰 움직임을 나타냅니다. 이 이니셔티브는 스타트업들에게 필요한 자본을 제공하고, 아직 개발 중인 첨단 AI 모델에 대한 조기 접근권을 부여하며, 구글 내부 전문가들의 멘토링을 제
구글의 AI 도약 속内幕: Gemini 2.5는 더 깊이 생각하고, 더 영리하게 말하며, 더 빠르게 코딩한다 구글의 AI 도약 속内幕: Gemini 2.5는 더 깊이 생각하고, 더 영리하게 말하며, 더 빠르게 코딩한다 구글, 보편적인 AI 어시스턴트 비전 실현에 한 걸음 더 다가서다올해 구글 I/O 행사에서 구글은 Gemini 2.5 시리즈의 중요한 업데이트를 공개했다. 특히 다양한 차원에서 기능을 개선하는 데 초점을 맞췄다. 최신 버전인 Gemini 2.5 Flash와 2.5 Pro는 이제 더 지능적이며 효율적이다. 이러한 발전은 구
Oura, AI 기반 혈당 추적 및 식사 기록 기능 추가 Oura, AI 기반 혈당 추적 및 식사 기록 기능 추가 Oura, 대사 건강에 대한 약속을 강화하기 위해 두 가지 흥미로운 신규 기능 추가Oura는 Glucose 추적 및 식사 기록이라는 두 가지 혁신적인 AI 기반 기능을 통해 대사 건강 분야에서 새로운 경지를 개척하고 있습니다. Glucose 추적 기능은 최근 Dexcom과의 전략적 파트너십 이후 발표된 Stelo 지속형
Comments (0)
0/200
Back to Top
OR