Hugging Face Enhances LeRobot Platform with Self-Driving Machine Training Data
Last year, Hugging Face, a well-known AI development platform, introduced LeRobot, a comprehensive suite of open AI models, datasets, and tools aimed at enhancing real-world robotics applications. This week, they've taken a significant step forward by partnering with AI startup Yaak to enrich LeRobot. Together, they've introduced the Learning to Drive (L2D) dataset, a massive training set designed specifically for autonomous navigation in varied environments, like bustling city streets.
The L2D dataset is colossal, spanning over a petabyte, and it's packed with data gathered from sensors mounted on cars used in German driving schools. This dataset captures a wide range of driving experiences, including everything from navigating construction zones to cruising on highways, all recorded through cameras, GPS, and vehicle dynamics sensors. This unique collection offers a real-world perspective from both instructors and students behind the wheel.
While there are other open self-driving training datasets out there, such as those from Alphabet’s Waymo and Comma AI, they often focus on specific planning tasks like object detection and tracking. These require meticulous annotations, which can be a bottleneck for scaling, according to the creators of L2D. In contrast, L2D is built to facilitate "end-to-end" learning, allowing AI models to predict actions directly from raw sensor inputs, like deciding when a pedestrian might step into the road based on camera footage.

A sampling of the data in the L2D dataset, captured by a number of sensors. Image Credits: Hugging Face
Yaak co-founder Harsimrat Sandhawalia and Remi Cadene, from Hugging Face's AI for robotics team, expressed their excitement in a blog post, stating, "The AI community can now build end-to-end self-driving models. L2D aims to be the largest open-source self-driving dataset that empowers the AI community with unique and diverse 'episodes' for training end-to-end spatial intelligence."
Hugging Face and Yaak are gearing up for some real-world action this summer. They plan to test models trained on L2D and LeRobot in actual driving conditions, with a safety driver on board, of course. They're inviting the AI community to contribute by submitting models and suggesting specific tasks, like navigating tricky roundabouts or mastering tight parking spaces, to be evaluated during these tests.
Related article
WordPress.com now allows AI agents to write and publish posts, plus more
WordPress.com, the popular web hosting and publishing platform, is now embracing AI agents—a move that could reshape the look and feel of the web. The company announced Friday that it will allow AI agents to draft, edit, and publish content on custom
Anthropic's experimental AI Claude completes negotiations and transactions in e-commerce test
As artificial intelligence advances rapidly, Anthropic quietly rolled out an internal experiment called "Project Deal" last Friday, showcasing AI's potential in e-commerce. The experiment had its AI model Claude autonomously handle buying, selling, a
DeepSeek Code poised for launch
As AI technology accelerates, DeepSeek is at a thrilling juncture. The AI company recently revealed it has secured over 70 billion yuan in funding. Leadership has emphasized a commitment to groundbreaking AI research over immediate commercial gains.
Related Special Topic Recommendations
Comments (14)
0/500
Whoa, Hugging Face diving into self-driving data? That's wild! LeRobot just got a serious upgrade. Wonder if they'll open-source the whole pipeline. 🤖🚗
自動運転データも統合したんですってね、Hugging FaceとYaakの連携、進化が速すぎて追いつけない!ロボット学習用のデータセットが拡充されることで、研究室レベルの技術が身近になってきてる感じがします。個人的には家庭用ロボットが早く安価に提供されてほしいんですけど、セキュリティ面の懸念はどうクリアするんでしょう?🤖🤔 オープンモデルが増えるのはいいけど、学習データの質の問題は常に気になります。
Diese Entwicklung bei LeRobot zeigt, wie schnell sich Robotik und KI verbinden. Ich frage mich, ob solche Open-Source-Projekte langfristig mit Googles Robotik-Initiativen mithalten können. Die Partnerschaft mit Yaak könnte neue Standards setzen - hoffentlich bleiben ethische Fragen dabei nicht auf der Strecke. 🤖
Encore une collaboration qui promet ! HF+Yaak pour améliorer les données de robots autonomes, j'espère que ça ne finira pas comme les voitures qui ne reconnaissent pas les piétons par temps de pluie 😅 C'est cool mais la sécurité d'abord, non ?
Last year, Hugging Face, a well-known AI development platform, introduced LeRobot, a comprehensive suite of open AI models, datasets, and tools aimed at enhancing real-world robotics applications. This week, they've taken a significant step forward by partnering with AI startup Yaak to enrich LeRobot. Together, they've introduced the Learning to Drive (L2D) dataset, a massive training set designed specifically for autonomous navigation in varied environments, like bustling city streets.
The L2D dataset is colossal, spanning over a petabyte, and it's packed with data gathered from sensors mounted on cars used in German driving schools. This dataset captures a wide range of driving experiences, including everything from navigating construction zones to cruising on highways, all recorded through cameras, GPS, and vehicle dynamics sensors. This unique collection offers a real-world perspective from both instructors and students behind the wheel.
While there are other open self-driving training datasets out there, such as those from Alphabet’s Waymo and Comma AI, they often focus on specific planning tasks like object detection and tracking. These require meticulous annotations, which can be a bottleneck for scaling, according to the creators of L2D. In contrast, L2D is built to facilitate "end-to-end" learning, allowing AI models to predict actions directly from raw sensor inputs, like deciding when a pedestrian might step into the road based on camera footage.

Yaak co-founder Harsimrat Sandhawalia and Remi Cadene, from Hugging Face's AI for robotics team, expressed their excitement in a blog post, stating, "The AI community can now build end-to-end self-driving models. L2D aims to be the largest open-source self-driving dataset that empowers the AI community with unique and diverse 'episodes' for training end-to-end spatial intelligence."
Hugging Face and Yaak are gearing up for some real-world action this summer. They plan to test models trained on L2D and LeRobot in actual driving conditions, with a safety driver on board, of course. They're inviting the AI community to contribute by submitting models and suggesting specific tasks, like navigating tricky roundabouts or mastering tight parking spaces, to be evaluated during these tests.
WordPress.com now allows AI agents to write and publish posts, plus more
WordPress.com, the popular web hosting and publishing platform, is now embracing AI agents—a move that could reshape the look and feel of the web. The company announced Friday that it will allow AI agents to draft, edit, and publish content on custom
Anthropic's experimental AI Claude completes negotiations and transactions in e-commerce test
As artificial intelligence advances rapidly, Anthropic quietly rolled out an internal experiment called "Project Deal" last Friday, showcasing AI's potential in e-commerce. The experiment had its AI model Claude autonomously handle buying, selling, a
DeepSeek Code poised for launch
As AI technology accelerates, DeepSeek is at a thrilling juncture. The AI company recently revealed it has secured over 70 billion yuan in funding. Leadership has emphasized a commitment to groundbreaking AI research over immediate commercial gains.
Whoa, Hugging Face diving into self-driving data? That's wild! LeRobot just got a serious upgrade. Wonder if they'll open-source the whole pipeline. 🤖🚗
自動運転データも統合したんですってね、Hugging FaceとYaakの連携、進化が速すぎて追いつけない!ロボット学習用のデータセットが拡充されることで、研究室レベルの技術が身近になってきてる感じがします。個人的には家庭用ロボットが早く安価に提供されてほしいんですけど、セキュリティ面の懸念はどうクリアするんでしょう?🤖🤔 オープンモデルが増えるのはいいけど、学習データの質の問題は常に気になります。
Diese Entwicklung bei LeRobot zeigt, wie schnell sich Robotik und KI verbinden. Ich frage mich, ob solche Open-Source-Projekte langfristig mit Googles Robotik-Initiativen mithalten können. Die Partnerschaft mit Yaak könnte neue Standards setzen - hoffentlich bleiben ethische Fragen dabei nicht auf der Strecke. 🤖
Encore une collaboration qui promet ! HF+Yaak pour améliorer les données de robots autonomes, j'espère que ça ne finira pas comme les voitures qui ne reconnaissent pas les piétons par temps de pluie 😅 C'est cool mais la sécurité d'abord, non ?





Home






