## Short Segments Fine-tuning AI models just got more accessible with a new step-by-step tutorial for Liquid AI's LFM2. We'll also explore how MIT researchers are teaching AI to interpret charts, and Nous Research's Hermes Desktop brings a new interface to AI agents. Coming up, NVIDIA's Cosmos 3 unifies physical reasoning and action generation in a single model. Fine-tuning LFM2 with QLoRA and DPO is now easier than ever. A new tutorial on Google Colab walks users through the process of fine-tuning Liquid AI's LFM2 model using QLoRA and DPO. This comprehensive guide covers loading the base LFM2 checkpoint, preparing a chat-style supervised fine-tuning dataset, and training a lightweight LoRA adapter. The tutorial also demonstrates how to merge the adapter back into the model and extend the workflow with DPO for improved response preference. By the end, users have a practical pipeline that moves from a base LFM2 model to a preference-aligned checkpoint, ready for further testing or deployment. This development makes fine-tuning more accessible and efficient, allowing users to achieve better model performance with less effort. Nous Research releases Hermes Desktop, a cross-platform front end for Hermes Agent. Hermes Desktop, now in public preview, provides a graphical interface for the open-source Hermes Agent, available on macOS, Windows, and Linux. This native application allows users to interact with Hermes Agent without needing a command-line interface, offering a more user-friendly experience. The desktop version shares configuration, API keys, sessions, skills, and memory with the CLI and gateway, ensuring seamless integration across platforms. With features like streaming responses, live tool activity, and a file browser, Hermes Desktop enhances the usability of AI agents for everyday tasks. This release marks a significant step in making AI agents more accessible to a broader audience, moving beyond developer tools to products that companies can standardize around. MIT researchers develop ChartNet to teach AI models to interpret charts. In a bid to improve AI's ability to summarize and interpret charts, MIT and the MIT-IBM Computing Research Lab have created ChartNet, a multifaceted resource for AI users. This novel dataset includes over a million varied charts, encoding visual, linguistic, and numerical components to enable robust reasoning. Using ChartNet, researchers trained open-source vision-language models that outperformed larger commercial models in tasks like data extraction and chart summarization. By enabling smaller models to excel, ChartNet offers small firms with limited budgets the opportunity to leverage AI for business trend analysis and scientific figure interpretation. This development could democratize access to advanced AI capabilities, allowing more organizations to benefit from AI-driven insights. ## Feature Story NVIDIA's Cosmos 3 unifies physical reasoning and action generation in a single model. The newly released Cosmos 3 by NVIDIA is a groundbreaking omnimodal world model for physical AI, combining physical reasoning, world generation, and action generation within one open model. This release targets robotics, autonomous vehicles, and warehouse monitoring teams, offering a unified approach to perceiving, predicting, and acting in the physical world. Cosmos 3's Mixture-of-Transformers architecture features two towers: the reasoner tower, a vision-language model that interprets images, videos, and text, and the generator tower, which produces future observations and action sequences. The reasoner tower acts as the model's brain, understanding motion and object interactions, while the generator tower uses a diffusion-based process for physics-aware video and actions. This architecture allows a single model to handle reasoning and generation together, streamlining processes that previously required separate models. By open-sourcing the Cosmos 3 models, training scripts, deployment tools, and datasets, NVIDIA is making advanced physical AI capabilities more accessible to developers and researchers. This release could accelerate innovation in fields that rely on autonomous systems, providing a robust foundation for simulating and understanding the physical world. As the AI industry continues to evolve, Cosmos 3 represents a significant step forward in the integration of reasoning and action generation, paving the way for more sophisticated and capable AI systems.