The Hailo-10H is Hailo's second-generation AI accelerator featuring powerful generative AI capabilities.
The Hailo-10H complements Hailo-8's performance in vision AI tasks with new generative AI capabilities and introduces the ability to run large language models (LLMs), vision-language models (VLMs), and other generative AI models entirely on-device, without relying on cloud connectivity.
The Hailo-10H is fully compatible with Hailo’s software stack. It empowers developers to run state-of-the-art vision and generative AI models directly on edge devices. By processing data locally, the Hailo-10H ensures strong data privacy, since personally identifiable information remains on the device, and minimizes cloud bandwidth usage.
The AI operates independently of cloud connectivity, ensuring consistent availability even in environments with limited or no internet access.
Specifically designed for edge devices across consumer, enterprise, and automotive markets, including media centers, home gateways, and automotive cockpit systems, the Hailo-10H enables advanced use cases like natural language human-machine interaction, visual awareness, and multi-modal AI to run seamlessly within the power and cost constraints typical of edge environments.
In performance benchmarks, the Hailo-10H has achieved a first-token latency of under 1 second and over 10 Tokens per Second on a variety of 2B language and vision-language models. For video analytics, the Hailo-10H enables state-of-the-art object detection (e.g., YOLOv11m) on a real-time 4K video stream.
All of these come at a typical power consumption of 2.5W. The Hailo-10H is automotive-qualified to AEC-Q100 Grade 2 standards and is aimed at automotive designs with a 2026 start of production.
To start integrating Hailo-10H into your next product, contact Hailo.