Milestone Advances AI Strategy With Generative Plug-In for XProtect

Milestone executives discuss how a new generative AI plug-in for XProtect marks the company’s next step toward ethical, contextual and developer-driven video intelligence.
Nov. 4, 2025
6 min read

Milestone Systems is introducing a generative AI-powered plug-in for its XProtect video management software (VMS), a move the company says represents the next step in how operators manage and learn from visual data.

Developed in collaboration with NVIDIA, the new tool is designed to automate video review, reduce false alarms and streamline operator workflows, signaling a broader evolution in Milestone’s technology strategy. A beta version is debuting at Smart City Expo World Congress in Barcelona, Nov. 4-6, with general availability coming later this year.

SecurityInfoWatch held an exclusive interview with Andrew Burnett, Interim CTO, and Edward Mauser, Director and Product Lead for Hafnia at Milestone Systems, to discuss the company’s new generative AI plug-in and what it signals for the future of video management.

Beyond traditional VMS

While Milestone remains rooted in its video management platform, the company is clearly pushing beyond conventional definitions of VMS functionality.

“VMS is still at our core,” said Andrew Burnett, who noted that the platform has steadily evolved through recent additions such as cloud services, analytics solutions and the Hafnia vision language model (VLM). “It’s about taking VMS as a concept for our customers and partners and bringing that into the modern era,” he explained. “Generative AI plays a huge part in that… and we play a role as a trusted provider to bridge that gap for customers and partners.”

That gap, Burnett said, includes the challenge of helping organizations engage responsibly with AI in ways that are both compliant and ethical. The company’s new plug-in is intended to provide that assurance while expanding the value of XProtect for operators and integrators alike.

Generative AI vs. traditional analytics

According to Mauser, the new approach relies on a vision language model that understands scenes in full context rather than focusing on isolated events or objects.

“A visual language model is trained on a very large amount of data that understands the context of the world better than a conventional model used on devices today,” he said. “This solution actually builds on and amplifies existing analytics to support use cases that previously couldn’t be solved without heavy investments.”

By integrating this model directly with XProtect’s rule engine, Milestone aims to simplify deployment while enhancing performance. Mauser said setup is straightforward, requiring only a few minutes to install, with no additional infrastructure or GPUs needed since the VLM operates as a service.

About the Author

Rodney Bosch

Editor-in-Chief/SecurityInfoWatch.com

Rodney Bosch is the Editor-in-Chief of SecurityInfoWatch.com. He has covered the security industry since 2006 for multiple major security publications. Reach him at [email protected].

Sign up for our eNewsletters
Get the latest news and updates

Voice Your Opinion!

To join the conversation, and become an exclusive member of Security Info Watch, create an account today!