For years, the "final mile" of digital transformation has been plagued by forgotten PDFs and ignored coaching manuals.
Organizations spend hundreds of thousands on subtle software program like SAP or Salesforce, just for staff to battle with primary navigation. Now, because the period of agentic AI arrives, firms face a double-edged sword: they have to train human staff to collaborate with AI, whereas concurrently instructing AI brokers to navigate the labyrinthine interfaces of the fashionable enterprise.
One concept that appears to be gaining momentum amongst AI-forward companies: utilizing display recordings and tutorials/walkthroughs of somebody performing an enterprise activity — be it creating a brand new ticket or processing an bill — and coaching AI to duplicate the circulation primarily based on the display seize. Simply this week, a startup referred to as Customary Intelligence went viral on X exhibiting an early demo of open-ended model of this for the bodily and digital world.
However the reality is, there are already gamers tackling this drawback for the enterprise itself square-on: case-in-point, Guidde, an Israel startup born throughout the video-centric years of the COVID-19 pandemic, at the moment introduced an oversubscribed $50 million Sequence B funding spherical led by PSG Fairness to handle this precise information infrastructure disaster.
As an alternative of feeding an agent a static PDF guide, Guidde gives high-fidelity "Video Floor Fact"—a wealthy stream of knowledge captured from actual human consultants as they navigate advanced software program.
The funding indicators a shift in how the tech business views documentation—not as a static byproduct of labor, however because the essential telemetry wanted to coach the subsequent era of autonomous digital brokers.
Expertise: from video seize to world fashions
At its core, Guidde is an AI Digital Adoption Platform (ADAP). Nonetheless, its technological breakthrough lies in what occurs behind the scenes throughout a recording.
Guidde isn't simply recording pixels; it’s capturing each click on, scroll, and latent interplay with the HTML web page—the delicate pauses, the particular scroll depths, and the corrections a human makes when a system lags. This telemetry transforms uncooked video right into a Imaginative and prescient-Language-Motion (VLA) coaching set.
In the meantime, the platform's Magic Redaction mechanically obscures delicate information like passwords or bank card numbers throughout seize, guaranteeing supplies stay safe and HIPAA-aligned.
"Each time you click on a button, you drag-and-drop, you scroll, you kind, we collect the interplay… all of it, we do cleanse it—there's no personal info," defined Guidde co-founder and CEO Yoav Einav in an unique interview with VentureBeat.
Below the hood, the platform captures the underlying metadata and DOM (Doc Object Mannequin) modifications synchronized with the video frames. The differentiator is the telemetry hidden beneath the floor.
This wealthy metadata creates a "digital world mannequin" of enterprise software program. And since every enterprise makes use of its personal distinctive mixture of apps and processes, Guidde is creating an information moat that enables enterprise brokers to purpose by way of legacy UIs with the identical spatial consciousness as a human, guaranteeing that automation really works in a manufacturing surroundings reasonably than only a lab demo.
For a human, it’s a tutorial. For an AI agent, it’s a high-fidelity map of the interface. This permits brokers to "see" and purpose by way of advanced UIs the way in which people do, fixing the "final mile" of automation the place brokers beforehand failed as a result of lack of particular enterprise and in-situ utilization context.
In a way, Guidde is constructing a "self-driving automotive" like a Waymo for laptop utilization.
Product: three pillars of Guidd-ance
The platform has developed into three distinct merchandise designed to scale with a corporation's maturity:
Guidde Create: The engine for subject material consultants to show workflows into documentation in minutes.
Guidde Broadcast: A customized suggestion engine—typically in comparison with Netflix—that delivers solutions contained in the instruments individuals really use. It is aware of who the consumer is and what division they’re in to floor related content material precisely when wanted.
Guidde Uncover: The newly launched "agentic" pillar. Like Waze mapping roads by observing drivers, Uncover maps software program routes by monitoring how staff work. It understands the workflow, creates the content material, and updates it mechanically when the UI modifications.
Coaching people use AI — and AI utilizing people
Essentially the most non-obvious facet of Guidde’s development is its dual-purpose mission. "We're the one platform that trains each people and brokers," Einav acknowledged.
As firms roll out AI instruments like Microsoft 365 Copilot or ServiceNow brokers, they hit a proficiency hole. One in all Guidde’s largest clients revealed they had been paying over $1 million a yr for a complicated AI device, but "no person is aware of use them as a result of they did like a 30-minute coaching session, after which that’s it." Guidde closes this hole by offering "bite-sized" video tutorials within the circulation of labor.
Concurrently, these movies practice the AI brokers themselves. Basis fashions like Gemini or GPT-4 typically hallucinate when tasked with particular enterprise workflows as a result of they weren't educated on the extremely particular, inner "vanilla workflows" present in personal enterprise programs. Guidde gives the "place to begin," the "metadata," and the "x, y coordinates of the button" that an agent wants to finish an motion with out getting caught.
The multimodal benefit
To take care of this stage of accuracy, Guidde employs a multimodal infrastructure. The system doesn't depend on a single mannequin; as a substitute, it makes use of a "fleet" of fashions that consider each other.
Google Gemini: Usually used for visible duties like analyzing PDFs or PowerPoints.
Anthropic Claude: Leveraged for writing the storyline and narrative scripts.
Suggestions Loops: When a consumer edits a video, that information is fed again into the mannequin to stop the identical errors from occurring in future captures.
This method permits Guidde to interchange a legacy stack of six or seven disconnected instruments—Loom for seize, Adobe Premiere for enhancing, 11Labs for text-to-speech, and Synthesia for avatars—with a single, AI-native platform. "We mainly pack all the pieces for you," Einav says, "and automate all the course of primarily based in your model tips."
Video-first origin story
The genesis of Guidde lies in a frustration acquainted to any product chief. Earlier than founding the corporate, Einav and co-founder Dan Sahar spent years mastering video visitors at Qwilt, an organization they began in 2010 to research how individuals watched Netflix and Disney+.
When COVID-19 hit, they noticed a large alternative to use that video experience to the office. They noticed that quick video explainers may improve free-to-paid account conversions by 30%, however the friction of making them was unsustainable.
In an interview, Einav recalled the "tedious work" of the previous world: "My group in Israel had been creating the content material, somebody within the US with a US accent was doing the narration, somebody within the advertising group would write the script… and somebody within the enablement group would do the edit." This fragmented workflow meant a single video took two to 3 weeks to provide. "After which two weeks later, the product modifications, and it’s essential to redo it from scratch," Einav added.
Guidde was constructed to break down this cycle into seconds. By automating the "Magic Seize" of a workflow, the platform generates a structured narrative script {and professional} AI voiceover immediately. This removes the enhancing bottleneck, remodeling subject material consultants into "coaching powerhouses."
Licensing and market affect
Guidde’s pricing construction displays its transition from a utility to a core piece of enterprise infrastructure:
Free: $0 (As much as 25 movies, web-app assist).
Professional: $18/creator/month (Limitless movies, model kits).
Enterprise: $39/creator/month (Limitless text-to-voice, analytics).
Enterprise: Customized pricing (Multi-language translation, SSO, Magic Redaction).
The platform's affect is already seen within the numbers: a 41% discount in video creation time and 34% fewer inbound assist tickets.
For patrons like Emerson, this interprets to 40–60% faster information creation. Assist groups, specifically, are discovering they will offload 80% of their ticket quantity with brokers—however provided that these brokers have the content material to be helpful.
"The agent with out the content material is ineffective," Einav warns, noting that the majority enterprise documentation is both years old-fashioned or fully undocumented.
Neighborhood and business early reception
Guidde already claims 4,500 enterprise clients and seeks to increase this quantity with its new spherical of funding. Assist and operations leaders have been vocal concerning the platform's ease of use. Christopher Cummings, VP of Consumer Expertise at DocNetwork, highlighted its potential to supply "fast, customized video responses to buyer questions."
In the meantime, Wren Cotrone, a Director of Buyer Assist, famous that "When you set the branding the way in which you need, you’ll be able to actually zoom by way of these items."
Ronen Nir, Managing Director at PSG, summarized the funding thesis: "Guidde is fixing one of many largest blockers to profitable AI adoption: the information infrastructure."
Why this issues now
The paradigm shift from text-only LLMs to agentic video intelligence is the defining development of 2026. Guidde’s Sequence B indicators that the "floor reality" for enterprise brokers will come from uncooked video commentary, not static documentation.
By capturing how work will get completed throughout 10s of hundreds of thousands of workflows, Guidde is constructing a dataset that few others possess.
As Einav put it: "It begins with people within the loop, and over time strikes towards full autonomy." For the fashionable enterprise, the map is not a static doc—it’s a dwelling, respiration video intelligence layer that guides each the workforce and the brokers that assist them.

