Popular on EntSun
- Century City Alumnae Chapter Of Delta Sigma Theta Sorority, Inc. Presents The 2026 Entertainment Career Summit At Emerson College Los Angeles - 166
- RAS AP Consulting Advances to RFP Stage in Heidelberg Materials' SAP Vendor & Customer Master Data Modernization Initiative - 135
- Contracting Resources Group and Aalis Management Consulting Launch ARG Joint Venture Under SBA Mentor-Protégé Program - 131
- Advanced TeleSensors Appoints AgeTech Innovator Tiffany Wey, MBA as Vice President of Sales & Marketing - 127
- Slipaway Food Truck Park & Marina to host Kentucky Derby watch party and Cinco de Mayo celebration - 119
- Keenethics enters the ChatGPT Apps ecosystem as a new growth opportunity for businesses - 116
- Most Americans Choose Their Water Brand Because of Its Natural Source — Yet Fewer Than 3 in 10 Understand What Spring Water Actually Is - 114
- Lecture/Performance on Jewish Magicians in Juneau on Monday, May 4, 2026 - 112
- A Hidden Magical World Awaits in Ashley Gayheart's Upcoming Young Adult Fantasy, Rosewood Academy: The Awakening - 112
- Dual-Engine Growth Strategy Unleashed Targeting a $9.1B Market and the Exploding AI Biotech Revolution: KALA BIO (N A S D A Q: KALA) - 107
Similar on EntSun
- The AI Direction Deficit: TripleTen Study Finds Staff Get Told to Use AI — But Not Trained to Use It
- All About Technology Celebrates 25 Years of Bridging Detroit's Digital Divide
- iatroX surpasses 500,000 clinical queries and expands specialist exam coverage
- MSBG Corporation Acquires GridWatch US Telemetry Automation System
- TAYP Expands Athlete Exposure Platform Beyond Georgia With New Push Into Virginia and the 757
- The Millennium Alliance Appoints Former Adweek Executive Eric Hayden Shakun as Chief Financial Officer to Accelerate Next Phase of Growth
- $4.8M in Contracted AI Revenue with Projections of $30M Over 6-12 Months for Diversified AI Software and Platform-Based Services Provider XMax Inc
- Larry R. Wasion's Jump Gate III RoadMaker Blends Cutting-Edge Sci-Fi with High-Stakes Space Exploration and Complex Technologies
- SpeedyIndex Rolls Out Automated API for Mass URL Verification, Solving the Backlink Blind Spot for SEO Agencies
- DLT Resolution, Inc. (Stock Symbol: DLTI) Expands Into the $224 Billion Life Settlements Market While Accelerating Telecom Growth Across Canada
Stream Releases Open-Source AI Agent That Reads Your Face and Adapts How It Speaks
EntSun News/11092325
BOULDER, Colo. - EntSun -- Built on Vision Agents with Anam and Inworld to demonstrate emotionally aware, video-first AI
Stream released an open-source AI agent that responds to a user's facial expressions, gaze, and engagement in real time. The agent, called Crashout Buddy, is live at visionagents.ai.
The era of the floating orb is over. Most voice agents today are blind. They convert speech to text, run it through an LLM, and read the response back in a flat tone regardless of whether the user is laughing, frustrated, or close to tears. Built on Stream's Vision Agents framework in collaboration with Anam and Inworld, Crashout Buddy watches the user's face and shapes both what the agent says and how it says it. When the user goes quiet, it notices. When they look like they're about to lose it, it softens.
How It Works
The agent runs a multimodal perception stack on Stream's global edge network. MediaPipe tracks 52 facial blendshapes at 8 fps to classify emotion, gaze, and engagement. That signal is injected into the LLM (Gemini) on every turn, which steers Inworld's TTS-2 voice model using natural-language direction such as [say warmly with light, easy energy]. Anam renders a photorealistic, lip-synced avatar. Deepgram handles speech-to-text.
More on EntSun News
The same pattern (facial state, rich agent context, expressive voice, lip-synced avatar) suits apps in dating, coaching, recruitment, tutoring, and customer support.
Key capabilities include:
Availability
The full project is open source. Try the demo at visionagents.ai, read the guide on the Stream blog, or explore the code at: https://github.com/GetStream/Vision-Agents
Stream released an open-source AI agent that responds to a user's facial expressions, gaze, and engagement in real time. The agent, called Crashout Buddy, is live at visionagents.ai.
The era of the floating orb is over. Most voice agents today are blind. They convert speech to text, run it through an LLM, and read the response back in a flat tone regardless of whether the user is laughing, frustrated, or close to tears. Built on Stream's Vision Agents framework in collaboration with Anam and Inworld, Crashout Buddy watches the user's face and shapes both what the agent says and how it says it. When the user goes quiet, it notices. When they look like they're about to lose it, it softens.
How It Works
The agent runs a multimodal perception stack on Stream's global edge network. MediaPipe tracks 52 facial blendshapes at 8 fps to classify emotion, gaze, and engagement. That signal is injected into the LLM (Gemini) on every turn, which steers Inworld's TTS-2 voice model using natural-language direction such as [say warmly with light, easy energy]. Anam renders a photorealistic, lip-synced avatar. Deepgram handles speech-to-text.
More on EntSun News
- Summer Daily Activities Kick Off at Elklook
- iatroX surpasses 500,000 clinical queries and expands specialist exam coverage
- Inside-Out Hollywood: The Relentless Rise of Joseph Nybyk (AKA Joseph Neibich)
- SRK Collective Media Group Launches with a Modern Approach to Media, Authority Building, and Cultural Visibility
- MSBG Corporation Acquires GridWatch US Telemetry Automation System
The same pattern (facial state, rich agent context, expressive voice, lip-synced avatar) suits apps in dating, coaching, recruitment, tutoring, and customer support.
Key capabilities include:
- Emotion, gaze, and engagement classification with hysteresis to prevent flicker
- Natural-language voice steering in 100+ languages via Inworld TTS-2
- Photorealistic lip-synced avatar via Anam's CARA model
- Proactive re-engagement when the user drifts off-camera or goes quiet
- Composable processors running at independent frame rates
Availability
The full project is open source. Try the demo at visionagents.ai, read the guide on the Stream blog, or explore the code at: https://github.com/GetStream/Vision-Agents
Source: Getstream.io
0 Comments
Latest on EntSun News
- T. Jones Group Named Finalist Across Multiple Categories at the 2026 Georgie Awards
- The Simplest Small Business You're Probably Not Thinking About
- San Francisco Writer Wins Webby Award, Internet's Highest Honor, for Website Based on her Novel
- MetroLagoons announces Memorial Day weekend festivities
- EDC Weekend Comedy Special Featuring Don Barnhart & Friends — Use Promo Code FRIEND for 50% Off
- N Y S E: OTH Off The Hook YS Is Building a Vertically Integrated Marine Empire — And Investors Are Starting to Notice
- Concierge Title Agency Merges with Independence Title, Inc. to Deliver an Expanded Concierge Closing Experience Across South Florida
- Grow My Security Company Launches Next-Generation Website and Expands Strategic Marketing Solutions for the Security Industry
- $4.8M in Contracted AI Revenue with Projections of $30M Over 6-12 Months for Diversified AI Software and Platform-Based Services Provider XMax Inc
- Michelangelo's Great Secret Hiding in Plain Sight
- Virginia Marchese's Paradox: A Nation Still Deciding Who Belongs Examines Race, Migration, Law, and America's Unfinished Struggle for Equality
- From Blank Page to Published Book
- Larry R. Wasion's Jump Gate III RoadMaker Blends Cutting-Edge Sci-Fi with High-Stakes Space Exploration and Complex Technologies
- American Mensa and Davidson Institute Join Forces To Strengthen Support for Profoundly Gifted Youth
- 16th Annual Art Of Brooklyn Film Fest Returns June 1-10 with 55 New Indies
- 360 Sound And Vision Releases The Ubiquitous Compact Disc and America's Most Deadly UFO Encounters
- SpeedyIndex Rolls Out Automated API for Mass URL Verification, Solving the Backlink Blind Spot for SEO Agencies
- DJ Serving Grand Rapids, Detroit, and Ann Arbor Areas Walks Clients Through the Process
- KLEKT Announces Appointment of Jay Kimpton to Board of Directors
- Michigan Attorney General Closed FGM Licensing Investigations Months Before Federal Case Ended, Records Show