⚡🔋OpenAI's New Voice AI Can Now Hear Your Emotions and Talk Back

And more: OpenAI Snags Meta's AR Genius; GPT-4o Powers Robot Arms Anyone Can Build

In partnership with

Good morning aspiring leaders of the next gen!☀️

🚀 Here is what you are going to find in The Supercharged today:

  • OpenAI spreading voice mode love. Time to have deeper conversations with your AI! 🗣️✨

  • Meta's Llama models joining national security. The Pentagon gets a new digital pet! 🦙🛡️

  • Meta's Orion hardware guru jumps ship to OpenAI. The tech talent carousel spins again! 🎠💼

  • Anduril playing eeny-meeny with factory locations. Three states enter, and one state wins! 🏭🎲

  • Anthropic makes Haiku pricier. Poetry just got more expensive! 💰📝

  • Amazon adds AI recaps to Prime Video. Too lazy to watch? Let AI do it! 🎬🤖

  • Perplexity CEO offers to replace NYT strikers. Talk about reading the room! 📰😬

  • Apple simplifies ChatGPT Plus upgrades. One-click path to AI premium! 🍎💫

  • Coatue passing the hat for $1B in AI bets. Another day, another billion-dollar AI fund! 🎲💸

For Those Who Seek Unbiased News.

Be informed with 1440! Join 3.5 million readers who enjoy our daily, factual news updates. We compile insights from over 100 sources, offering a comprehensive look at politics, global events, business, and culture in just 5 minutes. Free from bias and political spin, get your news straight.

OpenAI released its advanced voice mode to more people. Here’s how to get it.

OpenAI has announced a significant expansion of its Advanced Voice Mode for ChatGPT, bringing sophisticated voice interaction capabilities to a broader user base. This enhancement represents a major step forward in natural AI communication, offering features like mid-sentence interruption and emotional response adaptation.

Technical Features and Implementation

The new Advanced Voice Mode marks a substantial improvement over the standard voice interface previously available to paid users. The system now offers more dynamic interaction capabilities, including the ability to interrupt the AI's responses naturally - a feature that was notably absent in the mobile app's previous iteration. The technology can now analyze and respond to emotional cues in users' voices, adjusting its responses accordingly.

OpenAI has introduced five new AI voices - Arbor, Maple, Sol, Spruce, and Vale - created through collaboration with professional voice actors worldwide. This development came after earlier controversy regarding the similarity of its original voice, Sky, to actress Scarlett Johansson's voice in the movie "Her." The company emphasizes that these new voices were carefully selected for their warmth, approachability, and ability to engage in extended conversations.

Accessibility and Safety Measures

The rollout strategy reflects a careful balance between expansion and control. Initially, access is limited to Plus users ($20/month) and Team users ($30/month), with plans to extend to Enterprise and Edu tiers. However, the company has implemented geographic restrictions, excluding users in the EU, UK, Switzerland, Iceland, Norway, and Liechtenstein.

Safety testing has been a crucial component of the development process, with external experts representing 45 different languages and 29 geographies evaluating the system. The GPT-4o system card outlines specific safeguards against generating violent or erotic speech, unauthorized voice imitation, and copyrighted content reproduction.

The closed-source nature of OpenAI's models has raised some concerns among researchers, as it makes independent evaluation of safety, bias, and potential harm more challenging compared to open-source alternatives. This limitation highlights the ongoing tension between proprietary technology and transparent safety validation in AI development.

The technology's ability to remember user-specific information and improve pronunciation in non-English languages demonstrates OpenAI's commitment to creating a more personalized and globally accessible tool. Early user feedback has highlighted the system's impressive speed and natural interaction capabilities, though the limited availability has frustrated some users.

The development of Advanced Voice Mode represents a significant milestone in the evolution of AI communication interfaces. Its ability to process and respond to emotional cues could open new possibilities in areas such as education, customer service, and therapeutic applications.

Looking ahead, the technology raises important questions about the future of human-AI interaction. The incorporation of emotional recognition and response capabilities suggests a movement toward more empathetic AI systems, though this also raises ethical considerations about the appropriate boundaries of AI emotional engagement.

The selective rollout strategy indicates OpenAI's awareness of the need to balance innovation with responsible deployment. By gradually expanding access while maintaining strict safety protocols, the company aims to ensure that the technology's benefits can be realized while minimizing potential risks.

The timing of this release coincides with increasing competition in the voice AI space, as other major tech companies develop their own advanced voice interaction systems. OpenAI's approach, focusing on emotional intelligence and natural interruption capabilities, could set new standards for voice-based AI interactions.

The successful implementation of these features could significantly influence the direction of voice AI development across the industry. As users become more comfortable with emotionally aware AI assistants, we may see increased demand for such capabilities in various applications and services.

As Advanced Voice Mode continues to evolve, its impact on human-AI interaction patterns and user expectations will be closely watched by industry observers and researchers alike. The technology's success could herald a new era in how we communicate with AI systems, potentially reshaping our understanding of machine-human interaction.

Meta says it’s making its Llama models available for US national security applications

Meta has announced it will make its Llama AI models available to U.S. government agencies and contractors for national security applications, partnering with companies like Accenture, AWS, Lockheed Martin, and Microsoft. This decision comes after reports that Chinese military researchers used Llama 2 for defense applications, which Meta called "unauthorized." While the company typically prohibits using Llama for military purposes, it's making exceptions for the U.S. and allied nations like the UK, Canada, Australia, and New Zealand. The move has sparked debate about AI in defense, with some experts warning about risks while Meta argues open AI can advance defense research while promoting U.S. interests.

Meta’s former hardware lead for Orion is joining OpenAI

Caitlin Kalinowski, former head of Meta's AR glasses team, is joining OpenAI to lead its robotics and consumer hardware initiatives. Kalinowski, who previously oversaw Meta's Orion AR prototype and VR goggles development, and worked at Apple designing MacBooks, will focus on bringing AI into the physical world through robotics work and partnerships. She may collaborate with former Apple executive Jony Ive on a new AI hardware device that OpenAI and Ive's LoveFrom are developing. This move comes as OpenAI rebuilds its robotics team after a four-year hiatus, while several companies, including Apple and Figure, are already incorporating OpenAI's models into their hardware.

📔#1 Insights Today on AI. Click the Links to Read

  1. Anduril is considering Arizona, Ohio, or Texas for its massive manufacturing facility, source says

  2. Anthropic hikes the price of its Haiku model

  3. Amazon brings generative AI-powered recaps to Prime Video

  4. Perplexity CEO offers AI company’s services to replace striking NYT staff

  5. Apple users can soon upgrade to ChatGPT Plus within the Settings app

  6. Coatue is raising $1B for AI bets

Easy tool for image creation: https://createnow.xyz/ 

Create easy your AI music: https://suno.com/

🧑 What specific AI-related skill or knowledge do you want to acquire in the next 6 months?

Help us to support your goals with the best infomation

Login or Subscribe to participate in polls.

Membership Spotlight
Membership Spotlight

🤖 Which Area of Your Business Do You Want to Enhance with AI?

Help us understand your AI implementation priorities

Login or Subscribe to participate in polls.

Business Problem:

I’m managing multiple email accounts for clients and need an efficient way to process incoming messages in real time without missing any critical ones.

Answer: Centralize Accounts:

Use a single email management tool (like Front, Zoho Mail, or Help Scout) to consolidate all client emails into one dashboard.

Set Priority Filters:

Create rules for each account to flag, label, or move critical emails into a “Priority” folder based on sender, keywords, or urgency indicators.

Real-Time Notifications😀

Enable notifications for only critical emails. Tools like Spark or Superhuman allow VIP notifications, alerting you immediately for prioritized messages only.

At set times, quickly scan low-priority messages to ensure nothing essential is missed. Handle flagged critical emails in real-time.

Do you have a business problem keeping you up at night? Here’s your chance to get it solved! Share your most staggering challenges with us, and I’ll use the power of AI to find solutions tailored just for you. I’ll feature the answers in one of our upcoming Supercharged issues—let’s tackle it together!

Membership Spotlight
Membership Spotlight

AI is not just about creating systems that can process information quickly; it's about developing new ways to represent and understand knowledge. Each advance in AI reveals something new about the nature of intelligence and learning.

Leslie Pack Kaelbling

Leslie Pack Kaelbling is an American roboticist and the Panasonic Professor of Computer Science and Engineering at the Massachusetts Institute of Technology.

AI Meets Affordability: A $120 Robot Arm That Cleans with GPT-4o

In a remarkable demonstration of how artificial intelligence can democratize robotics, two researchers have accomplished something both simple and revolutionary: they taught a pair of inexpensive robot arms to clean up spills using OpenAI's GPT-4o, and they did it in just four days.

Jannik Grothusen from UC Berkeley and Kaspar Janssen from ETH Zurich have shown that sophisticated robotics doesn't always require expensive hardware or months of development. Using robot arms that cost just $120 each, they created a visual language model for human-robot interaction that can understand and execute basic cleaning tasks.

The project's success hinges on GPT-4o's ability to process and learn from visual demonstrations. The researchers trained the system using approximately 100 demonstrations, teaching the robots to recognize and respond to spills effectively. This efficient training process represents a significant step forward in making robotic assistance more accessible.

What makes this project particularly noteworthy is its openness and reproducibility. The Robot Studio has released detailed plans on YouTube for building these budget-friendly arms, meaning anyone with an interest in robotics can potentially recreate this setup at home.

This development represents more than just a clever hack; it's a powerful example of how large language models are transforming robotics. By combining affordable hardware with sophisticated AI, researchers are breaking down the traditional barriers to entry in robotics development, potentially paving the way for more accessible and practical robotic applications in everyday life.

In an era where robotics often seems the domain of well-funded labs and tech giants, this project demonstrates that innovation can come from anywhere, especially when powered by open-source tools and generative AI.

RAREMINTSWe deliver daily curated Web 3 news in under 5 minutes, for free.

AI Playground: Code & Craft

🔧 This Week’s Tool: 🎮 "HeyGen Video Challenge: Hyper Realistic Videos

HeyGen is an innovative AI video generation tool that simplifies video creation by allowing users to transform text prompts into engaging, high-quality videos. It’s especially valuable for marketers, business owners, and content creators looking to produce professional video content quickly and affordably.

This Week’s Task: Create a short, compelling video using HeyGen to answer a creative prompt. Showcase your message or product in a 30-second video that captivates your audience!

Why Is It Better Than Other Tools?

HeyGen sets itself apart with its user-friendly interface, powerful customization options, and seamless AI-driven automation for video creation.

  • Easy to Use: HeyGen’s intuitive design allows users to create videos with minimal effort. No prior editing experience is required, making it ideal for beginners and pros alike.

  • Adaptable to Personal Data: Users can customize their videos with personal branding elements, voiceovers, and styles to resonate with their target audience. Whether highlighting product features or sharing a story, HeyGen adapts to your content needs.

  • Accessible: HeyGen is cloud-based and offers fast video generation, allowing users to create and access videos on any device without heavy software requirements. You can produce studio-quality videos in 175 languages without a camera or crew.

Step-by-Step Instructions

  1. Sign Up or Log In to HeyGen:

    • Head to HeyGen and log in or create a new account.

  2. Create Your Personal Avatar:

    • Follow the prompts in HeyGen to create your own avatar. You can choose facial features, expressions, and even voice options. Make it look and sound as close to you as possible to increase engagement.

  3. Write a Short Script:

    • Think about what you’d like to share with your audience. Here are some ideas:

      • Introduce yourself or your brand.

      • Share a quick tip related to your industry.

      • Highlight a special offer or announcement.

  4. Upload the Script into HeyGen:

    • Paste your text into HeyGen’s editor. You can adjust the tone, speed, and other settings to make it sound natural.

  5. Customize the Video:

    • Add background colors, your brand’s logo, or select visual effects that match your brand’s style. HeyGen’s tools make it easy to add a personalized touch to every aspect of your video.

  6. Preview and Generate:

    • Preview your video to ensure everything looks and sounds just right. Once you’re happy with it, hit “Generate” and let HeyGen work its magic!

  7. Download and Share:

    • Download the video and share it across your social media channels. Add a catchy caption to grab attention and engage viewers.

      What to Share with Us:

    Send us the link to your video, or tag us in your social media post with #TheSuperchargedChallenge!

The winner this week is P.U. creator of “Power Couple”! 🎉

Thank you for creating with Midjourney! Thank you for participating!

Image 2


Let’s go and create with AI—your next masterpiece awaits! 🚀

No more playing catch-up. It's time to GET AHEAD!!! 🚀🚀🚀

🙌🏼

Elena

⚡︎🔋 The Supercharged - loved by thousands of readers ❤️🙋‍♀️

Did The Supercharged bring you the ultimate satisfaction this week?

How did we do?

Login or Subscribe to participate in polls.

The Supercharged is aiming to be the world's #1 AI business magazine and is on a mission to empower 1,000,000 entrepreneurs worldwide by 2025, guiding them through the transition into the AI-driven creative age. We're dedicated to breaking down complex technologies, sharing actionable insights, and fostering a community that thrives on innovation, to become the ultimate resource for businesses navigating the AI revolution.

The Supercharged is the #1 AI Newsletter for Entrepreneurs, with 25,000 + readers working at the world’s leading startups and enterprises. The Supercharged is free for the readers. Main ads are typically sold out 2 weeks in advance. You can book future ad spots here.

I'm sending this email because you registered for one of our workshops or our affiliates brought you. You can unsubscribe at the bottom of each email at any time.

Reply

or to participate.