- Supercharged With AI
- Posts
- ⚡🔋Stability AI Turns Photos into Immersive 3D Videos
⚡🔋Stability AI Turns Photos into Immersive 3D Videos
And more: Anthropic Developing Voice Mode for Claude AI; Google’s Gemini AI Can Remove Watermarks
Good morning ☀️, leader of the next gen.
The future belongs to you. Let’s make conscious leadership the norm and embrace innovation as the driving force for positive change 🌍✨
WHAT’S AT STAKE TODAY ⚡
- 📷🌆 Stability AI turns photos into 3D scenes. Flat images get depth perception!
- 🗣️🎙️ Anthropic reportedly prepping Claude voice mode. AI assistant finds its voice!
- 📱✨ Tufa.io promises smarter social management. Another AI tool wants attention!
- 💼🔄 Intel facing changes under new CEO Tan. Chip giant gets fresh leadership!
- 🧪🔬 OpenAI exec leaves for materials science venture. AI talent explores new frontiers!
- 🦙📈 Zuckerberg brags about 1B Llama downloads. Meta's AI models go viral!
- 🎮💰 Arcade raises $12M to improve AI agents. Making digital assistants less annoying!
- 💻💸 Anthropic-backed code review platform scores funding. Graphite gets cash injection!
- 🎨🔊 Google adds 'canvas' to Gemini, plus Audio Overview. AI notebook gets creative!
⚡ Latest in AI
Stability AI’s new AI model turns photos into 3D scenes

Stability AI's Stable Virtual Camera generates 3D scenes from 2D images
Stability AI has unveiled Stable Virtual Camera, a groundbreaking AI model that can convert static 2D images into immersive, navigable 3D videos with realistic depth and perspective. This latest release represents a significant advancement in the company's generative AI capabilities and could potentially transform workflows across visual media industries including film, gaming, and virtual reality.
The new model leverages AI to reimagine traditional virtual camera technology—tools that have long been essential in digital filmmaking and 3D animation for scene navigation and visualization. By introducing generative capabilities to this established concept, Stability has created a system that can synthesize novel viewpoints of a scene from limited visual information, effectively "hallucinating" what parts of a scene might look like from angles not present in the original images.
According to Stability's announcement, Stable Virtual Camera can process up to 32 input images simultaneously, using them to construct a coherent 3D representation of the captured scene. Users can then specify custom camera angles and movements to generate videos that simulate different perspectives. The model offers several preset camera paths, including "Spiral," "Dolly Zoom," "Move," and "Pan," allowing for diverse cinematographic effects without requiring traditional camera equipment or complex 3D modeling software.
This research preview supports multiple aspect ratios—square (1:1), portrait (9:16), and landscape (16:9)—and can generate sequences up to 1,000 frames in length. However, Stability acknowledges certain limitations in the current implementation. The model reportedly struggles with scenes containing humans, animals, or dynamic textures like water. "Highly ambiguous scenes, complex camera paths that intersect objects or surfaces, and irregularly-shaped objects can cause flickering artifacts," the company notes, "especially when target viewpoints differ significantly from the input images."
From a technical perspective, Stable Virtual Camera likely builds upon recent advancements in neural radiance fields (NeRF) and view synthesis technologies. These approaches allow AI systems to infer the three-dimensional structure of a scene from multiple viewpoints and then render new perspectives with consistent lighting, shadows, and occlusion effects. The 1,000-frame capability suggests significant improvements in temporal consistency compared to earlier models in this domain, which often struggled with maintaining coherence across extended sequences.
The potential applications for this technology span numerous creative fields. Filmmakers could use it to explore alternative camera angles for scenes without reshooting, or to create complex camera movements that would be physically impossible or prohibitively expensive to capture with traditional equipment. Game developers might leverage the tool for rapid prototyping of environments or cinematics. Visual effects artists could generate reference footage for complex compositing tasks. Architects and real estate professionals could create virtual tours from still photographs of properties.
Stability has made Stable Virtual Camera available for research use under a noncommercial license through the AI development platform Hugging Face. This release strategy aligns with the company's historical approach of making research models accessible to the broader AI community while reserving commercial applications for potential future monetization.
The launch comes at a pivotal moment for Stability AI, which has faced significant business challenges over the past year. The company, best known for developing the popular image generation model Stable Diffusion, reportedly encountered financial difficulties under former CEO Emad Mostaque, whose management led to staff resignations, a failed partnership with design platform Canva, and investor concerns. Recent months have seen substantial organizational changes, including the appointment of a new CEO, the addition of "Titanic" director James Cameron to its board of directors, and the release of several new generative models.
In early March, Stability also announced a partnership with chipmaker Arm to bring audio generation capabilities to mobile devices running Arm chips, suggesting a strategic pivot toward diversifying both its technology portfolio and potential revenue streams. Last year, the company secured additional funding from investors including former Google CEO Eric Schmidt and Napster founder Sean Parker, who are reportedly working to restructure the business for long-term sustainability.
Within the broader AI industry, Stability's new offering enters an increasingly competitive landscape for 3D-oriented generative models. OpenAI's Sora has demonstrated the ability to generate dynamic videos from text prompts, while Google's Lumiere similarly creates motion from textual descriptions. Neither of these, however, focuses specifically on navigating 3D space from static images in the manner of Stable Virtual Camera. Runway ML has developed Gen-2, which can create limited camera movements around subjects in generated videos, while more specialized startups like Luma AI have focused explicitly on NeRF-based 3D reconstruction.
As the technology matures, questions about its implications for creative industries will likely intensify. The ability to generate convincing camera movements and perspectives from limited visual data could potentially reduce the need for certain types of specialized photography and videography. Conversely, it might also democratize sophisticated visual storytelling techniques, making advanced cinematography more accessible to independent creators with limited budgets and equipment.
Discover 100 Game-Changing Side Hustles for 2025
In today's economy, relying on a single income stream isn't enough. Our expertly curated database gives you everything you need to launch your perfect side hustle.
Explore vetted opportunities requiring minimal startup costs
Get detailed breakdowns of required skills and time investment
Compare potential earnings across different industries
Access step-by-step launch guides for each opportunity
Find side hustles that match your current skills
Ready to transform your income?
⚡ The companies of the future
Anthropic is reportedly prepping a voice mode for Claude

Anthropic Voice Mode for Claude
Anthropic is reportedly developing voice capabilities for its AI chatbot Claude. Chief Product Officer Mike Krieger told the Financial Times that the company has internal prototypes and plans to launch experiences allowing users to talk to Claude models.
Krieger suggested that voice would provide a more natural interface, especially when using Claude to operate a computer. The report indicates Anthropic has held discussions with Amazon (a major investor and partner) and voice-focused AI startup ElevenLabs about potentially powering Claude's voice features, though no deals have been finalized.
Krieger confirmed Anthropic has spoken with "a bunch of partners" to possibly accelerate the launch of a voice experience.
Why it matters: With voice AI growing in importance, Anthropic's move signals an effort to compete with OpenAI's ChatGPT voice features, aiming to create more intuitive and seamless AI interactions.
⚡ Smarter Social Media Management with AI

Tufa.io is an AI-powered social media management tool that automates post creation and scheduling across multiple platforms.
It uses artificial intelligence to generate industry-specific, engaging content, helping businesses and marketers streamline their social media workflows.
With automated scheduling and content generation, users can save time, maintain consistency, and boost engagement without requiring extensive effort or expertise. Learn more about how AI can enhance your social media strategy.
⚡ More AI Bites
- 💼🔄 Intel facing changes under new CEO Tan. Chip giant gets fresh leadership!
- 🧪🔬 OpenAI exec leaves for materials science venture. AI talent explores new frontiers!
- 🦙📈 Zuckerberg brags about 1B Llama downloads. Meta's AI models go viral!
- 🎮💰 Arcade raises $12M to improve AI agents. Making digital assistants less annoying!
- 💻💸 Anthropic-backed code review platform scores funding. Graphite gets cash injection!
- 🎨🔊 Google adds 'canvas' to Gemini, plus Audio Overview. AI notebook gets creative!
⚡ Trends for the Future
Google's Gemini AI Sparks Controversy Over Watermark Removal Capability

Google Gemini AI Watermark Removal
The details:
Users on social media have discovered a controversial application for Google's new Gemini 2.0 Flash model: removing watermarks from images, including those from Getty Images and other prominent stock media providers.
Google recently expanded access to Gemini 2.0 Flash's image generation feature, which allows users to create and edit image content. While impressively powerful, the technology appears to have minimal protective limitations. Beyond watermark removal, the model willingly generates images of celebrities and copyrighted characters.
Social media posts demonstrate that Gemini 2.0 Flash not only removes watermarks but attempts to reconstruct the underlying image data, filling gaps created when watermarks are deleted. Though other AI tools offer similar functionality, Gemini's version stands out for both its effectiveness and being free to use.
The feature is currently labeled "experimental" and "not for production use," available only through Google's developer interfaces like AI Studio. The model also shows limitations, struggling with semi-transparent watermarks and those covering large portions of images.
This capability raises significant copyright concerns. Competing AI models like Anthropic's Claude 3.7 Sonnet and OpenAI's GPT-4o explicitly refuse watermark removal requests. Claude specifically describes the practice as "unethical and potentially illegal."
Indeed, removing watermarks without permission from the original owner is generally considered illegal under U.S. copyright law, with few exceptions.
When contacted by TechCrunch, Google responded: "Using Google's generative AI tools to engage in copyright infringement is a violation of our terms of service. As with all experimental releases, we're monitoring closely and listening for developer feedback."
The discovery highlights ongoing tensions between rapidly advancing AI capabilities and intellectual property protections. While AI companies continue pushing technological boundaries, content creators and media organizations face increasing challenges safeguarding their assets in a digital landscape where protection measures can be circumvented with increasingly accessible tools.
What makes this crucial: Google's response suggests the company recognizes these concerns, though whether additional safeguards will be implemented remains to be seen as Gemini's experimental features mature toward potential production release.

Do you have a business problem keeping you up at night?
Here’s your chance to get it solved! Share your most staggering challenges with us, and I’ll use the power of AI to find solutions tailored just for you. I’ll feature the answers in one of our upcoming Supercharged issues—let’s tackle it together!

AI is not just about creating intelligent machines; it's about developing new ways to understand the fundamental nature of intelligence itself. Each advance in AI helps us see dimensions of cognition—both human and artificial—that were previously invisible to us.
Noriko Arai is a Japanese mathematician and AI researcher who leads the Todai Robot Project. She's known for her work exploring the boundaries between human and machine intelligence, particularly in analyzing how AI systems process and understand information compared to humans.
🤖 AI Playground: Transform Your Workflow 🤖
🔧 This Week’s Tool: IBM Watsonx 🔧
Overview: IBM Watsonx is a comprehensive AI platform designed to empower businesses with advanced AI capabilities. It offers a suite of tools, including a studio for AI development, a data store for managing vast datasets, and governance tools to ensure compliance and ethical AI use. Watsonx supports multiple large language models (LLMs), providing flexibility and scalability for various AI applications. 🚀
Why Is It Better Than Other Tools? ✨
- ⚡ Comprehensive AI Suite: Combines AI development, data management, and governance in a single platform, streamlining workflows and reducing integration challenges.
- 🤖 Flexible Model Support: Supports various LLMs, including IBM's own Granite series and open-source models like LLaMA-2 and Mistral, allowing businesses to choose models that best fit their needs.
- 🔒 Robust Governance: Provides tools to manage AI risks, ensure compliance with evolving regulations, and promote ethical AI practices.
What Does It Do Best? 🌟
- 🛠️ AI Development: Offers a comprehensive studio for building, training, and deploying AI models, catering to both novice and experienced AI developers.
- 📊 Data Management: Facilitates seamless access to data, whether stored in the cloud or on-premises, through a single entry point, ensuring data security and compliance.
- 🔄 AI Governance: Assists organizations in implementing comprehensive AI lifecycle governance, managing risks, and maintaining compliance with evolving AI and industry regulations.
Applications 💼:
- 💻 IT Support: Automate routine IT tasks and provide instant support to employees, reducing downtime and increasing productivity.
- 📝 Human Resources: Streamline HR processes, such as employee onboarding and benefits management, through AI-driven automation.
- 🏢 Facilities Management: Optimize maintenance schedules and manage facility resources efficiently using predictive analytics.
- 📚 Knowledge Management: Ensure quick access to company policies, procedures, and FAQs through intelligent data retrieval systems.
- 🔧 Operational Efficiency: Automate routine tasks, allowing teams to focus on strategic initiatives and innovation.
Follow This Simple Guide to Get Started with IBM Watsonx:
- 🌐 Visit the Website: Go to ibm.com/watsonx to learn more about their offerings.
- 🔗 Request a Demo: Schedule a demonstration to see how Watsonx can be tailored to your organization's needs.
- 🛠️ Integrate with Your Tools: Work with the IBM team to integrate the platform with your existing systems and workflows.
- 🚀 Launch and Train: Introduce Watsonx to your employees and provide training to ensure a smooth transition.
- 🔄 Monitor and Optimize: Utilize Watsonx's analytics to continuously improve and adapt the platform to better serve your organization.
IBM Watsonx is your partner in transforming workplace operations, enhancing productivity, and ensuring employee satisfaction. 🌟
💡 Challenge: Identify a common process in your organization and implement Watsonx to automate its resolution. Share your experience by replying to this email for a chance to win a special prize! 🎁 Start revolutionizing your workplace with IBM Watsonx today! 🚀

No more playing catch-up. It's time to GET AHEAD!!! 🚀🚀🚀, Elena
🌡️ Use the Satisfaction Thermometer to show us how much you enjoyed The Supercharged this week ;)How did we do? |
⚡︎🔋 The Supercharged - loved by thousands of readers ❤️🙋♀️
The Supercharged is aiming to be the world's #1 AI business magazine and is on a mission to empower 1,000,000 entrepreneurs worldwide by 2025, guiding them through the transition into the AI-driven creative age. We're dedicated to breaking down complex technologies, sharing actionable insights, and fostering a community that thrives on innovation, to become the ultimate resource for businesses navigating the AI revolution.
The Supercharged is the #1 AI Newsletter for Entrepreneurs, with 25,000 + readers working at the world’s leading startups and enterprises. The Supercharged is free for the readers. Main ads are typically sold out 2 weeks in advance. You can book future ad spots here.
I'm sending this email because you registered for one of our workshops or our affiliates brought you. You can unsubscribe at the bottom of each email at any time.
Reply