- HANOOMAAN AI
- Posts
- AI can think Visually now
AI can think Visually now
Also learn how to use ideogram.ai for generating AI-based images
Hey there! Hanoomaan here — making AI simple, useful, and powerful for you.
OpenAI is once again pushing the boundaries of artificial intelligence with the release of o3 and o4-mini cutting-edge multimodal models that combine vision, language, and autonomous tool use. These next-gen AIs don’t just process images, they reason through them like a human would. From coding to file analysis, these models can handle complex tasks without missing a beat. Their potential to transform research, automation, and human-computer interaction is staggering.
Today’s AI Menu
OpenAI unleashes o3 & o4-mini: The AI models that think with images and act autonomously
Tutorial: How to use ideogram.ai for generating AI-based images?
Everything else you should know today
5 new AI tools to boost your productivity
AI Daily Prompt, Ride the Latest Social Media Trends and much more
TODAY IN WORLD OF AI
OpenAI just dropped o3 and o4-mini, and they're not just seeing, they're thinking.

Credit: VentureBeat made with Midjourney
Multimodal AI just took a quantum leap. OpenAI has unveiled o3 and o4-mini- the newest family of AI models with an upgraded core capability:
They don’t just see images. They reason with them.
🧠Why this is a big deal:
We’ve had AI that can generate images.
We’ve had AI that can describe what it sees.
But now we’re entering a new frontier: AI that can combine visual understanding and textual reasoning all in one continuous, intelligent workflow.
Imagine this 👇
▪️ A physics student uploads a dense diagram of forces and vectors
▪️ The AI scans it, rotates the 3D visual mentally, identifies a missing equation
▪️ Then runs code to simulate the motion and suggests a fix
▪️ Then generates a new version of the image with the correction
This is no longer “image captioning” or “OCR.” This is multimodal cognition.
⚙️ Key capabilities:
✅ Visual Analysis: Parse and reason through complex images
✅ Web Search: Pull context from online sources in real-time
✅ File Interpretation: Understand documents and embedded visuals
✅ Code Execution: Run logic and simulations mid-task
✅ Image Generation: Create new visuals in the context of what’s understood
All within one prompt. One task. One intelligent loop.
🔬 What this means for builders:
For developers, educators, researchers, and startup founders, this unlocks wild new use cases:
▪️ Teaching AI to debug engineering diagrams
▪️ Letting scientists analyze and annotate microscopy images
▪️ Building agents that can read PDFs, zoom into charts, ask clarifying questions, and return actionable insight
▪️ Visual-first workflows that mimic how humans observe, analyze, and explain
This is the closest we’ve come to AI with visual intuition.
This isn’t just an upgrade. It’s the beginning of a shift from linear prompting to fluid multimodal dialogue.
We’re going from “what you tell AI” to “what you show it + how it reasons back.”
The question is no longer: Can AI see?
Start learning AI in 2025
Everyone talks about AI, but no one has the time to learn it. So, we found the easiest way to learn AI in as little time as possible: The Rundown AI.
It's a free AI newsletter that keeps you up-to-date on the latest AI news, and teaches you how to apply it in just 5 minutes a day.
Plus, complete the quiz after signing up and they’ll recommend the best AI tools, guides, and courses – tailored to your needs.
THE AI INSTITUTE
How to use ideogram.ai for generating AI-based images?

▪️ Visit Ideogram.ai
▪️ Sign Up / Log In (Use your Google account or email to log in.)
▪️ In the text box, write what you want the image to be.
Prompt:
“A neon sign that says ‘Dream Big’ in cursive, night background”
“Motivational quote on a mountain background”
▪️ Select a Style (Optional): typography, logo, poster, painting, 3D render, realistic photo
▪️ Wait a few seconds and it will show you 4 AI-generated images.
▪️ Download or Share
AI & TECH NEWS
Everything else you need to know today

Perplexity AI Inc. is in discussions with Samsung Electronics Co. about integrating its assistant on the smartphone giant’s devices. Photographer: Yuki Iwamura/Bloomberg
Recallution: xAI's Grok just got a memory upgrade. Now, it remembers past conversations to deliver personalized responses, think of it as your AI buddy that truly knows you. This feature, currently in beta, is available on Grok.com and mobile apps, excluding EU and UK users. You can manage or delete memories directly from the chat interface, giving you control over your data.
Phonvasion: Perplexity AI is negotiating with Samsung and has secured a deal with Motorola to integrate its assistant into their smartphones. This move positions Perplexity as a formidable competitor to established AI assistants, aiming to redefine mobile user experiences.
Codex: OpenAI is in talks to acquire Windsurf, an AI-powered coding tool, for $3 billion, potentially its largest acquisition yet. Formerly known as Codeium, Windsurf could bolster OpenAI's capabilities in AI-driven software development.
Ad Guard: Google has enhanced its ad safety measures with over 50 AI-driven updates to its large language models. These improvements led to the removal of 5.1 billion ads and the suspension of 39.2 million advertiser accounts in 2024. By combating impersonation scams and enforcing stricter policies, Google aims to protect consumers and maintain a trustworthy advertising ecosystem.
Fund raising
Carevolution: Chapter, a Medicare advisory startup co-founded by former presidential candidate Vivek Ramaswamy, has secured $75 million in funding, elevating its valuation to $1.5 billion. The company distinguishes itself by prioritizing seniors' needs over insurer profits, offering tailored Medicare plan recommendations.
FOOD FOR PRODUCTIVITY
5 AI Tools of the Day
🖌️ Ideogram.ai: Turns text prompts into stylish AI-generated images with embedded typography.
SPECIAL
When more isn’t always Smarter in AI

Image credit: VentureBeat with DALL-E 3
Token Trap: More tokens ≠ more intelligence. In a surprising new paper from Microsoft Research, researchers revealed that longer outputs during AI inference can sometimes mean worse reasoning, not better.
Yes, you read that right. We’ve long assumed that increasing a model’s output length (i.e., letting it generate more tokens) gives us richer thought, deeper logic, or “more complete” answers.
But their study shows:
Longer output doesn’t always mean deeper reasoning. Sometimes, it’s just the model looping, guessing, or stalling.
In other words, when AI starts talking too much... it may be thinking less, not more.
Why This Matters:
If you’re building AI-powered apps or automations, this has huge implications:
▪️ Longer ≠ Smarter
▪️ More tokens can lead to reasoning drift
▪️ Model confidence isn’t always reflected in verbosity
▪️ Trimming outputs might actually increase accuracy in decision-making tasks
This challenges the default “bigger, longer, smarter” mindset in AI product design.
The future isn’t just about scaling.
It’s about teaching models to know when to stop and say something meaningful.
BOOST YOUR WORKFLOW
6 hidden android features that can instantly boost your workflow

Alex Tai/SOPA Images/LightRocket via Getty Images
Android Alchemy: Your Android phone has hidden powers. Most people never use them. We all use Android daily, but did you know it hides tools that can make your life faster, safer, and more efficient?
6 underrated features you should explore right now:
📶 Share Wi-Fi with a QR Code
No more spelling out passwords. Just tap → generate QR → done.
🔋 Limit background processes
Go to Developer Options to set process limits. Your battery will thank you.
🎧 Enable sound amplifier
Perfect for noisy environments or accessibility support.
🎥 Screen pinning for guest mode
Lock a single app on your screen when handing your phone to someone else. Great for kids or coworkers.
📷 Translate text in real-time via Google Lens
Point. Scan. Translate - instantly.
⚙️ Developer options for power users
Want to fake GPS, change animation speed, or monitor GPU usage? Flip this switch and unlock full control.
Why this matters:
Most people look for better apps. But sometimes, the upgrade is already in your phone.
These features don’t just improve productivity. They protect privacy, boost security, and make your phone feel smarter.
Android isn’t just customizable. It’s quietly one of the most powerful OS experiences, if you know where to look.
AI PROMPT OF THE DAY
Prompt for creating a resume for a Sales Executive
Prompt:
Create a professional and impactful resume for a Sales Executive role. Highlight key sections including a strong summary statement, sales achievements, quotas met/exceeded, client relationship management, and skills in CRM tools, negotiation, and lead generation. Format it for clarity and results, using metrics and action verbs to showcase performance. Make it ATS-friendly and tailored for B2B or B2C sales, as needed."
What type of prompt would you like to see tomorrow? |
If you find the contents of this email useful, subscribe now & share with your friends.
SOCIAL MEDIA TRENDS

🚀 Efficiency: These 10 AI tools are shaping how we work in 2025, streamlining tasks for creators, educators, and professionals alike.
⚖️ Responsibility: Quantum computing holds incredible power, but its future depends on ethical governance that ensures access, equity, and global collaboration.
🧠 Upgrade: Open AI introduces latest models which combines tools like web, code, and vision bringing us closer to truly autonomous, intelligent reasoning.
🧮 Breakthrough: Solving Olympiad-level math problems in record time, this AI model showcases just how fast and flexible next-gen reasoning can be.
🖼️ Smart Touch: Aura’s new Aspen frame blends elegant design with intelligent features making memories look good and think smart.
Acquire new customers and drive revenue by partnering with us
Hanoomaan is the world’s leading AI newsletter for businesses and professionals working at the world’s leading startups and enterprises. Companies like Superhuman AI, Hubspot, and The Rundown feature their products in Hanoomaan. You can learn more about partnering with us here.
Your Opinion Matters!
What did you think of today's email?Your feedback helps me create better emails for you! |
Got more feedback or just want to get in touch? Reply to this email and we’ll get back to you.
Thanks for reading.
Until tomorrow!
Shinky & the Hanoomaan AI team
Reply