AI Scanner — All Dates

research @shedntcare_

8/10

Stanford Exposes AI Vision Flaw: Mirage Effect

Stanford's research reveals that leading AI models like GPT-5 and Google Gemini maintain high accuracy without images, highlighting a significant flaw in AI vision systems. This finding could prompt engineers to reassess model reliability in real-world applications.

Holy shit… Stanford University just exposed a massive flaw in AI vision. GPT-5, Google Gemini, and Claude scored 70–80% accuracy… with no images at all. They call it the “mirage effect” ↓ → Researchers removed images from 6 major benchmarks → Models kept answering like

👁 932 views ❤ 10 🔁 6 💬 3 🔖 2 2.0% eng

AI researchvision systemsStanfordGPT-5Google Gemini

research @TechExplorist

7/10

Origami-Inspired Soft Robot with Heat-Responsive Materials

This tweet discusses a novel soft robot design that utilizes heat-responsive materials and embedded electronics for movement without traditional mechanical systems. Senior engineers may find the innovative approach to robotics and materials science relevant for future applications in AI and automation.

A new origami-inspired soft robot uses heat-responsive materials and embedded electronics to move, fold, and reshape itself, without motors, pumps, or bulky mechanical systems. @Princeton

👁 175 views ❤ 2 🔁 2 💬 0 🔖 0 2.3% eng

roboticssoft roboticsmaterials scienceAIinnovation

research @GoogleResearch

7/10

New Human-AI Conversation Dataset Released

ConvApparel is a new dataset aimed at improving LLM-based user simulators by quantifying the 'realism gap.' This could be relevant for engineers focused on enhancing conversational agent training methodologies.

Introducing ConvApparel, a new human-AI conversation dataset, as well as a comprehensive evaluation framework designed to quantify the "realism gap" in LLM-based user simulators and improve the training of robust conversational agents. Read all about it → goo.gle/41k5eff

👁 650 views ❤ 22 🔁 4 💬 0 🔖 9 4.0% eng

AIdatasetconversational agentsresearchLLM

research @ucsbNLP

7/10

Evaluating Agent Skills in AI Systems

This tweet discusses a research paper exploring how effectively AI agents can find and utilize their skills independently. Senior engineers may find the insights valuable for understanding agent behavior and improving AI system design.

How well do agent skills actually work when agents must find and use them on their own? Check out the lates work from our lab! arxiv.org/abs/2604.04323

👁 188 views ❤ 3 🔁 0 💬 0 🔖 0 1.6% eng

AI agentsresearchautonomyskillsmachine learning

research @HuggingPapers

7/10

MIA: Advanced AI Agent Architecture

The Memory Intelligence Agent (MIA) proposes a new architecture that enhances 7B models to outperform GPT-5.4 through a Manager-Planner-Executor framework with continual learning. This could be of interest to engineers looking for novel strategies in AI model development.

MIA: Memory Intelligence Agent Evolves deep research agents from passive record-keepers into active strategists, enabling 7B models to outperform GPT-5.4 via a Manager-Planner-Executor architecture with continual test-time learning.

👁 1,897 views ❤ 43 🔁 15 💬 2 🔖 19 3.2% eng

AIarchitectureresearchMIAmodel performance

research @hasantoxr

7/10

Researcher Removes Google's SynthID Watermark

A researcher has developed a tool that effectively removes Google's SynthID watermark from images generated by Gemini, achieving 90% detection accuracy. This finding could have implications for watermarking techniques in AI-generated content.

One researcher beat Google's watermark with a math trick. So Google puts an invisible watermark in every image Gemini generates. They call it SynthID. And this researcher figured out exactly how it works and built a tool to remove it. 90% detection accuracy. 43+ dB image

👁 423 views ❤ 5 🔁 0 💬 0 🔖 0 1.2% eng

watermarkingAI researchimage processingSynthIDGemini

research @albinowax

7/10

AI Security Research at Black Hat

Announcement of a research presentation on AI's role in security, specifically focusing on a project called 'HTTP Terminator.' Senior engineers may find the insights relevant for understanding AI's application in security contexts.

I'm thrilled to announce "Can AI Do Novel Security Research? Meet the HTTP Terminator" will premiere at @BlackHatEvents #BHUSA! Check out the abstract:

👁 8,260 views ❤ 181 🔁 32 💬 8 🔖 55 2.7% eng

AIsecurityBlack HatresearchHTTP Terminator

research @kakehashi_dev

7/10

Method for Resolving Notation Variations in Medical Names

This tweet discusses a new method presented at NLP2026 for resolving notation variations in medical department names using an LLM, achieving a high accuracy rate. Senior engineers may find the approach and results relevant for improving NLP applications in healthcare.

Published a new article on the KAKEHASHI Tech Blog. We presented at NLP2026 a method that resolves "notation variations" in medical department names using an LLM, achieving a 97.5% accuracy rate with GPT-5. Please take a look.

👁 811 views ❤ 9 🔁 0 💬 0 🔖 0 1.1% eng

NLPmedical AIGPT-5researchaccuracy

research @AnthropicAI

7/10

Automated Alignment Researcher Experiment

Anthropic's new research explores using a weak AI model to supervise the training of a stronger one, potentially accelerating alignment research. This could have implications for how AI systems are developed and aligned in the future.

New Anthropic Fellows research: developing an Automated Alignment Researcher. We ran an experiment to learn whether Claude Opus 4.6 could accelerate research on a key alignment problem: using a weak AI model to supervise the training of a stronger one.

👁 11,980 views ❤ 252 🔁 47 💬 21 🔖 88 2.7% eng

AI alignmentresearchAnthropicClaude Opusmachine learning

AI Twitter Scanner