AI Scanner — 2026-04-11

research @ucsbNLP

7/10

This tweet discusses a research paper exploring how effectively AI agents can find and utilize their skills independently. Senior engineers may find the insights valuable for understanding agent behavior and improving AI system design.

How well do agent skills actually work when agents must find and use them on their own? Check out the lates work from our lab! arxiv.org/abs/2604.04323

👁 188 views ❤ 3 🔁 0 💬 0 🔖 0 1.6% eng

AI agentsresearchautonomyskillsmachine learning

market signal @tengyanAI

7/10

AI Agent Frameworks Show Unanimous Growth

The tweet highlights the growth in downloads of six major AI agent frameworks, indicating a strong market trend towards AI agents. Senior engineers should note the increasing traction and potential for these frameworks in production systems.

developers already decided AI agents work. the download data is unanimous. six major agent frameworks. all accelerating, zero declining. - @LangChain at 8.2M weekly downloads, +3.5%. - @OpenAI Agents at 965K, +11.8%. the last time every framework in a category grew

👁 382 views ❤ 7 🔁 3 💬 3 🔖 0 3.4% eng

AI agentsframeworksdownloadsmarket trendsinfrastructure

market signal @billtheinvestor

7/10

GLM-5.1 Leads Open-Source Model Rankings

Z.ai's GLM-5.1 is currently the top open-source model in Code Arena, outperforming several notable competitors. This ranking indicates the competitive landscape of AI models and may influence future development and adoption decisions.

With GLM-5.1, Z.ai maintains the top spot in the rankings for open-source models in Code Arena, currently trailing the overall leader by just about 20 points, while outperforming Claude Sonnet 4.6, Opus 4.5, GPT-5.4 High, and Gemini-3.1 Pro. Open-source models

👁 1,114 views ❤ 3 🔁 0 💬 0 🔖 0 0.3% eng

GLM-5.1Z.aiopen-sourceAI modelsCode Arena

market signal @Ubermenscchh

7/10

Alibaba's Qwen 3.6+ Model Benchmarks

Alibaba has released its Qwen 3.6+ model, achieving top scores on multiple benchmarks, including 61.6 on terminal-bench and 80.9 on multilingual agentic coding. This performance indicates a significant advancement in AI model capabilities that builders should monitor.

breaking.. alibaba mass dropped qwen 3.6-plus and it's embarrassing every frontier model right now 61.6 on terminal-bench (beats claude 4.5 opus) 56.6 on swe-bench pro (1st place) 80.9 on multilingual agentic coding (1st place) 58.7 on claw-eval real world agent (1st place)

👁 367 views ❤ 5 🔁 6 💬 3 🔖 0 3.8% eng

AIbenchmarkingAlibabaQwenmodel performance

model release @support_huihui

7/10

Huihui-gemma-4-31B-it-abliteratedv2 Model Release

A new version of the Huihui-gemma model shows improved perplexity metrics compared to its original, indicating potential quality enhancements. This release may interest engineers looking for better-performing models in their AI systems.

An absolutely unexpected result: tested with llama-perplexity, the ablated version actually has a lower PPL than the original model. The smaller the PPL value, the higher the model quality. We will upload the Huihui-gemma-4-31B-it-abliteratedv2 version, with fewer warnings and

👁 2,089 views ❤ 48 🔁 3 💬 4 🔖 13 2.6% eng Actionable

AImodel releaseperplexityHuihui-gemmamachine learning

market signal @maksym_andr

7/10

GPT-5.4 Achieves Top Benchmark with Reprompting Loop

GPT-5.4 has set a new top-1 entry on PostTrainBench, improving performance from 20.2% to 28.2% using a simple reprompting technique. This indicates a significant advancement in model performance that could influence future AI development strategies.

New top-1 entry on PostTrainBench: GPT-5.4 with a simple reprompting loop ("You still have remaining. Please continue improving your result and maximize performance.") This simple technique alone leads to a huge improvement: 20.2% -> 28.2%. I think @jackclarkSF was

👁 934 views ❤ 15 🔁 2 💬 0 🔖 3 1.8% eng

GPT-5.4PostTrainBenchAI performancerepromptingbenchmark

research @HuggingPapers

7/10

MIA: Advanced AI Agent Architecture

The Memory Intelligence Agent (MIA) proposes a new architecture that enhances 7B models to outperform GPT-5.4 through a Manager-Planner-Executor framework with continual learning. This could be of interest to engineers looking for novel strategies in AI model development.

MIA: Memory Intelligence Agent Evolves deep research agents from passive record-keepers into active strategists, enabling 7B models to outperform GPT-5.4 via a Manager-Planner-Executor architecture with continual test-time learning.

👁 1,897 views ❤ 43 🔁 15 💬 2 🔖 19 3.2% eng

AIarchitectureresearchMIAmodel performance

AI Twitter Scanner