AI Daily — 2026-05-31

English 中文

OpenAI launches Rosalind Biodefense for vetted developers and government partners · OpenAI Roboti...

Covering 19 AI news items

🔥 Top Stories

1. OpenAI launches Rosalind Biodefense for vetted developers and government partners

OpenAI expands trusted access to GPT-Rosalind for vetted developers and U.S. government partners to advance biodefense, public health, and pandemic preparedness. The initiative aims to strengthen societal resilience by enabling AI-powered defense and public health tools for approved users. Source-x

2. OpenAI Robotics advances; hiring engineers to build physical-world AI

OpenAI says its robotics program is evolving into OpenAI Robotics with rapid progress toward AI that can operate in the physical world. The team is hiring full-stack hardware, ops, systems, and ML engineers to help program and manufacture robots that assist society, initially supporting skilled workers and later enabling personal robots. Led by Aditya Ramesh, the program emphasizes co-design of hardware and ML research, and invites applicants to apply by emailing their background and accomplishments. Source-x

3. Parakeet STT Ported to ggml, Matches NeMo Output, Faster

Ported NVIDIA’s Parakeet speech-to-text models to pure C++/ggml, enabling CPU and GPU execution with no Python or PyTorch. Output is byte-for-byte identical to NeMo, with WER 0 on f32/f16 paths, and it runs significantly faster (up to ~5x on GPU for large models, ~1.86x on CPU when quantized) while using less memory. It includes quantized GGUF variants (f16, q8_0, q6_k, q5_k, q4_k), cache-aware streaming, real-time end-of-utterance, word-level timestamps with confidence, and a small C API for embedding. Source-reddit

📰 Featured

Open Source & LLM Development

Open-Source LLM Training From Scratch on GitHub — A GitHub project demonstrates training a transformer-based LLM from scratch using PyTorch on a single GPU. This could lower barriers for researchers to experiment with small-to-medium models, though true large-scale training remains resource-intensive. Source-github
Llama Studio v0.2.0 Adds Per-Model Scripts, Multi-GPU, Autoload — Llama Studio v0.2.0 updates the llama-server WebUI by switching to per-model shell scripts and adds multi-GPU support, session storage, and autoload on startup for headless servers. Source-reddit

AI Safety & Evaluation

13 Abliterated Gemma-4 E2B Variants Benchmarked in 44 GPU Hours — Abliterlitics evaluated 13 abliterated Gemma-4 E2B variants across 9 creators in 44 GPU hours; coder3101’s variant achieved 96% ASR with preserved capabilities, while treadon reaches 100% ASR but loses 3 points on GSM8K, casting doubt on claims of preserved capabilities across variants. Full dataset, graphs, and logs are on HuggingFace. Source-reddit

AI Ethics & Research / IP

Three Turing Laureates Republish Key AI Methods, Credit Disputes Emerge — Three Turing Award laureates republish influential AI methods without credit to original creators, highlighting ongoing attribution disputes tracked by IDSIA’s AI priority page. Source-x

Tools & Agentic AI

Codex Controls Browser in Real-Time, Viscerally Compelling — A demonstration shows Codex controlling a browser to perform tasks beyond its harness, described as a holy shit moment with notes about HLS playback and browser automation. Source-x

LLM Models & Industry

Claude trial vs Codex XHigh; 5.5 remains superior — An AI enthusiast reports trying Claude for a few days and returning to Codex XHigh, arguing that 5.5 remains significantly better than Claude and Codex XHigh. Source-x

Open Source & LLM Tools

Open-Source LLM Training From Scratch on GitHub — (Already listed under Open Source & LLM Development) Additional emphasis on community-driven learning and reproducibility. Source-github

Hardware & WebUI

Llama Studio v0.2.0 Adds Per-Model Scripts, Multi-GPU, Autoload — (Already listed) Noting continued tooling improvements for multi-GPU workflows. Source-reddit

Platform Updates & Open Access

Millions of users cheer as limits reset tomorrow — A social post celebrates upcoming usage-limit resets, comparing model capabilities and signaling platform dynamics in practice. Source-x

Models, Code & Community

Claude trial vs Codex XHigh; 5.5 remains superior — (Also categorized under LLM Models & Industry) Highlighting ongoing community debates about model performance and alignment trends. Source-x

⚡ Quick Bites

What happens when LLaMA spills from VRAM to system RAM — Explains memory paging and performance trade-offs when LLaMA data spills from VRAM to system RAM. Source-reddit
Home data center powers nine GPUs for ML experiments — Describes a home setup powering nine GPUs for ML experiments and practical trade-offs. Source-reddit
APEX-MTP GGUF release for Qwen3.6-35B-A3B-Claude — GGUF weight release enabling faster inference across models. Source-reddit
Dell XPS with NVIDIA N1X Confirmed at Computex — Dell confirms a laptop lineup featuring NVIDIA N1X at Computex. Source-reddit
GPT Realtime 2 Lets You Control Your PC With Voice — Adds voice-based PC control capabilities via GPT Realtime 2. Source-x
Hermes Agent Ships 100+ Pre-Enabled Skills — Hermes Agent ships with 100+ pre-enabled skills for agents. Source-x
G7 agrees on shared language around open-source AI and open weights AI — G7 seeks a common framework for discussing open-source AI and open weights. Source-reddit
Opinions Sought on Quantizing KV Cache for Qwen3.6b-27b — Community input requested on KV cache quantization for various Qwen models. Source-reddit
Semantic Step Prediction in LLM Reasoning Trajectories — Examines semantic step prediction in multi-step LLM reasoning paths. Source-reddit

Generated by AI News Agent | 2026-05-31