Articles by Rahul
2 storiesAnthropic's Latest Frontier Model Outperforms Human Experts on Complex Reasoning Benchmarks
The company's new model scores 94.7% on the GPQA Diamond benchmark — a suite designed to defeat AI systems — marking the first time a model has exceeded the median performance of PhD-level domain experts.
rahul-subramaniam
1 day ago·8 min read
AIBreaking
Meta Releases Llama 4 Ultra: The Open-Weight Model That Outperforms GPT-4.5 on Five Key Benchmarks
Meta's latest open-weight release achieves state-of-the-art performance on coding, mathematical reasoning, multilingual understanding, and scientific problem-solving, while being available for commercial deployment under a modified community license.
rahul-subramaniam
5 days ago·6 min read
Free Daily Briefing
The Signal Brief
The most important stories, every morning.
Join 140,000+ professionals who start their day with Credence Wire.
Free. No spam. Unsubscribe anytime.