Daily Digest | New Horizon

01Microsoft OpenAI Partnership Update (2 minute read) [TLDR AI — 2026-04-28 (latest available)]OpenAI and Microsoft revised their agreement to increase flexibility, including non-exclusive IP licensing, multi-cloud support for OpenAI products, and capped revenue-sharing terms through 2030. → source
02OpenAI Misses Key Revenue, User Targets in High-Stakes Sprint Toward IPO (6 minute read) [TLDR AI — 2026-04-28 (latest available)]OpenAI missed its own targets for new users and revenue, raising concern among company leaders about whether it will be able to support its massive spending on data centers. The company's Chief Financial Officer has said that she is worried that OpenAI may not be able to pay for → source
03OpenAI Smartphone Rumors (3 minute read) [TLDR AI — 2026-04-28 (latest available)]Analyst Ming-Chi Kuo reported that OpenAI explored building a smartphone with partners like MediaTek and Qualcomm, potentially replacing app-centric interfaces with AI agents and hybrid on-device/cloud models. → source
04China Blocks Meta Manus Acquisition (2 minute read) [TLDR AI — 2026-04-28 (latest available)]China halted Meta's $2B acquisition of agentic AI startup Manus, ordering the deal unwound amid regulatory scrutiny, complicating Meta's push into AI agents and cross-border expansion. → source
05To Train or Not to Train (10 minute read) [TLDR AI — 2026-04-28 (latest available)]The companies integrating down into the model layer are doing it because, at their scale, the economics and differentiation arguments work out. Almost all of them are doing post-training, not pre-training from scratch. Companies should start collecting data and build small, speci → source
06Batch API is terrible for one agent. It might be great for a fleet (6 minute read) [TLDR AI — 2026-04-28 (latest available)]Batch API offers a 50% discount but adds latency, making it less suitable for single-agent use. For fleets of agents where multiple requests can be pooled, the batching approach becomes economically viable. Optimal usage involves routing slower, costlier models through batches, w → source
07GPT 5.5: The System Card (20 minute read) [TLDR AI — 2026-04-28 (latest available)]GPT-5.5 is a solid improvement and is competitive with Claude Opus. It seems to be better for factual queries, web searches, and straightforward, well-specified requests, while Claude Opus excels in more open-ended or interpretive purposes. The model is unlikely to pose new big r → source
08Stop stitching databases for AI agents (Sponsor) [TLDR AI — 2026-04-28 (latest available)]Oracle AI Database acts as a unified memory core for agents . Vector search, relational, JSON, and graph data live together so agents can reason over live enterprise data without extra vector stores, pipelines, or synchronization jobs. See how developers build agent memory → → source
09Codex Symphony Agent Orchestration (28 minute read) [TLDR AI — 2026-04-28 (latest available)]OpenAI's Symphony is an open-source specification that turns issue trackers into control planes for coding agents, reducing context switching and increasing pull request throughput by up to 5x. → source
10Amazon's Risk Evaluation Framework (18 minute read) [TLDR AI — 2026-04-28 (latest available)]Amazon researchers introduced ESRRSim, an agentic evaluation framework with a structured taxonomy to benchmark risks like deception and reward hacking, revealing wide variation in model behavior across 11 LLMs. → source
11Compressing AI vectors to 2–4 bits per number without losing accuracy (54 minute read) [TLDR AI — 2026-04-28 (latest available)]TurboQuant compresses each coordinate in large tables of high-dimensional vectors to 2-4 bits with provably near-optimal distortion, no memory overhead for scale factors, and no training or calibration. It is between four and six orders of magnitude faster than the alternatives a → source
12Recursive Language Models, clearly explained (3 minute read) [TLDR AI — 2026-04-28 (latest available)]MIT researchers have introduced Recursive Language Models (RLMs) to solve "context rot," a phenomenon where large language models experience reasoning degradation when processing massive context windows, even if they excel at basic retrieval tasks. Instead of forcing a model to i → source
13DeepSeek V4 Preview (April 24, 2026) [AI Models & Research]DeepSeek released a trillion-parameter Mixture-of-Experts model in two variants—V4-Pro (1.6T params, 49B active) and V4-Flash (284B params, 13B active)—priced at $0.14 and $1.74 per million tokens respectively. Independent benchmarks place V4-Pro within 7-8 points of Claude Opus → source
14Google TurboQuant (ICLR 2026, April 2026) [AI Models & Research]Google's new KV cache quantization technique achieves extreme compression to ~3 bits with zero accuracy loss, delivering 6x less memory usage and up to 8x faster inference speeds. The open-source PyTorch implementation and vLLM plugin enable drop-in deployment across existing inf → source
15Meta Llama 4 Scout & Maverick (April 2026) [AI Models & Research]Meta released two new models featuring alternating dense and MoE layers—Scout (109B total, 17B active) with 10M token context and Maverick (400B total, 17B active) with 1M token context. Both models include native multimodal capabilities and use the Llama License with a 700M MAU → source

Issue · 2026-04-29

Get the digest delivered