LLM Inference Optimization

LLM Data Mixture Breaks When Training Pools Shift: Causal Inference Offers Fix

LLM training data mixture optimization breaks when training pools shift — every prior proxy experiment becomes stale.

OpenAI and Broadcom unveil 'Jalapeño' Intelligence Processor for LLM inference

"A blank-slate design for modern LLM inference, not a general-purpose accelerator adapted from earlier AI workloads" ...

VentureBeat

New LLM optimization technique slashes memory costs up to 75%

Researchers at the Tokyo-based startup Sakana AI have developed a new technique that enables language models to use memory more efficiently, helping enterprises cut the costs of building applications ...

From ChatGPT to Chips: OpenAI Unveils Jalapeño to Power Faster LLMs and More Affordable AI

OpenAI and Broadcom unveiled Jalapeño, a custom AI inference chip designed for LLMs, promising higher efficiency, lower costs ...

OpenAI, Broadcom (AVGO) Unveil “Jalapeño” AI Accelerator for Enhanced LLM Inference

Broadcom Inc. (NASDAQ:AVGO) is one of the best stocks for beginners to buy now. On June 24, OpenAI and Broadcom introduced ...

Business Wire

MangoBoost Launches Mango LLMBoost™: AI Inference Optimization Software with Up to 12.6x Relative Performance Improvement and 92% Cost Savings

BELLEVUE, Wash.--(BUSINESS WIRE)--MangoBoost, a provider of cutting-edge system solutions designed to maximize AI data center efficiency, is announcing the launch of Mango LLMBoost™, system ...

10d

Show inaccessible results

LLM Data Mixture Breaks When Training Pools Shift: Causal Inference Offers Fix

OpenAI and Broadcom unveil 'Jalapeño' Intelligence Processor for LLM inference

New LLM optimization technique slashes memory costs up to 75%

From ChatGPT to Chips: OpenAI Unveils Jalapeño to Power Faster LLMs and More Affordable AI

OpenAI, Broadcom (AVGO) Unveil “Jalapeño” AI Accelerator for Enhanced LLM Inference

MangoBoost Launches Mango LLMBoost™: AI Inference Optimization Software with Up to 12.6x Relative Performance Improvement and 92% Cost Savings

OpenAI and Broadcom Unveil LLM-Optimized Intelligence Processor

Senior LLM Inference Engineer

AI Inference and World Model Startups Pull $1.8B in Two Days as Foundation Models Commoditize

OpenAI and Broadcom unveil Jalapeño Intelligence Processor for LLM workloads