Inference Models - Search News

29m

China debuts biggest AI model trained on local chips, as Meituan releases LongCat-2.0

LongCat-2.0 boasts 1.6 trillion parameters and a million-token context window, on par with DeepSeek’s latest flagship model.

2h

DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%

DSpark can make decoding faster, but acceptance quality still determines how much speed the system actually realizes.

5d

OpenAI unveils first custom AI inference chip, Jalapeño, with Broadcom — and its development was sped-up with OpenAI's own models

The companies attributed this speed to a deep software-hardware co-development process that actively used OpenAI’s own models ...

AI Inference and World Model Startups Pull $1.8B in Two Days as Foundation Models Commoditize

AI inference infrastructure investment pulled $1.8 billion in 48 hours as Baseten’s $1.5B round at a $13B valuation and ...

1don MSN

Faster AI, lower costs: DSpark eases inference bottlenecks and chip strain, says DeepSeek

Start-up unveils speculative decoding framework that speeds up inference by up to 85 per cent amid China's push to overcome ...

Barchart on MSN

Broadcom just quietly became OpenAI’s preferred choice for AI inference. Here’s what that means for AVGO stock.

In the second half of last year, OpenAI and Broadcom (AVGO) announced a deal for 10 gigawatts worth of compute capacity. Just ...

5d

This Artificial Intelligence (AI) Chip Stock Is Dominating the Inference Era. It Could Be the Biggest Winner of This Megatrend (Hint: It's Not AMD or Broadcom)

Demand for AI inference compute workloads is increasing rapidly, and Nvidia is dominating the market despite competition from ...

3d

The Most Expensive Part Of AI Might Not Be The Model

This matters because AI usage is growing fast. Goldman Sachs estimated that global AI infrastructure spending could reach ...

4d

What Is a Reasoning Model? The AI Breakthrough That Taught Machines to “Think”

In September 2024, OpenAI previewed a model that behaved differently from the AI systems most people had grown accustomed to.

Active inference as a model of collision avoidance behavior in human drivers

Collision avoidance – involving a rapid threat detection and quick execution of the appropriate evasive maneuver – is a critical aspect of driving. However, existing models of human collision ...

Active inference and the two-step task

Sequential decision problems distill important challenges frequently faced by humans. Through repeated interactions with an uncertain world, unknown statistics need to be learned while balancing ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results