Mathematical Optimization Problems

23h

OpenAI engineers cut ChatGPT guest traffic to a few hundred Nvidia GPUs, with no new hardware deployed.

OpenAI inference cost reduction cut ChatGPT guest traffic from tens of thousands of Nvidia GPUs to just a couple hundred, using software optimization alone. Engineers achieved more than 50% savings ...

Tech Times

OpenAI Halves Inference Costs With Software Alone: GPUs Drop to Hundreds

OpenAI inference cost reduction cut ChatGPT guest traffic from tens of thousands of Nvidia GPUs to just a couple hundred, ...

Hub

The MVPs of data analytics

The Sports Analytics Research Group employs quantitative analysis to give teams the hard numbers they need to perform better ...

2don MSN

Quantum computing is about to get a lot more real

A surge of funding and federal action is giving the once-futuristic technology a more immediate role in everything from ...

Why Pure Agentic AI Fails In Enterprise Settings And What Works Instead

If your agentic AI project is failing, your problem is likely that you treated the integration work as somebody else's issue ...

Aalto University

Doctoral Researcher in AI and Quantum-Inspired Optimization for Sustainable Energy Systems

Are you passionate about developing AI-based and quantum-inspired solutions for the next generation of sustainable energy systems? We are now looking for a fully funded Doctoral Researcher to work on ...

How does an On-device AI work?

Curious about the working of an on-device AI? Here is how an on-device AI works and what you can take from it for yourself.

I’ve used AI as a brain crutch, and that might be a problem

What I remember is the reflex—an almost-automatic pivot to an external brain to help me locate my own train of thought. I ...

Fable 5 Breach Leaks Cryptic AI Chain of Thought Shorthand

Fable 5's chain of thought has leaked, showing math-like shorthand, while its three-layer defense classifiers block most jailbreak attempts.

DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%

DSpark can make decoding faster, but acceptance quality still determines how much speed the system actually realizes.

Meituan open sources LongCat-2.0, the 1.6T, near-frontier agentic coding model that's been leading OpenRouter — trained entirely on Chinese chips

By registering the LongCat-2.0 repository under the open-source MIT License, Meituan positions the architecture with maximum ...

Reliability Determines Success in the Biotech Industry

Biotech companies are spending billions to perfect chemical formulas, only to be tripped up by the one variable they forgot ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results