Inference - Search News

Nvidia Reports Earnings in May. Here's Why I'm Loading Up Before the Report.

Going into the May 20 earnings report, investor expectations are undoubtedly high. But Nvidia's underlying demand drivers ...

Not Nvidia. Not Broadcom. Intel Is Going to Be the Biggest Winner of the Artificial Intelligence (AI) Inference Era.

Sales of Intel's central processing units and custom AI processors are gaining traction as AI inference workloads grow.

AI inference just plays by different rules

Users and AI agents feel the outliers. A two-millisecond average latency means nothing if one percent of your queries take ...

20m

Report: AI chipmaker Cerebras to increase IPO price target amid surging investor demand

Cerebras has timed its IPO to take advantage of the surging demand for high-performance AI chips, and has reportedly received ...

Design News

Optical AI Architecture Delivers Faster Inference While Saving Energy

Iris Nova runs real-time inference on Llama 8B and 70B using a hybrid processor. The hybrid architecture combines digital ...

SDxCentral

Zero Latency launches beta for AI inference orchestration platform

Zero Latency (formerly Hyphastructure) launched a closed beta for Zerogrid, a distributed AI inference platform designed to route workloads across edge infrastructure according to latency, data ...

Nvidia owns the AI story, so why is AMD beating it on returns?

Jensen Huang still owns the AI stage, Nvidia NASDAQ:NVDA remains the default name in data-center spend, and the company has ...

Nebius shares jump 12% as $643M Eigen AI deal boosts inference ambitions

The $643M Eigen AI deal directly upgrades Nebius’s inference efficiency via Eigen’s optimization stack, strengthening its ...

i-SCOOP

Nebius AI cloud for training and inference at scale

Explore Nebius, the AI cloud built for GPU intensive training, scalable inference, managed ML tools and real world AI ...

IEEE Spectrum on MSN

Startup wants to run AI inference from space

Orbital comes out of stealth with plans of thousands small number-crunching satellites ...

Ventureburn

DeepInfra Raises $107M To Scale Global Inference Infrastructure

DeepInfra raises $107M to expand global inference capacity, support new AI models, and enhance developer tooling across its ...

Pioneering AI Inference Acceleration Provider Selects Silicom's Inference-Specific Solution

Silicom Ltd. (NASDAQ: SILC), a leading provider of networking and data infrastructure solutions, today announced that one of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results