News tagged AI inference at DIGITIMES

NEWS TAGGED AI INFERENCE

Monday 27 July 2026

Rebellions readies Rebel100 shipments as AI inference demand grows

South Korean AI chip designer Rebellions plans to begin shipping its next-generation Rebel100 accelerator in the second half of 2026, betting that wider use of AI agents and commercial...

Friday 24 July 2026

IEI pairs NANO-X100 with Ryzen AI for edge AI inference

Industrial PC maker IEI Integration joined Castec International at AMD Advancing AI 2026 in San Francisco on July 21 and 22 to showcase the new NANO-X100 4-inch single-board computer...

Wednesday 22 July 2026

Intel cuts data center jobs again despite rising AI server demand

Intel has confirmed a fresh round of job cuts in its Data Center Group (DCG), although it did not disclose the number of positions...

Wednesday 22 July 2026

Memory shortage to worsen in 2027 as AI inference strains supply, Silicon Motion CEO says

The global memory shortage will worsen in 2027 and may not begin to ease until the second half of 2028, with prices likely to remain elevated for the next two to three years, according...

Thursday 16 July 2026

China's TPU path gains traction with low-cost AI inference challenging GPU economics

Generative AI applications are expanding rapidly, making computing costs a growing bottleneck to commercial AI deployment. The AI accelerator...

Tuesday 14 July 2026

Rambus unveils DDR5 9600 chipset for next-generation AI servers

Rambus has introduced a new DDR5 9600 server RDIMM chipset aimed at faster, denser data center memory systems. The move matters beyond one supplier because higher bandwidth and better...

Thursday 9 July 2026

Nvidia expands alliance strategy as AI chip ecosystem shifts toward collaboration

Nvidia is increasingly embracing partnerships with emerging AI chip developers, signaling a broader shift from competing solely on hardware to enabling heterogeneous AI infrastructure,...

Monday 6 July 2026

Anthropic's reported chip plans with Samsung could ease inference costs, not chase top-end performance

Anthropic's reported move into in-house chip development could matter well beyond Silicon Valley if it helps lower the cost of running AI services worldwide. By prioritizing cheaper...

Friday 3 July 2026

Tsinghua chip veteran’s US$1.8bn 3D AI chip startup targets China’s GPU gap

China's AI chip sector has a heavyweight new entrant: veteran semiconductor figure Shaojun Wei has formally unveiled Shanghai Orient...

Wednesday 1 July 2026

OpenAI engineers claim to discover way to cut inference costs in half

OpenAI engineers claim to have figured out a way to halve the costs of inference using its models, according to The Information. The development comes as AI model developers...

Wednesday 1 July 2026

AI chip startup Rebellions' acquisition of SqueezeBits signals push beyond hardware

South Korean AI chip designer Rebellions said on June 30 that it is acquiring AI inference optimization company SqueezeBits, as part...

Monday 29 June 2026

Qualcomm stirs AI data center competition with CPUs, ASICs, and accelerators

Qualcomm officially unveiled its Dragonfly data center platform at this week's annual investor day, laying out a four-pronged push into cloud AI that spans SerDes, PAM4 DSP, and other...

Thursday 25 June 2026

SambaNova targets US$10B valuation as demand rises for cheaper AI inference

AI chipmaker SambaNova could raise between US$800 million and US$1 billion in a new funding round, according to its executive chairman and Intel CEO Lip-Bu Tan. This would raise SambaNova's...

Thursday 25 June 2026

ByteDance's reported AI chip orders mark breakthrough for Chinese GPU maker Iluvatar CoreX

ByteDance's reported plan to purchase at least 50,000 AI inference chips from Shanghai-based GPU developer Iluvatar CoreX could become one of the most significant commercial wins yet...

Thursday 25 June 2026

Nvidia and AWS deepen push to simplify AI infrastructure at scale

Nvidia and Amazon Web Services (AWS) are expanding tools that could make it easier for companies worldwide to build and run large-scale AI systems. The changes aim to improve speed,...

1/9 pages

BIZ FOCUS

Jul 30, 08:00

EDOM Accelerates Edge AI Deployment with NVIDIA Technologies

MOST-READ
7 DAYS NEWS

Full list