Azure Ai Model Inference

Microsoft’s Azure Maia chief on the complex future of AI compute

The future of AI compute is heterogenous, according to Microsoft's GM of Azure Maia Andrew Wall. The implications of this are ...

The Official Microsoft Blog on MSN

Microsoft at NVIDIA GTC: New solutions for Microsoft Foundry, Azure AI infrastructure and physical AI

Microsoft combines accelerated computing with cloud scale engineering to bring advanced AI capabilities to our customers. For years, we’ve worked with NVIDIA to integrate hardware, software and ...

5don MSN

Nvidia says the "inflection point of inference" has arrived. Here are 2 AI stocks to buy for 2026.

These tech stocks look particularly well positioned to benefit from this opportunity.

The Edge Singapore

How Microsoft is rebuilding the economics of AI from the chip up

As AI compute costs rise, Microsoft is seeking to reduce reliance on third-party chips, extending its push from custom ...

AI Inference Takes Center Stage At KubeCon Europe 2026

KubeCon Europe 2026 made AI inference its central focus with major CNCF donations including llm-d, Nvidia's GPU DRA driver ...

Computerworld

Microsoft launches its second generation AI inference chip, Maia 200

Calling it the highest performance chip of any custom cloud accelerator, the company says Maia is optimized for AI inference on multiple models. Signaling that the future of AI may not just be how ...

SDxCentral

Big four cloud giants tap Nvidia Dynamo to boost AI inference

The big four cloud giants are turning to Nvidia's Dynamo to boost inference performance, with the chip designer's new Kubernetes-based API helping to further ease complex orchestration. According to a ...

8don MSN

Startup Gimlet Labs is solving the AI inference bottleneck in a surprisingly elegant way

Gimlet Labs just raised an $80 million Series A for tech that lets AI run across NVIDIA, AMD, Intel, ARM, Cerebras and ...

All We Need Is Memory, Dealing With The AI RAMpocalypse

Nvidia announcements show the current shortage of storage and memory could continue into the future, driving up prices and ...

Azilen Launches Dedicated Inference Engineering Practice to Make Enterprise AI Faster, Leaner, and Production-Ready

Azilen launches Inference Engineering practice to optimize AI performance, reduce costs, and scale efficiently across ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results