Off-Prem

Alibaba Cloud claims its modular datacenter architecture shrinks build times by 50 percent

Also reveals boosted utilization rates, upgraded IaaS and more – all in the name of AI apps


Alibaba Cloud has revealed a modular datacenter architecture it claims will help it to satisfy demand for AI infrastructure by improving performance and build times for new facilities.

Announced at its annual Apsara conference yesterday, the "CUBE DC 5.0" architecture was described as using "prefabricated modular designs" plus "advanced and proprietary technologies such as wind-liquid hybrid cooling system, all-direct current power distribution architecture and smart management system."

Alibaba Cloud hasn't explained those techs in depth, but claims the modular approach reduces deployment times by up to 50 percent compared to traditional datacenter building techniques.

The Register has asked Alibaba to explain the workings of a "wind-liquid hybrid cooling system." For what it's worth, machine translation of the term into Mandarin, then fed into search engines, produced results describing cold plate cooling – a technique that sees thin reservoirs of cooled liquids placed on hardware, with cooling achieved by circulating liquid and/or blowing air across the plates.

Whatever the term describes, CEO of Alibaba Cloud Intelligence Eddie Wu told the conference his company "is investing heavily in building an AI infrastructure for the future."

"These enhancements are not just about keeping up with AI demands but about setting a global standard for efficiency and sustainability."

Other steps towards that goal include a scheduler said to better manage hardware resources so that they achieve up to 90 percent utilization rates.

Alibaba Cloud's IaaS offering, the Enterprise Elastic Compute Service (ECS), has reached its ninth generation. Conference attendees were told it is better equipped for AI applications as it has improved recommendation engine speeds by 30 percent and database read/write queries per second by 17 percent.

Also at the conference, Alibaba Cloud announced an "Open Lake data utility" that integrates multiple big data engines so they can be used by generative AI applications. Another new offering, "DMS: OneMeta+OneOps," apparently combines and manages metadata from 40 different data sources.

It's 2024, so Alibaba Cloud also announced some AI news: the release of its Qwen 2.5 multimodal models, available in sizes from 0.5 to 72 billion parameters, supporting 29 languages and tuned for the needs of sectors including automotive and gaming. The new models are said to have "enhanced knowledge [and] stronger capabilities in math and coding."

A text-to-video AI model that works with both Chinese and English prompts, Tongyi Wanxiang, was also released.

"The new model is capable of generating high-quality videos in a wide variety of visual styles from realistic scenes to 3D animation," boasted Alibaba Cloud execs. ®

Send us news
4 Comments

Infosec experts divided on AI's potential to assist red teams

Yes, LLMs can do the heavy lifting. But good luck getting one to give evidence

Million GPU clusters, gigawatts of power – the scale of AI defies logic

It's not just one hyperbolic billionaire – the entire industry is chasing the AI dragon

US bipartisan group publishes laundry list of AI policy requests

Chair Jay Obernolte urges Congress to act – whether it will is another matter

Cheat codes for LLM performance: An introduction to speculative decoding

Sometimes two models really are faster than one

Take a closer look at Nvidia's buy of Run.ai, European Commission told

Campaign groups, non-profit orgs urge action to prevent GPU maker tightening grip on AI industry

AI's rising tide lifts all chips as AMD Instinct, cloudy silicon vie for a slice of Nvidia's pie

Analyst estimates show growing apetite for alternative infrastructure

Just how deep is Nvidia's CUDA moat really?

Not as impenetrable as you might think, but still more than Intel or AMD would like

American cops are using AI to draft police reports, and the ACLU isn't happy

Do we really need to explain why this is a problem?

Apple reportedly building AI server processor with help from Broadcom

Something called 'Baltra' expected to make its debut in 2026, perhaps with tech both already use

Are you better value for money than AI?

Tech vendors start saying the quiet part out loud – do enterprises really need all that headcount?

Apple called on to ditch AI headline summaries after BBC debacle

'Facts can't be decided by a roll of the dice'

Google Gemini 2.0 Flash comes out with real-time conversation, image analysis

Chocolate Factory's latest multimodal model aims to power more trusted AI agents