Archivo de la categoría: AI

Alibaba’s LLM pricing challenges domestic and Western rivals

Alibaba Cloud LLM pricing strategy has taken a dramatic turn as the company announces an unprecedented 85% reduction in access costs for its most sophisticated large language models. The announcement, made via WeChat and reported by the South China Morning Post, positions the company’s flagship Qwen-VL-Max model at just 0.003 yuan ($0.00041) per thousand tokens, […]

The post Alibaba’s LLM pricing challenges domestic and Western rivals appeared first on Cloud Computing News.

Microsoft outperforms Amazon and Google in cloud AI

Cloud heavyweights AWS, Microsoft Azure, and Google Cloud Platform (GCP) are positioning themselves to use AI as a key driver of cloud infrastructure growth. According to IoT Analytics, Microsoft is emerging as a cloud AI leader, particularly in genAI, passing both AWS and Google. Microsoft leads in generative AI Microsoft accounted for 45% of new […]

The post Microsoft outperforms Amazon and Google in cloud AI appeared first on Cloud Computing News.

DC certifications for newcomers, experts, and sustainability pros

The data centre industry is rapidly evolving, driven by advances in AI, cybersecurity, and sustainability. Despite these changes, certifications remain essential for IT career growth. According to NetworkWorld, certifications have emerged to match the demands of cutting-edge technologies. For IT professionals, the appeal of certifications is clear—enhanced career prospects, higher salaries, and greater job security. […]

The post DC certifications for newcomers, experts, and sustainability pros appeared first on Cloud Computing News.

Oracle partners with Meta to power Llama AI models

Oracle has entered into with Meta, enabling the social media giant to leverage Oracle Cloud Infrastructure for training and deploying its Llama large language models (LLMs), as reported by TechRadar. The agreement was confirmed by Oracle’s Chief Technology Officer, Larry Ellison. “We just signed an agreement with Meta – for them to use Oracle’s AI […]

The post Oracle partners with Meta to power Llama AI models appeared first on Cloud Computing News.

Google Cloud partners with Air France-KLM to transform its data and generative AI strategy

Google Cloud has formed a strategic collaboration with Air France-KLM (AFKL), a leading global airline group, to help the airline leverage the power of its data, analytics, and generative AI (gen AI) technology to accelerate its data-centric and multicloud strategy, drive innovation, and reimagine the future of the travel industry. Through the enhancement and growing… Read more »

The post Google Cloud partners with Air France-KLM to transform its data and generative AI strategy appeared first on Cloud Computing News.

How cloud providers are tackling GPU shortages with custom chips

GPUs are the backbone of AI computing, but as demand exceeds supply, cloud providers are getting creative. Instead of waiting for more GPUs, as Network World reported, they’re creating custom chips to meet specific workloads, delivering faster, more efficient computing while keeping costs under control. The competition is heating up. At Microsoft’s Ignite conference last… Read more »

The post How cloud providers are tackling GPU shortages with custom chips appeared first on Cloud Computing News.

Building the future of AI systems at Meta

Meta’s Ye (Charlotte) Qi took the stage at QCon San Francisco 2024, to discuss the challenges of running LLMs at scale. As reported by InfoQ, her presentation focused on what it takes to manage massive models in real-world systems, highlighting the obstacles posed by their size, complex hardware requirements, and demanding production environments. She compared… Read more »

The post Building the future of AI systems at Meta appeared first on Cloud Computing News.

NetApp partners with Vultr Cloud Alliance for scalable AI solutions

NetApp has joined the Vultr Cloud Alliance, a collaboration of technology providers focused on delivering adaptable AI cloud services. The alliance leverages each member’s unique strengths to address the growing demand for scalable and composable infrastructure for cloud and AI workloads. The integration of NetApp’s ONTAP platform with Vultr’s global network of data centres provides… Read more »

The post NetApp partners with Vultr Cloud Alliance for scalable AI solutions appeared first on Cloud Computing News.

Revolutionising data centre sustainability with power capping

Have you ever noticed how the rise of AI and cloud computing has supercharged energy demands? Data centre, the unsung heroes of our digital world, are now grappling with a growing dilemma. While they power our online lives, they’re also some of the biggest energy guzzlers, contributing significantly to global power consumption. With generative AI… Read more »

The post Revolutionising data centre sustainability with power capping appeared first on Cloud Computing News.

Data centre cooling crisis: UT Austin’s game-changing fix

The relentless march of artificial intelligence (AI) is pushing data centre cooling systems to their absolute limits. Inside these massive computing facilities, densely packed servers generate enough heat to require industrial-scale cooling solutions, with some areas reaching critical temperatures exceeding 100°F (37.8°C). As AI workloads continue to multiply exponentially, traditional cooling methods are struggling to… Read more »

The post Data centre cooling crisis: UT Austin’s game-changing fix appeared first on Cloud Computing News.