Now Reading
Core42 Launches Open-Weight GPT-OSS Models on AI Cloud Platform

Core42 Launches Open-Weight GPT-OSS Models on AI Cloud Platform

Core42 AI Cloud showcasing GPT-OSS models deployment

Core42, a G42 company specializing in sovereign cloud and AI infrastructure, has introduced OpenAI’s latest open-weight models, gpt-oss-20B and gpt-oss-120B, on its AI Cloud platform. Consequently, enterprises, researchers, and developers can now access these models instantly through the Core42 Compass API. This deployment provides scalable, high-performance AI capabilities across global markets.

Furthermore, the models can run on a variety of leading silicon platforms. Therefore, organizations can leverage sovereign, scalable, and high-performance infrastructure while optimizing workloads for both speed and cost.

High-Performance AI with Compass API

Integrated into the Compass API, Core42 offers flexible access to a broad spectrum of compute platforms. As a result, the platform delivers inference speeds of up to 3,000 tokens per second per user, supporting real-time AI at global scale. Additionally, this infrastructure ensures workloads are matched to the most suitable hardware for maximum efficiency.

Kiril Evtimov, CEO of Core42 and Group CTO of G42, said, “Core42 AI Cloud, powered by silicon-diverse infrastructure, delivers the flexibility and performance needed for today’s AI workloads. Through the Compass API, organizations can access the latest open-weight AI models and choose the optimal platform to scale transformation, optimize performance and cost, and drive progress across global markets.”

Benefits of Open-Weight Model Deployment

The Compass API deployment brings several advantages:

  • Enterprise-scale performance: Supports demanding workloads at global scale, enabling advanced automation and real-time AI applications.

  • Sovereign-ready scalability: Provides in-country deployment with full sovereign controls, ideal for regulated sectors like healthcare, finance, and national security.

    See Also
    Esports World Cup arena in Riyadh with global players and massive audience

  • Optimized for committed environments: Ensures predictable cost and performance for organizations operating under dedicated infrastructure agreements.

  • Cost-efficient agentic AI: Enables low-cost agentic AI workloads while maintaining in-country deployment and compliance.

Additionally, these models allow organizations to run and fine-tune AI locally or in the cloud with full transparency. This flexibility helps align performance, cost, and regulatory compliance according to specific needs, reinforcing Core42’s commitment to secure and optimized global AI infrastructure.

View Comments (0)

Leave a Reply

Your email address will not be published.

© 2024 The Technology Express. All Rights Reserved.