FriendliAI is collaborating with Samsung SDS to deliver frontier model AI inference services to global startups and enterprises. The initiative brings together FriendliAI's high-performance inference stack and Samsung Cloud Platform (SCP)'s scalable NVIDIA B300 GPU infrastructure. By integrating FriendliAI?s platform with Samsung SCP?s B300 GPU infrastructure, the two companies will enable customers to run the latest frontier open-weight models ?
including GLM 5.1, MiniMax M2.5, NVIDIA Nemotron 3 Super, and DeepSeek v3.2 at maximum performance with production-grade reliability and competitive token pricing. The partnership is designed to provide organizations worldwide with a seamless and scalable experience when deploying frontier AI models. Customers will be able to access FriendliAI?s inference platform powered by Samsung SCP?s NVIDIA B300 GPU IaaS, combining inference optimization with GPU infrastructure.
Key benefits include: Day-0 Support for Frontier Open-Weight Models: Immediate support for models such as GLM 5.1, MiniMax M2.5, NVIDIA Nemotron 3 Super, and DeepSeek v3.2 enables companies to stay at the forefront of AI innovation without building and maintaining custom inference stacks. High-Performance Inference: FriendliAI?s optimized platform is built on its in-house inference engine featuring custom kernels, advanced quantization techniques, intelligent caching, and adaptive speculative decoding. Cost-Effective Scalability: The partnership enables high-speed inference with low operational cost and maximum scalability.
Friendli Serverless Endpoints provide a flexible, token-based pricing model and pay only for the tokens they process. Global Reach and Reliability: FriendliAI?s inference infrastructure, integrated with Samsung SCP?s B300 IaaS platform, provides a global footprint with high-availability guarantees, ensuring low-latency and stable service delivery for customers worldwide.

















