Integration Brings Cerebras Inference Capabilities to Hugging Face Hub

Integration Brings Cerebras Inference Capabilities to Hugging Face Hub — Campus Technology

Post author:Khadim Hussain
Post published:March 14, 2025
Post category:Tech News & Updates
Post comments:0 Comments

Integration Brings Cerebras Inference Capabilities to Hugging Face Hub

By John Okay. Waters
03/14/25

AI {hardware} firm Cerebras has teamed up with Hugging Face, the open supply platform and neighborhood for machine studying, to combine its inference capabilities into the Hugging Face Hub. This collaboration offers greater than 5 million builders with entry to fashions operating on Cerebras’ CS-3 system, the businesses mentioned in an announcement, with reported inference speeds considerably increased than standard GPU options.

Cerebras Inference, now accessible on Hugging Face, processes greater than 2,000 tokens per second. Latest benchmarks point out that fashions corresponding to Llama 3.3 70B operating on Cerebras’ system can attain speeds exceeding 2,200 tokens per second, providing a efficiency improve in comparison with main GPU-based options.

“By making Cerebras Inference accessible by means of Hugging Face, we’re enabling builders to entry various infrastructure for open supply AI fashions,” mentioned Andrew Feldman, CEO of Cerebras, in an announcement.

For Hugging Face’s 5 million builders, this integration offers a streamlined method to leverage Cerebras’ expertise. Customers can choose “Cerebras” as their inference supplier throughout the Hugging Face platform, immediately accessing one of many {industry}’s quickest inference capabilities.

The demand for high-speed, high-accuracy AI inference is rising, particularly for test-time compute and agentic AI purposes. Open supply fashions optimized for Cerebras’ CS-3 structure allow sooner and extra exact AI reasoning, the businesses mentioned, with velocity beneficial properties starting from 10 to 70 instances in comparison with GPUs.

“Cerebras has been a pacesetter in inference velocity and efficiency, and we’re thrilled to associate to deliver this industry-leading inference on open supply fashions to our developer neighborhood,” commented Julien Chaumond, CTO of Hugging Face.

Builders can entry Cerebras-powered AI inference by deciding on supported fashions on Hugging Face, corresponding to Llama 3.3 70B, and selecting Cerebras as their inference supplier.

In regards to the Writer

John K. Waters is the editor in chief of various Converge360.com websites, with a give attention to high-end improvement, AI and future tech. He is been writing about cutting-edge applied sciences and tradition of Silicon Valley for greater than two many years, and he is written greater than a dozen books. He additionally co-scripted the documentary movie Silicon Valley: A 100 12 months Renaissance, which aired on PBS. He could be reached at [email protected].

Source link

Share this content

Opens in a new window

Opens in a new window

Opens in a new window

Opens in a new window

Opens in a new window

Opens in a new window

Opens in a new window

Opens in a new window

Opens in a new window

Opens in a new window

You Might Also Like

California State University Launches Systemwide ChatGPT Edu Deployment — Campus Technology
February 6, 2025

New Anthropic AI Models Demonstrate Coding Prowess, Behavior Risks — Campus Technology
June 2, 2025

Leave a Reply Cancel reply
Comment
Enter your name or username to comment

Enter your email address to comment

Enter your website URL (optional)

Save my name, email, and website in this browser for the next time I comment.

Search
Recent Posts
Linux Foundation to Host Protocol for AI Agent Interoperability — Campus Technology

IBM Introduces Agentic AI Governance and Security Platform — Campus Technology

AI Security Controls Lag Behind Adoption of AI Cloud Services — Campus Technology

New Cloud Security Auditing Tool Utilizes AI to Validate Providers’ Security Assessments — Campus Technology

Cisco Introduces AI-First Approach to IT Operations — Campus Technology

Recent Comments
No comments to show.
About Me

Khadim Hussain

Khadim Hussain brings you GadgetRealm.xyz, your go-to source for expert reviews, news, and insights on the latest gadgets, apps, and software. Explore tech recommendations and stay ahead with the future of technology, all in one place.

Newsletter

Get all latest content a few times a month!

Email is required Email is not valid
Thanks for your subscription.

Failed to subscribe, please contact admin.

Follow Me

Recent Comments

Integration Brings Cerebras Inference Capabilities to Hugging Face Hub

Share my story Share this content

You Might Also Like

California State University Launches Systemwide ChatGPT Edu Deployment — Campus Technology

New Anthropic AI Models Demonstrate Coding Prowess, Behavior Risks — Campus Technology

Leave a Reply Cancel reply

Share this content