As enterprises move from the hype of generative AI to building governed, production-ready AI applications, several new challenges have emerged. Currently, many businesses route all of their prompts to massive, cloud-based LLMs, which can result in excessive costs, high latency, and unnecessary data exposure.
To solve this problem, Red Hat in collaboration with NVIDIA is bringing enterprise-grade AI development directly to the developer’s desk. We are excited to announce the development preview of Red Hat Enterprise Linux 10 (RHEL 10) on NVIDIA DGX Spark.
Powered by the NVIDIA GB10 Grace Blackwell architecture, NVIDIA DGX Spark boasts up to 1 Petaflop of performance, FP4 data format support, and 128GB of unified memory. By pairing this hardware with Red Hat’s trusted operating system and developer tooling, we are delivering a high performance AI developer workstation that enables the inner development loop and local agentic AI workloads at the edge.
This combination allows developers to build, test, and evaluate complex agentic workflows locally with familiar tools like Red Hat Desktop prior to pushing those workloads into production.
Why does this matter?
With RHEL on NVIDIA DGX Spark, we are tackling the primary roadblocks to enterprise AI adoption, including:
- Cost control and data sovereignty: By keeping sensitive traffic local and utilizing small language models (SLMs) for medium-to-low complexity tasks, businesses can reduce their cloud API spend while restricting their sensitive data to a local workstation. Additionally, the 128GB of unified memory on NVIDIA DGX Spark enables hosting large language models (LLMs) and can help further reduce cloud costs during model development.
- Evaluation-driven development: Autonomous AI agents are non-deterministic, making standard unit testing insufficient. Running RHEL on NVIDIA DGX Spark provides the ultimate sandbox for rigorous testing. Developers can use embedded MLflow instances for “glass box” trajectory tracing, capturing tool calls and model queries more effectively, and performing LLM-as-a-Judge evaluations locally before pushing anything to production.
- Core-to-cluster consistency: By standardizing on RHEL as the underlying OS, developers can focus on building their applications without worrying about infrastructure friction. RHEL combines production stability with developer agility to help accelerate application development across hybrid cloud environments and industry leading chip architectures. Plus, transitioning from local workstation development to large scale deployments on Red Hat OpenShift becomes a more streamlined, predictable pipeline.
The future of AI is hybrid, and it starts at the developer’s desk. Reach out to your Red Hat representative to learn more about RHEL 10 in developer preview on NVIDIA DGX Spark and how it can help accelerate your AI journey.
저자 소개
Luke Thompson is a Senior Product Manager for the Edge Computing portfolio at Red Hat, focused on delivering platform capabilities that simplify how edge technologies are adopted and used in real-world environments.
He brings a strong background in industrial automation and IT/OT convergence, with experience helping organizations integrate complex systems, streamline data flows, and turn operational data into actionable insights in manufacturing and industrial settings.
Today, Luke applies this expertise to bridge the gap between sophisticated IT infrastructure and the practical needs of users at the edge, making advanced technologies more accessible, usable, and impactful.
유사한 검색 결과
에이전틱 AI가 요구하는 새로운 인프라 스택: AMD와 Red Hat의 솔루션 제공
과거의 운영 방식에서 벗어나 IT의 미래 구축
Operating System Management | Compiler
Technically Speaking | Inside open source AI strategy
채널별 검색
오토메이션
기술, 팀, 인프라를 위한 IT 자동화 최신 동향
인공지능
고객이 어디서나 AI 워크로드를 실행할 수 있도록 지원하는 플랫폼 업데이트
오픈 하이브리드 클라우드
하이브리드 클라우드로 더욱 유연한 미래를 구축하는 방법을 알아보세요
보안
환경과 기술 전반에 걸쳐 리스크를 감소하는 방법에 대한 최신 정보
엣지 컴퓨팅
엣지에서의 운영을 단순화하는 플랫폼 업데이트
인프라
세계적으로 인정받은 기업용 Linux 플랫폼에 대한 최신 정보
애플리케이션
복잡한 애플리케이션에 대한 솔루션 더 보기
가상화
온프레미스와 클라우드 환경에서 워크로드를 유연하게 운영하기 위한 엔터프라이즈 가상화의 미래