Introduction
Anyone who is serious about big data, scale out applications and cloud infrastructure should want to intimately understand the benefits of scale out architecture and the resource elasticity of cloud services. As we continue our evolution into a deeper understanding of data, we see a need agile access to an elastic big data platform. Such a platform can allow us to capture, synthesize and quantify data into business value.
Enter OpenStack Sahara - the intersection of Hadoop and OpenStack.
As an OpenStack project started by Red Hat, Mirantis and Hortonworks during the OpenStack Havana summit in Portland, Sahara was incubated for the OpenStack Icehouse release and is expected to be integrated for OpenStack Juno by the end of 2014.
Sahara’s mission is to provide a scalable data processing stack and associated management interfaces. Sahara delivers on that mission by providing the ability to rapidly create and manage Apache Hadoop™ clusters and easily run workloads across them. All on OpenStack managed infrastructure, without having to deal with the details of cluster management.
With full cluster lifecycle management, provisioning, scaling and termination, Sahara allows the user to select different Hadoop versions, cluster topology and node hardware details.
Sahara key features and use cases:
- Fast and agile Hadoop cluster deployment
- An extensible framework for management and provisioning components
- Run Hadoop workloads in few clicks without expertise in Hadoop operations
- “Analytics as a Service” utilization of unused compute capacity for ad-hoc or bursty analytic workloads
- Sahara supports different types of jobs: MapReduce, Hive, Pig and Oozie workflows. The data could be taken from various sources: Swift, HDFS, NoSQL and SQL databases. It also supports various provisioning plugins.
- The intersection of two of the largest open source movements
- OpenStack provides the foundation and hub of innovation for cleanly managing infrastructure resources. While Apache Hadoop™ serves as the core and innovation driver for storing and processing data.
Bringing these two technologies together not only strengthens and catalyzes their ecosystems, but offers an increasing wealth of value to their users.
The OpenStack Sahara project aims to facilitate this combination and enable customers and partners alike to take advantage of a growing big data processing platform on OpenStack.
Our vision is to bring Big Data and OpenStack together, with a broad ecosystem of partner interoperability, reliability & choice.
You can use Sahara now in RDO and as technology preview in RHEL OSP 5
Over the next few months, we’ll bring you examples of how to use Sahara in RDO and RHEL OSP, how to get involved as a customer or partner, and tell you about the value provided by merging the infrastructure and data processing universes. Look for post by Keith Basil and Matthew Farrelle.
To learn more and get involved with the Sahara project, please visit the Sahara OpenStack Wiki at: https://wiki.openstack.org/wiki/Sahara
저자 소개
채널별 검색
오토메이션
기술, 팀, 인프라를 위한 IT 자동화 최신 동향
인공지능
고객이 어디서나 AI 워크로드를 실행할 수 있도록 지원하는 플랫폼 업데이트
오픈 하이브리드 클라우드
하이브리드 클라우드로 더욱 유연한 미래를 구축하는 방법을 알아보세요
보안
환경과 기술 전반에 걸쳐 리스크를 감소하는 방법에 대한 최신 정보
엣지 컴퓨팅
엣지에서의 운영을 단순화하는 플랫폼 업데이트
인프라
세계적으로 인정받은 기업용 Linux 플랫폼에 대한 최신 정보
애플리케이션
복잡한 애플리케이션에 대한 솔루션 더 보기
오리지널 쇼
엔터프라이즈 기술 분야의 제작자와 리더가 전하는 흥미로운 스토리
제품
- Red Hat Enterprise Linux
- Red Hat OpenShift Enterprise
- Red Hat Ansible Automation Platform
- 클라우드 서비스
- 모든 제품 보기
툴
체험, 구매 & 영업
커뮤니케이션
Red Hat 소개
Red Hat은 Linux, 클라우드, 컨테이너, 쿠버네티스 등을 포함한 글로벌 엔터프라이즈 오픈소스 솔루션 공급업체입니다. Red Hat은 코어 데이터센터에서 네트워크 엣지에 이르기까지 다양한 플랫폼과 환경에서 기업의 업무 편의성을 높여 주는 강화된 기능의 솔루션을 제공합니다.