Hadoop is gaining incredible traction in enterprise big data implementations1. But how do enterprises use Hadoop in concert with traditional analytics to gain actionable information from their big data? What impact does this goal have on Hadoop deployment decisions?
An IDC study commissioned by Red Hat, titled “Trends in Enterprise Hadoop Deployments” reports, “... 32% of respondents indicated that their firms have existing Hadoop deployments. An additional 31% indicated that they had plans to deploy it within 12 months. And finally 36% said that their Hadoop deployment schedule could go beyond 12 months.”
Enterprises are using more than Hadoop for big data analysis. The IDC study states, “Nearly 39% respondents indicated that they use NoSQL databases like HBase, Cassandra and MongoDB etc. while nearly 36% indicated that they use MPP databases like Greenplum and Vertica etc. This situation also underscores the importance of causality and correlation – in which traditional structured data sets are analyzed in conjunction with unstructured data from newer sources.”
The IDC study also explores the use cases for enterprise Hadoop deployments and confirms that “... businesses use Hadoop in more than one way:
- Analysis of raw data – whether it is operations data, data from machines or devices, point of sale systems or customer behavioral data gathered from ecommerce or retail systems – is one of the dominant use cases for Hadoop.
- Nearly 39% of respondents indicated that they use Hadoop for service innovation, which includes the analysis of secondary data sets for modeling of “if-then scenarios for products and services.
- Some of the less popular use cases for Hadoop include its deployment as a platform for non-analytic workloads (for example in conjunction with a SQL overlay for OLTP).”
As a result, enterprises are looking to alternative persistent storage systems. According to the IDC report, “File systems like IBM’s Global File system (GPFS), Red Hat Storage (GlusterFS), EMC Isilon OneFS and others that have earned a reputation for their robust, scale-out capabilities are clearly preferred as alternatives to HDFS. Of these three, only Red Hat offers an integrated open-source based enterprise Linux platform that combines a distributed file system with a Hadoop connector, enterprise middleware and the ability to run Hadoop computational workloads natively.”
The survey also found that most enterprises process big data both before and after Hadoop processing. This highlights another attractive feature of GlusterFS and other alternatives – the ability to keep the data in native POSIX format and use traditional analysis tools.
To download the full copy of the IDC Study, register here.
1IDC White Paper, Trends in Enterprise Hadoop Deployments, commissioned by Red Hat, August 2013
저자 소개
채널별 검색
오토메이션
기술, 팀, 인프라를 위한 IT 자동화 최신 동향
인공지능
고객이 어디서나 AI 워크로드를 실행할 수 있도록 지원하는 플랫폼 업데이트
오픈 하이브리드 클라우드
하이브리드 클라우드로 더욱 유연한 미래를 구축하는 방법을 알아보세요
보안
환경과 기술 전반에 걸쳐 리스크를 감소하는 방법에 대한 최신 정보
엣지 컴퓨팅
엣지에서의 운영을 단순화하는 플랫폼 업데이트
인프라
세계적으로 인정받은 기업용 Linux 플랫폼에 대한 최신 정보
애플리케이션
복잡한 애플리케이션에 대한 솔루션 더 보기
오리지널 쇼
엔터프라이즈 기술 분야의 제작자와 리더가 전하는 흥미로운 스토리
제품
- Red Hat Enterprise Linux
- Red Hat OpenShift Enterprise
- Red Hat Ansible Automation Platform
- 클라우드 서비스
- 모든 제품 보기
툴
체험, 구매 & 영업
커뮤니케이션
Red Hat 소개
Red Hat은 Linux, 클라우드, 컨테이너, 쿠버네티스 등을 포함한 글로벌 엔터프라이즈 오픈소스 솔루션 공급업체입니다. Red Hat은 코어 데이터센터에서 네트워크 엣지에 이르기까지 다양한 플랫폼과 환경에서 기업의 업무 편의성을 높여 주는 강화된 기능의 솔루션을 제공합니다.