INTRODUCTION
Red Hat and SAP® developed an integrated solution for the interactive analysis of big data. SAP Data Hub is a distributed computing solution that provides actionable business insights from big data. Red Hat® OpenShift® Container Platform delivers an enterprise Kubernetes environment for deploying and managing container-based workloads like SAP Data Hub.
SAP Data Hub helps companies orchestrate data of any volume, velocity, and variety. As a result, companies can progress toward an integrated big data landscape founded on an enterprise hub for all types of data—including streaming, Apache Hadoop, transactional, Internet of Things (IoT), and location data.
SUPPORTING THE FUTURE OF BIG DATA
SAP DATA HUB
SAP Data Hub provides an integration layer for data-driven processes across enterprise services to process and orchestrate data in the overall landscape for all user groups—IT, business analysts, data scientists, data engineering, and data stewards. It integrates and prepares data within a digital landscape to inform business decisions. It offers an open, big data architecture with open source integration, cloud deployments, and third-party interfaces. It uses massive distributed processing and serverless computing capabilities to deliver increased performance. With SAP Data Hub, you can easily package analysis from disparate data types with a common SQL interface, easy-to-use modeling tools, and extension of Apache Spark to include SAP HANA® platform access. As a result, you can gain insight based on relevant information from across your data environment.
RED HAT OPENSHIFT CONTAINER PLATFORM
Red Hat OpenShift Container Platform brings Linux® containers and Kubernetes to the enterprise. The platform helps organizations build, deploy, and manage new and existing applications across physical, virtual, or public cloud infrastructures. Red Hat is a leading contributor to the Kubernetes open source project and the Cloud Native Computing Foundation (CNCF). Red Hat OpenShift Container Platform is built on open source standards and is a proven, reliable container platform for SAP Data Hub.
INTEGRATED RED HAT AND SAP SOLUTION
SAP Data Hub running on Red Hat OpenShift Container Platform provides an integrated solution that supports enterprise applications and use cases and continues the strong partnership between the two companies. SAP Data Hub makes use of Kubernetes clusters, with services built into Red Hat OpenShift Container Platform containers. Components of SAP Data Hub, primarily SAP Spark Extension, are deployed on Hadoop. SAP Data Hub Distributed Processing then runs analytics engines and services in containers orchestrated by Kubernetes and collects big data via Spark, Hadoop, or directly from cloud environments.