The Sahara project provides a simple means to provision Hadoop clusters in an OpenStack® cloud infrastructure. It enables data processing on OpenStack and aims to eliminate tool management. This reference architecture describes how to install and configure Sahara on Red Hat® Enterprise Linux® OpenStack Platform 5. It also shares deployment best practices for optimizing data processing tools such as Hadoop. The configuration and deployment steps described in this document focus on a single user to small group use case. Users have the ability to spin up a cluster from a predefined template which reads input from and writes output to Swift for long term storage. The cluster remains active for the duration of the activity. Additional clusters can be launched from the same template as needed.