The need for high-throughput data analysis has grown tremendously since the introduction of next-generation sequencing (NGS) platforms. The massive amount of data produced creates a new class of resource barriers to be overcome including limited bandwidth, storage volume and compute power. Small research labs can hardly cope with the data generated. A solution to the mere resource problem are cloud computing environments as virtually unlimited and flexible resources.
The de.NBI Cloud is a full academic cloud federation, providing compute and storage re-sources free of charge for academic users. It provides a powerful IT infrastructure in combination with flexible bioinformatics workflows and analysis tools to the life science community in Germany. The de.NBI Cloud offers reliable IT security concepts and user access rules to en-sure secure data access and storage. It closes the gap of missing computational resources for life science researchers in Germany.
The de.NBI Cloud project started in 2016 as collaboration between the universities of Bielefeld, Freiburg, Gießen, Heidelberg and Tübingen. The close cooperation with the ELIXIR cloud ensures the connectivity and sustainability in the international context.
The de.NBI Cloud operates the major service levels:
• Infrastructure as a Service (IaaS)
suited for experienced power users that want full control over the compute environment; plain access to virtualized infrastructure
• Platform as a Service (PaaS)
suited for experienced users who utilize fully configured infrastructure for the deployment of custom workflows
• Software as a Service (SaaS)
suited for users without cloud experience who can use virtual machines (VMs) of pre-configured, state-of-the-art analysis tools and pipelines
Cloud computing requires initial efforts and skills to port existing workflows to these new mod-els. The same holds true for emerging programming models. Cloud environments can be difficult to use by scientists with little system administration and programming skills. Challenges exist in managing cloud environments as there is a lack of tools which simplify accessing and using these environments and helping bootstrap users by providing basic software stacks.
The goal of the tutorial is to provide a fundamental introduction to the underlying OpenStack infrastructure. Target audience are bioinformaticians or experienced computational data analysts who would like to utilize scalable and flexible cloud resources for their research. Participants will learn how to setup a cloud project and work with virtual instances, and how to efficiently utilize cloud computing resources. We will also address networking and security issues, demonstrate how to deploy bioinformatics tools in the cloud, and how to set up a customized compute cluster in a cloud environment using BiBiGrid. All topics will be covered by short talks and practical hands-on sessions.