HPC, Big Data, and Data Science

Using OpenStack to reduce HPC service complexity

... no, that is not an oxymoron!

February 5, 2022
3:00 PM – 3:30 PM

D.hpc

John Garbutt

<p>Why build #4 on the Green500 using OpenStack? It makes it easier to manage. Cambridge University started using OpenStack in 2015. Since mid 2020, all new hardware is controlled using OpenStack. Compute nodes, GPU nodes, Lustre nodes, Ceph nodes, almost everything. OpenStack allows large baremetal slurm clusters and dedicated TRE (trusted research environments) to share the same images. Is this a cloud native supercomputer?</p>

We will explore how OpenStack is used to manage a supercomputer as a shared pool of hardware resources, that can be partitioned between a multitude of different platforms required by a diverse group of scientists. Ranging from Trusted Research Environments (TREs), on demand dedicated AI platforms, dedicated big data platforms, and to traditional shared Slurm clusters. We will focus on providing a range of services from a single shared hardware pool, allowing for the delivery of both on demand interactive compute platforms for STFC's IRIS e-Infrastrcture and Slurm clusters such as the #4 in the Green500, called Wilkes-3: https://www.top500.org/system/179930/ This makes use of both OpenStack Ironic, for the baremetal deployment, and on-demand OpenStack KVM powered VMs running Cluster API provisioned Kubernetes, with KubeApps to deploy JuypterHub.

Additional information

Type	devroom

More sessions

2/5/22	Low-code data visualization and aggregation with OpenSearch Dashboards HPC, Big Data, and Data Science Olena Kutsenko D.hpc <p>Working with Big Data means that we need tools to organise and understand the data. And you don’t have to be a developer to search, aggregate and visualise your data. Whether you need an affordable business analytics tool or you want to analyse log data in near real time, OpenSearch can help you. And all of it through a visual interface of OpenSearch Dashboards.</p> <p>After listening to this talk you’ll understand the basics of working with an OpenSearch cluster and different use cases ...
2/5/22	Uncovering Arcon: A state-first Rust streaming analytics runtime HPC, Big Data, and Data Science Max Meldrum D.hpc <p>In this talk, I will present Arcon, a Rust-native streaming runtime that integrates seamlessly with the Apache Arrow ecosystem. The Arcon philosophy is streaming first, similarly to systems such as Apache Flink and Timely Dataflow. However, unlike all existing systems, Arcon features great flexibility when it comes to its application state. Arcon's TSS query language allows extracting and operating on state snapshots consistently based on application-time constraints and interfacing with ...
2/5/22	Build an Open Source Streaming Data Pipeline HPC, Big Data, and Data Science D.hpc <p>Any conversation about Big Data would be incomplete without talking about Apache Kafka and Apache Flink: the winning open source combination for high-volume streaming data pipelines.</p> <p>In this talk we'll explore how moving from long running batches to streaming data changes the game completely. We'll show how to build a streaming data pipeline, starting with Apache Kafka for storing and transmitting high throughput and low latency messages. Then we'll add Apache Flink, a distributed ...
2/5/22	Containers in HPC HPC, Big Data, and Data Science Christian Kniep D.hpc <p>This short talk will disect the container ecosystem for HPC in four segments and discusses what to look out for, what is already settled and how to navigate containers in 2022.</p>
2/5/22	This is The Way- A Crash Course on the Intricacies of Managing CPUs in K8s HPC, Big Data, and Data Science D.hpc <p>Optimizing CPU management improves cluster performance and security, but is daunting to almost everyone. CPU management may seem complex, but it can be explained in such a way that even your inner toddler will comprehend. With this talk, we will give a path to success.</p> <p>You may have a multi-socket node cluster where your AI/ML workloads care about the proximity of your CPUs to GPUs. You may be running scientific workloads where you want to pin in cores within containers instead of just ...
2/5/22	Making Apache Spark, Apache Mahout, Kubeflow, and Kubernetes Play Nice HPC, Big Data, and Data Science Trevor Grant D.hpc <p>Working with big data matrices is challenging, Kubernetes allows users to elastically scale, but can only have a pod as large as a node, which may not be large enough to fit the matrix in memory. While Kubernetes allows for other paradigms on top of it which allows pods to coordinate on individual jobs, setting them up and making them play nice with ML platforms is not straightforward. Using Apache Spark and Apache Mahout we can work with matrices of any dimension and distribute them across ...
2/6/22	HPC for Social & Crime Science HPC, Big Data, and Data Science Philipp M. Dau D.hpc <p>Many scientific disciplines have benefitted from the availability of big datasets to develop algorithm supported solutions. Recently, this trend has penetrated the fields of crime and police research. The presentation highlights use cases of big data computation and HPC for typical datasets in crime science: crime records, emergency call data, and police GPS data. The focus lies on spatiotemporal applications (i.e., geocoding, map matching, spatial and temporal algorithms). The datasets come ...

FOSDEM 2022

2/5/22 – 2/6/22

Event

Hackerkonferenzen

Created by @CCC 58 Follower

Event Calendar