Monitoring and Observability

Monitoring of a Large-Scale University Network: Lessons Learned and Future Directions

The complexity of network monitoring strongly depends on the size of the network under observation. Challenges in monitoring large-scale networks arise not only from dealing with a large volume of traffic, but also from keeping track of all traffic sources, destinations, and who-talks-to-whom communications. Analyzing this information allows to uncover new behaviors that would have not been visible by merely observing common metrics such as bytes and packets. The drawback is that extra pressure is put on the monitoring system as well and on the downstream data- and timeseries-stores. This talk presents a case study based on the monitoring of a large-scale university network. Challenges faced, findings, and lessons learned will be examined. It will be shown how to make sense of the input data to properly manage and reduce its scale as early as possible in the monitoring system. The discussion will also highlight the advantages and limitations of the opensource software components of the monitoring system. In particular, the opensource network monitoring tool ntopng and the timeseries-store InfluxDB will be considered. It will be shown what happens when ntopng and InfluxDB are pushed to their limits and beyond, and what it can be done to ensure their smooth operation. Relevant findings, behaviors uncovered in the network traffic, and future directions will conclude the talk. Intended audience is technical and managerial individuals who are familiar with network monitoring.
Main challenges of monitoring large-scale networks Case study based on the monitoring of a university network Availability, scalability and suitability of opensource software for network monitoring

Weitere Infos

Format devroom

Weitere Sessions

02.02.20
Monitoring and Observability
Richard Hartmann
UD2.120 (Chavanne)
Introduction and welcome to the monitoring and observability devroom
02.02.20
Monitoring and Observability
Juraci Paixão Kröhling
UD2.120 (Chavanne)
Distributed tracing is a tool that belongs to every developer's tool belt, but what it actually can do remains a mystery to most developers. In this slideless talk, we will introduce you to the world of distributed tracing by developing a cloud native application from scratch and applying all important distributed tracing concepts in practice, at first by hand and then by using existing libraries to automate our work. You will learn not only what distributed tracing is, but how it works, what it ...
02.02.20
Monitoring and Observability
Andrej Ocenas
UD2.120 (Chavanne)
This talk presents current capabilities of Grafana to integrate metrics, logs and traces and shows how to setup both Grafana and application code to be able to correlate all 3 in Grafana. It assumes some familiarity with Grafana to follow the How To steps but should be suitable for beginner users. Afterwards it shows future direction of Grafana in context of "Experiences", for even more seamless experience when correlating data from multiple data sources.
02.02.20
Monitoring and Observability
Deepika Upadhyay
UD2.120 (Chavanne)
Jaeger and Opentracing provide ready to use tracing services for distributed systems and are becoming widely used de-facto standard because of their ease of use. Making use of these libraries, Ceph, can reach to a much-improved monitoring state, supporting visibility to its background distributed processes. This would, in turn, add up to the way Ceph is being debugged, “making Ceph more transparent” in identifying abnormalities. In this session, the audience will get to learn about using ...
02.02.20
Monitoring and Observability
Richard Hartmann
UD2.120 (Chavanne)
Society would end if all ModBus stopped working overnight. Good thing it has zero security built in. Still, it's useful to get data out of industrial systems, be they a datacenter or a power plant.
02.02.20
Monitoring and Observability
Jean-Marc Davril
UD2.120 (Chavanne)
According to the United Nations, 2.5 billion more people will be living in cities by 2050. This trend has caused indoor farming to draw a lot of attention and effort in recent years, in an attempt to scale the production of highly nutritious, healthy food inside cities. Over the past 3 years, Agricool has recycled 20 industrial containers into farms that grow strawberries, herbs and salads, in the very heart of cities, and without any pesticide. These urban farms are currently operated in Paris ...
02.02.20
Monitoring and Observability
Rob Skillington
UD2.120 (Chavanne)
The cardinality of monitoring data we are collecting today continues to rise, in no small part due to the ephemeral nature of containers and compute platforms like Kubernetes. Querying a flat dataset comprised of an increasing number of metrics requires searching through millions and in some cases billions of metrics to select a subset to display or alert on. The ability to use wildcards or regex within the tag name and values of these metrics and traces are becoming less of a nice-to-have ...