An OpenStack Crime Story solved by tcpdump, sysdig, and iostat – Episode 3

16.9.2014 | 5 minutes reading time

Previously on OpenStack Crime Investigation … Two load balancers running as virtual machine in our OpenStack based cloud, sharing a keepalived based highly available IP address started to flap, switching the IP address back and forth. After ruling out a misconfiguration of keepalived and issues in the virtual network, I finally got the hint that the problem might originate not in the virtual, but in the bare metal world of our cloud. Maybe high IO was causing the gap between the keep alive VRRP packets.

When I arrived at baremetal host node01, hosting virtual host loadbalancer01, I was anxious to see the IO statistics. The machine must be under heavy IO load when the virtual machine’s messages are waiting for up to five seconds.

I switched on my iostat flash light and saw this:

1$ iostat
2Device:            tps    kB_read/s    kB_wrtn/s    kB_read    kB_wrtn
3sda              12.00         0.00       144.00          0        144
4sdc               0.00         0.00         0.00          0          0
5sdb               6.00         0.00        24.00          0         24
6sdd               0.00         0.00         0.00          0          0
7sde               0.00         0.00         0.00          0          0
8sdf              20.00         0.00       118.00          0        118
9sdg               0.00         0.00         0.00          0          0
10sdi              22.00         0.00       112.00          0        112
11sdh               0.00         0.00         0.00          0          0
12sdj               0.00         0.00         0.00          0          0
13sdk              21.00         0.00        96.50          0         96
14sdl               0.00         0.00         0.00          0          0
15sdm               9.00         0.00        64.00          0         64

Nothing? Nothing at all? No IO on the disks? Maybe my bigger flash light iotop could help:

1$ iotop

Unfortunately, what I saw was too ghastly to show here and therefore I decided to omit the screenshots of iotop 1 . It was pure horror. Six qemu processes eating the physical CPUs alive in IO.

So, no disk IO, but super high IO caused by qemu. It must be network IO then. But all performance counters show almost no network activity. What if this IO wasn’t real, but virtual? It could be the virtual network driver! It had to be the virtual network driver.

I checked the OpenStack configuration. It was set to use the para-virtualized network driver vhost_net.

I checked the running qemu processes. They were also configured to use the para-virtualized network driver.

1$ ps aux | grep qemu
2libvirt+  6875 66.4  8.3 63752992 11063572 ?   Sl   Sep05 4781:47 /usr/bin/qemu-system-x86_64
3 -name instance-000000dd -S ... -netdev tap,fd=25,id=hostnet0,vhost=on,vhostfd=27 ...

I was getting closer! I checked the kernel modules. Kernel module vhost_net was loaded and active.

1$ lsmod | grep net
2vhost_net              18104  2
3vhost                  29009  1 vhost_net
4macvtap                18255  1 vhost_net

I checked the qemu-kvm configuration and froze.

1$ cat /etc/default/qemu-kvm
2# To disable qemu-kvm's page merging feature, set KSM_ENABLED=0 and
3# sudo restart qemu-kvm
4KSM_ENABLED=1
5SLEEP_MILLISECS=200
6# To load the vhost_net module, which in some cases can speed up
7# network performance, set VHOST_NET_ENABLED to 1.
8VHOST_NET_ENABLED=0
9 
10# Set this to 1 if you want hugepages to be available to kvm under
11# /run/hugepages/kvm
12KVM_HUGEPAGES=0

vhost_net was disabled by default for qemu-kvm. We were running all packets through userspace and qemu instead of passing them to the kernel directly as vhost_net does! That’s where the lag was coming from!

I acted immediately to rescue the victims. I made the huge, extremely complicated, full 1 byte change on all our compute nodes by modifying a VHOST_NET_ENABLED=0 to a VHOST_NET_ENABLED=1, restarted all virtual machines and finally, after days of constantly screaming in pain, the flapping between both load balancers stopped.

I did it! I saved them!

But I couldn’t stop here. I wanted to find out, who did that to the poor little load balancers. Who’s behind this conspiracy of crippled network latency?

I knew there was only one way to finally catch the guy. I set a trap. I installed a fresh, clean, virgin Ubuntu 14.04 in a virtual machine and then, well, then I waited — for apt-get install qemu-kvm to finish:

1$ sudo apt-get install qemu-kvm
2Reading package lists... Done
3Building dependency tree
4Reading state information... Done
5The following extra packages will be installed:
6  acl cpu-checker ipxe-qemu libaio1 libasound2 libasound2-data libasyncns0
7  libbluetooth3 libboost-system1.54.0 libboost-thread1.54.0 libbrlapi0.6
8  libcaca0 libfdt1 libflac8 libjpeg-turbo8 libjpeg8 libnspr4 libnss3
9  libnss3-nssdb libogg0 libpulse0 librados2 librbd1 libsdl1.2debian
10  libseccomp2 libsndfile1 libspice-server1 libusbredirparser1 libvorbis0a
11  libvorbisenc2 libxen-4.4 libxenstore3.0 libyajl2 msr-tools qemu-keymaps
12  qemu-system-common qemu-system-x86 qemu-utils seabios sharutils
13Suggested packages:
14  libasound2-plugins alsa-utils pulseaudio samba vde2 sgabios debootstrap
15  bsd-mailx mailx
16The following NEW packages will be installed:
17  acl cpu-checker ipxe-qemu libaio1 libasound2 libasound2-data libasyncns0
18  libbluetooth3 libboost-system1.54.0 libboost-thread1.54.0 libbrlapi0.6
19  libcaca0 libfdt1 libflac8 libjpeg-turbo8 libjpeg8 libnspr4 libnss3
20  libnss3-nssdb libogg0 libpulse0 librados2 librbd1 libsdl1.2debian
21  libseccomp2 libsndfile1 libspice-server1 libusbredirparser1 libvorbis0a
22  libvorbisenc2 libxen-4.4 libxenstore3.0 libyajl2 msr-tools qemu-keymaps
23  qemu-kvm qemu-system-common qemu-system-x86 qemu-utils seabios sharutils
240 upgraded, 41 newly installed, 0 to remove and 2 not upgraded.
25Need to get 3631 kB/8671 kB of archives.
26After this operation, 42.0 MB of additional disk space will be used.
27Do you want to continue? [Y/n] <
28...
29Setting up qemu-system-x86 (2.0.0+dfsg-2ubuntu1.3) ...
30qemu-kvm start/running
31Setting up qemu-utils (2.0.0+dfsg-2ubuntu1.3) ...
32Processing triggers for ureadahead (0.100.0-16) ...
33Setting up qemu-kvm (2.0.0+dfsg-2ubuntu1.3) ...
34Processing triggers for libc-bin (2.19-0ubuntu6.3) ...

And then, I let the trap snap:

1$ cat /etc/default/qemu-kvm
2# To disable qemu-kvm's page merging feature, set KSM_ENABLED=0 and
3# sudo restart qemu-kvm
4KSM_ENABLED=1
5SLEEP_MILLISECS=200
6# To load the vhost_net module, which in some cases can speed up
7# network performance, set VHOST_NET_ENABLED to 1.
8VHOST_NET_ENABLED=0
9 
10# Set this to 1 if you want hugepages to be available to kvm under
11# /run/hugepages/kvm
12KVM_HUGEPAGES=0

I could not believe it! It was Ubuntu’s own default setting. Ubuntu, the very foundation of our cloud decided that despite all modern hardware supporting vhost_net to turn it off by default. Ubuntu was convicted and I could finally rest.

This is the end of my detective story. I found and arrested the criminal Ubuntu default setting and were able to prevent him from further crippling our virtual network latencies.

Please feel free to leave comments and ask questions about details of my journey. I’am already negotiating to sell the movie rights. But maybe there will be another season of OpenStack Crime Investigation in the future. So stay tuned on codecentric Blog.

Footnotes

1. Eh, and because I lost them.↵

Was this post helpful?

Blog author

Lukas Pustina

Do you still have questions? Just send me a message.

fromLukas Pustina

An OpenStack Crime Story solved by tcpdump, sysdig, and iostat – Episode...

Previously on OpenStack Crime Investigation. I was called to a crime scene; our OpenStack based private cloud for CenterDevice. Somebody or something was causing our virtual load balancers to flap their highly available IP address. tcpdump showed me...

Infrastructure
Open Source
APM
Cloud
IT-Security

15.9.2014 | 5 minutes reading time

Lukas Pustina

An OpenStack Crime Story solved by tcpdump, sysdig, and iostat – Episode...

This is the story of how the tiniest things can sometimes be the biggest culprits. Because first and foremost, this is a detective story. So come and follow me on a little crime scene investigation that uncovered an incredibly counterintuitive and almost...

Cloud

14.9.2014 | 5 minutes reading time

Lukas Pustina

Provisioning IaaS Clouds with Dynamic Ansible Inventories and OpenStack...

My colleague Daniel Schneller gave an introduction to Ansible . A key concept of Ansible is the inventory. It contains all hosts of your site that you want to provision with Ansible. For bare metal hardware, this inventory is a static file enumerating...

Database
Cloud

24.6.2014 | 5 minutes reading time

Lukas Pustina

Crypto is Broken or How to Apply Secure Crypto as a Developer

Last year’s revelations show that crypto is broken on all levels. 1 We cannot trust hardware nor commercial software providers anymore to securely encrypt our data. My first instinct as a developer is to turn to open source libraries which have been...

Crypto
IT-Security

5.3.2014 | 8 minutes reading time

Lukas Pustina

Ceph Object Storage as fast as it gets or Benchmarking Ceph

CenterDevice is a distributed document management and sharing software without any single centralized component. In our next evolution we are going to use the distributed object store Ceph for storing our encrypted documents. In this article, my colleague...

Infrastructure
Software development

3.3.2014 | 9 minutes reading time

Lukas Pustina

Docker Registry or How to Run your own Private Docker Image Repository

Docker allows to bundle artifacts and configurations in an image. These images run as light weight system-level virtual machines. In my previous articles, I showed how to use Docker in general and how to use networking . In this article, I will show...

Container

18.2.2014 | 5 minutes reading time

Lukas Pustina

Docker Networking Made Simple or 3 Ways to Connect LXC Containers

In my previous article , I introduced Docker as a lightweight alternative to hypervisor-based virtualization. The article described the basic usage of Docker. Today, we dig a bit deeper and cover advanced topics regarding Docker networking and how to...

CI/CD
DevOps
Container

26.1.2014 | 7 minutes reading time

Lukas Pustina

Lightweight Virtual Machines Made Simple with Docker or How to Run 100...

Running virtual machines has many benefits. They utilize your hardware much better, are easy to backup and exchange, and isolate services from each other. But running virtual machines also has downsides. Virtual machine images are clunky. Also and more...

DevOps
Open Source
APM

6.1.2014 | 8 minutes reading time

Lukas Pustina

Your Hardware will Fail – Just not the Way You Expect

Why do people decide to move their services into the cloud? For one thing they wish to store large amounts of data. But they also wish for response times and reliability that classical sever installation cannot offer. Therefore, clusters of commodity...

13.11.2013 | 5 minutes reading time

Lukas Pustina

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

The Ultimate Tool for Engineers and Developers: Compass Premium

It’s not an every day activity that a tool comes and redefines how engineering and development teams operate, but Compass is the tool with a game-changing solution. As Atlassian's out-of-the-box internal developer platform, Compass helps teams to stay...

Atlassian
Cloud

3.12.2024 | 4 [Missing String "readingTime"]

Özge Kavas

Living on the edge: building serverless applications with Cloudflare Workers

Cloudflare is best known for its CDN, DNS server (1.1.1.1) or WAF/DDos mitigation services. These services are highly predicated on “Edge Computing”, bringing data closer to the user interested in those services – a user in Australia will be happier ...

Cloud native
Cloud
Serverless

28.11.2024 | 14 [Missing String "readingTime"]

We deployed our SaaS Application on fly.io (and it was great).

How we deployed our application in a fraction of the time while saving 100% of the cost. Our team, a bunch of experienced software engineers without prior contact to cloud deployments, wanted to deploy our OCPP-compliant EV Charging Station Simulator...

AWS
Cloud

23.10.2024 | 4 [Missing String "readingTime"]

Jannis Mainczyk

Dangling DNS in cloud infrastructures

Dangling DNS entries are nothing new. Forgotten, outdated or incorrect DNS records can lead to subdomains being taken over and used in phishing campaigns, for example, to steal employee secrets. Due to dynamic IP addresses of rapidly changing resources...

IT-Security
Validation
Cloud
AWS
Infrastructure

5.9.2024 | 4 [Missing String "readingTime"]

Markus Höfer

Spring Boot and HTMX: Deployment to AWS Lambda

This is the next part of my series about Spring Boot and HTMX. In this post, I will show you how to deploy the application created in the previous post to AWS Lambda. If you're in a hurry or impatient, you can simply check out the accompanying Git Repo...

Serverless
Spring
AWS
DevOps
Cloud

30.7.2024 | 5 [Missing String "readingTime"]

Integrating Dapr with Azure Kubernetes Service (AKS): Portability is key

In a recent blog post, we explored how Dapr works and how to test it on a simple local Kubernetes cluster. One of Dapr's key advantages is its component system, which enhances portability. In this post, we'll take our previously daperized demo app and...

Software development
Cloud
Azure
Cloud native

22.7.2024 | 10 [Missing String "readingTime"]

Manuel Zapf

Modern Microservices: Unleashing the Power of .NET Core, Aspire, and Dapr

I recall the days when writing a web application in C# with .NET meant deploying it on an IIS web server for accessibility. Today, this approach seems outdated, especially with the shift towards microservice-based architectures. Fortunately, Microsoft...

Software architecture
Open Source
Cloud
Microservices
Infrastructure as Code
.NET
Cloud native

27.6.2024 | 8 [Missing String "readingTime"]

Manuel Zapf

From sidecars to sidecarless: Tracing the evolution of service mesh technologies...

Ever wondered how the technology that seamlessly manages microservices traffic evolved from early implementations to lean, kernel-level solutions? Let's dive into the fascinating journey of service meshes, from Linkerd 1.x to the cutting-edge technologies...

Cloud
Networking
Infrastructure
Kubernetes
Linux

22.5.2024 | 10 [Missing String "readingTime"]

Manuel Zapf

Demystifying the Kubernetes Gateway API: What the heck is it and why should...

When Gateway API debuted in October last year, this concluded a nearly four-year-long process that started in summer 2019. Gateway API is the successor of core Ingress definition, aiming towards various goals. This blog post will give a brief overview...

API
Open Source
Cloud
Networking
Kubernetes
Cloud native

15.3.2024 | 6 [Missing String "readingTime"]

Manuel Zapf

Cloud-native (application) networking in 2024

It's 2024 and Software is still eating the world. Whether it's powering an e-commerce platform, driving AI applications, or supporting critical business processes within organizations, there's a high likelihood that these applications are running in ...

Cloud
Networking
Infrastructure
Kubernetes

8.3.2024 | 2 [Missing String "readingTime"]

Manuel Zapf

Charge your APIs Volume 22: Mastering the Art of API Federation

API Federation is becoming essential in modern API management, addressing the complexities of evolving digital enterprises. It marks a shift from centralised, monolithic management to a dynamic, modular framework. Unlike traditional methods, API Federation...

API
Cloud
Cloud native

7.2.2024 | 11 [Missing String "readingTime"]

Daniel Kocot

How to upgrade your Aurora Serverless database schema using CDK and Lambda

Imagine the following situation: You are building a serverless application using e.g. lambdas, you setup your system using CDK (or CloudFormation) and you store your data in Aurora Serverless. How would you automate your database schema adaptations or...

Cloud
Database
AWS
Infrastructure as Code
Serverless

16.1.2023 | 12 [Missing String "readingTime"]

Heroku is dead: Let’s deploy Spring Boot containers on fly.io!

Heroku is cancelling their free plan! What about all my open-source projects? Luckily fly.io comes to the rescue! Here are the missing docs on how to run Spring Boot on fly.io.Why I love(d) HerokuHeroku was my go-to PaaS for open-source projects for ...

CI/CD
Java
Cloud
DevOps
Spring

18.9.2022 | 17 [Missing String "readingTime"]

CloudWatch on AWS: How to tackle high-security requirements

If you build cloud-native applications, you will also generate log output. Log outputs are essential to log the functionality of the application and to be able to localize errors very quickly in the event of a crash. However, log outputs of any kind ...

AWS
Cloud
IT-Security

23.8.2022 | 15 [Missing String "readingTime"]

Jörg Riegel

Tame the multi-cloud beast with Crossplane: Let’s start with AWS S3

What if learning the Kubernetes API is all you need to provision any infrastructure? And we’re not only talking about AWS, Azure & Google – but also IONOS, DigitalOcean and even vSphere. Let’s have a look at Crossplane and how we can create an S3 Bucket...

AWS
CI/CD
Cloud
DevOps

3.7.2022 | 21 [Missing String "readingTime"]

Building an instant noodles DevOps starter pack with Terraform and AWS

How can we help a fictitious startup kickstart its software development process? Using Terraform and AWS services, we’ll build an IT infrastructure that is ready within minutes and ticks quite a few boxes on the technical DevOps capabilities list. Just...

Cloud
Infrastructure
AWS
CI/CD
DevOps

27.6.2022 | 21 [Missing String "readingTime"]

Development Containers & GitHub Codespaces kill the “works on my machine...

We love them, and hate them at the same time: local development environments. But what if we could use remote development techniques like Development Containers or GitHub Codespaces to finally overcome the “works on my machine” problem? And also end ...

DevOps
CI/CD
Cloud
Container

12.6.2022 | 15 [Missing String "readingTime"]

Rebooting Accelerate, part 2: How to deliver value faster

So we want to deliver value faster, but how do we do it? The good news is that there are lots of ways to achieve it. The bad news is that it’s hard to pick the right means. What capabilities and approaches are the ones that matter to us as tech people...

Cloud
DevOps

6.6.2022 | 13 [Missing String "readingTime"]

Secretless connections from GitHub Actions to AWS using OIDC

Imagine the following scenario: You set up your GitHub Actions in your repository. And it’s all cool until you want to access your cloud provider resources. Now you might be tempted to create an access key and secret access key, place it as a secret ...

Azure
Cloud
AWS
CI/CD
DevOps
GitHub

29.5.2022 | 8 [Missing String "readingTime"]

Manuel

GitLab security scanning – part 3: Kubernetes deployments

In part 1 and part 2 , we focused on different types of security scanning practices. In this article we will take a look at Kubernetes deployments with Helm and Helmfile. In particular, we are interested in how to ensure that objects deployed to Kubernetes...

DevOps
IT-Security
CI/CD
GitLab
Cloud
Kubernetes

15.5.2022 | 4 [Missing String "readingTime"]

Sven Hertzberg

An OpenStack Crime Story solved by tcpdump, sysdig, and iostat – Episode 3

Footnotes

Was this post helpful?

Blog author

More articles

An OpenStack Crime Story solved by tcpdump, sysdig, and iostat – Episode...

An OpenStack Crime Story solved by tcpdump, sysdig, and iostat – Episode...

Provisioning IaaS Clouds with Dynamic Ansible Inventories and OpenStack...

Crypto is Broken or How to Apply Secure Crypto as a Developer

Ceph Object Storage as fast as it gets or Benchmarking Ceph

Docker Registry or How to Run your own Private Docker Image Repository

Docker Networking Made Simple or 3 Ways to Connect LXC Containers

Lightweight Virtual Machines Made Simple with Docker or How to Run 100...

Your Hardware will Fail – Just not the Way You Expect

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

More articles in this subject area

The Ultimate Tool for Engineers and Developers: Compass Premium

Living on the edge: building serverless applications with Cloudflare Workers

We deployed our SaaS Application on fly.io (and it was great).

Dangling DNS in cloud infrastructures

Spring Boot and HTMX: Deployment to AWS Lambda

Integrating Dapr with Azure Kubernetes Service (AKS): Portability is key

Modern Microservices: Unleashing the Power of .NET Core, Aspire, and Dapr

From sidecars to sidecarless: Tracing the evolution of service mesh technologies...

Demystifying the Kubernetes Gateway API: What the heck is it and why should...

Cloud-native (application) networking in 2024

Charge your APIs Volume 22: Mastering the Art of API Federation

How to upgrade your Aurora Serverless database schema using CDK and Lambda

Heroku is dead: Let’s deploy Spring Boot containers on fly.io!

CloudWatch on AWS: How to tackle high-security requirements

Tame the multi-cloud beast with Crossplane: Let’s start with AWS S3

Building an instant noodles DevOps starter pack with Terraform and AWS

Development Containers & GitHub Codespaces kill the “works on my machine...

Rebooting Accelerate, part 2: How to deliver value faster

Secretless connections from GitHub Actions to AWS using OIDC

GitLab security scanning – part 3: Kubernetes deployments