"Key addition failed" blocks web interface

Create issue
Issue #61 new
diafygi created an issue

I'm seeing the following logs fairly often in db.log:

2018-07-19 18:31:16 add_keys_merge failed: Eventloop.SigAlarm
2018-07-19 18:31:16 Key addition failed: Eventloop.SigAlarm

When these logs happen, the web interface becomes unresponsive and sks db process spikes to 100% CPU. I guess this key merging is blocking the web interface from serving requests. I don't mind the CPU spike (it's only one thread), but the web unresponsiveness is getting me kicked out of the sks-keyservers.net pool frequently.

Possible solutions:

  • Split key merging into a separate process from serving web requests. I don't mind running another process (maybe something like sks keymerge) that will handle the high CPU spikes.
  • Figure out a way to not spike the CPU so high and not block web requests when merging huge keys. No idea if this is possible given the current code architecture.

Related mailing list threads:

Comments (5)

  1. Fleish

    I'm not sure extending the web timeout will solve the SigAlarm issue. I'm currently experiencing it and it seems to be related to trying to process certain keys received during the recon process. I've been trying to sort out which one is the issue and came up with these 2 common hashes that were always mentioned in the logs in the run leading up to when a server reported the failure:

    2309DD6AF2606DDCD801307FAC024E28

    5699F92E6316448E43FB6D18C6F0943F

  2. Yegor Timoshenko

    Would like to quote https://lists.nongnu.org/archive/html/sks-devel/2018-11/msg00074.html here. If someone wants to momentarily fix the issue, you can apply the following patch: https://lists.nongnu.org/archive/html/sks-devel/2018-07/msg00053.html

    However, all the poison key that's causing the issue is it's just a normal key with a lot of user packets where 5-10MB chunks of user packets were sent to different keyservers (see #60 for repro). Anyone can generate it.

    I'm not really sure how to fix the root cause of this issue, that recon goes on and on trying to merge this key. This issue in its current form means we can cause complete denial of service to any key we want to target, say we can make it so server won't be able to sync real (i.e. signed) user packets.

    Even if we check for signatures (see #41), that same attack could have used cryptographically sound signature packets and deny user any further changes to the key, destroying the network at the same time.

    If anyone has suggestions how to fix this that don't devolve into denial of service on any level (be it per-key or whole network), please tell!

  3. Log in to comment