ODB should be saved to disk periodically.

Stefan Ritt

I’m tempted to put the flushing at the EOR of the logger. This way we have it in one well defined location, and we know when it happens (after the end of each run). This would solve the race condition between processes. Problem: If we don’t start/stop runs, no flush will happen. Concerning the memory allocation I think it should be first copy - then release lock - then write. If the memory allocation fails, then we write directly without the copy. Takes longer, but best we can do (and the same as we have now).

Stefan

2023-06-15T17:42:45+00:00

dd1 reporter

concur on flush at EOR (and BOR?). concur on write-without-copy if memory allocation fails. also add flush on exit from odbedit (with a --no-flush-odb flag), a periodic flush in mhttpd and we have our bases covered. experiments that do not start/stop runs, do not run mhttpd and do not use odbedit form an empty set. manual changes to odb are done by odbedit (saved-to-disk on exit, as expected) and by mhttpd web interface (saved-to-disk periodically). K.O.

2023-06-15T20:09:18+00:00

dd1 reporter

also add a warning “flushing odb to disk took longer than 1 second”. K.O.

2023-06-15T20:10:18+00:00

Stefan Ritt

assigned issue to

Stefan Ritt

2023-06-16T15:10:18+00:00

Stefan Ritt

Ok I implemented some periodic flushing. Here is what I did:

Created

/System/Flush/Flush period : TID_UINT32 /System/Flush/Last flush : TID_UINT32

which control the flushing to disk. The default value for “Flush period” is 60 seconds or one minute.

All clients call db_flush_database() through their cm_yield() function
db_flush_database() checks the “Last flush” and only flushes the ODB when the period has expired. This test is done inside the ODB semaphore so that we don’t get a race condigiton
If the period has expired, db_flush_database() calls ss_shm_flush()
ss_shm_flush() tries to allocate a buffer of the shared memory. If the allocation is not successful (out of memory), ss_shm_flush() writes directly to the binary file as before.
If the allocation is successful, ss_shm_flush() copies the share memory to a buffer and passes this buffer to a dedicated thread which writes the buffer to the binary file. This causes ss_shm_flush() to return immediately and not block the calling program during the disk write operation.
Added back the “if (destroy_flag) ss_shm_flush()” so that the ODB is flushed for sure before the shared memory gets deleted.

This means now that under normal circumstances, exiting programs like odbedit do NOT flush the ODB. This allows to call many “odbedit -c” in a row without the flush penalty. Nevertheless, the ODB then gets flushed by other clients latest 60 seconds (or whatever the flush period is) after odbedit exits.

Please note that ODB flushing has two purposes:

When all programs exit, we need a persistent storage for the ODB. In most experiments this only happens very seldom. Maybe at the end of a beam time period.
If the computer crashes, a recent version of the ODB is kept on disk to simplify recovery after the crash.

Since crashes are not so often (during production periods we have maybe one hardware failure every few years) the flushing of the ODB too often does not make sense and just consumes resources. Flushing does also not help from corrupted ODBs, since the binary image will also get corrupted. So the only reason for periodic flushes is to ease recovery after a total crash. I put the default to 60 seconds, but if people are really paranoid they can decrease it to 10 seconds or so. Or increase it to 600 seconds if their system does not crash every week and disks are slow.

I made a dedicated branch feature/periodic_odb_flush so people can test the new functionality. If there are no complaints within the next few days, I will merge that into develop.

Stefan

‌

2023-06-16T15:25:22+00:00

Stefan Ritt

changed status to resolved

After quite some testing of the periodic flushing of the ODB shared memory, I merged this feature branch into develop today.

2023-07-28T12:40:32+00:00

dd1 reporter

there was one bug - a race condition between the flush thread and program (i.e. odbedit) exiting. when program exits without waiting/reaping/joining all threads, they are silently killed. best I can tell, normally, the flush thread will be killed while it is inside the main write(), so odb contents would get written to disk. but this is not for sure. correct way is to wait for the thread to finish before exiting the program. two small buglets - a thread leak (no join/reap) and missing write lock for updating “last flushed” timestamp in ODB. K.O.

2023-08-09T23:43:12+00:00

Stefan Ritt

From your message it’s not clear if you really fixed the race condition. I see some code of yours there. Do you consider this now fixes? If so, can you close this isee?

Thanks,
Stefan

2023-08-10T05:45:40+00:00

Comments (8)