Lazylogger Segmentation fault

Hi

The lazylogger from the latest git pull, seems to cause, 'still', a segmentation fault, when trying to do disk -> disk copy.

I found this "bug" awhile back, but since we've forked midas for some time now, I fixed it, and then forgot about it. I am busy merging our fork of midas to the latest upstream(git) version, so I ran into this "bug" again.

Below is a break down of the error.

I'll post the gdb output, and the offending code segment, as well as my impromptu fix.

#gdb segmentation fault output

eading symbols from /home/xxx/new_midas/midas/linux/bin/lazylogger...done.
(gdb) run -c DISK
Starting program: /home/xxx/new_midas/midas/linux/bin/lazylogger -c DISK
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
[Lazy,ERROR] [odb.c:1229:db_open_database,ERROR] Removed ODB client 'Lazy_Disk', index 3 because process pid 6121 does not exists
[Lazy,INFO] Updated notify_count of "/Experiment/Security/RPC hosts/Allowed hosts" from 4 to 3
[Lazy,INFO] Removed open record flag from "/Runinfo/State"
[Lazy,INFO] Removed exclusive access mode from "/Runinfo/State"
[Lazy,INFO] Removed open record flag from "/Lazy/Disk/Settings"
[Lazy,INFO] Removed exclusive access mode from "/Lazy/Disk/Settings"
[Lazy,INFO] Removed open record flag from "/Lazy/Disk/Statistics"
[Lazy,INFO] Removed exclusive access mode from "/Lazy/Disk/Statistics"
[Lazy,INFO] Corrected 4 ODB entries
[Lazy,INFO] Deleted entry '/System/Clients/6121' for client 'Lazy_Disk' because it is not connected to ODB
Lazy_Disk starting... ! to exit 
output disk size 694119.8 MiB, free 63950.3 MiB
[Lazy_Disk,INFO] Output file '/home/xxx/new_midas/online/data/run00001.mid.gz' already exists, removing
[Lazy_Disk,INFO] Starting lazy_disk_copy '/home/xxx/online/data/run00001.mid.gz' to '/home/xxx/new_midas/online/data/run00001.mid.gz'
[Lazy_Disk,INFO] Starting lazy_disk_copy '/home/xxx/online/data/run00001.mid.gz' to '/home/xxx/new_midas/online/data/run00001.mid.gz'

Program received signal SIGSEGV, Segmentation fault.
lazy_disk_copy_loop (outfile=0x7fffffffdbc0 "/home/xxx/new_midas/online/data/run00001.mid.gz", infile=0x7fffffffdb40 "/home/xxx/online/data/run00001.mid.gz", fpout=0x6a9ff0, 
    fpin=0x6a9940) at src/lazylogger.cxx:1326
1326    {
(gdb) bt
#0  lazy_disk_copy_loop (outfile=0x7fffffffdbc0 "/home/xxx/new_midas/online/data/run00001.mid.gz", infile=0x7fffffffdb40 "/home/xxx/online/data/run00001.mid.gz", fpout=0x6a9ff0, 
    fpin=0x6a9940) at src/lazylogger.cxx:1326
#1  0x0000000000406ff9 in lazy_disk_copy (outfile=0x7fffffffdbc0 "/home/xxx/new_midas/online/data/run00001.mid.gz", infile=0x7fffffffdb40 "/home/xxx/online/data/run00001.mid.gz")
    at src/lazylogger.cxx:1468
#2  0x000000000040a3bf in lazy_main (channel=<optimized out>, pLall=<optimized out>) at src/lazylogger.cxx:2080
#3  0x00000000004047f9 in main (argc=1, argv=0x662d00 <lazyinfo>) at src/lazylogger.cxx:2506

#Setting a breakpoint

(gdb) run -c Disk
Starting program: /home/xxx/new_midas/midas/linux/bin/lazylogger -c Disk
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
Lazy_Disk starting... ! to exit 
output disk size 694119.8 MiB, free 63771.5 MiB
[Lazy_Disk,INFO] Output file '/home/xxx/new_midas/online/data/run00001.mid.gz' already exists, removing
[Lazy_Disk,INFO] Starting lazy_disk_copy '/home/xxx/online/data/run00001.mid.gz' to '/home/xxx/new_midas/online/data/run00001.mid.gz'
[Lazy_Disk,INFO] Starting lazy_disk_copy '/home/xxx/online/data/run00001.mid.gz' to '/home/xxx/new_midas/online/data/run00001.mid.gz'

Breakpoint 1, lazy_disk_copy_loop (outfile=0x7fffffffdbc0 "/home/xxx/new_midas/online/data/run00001.mid.gz", infile=0x7fffffffdb40 "/home/xxx/online/data/run00001.mid.gz", 
    fpout=0x6a9ff0, fpin=0x7fffffffd8a0) at src/lazylogger.cxx:1326
1326    {

(gdb) next
1338             int rd = fread(buf, 1, kBufSize, fpin);
(gdb) print kBufSize
$1 = <optimized out>
(gdb) print buf
Cannot access memory at address 0x7fffff5fd840

#Offending code snippet

int lazy_disk_copy_loop(const char *outfile, const char *infile, FILE* fpout, FILE* fpin)
{
.
.
.
 /* infinite loop while copying */
   while (1) {
      if (copy_continue) {
         const int kBufSize = 10*1024*1024;
         char buf[kBufSize];
         int rd = fread(buf, 1, kBufSize, fpin);

#compiler # gcc -v gcc version 4.8.5 20150623 (Red Hat 4.8.5-4) (GCC)

Also tested this on debian. gcc version 4.9.2 (Debian 4.9.2-10) Exact same error was produced.

I've included the diff file from our older forked version of midas, compared with the latest git pull.

The last commit from this git pull we have is,

commit e173b03742e7186288d425606ca831fa7401e958 Author: Stefan Ritt stefan.ritt@psi.ch Date: Tue Sep 27 13:13:16 2016 +0200

#Odb Settings

Period                          10
Maintain free space (%)         0
Stay behind                     0
Alarm Class                     
Running condition               ALWAYS
Data dir                        /home/xxx/online/data
Data format                     MIDAS
Filename format                 run%05d.mid.gz
Backup type                     Disk
Execute after rewind            
Path                            /home/xxx/new_midas/online/data
Capacity (Bytes)                5e+09
List label                      Disk
Execute before writing file     
Execute after writing file      
Modulo.Position                 
Tape Data Append                n

Maybe I overlooked something that caused the seg fault?

Comments (13)