Copying symmetry expanded particle sets to scratch takes up a lot of space

Issue #58 resolved
Oliver Clarke created an issue

Hi,

Currently, if I run a refinement or classification with a symmetry expanded particle set, relion copies each particle (the "original" particle and all the symmetry expanded copies) to the scratch drive, taking up n-times more space than the original dataset.

This becomes problematic with large datasets or high order symmetry, and it doesn't seem necessary, since the original stacks are all that is referenced by the star file (with different Euler angles). Would there be any way to alter this behavior to only take up as much space as the original dataset?

Cheers Oli

Comments (7)

  1. Oliver Clarke reporter

    Thanks Takanori - also for 3.1 then (maybe?), it would be great if relion kept track of what is already on scratch, so if I run three jobs in a row with the same settings but different input (e.g three different classification runs), relion does not have to recopy the data each time. For large datasets the copying can take quite a while.

    Cheers Oli

  2. Takanori Nakane

    At LMB, every job goes into a random node in a cluster so RELION deletes all cache before and after a job by default (the two options above change this). And if one owns a personal dedicated workstation, they can copy particles to the cache and replace Extract/jobXXX by a symbolic link to it. Thus, I am afraid to say that the priority for persistent cache system is low here (but I welcome patches).

  3. Log in to comment