Documentation for cryoDRGN2.
CryoDRGN2 commands are now natively available in the top-of-tree on github, and will be available in the next official cryoDRGN version 2.0 release.
There are two commands for ab initio reconstruction: cryodrgn abinit_homo
(homogeneous ab initio reconstruction) and cryodrgn abinit_het
(heterogeneous ab initio reconstruction):
# homogeneous ab initio
cryodrgn abinit_homo -h
# heterogeneous ab initio
cryodrgn abinit_het -h
Downsample your particles to a box size of 128 either with cryodrgn downsample
or with other tools.
If you have a very large dataset (>500k images), I would recommend training on a subset of particles for initial testing. Use cryodrgn_utils select_random
to select a random subset of particles.
# get a random selection of 200k particles from a dataset of 1,423,124 particles
cryodrgn_utils select_random 1423124 -n 200000 -o ind200k.pkl
--ind ind200k.pkl
For reference, ab initio heterogeneous reconstruction of a recent dataset of ~218k 128x128 particles took 20 hours to train (1 V100 GPU).
# homogeneous reconstruction
cryodrgn abinit_homo [particles] --ctf [ctf.pkl] -o [output_directory] >> output.log
# heterogeneous reconstruction
cryodrgn abinit_het [particles] --ctf [ctf.pkl] --zdim 8 -o [output_directory] >> output.log
We have found that cryoDRGN2’s homogeneous reconstruction can work better than traditional ab initio approaches for challenging heterogeneous datasets, e.g. the RAG1-RAG2 complex (EMPIAR-10049) and the pre-catalytic spliceosome (EMPIAR-10076), potentially due regularization properties of the neural model. It may be interesting to try cryodrgn abinit_homo
for generating a consensus reconstruction or initial model. Please let us know if you find something interesting!
--t-extent 40
).--ps-freq
)cryodrgn analyze
), then extend training to 60 epochs. You can extend by rerunning with -n 60 --load latest
. If your dataset is very large, you may want to reduce --ps-freq
and -n
.