Frequently Asked Questions (FAQ)#
Pipeline and Configuration version mismatch#
I can’t run the pipeline because it says my TOML file has a version mismatch?
In order to try and manage compatibility for the pipeline, your configuration file has a version
key in it. This key must be compatible (within SemVer) with the installed version of vampires_dpp
. There are two approaches to fixing this:
(Recommended) Call
dpp upgrade
to try to automatically upgrade your configurationDowngrade
vampires_dpp
to match the version in your configuration
I’m getting warnings about centroid files, help!#
The blah blah explain it.
TODO
Performance#
It’s slow. It’s so, so slow. Help.
It’s hard to process data in the volumes that VAMPIRES produces, but there are some tips for speeding it up.
Use an SSD (over USB 3 or thunderbolt)
Faster storage media reduces slowdowns from opening and closing files, which happens a lot throughout the pipeline
Don’t save intermediate files
The time it takes to open a file, write to disk, and close it will add a lot to your overheads, in addition to the huge increase in data volume
Use multi-processing
Using more processes should improve some parts of the pipeline, but don’t expect multiplicative increases in speed since most operations are limited by the storage IO speed.
Tip: multiprocessing with numpy
Make sure when you’re multiprocessing to set the correct environment variables in numpy
OMP_NUM_THREADS=1 dpp run -j <num_proc> <config> <input_filenames>
OMP_NUM_THREADS=1 dpp run -j 4 20230101_ABAur_vampires.toml norm/*.fits