The Scalability Challenge: From a Single Image to a Mock Survey
The pipeline works for a single image, but how does it perform at scale? This week, I pointed RIPPLe at a full tract of mock LSST data, which contains thousands of potential targets. The results were illuminating. While the pipeline completed the task without crashing, the processing time was unacceptably long. The performance metrics clearly indicate that the current, sequential approach to data fetching and preprocessing is a major bottleneck. Each cutout is processed one by one, leaving the GPU idle for long periods. It's clear that to handle the volume of data LSST will produce, a more sophisticated, parallelized workflow is necessary. The next phase of this project will be dedicated to optimization.