Problem:
We are planning to include at least 30 repos for mirroring on a big EC2 instance with enough CPU to support several workers, we were trying to determine the best schedule for cron job to trigger the sync tasks, initially we thought of grouping repos with a certain pattern depending on their size so we will run in batches, let’s say, one group at 1am then next at 2am and so on, but we think that is inefficient since some tasks can finish within seconds when no updates are found, the question is, pulp is able to queue all the syncs if we trigger the tasks for example separated by 1 minute intervals in cron or some other pattern? in this case what I expect is if the server has 8 CPUs then 8 tasks will run and 22 will be in waiting state but at the end all of them will be run? If so what is the limit of this queue?
Thank you for your input!
Expected outcome:
Pulpcore version:
“core”: “3.48.0”,
Pulp plugins installed and their versions:
“versions”: {
“rpm”: “3.25.1”,
“core”: “3.48.0”,
“file”: “3.48.0”,
“certguard”: “3.48.0”
},
Operating system - distribution and version:
RHEL 9
Other relevant data: