In our pulp3 instance, there is a cron job doing repo sync once per day. The retain_repo_versions is set to 1. What the cron does for each rpm repo:
- do sync
- do publication (for this step, even there is not change in repo, a new publication still created)
- do distribution
For the moment, we have 39 rpm repo. Thepulp rpm distribution list
shows 39 rpm distributions, which corresponds to the total number of repos.
The pulp rpm distribution list
shows 431 publications, much higher, which is normal, since the cron job do repo publication every. The repo which changes less frequently has more publication “duplicated”, eg rocky8_extras, there are 2 versions, but there are 47 rpm publications pointing to the same version 1:
eg:
1 "repository_version": "/pulp/api/v3/repositories/rpm/rpm/019097b6-a5db-7d0d-bf25-4cf2f9aa0c4a/versions/0/",
47 "repository_version": "/pulp/api/v3/repositories/rpm/rpm/019097b6-a5db-7d0d-bf25-4cf2f9aa0c4a/versions/1/",
I am wondering, is it worth to cleaning rpm publications
to remove the duplicated ones? or reduce the sync frequence for these repo? Or Am I worried for nothing cos it is a normal thing, just leave as it is?
I suppose rpm publication
resouce doesn’t take much disk resources?
Pulpcore version:
“component”: “core”,
“version”: “3.54.1”,
“package”: “pulpcore”,
“module”: “pulpcore.app”,
“domain_compatible”: true
Pulp plugins installed and their versions:
{
“component”: “rpm”,
“version”: “3.27.1”,
“package”: “pulp-rpm”,
“module”: “pulp_rpm.app”,
“domain_compatible”: true
},
Operating system - distribution and version:
RedHat 9
Other relevant data: