-
Notifications
You must be signed in to change notification settings - Fork 108
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refresh test json templates due to removal of input data #11860
Comments
As I haven't made any progress on this for the last month, I am setting it back to the ToDo queue. |
Our templates have degraded even further and perhaps half of them are now broken. Most common issues are:
Here is a short summary of workflows (templates) and the problems found during Agent 2.3.7 validation:
We will have to work on it ASAP. |
Quick update on the SL6 workflows. They are indeed broken at the moment, but still supported in central production. It will be fixed with a new version of glideinWMS (not yet released). Further details in: https://its.cern.ch/jira/browse/CMSPROD-223 |
I went through our validation checklist and we can probably reduce our templates to about 20. Here is a breadown of the templates and how to organize them: ReReco
DQMHarvest
TaskChain
StepChain (almost a dup of the TaskChain templates - or vice-versa)
In addition, I would suggest to have 1 template of each spec under the |
Impact of the bug
WMCore validation in general
Describe the bug
As we get started with the HG2401 / WMAgent 2.2.6 validation, there are many workflows getting stuck in
assigned
status. Checking MSTransferor logs, one can see that many calls to Rucio are not yielding any results, meaning that data has been completely removed from the grid [1].How to reproduce it
Inject the relevant test json templates
Expected behavior
Matching those datasets against our test json templates, suggest that the following templates need to be remade/refactored because the RelVal data is no longer available:
test/data/ReqMgr/requests/Integration/SC_ReDigi_Harvest_Prod.json
test/data/ReqMgr/requests/Integration/SC_PY3_PURecyc.json
test/data/ReqMgr/requests/Integration/TaskChain_PUMCRecyc.json
and for the non-relval data that has been removed (e.g. DQMIO), the following needs to be remade:
test/data/ReqMgr/requests/DMWM/DQMHarvest_RunWhitelist.json
test/data/ReqMgr/requests/Integration/DQMHarvesting_MultiRun.json
test/data/ReqMgr/requests/Integration/DQMHarvesting.json
test/data/ReqMgr/requests/Integration/DQMHarvesting_LumiMask.json
Additional context and error message
[1] Relevant log from MStransferor
The text was updated successfully, but these errors were encountered: