dirac-dms-remove-catalog-replicas doesn't work? #7224
-
One of our site has disk corruption. I tried to remove the replicas of their SE from DFC and got the following errors:
I used dirac_admin with "FileCatalogManagement" property to run it. The "t_prodution" contains a list of files located in JINR_EOS. |
Beta Was this translation helpful? Give feedback.
Replies: 5 comments 1 reply
-
Do you have possibility to look for an error printed server side? Which DIRAC version are you using? |
Beta Was this translation helpful? Give feedback.
-
I found the reason. If the number of files exceeds 1 million or so in the list, the problem would happen. It works fine with 10 thousand files when I split the file lists into small ones. It looks like the command can't support removing replicas beyond 1 million files. |
Beta Was this translation helpful? Give feedback.
-
But I found another problem. I finished the cleaning and found no files with the command (dirac-dms-find-lfns -ddd --SE JINR-EOS Path=/juno/production vo=juno). But when I use "size -l /juno/production", I can still see the replicas exists in JINR_EOS: StorageElement Size Replicas 1 IHEP-STORM 96,822,387,786,900 1054954 It looks like the command "size" doesn't give correct number. I have no idea what happened. |
Beta Was this translation helpful? Give feedback.
-
"rebuild" didn't work. After rebuilding, "size -l" failed. |
Beta Was this translation helpful? Give feedback.
-
It seems "rebuild" continues to run in the background after the command finished. After a few hours, I tried again and "size -l" is working. StorageElement Size Replicas |
Beta Was this translation helpful? Give feedback.
I found the reason. If the number of files exceeds 1 million or so in the list, the problem would happen. It works fine with 10 thousand files when I split the file lists into small ones. It looks like the command can't support removing replicas beyond 1 million files.