Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DOCS: Make it clear that 'Eqw' jobs remains until error is fixed or job is qdel:ed #120

Open
HenrikBengtsson opened this issue May 6, 2023 · 0 comments

Comments

@HenrikBengtsson
Copy link
Contributor

HenrikBengtsson commented May 6, 2023

There are several left-over jobs in error state (Eqw) that just sits in the queue.

$ qstat -u '*' | grep -E "\bEqw\b" | wc -l
302

$ qstat -u '*' | grep -E "\bEqw\b" | head
 999111 0.27944 dscpileup. alice     Eqw  01/06/2023 22:13:21    1
1123487 0.08937 GEXA7      bob       Eqw  01/17/2023 19:14:18   16
1123493 0.08937 GEXB1      bob       Eqw  01/17/2023 19:15:18   16
1123501 0.08937 GEXA7      bob       Eqw  01/17/2023 19:31:23   16
1123517 0.08875 GEXA7      bob       Eqw  01/17/2023 19:45:46   16
 970748 0.08045 nf-DADA2_A charlie   Eqw  03/16/2023 22:27:33   16
 971176 0.08007 nf-DADA2_A charlie   Eqw  03/16/2023 22:32:03   16
2423870 0.07663 dask-worke carol     Eqw  04/06/2023 14:32:48    1
2424271 0.07663 dask-worke carol     Eqw  04/06/2023 15:21:23    1
2423840 0.07662 dask-worke carol     Eqw  04/06/2023 14:32:47    1
...

At a minimum, we should explain that these jobs stay in the queue forever, unless the underlying error gets fixed, or the job is qdel:ed by the user.

PS. SGE keeps spending time on these jobs over and over.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant