Preventing unnoticed thread death

dstorrs · April 27, 2022, 6:54pm

I'm expanding the majordomo2 task management module to accept a max-workers parameter such that number of running tasks can be constrained with newly-added tasks being queued until there are workers available.

My initial thought on how to do this is to have a central thread that will listen on an async-channel for messages regarding new tasks being queued, workers finishing or dying, etc. Based on these signals it would start new workers as needed. My concern is that if the monitor thread stops working then the whole thing breaks down and new tasks will never be run. I'm looking for a way to guarantee that I can detect when the monitoring thread dies.

I am aware of 16.3 Wills and Executors but don't have experience with them. The Reference section is terse and there's nothing in the Guide on them, so it's not clear to me if they are a good fit here.

Any advice?

greghendershott · April 27, 2022, 9:41pm

If the original, main thread exits, doesn't the whole Racket process terminate?

If so, what if you make that main thread do the monitoring?

In some programs the main thread has nothing interesting to do -- e.g. in a web server it might park forever in a (sync never-evt), just to keep the process from exiting -- in which case, yay, now it has a more-interesting role to play.

I think one problem (at least) with that answer, is that it's only an answer for an application. You're talking about a library, correct?

dstorrs · April 28, 2022, 1:00am

It's a library, yes.

greghendershott · April 28, 2022, 12:48pm

An executor needs another "monitor" thread to execute wills. If "guarantee" means you need to handle that thread stopping unexpectedly, this seems like it's going to be an infinite regress.

So instead of noticing the bad thing happening, I'd probably try to prevent it --- focus on trying to keep the monitor thread alive?

I'd start by making sure the monitor thread won't exit itself. Make sure it handles all uncaught exceptions, using with-handlers or call-with-exception-handler.

That leaves something else killing the thread, via kill-thread or shutting down its custodian. I think you can mostly guard against those by (a) not making that thread descriptor value available in most of your library (don't provide a variable holding its value) and (b) carefully creating the thread in its own custodian, and ditto not advertising that custodian value.

Strictly speaking this answer is cheating by redefining the question from "preventing unnoticed thread death" to "preventing thread death", but, that would be my advice. Probably other people have better!

dstorrs · April 28, 2022, 1:24pm

So, essentially, "Threads aren't going to die without a specific reason so as long as their function is simple and robust then you should stop obsessing about irrelevant things." :>

That works. Thank you!

greghendershott · April 28, 2022, 2:36pm

I will agree with this last part only because I think we share a sense of humor that includes a fair amount of self-deprecation. Of course seriously you asked a really good question.

I'm not certain my advice is best, but it's what I'd do until/unless I learned something better.

Topic		Replies	Views
What's the most reasonable way to do something seasonal in racket Questions & Answers question	4	314	March 3, 2022
Logging, custodians and larger applications Questions & Answers question , logging	2	818	March 7, 2022
Will executor not being executed on program exit unless `(collect-garbage`) is called General	3	40	November 15, 2024
Threads waiting for a worker thread to make progress General	2	225	January 1, 2022
Kill-safe cleanup of locally created threads Questions & Answers	1	138	April 16, 2024

Preventing unnoticed thread death

Related topics