How exactly do semaphores work, especially when combined with sync/timeout?

dstorrs · June 17, 2022, 4:48pm

I was writing some code using semaphores and realized that I wasn't sure I completely understand them.

Suppose I have something like this:

(define sema (make-semaphore 1))
(define user->id (make-hash (list (cons 'alice 1) (cons 'bill 2))))
(define id->user (make-hash (list (cons 1 'alice) (cons 2 'bill))))
(define (get-name id)    (hash-ref id->user id))
(define (get-id   name)  (hash-ref user->id name))

(define (add-user! name id)
  (call-with-semaphore sema
                       (thunk
                        (hash-set! user->id name id)
                        (hash-set! id->user id name))))

(add-user! 'charlie 3)
(get-name 2)
(get-id 'alice)

My understanding is that:

A. Racket is single-threaded and implements concurrency by periodically pausing the current thread and running a different one.
B. The semaphore ensures that if multiple threads want to call add-user! then they will take turns, thereby ensuring that the two hashes are always consistent.

The things I'm not sure about:

Do I need to worry about get-name seeing an inconsistent state if it's running in thread A and the add-user! is happening in thread B?
How exactly does this work? Does call-with-semaphore pause all other threads (or, rather, disable thread switching) while it finishes the protected block?
If the answer to 1 is yes then I believe the solution is to use the semaphore in both the accessors and mutators, while being careful to avoid deadlock. Is this right?
If the answer to 2 is yes then does the time spent in the protected block count against the duration of a sync/timeout in another thread?

samth · June 17, 2022, 5:13pm

Your program will ensure mutual exclusion between different calls to add-user!. But the semaphore doesn't protect against anything that isn't checking it, so calls like get-name could run in between the two hash-set! calls. So the answer to 1 is "yes, you need to worry about that".

If you only have this one data structure that you're protecting, then just using the same semaphore for everything should be fine. If you have a more complex situation than that, then I highly recommend using multiple threads and communicating via channels, rather than sharing mutable data between threads.

dstorrs · June 17, 2022, 5:18pm

Cool, thanks. So, in summary:

Semaphores do not disable thread switching
Therefore accessors and mutators must coordinate
Mutability = badness (this I knew, but the specific thing I'm working on is awkward to do without it)
Because threads can switch at (essentially) any time, a semaphore-protected block is not special in how much latency it generates as compared to any other thread

Do I have that correct?

samth · June 17, 2022, 5:21pm

Yes, that's all correct.

Mutation is fine; it's shared mutable data that's not fine. The way I'd do something like your example is have a thread that holds (and does not share) the mutable hash table, and then communicate with that thread via a channel to do things like add a user or query the table.

Topic		Replies	Views
Recommended Approach to Locks Questions & Answers question	4	311	August 15, 2022
Parallel HTTP Request Mutex Deadlock when Running from Dr Racket Questions & Answers question , drracket	2	108	February 25, 2024
Threads waiting for a worker thread to make progress General	2	224	January 1, 2022
Threading and paralelism Questions & Answers parallelism , chez , thread	2	103	February 12, 2025
Events and concurrency Questions & Answers question	4	246	May 25, 2022

How exactly do semaphores work, especially when combined with sync/timeout?

Related topics