Andy made an interesting suggestion to me.  What if the profile event code allocated all the counters to the requested event.  It could reallocate half of them if a second event was also requested, and so on, till we're down to one (unreliably interrupting) counter.  Or we could set a limit requiring at least three counters per event.
The problem would still exist, but this should throw the maximum amount of ammunition we have towards minimizing it.  Would this be worth the effort to implement?
-dl