Why multithread program are faster than a single threaded multiprocessor machine
Can anyone explain why a multithreaded program with one to one mapping between user level and kernel level thread performs faster than a single threaded multiprocessor machine? Is it because it allows blocking one or more system calls and enabling the smooth running of other threads? Please elaborate.








