When you observe that application transaction response times are throttled from increased usage, but the hardware resources of the server (memory and CPU) have enough head room, you can implement vertical clustering by adding two or more server nodes of the application on the same physical server.
The following figure depicts vertical clustering.