Tests performed on a real Model 3 board running Virtua Fighter 3 revealed that ping pong flip occurred about 66% of the frame time after IRQ2; 239 display lines + 40 blanking lines = 279 lines, and 279 / 424 = about 65.8% of the frame (!!!) I must admit I got quite excited when I figured this out

Both Sega Rally 2 and Star Wars Trilogy Arcade wait until IRQ2 before starting to process the next frame on the CPU side, whereas all other games wait for ping pong flip. With the current frame timing in Supermodel this means 33% of PowerPC cycles are wasted, but these games set it up so that ping pong flip occurs much later in the frame and so fewer cycles are wasted on real hardware (about 4% total).
In addition, I have deduced what IRQ8 and IRQ4 represent: they signal when Real3D has finished geometry T&L and rasterization respectively. The timing debug screens present in most games list GP as the time between IRQ8 and ping pong flip and DP as the time between IRQ4 and IRQ2. This basically confirms that 3D rendering is done in two separate stages on Model 3, which I had been theorizing about for a while.