https://www.cpu-monkey.com/en/cpu_benchmark-cpu_performance_...
What do we see at the top of this chart? TSMC 3nm (M3/M4), followed by TSMC 4nm (Ryzen 7000U/8000U), TSMC 5nm (M1/M2), TSMC 5nm/6nm mixed (Ryzen 7000H), and then finally we find something made on an Intel process node instead of TSMC.
The efficiency has more to do with the process node than which architecture it is.
It's too bad they don't have Epyc on that chart. Epyc 9845 is on TSMC N3E and that thing is running cores at a >2GHz base clock at less than 2.5W per core.
And this benchmark doesn't even include M4, which is even more efficient.
Your link is comparing the M3 against AMD chips with higher TDPs. Higher TDPs tank "single-core efficiency" because power consumption is non-linear with clock speed. Give a core near its limit three times the power budget and you're basically dividing the single-core efficiency by three because you burn three times more power and barely improve single-thread performance at all, and then that's exactly what you see there.
To have a useful comparison you have to compare the efficiency of CPUs when they're set to use the same amount of power.
They're saying the exact opposite of that. Their claim is that all the extra power is only juicing performance a little bit, so at similar power levels the performance is not all that far apart.