General advice is a modern CPU with low latency RAM and a fast CPU to RAM memory bus.
The precise CPU isn't so straightforward. Naively you may expect a CPU with a large cache to be better, but this isn't necessarily true. Modern CPUs can actually anticipate the memory the data they need to fetch from the RAM and prefetch it to the cache before it is needed. The microarchitecture can also vary between CPUs with similar clock speeds, which can mean some CPUs can perform calculations with fewer clock cycles than others. Unfortunately, I don't know what sorts of optimizations Toady has applied (or which compiler and the settings of said compiler), so a general all-rounder CPU could be the safest bet. Edit: I should add that larger caches are slower as connections are physically larger. Whether a large slow cache works well for DF depends if CPU prefetching is effective. I have no idea if that is the case for DF.
One of the biggest microarchitectural differences is the length of the pipeline. It determines how far in advance the CPU had to fetch information. Deep pipelines are faster, but if you get a branch prediction failure there will be longer delays. It would be good to do some science on this to see if there is an optimal microarchitecture.
A good science project would be to create a standardized fort (let's call it DFSF2018), save it, then compare FPS across different CPU/RAM combinations. If we could get enough data we might be able to work out if any specific CPU architectures are better than others, and whether the need for low RAM latency can be mitigated by a CPU with better prefetching, if deep pipelines are better than shallow ones, and how important cache size is. Note: to do all that we'll need an absolute ton of data.