Other than four cores, the most obvious difference is the new widened SSE instructions. On the pre-Barcelona parts, SSE was done in 64 bit chunks, so if you wanted to do a 128b operation, you needed two passes, possibly more. With the widening of SSE, it should immediately double throughput on SSE instructions. Obviously media operations will benefit, but HPC and FP heavy ops will get a solid kick in the pants too.
http://www.theinquirer.net/default.aspx?article=35011
Nice this is a pretty good article....now if they could just hurry their asses up and let us see some f@#$'in benchmarks!!