Well, did you consider that it probably down clocks itself while idle? If it's downclocked then GPU-Z will report a lower fill number than it would be under load.
Whats the difference, if any, between what nVidia has for core clock speed and what you have for core clock speed, and the speed of the memory on each?