CPU Performance Counter based clock and stop-watch classes for Win2K++
were implemented in VC++. Details: see sources (I hope, these classes, especially StopWatch, have well-to-do interfaces;)).
Warning: no special precautions for multi-threading/multi-core CPU environment.
It's not so hard work to translate these codes in C (or other C++ implementations).
This codes are in the public domain.