Today I ported my old memory benchmark
from Borland C++ builder 5.0 to BDS2006 Turbo C++ and found out weird thing.
- exe from BCB5 runs OK and stable
- exe from BDS2006 measure OK only before main Form is started (inside its constructor) and if the benchmark is started again after main form is
Activated
or even after any VCL component change (for exampleCaption
of main form) then the speed of benchmark thread is strongly affected.
After some research I found out that:
- Does not mater if test is inside thread or not.
- The process/thread priority,affinity does not affect this either.
- Hide of any window (
Visibility,Enabled
) does not affect this. - call the test form
OnIdleEvent
does not affect this - does not mater if time is measured by
RDTSC
orPerformanceCounter
My conclusion is that VCL library runs some code/thread on the background so my questions are:
Is there a way to temporarily pause VCL code/stuff ?
ideal something like
Application->Pause();
andApplication->Resume();
or justForms
.what else could cause this behavior and how to avoid it ?
PS.
Test application has no VCL components other than main form. Benchmark is just a few memory transfers by rep stosd
with different block sizes (no funny stuff). Source is in this related Q/A. I know BDS2006 is out-dated but I am not looking for upgrade right now so please skip any comments about that they are not help-full at all.
Tested on Windows7 pro x64, 32bit
Application
I found out that
wndproc
in BDS2006::VCLinvalidates
CACHEs.I have tried to override
wndproc
bywinapi
for
Application->Handle
is this easy but it does not stop the processing of messages forForm
. When I triedForm1->Handle
as window thenerror 1400
occurs (not valid window handle)I have tried to override
wndproc
by VCLfor Application by
TApplication events
and forForm
by override of virtualwndproc
member. Message processing stops but their calling sequences remains and the problem is not solved either.So my conclusion after eliminating every possibility I can think off is that I need to flush CACHE more intensively somehow after setting process/thread for benchmarking.
In DOS I would done it by single instruction but on windows it is more tricky. Well The previous version of memory benchmark used just memory filling which is obviously not enough for BDS2006 exe. I think that instruction CACHE is involved in this problem not data cache so I change it a bit and it finally worked thing out.
Flushing the CPU CACHE:
Where dat is
128MB
allocated memory chunk (or bigger) and must be done after all process/thread priority and affinity changes or all winapi calls prior to benchmarking.