=begin
I do not think that fixing specific variable into CPU registers is really reflected in performance gains (benchmarks welcome!). It will also generally disturb compilers' handling of optimization (especially on register-starved CPUs like IA-32/i386) and may generate less efficient code. Remove the i386/amd64-specific code.
As a side-effect, it enables successful compilation with recent versions of clang/llvm.
=end