the default number of threads should be set to the number of physical cores, not logical cores. If you have a 4 core machine with hyperthreads, the default number of threads is 8. This is actually a bit slower than if 4 threads were used, since each core is already fully utilized and the extra 4 threads incur extra context switching.