Golang also has a totally inaccurate routine for counting the CPUs in the machin...

uxcn · on Oct 5, 2015

I'm not necessarily questioning the accuracy, just that they generally don't consider the affinities and a lot of software assumes that number of cpus matches the concurrency available. If only half the cpus are in the affinity set, the processes/threads could generally contend twice as much as possible. I guess depending on how the number is used it could improve the throughput though (e.g. blocking on i/o).

tedunangst · on Oct 5, 2015

Depends on the language used in the spec, but "CPUs available for scheduling" seems like the definition most software should use. However, I suspect most software is built using an interface that returns the total CPU count for the machine.

uxcn · on Oct 6, 2015

The language is typically vague, if there is any. POSIX also notably never had anything to say in regard to cpu affinity.

nulltype · on Oct 5, 2015

What's the accurate way?

seiji · on Oct 6, 2015

As usual, erlang does it better: http://www.erlang.org/doc/man/erlang.html#system_info_cpu_to... and http://www.erlang.org/doc/man/erlang.html#logical_processors

dunkelheit · on Oct 5, 2015

At the very least you should be aware that these counts usually count cores with hyperthreading twice - and hyperthreading does indeed provide opportunity for increased parallelism but it is noticeably worse than having another separate physical core.

wmf · on Oct 6, 2015

Probably something like hwloc which can be configured to show you only the cpus you're allowed to use.

BTW the same problem also happens for determining available memory in containers.

uxcn · on Oct 5, 2015

To calculate the number of processors available or the optimal number of threads/processes?

nulltype · on Oct 5, 2015

Well specifically it would be cool if it counted the optimal number to set GOMAXPROCS to, since I think that's like the main use of runtime.NumCPU().

thrownaway2424 · on Oct 6, 2015

Right. Currently runtime.NumCPU tries to be fancy by looking at the population count of the cpuset mask[1]. However in a hosted environment using containers there's no reason to believe that the cpuset will remain fixed over the life of the process. This can undercount the available CPUs, leaving you with a GOMAXPROCS that is too low.

1: https://code.google.com/p/go/source/browse/src/pkg/runtime/t...

vessenes · on Oct 6, 2015

Anecdotally, it's very often not bad, (and in fact sometimes "good") to over-provision MAXPROCS. We have used as much as 3 to 6x the number of hyperthreaded cores with good results, depending on the workload. This could insulate you against some container changes.

nulltype · on Oct 6, 2015

Seems like if they did some other metric, it could overcount the number of CPUs instead, right?

What about changing GOMAXPROCS once per minute in a goroutine that calls NumCPU()?