Re: Parallel make

bug-make

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Parallel make

From:	Kruppa, Jason
Subject:	Re: Parallel make
Date:	Wed, 29 Apr 2015 21:55:35 -0500

Last time I looked at distcc, it still ran the c++ preprocessor on the main 
machine (to eliminate the risk of different machines having different 
configurations), and then sent the preprocessed files to the various machines 
at its disposal.  This, in my experience, still does a huge portion of the 
build on the machine where make is invoked.

There are a number or things you can look at for reducing compile times.

You mention the job server capability like it is new, which probably means you 
build on windows machines.  My build performance on Linux is 3x faster via NFS 
than my windows builds are on local disk.  And network builds on windows are 
much slower than local disk.  We found that when the files are on a local disk, 
windows will cache the file contents in ram until ram is full.  This lets you 
basically store your entire source tree in ram.  I was unable to get the 
network share to behave this way on the windows machine, likely because windows 
does not assume it has exclusive access to the network file system.  To work 
around this in our windows environment we copy source locally and build, via a 
subst'd drive, and then copy the build to the network for consumption by the 
team at large.

We also found that our windows build performance did not scale well between 24 
and 48 core machines.  There is some resource that is pegged out that keeps the 
build speed from further reduction.

We ran into similar things in Linux due to NFS, autofs, and NFS via apparmor 
not scaling well when 100 compilers are trying to search for header files 
through a long number of sourcedirs.  Recent kernels have eliminated most of 
these spinlocks, but symlinks are still not scalable, and having them in your 
sourcedirs will kill performance if you've got enough cores.

I worked with someone today who thought their Linux build system was utilizing 
a precompiled header.  It wasn't actually using it, and ensuring the pch was 
getting used resulted in an immediate 33% reduction.  Turning off the updating 
of directory and file access time on the 80-core box provided us another 50% 
reduction in build time.  These two tweaks reduced a 12 minute build to a 4 
minute build.  Reducing the directory depth of the build will give us further 
improvements.

I hope some of these lessons are applicable to you.  Maybe faster builds are 
available to you without developing a distributed build system.  Good luck!

> On Apr 29, 2015, at 3:20 PM, Paul Smith <address@hidden> wrote:
> 
>> On Wed, 2015-04-29 at 13:50 -0600, Ryan P. Steele wrote:
>> The multithreaded version of make (-j#) is wonderful, and we have made
>> great use of it.  Because we're dealing with some very large code, 
>> however, it would be great to be able to parallelize compilation over 
>> multiple machines.  I can't seem to find any option for doing this, 
>> though.  Does such functionality exist?  It would appear to be a
>> fairly straightforward extension of multithreading, especially on a
>> network file system, but thus far, we haven't been able to make it
>> work.  Any help would be appreciated.
> 
> Just to be clear, GNU make is not multithreaded.  It simply spawns
> multiple processes and lets them run in parallel.
> 
> GNU make has no built-in capability to use multiple machines:
> conceptually it may be a straightforward extension but the effort needed
> to communicate between multiple systems over a network, send and receive
> results reliably, kill jobs when someone stops the main make process
> with ^C, etc. is FAR out of GNU make's current wheelhouse.
> 
> You can get a very cheap implementation, if you're willing to live with
> many prerequisite configuration requirements such as a networked
> filesystem, SSH access that doesn't require a password, etc. by writing
> your own script to forward the job via SSH and setting the GNU make
> SHELL variable to point to your script.
> 
> Luckily, if you are building C or C++ code someone has already done all
> the necessary work for you.  I recommend you investigate the distcc
> package: https://code.google.com/p/distcc/
> 
> Cheers!
> 
> 
> _______________________________________________
> Bug-make mailing list
> address@hidden
> https://lists.gnu.org/mailman/listinfo/bug-make

[Prev in Thread]

Current Thread

[Next in Thread]

Parallel make, Ryan P. Steele, 2015/04/29
- Re: Parallel make, Paul Smith, 2015/04/29
  - Re: Parallel make, Tim Murphy, 2015/04/29
  - Re: Parallel make, Kruppa, Jason <=
    - Re: Parallel make, Edward Welbourne, 2015/04/30
  - Re: Parallel make, Edward Welbourne, 2015/04/30
  - Re: Parallel make, SF Markus Elfring, 2015/04/30

Prev by Date: Re: Parallel make
Next by Date: Re: Parallel make
Previous by thread: Re: Parallel make
Next by thread: Re: Parallel make
Index(es):
- Date
- Thread