1====================================
2Getting Started with the LLVM System
3====================================
4
5.. contents::
6   :local:
7
8Overview
9========
10
11Welcome to the LLVM project!
12
13The LLVM project has multiple components. The core of the project is
14itself called "LLVM". This contains all of the tools, libraries, and header
15files needed to process intermediate representations and converts it into
16object files.  Tools include an assembler, disassembler, bitcode analyzer, and
17bitcode optimizer.  It also contains basic regression tests.
18
19C-like languages use the `Clang <https://clang.llvm.org/>`_ front end.  This
20component compiles C, C++, Objective C, and Objective C++ code into LLVM bitcode
21-- and from there into object files, using LLVM.
22
23Other components include:
24the `libc++ C++ standard library <https://libcxx.llvm.org>`_,
25the `LLD linker <https://lld.llvm.org>`_, and more.
26
27Getting the Source Code and Building LLVM
28=========================================
29
30The LLVM Getting Started documentation may be out of date.  The `Clang
31Getting Started <https://clang.llvm.org/get_started.html>`_ page might have more
32accurate information.
33
34This is an example workflow and configuration to get and build the LLVM source:
35
36#. Checkout LLVM (including related subprojects like Clang):
37
38   * ``git clone https://github.com/llvm/llvm-project.git``
39   * Or, on windows, ``git clone --config core.autocrlf=false
40     https://github.com/llvm/llvm-project.git``
41   * To save storage and speed-up the checkout time, you may want to do a
42     `shallow clone <https://git-scm.com/docs/git-clone#Documentation/git-clone.txt---depthltdepthgt>`_.
43     For example, to get the latest revision of the LLVM project, use
44     ``git clone --depth 1 https://github.com/llvm/llvm-project.git``
45
46#. Configure and build LLVM and Clang:
47
48   * ``cd llvm-project``
49   * ``mkdir build``
50   * ``cd build``
51   * ``cmake -G <generator> [options] ../llvm``
52
53     Some common build system generators are:
54
55     * ``Ninja`` --- for generating `Ninja <https://ninja-build.org>`_
56       build files. Most llvm developers use Ninja.
57     * ``Unix Makefiles`` --- for generating make-compatible parallel makefiles.
58     * ``Visual Studio`` --- for generating Visual Studio projects and
59       solutions.
60     * ``Xcode`` --- for generating Xcode projects.
61
62     Some Common options:
63
64     * ``-DLLVM_ENABLE_PROJECTS='...'`` --- semicolon-separated list of the LLVM
65       subprojects you'd like to additionally build. Can include any of: clang,
66       clang-tools-extra, lldb, compiler-rt, lld, polly, or cross-project-tests.
67
68       For example, to build LLVM, Clang, libcxx, and libcxxabi, use
69       ``-DLLVM_ENABLE_PROJECTS="clang" -DLLVM_ENABLE_RUNTIMES="libcxx;libcxxabi"``.
70
71     * ``-DCMAKE_INSTALL_PREFIX=directory`` --- Specify for *directory* the full
72       pathname of where you want the LLVM tools and libraries to be installed
73       (default ``/usr/local``).
74
75     * ``-DCMAKE_BUILD_TYPE=type`` --- Valid options for *type* are Debug,
76       Release, RelWithDebInfo, and MinSizeRel. Default is Debug.
77
78     * ``-DLLVM_ENABLE_ASSERTIONS=On`` --- Compile with assertion checks enabled
79       (default is Yes for Debug builds, No for all other build types).
80
81   * ``cmake --build . [--target <target>]`` or the build system specified
82     above directly.
83
84     * The default target (i.e. ``cmake --build .`` or ``make``) will build all of
85       LLVM.
86
87     * The ``check-all`` target (i.e. ``ninja check-all``) will run the
88       regression tests to ensure everything is in working order.
89
90     * CMake will generate build targets for each tool and library, and most
91       LLVM sub-projects generate their own ``check-<project>`` target.
92
93     * Running a serial build will be **slow**.  To improve speed, try running a
94       parallel build. That's done by default in Ninja; for ``make``, use the
95       option ``-j NN``, where ``NN`` is the number of parallel jobs, e.g. the
96       number of available CPUs.
97
98   * For more information see `CMake <CMake.html>`__
99
100   * If you get an "internal compiler error (ICE)" or test failures, see
101     `below`_.
102
103Consult the `Getting Started with LLVM`_ section for detailed information on
104configuring and compiling LLVM.  Go to `Directory Layout`_ to learn about the
105layout of the source code tree.
106
107Requirements
108============
109
110Before you begin to use the LLVM system, review the requirements given below.
111This may save you some trouble by knowing ahead of time what hardware and
112software you will need.
113
114Hardware
115--------
116
117LLVM is known to work on the following host platforms:
118
119================== ===================== =============
120OS                 Arch                  Compilers
121================== ===================== =============
122Linux              x86\ :sup:`1`         GCC, Clang
123Linux              amd64                 GCC, Clang
124Linux              ARM                   GCC, Clang
125Linux              Mips                  GCC, Clang
126Linux              PowerPC               GCC, Clang
127Linux              SystemZ               GCC, Clang
128Solaris            V9 (Ultrasparc)       GCC
129DragonFlyBSD       amd64                 GCC, Clang
130FreeBSD            x86\ :sup:`1`         GCC, Clang
131FreeBSD            amd64                 GCC, Clang
132NetBSD             x86\ :sup:`1`         GCC, Clang
133NetBSD             amd64                 GCC, Clang
134OpenBSD            x86\ :sup:`1`         GCC, Clang
135OpenBSD            amd64                 GCC, Clang
136macOS\ :sup:`2`    PowerPC               GCC
137macOS              x86                   GCC, Clang
138Cygwin/Win32       x86\ :sup:`1, 3`      GCC
139Windows            x86\ :sup:`1`         Visual Studio
140Windows x64        x86-64                Visual Studio
141================== ===================== =============
142
143.. note::
144
145  #. Code generation supported for Pentium processors and up
146  #. Code generation supported for 32-bit ABI only
147  #. To use LLVM modules on Win32-based system, you may configure LLVM
148     with ``-DBUILD_SHARED_LIBS=On``.
149
150Note that Debug builds require a lot of time and disk space.  An LLVM-only build
151will need about 1-3 GB of space.  A full build of LLVM and Clang will need around
15215-20 GB of disk space.  The exact space requirements will vary by system.  (It
153is so large because of all the debugging information and the fact that the
154libraries are statically linked into multiple tools).
155
156If you are space-constrained, you can build only selected tools or only
157selected targets.  The Release build requires considerably less space.
158
159The LLVM suite *may* compile on other platforms, but it is not guaranteed to do
160so.  If compilation is successful, the LLVM utilities should be able to
161assemble, disassemble, analyze, and optimize LLVM bitcode.  Code generation
162should work as well, although the generated native code may not work on your
163platform.
164
165Software
166--------
167
168Compiling LLVM requires that you have several software packages installed. The
169table below lists those required packages. The Package column is the usual name
170for the software package that LLVM depends on. The Version column provides
171"known to work" versions of the package. The Notes column describes how LLVM
172uses the package and provides other details.
173
174=========================================================== ============ ==========================================
175Package                                                     Version      Notes
176=========================================================== ============ ==========================================
177`CMake <http://cmake.org/>`__                               >=3.13.4     Makefile/workspace generator
178`GCC <http://gcc.gnu.org/>`_                                >=5.1.0      C/C++ compiler\ :sup:`1`
179`python <http://www.python.org/>`_                          >=3.6        Automated test suite\ :sup:`2`
180`zlib <http://zlib.net>`_                                   >=1.2.3.4    Compression library\ :sup:`3`
181`GNU Make <http://savannah.gnu.org/projects/make>`_         3.79, 3.79.1 Makefile/build processor\ :sup:`4`
182=========================================================== ============ ==========================================
183
184.. note::
185
186   #. Only the C and C++ languages are needed so there's no need to build the
187      other languages for LLVM's purposes. See `below` for specific version
188      info.
189   #. Only needed if you want to run the automated test suite in the
190      ``llvm/test`` directory.
191   #. Optional, adds compression / uncompression capabilities to selected LLVM
192      tools.
193   #. Optional, you can use any other build tool supported by CMake.
194
195Additionally, your compilation host is expected to have the usual plethora of
196Unix utilities. Specifically:
197
198* **ar** --- archive library builder
199* **bzip2** --- bzip2 command for distribution generation
200* **bunzip2** --- bunzip2 command for distribution checking
201* **chmod** --- change permissions on a file
202* **cat** --- output concatenation utility
203* **cp** --- copy files
204* **date** --- print the current date/time
205* **echo** --- print to standard output
206* **egrep** --- extended regular expression search utility
207* **find** --- find files/dirs in a file system
208* **grep** --- regular expression search utility
209* **gzip** --- gzip command for distribution generation
210* **gunzip** --- gunzip command for distribution checking
211* **install** --- install directories/files
212* **mkdir** --- create a directory
213* **mv** --- move (rename) files
214* **ranlib** --- symbol table builder for archive libraries
215* **rm** --- remove (delete) files and directories
216* **sed** --- stream editor for transforming output
217* **sh** --- Bourne shell for make build scripts
218* **tar** --- tape archive for distribution generation
219* **test** --- test things in file system
220* **unzip** --- unzip command for distribution checking
221* **zip** --- zip command for distribution generation
222
223.. _below:
224.. _check here:
225
226Host C++ Toolchain, both Compiler and Standard Library
227------------------------------------------------------
228
229LLVM is very demanding of the host C++ compiler, and as such tends to expose
230bugs in the compiler. We also attempt to follow improvements and developments in
231the C++ language and library reasonably closely. As such, we require a modern
232host C++ toolchain, both compiler and standard library, in order to build LLVM.
233
234LLVM is written using the subset of C++ documented in :doc:`coding
235standards<CodingStandards>`. To enforce this language version, we check the most
236popular host toolchains for specific minimum versions in our build systems:
237
238* Clang 3.5
239* Apple Clang 6.0
240* GCC 5.1
241* Visual Studio 2019
242
243Anything older than these toolchains *may* work, but will require forcing the
244build system with a special option and is not really a supported host platform.
245Also note that older versions of these compilers have often crashed or
246miscompiled LLVM.
247
248For less widely used host toolchains such as ICC or xlC, be aware that a very
249recent version may be required to support all of the C++ features used in LLVM.
250
251We track certain versions of software that are *known* to fail when used as
252part of the host toolchain. These even include linkers at times.
253
254**GNU ld 2.16.X**. Some 2.16.X versions of the ld linker will produce very long
255warning messages complaining that some "``.gnu.linkonce.t.*``" symbol was
256defined in a discarded section. You can safely ignore these messages as they are
257erroneous and the linkage is correct.  These messages disappear using ld 2.17.
258
259**GNU binutils 2.17**: Binutils 2.17 contains `a bug
260<http://sourceware.org/bugzilla/show_bug.cgi?id=3111>`__ which causes huge link
261times (minutes instead of seconds) when building LLVM.  We recommend upgrading
262to a newer version (2.17.50.0.4 or later).
263
264**GNU Binutils 2.19.1 Gold**: This version of Gold contained `a bug
265<http://sourceware.org/bugzilla/show_bug.cgi?id=9836>`__ which causes
266intermittent failures when building LLVM with position independent code.  The
267symptom is an error about cyclic dependencies.  We recommend upgrading to a
268newer version of Gold.
269
270Getting a Modern Host C++ Toolchain
271^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
272
273This section mostly applies to Linux and older BSDs. On macOS, you should
274have a sufficiently modern Xcode, or you will likely need to upgrade until you
275do. Windows does not have a "system compiler", so you must install either Visual
276Studio 2019 (or later), or a recent version of mingw64. FreeBSD 10.0 and newer
277have a modern Clang as the system compiler.
278
279However, some Linux distributions and some other or older BSDs sometimes have
280extremely old versions of GCC. These steps attempt to help you upgrade you
281compiler even on such a system. However, if at all possible, we encourage you
282to use a recent version of a distribution with a modern system compiler that
283meets these requirements. Note that it is tempting to install a prior
284version of Clang and libc++ to be the host compiler, however libc++ was not
285well tested or set up to build on Linux until relatively recently. As
286a consequence, this guide suggests just using libstdc++ and a modern GCC as the
287initial host in a bootstrap, and then using Clang (and potentially libc++).
288
289The first step is to get a recent GCC toolchain installed. The most common
290distribution on which users have struggled with the version requirements is
291Ubuntu Precise, 12.04 LTS. For this distribution, one easy option is to install
292the `toolchain testing PPA`_ and use it to install a modern GCC. There is
293a really nice discussions of this on the `ask ubuntu stack exchange`_ and a
294`github gist`_ with updated commands. However, not all users can use PPAs and
295there are many other distributions, so it may be necessary (or just useful, if
296you're here you *are* doing compiler development after all) to build and install
297GCC from source. It is also quite easy to do these days.
298
299.. _toolchain testing PPA:
300  https://launchpad.net/~ubuntu-toolchain-r/+archive/test
301.. _ask ubuntu stack exchange:
302  https://askubuntu.com/questions/466651/how-do-i-use-the-latest-gcc-on-ubuntu/581497#58149
303.. _github gist:
304  https://gist.github.com/application2000/73fd6f4bf1be6600a2cf9f56315a2d91
305
306Easy steps for installing GCC 5.1.0:
307
308.. code-block:: console
309
310  % gcc_version=5.1.0
311  % wget https://ftp.gnu.org/gnu/gcc/gcc-${gcc_version}/gcc-${gcc_version}.tar.bz2
312  % wget https://ftp.gnu.org/gnu/gcc/gcc-${gcc_version}/gcc-${gcc_version}.tar.bz2.sig
313  % wget https://ftp.gnu.org/gnu/gnu-keyring.gpg
314  % signature_invalid=`gpg --verify --no-default-keyring --keyring ./gnu-keyring.gpg gcc-${gcc_version}.tar.bz2.sig`
315  % if [ $signature_invalid ]; then echo "Invalid signature" ; exit 1 ; fi
316  % tar -xvjf gcc-${gcc_version}.tar.bz2
317  % cd gcc-${gcc_version}
318  % ./contrib/download_prerequisites
319  % cd ..
320  % mkdir gcc-${gcc_version}-build
321  % cd gcc-${gcc_version}-build
322  % $PWD/../gcc-${gcc_version}/configure --prefix=$HOME/toolchains --enable-languages=c,c++
323  % make -j$(nproc)
324  % make install
325
326For more details, check out the excellent `GCC wiki entry`_, where I got most
327of this information from.
328
329.. _GCC wiki entry:
330  https://gcc.gnu.org/wiki/InstallingGCC
331
332Once you have a GCC toolchain, configure your build of LLVM to use the new
333toolchain for your host compiler and C++ standard library. Because the new
334version of libstdc++ is not on the system library search path, you need to pass
335extra linker flags so that it can be found at link time (``-L``) and at runtime
336(``-rpath``). If you are using CMake, this invocation should produce working
337binaries:
338
339.. code-block:: console
340
341  % mkdir build
342  % cd build
343  % CC=$HOME/toolchains/bin/gcc CXX=$HOME/toolchains/bin/g++ \
344    cmake .. -DCMAKE_CXX_LINK_FLAGS="-Wl,-rpath,$HOME/toolchains/lib64 -L$HOME/toolchains/lib64"
345
346If you fail to set rpath, most LLVM binaries will fail on startup with a message
347from the loader similar to ``libstdc++.so.6: version `GLIBCXX_3.4.20' not
348found``. This means you need to tweak the -rpath linker flag.
349
350This method will add an absolute path to the rpath of all executables. That's
351fine for local development. If you want to distribute the binaries you build
352so that they can run on older systems, copy ``libstdc++.so.6`` into the
353``lib/`` directory.  All of LLVM's shipping binaries have an rpath pointing at
354``$ORIGIN/../lib``, so they will find ``libstdc++.so.6`` there.  Non-distributed
355binaries don't have an rpath set and won't find ``libstdc++.so.6``. Pass
356``-DLLVM_LOCAL_RPATH="$HOME/toolchains/lib64"`` to cmake to add an absolute
357path to ``libstdc++.so.6`` as above. Since these binaries are not distributed,
358having an absolute local path is fine for them.
359
360When you build Clang, you will need to give *it* access to modern C++
361standard library in order to use it as your new host in part of a bootstrap.
362There are two easy ways to do this, either build (and install) libc++ along
363with Clang and then use it with the ``-stdlib=libc++`` compile and link flag,
364or install Clang into the same prefix (``$HOME/toolchains`` above) as GCC.
365Clang will look within its own prefix for libstdc++ and use it if found. You
366can also add an explicit prefix for Clang to look in for a GCC toolchain with
367the ``--gcc-toolchain=/opt/my/gcc/prefix`` flag, passing it to both compile and
368link commands when using your just-built-Clang to bootstrap.
369
370.. _Getting Started with LLVM:
371
372Getting Started with LLVM
373=========================
374
375The remainder of this guide is meant to get you up and running with LLVM and to
376give you some basic information about the LLVM environment.
377
378The later sections of this guide describe the `general layout`_ of the LLVM
379source tree, a `simple example`_ using the LLVM tool chain, and `links`_ to find
380more information about LLVM or to get help via e-mail.
381
382Terminology and Notation
383------------------------
384
385Throughout this manual, the following names are used to denote paths specific to
386the local system and working environment.  *These are not environment variables
387you need to set but just strings used in the rest of this document below*.  In
388any of the examples below, simply replace each of these names with the
389appropriate pathname on your local system.  All these paths are absolute:
390
391``SRC_ROOT``
392
393  This is the top level directory of the LLVM source tree.
394
395``OBJ_ROOT``
396
397  This is the top level directory of the LLVM object tree (i.e. the tree where
398  object files and compiled programs will be placed.  It can be the same as
399  SRC_ROOT).
400
401Unpacking the LLVM Archives
402---------------------------
403
404If you have the LLVM distribution, you will need to unpack it before you can
405begin to compile it.  LLVM is distributed as a number of different
406subprojects. Each one has its own download which is a TAR archive that is
407compressed with the gzip program.
408
409The files are as follows, with *x.y* marking the version number:
410
411``llvm-x.y.tar.gz``
412
413  Source release for the LLVM libraries and tools.
414
415``cfe-x.y.tar.gz``
416
417  Source release for the Clang frontend.
418
419.. _checkout:
420
421Checkout LLVM from Git
422----------------------
423
424You can also checkout the source code for LLVM from Git.
425
426.. note::
427
428  Passing ``--config core.autocrlf=false`` should not be required in
429  the future after we adjust the .gitattribute settings correctly, but
430  is required for Windows users at the time of this writing.
431
432Simply run:
433
434.. code-block:: console
435
436  % git clone https://github.com/llvm/llvm-project.git
437
438or on Windows,
439
440.. code-block:: console
441
442  % git clone --config core.autocrlf=false https://github.com/llvm/llvm-project.git
443
444This will create an '``llvm-project``' directory in the current directory and
445fully populate it with all of the source code, test directories, and local
446copies of documentation files for LLVM and all the related subprojects. Note
447that unlike the tarballs, which contain each subproject in a separate file, the
448git repository contains all of the projects together.
449
450If you want to get a specific release (as opposed to the most recent revision),
451you can check out a tag after cloning the repository. E.g., `git checkout
452llvmorg-6.0.1` inside the ``llvm-project`` directory created by the above
453command.  Use `git tag -l` to list all of them.
454
455Sending patches
456^^^^^^^^^^^^^^^
457
458Please read `Developer Policy <DeveloperPolicy.html#one-off-patches>`_, too.
459
460We don't currently accept github pull requests, so you'll need to send patches
461either via emailing to llvm-commits, or, preferably, via :ref:`Phabricator
462<phabricator-reviews>`.
463
464You'll generally want to make sure your branch has a single commit,
465corresponding to the review you wish to send, up-to-date with the upstream
466``origin/main`` branch, and doesn't contain merges. Once you have that, you
467can start `a Phabricator review <Phabricator.html>`_ (or use ``git show`` or
468``git format-patch`` to output the diff, and attach it to an email message).
469
470However, using the "Arcanist" tool is often easier. After `installing arcanist`_, you
471will also need to apply a fix to your arcanist repo in order to submit a patch:
472
473.. code-block:: console
474
475  % cd arcanist
476  % git fetch https://github.com/rashkov/arcanist update_cacerts
477  % git cherry-pick e3659d43d8911e91739f3b0c5935598bceb859aa
478
479Once this is all done, you can upload the latest commit using:
480
481.. code-block:: console
482
483  % arc diff HEAD~1
484
485Additionally, before sending a patch for review, please also try to ensure it's
486formatted properly. We use ``clang-format`` for this, which has git integration
487through the ``git-clang-format`` script. On some systems, it may already be
488installed (or be installable via your package manager). If so, you can simply
489run it -- the following command will format only the code changed in the most
490recent commit:
491
492.. code-block:: console
493
494  % git clang-format HEAD~1
495
496Note that this modifies the files, but doesn't commit them -- you'll likely want
497to run
498
499.. code-block:: console
500
501  % git commit --amend -a
502
503in order to update the last commit with all pending changes.
504
505.. note::
506  If you don't already have ``clang-format`` or ``git clang-format`` installed
507  on your system, the ``clang-format`` binary will be built alongside clang, and
508  the git integration can be run from
509  ``clang/tools/clang-format/git-clang-format``.
510
511
512.. _commit_from_git:
513
514For developers to commit changes from Git
515^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
516
517Once a patch is reviewed, you should rebase it, re-test locally, and commit the
518changes to LLVM's main branch. This is done using `git push` if you have the
519required access rights. See `committing a change
520<Phabricator.html#committing-a-change>`_ for Phabricator based commits or
521`obtaining commit access <DeveloperPolicy.html#obtaining-commit-access>`_
522for commit access.
523
524Here is an example workflow using git. This workflow assumes you have an
525accepted commit on the branch named `branch-with-change`.
526
527.. code-block:: console
528
529  # Go to the branch with your accepted commit.
530  % git checkout branch-with-change
531  # Rebase your change onto the latest commits on Github.
532  % git pull --rebase origin main
533  # Rerun the appropriate tests if needed.
534  % ninja check-$whatever
535  # Check that the list of commits about to be pushed is correct.
536  % git log origin/main...HEAD --oneline
537  # Push to Github.
538  % git push origin HEAD:main
539
540LLVM currently has a linear-history policy, which means that merge commits are
541not allowed. The `llvm-project` repo on github is configured to reject pushes
542that include merges, so the `git rebase` step above is required.
543
544Please ask for help if you're having trouble with your particular git workflow.
545
546
547.. _git_pre_push_hook:
548
549Git pre-push hook
550^^^^^^^^^^^^^^^^^
551
552We include an optional pre-push hook that run some sanity checks on the revisions
553you are about to push and ask confirmation if you push multiple commits at once.
554You can set it up (on Unix systems) by running from the repository root:
555
556.. code-block:: console
557
558  % ln -sf ../../llvm/utils/git/pre-push.py .git/hooks/pre-push
559
560Bisecting commits
561^^^^^^^^^^^^^^^^^
562
563See `Bisecting LLVM code <GitBisecting.html>`_ for how to use ``git bisect``
564on LLVM.
565
566Reverting a change
567^^^^^^^^^^^^^^^^^^
568
569When reverting changes using git, the default message will say "This reverts
570commit XYZ". Leave this at the end of the commit message, but add some details
571before it as to why the commit is being reverted. A brief explanation and/or
572links to bots that demonstrate the problem are sufficient.
573
574Local LLVM Configuration
575------------------------
576
577Once checked out repository, the LLVM suite source code must be configured
578before being built. This process uses CMake.  Unlinke the normal ``configure``
579script, CMake generates the build files in whatever format you request as well
580as various ``*.inc`` files, and ``llvm/include/Config/config.h``.
581
582Variables are passed to ``cmake`` on the command line using the format
583``-D<variable name>=<value>``. The following variables are some common options
584used by people developing LLVM.
585
586+-------------------------+----------------------------------------------------+
587| Variable                | Purpose                                            |
588+=========================+====================================================+
589| CMAKE_C_COMPILER        | Tells ``cmake`` which C compiler to use. By        |
590|                         | default, this will be /usr/bin/cc.                 |
591+-------------------------+----------------------------------------------------+
592| CMAKE_CXX_COMPILER      | Tells ``cmake`` which C++ compiler to use. By      |
593|                         | default, this will be /usr/bin/c++.                |
594+-------------------------+----------------------------------------------------+
595| CMAKE_BUILD_TYPE        | Tells ``cmake`` what type of build you are trying  |
596|                         | to generate files for. Valid options are Debug,    |
597|                         | Release, RelWithDebInfo, and MinSizeRel. Default   |
598|                         | is Debug.                                          |
599+-------------------------+----------------------------------------------------+
600| CMAKE_INSTALL_PREFIX    | Specifies the install directory to target when     |
601|                         | running the install action of the build files.     |
602+-------------------------+----------------------------------------------------+
603| PYTHON_EXECUTABLE       | Forces CMake to use a specific Python version by   |
604|                         | passing a path to a Python interpreter. By default |
605|                         | the Python version of the interpreter in your PATH |
606|                         | is used.                                           |
607+-------------------------+----------------------------------------------------+
608| LLVM_TARGETS_TO_BUILD   | A semicolon delimited list controlling which       |
609|                         | targets will be built and linked into llvm.        |
610|                         | The default list is defined as                     |
611|                         | ``LLVM_ALL_TARGETS``, and can be set to include    |
612|                         | out-of-tree targets. The default value includes:   |
613|                         | ``AArch64, AMDGPU, ARM, AVR, BPF, Hexagon, Lanai,  |
614|                         | Mips, MSP430, NVPTX, PowerPC, RISCV, Sparc,        |
615|                         | SystemZ, WebAssembly, X86, XCore``.                |
616|                         |                                                    |
617+-------------------------+----------------------------------------------------+
618| LLVM_ENABLE_DOXYGEN     | Build doxygen-based documentation from the source  |
619|                         | code This is disabled by default because it is     |
620|                         | slow and generates a lot of output.                |
621+-------------------------+----------------------------------------------------+
622| LLVM_ENABLE_PROJECTS    | A semicolon-delimited list selecting which of the  |
623|                         | other LLVM subprojects to additionally build. (Only|
624|                         | effective when using a side-by-side project layout |
625|                         | e.g. via git). The default list is empty. Can      |
626|                         | include: clang, clang-tools-extra,                 |
627|                         | cross-project-tests, flang, libc, libclc, lld,     |
628|                         | lldb, mlir, openmp, polly, or pstl.                |
629+-------------------------+----------------------------------------------------+
630| LLVM_ENABLE_RUNTIMES    | A semicolon-delimited list selecting which of the  |
631|                         | runtimes to build. (Only effective when using the  |
632|                         | full monorepo layout). The default list is empty.  |
633|                         | Can include: compiler-rt, libc, libcxx, libcxxabi, |
634|                         | libunwind, or openmp.                              |
635+-------------------------+----------------------------------------------------+
636| LLVM_ENABLE_SPHINX      | Build sphinx-based documentation from the source   |
637|                         | code. This is disabled by default because it is    |
638|                         | slow and generates a lot of output. Sphinx version |
639|                         | 1.5 or later recommended.                          |
640+-------------------------+----------------------------------------------------+
641| LLVM_BUILD_LLVM_DYLIB   | Generate libLLVM.so. This library contains a       |
642|                         | default set of LLVM components that can be         |
643|                         | overridden with ``LLVM_DYLIB_COMPONENTS``. The     |
644|                         | default contains most of LLVM and is defined in    |
645|                         | ``tools/llvm-shlib/CMakelists.txt``. This option is|
646|                         | not available on Windows.                          |
647+-------------------------+----------------------------------------------------+
648| LLVM_OPTIMIZED_TABLEGEN | Builds a release tablegen that gets used during    |
649|                         | the LLVM build. This can dramatically speed up     |
650|                         | debug builds.                                      |
651+-------------------------+----------------------------------------------------+
652
653To configure LLVM, follow these steps:
654
655#. Change directory into the object root directory:
656
657   .. code-block:: console
658
659     % cd OBJ_ROOT
660
661#. Run the ``cmake``:
662
663   .. code-block:: console
664
665     % cmake -G "Unix Makefiles" -DCMAKE_INSTALL_PREFIX=/install/path
666       [other options] SRC_ROOT
667
668Compiling the LLVM Suite Source Code
669------------------------------------
670
671Unlike with autotools, with CMake your build type is defined at configuration.
672If you want to change your build type, you can re-run cmake with the following
673invocation:
674
675   .. code-block:: console
676
677     % cmake -G "Unix Makefiles" -DCMAKE_BUILD_TYPE=type SRC_ROOT
678
679Between runs, CMake preserves the values set for all options. CMake has the
680following build types defined:
681
682Debug
683
684  These builds are the default. The build system will compile the tools and
685  libraries unoptimized, with debugging information, and asserts enabled.
686
687Release
688
689  For these builds, the build system will compile the tools and libraries
690  with optimizations enabled and not generate debug info. CMakes default
691  optimization level is -O3. This can be configured by setting the
692  ``CMAKE_CXX_FLAGS_RELEASE`` variable on the CMake command line.
693
694RelWithDebInfo
695
696  These builds are useful when debugging. They generate optimized binaries with
697  debug information. CMakes default optimization level is -O2. This can be
698  configured by setting the ``CMAKE_CXX_FLAGS_RELWITHDEBINFO`` variable on the
699  CMake command line.
700
701Once you have LLVM configured, you can build it by entering the *OBJ_ROOT*
702directory and issuing the following command:
703
704.. code-block:: console
705
706  % make
707
708If the build fails, please `check here`_ to see if you are using a version of
709GCC that is known not to compile LLVM.
710
711If you have multiple processors in your machine, you may wish to use some of the
712parallel build options provided by GNU Make.  For example, you could use the
713command:
714
715.. code-block:: console
716
717  % make -j2
718
719There are several special targets which are useful when working with the LLVM
720source code:
721
722``make clean``
723
724  Removes all files generated by the build.  This includes object files,
725  generated C/C++ files, libraries, and executables.
726
727``make install``
728
729  Installs LLVM header files, libraries, tools, and documentation in a hierarchy
730  under ``$PREFIX``, specified with ``CMAKE_INSTALL_PREFIX``, which
731  defaults to ``/usr/local``.
732
733``make docs-llvm-html``
734
735  If configured with ``-DLLVM_ENABLE_SPHINX=On``, this will generate a directory
736  at ``OBJ_ROOT/docs/html`` which contains the HTML formatted documentation.
737
738Cross-Compiling LLVM
739--------------------
740
741It is possible to cross-compile LLVM itself. That is, you can create LLVM
742executables and libraries to be hosted on a platform different from the platform
743where they are built (a Canadian Cross build). To generate build files for
744cross-compiling CMake provides a variable ``CMAKE_TOOLCHAIN_FILE`` which can
745define compiler flags and variables used during the CMake test operations.
746
747The result of such a build is executables that are not runnable on the build
748host but can be executed on the target. As an example the following CMake
749invocation can generate build files targeting iOS. This will work on macOS
750with the latest Xcode:
751
752.. code-block:: console
753
754  % cmake -G "Ninja" -DCMAKE_OSX_ARCHITECTURES="armv7;armv7s;arm64"
755    -DCMAKE_TOOLCHAIN_FILE=<PATH_TO_LLVM>/cmake/platforms/iOS.cmake
756    -DCMAKE_BUILD_TYPE=Release -DLLVM_BUILD_RUNTIME=Off -DLLVM_INCLUDE_TESTS=Off
757    -DLLVM_INCLUDE_EXAMPLES=Off -DLLVM_ENABLE_BACKTRACES=Off [options]
758    <PATH_TO_LLVM>
759
760Note: There are some additional flags that need to be passed when building for
761iOS due to limitations in the iOS SDK.
762
763Check :doc:`HowToCrossCompileLLVM` and `Clang docs on how to cross-compile in general
764<https://clang.llvm.org/docs/CrossCompilation.html>`_ for more information
765about cross-compiling.
766
767The Location of LLVM Object Files
768---------------------------------
769
770The LLVM build system is capable of sharing a single LLVM source tree among
771several LLVM builds.  Hence, it is possible to build LLVM for several different
772platforms or configurations using the same source tree.
773
774* Change directory to where the LLVM object files should live:
775
776  .. code-block:: console
777
778    % cd OBJ_ROOT
779
780* Run ``cmake``:
781
782  .. code-block:: console
783
784    % cmake -G "Unix Makefiles" SRC_ROOT
785
786The LLVM build will create a structure underneath *OBJ_ROOT* that matches the
787LLVM source tree. At each level where source files are present in the source
788tree there will be a corresponding ``CMakeFiles`` directory in the *OBJ_ROOT*.
789Underneath that directory there is another directory with a name ending in
790``.dir`` under which you'll find object files for each source.
791
792For example:
793
794  .. code-block:: console
795
796    % cd llvm_build_dir
797    % find lib/Support/ -name APFloat*
798    lib/Support/CMakeFiles/LLVMSupport.dir/APFloat.cpp.o
799
800Optional Configuration Items
801----------------------------
802
803If you're running on a Linux system that supports the `binfmt_misc
804<http://en.wikipedia.org/wiki/binfmt_misc>`_
805module, and you have root access on the system, you can set your system up to
806execute LLVM bitcode files directly. To do this, use commands like this (the
807first command may not be required if you are already using the module):
808
809.. code-block:: console
810
811  % mount -t binfmt_misc none /proc/sys/fs/binfmt_misc
812  % echo ':llvm:M::BC::/path/to/lli:' > /proc/sys/fs/binfmt_misc/register
813  % chmod u+x hello.bc   (if needed)
814  % ./hello.bc
815
816This allows you to execute LLVM bitcode files directly.  On Debian, you can also
817use this command instead of the 'echo' command above:
818
819.. code-block:: console
820
821  % sudo update-binfmts --install llvm /path/to/lli --magic 'BC'
822
823.. _Program Layout:
824.. _general layout:
825
826Directory Layout
827================
828
829One useful source of information about the LLVM source base is the LLVM `doxygen
830<http://www.doxygen.org/>`_ documentation available at
831`<https://llvm.org/doxygen/>`_.  The following is a brief introduction to code
832layout:
833
834``llvm/cmake``
835--------------
836Generates system build files.
837
838``llvm/cmake/modules``
839  Build configuration for llvm user defined options. Checks compiler version and
840  linker flags.
841
842``llvm/cmake/platforms``
843  Toolchain configuration for Android NDK, iOS systems and non-Windows hosts to
844  target MSVC.
845
846``llvm/examples``
847-----------------
848
849- Some simple examples showing how to use LLVM as a compiler for a custom
850  language - including lowering, optimization, and code generation.
851
852- Kaleidoscope Tutorial: Kaleidoscope language tutorial run through the
853  implementation of a nice little compiler for a non-trivial language
854  including a hand-written lexer, parser, AST, as well as code generation
855  support using LLVM- both static (ahead of time) and various approaches to
856  Just In Time (JIT) compilation.
857  `Kaleidoscope Tutorial for complete beginner
858  <https://llvm.org/docs/tutorial/MyFirstLanguageFrontend/index.html>`_.
859
860- BuildingAJIT: Examples of the `BuildingAJIT tutorial
861  <https://llvm.org/docs/tutorial/BuildingAJIT1.html>`_ that shows how LLVM’s
862  ORC JIT APIs interact with other parts of LLVM. It also, teaches how to
863  recombine them to build a custom JIT that is suited to your use-case.
864
865``llvm/include``
866----------------
867
868Public header files exported from the LLVM library. The three main subdirectories:
869
870``llvm/include/llvm``
871
872  All LLVM-specific header files, and  subdirectories for different portions of
873  LLVM: ``Analysis``, ``CodeGen``, ``Target``, ``Transforms``, etc...
874
875``llvm/include/llvm/Support``
876
877  Generic support libraries provided with LLVM but not necessarily specific to
878  LLVM. For example, some C++ STL utilities and a Command Line option processing
879  library store header files here.
880
881``llvm/include/llvm/Config``
882
883  Header files configured by ``cmake``.  They wrap "standard" UNIX and
884  C header files.  Source code can include these header files which
885  automatically take care of the conditional #includes that ``cmake``
886  generates.
887
888``llvm/lib``
889------------
890
891Most source files are here. By putting code in libraries, LLVM makes it easy to
892share code among the `tools`_.
893
894``llvm/lib/IR/``
895
896  Core LLVM source files that implement core classes like Instruction and
897  BasicBlock.
898
899``llvm/lib/AsmParser/``
900
901  Source code for the LLVM assembly language parser library.
902
903``llvm/lib/Bitcode/``
904
905  Code for reading and writing bitcode.
906
907``llvm/lib/Analysis/``
908
909  A variety of program analyses, such as Call Graphs, Induction Variables,
910  Natural Loop Identification, etc.
911
912``llvm/lib/Transforms/``
913
914  IR-to-IR program transformations, such as Aggressive Dead Code Elimination,
915  Sparse Conditional Constant Propagation, Inlining, Loop Invariant Code Motion,
916  Dead Global Elimination, and many others.
917
918``llvm/lib/Target/``
919
920  Files describing target architectures for code generation.  For example,
921  ``llvm/lib/Target/X86`` holds the X86 machine description.
922
923``llvm/lib/CodeGen/``
924
925  The major parts of the code generator: Instruction Selector, Instruction
926  Scheduling, and Register Allocation.
927
928``llvm/lib/MC/``
929
930  The libraries represent and process code at machine code level. Handles
931  assembly and object-file emission.
932
933``llvm/lib/ExecutionEngine/``
934
935  Libraries for directly executing bitcode at runtime in interpreted and
936  JIT-compiled scenarios.
937
938``llvm/lib/Support/``
939
940  Source code that corresponding to the header files in ``llvm/include/ADT/``
941  and ``llvm/include/Support/``.
942
943``llvm/bindings``
944----------------------
945
946Contains bindings for the LLVM compiler infrastructure to allow
947programs written in languages other than C or C++ to take advantage of the LLVM
948infrastructure.
949LLVM project provides language bindings for Go, OCaml and Python.
950
951``llvm/projects``
952-----------------
953
954Projects not strictly part of LLVM but shipped with LLVM. This is also the
955directory for creating your own LLVM-based projects which leverage the LLVM
956build system.
957
958``llvm/test``
959-------------
960
961Feature and regression tests and other sanity checks on LLVM infrastructure. These
962are intended to run quickly and cover a lot of territory without being exhaustive.
963
964``test-suite``
965--------------
966
967A comprehensive correctness, performance, and benchmarking test suite
968for LLVM.  This comes in a ``separate git repository
969<https://github.com/llvm/llvm-test-suite>``, because it contains a
970large amount of third-party code under a variety of licenses. For
971details see the :doc:`Testing Guide <TestingGuide>` document.
972
973.. _tools:
974
975``llvm/tools``
976--------------
977
978Executables built out of the libraries
979above, which form the main part of the user interface.  You can always get help
980for a tool by typing ``tool_name -help``.  The following is a brief introduction
981to the most important tools.  More detailed information is in
982the `Command Guide <CommandGuide/index.html>`_.
983
984``bugpoint``
985
986  ``bugpoint`` is used to debug optimization passes or code generation backends
987  by narrowing down the given test case to the minimum number of passes and/or
988  instructions that still cause a problem, whether it is a crash or
989  miscompilation. See `<HowToSubmitABug.html>`_ for more information on using
990  ``bugpoint``.
991
992``llvm-ar``
993
994  The archiver produces an archive containing the given LLVM bitcode files,
995  optionally with an index for faster lookup.
996
997``llvm-as``
998
999  The assembler transforms the human readable LLVM assembly to LLVM bitcode.
1000
1001``llvm-dis``
1002
1003  The disassembler transforms the LLVM bitcode to human readable LLVM assembly.
1004
1005``llvm-link``
1006
1007  ``llvm-link``, not surprisingly, links multiple LLVM modules into a single
1008  program.
1009
1010``lli``
1011
1012  ``lli`` is the LLVM interpreter, which can directly execute LLVM bitcode
1013  (although very slowly...). For architectures that support it (currently x86,
1014  Sparc, and PowerPC), by default, ``lli`` will function as a Just-In-Time
1015  compiler (if the functionality was compiled in), and will execute the code
1016  *much* faster than the interpreter.
1017
1018``llc``
1019
1020  ``llc`` is the LLVM backend compiler, which translates LLVM bitcode to a
1021  native code assembly file.
1022
1023``opt``
1024
1025  ``opt`` reads LLVM bitcode, applies a series of LLVM to LLVM transformations
1026  (which are specified on the command line), and outputs the resultant
1027  bitcode.   '``opt -help``'  is a good way to get a list of the
1028  program transformations available in LLVM.
1029
1030  ``opt`` can also  run a specific analysis on an input LLVM bitcode
1031  file and print  the results.  Primarily useful for debugging
1032  analyses, or familiarizing yourself with what an analysis does.
1033
1034``llvm/utils``
1035--------------
1036
1037Utilities for working with LLVM source code; some are part of the build process
1038because they are code generators for parts of the infrastructure.
1039
1040
1041``codegen-diff``
1042
1043  ``codegen-diff`` finds differences between code that LLC
1044  generates and code that LLI generates. This is useful if you are
1045  debugging one of them, assuming that the other generates correct output. For
1046  the full user manual, run ```perldoc codegen-diff'``.
1047
1048``emacs/``
1049
1050   Emacs and XEmacs syntax highlighting  for LLVM   assembly files and TableGen
1051   description files.  See the ``README`` for information on using them.
1052
1053``getsrcs.sh``
1054
1055  Finds and outputs all non-generated source files,
1056  useful if one wishes to do a lot of development across directories
1057  and does not want to find each file. One way to use it is to run,
1058  for example: ``xemacs `utils/getsources.sh``` from the top of the LLVM source
1059  tree.
1060
1061``llvmgrep``
1062
1063  Performs an ``egrep -H -n`` on each source file in LLVM and
1064  passes to it a regular expression provided on ``llvmgrep``'s command
1065  line. This is an efficient way of searching the source base for a
1066  particular regular expression.
1067
1068``TableGen/``
1069
1070  Contains the tool used to generate register
1071  descriptions, instruction set descriptions, and even assemblers from common
1072  TableGen description files.
1073
1074``vim/``
1075
1076  vim syntax-highlighting for LLVM assembly files
1077  and TableGen description files. See the    ``README`` for how to use them.
1078
1079.. _simple example:
1080
1081An Example Using the LLVM Tool Chain
1082====================================
1083
1084This section gives an example of using LLVM with the Clang front end.
1085
1086Example with clang
1087------------------
1088
1089#. First, create a simple C file, name it 'hello.c':
1090
1091   .. code-block:: c
1092
1093     #include <stdio.h>
1094
1095     int main() {
1096       printf("hello world\n");
1097       return 0;
1098     }
1099
1100#. Next, compile the C file into a native executable:
1101
1102   .. code-block:: console
1103
1104     % clang hello.c -o hello
1105
1106   .. note::
1107
1108     Clang works just like GCC by default.  The standard -S and -c arguments
1109     work as usual (producing a native .s or .o file, respectively).
1110
1111#. Next, compile the C file into an LLVM bitcode file:
1112
1113   .. code-block:: console
1114
1115     % clang -O3 -emit-llvm hello.c -c -o hello.bc
1116
1117   The -emit-llvm option can be used with the -S or -c options to emit an LLVM
1118   ``.ll`` or ``.bc`` file (respectively) for the code.  This allows you to use
1119   the `standard LLVM tools <CommandGuide/index.html>`_ on the bitcode file.
1120
1121#. Run the program in both forms. To run the program, use:
1122
1123   .. code-block:: console
1124
1125      % ./hello
1126
1127   and
1128
1129   .. code-block:: console
1130
1131     % lli hello.bc
1132
1133   The second examples shows how to invoke the LLVM JIT, :doc:`lli
1134   <CommandGuide/lli>`.
1135
1136#. Use the ``llvm-dis`` utility to take a look at the LLVM assembly code:
1137
1138   .. code-block:: console
1139
1140     % llvm-dis < hello.bc | less
1141
1142#. Compile the program to native assembly using the LLC code generator:
1143
1144   .. code-block:: console
1145
1146     % llc hello.bc -o hello.s
1147
1148#. Assemble the native assembly language file into a program:
1149
1150   .. code-block:: console
1151
1152     % /opt/SUNWspro/bin/cc -xarch=v9 hello.s -o hello.native   # On Solaris
1153
1154     % gcc hello.s -o hello.native                              # On others
1155
1156#. Execute the native code program:
1157
1158   .. code-block:: console
1159
1160     % ./hello.native
1161
1162   Note that using clang to compile directly to native code (i.e. when the
1163   ``-emit-llvm`` option is not present) does steps 6/7/8 for you.
1164
1165Common Problems
1166===============
1167
1168If you are having problems building or using LLVM, or if you have any other
1169general questions about LLVM, please consult the `Frequently Asked
1170Questions <FAQ.html>`_ page.
1171
1172If you are having problems with limited memory and build time, please try
1173building with ninja instead of make. Please consider configuring the
1174following options with cmake:
1175
1176 * -G Ninja
1177   Setting this option will allow you to build with ninja instead of make.
1178   Building with ninja significantly improves your build time, especially with
1179   incremental builds, and improves your memory usage.
1180
1181 * -DLLVM_USE_LINKER
1182   Setting this option to lld will significantly reduce linking time for LLVM
1183   executables on ELF-based platforms, such as Linux. If you are building LLVM
1184   for the first time and lld is not available to you as a binary package, then
1185   you may want to use the gold linker as a faster alternative to GNU ld.
1186
1187 * -DCMAKE_BUILD_TYPE
1188
1189    - Debug --- This is the default build type. This disables optimizations while
1190      compiling LLVM and enables debug info. On ELF-based platforms (e.g. Linux)
1191      linking with debug info may consume a large amount of memory.
1192
1193    - Release --- Turns on optimizations and disables debug info. Combining the
1194      Release build type with -DLLVM_ENABLE_ASSERTIONS=ON may be a good trade-off
1195      between speed and debugability during development, particularly for running
1196      the test suite.
1197
1198 * -DLLVM_ENABLE_ASSERTIONS
1199   This option defaults to ON for Debug builds and defaults to OFF for Release
1200   builds. As mentioned in the previous option, using the Release build type and
1201   enabling assertions may be a good alternative to using the Debug build type.
1202
1203 * -DLLVM_PARALLEL_LINK_JOBS
1204   Set this equal to number of jobs you wish to run simultaneously. This is
1205   similar to the -j option used with make, but only for link jobs. This option
1206   can only be used with ninja. You may wish to use a very low number of jobs,
1207   as this will greatly reduce the amount of memory used during the build
1208   process. If you have limited memory, you may wish to set this to 1.
1209
1210 * -DLLVM_TARGETS_TO_BUILD
1211   Set this equal to the target you wish to build. You may wish to set this to
1212   X86; however, you will find a full list of targets within the
1213   llvm-project/llvm/lib/Target directory.
1214
1215 * -DLLVM_OPTIMIZED_TABLEGEN
1216   Set this to ON to generate a fully optimized tablegen during your build. This
1217   will significantly improve your build time. This is only useful if you are
1218   using the Debug build type.
1219
1220 * -DLLVM_ENABLE_PROJECTS
1221   Set this equal to the projects you wish to compile (e.g. clang, lld, etc.) If
1222   compiling more than one project, separate the items with a semicolon. Should
1223   you run into issues with the semicolon, try surrounding it with single quotes.
1224
1225 * -DLLVM_ENABLE_RUNTIMES
1226   Set this equal to the runtimes you wish to compile (e.g. libcxx, libcxxabi, etc.)
1227   If compiling more than one runtime, separate the items with a semicolon. Should
1228   you run into issues with the semicolon, try surrounding it with single quotes.
1229
1230 * -DCLANG_ENABLE_STATIC_ANALYZER
1231   Set this option to OFF if you do not require the clang static analyzer. This
1232   should improve your build time slightly.
1233
1234 * -DLLVM_USE_SPLIT_DWARF
1235   Consider setting this to ON if you require a debug build, as this will ease
1236   memory pressure on the linker. This will make linking much faster, as the
1237   binaries will not contain any of the debug information; however, this will
1238   generate the debug information in the form of a DWARF object file (with the
1239   extension .dwo). This only applies to host platforms using ELF, such as Linux.
1240
1241.. _links:
1242
1243Links
1244=====
1245
1246This document is just an **introduction** on how to use LLVM to do some simple
1247things... there are many more interesting and complicated things that you can do
1248that aren't documented here (but we'll gladly accept a patch if you want to
1249write something up!).  For more information about LLVM, check out:
1250
1251* `LLVM Homepage <https://llvm.org/>`_
1252* `LLVM Doxygen Tree <https://llvm.org/doxygen/>`_
1253* `Starting a Project that Uses LLVM <https://llvm.org/docs/Projects.html>`_
1254
1255.. _installing arcanist: https://secure.phabricator.com/book/phabricator/article/arcanist_quick_start/
1256