1FileCheck - Flexible pattern matching file verifier
2===================================================
3
4SYNOPSIS
5--------
6
7:program:`FileCheck` *match-filename* [*--check-prefix=XXX*] [*--strict-whitespace*]
8
9DESCRIPTION
10-----------
11
12:program:`FileCheck` reads two files (one from standard input, and one
13specified on the command line) and uses one to verify the other.  This
14behavior is particularly useful for the testsuite, which wants to verify that
15the output of some tool (e.g. :program:`llc`) contains the expected information
16(for example, a movsd from esp or whatever is interesting).  This is similar to
17using :program:`grep`, but it is optimized for matching multiple different
18inputs in one file in a specific order.
19
20The ``match-filename`` file specifies the file that contains the patterns to
21match.  The file to verify is read from standard input unless the
22:option:`--input-file` option is used.
23
24OPTIONS
25-------
26
27.. option:: -help
28
29 Print a summary of command line options.
30
31.. option:: --check-prefix prefix
32
33 FileCheck searches the contents of ``match-filename`` for patterns to
34 match.  By default, these patterns are prefixed with "``CHECK:``".
35 If you'd like to use a different prefix (e.g. because the same input
36 file is checking multiple different tool or options), the
37 :option:`--check-prefix` argument allows you to specify one or more
38 prefixes to match. Multiple prefixes are useful for tests which might
39 change for different run options, but most lines remain the same.
40
41.. option:: --check-prefixes prefix1,prefix2,...
42
43 An alias of :option:`--check-prefix` that allows multiple prefixes to be
44 specified as a comma separated list.
45
46.. option:: --input-file filename
47
48  File to check (defaults to stdin).
49
50.. option:: --match-full-lines
51
52 By default, FileCheck allows matches of anywhere on a line. This
53 option will require all positive matches to cover an entire
54 line. Leading and trailing whitespace is ignored, unless
55 :option:`--strict-whitespace` is also specified. (Note: negative
56 matches from ``CHECK-NOT`` are not affected by this option!)
57
58 Passing this option is equivalent to inserting ``{{^ *}}`` or
59 ``{{^}}`` before, and ``{{ *$}}`` or ``{{$}}`` after every positive
60 check pattern.
61
62.. option:: --strict-whitespace
63
64 By default, FileCheck canonicalizes input horizontal whitespace (spaces and
65 tabs) which causes it to ignore these differences (a space will match a tab).
66 The :option:`--strict-whitespace` argument disables this behavior. End-of-line
67 sequences are canonicalized to UNIX-style ``\n`` in all modes.
68
69.. option:: --implicit-check-not check-pattern
70
71  Adds implicit negative checks for the specified patterns between positive
72  checks. The option allows writing stricter tests without stuffing them with
73  ``CHECK-NOT``\ s.
74
75  For example, "``--implicit-check-not warning:``" can be useful when testing
76  diagnostic messages from tools that don't have an option similar to ``clang
77  -verify``. With this option FileCheck will verify that input does not contain
78  warnings not covered by any ``CHECK:`` patterns.
79
80.. option:: --enable-var-scope
81
82  Enables scope for regex variables.
83
84  Variables with names that start with ``$`` are considered global and
85  remain set throughout the file.
86
87  All other variables get undefined after each encountered ``CHECK-LABEL``.
88
89.. option:: -D<VAR=VALUE>
90
91  Sets a filecheck variable ``VAR`` with value ``VALUE`` that can be used in
92  ``CHECK:`` lines.
93
94.. option:: -version
95
96 Show the version number of this program.
97
98EXIT STATUS
99-----------
100
101If :program:`FileCheck` verifies that the file matches the expected contents,
102it exits with 0.  Otherwise, if not, or if an error occurs, it will exit with a
103non-zero value.
104
105TUTORIAL
106--------
107
108FileCheck is typically used from LLVM regression tests, being invoked on the RUN
109line of the test.  A simple example of using FileCheck from a RUN line looks
110like this:
111
112.. code-block:: llvm
113
114   ; RUN: llvm-as < %s | llc -march=x86-64 | FileCheck %s
115
116This syntax says to pipe the current file ("``%s``") into ``llvm-as``, pipe
117that into ``llc``, then pipe the output of ``llc`` into ``FileCheck``.  This
118means that FileCheck will be verifying its standard input (the llc output)
119against the filename argument specified (the original ``.ll`` file specified by
120"``%s``").  To see how this works, let's look at the rest of the ``.ll`` file
121(after the RUN line):
122
123.. code-block:: llvm
124
125   define void @sub1(i32* %p, i32 %v) {
126   entry:
127   ; CHECK: sub1:
128   ; CHECK: subl
129           %0 = tail call i32 @llvm.atomic.load.sub.i32.p0i32(i32* %p, i32 %v)
130           ret void
131   }
132
133   define void @inc4(i64* %p) {
134   entry:
135   ; CHECK: inc4:
136   ; CHECK: incq
137           %0 = tail call i64 @llvm.atomic.load.add.i64.p0i64(i64* %p, i64 1)
138           ret void
139   }
140
141Here you can see some "``CHECK:``" lines specified in comments.  Now you can
142see how the file is piped into ``llvm-as``, then ``llc``, and the machine code
143output is what we are verifying.  FileCheck checks the machine code output to
144verify that it matches what the "``CHECK:``" lines specify.
145
146The syntax of the "``CHECK:``" lines is very simple: they are fixed strings that
147must occur in order.  FileCheck defaults to ignoring horizontal whitespace
148differences (e.g. a space is allowed to match a tab) but otherwise, the contents
149of the "``CHECK:``" line is required to match some thing in the test file exactly.
150
151One nice thing about FileCheck (compared to grep) is that it allows merging
152test cases together into logical groups.  For example, because the test above
153is checking for the "``sub1:``" and "``inc4:``" labels, it will not match
154unless there is a "``subl``" in between those labels.  If it existed somewhere
155else in the file, that would not count: "``grep subl``" matches if "``subl``"
156exists anywhere in the file.
157
158The FileCheck -check-prefix option
159~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
160
161The FileCheck `-check-prefix` option allows multiple test
162configurations to be driven from one `.ll` file.  This is useful in many
163circumstances, for example, testing different architectural variants with
164:program:`llc`.  Here's a simple example:
165
166.. code-block:: llvm
167
168   ; RUN: llvm-as < %s | llc -mtriple=i686-apple-darwin9 -mattr=sse41 \
169   ; RUN:              | FileCheck %s -check-prefix=X32
170   ; RUN: llvm-as < %s | llc -mtriple=x86_64-apple-darwin9 -mattr=sse41 \
171   ; RUN:              | FileCheck %s -check-prefix=X64
172
173   define <4 x i32> @pinsrd_1(i32 %s, <4 x i32> %tmp) nounwind {
174           %tmp1 = insertelement <4 x i32>; %tmp, i32 %s, i32 1
175           ret <4 x i32> %tmp1
176   ; X32: pinsrd_1:
177   ; X32:    pinsrd $1, 4(%esp), %xmm0
178
179   ; X64: pinsrd_1:
180   ; X64:    pinsrd $1, %edi, %xmm0
181   }
182
183In this case, we're testing that we get the expected code generation with
184both 32-bit and 64-bit code generation.
185
186The "CHECK-NEXT:" directive
187~~~~~~~~~~~~~~~~~~~~~~~~~~~
188
189Sometimes you want to match lines and would like to verify that matches
190happen on exactly consecutive lines with no other lines in between them.  In
191this case, you can use "``CHECK:``" and "``CHECK-NEXT:``" directives to specify
192this.  If you specified a custom check prefix, just use "``<PREFIX>-NEXT:``".
193For example, something like this works as you'd expect:
194
195.. code-block:: llvm
196
197   define void @t2(<2 x double>* %r, <2 x double>* %A, double %B) {
198 	%tmp3 = load <2 x double>* %A, align 16
199 	%tmp7 = insertelement <2 x double> undef, double %B, i32 0
200 	%tmp9 = shufflevector <2 x double> %tmp3,
201                               <2 x double> %tmp7,
202                               <2 x i32> < i32 0, i32 2 >
203 	store <2 x double> %tmp9, <2 x double>* %r, align 16
204 	ret void
205
206   ; CHECK:          t2:
207   ; CHECK: 	        movl	8(%esp), %eax
208   ; CHECK-NEXT: 	movapd	(%eax), %xmm0
209   ; CHECK-NEXT: 	movhpd	12(%esp), %xmm0
210   ; CHECK-NEXT: 	movl	4(%esp), %eax
211   ; CHECK-NEXT: 	movapd	%xmm0, (%eax)
212   ; CHECK-NEXT: 	ret
213   }
214
215"``CHECK-NEXT:``" directives reject the input unless there is exactly one
216newline between it and the previous directive.  A "``CHECK-NEXT:``" cannot be
217the first directive in a file.
218
219The "CHECK-SAME:" directive
220~~~~~~~~~~~~~~~~~~~~~~~~~~~
221
222Sometimes you want to match lines and would like to verify that matches happen
223on the same line as the previous match.  In this case, you can use "``CHECK:``"
224and "``CHECK-SAME:``" directives to specify this.  If you specified a custom
225check prefix, just use "``<PREFIX>-SAME:``".
226
227"``CHECK-SAME:``" is particularly powerful in conjunction with "``CHECK-NOT:``"
228(described below).
229
230For example, the following works like you'd expect:
231
232.. code-block:: llvm
233
234   !0 = !DILocation(line: 5, scope: !1, inlinedAt: !2)
235
236   ; CHECK:       !DILocation(line: 5,
237   ; CHECK-NOT:               column:
238   ; CHECK-SAME:              scope: ![[SCOPE:[0-9]+]]
239
240"``CHECK-SAME:``" directives reject the input if there are any newlines between
241it and the previous directive.  A "``CHECK-SAME:``" cannot be the first
242directive in a file.
243
244The "CHECK-EMPTY:" directive
245~~~~~~~~~~~~~~~~~~~~~~~~~~~~
246
247If you need to check that the next line has nothing on it, not even whitespace,
248you can use the "``CHECK-EMPTY:``" directive.
249
250.. code-block:: llvm
251
252   foo
253
254   bar
255   ; CHECK: foo
256   ; CHECK-EMPTY:
257   ; CHECK-NEXT: bar
258
259Just like "``CHECK-NEXT:``" the directive will fail if there is more than one
260newline before it finds the next blank line, and it cannot be the first
261directive in a file.
262
263The "CHECK-NOT:" directive
264~~~~~~~~~~~~~~~~~~~~~~~~~~
265
266The "``CHECK-NOT:``" directive is used to verify that a string doesn't occur
267between two matches (or before the first match, or after the last match).  For
268example, to verify that a load is removed by a transformation, a test like this
269can be used:
270
271.. code-block:: llvm
272
273   define i8 @coerce_offset0(i32 %V, i32* %P) {
274     store i32 %V, i32* %P
275
276     %P2 = bitcast i32* %P to i8*
277     %P3 = getelementptr i8* %P2, i32 2
278
279     %A = load i8* %P3
280     ret i8 %A
281   ; CHECK: @coerce_offset0
282   ; CHECK-NOT: load
283   ; CHECK: ret i8
284   }
285
286The "CHECK-DAG:" directive
287~~~~~~~~~~~~~~~~~~~~~~~~~~
288
289If it's necessary to match strings that don't occur in a strictly sequential
290order, "``CHECK-DAG:``" could be used to verify them between two matches (or
291before the first match, or after the last match). For example, clang emits
292vtable globals in reverse order. Using ``CHECK-DAG:``, we can keep the checks
293in the natural order:
294
295.. code-block:: c++
296
297    // RUN: %clang_cc1 %s -emit-llvm -o - | FileCheck %s
298
299    struct Foo { virtual void method(); };
300    Foo f;  // emit vtable
301    // CHECK-DAG: @_ZTV3Foo =
302
303    struct Bar { virtual void method(); };
304    Bar b;
305    // CHECK-DAG: @_ZTV3Bar =
306
307``CHECK-NOT:`` directives could be mixed with ``CHECK-DAG:`` directives to
308exclude strings between the surrounding ``CHECK-DAG:`` directives. As a result,
309the surrounding ``CHECK-DAG:`` directives cannot be reordered, i.e. all
310occurrences matching ``CHECK-DAG:`` before ``CHECK-NOT:`` must not fall behind
311occurrences matching ``CHECK-DAG:`` after ``CHECK-NOT:``. For example,
312
313.. code-block:: llvm
314
315   ; CHECK-DAG: BEFORE
316   ; CHECK-NOT: NOT
317   ; CHECK-DAG: AFTER
318
319This case will reject input strings where ``BEFORE`` occurs after ``AFTER``.
320
321With captured variables, ``CHECK-DAG:`` is able to match valid topological
322orderings of a DAG with edges from the definition of a variable to its use.
323It's useful, e.g., when your test cases need to match different output
324sequences from the instruction scheduler. For example,
325
326.. code-block:: llvm
327
328   ; CHECK-DAG: add [[REG1:r[0-9]+]], r1, r2
329   ; CHECK-DAG: add [[REG2:r[0-9]+]], r3, r4
330   ; CHECK:     mul r5, [[REG1]], [[REG2]]
331
332In this case, any order of that two ``add`` instructions will be allowed.
333
334If you are defining `and` using variables in the same ``CHECK-DAG:`` block,
335be aware that the definition rule can match `after` its use.
336
337So, for instance, the code below will pass:
338
339.. code-block:: text
340
341  ; CHECK-DAG: vmov.32 [[REG2:d[0-9]+]][0]
342  ; CHECK-DAG: vmov.32 [[REG2]][1]
343  vmov.32 d0[1]
344  vmov.32 d0[0]
345
346While this other code, will not:
347
348.. code-block:: text
349
350  ; CHECK-DAG: vmov.32 [[REG2:d[0-9]+]][0]
351  ; CHECK-DAG: vmov.32 [[REG2]][1]
352  vmov.32 d1[1]
353  vmov.32 d0[0]
354
355While this can be very useful, it's also dangerous, because in the case of
356register sequence, you must have a strong order (read before write, copy before
357use, etc). If the definition your test is looking for doesn't match (because
358of a bug in the compiler), it may match further away from the use, and mask
359real bugs away.
360
361In those cases, to enforce the order, use a non-DAG directive between DAG-blocks.
362
363The "CHECK-LABEL:" directive
364~~~~~~~~~~~~~~~~~~~~~~~~~~~~
365
366Sometimes in a file containing multiple tests divided into logical blocks, one
367or more ``CHECK:`` directives may inadvertently succeed by matching lines in a
368later block. While an error will usually eventually be generated, the check
369flagged as causing the error may not actually bear any relationship to the
370actual source of the problem.
371
372In order to produce better error messages in these cases, the "``CHECK-LABEL:``"
373directive can be used. It is treated identically to a normal ``CHECK``
374directive except that FileCheck makes an additional assumption that a line
375matched by the directive cannot also be matched by any other check present in
376``match-filename``; this is intended to be used for lines containing labels or
377other unique identifiers. Conceptually, the presence of ``CHECK-LABEL`` divides
378the input stream into separate blocks, each of which is processed independently,
379preventing a ``CHECK:`` directive in one block matching a line in another block.
380If ``--enable-var-scope`` is in effect, all local variables are cleared at the
381beginning of the block.
382
383For example,
384
385.. code-block:: llvm
386
387  define %struct.C* @C_ctor_base(%struct.C* %this, i32 %x) {
388  entry:
389  ; CHECK-LABEL: C_ctor_base:
390  ; CHECK: mov [[SAVETHIS:r[0-9]+]], r0
391  ; CHECK: bl A_ctor_base
392  ; CHECK: mov r0, [[SAVETHIS]]
393    %0 = bitcast %struct.C* %this to %struct.A*
394    %call = tail call %struct.A* @A_ctor_base(%struct.A* %0)
395    %1 = bitcast %struct.C* %this to %struct.B*
396    %call2 = tail call %struct.B* @B_ctor_base(%struct.B* %1, i32 %x)
397    ret %struct.C* %this
398  }
399
400  define %struct.D* @D_ctor_base(%struct.D* %this, i32 %x) {
401  entry:
402  ; CHECK-LABEL: D_ctor_base:
403
404The use of ``CHECK-LABEL:`` directives in this case ensures that the three
405``CHECK:`` directives only accept lines corresponding to the body of the
406``@C_ctor_base`` function, even if the patterns match lines found later in
407the file. Furthermore, if one of these three ``CHECK:`` directives fail,
408FileCheck will recover by continuing to the next block, allowing multiple test
409failures to be detected in a single invocation.
410
411There is no requirement that ``CHECK-LABEL:`` directives contain strings that
412correspond to actual syntactic labels in a source or output language: they must
413simply uniquely match a single line in the file being verified.
414
415``CHECK-LABEL:`` directives cannot contain variable definitions or uses.
416
417FileCheck Pattern Matching Syntax
418~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
419
420All FileCheck directives take a pattern to match.
421For most uses of FileCheck, fixed string matching is perfectly sufficient.  For
422some things, a more flexible form of matching is desired.  To support this,
423FileCheck allows you to specify regular expressions in matching strings,
424surrounded by double braces: ``{{yourregex}}``. FileCheck implements a POSIX
425regular expression matcher; it supports Extended POSIX regular expressions
426(ERE). Because we want to use fixed string matching for a majority of what we
427do, FileCheck has been designed to support mixing and matching fixed string
428matching with regular expressions.  This allows you to write things like this:
429
430.. code-block:: llvm
431
432   ; CHECK: movhpd	{{[0-9]+}}(%esp), {{%xmm[0-7]}}
433
434In this case, any offset from the ESP register will be allowed, and any xmm
435register will be allowed.
436
437Because regular expressions are enclosed with double braces, they are
438visually distinct, and you don't need to use escape characters within the double
439braces like you would in C.  In the rare case that you want to match double
440braces explicitly from the input, you can use something ugly like
441``{{[{][{]}}`` as your pattern.
442
443FileCheck Variables
444~~~~~~~~~~~~~~~~~~~
445
446It is often useful to match a pattern and then verify that it occurs again
447later in the file.  For codegen tests, this can be useful to allow any register,
448but verify that that register is used consistently later.  To do this,
449:program:`FileCheck` allows named variables to be defined and substituted into
450patterns.  Here is a simple example:
451
452.. code-block:: llvm
453
454   ; CHECK: test5:
455   ; CHECK:    notw	[[REGISTER:%[a-z]+]]
456   ; CHECK:    andw	{{.*}}[[REGISTER]]
457
458The first check line matches a regex ``%[a-z]+`` and captures it into the
459variable ``REGISTER``.  The second line verifies that whatever is in
460``REGISTER`` occurs later in the file after an "``andw``".  :program:`FileCheck`
461variable references are always contained in ``[[ ]]`` pairs, and their names can
462be formed with the regex ``[a-zA-Z_][a-zA-Z0-9_]*``.  If a colon follows the name,
463then it is a definition of the variable; otherwise, it is a use.
464
465:program:`FileCheck` variables can be defined multiple times, and uses always
466get the latest value.  Variables can also be used later on the same line they
467were defined on. For example:
468
469.. code-block:: llvm
470
471    ; CHECK: op [[REG:r[0-9]+]], [[REG]]
472
473Can be useful if you want the operands of ``op`` to be the same register,
474and don't care exactly which register it is.
475
476If ``--enable-var-scope`` is in effect, variables with names that
477start with ``$`` are considered to be global. All others variables are
478local.  All local variables get undefined at the beginning of each
479CHECK-LABEL block. Global variables are not affected by CHECK-LABEL.
480This makes it easier to ensure that individual tests are not affected
481by variables set in preceding tests.
482
483FileCheck Expressions
484~~~~~~~~~~~~~~~~~~~~~
485
486Sometimes there's a need to verify output which refers line numbers of the
487match file, e.g. when testing compiler diagnostics.  This introduces a certain
488fragility of the match file structure, as "``CHECK:``" lines contain absolute
489line numbers in the same file, which have to be updated whenever line numbers
490change due to text addition or deletion.
491
492To support this case, FileCheck allows using ``[[@LINE]]``,
493``[[@LINE+<offset>]]``, ``[[@LINE-<offset>]]`` expressions in patterns. These
494expressions expand to a number of the line where a pattern is located (with an
495optional integer offset).
496
497This way match patterns can be put near the relevant test lines and include
498relative line number references, for example:
499
500.. code-block:: c++
501
502   // CHECK: test.cpp:[[@LINE+4]]:6: error: expected ';' after top level declarator
503   // CHECK-NEXT: {{^int a}}
504   // CHECK-NEXT: {{^     \^}}
505   // CHECK-NEXT: {{^     ;}}
506   int a
507
508Matching Newline Characters
509~~~~~~~~~~~~~~~~~~~~~~~~~~~
510
511To match newline characters in regular expressions the character class
512``[[:space:]]`` can be used. For example, the following pattern:
513
514.. code-block:: c++
515
516   // CHECK: DW_AT_location [DW_FORM_sec_offset] ([[DLOC:0x[0-9a-f]+]]){{[[:space:]].*}}"intd"
517
518matches output of the form (from llvm-dwarfdump):
519
520.. code-block:: text
521
522       DW_AT_location [DW_FORM_sec_offset]   (0x00000233)
523       DW_AT_name [DW_FORM_strp]  ( .debug_str[0x000000c9] = "intd")
524
525letting us set the :program:`FileCheck` variable ``DLOC`` to the desired value
526``0x00000233``, extracted from the line immediately preceding "``intd``".
527