1FileCheck - Flexible pattern matching file verifier
2===================================================
3
4SYNOPSIS
5--------
6
7:program:`FileCheck` *match-filename* [*--check-prefix=XXX*] [*--strict-whitespace*]
8
9DESCRIPTION
10-----------
11
12:program:`FileCheck` reads two files (one from standard input, and one
13specified on the command line) and uses one to verify the other.  This
14behavior is particularly useful for the testsuite, which wants to verify that
15the output of some tool (e.g. :program:`llc`) contains the expected information
16(for example, a movsd from esp or whatever is interesting).  This is similar to
17using :program:`grep`, but it is optimized for matching multiple different
18inputs in one file in a specific order.
19
20The ``match-filename`` file specifies the file that contains the patterns to
21match.  The file to verify is read from standard input unless the
22:option:`--input-file` option is used.
23
24OPTIONS
25-------
26
27Options are parsed from the environment variable ``FILECHECK_OPTS``
28and from the command line.
29
30.. option:: -help
31
32 Print a summary of command line options.
33
34.. option:: --check-prefix prefix
35
36 FileCheck searches the contents of ``match-filename`` for patterns to
37 match.  By default, these patterns are prefixed with "``CHECK:``".
38 If you'd like to use a different prefix (e.g. because the same input
39 file is checking multiple different tool or options), the
40 :option:`--check-prefix` argument allows you to specify one or more
41 prefixes to match. Multiple prefixes are useful for tests which might
42 change for different run options, but most lines remain the same.
43
44.. option:: --check-prefixes prefix1,prefix2,...
45
46 An alias of :option:`--check-prefix` that allows multiple prefixes to be
47 specified as a comma separated list.
48
49.. option:: --input-file filename
50
51  File to check (defaults to stdin).
52
53.. option:: --match-full-lines
54
55 By default, FileCheck allows matches of anywhere on a line. This
56 option will require all positive matches to cover an entire
57 line. Leading and trailing whitespace is ignored, unless
58 :option:`--strict-whitespace` is also specified. (Note: negative
59 matches from ``CHECK-NOT`` are not affected by this option!)
60
61 Passing this option is equivalent to inserting ``{{^ *}}`` or
62 ``{{^}}`` before, and ``{{ *$}}`` or ``{{$}}`` after every positive
63 check pattern.
64
65.. option:: --strict-whitespace
66
67 By default, FileCheck canonicalizes input horizontal whitespace (spaces and
68 tabs) which causes it to ignore these differences (a space will match a tab).
69 The :option:`--strict-whitespace` argument disables this behavior. End-of-line
70 sequences are canonicalized to UNIX-style ``\n`` in all modes.
71
72.. option:: --implicit-check-not check-pattern
73
74  Adds implicit negative checks for the specified patterns between positive
75  checks. The option allows writing stricter tests without stuffing them with
76  ``CHECK-NOT``\ s.
77
78  For example, "``--implicit-check-not warning:``" can be useful when testing
79  diagnostic messages from tools that don't have an option similar to ``clang
80  -verify``. With this option FileCheck will verify that input does not contain
81  warnings not covered by any ``CHECK:`` patterns.
82
83.. option:: --dump-input <mode>
84
85  Dump input to stderr, adding annotations representing currently enabled
86  diagnostics.  Do this either 'always', on 'fail', or 'never'.  Specify 'help'
87  to explain the dump format and quit.
88
89.. option:: --dump-input-on-failure
90
91  When the check fails, dump all of the original input.  This option is
92  deprecated in favor of `--dump-input=fail`.
93
94.. option:: --enable-var-scope
95
96  Enables scope for regex variables.
97
98  Variables with names that start with ``$`` are considered global and
99  remain set throughout the file.
100
101  All other variables get undefined after each encountered ``CHECK-LABEL``.
102
103.. option:: -D<VAR=VALUE>
104
105  Sets a filecheck variable ``VAR`` with value ``VALUE`` that can be used in
106  ``CHECK:`` lines.
107
108.. option:: -version
109
110 Show the version number of this program.
111
112.. option:: -v
113
114  Print directive pattern matches.
115
116.. option:: -vv
117
118  Print information helpful in diagnosing internal FileCheck issues, such as
119  discarded overlapping ``CHECK-DAG:`` matches, implicit EOF pattern matches,
120  and ``CHECK-NOT:`` patterns that do not have matches.  Implies ``-v``.
121
122.. option:: --allow-deprecated-dag-overlap
123
124  Enable overlapping among matches in a group of consecutive ``CHECK-DAG:``
125  directives.  This option is deprecated and is only provided for convenience
126  as old tests are migrated to the new non-overlapping ``CHECK-DAG:``
127  implementation.
128
129.. option:: --color
130
131  Use colors in output (autodetected by default).
132
133EXIT STATUS
134-----------
135
136If :program:`FileCheck` verifies that the file matches the expected contents,
137it exits with 0.  Otherwise, if not, or if an error occurs, it will exit with a
138non-zero value.
139
140TUTORIAL
141--------
142
143FileCheck is typically used from LLVM regression tests, being invoked on the RUN
144line of the test.  A simple example of using FileCheck from a RUN line looks
145like this:
146
147.. code-block:: llvm
148
149   ; RUN: llvm-as < %s | llc -march=x86-64 | FileCheck %s
150
151This syntax says to pipe the current file ("``%s``") into ``llvm-as``, pipe
152that into ``llc``, then pipe the output of ``llc`` into ``FileCheck``.  This
153means that FileCheck will be verifying its standard input (the llc output)
154against the filename argument specified (the original ``.ll`` file specified by
155"``%s``").  To see how this works, let's look at the rest of the ``.ll`` file
156(after the RUN line):
157
158.. code-block:: llvm
159
160   define void @sub1(i32* %p, i32 %v) {
161   entry:
162   ; CHECK: sub1:
163   ; CHECK: subl
164           %0 = tail call i32 @llvm.atomic.load.sub.i32.p0i32(i32* %p, i32 %v)
165           ret void
166   }
167
168   define void @inc4(i64* %p) {
169   entry:
170   ; CHECK: inc4:
171   ; CHECK: incq
172           %0 = tail call i64 @llvm.atomic.load.add.i64.p0i64(i64* %p, i64 1)
173           ret void
174   }
175
176Here you can see some "``CHECK:``" lines specified in comments.  Now you can
177see how the file is piped into ``llvm-as``, then ``llc``, and the machine code
178output is what we are verifying.  FileCheck checks the machine code output to
179verify that it matches what the "``CHECK:``" lines specify.
180
181The syntax of the "``CHECK:``" lines is very simple: they are fixed strings that
182must occur in order.  FileCheck defaults to ignoring horizontal whitespace
183differences (e.g. a space is allowed to match a tab) but otherwise, the contents
184of the "``CHECK:``" line is required to match some thing in the test file exactly.
185
186One nice thing about FileCheck (compared to grep) is that it allows merging
187test cases together into logical groups.  For example, because the test above
188is checking for the "``sub1:``" and "``inc4:``" labels, it will not match
189unless there is a "``subl``" in between those labels.  If it existed somewhere
190else in the file, that would not count: "``grep subl``" matches if "``subl``"
191exists anywhere in the file.
192
193The FileCheck -check-prefix option
194~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
195
196The FileCheck `-check-prefix` option allows multiple test
197configurations to be driven from one `.ll` file.  This is useful in many
198circumstances, for example, testing different architectural variants with
199:program:`llc`.  Here's a simple example:
200
201.. code-block:: llvm
202
203   ; RUN: llvm-as < %s | llc -mtriple=i686-apple-darwin9 -mattr=sse41 \
204   ; RUN:              | FileCheck %s -check-prefix=X32
205   ; RUN: llvm-as < %s | llc -mtriple=x86_64-apple-darwin9 -mattr=sse41 \
206   ; RUN:              | FileCheck %s -check-prefix=X64
207
208   define <4 x i32> @pinsrd_1(i32 %s, <4 x i32> %tmp) nounwind {
209           %tmp1 = insertelement <4 x i32>; %tmp, i32 %s, i32 1
210           ret <4 x i32> %tmp1
211   ; X32: pinsrd_1:
212   ; X32:    pinsrd $1, 4(%esp), %xmm0
213
214   ; X64: pinsrd_1:
215   ; X64:    pinsrd $1, %edi, %xmm0
216   }
217
218In this case, we're testing that we get the expected code generation with
219both 32-bit and 64-bit code generation.
220
221The "CHECK-NEXT:" directive
222~~~~~~~~~~~~~~~~~~~~~~~~~~~
223
224Sometimes you want to match lines and would like to verify that matches
225happen on exactly consecutive lines with no other lines in between them.  In
226this case, you can use "``CHECK:``" and "``CHECK-NEXT:``" directives to specify
227this.  If you specified a custom check prefix, just use "``<PREFIX>-NEXT:``".
228For example, something like this works as you'd expect:
229
230.. code-block:: llvm
231
232   define void @t2(<2 x double>* %r, <2 x double>* %A, double %B) {
233 	%tmp3 = load <2 x double>* %A, align 16
234 	%tmp7 = insertelement <2 x double> undef, double %B, i32 0
235 	%tmp9 = shufflevector <2 x double> %tmp3,
236                               <2 x double> %tmp7,
237                               <2 x i32> < i32 0, i32 2 >
238 	store <2 x double> %tmp9, <2 x double>* %r, align 16
239 	ret void
240
241   ; CHECK:          t2:
242   ; CHECK: 	        movl	8(%esp), %eax
243   ; CHECK-NEXT: 	movapd	(%eax), %xmm0
244   ; CHECK-NEXT: 	movhpd	12(%esp), %xmm0
245   ; CHECK-NEXT: 	movl	4(%esp), %eax
246   ; CHECK-NEXT: 	movapd	%xmm0, (%eax)
247   ; CHECK-NEXT: 	ret
248   }
249
250"``CHECK-NEXT:``" directives reject the input unless there is exactly one
251newline between it and the previous directive.  A "``CHECK-NEXT:``" cannot be
252the first directive in a file.
253
254The "CHECK-SAME:" directive
255~~~~~~~~~~~~~~~~~~~~~~~~~~~
256
257Sometimes you want to match lines and would like to verify that matches happen
258on the same line as the previous match.  In this case, you can use "``CHECK:``"
259and "``CHECK-SAME:``" directives to specify this.  If you specified a custom
260check prefix, just use "``<PREFIX>-SAME:``".
261
262"``CHECK-SAME:``" is particularly powerful in conjunction with "``CHECK-NOT:``"
263(described below).
264
265For example, the following works like you'd expect:
266
267.. code-block:: llvm
268
269   !0 = !DILocation(line: 5, scope: !1, inlinedAt: !2)
270
271   ; CHECK:       !DILocation(line: 5,
272   ; CHECK-NOT:               column:
273   ; CHECK-SAME:              scope: ![[SCOPE:[0-9]+]]
274
275"``CHECK-SAME:``" directives reject the input if there are any newlines between
276it and the previous directive.  A "``CHECK-SAME:``" cannot be the first
277directive in a file.
278
279The "CHECK-EMPTY:" directive
280~~~~~~~~~~~~~~~~~~~~~~~~~~~~
281
282If you need to check that the next line has nothing on it, not even whitespace,
283you can use the "``CHECK-EMPTY:``" directive.
284
285.. code-block:: llvm
286
287   declare void @foo()
288
289   declare void @bar()
290   ; CHECK: foo
291   ; CHECK-EMPTY:
292   ; CHECK-NEXT: bar
293
294Just like "``CHECK-NEXT:``" the directive will fail if there is more than one
295newline before it finds the next blank line, and it cannot be the first
296directive in a file.
297
298The "CHECK-NOT:" directive
299~~~~~~~~~~~~~~~~~~~~~~~~~~
300
301The "``CHECK-NOT:``" directive is used to verify that a string doesn't occur
302between two matches (or before the first match, or after the last match).  For
303example, to verify that a load is removed by a transformation, a test like this
304can be used:
305
306.. code-block:: llvm
307
308   define i8 @coerce_offset0(i32 %V, i32* %P) {
309     store i32 %V, i32* %P
310
311     %P2 = bitcast i32* %P to i8*
312     %P3 = getelementptr i8* %P2, i32 2
313
314     %A = load i8* %P3
315     ret i8 %A
316   ; CHECK: @coerce_offset0
317   ; CHECK-NOT: load
318   ; CHECK: ret i8
319   }
320
321The "CHECK-COUNT:" directive
322~~~~~~~~~~~~~~~~~~~~~~~~~~~~
323
324If you need to match multiple lines with the same pattern over and over again
325you can repeat a plain ``CHECK:`` as many times as needed. If that looks too
326boring you can instead use a counted check "``CHECK-COUNT-<num>:``", where
327``<num>`` is a positive decimal number. It will match the pattern exactly
328``<num>`` times, no more and no less. If you specified a custom check prefix,
329just use "``<PREFIX>-COUNT-<num>:``" for the same effect.
330Here is a simple example:
331
332.. code-block:: text
333
334   Loop at depth 1
335   Loop at depth 1
336   Loop at depth 1
337   Loop at depth 1
338     Loop at depth 2
339       Loop at depth 3
340
341   ; CHECK-COUNT-6: Loop at depth {{[0-9]+}}
342   ; CHECK-NOT:     Loop at depth {{[0-9]+}}
343
344The "CHECK-DAG:" directive
345~~~~~~~~~~~~~~~~~~~~~~~~~~
346
347If it's necessary to match strings that don't occur in a strictly sequential
348order, "``CHECK-DAG:``" could be used to verify them between two matches (or
349before the first match, or after the last match). For example, clang emits
350vtable globals in reverse order. Using ``CHECK-DAG:``, we can keep the checks
351in the natural order:
352
353.. code-block:: c++
354
355    // RUN: %clang_cc1 %s -emit-llvm -o - | FileCheck %s
356
357    struct Foo { virtual void method(); };
358    Foo f;  // emit vtable
359    // CHECK-DAG: @_ZTV3Foo =
360
361    struct Bar { virtual void method(); };
362    Bar b;
363    // CHECK-DAG: @_ZTV3Bar =
364
365``CHECK-NOT:`` directives could be mixed with ``CHECK-DAG:`` directives to
366exclude strings between the surrounding ``CHECK-DAG:`` directives. As a result,
367the surrounding ``CHECK-DAG:`` directives cannot be reordered, i.e. all
368occurrences matching ``CHECK-DAG:`` before ``CHECK-NOT:`` must not fall behind
369occurrences matching ``CHECK-DAG:`` after ``CHECK-NOT:``. For example,
370
371.. code-block:: llvm
372
373   ; CHECK-DAG: BEFORE
374   ; CHECK-NOT: NOT
375   ; CHECK-DAG: AFTER
376
377This case will reject input strings where ``BEFORE`` occurs after ``AFTER``.
378
379With captured variables, ``CHECK-DAG:`` is able to match valid topological
380orderings of a DAG with edges from the definition of a variable to its use.
381It's useful, e.g., when your test cases need to match different output
382sequences from the instruction scheduler. For example,
383
384.. code-block:: llvm
385
386   ; CHECK-DAG: add [[REG1:r[0-9]+]], r1, r2
387   ; CHECK-DAG: add [[REG2:r[0-9]+]], r3, r4
388   ; CHECK:     mul r5, [[REG1]], [[REG2]]
389
390In this case, any order of that two ``add`` instructions will be allowed.
391
392If you are defining `and` using variables in the same ``CHECK-DAG:`` block,
393be aware that the definition rule can match `after` its use.
394
395So, for instance, the code below will pass:
396
397.. code-block:: text
398
399  ; CHECK-DAG: vmov.32 [[REG2:d[0-9]+]][0]
400  ; CHECK-DAG: vmov.32 [[REG2]][1]
401  vmov.32 d0[1]
402  vmov.32 d0[0]
403
404While this other code, will not:
405
406.. code-block:: text
407
408  ; CHECK-DAG: vmov.32 [[REG2:d[0-9]+]][0]
409  ; CHECK-DAG: vmov.32 [[REG2]][1]
410  vmov.32 d1[1]
411  vmov.32 d0[0]
412
413While this can be very useful, it's also dangerous, because in the case of
414register sequence, you must have a strong order (read before write, copy before
415use, etc). If the definition your test is looking for doesn't match (because
416of a bug in the compiler), it may match further away from the use, and mask
417real bugs away.
418
419In those cases, to enforce the order, use a non-DAG directive between DAG-blocks.
420
421A ``CHECK-DAG:`` directive skips matches that overlap the matches of any
422preceding ``CHECK-DAG:`` directives in the same ``CHECK-DAG:`` block.  Not only
423is this non-overlapping behavior consistent with other directives, but it's
424also necessary to handle sets of non-unique strings or patterns.  For example,
425the following directives look for unordered log entries for two tasks in a
426parallel program, such as the OpenMP runtime:
427
428.. code-block:: text
429
430    // CHECK-DAG: [[THREAD_ID:[0-9]+]]: task_begin
431    // CHECK-DAG: [[THREAD_ID]]: task_end
432    //
433    // CHECK-DAG: [[THREAD_ID:[0-9]+]]: task_begin
434    // CHECK-DAG: [[THREAD_ID]]: task_end
435
436The second pair of directives is guaranteed not to match the same log entries
437as the first pair even though the patterns are identical and even if the text
438of the log entries is identical because the thread ID manages to be reused.
439
440The "CHECK-LABEL:" directive
441~~~~~~~~~~~~~~~~~~~~~~~~~~~~
442
443Sometimes in a file containing multiple tests divided into logical blocks, one
444or more ``CHECK:`` directives may inadvertently succeed by matching lines in a
445later block. While an error will usually eventually be generated, the check
446flagged as causing the error may not actually bear any relationship to the
447actual source of the problem.
448
449In order to produce better error messages in these cases, the "``CHECK-LABEL:``"
450directive can be used. It is treated identically to a normal ``CHECK``
451directive except that FileCheck makes an additional assumption that a line
452matched by the directive cannot also be matched by any other check present in
453``match-filename``; this is intended to be used for lines containing labels or
454other unique identifiers. Conceptually, the presence of ``CHECK-LABEL`` divides
455the input stream into separate blocks, each of which is processed independently,
456preventing a ``CHECK:`` directive in one block matching a line in another block.
457If ``--enable-var-scope`` is in effect, all local variables are cleared at the
458beginning of the block.
459
460For example,
461
462.. code-block:: llvm
463
464  define %struct.C* @C_ctor_base(%struct.C* %this, i32 %x) {
465  entry:
466  ; CHECK-LABEL: C_ctor_base:
467  ; CHECK: mov [[SAVETHIS:r[0-9]+]], r0
468  ; CHECK: bl A_ctor_base
469  ; CHECK: mov r0, [[SAVETHIS]]
470    %0 = bitcast %struct.C* %this to %struct.A*
471    %call = tail call %struct.A* @A_ctor_base(%struct.A* %0)
472    %1 = bitcast %struct.C* %this to %struct.B*
473    %call2 = tail call %struct.B* @B_ctor_base(%struct.B* %1, i32 %x)
474    ret %struct.C* %this
475  }
476
477  define %struct.D* @D_ctor_base(%struct.D* %this, i32 %x) {
478  entry:
479  ; CHECK-LABEL: D_ctor_base:
480
481The use of ``CHECK-LABEL:`` directives in this case ensures that the three
482``CHECK:`` directives only accept lines corresponding to the body of the
483``@C_ctor_base`` function, even if the patterns match lines found later in
484the file. Furthermore, if one of these three ``CHECK:`` directives fail,
485FileCheck will recover by continuing to the next block, allowing multiple test
486failures to be detected in a single invocation.
487
488There is no requirement that ``CHECK-LABEL:`` directives contain strings that
489correspond to actual syntactic labels in a source or output language: they must
490simply uniquely match a single line in the file being verified.
491
492``CHECK-LABEL:`` directives cannot contain variable definitions or uses.
493
494FileCheck Pattern Matching Syntax
495~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
496
497All FileCheck directives take a pattern to match.
498For most uses of FileCheck, fixed string matching is perfectly sufficient.  For
499some things, a more flexible form of matching is desired.  To support this,
500FileCheck allows you to specify regular expressions in matching strings,
501surrounded by double braces: ``{{yourregex}}``. FileCheck implements a POSIX
502regular expression matcher; it supports Extended POSIX regular expressions
503(ERE). Because we want to use fixed string matching for a majority of what we
504do, FileCheck has been designed to support mixing and matching fixed string
505matching with regular expressions.  This allows you to write things like this:
506
507.. code-block:: llvm
508
509   ; CHECK: movhpd	{{[0-9]+}}(%esp), {{%xmm[0-7]}}
510
511In this case, any offset from the ESP register will be allowed, and any xmm
512register will be allowed.
513
514Because regular expressions are enclosed with double braces, they are
515visually distinct, and you don't need to use escape characters within the double
516braces like you would in C.  In the rare case that you want to match double
517braces explicitly from the input, you can use something ugly like
518``{{[{][{]}}`` as your pattern.
519
520FileCheck Variables
521~~~~~~~~~~~~~~~~~~~
522
523It is often useful to match a pattern and then verify that it occurs again
524later in the file.  For codegen tests, this can be useful to allow any register,
525but verify that that register is used consistently later.  To do this,
526:program:`FileCheck` allows named variables to be defined and substituted into
527patterns.  Here is a simple example:
528
529.. code-block:: llvm
530
531   ; CHECK: test5:
532   ; CHECK:    notw	[[REGISTER:%[a-z]+]]
533   ; CHECK:    andw	{{.*}}[[REGISTER]]
534
535The first check line matches a regex ``%[a-z]+`` and captures it into the
536variable ``REGISTER``.  The second line verifies that whatever is in
537``REGISTER`` occurs later in the file after an "``andw``".  :program:`FileCheck`
538variable references are always contained in ``[[ ]]`` pairs, and their names can
539be formed with the regex ``[a-zA-Z_][a-zA-Z0-9_]*``.  If a colon follows the name,
540then it is a definition of the variable; otherwise, it is a use.
541
542:program:`FileCheck` variables can be defined multiple times, and uses always
543get the latest value.  Variables can also be used later on the same line they
544were defined on. For example:
545
546.. code-block:: llvm
547
548    ; CHECK: op [[REG:r[0-9]+]], [[REG]]
549
550Can be useful if you want the operands of ``op`` to be the same register,
551and don't care exactly which register it is.
552
553If ``--enable-var-scope`` is in effect, variables with names that
554start with ``$`` are considered to be global. All others variables are
555local.  All local variables get undefined at the beginning of each
556CHECK-LABEL block. Global variables are not affected by CHECK-LABEL.
557This makes it easier to ensure that individual tests are not affected
558by variables set in preceding tests.
559
560FileCheck Expressions
561~~~~~~~~~~~~~~~~~~~~~
562
563Sometimes there's a need to verify output which refers line numbers of the
564match file, e.g. when testing compiler diagnostics.  This introduces a certain
565fragility of the match file structure, as "``CHECK:``" lines contain absolute
566line numbers in the same file, which have to be updated whenever line numbers
567change due to text addition or deletion.
568
569To support this case, FileCheck allows using ``[[@LINE]]``,
570``[[@LINE+<offset>]]``, ``[[@LINE-<offset>]]`` expressions in patterns. These
571expressions expand to a number of the line where a pattern is located (with an
572optional integer offset).
573
574This way match patterns can be put near the relevant test lines and include
575relative line number references, for example:
576
577.. code-block:: c++
578
579   // CHECK: test.cpp:[[@LINE+4]]:6: error: expected ';' after top level declarator
580   // CHECK-NEXT: {{^int a}}
581   // CHECK-NEXT: {{^     \^}}
582   // CHECK-NEXT: {{^     ;}}
583   int a
584
585Matching Newline Characters
586~~~~~~~~~~~~~~~~~~~~~~~~~~~
587
588To match newline characters in regular expressions the character class
589``[[:space:]]`` can be used. For example, the following pattern:
590
591.. code-block:: c++
592
593   // CHECK: DW_AT_location [DW_FORM_sec_offset] ([[DLOC:0x[0-9a-f]+]]){{[[:space:]].*}}"intd"
594
595matches output of the form (from llvm-dwarfdump):
596
597.. code-block:: text
598
599       DW_AT_location [DW_FORM_sec_offset]   (0x00000233)
600       DW_AT_name [DW_FORM_strp]  ( .debug_str[0x000000c9] = "intd")
601
602letting us set the :program:`FileCheck` variable ``DLOC`` to the desired value
603``0x00000233``, extracted from the line immediately preceding "``intd``".
604