1FileCheck - Flexible pattern matching file verifier
2===================================================
3
4SYNOPSIS
5--------
6
7:program:`FileCheck` *match-filename* [*--check-prefix=XXX*] [*--strict-whitespace*]
8
9DESCRIPTION
10-----------
11
12:program:`FileCheck` reads two files (one from standard input, and one
13specified on the command line) and uses one to verify the other.  This
14behavior is particularly useful for the testsuite, which wants to verify that
15the output of some tool (e.g. :program:`llc`) contains the expected information
16(for example, a movsd from esp or whatever is interesting).  This is similar to
17using :program:`grep`, but it is optimized for matching multiple different
18inputs in one file in a specific order.
19
20The ``match-filename`` file specifies the file that contains the patterns to
21match.  The file to verify is read from standard input unless the
22:option:`--input-file` option is used.
23
24OPTIONS
25-------
26
27Options are parsed from the environment variable ``FILECHECK_OPTS``
28and from the command line.
29
30.. option:: -help
31
32 Print a summary of command line options.
33
34.. option:: --check-prefix prefix
35
36 FileCheck searches the contents of ``match-filename`` for patterns to
37 match.  By default, these patterns are prefixed with "``CHECK:``".
38 If you'd like to use a different prefix (e.g. because the same input
39 file is checking multiple different tool or options), the
40 :option:`--check-prefix` argument allows you to specify one or more
41 prefixes to match. Multiple prefixes are useful for tests which might
42 change for different run options, but most lines remain the same.
43
44.. option:: --check-prefixes prefix1,prefix2,...
45
46 An alias of :option:`--check-prefix` that allows multiple prefixes to be
47 specified as a comma separated list.
48
49.. option:: --input-file filename
50
51  File to check (defaults to stdin).
52
53.. option:: --match-full-lines
54
55 By default, FileCheck allows matches of anywhere on a line. This
56 option will require all positive matches to cover an entire
57 line. Leading and trailing whitespace is ignored, unless
58 :option:`--strict-whitespace` is also specified. (Note: negative
59 matches from ``CHECK-NOT`` are not affected by this option!)
60
61 Passing this option is equivalent to inserting ``{{^ *}}`` or
62 ``{{^}}`` before, and ``{{ *$}}`` or ``{{$}}`` after every positive
63 check pattern.
64
65.. option:: --strict-whitespace
66
67 By default, FileCheck canonicalizes input horizontal whitespace (spaces and
68 tabs) which causes it to ignore these differences (a space will match a tab).
69 The :option:`--strict-whitespace` argument disables this behavior. End-of-line
70 sequences are canonicalized to UNIX-style ``\n`` in all modes.
71
72.. option:: --implicit-check-not check-pattern
73
74  Adds implicit negative checks for the specified patterns between positive
75  checks. The option allows writing stricter tests without stuffing them with
76  ``CHECK-NOT``\ s.
77
78  For example, "``--implicit-check-not warning:``" can be useful when testing
79  diagnostic messages from tools that don't have an option similar to ``clang
80  -verify``. With this option FileCheck will verify that input does not contain
81  warnings not covered by any ``CHECK:`` patterns.
82
83.. option:: --dump-input-on-failure
84
85  When the check fails, dump all of the original input.
86
87.. option:: --enable-var-scope
88
89  Enables scope for regex variables.
90
91  Variables with names that start with ``$`` are considered global and
92  remain set throughout the file.
93
94  All other variables get undefined after each encountered ``CHECK-LABEL``.
95
96.. option:: -D<VAR=VALUE>
97
98  Sets a filecheck variable ``VAR`` with value ``VALUE`` that can be used in
99  ``CHECK:`` lines.
100
101.. option:: -version
102
103 Show the version number of this program.
104
105.. option:: -v
106
107  Print directive pattern matches.
108
109.. option:: -vv
110
111  Print information helpful in diagnosing internal FileCheck issues, such as
112  discarded overlapping ``CHECK-DAG:`` matches, implicit EOF pattern matches,
113  and ``CHECK-NOT:`` patterns that do not have matches.  Implies ``-v``.
114
115.. option:: --allow-deprecated-dag-overlap
116
117  Enable overlapping among matches in a group of consecutive ``CHECK-DAG:``
118  directives.  This option is deprecated and is only provided for convenience
119  as old tests are migrated to the new non-overlapping ``CHECK-DAG:``
120  implementation.
121
122.. option:: --color
123
124  Use colors in output (autodetected by default).
125
126EXIT STATUS
127-----------
128
129If :program:`FileCheck` verifies that the file matches the expected contents,
130it exits with 0.  Otherwise, if not, or if an error occurs, it will exit with a
131non-zero value.
132
133TUTORIAL
134--------
135
136FileCheck is typically used from LLVM regression tests, being invoked on the RUN
137line of the test.  A simple example of using FileCheck from a RUN line looks
138like this:
139
140.. code-block:: llvm
141
142   ; RUN: llvm-as < %s | llc -march=x86-64 | FileCheck %s
143
144This syntax says to pipe the current file ("``%s``") into ``llvm-as``, pipe
145that into ``llc``, then pipe the output of ``llc`` into ``FileCheck``.  This
146means that FileCheck will be verifying its standard input (the llc output)
147against the filename argument specified (the original ``.ll`` file specified by
148"``%s``").  To see how this works, let's look at the rest of the ``.ll`` file
149(after the RUN line):
150
151.. code-block:: llvm
152
153   define void @sub1(i32* %p, i32 %v) {
154   entry:
155   ; CHECK: sub1:
156   ; CHECK: subl
157           %0 = tail call i32 @llvm.atomic.load.sub.i32.p0i32(i32* %p, i32 %v)
158           ret void
159   }
160
161   define void @inc4(i64* %p) {
162   entry:
163   ; CHECK: inc4:
164   ; CHECK: incq
165           %0 = tail call i64 @llvm.atomic.load.add.i64.p0i64(i64* %p, i64 1)
166           ret void
167   }
168
169Here you can see some "``CHECK:``" lines specified in comments.  Now you can
170see how the file is piped into ``llvm-as``, then ``llc``, and the machine code
171output is what we are verifying.  FileCheck checks the machine code output to
172verify that it matches what the "``CHECK:``" lines specify.
173
174The syntax of the "``CHECK:``" lines is very simple: they are fixed strings that
175must occur in order.  FileCheck defaults to ignoring horizontal whitespace
176differences (e.g. a space is allowed to match a tab) but otherwise, the contents
177of the "``CHECK:``" line is required to match some thing in the test file exactly.
178
179One nice thing about FileCheck (compared to grep) is that it allows merging
180test cases together into logical groups.  For example, because the test above
181is checking for the "``sub1:``" and "``inc4:``" labels, it will not match
182unless there is a "``subl``" in between those labels.  If it existed somewhere
183else in the file, that would not count: "``grep subl``" matches if "``subl``"
184exists anywhere in the file.
185
186The FileCheck -check-prefix option
187~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
188
189The FileCheck `-check-prefix` option allows multiple test
190configurations to be driven from one `.ll` file.  This is useful in many
191circumstances, for example, testing different architectural variants with
192:program:`llc`.  Here's a simple example:
193
194.. code-block:: llvm
195
196   ; RUN: llvm-as < %s | llc -mtriple=i686-apple-darwin9 -mattr=sse41 \
197   ; RUN:              | FileCheck %s -check-prefix=X32
198   ; RUN: llvm-as < %s | llc -mtriple=x86_64-apple-darwin9 -mattr=sse41 \
199   ; RUN:              | FileCheck %s -check-prefix=X64
200
201   define <4 x i32> @pinsrd_1(i32 %s, <4 x i32> %tmp) nounwind {
202           %tmp1 = insertelement <4 x i32>; %tmp, i32 %s, i32 1
203           ret <4 x i32> %tmp1
204   ; X32: pinsrd_1:
205   ; X32:    pinsrd $1, 4(%esp), %xmm0
206
207   ; X64: pinsrd_1:
208   ; X64:    pinsrd $1, %edi, %xmm0
209   }
210
211In this case, we're testing that we get the expected code generation with
212both 32-bit and 64-bit code generation.
213
214The "CHECK-NEXT:" directive
215~~~~~~~~~~~~~~~~~~~~~~~~~~~
216
217Sometimes you want to match lines and would like to verify that matches
218happen on exactly consecutive lines with no other lines in between them.  In
219this case, you can use "``CHECK:``" and "``CHECK-NEXT:``" directives to specify
220this.  If you specified a custom check prefix, just use "``<PREFIX>-NEXT:``".
221For example, something like this works as you'd expect:
222
223.. code-block:: llvm
224
225   define void @t2(<2 x double>* %r, <2 x double>* %A, double %B) {
226 	%tmp3 = load <2 x double>* %A, align 16
227 	%tmp7 = insertelement <2 x double> undef, double %B, i32 0
228 	%tmp9 = shufflevector <2 x double> %tmp3,
229                               <2 x double> %tmp7,
230                               <2 x i32> < i32 0, i32 2 >
231 	store <2 x double> %tmp9, <2 x double>* %r, align 16
232 	ret void
233
234   ; CHECK:          t2:
235   ; CHECK: 	        movl	8(%esp), %eax
236   ; CHECK-NEXT: 	movapd	(%eax), %xmm0
237   ; CHECK-NEXT: 	movhpd	12(%esp), %xmm0
238   ; CHECK-NEXT: 	movl	4(%esp), %eax
239   ; CHECK-NEXT: 	movapd	%xmm0, (%eax)
240   ; CHECK-NEXT: 	ret
241   }
242
243"``CHECK-NEXT:``" directives reject the input unless there is exactly one
244newline between it and the previous directive.  A "``CHECK-NEXT:``" cannot be
245the first directive in a file.
246
247The "CHECK-SAME:" directive
248~~~~~~~~~~~~~~~~~~~~~~~~~~~
249
250Sometimes you want to match lines and would like to verify that matches happen
251on the same line as the previous match.  In this case, you can use "``CHECK:``"
252and "``CHECK-SAME:``" directives to specify this.  If you specified a custom
253check prefix, just use "``<PREFIX>-SAME:``".
254
255"``CHECK-SAME:``" is particularly powerful in conjunction with "``CHECK-NOT:``"
256(described below).
257
258For example, the following works like you'd expect:
259
260.. code-block:: llvm
261
262   !0 = !DILocation(line: 5, scope: !1, inlinedAt: !2)
263
264   ; CHECK:       !DILocation(line: 5,
265   ; CHECK-NOT:               column:
266   ; CHECK-SAME:              scope: ![[SCOPE:[0-9]+]]
267
268"``CHECK-SAME:``" directives reject the input if there are any newlines between
269it and the previous directive.  A "``CHECK-SAME:``" cannot be the first
270directive in a file.
271
272The "CHECK-EMPTY:" directive
273~~~~~~~~~~~~~~~~~~~~~~~~~~~~
274
275If you need to check that the next line has nothing on it, not even whitespace,
276you can use the "``CHECK-EMPTY:``" directive.
277
278.. code-block:: llvm
279
280   declare void @foo()
281
282   declare void @bar()
283   ; CHECK: foo
284   ; CHECK-EMPTY:
285   ; CHECK-NEXT: bar
286
287Just like "``CHECK-NEXT:``" the directive will fail if there is more than one
288newline before it finds the next blank line, and it cannot be the first
289directive in a file.
290
291The "CHECK-NOT:" directive
292~~~~~~~~~~~~~~~~~~~~~~~~~~
293
294The "``CHECK-NOT:``" directive is used to verify that a string doesn't occur
295between two matches (or before the first match, or after the last match).  For
296example, to verify that a load is removed by a transformation, a test like this
297can be used:
298
299.. code-block:: llvm
300
301   define i8 @coerce_offset0(i32 %V, i32* %P) {
302     store i32 %V, i32* %P
303
304     %P2 = bitcast i32* %P to i8*
305     %P3 = getelementptr i8* %P2, i32 2
306
307     %A = load i8* %P3
308     ret i8 %A
309   ; CHECK: @coerce_offset0
310   ; CHECK-NOT: load
311   ; CHECK: ret i8
312   }
313
314The "CHECK-COUNT:" directive
315~~~~~~~~~~~~~~~~~~~~~~~~~~~~
316
317If you need to match multiple lines with the same pattern over and over again
318you can repeat a plain ``CHECK:`` as many times as needed. If that looks too
319boring you can instead use a counted check "``CHECK-COUNT-<num>:``", where
320``<num>`` is a positive decimal number. It will match the pattern exactly
321``<num>`` times, no more and no less. If you specified a custom check prefix,
322just use "``<PREFIX>-COUNT-<num>:``" for the same effect.
323Here is a simple example:
324
325.. code-block:: text
326
327   Loop at depth 1
328   Loop at depth 1
329   Loop at depth 1
330   Loop at depth 1
331     Loop at depth 2
332       Loop at depth 3
333
334   ; CHECK-COUNT-6: Loop at depth {{[0-9]+}}
335   ; CHECK-NOT:     Loop at depth {{[0-9]+}}
336
337The "CHECK-DAG:" directive
338~~~~~~~~~~~~~~~~~~~~~~~~~~
339
340If it's necessary to match strings that don't occur in a strictly sequential
341order, "``CHECK-DAG:``" could be used to verify them between two matches (or
342before the first match, or after the last match). For example, clang emits
343vtable globals in reverse order. Using ``CHECK-DAG:``, we can keep the checks
344in the natural order:
345
346.. code-block:: c++
347
348    // RUN: %clang_cc1 %s -emit-llvm -o - | FileCheck %s
349
350    struct Foo { virtual void method(); };
351    Foo f;  // emit vtable
352    // CHECK-DAG: @_ZTV3Foo =
353
354    struct Bar { virtual void method(); };
355    Bar b;
356    // CHECK-DAG: @_ZTV3Bar =
357
358``CHECK-NOT:`` directives could be mixed with ``CHECK-DAG:`` directives to
359exclude strings between the surrounding ``CHECK-DAG:`` directives. As a result,
360the surrounding ``CHECK-DAG:`` directives cannot be reordered, i.e. all
361occurrences matching ``CHECK-DAG:`` before ``CHECK-NOT:`` must not fall behind
362occurrences matching ``CHECK-DAG:`` after ``CHECK-NOT:``. For example,
363
364.. code-block:: llvm
365
366   ; CHECK-DAG: BEFORE
367   ; CHECK-NOT: NOT
368   ; CHECK-DAG: AFTER
369
370This case will reject input strings where ``BEFORE`` occurs after ``AFTER``.
371
372With captured variables, ``CHECK-DAG:`` is able to match valid topological
373orderings of a DAG with edges from the definition of a variable to its use.
374It's useful, e.g., when your test cases need to match different output
375sequences from the instruction scheduler. For example,
376
377.. code-block:: llvm
378
379   ; CHECK-DAG: add [[REG1:r[0-9]+]], r1, r2
380   ; CHECK-DAG: add [[REG2:r[0-9]+]], r3, r4
381   ; CHECK:     mul r5, [[REG1]], [[REG2]]
382
383In this case, any order of that two ``add`` instructions will be allowed.
384
385If you are defining `and` using variables in the same ``CHECK-DAG:`` block,
386be aware that the definition rule can match `after` its use.
387
388So, for instance, the code below will pass:
389
390.. code-block:: text
391
392  ; CHECK-DAG: vmov.32 [[REG2:d[0-9]+]][0]
393  ; CHECK-DAG: vmov.32 [[REG2]][1]
394  vmov.32 d0[1]
395  vmov.32 d0[0]
396
397While this other code, will not:
398
399.. code-block:: text
400
401  ; CHECK-DAG: vmov.32 [[REG2:d[0-9]+]][0]
402  ; CHECK-DAG: vmov.32 [[REG2]][1]
403  vmov.32 d1[1]
404  vmov.32 d0[0]
405
406While this can be very useful, it's also dangerous, because in the case of
407register sequence, you must have a strong order (read before write, copy before
408use, etc). If the definition your test is looking for doesn't match (because
409of a bug in the compiler), it may match further away from the use, and mask
410real bugs away.
411
412In those cases, to enforce the order, use a non-DAG directive between DAG-blocks.
413
414A ``CHECK-DAG:`` directive skips matches that overlap the matches of any
415preceding ``CHECK-DAG:`` directives in the same ``CHECK-DAG:`` block.  Not only
416is this non-overlapping behavior consistent with other directives, but it's
417also necessary to handle sets of non-unique strings or patterns.  For example,
418the following directives look for unordered log entries for two tasks in a
419parallel program, such as the OpenMP runtime:
420
421.. code-block:: text
422
423    // CHECK-DAG: [[THREAD_ID:[0-9]+]]: task_begin
424    // CHECK-DAG: [[THREAD_ID]]: task_end
425    //
426    // CHECK-DAG: [[THREAD_ID:[0-9]+]]: task_begin
427    // CHECK-DAG: [[THREAD_ID]]: task_end
428
429The second pair of directives is guaranteed not to match the same log entries
430as the first pair even though the patterns are identical and even if the text
431of the log entries is identical because the thread ID manages to be reused.
432
433The "CHECK-LABEL:" directive
434~~~~~~~~~~~~~~~~~~~~~~~~~~~~
435
436Sometimes in a file containing multiple tests divided into logical blocks, one
437or more ``CHECK:`` directives may inadvertently succeed by matching lines in a
438later block. While an error will usually eventually be generated, the check
439flagged as causing the error may not actually bear any relationship to the
440actual source of the problem.
441
442In order to produce better error messages in these cases, the "``CHECK-LABEL:``"
443directive can be used. It is treated identically to a normal ``CHECK``
444directive except that FileCheck makes an additional assumption that a line
445matched by the directive cannot also be matched by any other check present in
446``match-filename``; this is intended to be used for lines containing labels or
447other unique identifiers. Conceptually, the presence of ``CHECK-LABEL`` divides
448the input stream into separate blocks, each of which is processed independently,
449preventing a ``CHECK:`` directive in one block matching a line in another block.
450If ``--enable-var-scope`` is in effect, all local variables are cleared at the
451beginning of the block.
452
453For example,
454
455.. code-block:: llvm
456
457  define %struct.C* @C_ctor_base(%struct.C* %this, i32 %x) {
458  entry:
459  ; CHECK-LABEL: C_ctor_base:
460  ; CHECK: mov [[SAVETHIS:r[0-9]+]], r0
461  ; CHECK: bl A_ctor_base
462  ; CHECK: mov r0, [[SAVETHIS]]
463    %0 = bitcast %struct.C* %this to %struct.A*
464    %call = tail call %struct.A* @A_ctor_base(%struct.A* %0)
465    %1 = bitcast %struct.C* %this to %struct.B*
466    %call2 = tail call %struct.B* @B_ctor_base(%struct.B* %1, i32 %x)
467    ret %struct.C* %this
468  }
469
470  define %struct.D* @D_ctor_base(%struct.D* %this, i32 %x) {
471  entry:
472  ; CHECK-LABEL: D_ctor_base:
473
474The use of ``CHECK-LABEL:`` directives in this case ensures that the three
475``CHECK:`` directives only accept lines corresponding to the body of the
476``@C_ctor_base`` function, even if the patterns match lines found later in
477the file. Furthermore, if one of these three ``CHECK:`` directives fail,
478FileCheck will recover by continuing to the next block, allowing multiple test
479failures to be detected in a single invocation.
480
481There is no requirement that ``CHECK-LABEL:`` directives contain strings that
482correspond to actual syntactic labels in a source or output language: they must
483simply uniquely match a single line in the file being verified.
484
485``CHECK-LABEL:`` directives cannot contain variable definitions or uses.
486
487FileCheck Pattern Matching Syntax
488~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
489
490All FileCheck directives take a pattern to match.
491For most uses of FileCheck, fixed string matching is perfectly sufficient.  For
492some things, a more flexible form of matching is desired.  To support this,
493FileCheck allows you to specify regular expressions in matching strings,
494surrounded by double braces: ``{{yourregex}}``. FileCheck implements a POSIX
495regular expression matcher; it supports Extended POSIX regular expressions
496(ERE). Because we want to use fixed string matching for a majority of what we
497do, FileCheck has been designed to support mixing and matching fixed string
498matching with regular expressions.  This allows you to write things like this:
499
500.. code-block:: llvm
501
502   ; CHECK: movhpd	{{[0-9]+}}(%esp), {{%xmm[0-7]}}
503
504In this case, any offset from the ESP register will be allowed, and any xmm
505register will be allowed.
506
507Because regular expressions are enclosed with double braces, they are
508visually distinct, and you don't need to use escape characters within the double
509braces like you would in C.  In the rare case that you want to match double
510braces explicitly from the input, you can use something ugly like
511``{{[{][{]}}`` as your pattern.
512
513FileCheck Variables
514~~~~~~~~~~~~~~~~~~~
515
516It is often useful to match a pattern and then verify that it occurs again
517later in the file.  For codegen tests, this can be useful to allow any register,
518but verify that that register is used consistently later.  To do this,
519:program:`FileCheck` allows named variables to be defined and substituted into
520patterns.  Here is a simple example:
521
522.. code-block:: llvm
523
524   ; CHECK: test5:
525   ; CHECK:    notw	[[REGISTER:%[a-z]+]]
526   ; CHECK:    andw	{{.*}}[[REGISTER]]
527
528The first check line matches a regex ``%[a-z]+`` and captures it into the
529variable ``REGISTER``.  The second line verifies that whatever is in
530``REGISTER`` occurs later in the file after an "``andw``".  :program:`FileCheck`
531variable references are always contained in ``[[ ]]`` pairs, and their names can
532be formed with the regex ``[a-zA-Z_][a-zA-Z0-9_]*``.  If a colon follows the name,
533then it is a definition of the variable; otherwise, it is a use.
534
535:program:`FileCheck` variables can be defined multiple times, and uses always
536get the latest value.  Variables can also be used later on the same line they
537were defined on. For example:
538
539.. code-block:: llvm
540
541    ; CHECK: op [[REG:r[0-9]+]], [[REG]]
542
543Can be useful if you want the operands of ``op`` to be the same register,
544and don't care exactly which register it is.
545
546If ``--enable-var-scope`` is in effect, variables with names that
547start with ``$`` are considered to be global. All others variables are
548local.  All local variables get undefined at the beginning of each
549CHECK-LABEL block. Global variables are not affected by CHECK-LABEL.
550This makes it easier to ensure that individual tests are not affected
551by variables set in preceding tests.
552
553FileCheck Expressions
554~~~~~~~~~~~~~~~~~~~~~
555
556Sometimes there's a need to verify output which refers line numbers of the
557match file, e.g. when testing compiler diagnostics.  This introduces a certain
558fragility of the match file structure, as "``CHECK:``" lines contain absolute
559line numbers in the same file, which have to be updated whenever line numbers
560change due to text addition or deletion.
561
562To support this case, FileCheck allows using ``[[@LINE]]``,
563``[[@LINE+<offset>]]``, ``[[@LINE-<offset>]]`` expressions in patterns. These
564expressions expand to a number of the line where a pattern is located (with an
565optional integer offset).
566
567This way match patterns can be put near the relevant test lines and include
568relative line number references, for example:
569
570.. code-block:: c++
571
572   // CHECK: test.cpp:[[@LINE+4]]:6: error: expected ';' after top level declarator
573   // CHECK-NEXT: {{^int a}}
574   // CHECK-NEXT: {{^     \^}}
575   // CHECK-NEXT: {{^     ;}}
576   int a
577
578Matching Newline Characters
579~~~~~~~~~~~~~~~~~~~~~~~~~~~
580
581To match newline characters in regular expressions the character class
582``[[:space:]]`` can be used. For example, the following pattern:
583
584.. code-block:: c++
585
586   // CHECK: DW_AT_location [DW_FORM_sec_offset] ([[DLOC:0x[0-9a-f]+]]){{[[:space:]].*}}"intd"
587
588matches output of the form (from llvm-dwarfdump):
589
590.. code-block:: text
591
592       DW_AT_location [DW_FORM_sec_offset]   (0x00000233)
593       DW_AT_name [DW_FORM_strp]  ( .debug_str[0x000000c9] = "intd")
594
595letting us set the :program:`FileCheck` variable ``DLOC`` to the desired value
596``0x00000233``, extracted from the line immediately preceding "``intd``".
597