1FileCheck - Flexible pattern matching file verifier
2===================================================
3
4SYNOPSIS
5--------
6
7:program:`FileCheck` *match-filename* [*--check-prefix=XXX*] [*--strict-whitespace*]
8
9DESCRIPTION
10-----------
11
12:program:`FileCheck` reads two files (one from standard input, and one
13specified on the command line) and uses one to verify the other.  This
14behavior is particularly useful for the testsuite, which wants to verify that
15the output of some tool (e.g. :program:`llc`) contains the expected information
16(for example, a movsd from esp or whatever is interesting).  This is similar to
17using :program:`grep`, but it is optimized for matching multiple different
18inputs in one file in a specific order.
19
20The ``match-filename`` file specifies the file that contains the patterns to
21match.  The file to verify is read from standard input unless the
22:option:`--input-file` option is used.
23
24OPTIONS
25-------
26
27.. option:: -help
28
29 Print a summary of command line options.
30
31.. option:: --check-prefix prefix
32
33 FileCheck searches the contents of ``match-filename`` for patterns to
34 match.  By default, these patterns are prefixed with "``CHECK:``".
35 If you'd like to use a different prefix (e.g. because the same input
36 file is checking multiple different tool or options), the
37 :option:`--check-prefix` argument allows you to specify one or more
38 prefixes to match. Multiple prefixes are useful for tests which might
39 change for different run options, but most lines remain the same.
40
41.. option:: --check-prefixes prefix1,prefix2,...
42
43 An alias of :option:`--check-prefix` that allows multiple prefixes to be
44 specified as a comma separated list.
45
46.. option:: --input-file filename
47
48  File to check (defaults to stdin).
49
50.. option:: --match-full-lines
51
52 By default, FileCheck allows matches of anywhere on a line. This
53 option will require all positive matches to cover an entire
54 line. Leading and trailing whitespace is ignored, unless
55 :option:`--strict-whitespace` is also specified. (Note: negative
56 matches from ``CHECK-NOT`` are not affected by this option!)
57
58 Passing this option is equivalent to inserting ``{{^ *}}`` or
59 ``{{^}}`` before, and ``{{ *$}}`` or ``{{$}}`` after every positive
60 check pattern.
61
62.. option:: --strict-whitespace
63
64 By default, FileCheck canonicalizes input horizontal whitespace (spaces and
65 tabs) which causes it to ignore these differences (a space will match a tab).
66 The :option:`--strict-whitespace` argument disables this behavior. End-of-line
67 sequences are canonicalized to UNIX-style ``\n`` in all modes.
68
69.. option:: --implicit-check-not check-pattern
70
71  Adds implicit negative checks for the specified patterns between positive
72  checks. The option allows writing stricter tests without stuffing them with
73  ``CHECK-NOT``\ s.
74
75  For example, "``--implicit-check-not warning:``" can be useful when testing
76  diagnostic messages from tools that don't have an option similar to ``clang
77  -verify``. With this option FileCheck will verify that input does not contain
78  warnings not covered by any ``CHECK:`` patterns.
79
80.. option:: --dump-input-on-failure
81
82  When the check fails, dump all of the original input.
83
84.. option:: --enable-var-scope
85
86  Enables scope for regex variables.
87
88  Variables with names that start with ``$`` are considered global and
89  remain set throughout the file.
90
91  All other variables get undefined after each encountered ``CHECK-LABEL``.
92
93.. option:: -D<VAR=VALUE>
94
95  Sets a filecheck variable ``VAR`` with value ``VALUE`` that can be used in
96  ``CHECK:`` lines.
97
98.. option:: -version
99
100 Show the version number of this program.
101
102.. option:: -v
103
104  Print directive pattern matches.
105
106.. option:: -vv
107
108  Print information helpful in diagnosing internal FileCheck issues, such as
109  discarded overlapping ``CHECK-DAG:`` matches, implicit EOF pattern matches,
110  and ``CHECK-NOT:`` patterns that do not have matches.  Implies ``-v``.
111
112.. option:: --allow-deprecated-dag-overlap
113
114  Enable overlapping among matches in a group of consecutive ``CHECK-DAG:``
115  directives.  This option is deprecated and is only provided for convenience
116  as old tests are migrated to the new non-overlapping ``CHECK-DAG:``
117  implementation.
118
119.. option:: --color
120
121  Use colors in output (autodetected by default).
122
123EXIT STATUS
124-----------
125
126If :program:`FileCheck` verifies that the file matches the expected contents,
127it exits with 0.  Otherwise, if not, or if an error occurs, it will exit with a
128non-zero value.
129
130TUTORIAL
131--------
132
133FileCheck is typically used from LLVM regression tests, being invoked on the RUN
134line of the test.  A simple example of using FileCheck from a RUN line looks
135like this:
136
137.. code-block:: llvm
138
139   ; RUN: llvm-as < %s | llc -march=x86-64 | FileCheck %s
140
141This syntax says to pipe the current file ("``%s``") into ``llvm-as``, pipe
142that into ``llc``, then pipe the output of ``llc`` into ``FileCheck``.  This
143means that FileCheck will be verifying its standard input (the llc output)
144against the filename argument specified (the original ``.ll`` file specified by
145"``%s``").  To see how this works, let's look at the rest of the ``.ll`` file
146(after the RUN line):
147
148.. code-block:: llvm
149
150   define void @sub1(i32* %p, i32 %v) {
151   entry:
152   ; CHECK: sub1:
153   ; CHECK: subl
154           %0 = tail call i32 @llvm.atomic.load.sub.i32.p0i32(i32* %p, i32 %v)
155           ret void
156   }
157
158   define void @inc4(i64* %p) {
159   entry:
160   ; CHECK: inc4:
161   ; CHECK: incq
162           %0 = tail call i64 @llvm.atomic.load.add.i64.p0i64(i64* %p, i64 1)
163           ret void
164   }
165
166Here you can see some "``CHECK:``" lines specified in comments.  Now you can
167see how the file is piped into ``llvm-as``, then ``llc``, and the machine code
168output is what we are verifying.  FileCheck checks the machine code output to
169verify that it matches what the "``CHECK:``" lines specify.
170
171The syntax of the "``CHECK:``" lines is very simple: they are fixed strings that
172must occur in order.  FileCheck defaults to ignoring horizontal whitespace
173differences (e.g. a space is allowed to match a tab) but otherwise, the contents
174of the "``CHECK:``" line is required to match some thing in the test file exactly.
175
176One nice thing about FileCheck (compared to grep) is that it allows merging
177test cases together into logical groups.  For example, because the test above
178is checking for the "``sub1:``" and "``inc4:``" labels, it will not match
179unless there is a "``subl``" in between those labels.  If it existed somewhere
180else in the file, that would not count: "``grep subl``" matches if "``subl``"
181exists anywhere in the file.
182
183The FileCheck -check-prefix option
184~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
185
186The FileCheck `-check-prefix` option allows multiple test
187configurations to be driven from one `.ll` file.  This is useful in many
188circumstances, for example, testing different architectural variants with
189:program:`llc`.  Here's a simple example:
190
191.. code-block:: llvm
192
193   ; RUN: llvm-as < %s | llc -mtriple=i686-apple-darwin9 -mattr=sse41 \
194   ; RUN:              | FileCheck %s -check-prefix=X32
195   ; RUN: llvm-as < %s | llc -mtriple=x86_64-apple-darwin9 -mattr=sse41 \
196   ; RUN:              | FileCheck %s -check-prefix=X64
197
198   define <4 x i32> @pinsrd_1(i32 %s, <4 x i32> %tmp) nounwind {
199           %tmp1 = insertelement <4 x i32>; %tmp, i32 %s, i32 1
200           ret <4 x i32> %tmp1
201   ; X32: pinsrd_1:
202   ; X32:    pinsrd $1, 4(%esp), %xmm0
203
204   ; X64: pinsrd_1:
205   ; X64:    pinsrd $1, %edi, %xmm0
206   }
207
208In this case, we're testing that we get the expected code generation with
209both 32-bit and 64-bit code generation.
210
211The "CHECK-NEXT:" directive
212~~~~~~~~~~~~~~~~~~~~~~~~~~~
213
214Sometimes you want to match lines and would like to verify that matches
215happen on exactly consecutive lines with no other lines in between them.  In
216this case, you can use "``CHECK:``" and "``CHECK-NEXT:``" directives to specify
217this.  If you specified a custom check prefix, just use "``<PREFIX>-NEXT:``".
218For example, something like this works as you'd expect:
219
220.. code-block:: llvm
221
222   define void @t2(<2 x double>* %r, <2 x double>* %A, double %B) {
223 	%tmp3 = load <2 x double>* %A, align 16
224 	%tmp7 = insertelement <2 x double> undef, double %B, i32 0
225 	%tmp9 = shufflevector <2 x double> %tmp3,
226                               <2 x double> %tmp7,
227                               <2 x i32> < i32 0, i32 2 >
228 	store <2 x double> %tmp9, <2 x double>* %r, align 16
229 	ret void
230
231   ; CHECK:          t2:
232   ; CHECK: 	        movl	8(%esp), %eax
233   ; CHECK-NEXT: 	movapd	(%eax), %xmm0
234   ; CHECK-NEXT: 	movhpd	12(%esp), %xmm0
235   ; CHECK-NEXT: 	movl	4(%esp), %eax
236   ; CHECK-NEXT: 	movapd	%xmm0, (%eax)
237   ; CHECK-NEXT: 	ret
238   }
239
240"``CHECK-NEXT:``" directives reject the input unless there is exactly one
241newline between it and the previous directive.  A "``CHECK-NEXT:``" cannot be
242the first directive in a file.
243
244The "CHECK-SAME:" directive
245~~~~~~~~~~~~~~~~~~~~~~~~~~~
246
247Sometimes you want to match lines and would like to verify that matches happen
248on the same line as the previous match.  In this case, you can use "``CHECK:``"
249and "``CHECK-SAME:``" directives to specify this.  If you specified a custom
250check prefix, just use "``<PREFIX>-SAME:``".
251
252"``CHECK-SAME:``" is particularly powerful in conjunction with "``CHECK-NOT:``"
253(described below).
254
255For example, the following works like you'd expect:
256
257.. code-block:: llvm
258
259   !0 = !DILocation(line: 5, scope: !1, inlinedAt: !2)
260
261   ; CHECK:       !DILocation(line: 5,
262   ; CHECK-NOT:               column:
263   ; CHECK-SAME:              scope: ![[SCOPE:[0-9]+]]
264
265"``CHECK-SAME:``" directives reject the input if there are any newlines between
266it and the previous directive.  A "``CHECK-SAME:``" cannot be the first
267directive in a file.
268
269The "CHECK-EMPTY:" directive
270~~~~~~~~~~~~~~~~~~~~~~~~~~~~
271
272If you need to check that the next line has nothing on it, not even whitespace,
273you can use the "``CHECK-EMPTY:``" directive.
274
275.. code-block:: llvm
276
277   declare void @foo()
278
279   declare void @bar()
280   ; CHECK: foo
281   ; CHECK-EMPTY:
282   ; CHECK-NEXT: bar
283
284Just like "``CHECK-NEXT:``" the directive will fail if there is more than one
285newline before it finds the next blank line, and it cannot be the first
286directive in a file.
287
288The "CHECK-NOT:" directive
289~~~~~~~~~~~~~~~~~~~~~~~~~~
290
291The "``CHECK-NOT:``" directive is used to verify that a string doesn't occur
292between two matches (or before the first match, or after the last match).  For
293example, to verify that a load is removed by a transformation, a test like this
294can be used:
295
296.. code-block:: llvm
297
298   define i8 @coerce_offset0(i32 %V, i32* %P) {
299     store i32 %V, i32* %P
300
301     %P2 = bitcast i32* %P to i8*
302     %P3 = getelementptr i8* %P2, i32 2
303
304     %A = load i8* %P3
305     ret i8 %A
306   ; CHECK: @coerce_offset0
307   ; CHECK-NOT: load
308   ; CHECK: ret i8
309   }
310
311The "CHECK-DAG:" directive
312~~~~~~~~~~~~~~~~~~~~~~~~~~
313
314If it's necessary to match strings that don't occur in a strictly sequential
315order, "``CHECK-DAG:``" could be used to verify them between two matches (or
316before the first match, or after the last match). For example, clang emits
317vtable globals in reverse order. Using ``CHECK-DAG:``, we can keep the checks
318in the natural order:
319
320.. code-block:: c++
321
322    // RUN: %clang_cc1 %s -emit-llvm -o - | FileCheck %s
323
324    struct Foo { virtual void method(); };
325    Foo f;  // emit vtable
326    // CHECK-DAG: @_ZTV3Foo =
327
328    struct Bar { virtual void method(); };
329    Bar b;
330    // CHECK-DAG: @_ZTV3Bar =
331
332``CHECK-NOT:`` directives could be mixed with ``CHECK-DAG:`` directives to
333exclude strings between the surrounding ``CHECK-DAG:`` directives. As a result,
334the surrounding ``CHECK-DAG:`` directives cannot be reordered, i.e. all
335occurrences matching ``CHECK-DAG:`` before ``CHECK-NOT:`` must not fall behind
336occurrences matching ``CHECK-DAG:`` after ``CHECK-NOT:``. For example,
337
338.. code-block:: llvm
339
340   ; CHECK-DAG: BEFORE
341   ; CHECK-NOT: NOT
342   ; CHECK-DAG: AFTER
343
344This case will reject input strings where ``BEFORE`` occurs after ``AFTER``.
345
346With captured variables, ``CHECK-DAG:`` is able to match valid topological
347orderings of a DAG with edges from the definition of a variable to its use.
348It's useful, e.g., when your test cases need to match different output
349sequences from the instruction scheduler. For example,
350
351.. code-block:: llvm
352
353   ; CHECK-DAG: add [[REG1:r[0-9]+]], r1, r2
354   ; CHECK-DAG: add [[REG2:r[0-9]+]], r3, r4
355   ; CHECK:     mul r5, [[REG1]], [[REG2]]
356
357In this case, any order of that two ``add`` instructions will be allowed.
358
359If you are defining `and` using variables in the same ``CHECK-DAG:`` block,
360be aware that the definition rule can match `after` its use.
361
362So, for instance, the code below will pass:
363
364.. code-block:: text
365
366  ; CHECK-DAG: vmov.32 [[REG2:d[0-9]+]][0]
367  ; CHECK-DAG: vmov.32 [[REG2]][1]
368  vmov.32 d0[1]
369  vmov.32 d0[0]
370
371While this other code, will not:
372
373.. code-block:: text
374
375  ; CHECK-DAG: vmov.32 [[REG2:d[0-9]+]][0]
376  ; CHECK-DAG: vmov.32 [[REG2]][1]
377  vmov.32 d1[1]
378  vmov.32 d0[0]
379
380While this can be very useful, it's also dangerous, because in the case of
381register sequence, you must have a strong order (read before write, copy before
382use, etc). If the definition your test is looking for doesn't match (because
383of a bug in the compiler), it may match further away from the use, and mask
384real bugs away.
385
386In those cases, to enforce the order, use a non-DAG directive between DAG-blocks.
387
388A ``CHECK-DAG:`` directive skips matches that overlap the matches of any
389preceding ``CHECK-DAG:`` directives in the same ``CHECK-DAG:`` block.  Not only
390is this non-overlapping behavior consistent with other directives, but it's
391also necessary to handle sets of non-unique strings or patterns.  For example,
392the following directives look for unordered log entries for two tasks in a
393parallel program, such as the OpenMP runtime:
394
395.. code-block:: text
396
397    // CHECK-DAG: [[THREAD_ID:[0-9]+]]: task_begin
398    // CHECK-DAG: [[THREAD_ID]]: task_end
399    //
400    // CHECK-DAG: [[THREAD_ID:[0-9]+]]: task_begin
401    // CHECK-DAG: [[THREAD_ID]]: task_end
402
403The second pair of directives is guaranteed not to match the same log entries
404as the first pair even though the patterns are identical and even if the text
405of the log entries is identical because the thread ID manages to be reused.
406
407The "CHECK-LABEL:" directive
408~~~~~~~~~~~~~~~~~~~~~~~~~~~~
409
410Sometimes in a file containing multiple tests divided into logical blocks, one
411or more ``CHECK:`` directives may inadvertently succeed by matching lines in a
412later block. While an error will usually eventually be generated, the check
413flagged as causing the error may not actually bear any relationship to the
414actual source of the problem.
415
416In order to produce better error messages in these cases, the "``CHECK-LABEL:``"
417directive can be used. It is treated identically to a normal ``CHECK``
418directive except that FileCheck makes an additional assumption that a line
419matched by the directive cannot also be matched by any other check present in
420``match-filename``; this is intended to be used for lines containing labels or
421other unique identifiers. Conceptually, the presence of ``CHECK-LABEL`` divides
422the input stream into separate blocks, each of which is processed independently,
423preventing a ``CHECK:`` directive in one block matching a line in another block.
424If ``--enable-var-scope`` is in effect, all local variables are cleared at the
425beginning of the block.
426
427For example,
428
429.. code-block:: llvm
430
431  define %struct.C* @C_ctor_base(%struct.C* %this, i32 %x) {
432  entry:
433  ; CHECK-LABEL: C_ctor_base:
434  ; CHECK: mov [[SAVETHIS:r[0-9]+]], r0
435  ; CHECK: bl A_ctor_base
436  ; CHECK: mov r0, [[SAVETHIS]]
437    %0 = bitcast %struct.C* %this to %struct.A*
438    %call = tail call %struct.A* @A_ctor_base(%struct.A* %0)
439    %1 = bitcast %struct.C* %this to %struct.B*
440    %call2 = tail call %struct.B* @B_ctor_base(%struct.B* %1, i32 %x)
441    ret %struct.C* %this
442  }
443
444  define %struct.D* @D_ctor_base(%struct.D* %this, i32 %x) {
445  entry:
446  ; CHECK-LABEL: D_ctor_base:
447
448The use of ``CHECK-LABEL:`` directives in this case ensures that the three
449``CHECK:`` directives only accept lines corresponding to the body of the
450``@C_ctor_base`` function, even if the patterns match lines found later in
451the file. Furthermore, if one of these three ``CHECK:`` directives fail,
452FileCheck will recover by continuing to the next block, allowing multiple test
453failures to be detected in a single invocation.
454
455There is no requirement that ``CHECK-LABEL:`` directives contain strings that
456correspond to actual syntactic labels in a source or output language: they must
457simply uniquely match a single line in the file being verified.
458
459``CHECK-LABEL:`` directives cannot contain variable definitions or uses.
460
461FileCheck Pattern Matching Syntax
462~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
463
464All FileCheck directives take a pattern to match.
465For most uses of FileCheck, fixed string matching is perfectly sufficient.  For
466some things, a more flexible form of matching is desired.  To support this,
467FileCheck allows you to specify regular expressions in matching strings,
468surrounded by double braces: ``{{yourregex}}``. FileCheck implements a POSIX
469regular expression matcher; it supports Extended POSIX regular expressions
470(ERE). Because we want to use fixed string matching for a majority of what we
471do, FileCheck has been designed to support mixing and matching fixed string
472matching with regular expressions.  This allows you to write things like this:
473
474.. code-block:: llvm
475
476   ; CHECK: movhpd	{{[0-9]+}}(%esp), {{%xmm[0-7]}}
477
478In this case, any offset from the ESP register will be allowed, and any xmm
479register will be allowed.
480
481Because regular expressions are enclosed with double braces, they are
482visually distinct, and you don't need to use escape characters within the double
483braces like you would in C.  In the rare case that you want to match double
484braces explicitly from the input, you can use something ugly like
485``{{[{][{]}}`` as your pattern.
486
487FileCheck Variables
488~~~~~~~~~~~~~~~~~~~
489
490It is often useful to match a pattern and then verify that it occurs again
491later in the file.  For codegen tests, this can be useful to allow any register,
492but verify that that register is used consistently later.  To do this,
493:program:`FileCheck` allows named variables to be defined and substituted into
494patterns.  Here is a simple example:
495
496.. code-block:: llvm
497
498   ; CHECK: test5:
499   ; CHECK:    notw	[[REGISTER:%[a-z]+]]
500   ; CHECK:    andw	{{.*}}[[REGISTER]]
501
502The first check line matches a regex ``%[a-z]+`` and captures it into the
503variable ``REGISTER``.  The second line verifies that whatever is in
504``REGISTER`` occurs later in the file after an "``andw``".  :program:`FileCheck`
505variable references are always contained in ``[[ ]]`` pairs, and their names can
506be formed with the regex ``[a-zA-Z_][a-zA-Z0-9_]*``.  If a colon follows the name,
507then it is a definition of the variable; otherwise, it is a use.
508
509:program:`FileCheck` variables can be defined multiple times, and uses always
510get the latest value.  Variables can also be used later on the same line they
511were defined on. For example:
512
513.. code-block:: llvm
514
515    ; CHECK: op [[REG:r[0-9]+]], [[REG]]
516
517Can be useful if you want the operands of ``op`` to be the same register,
518and don't care exactly which register it is.
519
520If ``--enable-var-scope`` is in effect, variables with names that
521start with ``$`` are considered to be global. All others variables are
522local.  All local variables get undefined at the beginning of each
523CHECK-LABEL block. Global variables are not affected by CHECK-LABEL.
524This makes it easier to ensure that individual tests are not affected
525by variables set in preceding tests.
526
527FileCheck Expressions
528~~~~~~~~~~~~~~~~~~~~~
529
530Sometimes there's a need to verify output which refers line numbers of the
531match file, e.g. when testing compiler diagnostics.  This introduces a certain
532fragility of the match file structure, as "``CHECK:``" lines contain absolute
533line numbers in the same file, which have to be updated whenever line numbers
534change due to text addition or deletion.
535
536To support this case, FileCheck allows using ``[[@LINE]]``,
537``[[@LINE+<offset>]]``, ``[[@LINE-<offset>]]`` expressions in patterns. These
538expressions expand to a number of the line where a pattern is located (with an
539optional integer offset).
540
541This way match patterns can be put near the relevant test lines and include
542relative line number references, for example:
543
544.. code-block:: c++
545
546   // CHECK: test.cpp:[[@LINE+4]]:6: error: expected ';' after top level declarator
547   // CHECK-NEXT: {{^int a}}
548   // CHECK-NEXT: {{^     \^}}
549   // CHECK-NEXT: {{^     ;}}
550   int a
551
552Matching Newline Characters
553~~~~~~~~~~~~~~~~~~~~~~~~~~~
554
555To match newline characters in regular expressions the character class
556``[[:space:]]`` can be used. For example, the following pattern:
557
558.. code-block:: c++
559
560   // CHECK: DW_AT_location [DW_FORM_sec_offset] ([[DLOC:0x[0-9a-f]+]]){{[[:space:]].*}}"intd"
561
562matches output of the form (from llvm-dwarfdump):
563
564.. code-block:: text
565
566       DW_AT_location [DW_FORM_sec_offset]   (0x00000233)
567       DW_AT_name [DW_FORM_strp]  ( .debug_str[0x000000c9] = "intd")
568
569letting us set the :program:`FileCheck` variable ``DLOC`` to the desired value
570``0x00000233``, extracted from the line immediately preceding "``intd``".
571