1Variable Formatting 2=================== 3 4.. contents:: 5 :local: 6 7LLDB has a data formatters subsystem that allows users to define custom display 8options for their variables. 9 10Usually, when you type ``frame variable`` or run some expression LLDB will 11automatically choose the way to display your results on a per-type basis, as in 12the following example: 13 14:: 15 16 (lldb) frame variable 17 (uint8_t) x = 'a' 18 (intptr_t) y = 124752287 19 20Note: ``frame variable`` without additional arguments prints the list of 21variables of the current frame. 22 23However, in certain cases, you may want to associate a different style to the 24display for certain datatypes. To do so, you need to give hints to the debugger 25as to how variables should be displayed. The LLDB type command allows you to do 26just that. 27 28Using it you can change your visualization to look like this: 29 30:: 31 32 (lldb) frame variable 33 (uint8_t) x = chr='a' dec=65 hex=0x41 34 (intptr_t) y = 0x76f919f 35 36In addition, some data structures can encode their data in a way that is not 37easily readable to the user, in which case a data formatter can be used to 38show the data in a human readable way. For example, without a formatter, 39printing a ``std::deque<int>`` with the elements ``{2, 3, 4, 5, 6}`` would 40result in something like: 41 42:: 43 44 (lldb) frame variable a_deque 45 (std::deque<Foo, std::allocator<int> >) $0 = { 46 std::_Deque_base<Foo, std::allocator<int> > = { 47 _M_impl = { 48 _M_map = 0x000000000062ceb0 49 _M_map_size = 8 50 _M_start = { 51 _M_cur = 0x000000000062cf00 52 _M_first = 0x000000000062cf00 53 _M_last = 0x000000000062d2f4 54 _M_node = 0x000000000062cec8 55 } 56 _M_finish = { 57 _M_cur = 0x000000000062d300 58 _M_first = 0x000000000062d300 59 _M_last = 0x000000000062d6f4 60 _M_node = 0x000000000062ced0 61 } 62 } 63 } 64 } 65 66which is very hard to make sense of. 67 68Note: ``frame variable <var>`` prints out the variable ``<var>`` in the current 69frame. 70 71On the other hand, a proper formatter is able to produce the following output: 72 73:: 74 75 (lldb) frame variable a_deque 76 (std::deque<Foo, std::allocator<int> >) $0 = size=5 { 77 [0] = 2 78 [1] = 3 79 [2] = 4 80 [3] = 5 81 [4] = 6 82 } 83 84which is what the user would expect from a good debugger. 85 86Note: you can also use ``v <var>`` instead of ``frame variable <var>``. 87 88It's worth mentioning that the ``size=5`` string is produced by a summary 89provider and the list of children is produced by a synthetic child provider. 90More information about these providers is available later in this document. 91 92 93There are several features related to data visualization: formats, summaries, 94filters, synthetic children. 95 96To reflect this, the type command has five subcommands: 97 98:: 99 100 type format 101 type summary 102 type filter 103 type synthetic 104 type category 105 106These commands are meant to bind printing options to types. When variables are 107printed, LLDB will first check if custom printing options have been associated 108to a variable's type and, if so, use them instead of picking the default 109choices. 110 111Each of the commands (except ``type category``) has four subcommands available: 112 113- ``add``: associates a new printing option to one or more types 114- ``delete``: deletes an existing association 115- ``list``: provides a listing of all associations 116- ``clear``: deletes all associations 117 118Type Format 119----------- 120 121Type formats enable you to quickly override the default format for displaying 122primitive types (the usual basic C/C++/ObjC types: int, float, char, ...). 123 124If for some reason you want all int variables in your program to print out as 125hex, you can add a format to the int type. 126 127This is done by typing 128 129:: 130 131 (lldb) type format add --format hex int 132 133at the LLDB command line. 134 135The ``--format`` (which you can shorten to -f) option accepts a :doc:`format 136name<formatting>`. Then, you provide one or more types to which you want the 137new format applied. 138 139A frequent scenario is that your program has a typedef for a numeric type that 140you know represents something that must be printed in a certain way. Again, you 141can add a format just to that typedef by using type format add with the name 142alias. 143 144But things can quickly get hierarchical. Let's say you have a situation like 145the following: 146 147:: 148 149 typedef int A; 150 typedef A B; 151 typedef B C; 152 typedef C D; 153 154and you want to show all A's as hex, all C's as byte arrays and leave the 155defaults untouched for other types (albeit its contrived look, the example is 156far from unrealistic in large software systems). 157 158If you simply type 159 160:: 161 162 (lldb) type format add -f hex A 163 (lldb) type format add -f uint8_t[] C 164 165values of type B will be shown as hex and values of type D as byte arrays, as in: 166 167:: 168 169 (lldb) frame variable -T 170 (A) a = 0x00000001 171 (B) b = 0x00000002 172 (C) c = {0x03 0x00 0x00 0x00} 173 (D) d = {0x04 0x00 0x00 0x00} 174 175This is because by default LLDB cascades formats through typedef chains. In 176order to avoid that you can use the option -C no to prevent cascading, thus 177making the two commands required to achieve your goal: 178 179:: 180 181 (lldb) type format add -C no -f hex A 182 (lldb) type format add -C no -f uint8_t[] C 183 184 185which provides the desired output: 186 187:: 188 189 (lldb) frame variable -T 190 (A) a = 0x00000001 191 (B) b = 2 192 (C) c = {0x03 0x00 0x00 0x00} 193 (D) d = 4 194 195Note, that qualifiers such as const and volatile will be stripped when matching types for example: 196 197:: 198 199 (lldb) frame var x y z 200 (int) x = 1 201 (const int) y = 2 202 (volatile int) z = 4 203 (lldb) type format add -f hex int 204 (lldb) frame var x y z 205 (int) x = 0x00000001 206 (const int) y = 0x00000002 207 (volatile int) z = 0x00000004 208 209Two additional options that you will want to look at are --skip-pointers (-p) 210and --skip-references (-r). These two options prevent LLDB from applying a 211format for type T to values of type T* and T& respectively. 212 213:: 214 215 (lldb) type format add -f float32[] int 216 (lldb) frame variable pointer *pointer -T 217 (int *) pointer = {1.46991e-39 1.4013e-45} 218 (int) *pointer = {1.53302e-42} 219 (lldb) type format add -f float32[] int -p 220 (lldb) frame variable pointer *pointer -T 221 (int *) pointer = 0x0000000100100180 222 (int) *pointer = {1.53302e-42} 223 224While they can be applied to pointers and references, formats will make no 225attempt to dereference the pointer and extract the value before applying the 226format, which means you are effectively formatting the address stored in the 227pointer rather than the pointee value. For this reason, you may want to use the 228-p option when defining formats. 229 230If you need to delete a custom format simply type type format delete followed 231by the name of the type to which the format applies.Even if you defined the 232same format for multiple types on the same command, type format delete will 233only remove the format for the type name passed as argument. 234 235To delete ALL formats, use ``type format clear``. To see all the formats 236defined, use type format list. 237 238If all you need to do, however, is display one variable in a custom format, 239while leaving the others of the same type untouched, you can simply type: 240 241:: 242 243 (lldb) frame variable counter -f hex 244 245This has the effect of displaying the value of counter as an hexadecimal 246number, and will keep showing it this way until you either pick a different 247format or till you let your program run again. 248 249Finally, this is a list of formatting options available out of which you can 250pick: 251 252+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 253| **Format name** | **Abbreviation** | **Description** | 254+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 255| ``default`` | | the default LLDB algorithm is used to pick a format | 256+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 257| ``boolean`` | B | show this as a true/false boolean, using the customary rule that 0 is | 258| | | false and everything else is true | 259+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 260| ``binary`` | b | show this as a sequence of bits | 261+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 262| ``bytes`` | y | show the bytes one after the other | 263+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 264| ``bytes with ASCII`` | Y | show the bytes, but try to display them as ASCII characters as well | 265+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 266| ``character`` | c | show the bytes as ASCII characters | 267+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 268| ``printable character`` | C | show the bytes as printable ASCII characters | 269+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 270| ``complex float`` | F | interpret this value as the real and imaginary part of a complex | 271| | | floating-point number | 272+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 273| ``c-string`` | s | show this as a 0-terminated C string | 274+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 275| ``decimal`` | d | show this as a signed integer number (this does not perform a cast, it | 276| | | simply shows the bytes as an integer with sign) | 277+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 278| ``enumeration`` | E | show this as an enumeration, printing the | 279| | | value's name if available or the integer value otherwise | 280+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 281| ``hex`` | x | show this as in hexadecimal notation (this does | 282| | | not perform a cast, it simply shows the bytes as hex) | 283+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 284| ``float`` | f | show this as a floating-point number (this does not perform a cast, it | 285| | | simply interprets the bytes as an IEEE754 floating-point value) | 286+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 287| ``octal`` | o | show this in octal notation | 288+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 289| ``OSType`` | O | show this as a MacOS OSType | 290+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 291| ``unicode16`` | U | show this as UTF-16 characters | 292+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 293| ``unicode32`` | | show this as UTF-32 characters | 294+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 295| ``unsigned decimal`` | u | show this as an unsigned integer number (this does not perform a cast, | 296| | | it simply shows the bytes as unsigned integer) | 297+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 298| ``pointer`` | p | show this as a native pointer (unless this is really a pointer, the | 299| | | resulting address will probably be invalid) | 300+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 301| ``char[]`` | | show this as an array of characters | 302+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 303| ``int8_t[], uint8_t[]`` | | show this as an array of the corresponding integer type | 304| ``int16_t[], uint16_t[]`` | | | 305| ``int32_t[], uint32_t[]`` | | | 306| ``int64_t[], uint64_t[]`` | | | 307| ``uint128_t[]`` | | | 308+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 309| ``float32[], float64[]`` | | show this as an array of the corresponding | 310| | | floating-point type | 311+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 312| ``complex integer`` | I | interpret this value as the real and imaginary part of a complex integer | 313| | | number | 314+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 315| ``character array`` | a | show this as a character array | 316+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 317| ``address`` | A | show this as an address target (symbol/file/line + offset), possibly | 318| | | also the string this address is pointing to | 319+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 320| ``hex float`` | | show this as hexadecimal floating point | 321+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 322| ``instruction`` | i | show this as an disassembled opcode | 323+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 324| ``void`` | v | don't show anything | 325+-----------------------------------------------+------------------+--------------------------------------------------------------------------+ 326 327Type Summary 328------------ 329 330Type formats work by showing a different kind of display for the value of a 331variable. However, they only work for basic types. When you want to display a 332class or struct in a custom format, you cannot do that using formats. 333 334A different feature, type summaries, works by extracting information from 335classes, structures, ... (aggregate types) and arranging it in a user-defined 336format, as in the following example: 337 338before adding a summary... 339 340:: 341 342 (lldb) frame variable -T one 343 (i_am_cool) one = { 344 (int) x = 3 345 (float) y = 3.14159 346 (char) z = 'E' 347 } 348 349after adding a summary... 350 351:: 352 353 (lldb) frame variable one 354 (i_am_cool) one = int = 3, float = 3.14159, char = 69 355 356There are two ways to use type summaries: the first one is to bind a summary 357string to the type; the second is to write a Python script that returns the 358string to be used as summary. Both options are enabled by the type summary add 359command. 360 361The command to obtain the output shown in the example is: 362 363:: 364 365(lldb) type summary add --summary-string "int = ${var.x}, float = ${var.y}, char = ${var.z%u}" i_am_cool 366 367Initially, we will focus on summary strings, and then describe the Python 368binding mechanism. 369 370Summary Strings 371--------------- 372 373Summary strings are written using a simple control language, exemplified by the 374snippet above. A summary string contains a sequence of tokens that are 375processed by LLDB to generate the summary. 376 377Summary strings can contain plain text, control characters and special 378variables that have access to information about the current object and the 379overall program state. 380 381Plain text is any sequence of characters that doesn't contain a ``{``, ``}``, ``$``, 382or ``\`` character, which are the syntax control characters. 383 384The special variables are found in between a "${" prefix, and end with a "}" 385suffix. Variables can be a simple name or they can refer to complex objects 386that have subitems themselves. In other words, a variable looks like 387``${object}`` or ``${object.child.otherchild}``. A variable can also be 388prefixed or suffixed with other symbols meant to change the way its value is 389handled. An example is ``${*var.int_pointer[0-3]}``. 390 391Basically, the syntax is the same one described Frame and Thread Formatting 392plus additional symbols specific for summary strings. The main of them is 393${var, which is used refer to the variable that a summary is being created for. 394 395The simplest thing you can do is grab a member variable of a class or structure 396by typing its expression path. In the previous example, the expression path for 397the field float y is simply .y. Thus, to ask the summary string to display y 398you would type ${var.y}. 399 400If you have code like the following: 401 402:: 403 404 struct A { 405 int x; 406 int y; 407 }; 408 struct B { 409 A x; 410 A y; 411 int *z; 412 }; 413 414the expression path for the y member of the x member of an object of type B 415would be .x.y and you would type ``${var.x.y}`` to display it in a summary 416string for type B. 417 418By default, a summary defined for type T, also works for types T* and T& (you 419can disable this behavior if desired). For this reason, expression paths do not 420differentiate between . and ->, and the above expression path .x.y would be 421just as good if you were displaying a B*, or even if the actual definition of B 422were: 423 424:: 425 426 struct B { 427 A *x; 428 A y; 429 int *z; 430 }; 431 432This is unlike the behavior of frame variable which, on the contrary, will 433enforce the distinction. As hinted above, the rationale for this choice is that 434waiving this distinction enables you to write a summary string once for type T 435and use it for both T and T* instances. As a summary string is mostly about 436extracting nested members' information, a pointer to an object is just as good 437as the object itself for the purpose. 438 439If you need to access the value of the integer pointed to by B::z, you cannot 440simply say ${var.z} because that symbol refers to the pointer z. In order to 441dereference it and get the pointed value, you should say ``${*var.z}``. The 442``${*var`` tells LLDB to get the object that the expression paths leads to, and 443then dereference it. In this example is it equivalent to ``*(bObject.z)`` in 444C/C++ syntax. Because ``.`` and ``->`` operators can both be used, there is no 445need to have dereferences in the middle of an expression path (e.g. you do not 446need to type ``${*(var.x).x}``) to read A::x as contained in ``*(B::x)``. To 447achieve that effect you can simply write ``${var.x->x}``, or even 448``${var.x.x}``. The ``*`` operator only binds to the result of the whole 449expression path, rather than piecewise, and there is no way to use parentheses 450to change that behavior. 451 452Of course, a summary string can contain more than one ${var specifier, and can 453use ``${var`` and ``${*var`` specifiers together. 454 455Formatting Summary Elements 456--------------------------- 457 458An expression path can include formatting codes. Much like the type formats 459discussed previously, you can also customize the way variables are displayed in 460summary strings, regardless of the format they have applied to their types. To 461do that, you can use %format inside an expression path, as in ${var.x->x%u}, 462which would display the value of x as an unsigned integer. 463 464You can also use some other special format markers, not available for formats 465themselves, but which carry a special meaning when used in this context: 466 467+------------+--------------------------------------------------------------------------+ 468| **Symbol** | **Description** | 469+------------+--------------------------------------------------------------------------+ 470| ``Symbol`` | ``Description`` | 471+------------+--------------------------------------------------------------------------+ 472| ``%S`` | Use this object's summary (the default for aggregate types) | 473+------------+--------------------------------------------------------------------------+ 474| ``%V`` | Use this object's value (the default for non-aggregate types) | 475+------------+--------------------------------------------------------------------------+ 476| ``%@`` | Use a language-runtime specific description (for C++ this does nothing, | 477| | for Objective-C it calls the NSPrintForDebugger API) | 478+------------+--------------------------------------------------------------------------+ 479| ``%L`` | Use this object's location (memory address, register name, ...) | 480+------------+--------------------------------------------------------------------------+ 481| ``%#`` | Use the count of the children of this object | 482+------------+--------------------------------------------------------------------------+ 483| ``%T`` | Use this object's datatype name | 484+------------+--------------------------------------------------------------------------+ 485| ``%N`` | Print the variable's basename | 486+------------+--------------------------------------------------------------------------+ 487| ``%>`` | Print the expression path for this item | 488+------------+--------------------------------------------------------------------------+ 489 490Since lldb 3.7.0, you can also specify ``${script.var:pythonFuncName}``. 491 492It is expected that the function name you use specifies a function whose 493signature is the same as a Python summary function. The return string from the 494function will be placed verbatim in the output. 495 496You cannot use element access, or formatting symbols, in combination with this 497syntax. For example the following: 498 499:: 500 501 ${script.var.element[0]:myFunctionName%@} 502 503is not valid and will cause the summary to fail to evaluate. 504 505 506Element Inlining 507---------------- 508 509Option --inline-children (-c) to type summary add tells LLDB not to look for a summary string, but instead to just print a listing of all the object's children on one line. 510 511As an example, given a type pair: 512 513:: 514 515 (lldb) frame variable --show-types a_pair 516 (pair) a_pair = { 517 (int) first = 1; 518 (int) second = 2; 519 } 520 521If one types the following commands: 522 523:: 524 525 (lldb) type summary add --inline-children pair 526 527the output becomes: 528 529:: 530 531 (lldb) frame variable a_pair 532 (pair) a_pair = (first=1, second=2) 533 534 535Of course, one can obtain the same effect by typing 536 537:: 538 539 (lldb) type summary add pair --summary-string "(first=${var.first}, second=${var.second})" 540 541While the final result is the same, using --inline-children can often save 542time. If one does not need to see the names of the variables, but just their 543values, the option --omit-names (-O, uppercase letter o), can be combined with 544--inline-children to obtain: 545 546:: 547 548 (lldb) frame variable a_pair 549 (pair) a_pair = (1, 2) 550 551which is of course the same as typing 552 553:: 554 555 (lldb) type summary add pair --summary-string "(${var.first}, ${var.second})" 556 557Bitfields And Array Syntax 558-------------------------- 559 560Sometimes, a basic type's value actually represents several different values 561packed together in a bitfield. 562 563With the classical view, there is no way to look at them. Hexadecimal display 564can help, but if the bits actually span nibble boundaries, the help is limited. 565 566Binary view would show it all without ambiguity, but is often too detailed and 567hard to read for real-life scenarios. 568 569To cope with the issue, LLDB supports native bitfield formatting in summary 570strings. If your expression paths leads to a so-called scalar type (the usual 571int, float, char, double, short, long, long long, double, long double and 572unsigned variants), you can ask LLDB to only grab some bits out of the value 573and display them in any format you like. If you only need one bit you can use 574the [n], just like indexing an array. To extract multiple bits, you can use a 575slice-like syntax: [n-m], e.g. 576 577:: 578 579 (lldb) frame variable float_point 580 (float) float_point = -3.14159 581 582:: 583 584 (lldb) type summary add --summary-string "Sign: ${var[31]%B} Exponent: ${var[30-23]%x} Mantissa: ${var[0-22]%u}" float 585 (lldb) frame variable float_point 586 (float) float_point = -3.14159 Sign: true Exponent: 0x00000080 Mantissa: 4788184 587 588In this example, LLDB shows the internal representation of a float variable by 589extracting bitfields out of a float object. 590 591When typing a range, the extremes n and m are always included, and the order of 592the indices is irrelevant. 593 594LLDB also allows to use a similar syntax to display array members inside a summary string. For instance, you may want to display all arrays of a given type using a more compact notation than the default, and then just delve into individual array members that prove interesting to your debugging task. You can tell LLDB to format arrays in special ways, possibly independent of the way the array members' datatype is formatted. 595e.g. 596 597:: 598 599 (lldb) frame variable sarray 600 (Simple [3]) sarray = { 601 [0] = { 602 x = 1 603 y = 2 604 z = '\x03' 605 } 606 [1] = { 607 x = 4 608 y = 5 609 z = '\x06' 610 } 611 [2] = { 612 x = 7 613 y = 8 614 z = '\t' 615 } 616 } 617 618 (lldb) type summary add --summary-string "${var[].x}" "Simple [3]" 619 620 (lldb) frame variable sarray 621 (Simple [3]) sarray = [1,4,7] 622 623The [] symbol amounts to: if var is an array and I know its size, apply this summary string to every element of the array. Here, we are asking LLDB to display .x for every element of the array, and in fact this is what happens. If you find some of those integers anomalous, you can then inspect that one item in greater detail, without the array format getting in the way: 624 625:: 626 627 (lldb) frame variable sarray[1] 628 (Simple) sarray[1] = { 629 x = 4 630 y = 5 631 z = '\x06' 632 } 633 634You can also ask LLDB to only print a subset of the array range by using the 635same syntax used to extract bit for bitfields: 636 637:: 638 639 (lldb) type summary add --summary-string "${var[1-2].x}" "Simple [3]" 640 641 (lldb) frame variable sarray 642 (Simple [3]) sarray = [4,7] 643 644If you are dealing with a pointer that you know is an array, you can use this 645syntax to display the elements contained in the pointed array instead of just 646the pointer value. However, because pointers have no notion of their size, the 647empty brackets [] operator does not work, and you must explicitly provide 648higher and lower bounds. 649 650In general, LLDB needs the square brackets ``operator []`` in order to handle 651arrays and pointers correctly, and for pointers it also needs a range. However, 652a few special cases are defined to make your life easier: 653 654you can print a 0-terminated string (C-string) using the %s format, omitting 655square brackets, as in: 656 657:: 658 659 (lldb) type summary add --summary-string "${var%s}" "char *" 660 661This syntax works for char* as well as for char[] because LLDB can rely on the 662final \0 terminator to know when the string has ended. 663 664LLDB has default summary strings for char* and char[] that use this special 665case. On debugger startup, the following are defined automatically: 666 667:: 668 669 (lldb) type summary add --summary-string "${var%s}" "char *" 670 (lldb) type summary add --summary-string "${var%s}" -x "char \[[0-9]+]" 671 672any of the array formats (int8_t[], float32{}, ...), and the y, Y and a formats 673work to print an array of a non-aggregate type, even if square brackets are 674omitted. 675 676:: 677 678 (lldb) type summary add --summary-string "${var%int32_t[]}" "int [10]" 679 680This feature, however, is not enabled for pointers because there is no way for 681LLDB to detect the end of the pointed data. 682 683This also does not work for other formats (e.g. boolean), and you must specify 684the square brackets operator to get the expected output. 685 686Python Scripting 687---------------- 688 689Most of the times, summary strings prove good enough for the job of summarizing 690the contents of a variable. However, as soon as you need to do more than 691picking some values and rearranging them for display, summary strings stop 692being an effective tool. This is because summary strings lack the power to 693actually perform any kind of computation on the value of variables. 694 695To solve this issue, you can bind some Python scripting code as a summary for 696your datatype, and that script has the ability to both extract children 697variables as the summary strings do and to perform active computation on the 698extracted values. As a small example, let's say we have a Rectangle class: 699 700:: 701 702 703 class Rectangle 704 { 705 private: 706 int height; 707 int width; 708 public: 709 Rectangle() : height(3), width(5) {} 710 Rectangle(int H) : height(H), width(H*2-1) {} 711 Rectangle(int H, int W) : height(H), width(W) {} 712 int GetHeight() { return height; } 713 int GetWidth() { return width; } 714 }; 715 716Summary strings are effective to reduce the screen real estate used by the 717default viewing mode, but are not effective if we want to display the area and 718perimeter of Rectangle objects 719 720To obtain this, we can simply attach a small Python script to the Rectangle 721class, as shown in this example: 722 723:: 724 725 (lldb) type summary add -P Rectangle 726 Enter your Python command(s). Type 'DONE' to end. 727 def function (valobj,internal_dict,options): 728 height_val = valobj.GetChildMemberWithName('height') 729 width_val = valobj.GetChildMemberWithName('width') 730 height = height_val.GetValueAsUnsigned(0) 731 width = width_val.GetValueAsUnsigned(0) 732 area = height*width 733 perimeter = 2*(height + width) 734 return 'Area: ' + str(area) + ', Perimeter: ' + str(perimeter) 735 DONE 736 (lldb) frame variable 737 (Rectangle) r1 = Area: 20, Perimeter: 18 738 (Rectangle) r2 = Area: 72, Perimeter: 36 739 (Rectangle) r3 = Area: 16, Perimeter: 16 740 741In order to write effective summary scripts, you need to know the LLDB public 742API, which is the way Python code can access the LLDB object model. For further 743details on the API you should look at the LLDB API reference documentation. 744 745 746As a brief introduction, your script is encapsulated into a function that is 747passed two parameters: ``valobj`` and ``internal_dict``. 748 749``internal_dict`` is an internal support parameter used by LLDB and you should 750not touch it. 751 752``valobj`` is the object encapsulating the actual variable being displayed, and 753its type is `SBValue`. Out of the many possible operations on an `SBValue`, the 754basic one is retrieve the children objects it contains (essentially, the fields 755of the object wrapped by it), by calling ``GetChildMemberWithName()``, passing 756it the child's name as a string. 757 758If the variable has a value, you can ask for it, and return it as a string 759using ``GetValue()``, or as a signed/unsigned number using 760``GetValueAsSigned()``, ``GetValueAsUnsigned()``. It is also possible to 761retrieve an `SBData` object by calling ``GetData()`` and then read the object's 762contents out of the `SBData`. 763 764If you need to delve into several levels of hierarchy, as you can do with 765summary strings, you can use the method ``GetValueForExpressionPath()``, 766passing it an expression path just like those you could use for summary strings 767(one of the differences is that dereferencing a pointer does not occur by 768prefixing the path with a ``*```, but by calling the ``Dereference()`` method 769on the returned `SBValue`). If you need to access array slices, you cannot do 770that (yet) via this method call, and you must use ``GetChildAtIndex()`` 771querying it for the array items one by one. Also, handling custom formats is 772something you have to deal with on your own. 773 774``options`` Python summary formatters can optionally define this 775third argument, which is an object of type ``lldb.SBTypeSummaryOptions``, 776allowing for a few customizations of the result. The decision to 777adopt or not this third argument - and the meaning of options 778thereof - is up to the individual formatter's writer. 779 780Other than interactively typing a Python script there are two other ways for 781you to input a Python script as a summary: 782 783- using the --python-script option to type summary add and typing the script 784 code as an option argument; as in: 785 786:: 787 788 (lldb) type summary add --python-script "height = valobj.GetChildMemberWithName('height').GetValueAsUnsigned(0);width = valobj.GetChildMemberWithName('width').GetValueAsUnsigned(0); return 'Area: %d' % (height*width)" Rectangle 789 790 791- using the --python-function (-F) option to type summary add and giving the 792 name of a Python function with the correct prototype. Most probably, you will 793 define (or have already defined) the function in the interactive interpreter, 794 or somehow loaded it from a file, using the command script import command. 795 LLDB will emit a warning if it is unable to find the function you passed, but 796 will still register the binding. 797 798Regular Expression Typenames 799---------------------------- 800 801As you noticed, in order to associate the custom summary string to the array 802types, one must give the array size as part of the typename. This can long 803become tiresome when using arrays of different sizes, Simple [3], Simple [9], 804Simple [12], ... 805 806If you use the -x option, type names are treated as regular expressions instead 807of type names. This would let you rephrase the above example for arrays of type 808Simple [3] as: 809 810:: 811 (lldb) type summary add --summary-string "${var[].x}" -x "Simple \[[0-9]+\]" 812 (lldb) frame variable 813 (Simple [3]) sarray = [1,4,7] 814 (Simple [2]) sother = [3,6] 815 816The above scenario works for Simple [3] as well as for any other array of 817Simple objects. 818 819While this feature is mostly useful for arrays, you could also use regular 820expressions to catch other type sets grouped by name. However, as regular 821expression matching is slower than normal name matching, LLDB will first try to 822match by name in any way it can, and only when this fails, will it resort to 823regular expression matching. 824 825One of the ways LLDB uses this feature internally, is to match the names of STL 826container classes, regardless of the template arguments provided. The details 827for this are found at FormatManager.cpp 828 829The regular expression language used by LLDB is the POSIX extended language, as 830defined by the Single UNIX Specification, of which macOS is a compliant 831implementation. 832 833Names Summaries 834--------------- 835 836For a given type, there may be different meaningful summary representations. 837However, currently, only one summary can be associated to a type at each 838moment. If you need to temporarily override the association for a variable, 839without changing the summary string for to its type, you can use named 840summaries. 841 842Named summaries work by attaching a name to a summary when creating it. Then, 843when there is a need to attach the summary to a variable, the frame variable 844command, supports a --summary option that tells LLDB to use the named summary 845given instead of the default one. 846 847:: 848 849 (lldb) type summary add --summary-string "x=${var.integer}" --name NamedSummary 850 (lldb) frame variable one 851 (i_am_cool) one = int = 3, float = 3.14159, char = 69 852 (lldb) frame variable one --summary NamedSummary 853 (i_am_cool) one = x=3 854 855When defining a named summary, binding it to one or more types becomes 856optional. Even if you bind the named summary to a type, and later change the 857summary string for that type, the named summary will not be changed by that. 858You can delete named summaries by using the type summary delete command, as if 859the summary name was the datatype that the summary is applied to 860 861A summary attached to a variable using the --summary option, has the same 862semantics that a custom format attached using the -f option has: it stays 863attached till you attach a new one, or till you let your program run again. 864 865Synthetic Children 866------------------ 867 868Summaries work well when one is able to navigate through an expression path. In 869order for LLDB to do so, appropriate debugging information must be available. 870 871Some types are opaque, i.e. no knowledge of their internals is provided. When 872that's the case, expression paths do not work correctly. 873 874In other cases, the internals are available to use in expression paths, but 875they do not provide a user-friendly representation of the object's value. 876 877For instance, consider an STL vector, as implemented by the GNU C++ Library: 878 879:: 880 881 (lldb) frame variable numbers -T 882 (std::vector<int>) numbers = { 883 (std::_Vector_base<int, std::allocator<int> >) std::_Vector_base<int, std::allocator<int> > = { 884 (std::_Vector_base<int, std::allocator&tl;int> >::_Vector_impl) _M_impl = { 885 (int *) _M_start = 0x00000001001008a0 886 (int *) _M_finish = 0x00000001001008a8 887 (int *) _M_end_of_storage = 0x00000001001008a8 888 } 889 } 890 } 891 892Here, you can see how the type is implemented, and you can write a summary for 893that implementation but that is not going to help you infer what items are 894actually stored in the vector. 895 896What you would like to see is probably something like: 897 898:: 899 900 (lldb) frame variable numbers -T 901 (std::vector<int>) numbers = { 902 (int) [0] = 1 903 (int) [1] = 12 904 (int) [2] = 123 905 (int) [3] = 1234 906 } 907 908Synthetic children are a way to get that result. 909 910The feature is based upon the idea of providing a new set of children for a 911variable that replaces the ones available by default through the debug 912information. In the example, we can use synthetic children to provide the 913vector items as children for the std::vector object. 914 915In order to create synthetic children, you need to provide a Python class that 916adheres to a given interface (the word is italicized because Python has no 917explicit notion of interface, by that word we mean a given set of methods must 918be implemented by the Python class): 919 920.. code-block:: python 921 922 class SyntheticChildrenProvider: 923 def __init__(self, valobj, internal_dict): 924 this call should initialize the Python object using valobj as the variable to provide synthetic children for 925 def num_children(self): 926 this call should return the number of children that you want your object to have 927 def get_child_index(self,name): 928 this call should return the index of the synthetic child whose name is given as argument 929 def get_child_at_index(self,index): 930 this call should return a new LLDB SBValue object representing the child at the index given as argument 931 def update(self): 932 this call should be used to update the internal state of this Python object whenever the state of the variables in LLDB changes.[1] 933 Also, this method is invoked before any other method in the interface. 934 def has_children(self): 935 this call should return True if this object might have children, and False if this object can be guaranteed not to have children.[2] 936 def get_value(self): 937 this call can return an SBValue to be presented as the value of the synthetic value under consideration.[3] 938 939As a warning, exceptions that are thrown by python formatters are caught 940silently by LLDB and should be handled appropriately by the formatter itself. 941Being more specific, in case of exceptions, LLDB might assume that the given 942object has no children or it might skip printing some children, as they are 943printed one by one. 944 945[1] This method is optional. Also, a boolean value must be returned (since lldb 9463.1.0). If ``False`` is returned, then whenever the process reaches a new stop, 947this method will be invoked again to generate an updated list of the children 948for a given variable. Otherwise, if ``True`` is returned, then the value is 949cached and this method won't be called again, effectively freezing the state of 950the value in subsequent stops. Beware that returning ``True`` incorrectly could 951show misleading information to the user. 952 953[2] This method is optional (since lldb 3.2.0). While implementing it in terms 954of num_children is acceptable, implementors are encouraged to look for 955optimized coding alternatives whenever reasonable. 956 957[3] This method is optional (since lldb 3.5.2). The `SBValue` you return here 958will most likely be a numeric type (int, float, ...) as its value bytes will be 959used as-if they were the value of the root `SBValue` proper. As a shortcut for 960this, you can inherit from lldb.SBSyntheticValueProvider, and just define 961get_value as other methods are defaulted in the superclass as returning default 962no-children responses. 963 964If a synthetic child provider supplies a special child named 965``$$dereference$$`` then it will be used when evaluating ``operator *`` and 966``operator ->`` in the frame variable command and related SB API 967functions. It is possible to declare this synthetic child without 968including it in the range of children displayed by LLDB. For example, 969this subset of a synthetic children provider class would allow the 970synthetic value to be dereferenced without actually showing any 971synthtic children in the UI: 972 973.. code-block:: python 974 975 class SyntheticChildrenProvider: 976 [...] 977 def num_children(self): 978 return 0 979 def get_child_index(self, name): 980 if name == '$$dereference$$': 981 return 0 982 return -1 983 def get_child_at_index(self, index): 984 if index == 0: 985 return <valobj resulting from dereference> 986 return None 987 988 989For examples of how synthetic children are created, you are encouraged to look 990at examples/synthetic in the LLDB trunk. Please, be aware that the code in 991those files (except bitfield/) is legacy code and is not maintained. You may 992especially want to begin looking at this example to get a feel for this 993feature, as it is a very easy and well commented example. 994 995The design pattern consistently used in synthetic providers shipping with LLDB 996is to use the __init__ to store the `SBValue` instance as a part of self. The 997update function is then used to perform the actual initialization. Once a 998synthetic children provider is written, one must load it into LLDB before it 999can be used. Currently, one can use the LLDB script command to type Python code 1000interactively, or use the command script import fileName command to load Python 1001code from a Python module (ordinary rules apply to importing modules this way). 1002A third option is to type the code for the provider class interactively while 1003adding it. 1004 1005For example, let's pretend we have a class Foo for which a synthetic children 1006provider class Foo_Provider is available, in a Python module contained in file 1007~/Foo_Tools.py. The following interaction sets Foo_Provider as a synthetic 1008children provider in LLDB: 1009 1010:: 1011 1012 (lldb) command script import ~/Foo_Tools.py 1013 (lldb) type synthetic add Foo --python-class Foo_Tools.Foo_Provider 1014 (lldb) frame variable a_foo 1015 (Foo) a_foo = { 1016 x = 1 1017 y = "Hello world" 1018 } 1019 1020LLDB has synthetic children providers for a core subset of STL classes, both in 1021the version provided by libstdcpp and by libcxx, as well as for several 1022Foundation classes. 1023 1024Synthetic children extend summary strings by enabling a new special variable: 1025``${svar``. 1026 1027This symbol tells LLDB to refer expression paths to the synthetic children 1028instead of the real ones. For instance, 1029 1030:: 1031 1032 (lldb) type summary add --expand -x "std::vector<" --summary-string "${svar%#} items" 1033 (lldb) frame variable numbers 1034 (std::vector<int>) numbers = 4 items { 1035 (int) [0] = 1 1036 (int) [1] = 12 1037 (int) [2] = 123 1038 (int) [3] = 1234 1039 } 1040 1041It's important to mention that LLDB invokes the synthetic child provider before 1042invoking the summary string provider, which allows the latter to have access to 1043the actual displayable children. This applies to both inlined summary strings 1044and python-based summary providers. 1045 1046 1047As a warning, when programmatically accessing the children or children count of 1048a variable that has a synthetic child provider, notice that LLDB hides the 1049actual raw children. For example, suppose we have a ``std::vector``, which has 1050an actual in-memory property ``__begin`` marking the beginning of its data. 1051After the synthetic child provider is executed, the ``std::vector`` variable 1052won't show ``__begin`` as child anymore, even through the SB API. It will have 1053instead the children calculated by the provider. In case the actual raw 1054children are needed, a call to ``value.GetNonSyntheticValue()`` is enough to 1055get a raw version of the value. It is import to remember this when implementing 1056summary string providers, as they run after the synthetic child provider. 1057 1058 1059In some cases, if LLDB is unable to use the real object to get a child 1060specified in an expression path, it will automatically refer to the synthetic 1061children. While in summaries it is best to always use ${svar to make your 1062intentions clearer, interactive debugging can benefit from this behavior, as 1063in: 1064 1065:: 1066 1067 (lldb) frame variable numbers[0] numbers[1] 1068 (int) numbers[0] = 1 1069 (int) numbers[1] = 12 1070 1071Unlike many other visualization features, however, the access to synthetic 1072children only works when using frame variable, and is not supported in 1073expression: 1074 1075:: 1076 1077 (lldb) expression numbers[0] 1078 Error [IRForTarget]: Call to a function '_ZNSt33vector<int, std::allocator<int> >ixEm' that is not present in the target 1079 error: Couldn't convert the expression to DWARF 1080 1081The reason for this is that classes might have an overloaded ``operator []``, 1082or other special provisions and the expression command chooses to ignore 1083synthetic children in the interest of equivalency with code you asked to have 1084compiled from source. 1085 1086Filters 1087------- 1088 1089Filters are a solution to the display of complex classes. At times, classes 1090have many member variables but not all of these are actually necessary for the 1091user to see. 1092 1093A filter will solve this issue by only letting the user see those member 1094variables they care about. Of course, the equivalent of a filter can be 1095implemented easily using synthetic children, but a filter lets you get the job 1096done without having to write Python code. 1097 1098For instance, if your class Foobar has member variables named A thru Z, but you 1099only need to see the ones named B, H and Q, you can define a filter: 1100 1101:: 1102 1103 (lldb) type filter add Foobar --child B --child H --child Q 1104 (lldb) frame variable a_foobar 1105 (Foobar) a_foobar = { 1106 (int) B = 1 1107 (char) H = 'H' 1108 (std::string) Q = "Hello world" 1109 } 1110 1111Objective-C Dynamic Type Discovery 1112---------------------------------- 1113 1114When doing Objective-C development, you may notice that some of your variables 1115come out as of type id (for instance, items extracted from NSArray). By 1116default, LLDB will not show you the real type of the object. it can actually 1117dynamically discover the type of an Objective-C variable, much like the runtime 1118itself does when invoking a selector. In order to be shown the result of that 1119discovery that, however, a special option to frame variable or expression is 1120required: ``--dynamic-type``. 1121 1122 1123``--dynamic-type`` can have one of three values: 1124 1125- ``no-dynamic-values``: the default, prevents dynamic type discovery 1126- ``no-run-target``: enables dynamic type discovery as long as running code on 1127 the target is not required 1128- ``run-target``: enables code execution on the target in order to perform 1129 dynamic type discovery 1130 1131If you specify a value of either no-run-target or run-target, LLDB will detect 1132the dynamic type of your variables and show the appropriate formatters for 1133them. As an example: 1134 1135:: 1136 1137 (lldb) expr @"Hello" 1138 (NSString *) $0 = 0x00000001048000b0 @"Hello" 1139 (lldb) expr -d no-run @"Hello" 1140 (__NSCFString *) $1 = 0x00000001048000b0 @"Hello" 1141 1142Because LLDB uses a detection algorithm that does not need to invoke any 1143functions on the target process, no-run-target is enough for this to work. 1144 1145As a side note, the summary for NSString shown in the example is built right 1146into LLDB. It was initially implemented through Python (the code is still 1147available for reference at CFString.py). However, this is out of sync with the 1148current implementation of the NSString formatter (which is a C++ function 1149compiled into the LLDB core). 1150 1151Categories 1152---------- 1153 1154Categories are a way to group related formatters. For instance, LLDB itself 1155groups the formatters for the libstdc++ types in a category named 1156gnu-libstdc++. Basically, categories act like containers in which to store 1157formatters for a same library or OS release. 1158 1159By default, several categories are created in LLDB: 1160 1161- default: this is the category where every formatter ends up, unless another category is specified 1162- objc: formatters for basic and common Objective-C types that do not specifically depend on macOS 1163- gnu-libstdc++: formatters for std::string, std::vector, std::list and std::map as implemented by libstdcpp 1164- libcxx: formatters for std::string, std::vector, std::list and std::map as implemented by libcxx 1165- system: truly basic types for which a formatter is required 1166- AppKit: Cocoa classes 1167- CoreFoundation: CF classes 1168- CoreGraphics: CG classes 1169- CoreServices: CS classes 1170- VectorTypes: compact display for several vector types 1171 1172If you want to use a custom category for your formatters, all the type ... add 1173provide a --category (-w) option, that names the category to add the formatter 1174to. To delete the formatter, you then have to specify the correct category. 1175 1176Categories can be in one of two states: enabled and disabled. A category is 1177initially disabled, and can be enabled using the type category enable command. 1178To disable an enabled category, the command to use is type category disable. 1179 1180The order in which categories are enabled or disabled is significant, in that 1181LLDB uses that order when looking for formatters. Therefore, when you enable a 1182category, it becomes the second one to be searched (after default, which always 1183stays on top of the list). The default categories are enabled in such a way 1184that the search order is: 1185 1186- default 1187- objc 1188- CoreFoundation 1189- AppKit 1190- CoreServices 1191- CoreGraphics 1192- gnu-libstdc++ 1193- libcxx 1194- VectorTypes 1195- system 1196 1197As said, gnu-libstdc++ and libcxx contain formatters for C++ STL data types. 1198system contains formatters for char* and char[], which reflect the behavior of 1199older versions of LLDB which had built-in formatters for these types. Because 1200now these are formatters, you can even replace them with your own if so you 1201wish. 1202 1203There is no special command to create a category. When you place a formatter in 1204a category, if that category does not exist, it is automatically created. For 1205instance, 1206 1207:: 1208 1209 (lldb) type summary add Foobar --summary-string "a foobar" --category newcategory 1210 1211automatically creates a (disabled) category named newcategory. 1212 1213Another way to create a new (empty) category, is to enable it, as in: 1214 1215:: 1216 1217 (lldb) type category enable newcategory 1218 1219However, in this case LLDB warns you that enabling an empty category has no 1220effect. If you add formatters to the category after enabling it, they will be 1221honored. But an empty category per se does not change the way any type is 1222displayed. The reason the debugger warns you is that enabling an empty category 1223might be a typo, and you effectively wanted to enable a similarly-named but 1224not-empty category. 1225 1226Finding Formatters 101 1227---------------------- 1228 1229Searching for a formatter (including formats, since lldb 3.4.0) given a 1230variable goes through a rather intricate set of rules. Namely, what happens is 1231that LLDB starts looking in each enabled category, according to the order in 1232which they were enabled (latest enabled first). In each category, LLDB does the 1233following: 1234 1235- If there is a formatter for the type of the variable, use it 1236- If this object is a pointer, and there is a formatter for the pointee type 1237 that does not skip pointers, use it 1238- If this object is a reference, and there is a formatter for the referred type 1239 that does not skip references, use it 1240- If this object is an Objective-C class and dynamic types are enabled, look 1241 for a formatter for the dynamic type of the object. If dynamic types are 1242 disabled, or the search failed, look for a formatter for the declared type of 1243 the object 1244- If this object's type is a typedef, go through typedef hierarchy (LLDB might 1245 not be able to do this if the compiler has not emitted enough information. If 1246 the required information to traverse typedef hierarchies is missing, type 1247 cascading will not work. The clang compiler, part of the LLVM project, emits 1248 the correct debugging information for LLDB to cascade). If at any level of 1249 the hierarchy there is a valid formatter that can cascade, use it. 1250- If everything has failed, repeat the above search, looking for regular 1251 expressions instead of exact matches 1252 1253If any of those attempts returned a valid formatter to be used, that one is 1254used, and the search is terminated (without going to look in other categories). 1255If nothing was found in the current category, the next enabled category is 1256scanned according to the same algorithm. If there are no more enabled 1257categories, the search has failed. 1258 1259**Warning**: previous versions of LLDB defined cascading to mean not only going 1260through typedef chains, but also through inheritance chains. This feature has 1261been removed since it significantly degrades performance. You need to set up 1262your formatters for every type in inheritance chains to which you want the 1263formatter to apply. 1264