[fpc-devel] DW_AT_external and other additions to FPC generated dwarf

Martin Frb via fpc-devel Thu, 23 Mar 2023 01:45:32 -0700

After the brief exchange onhttps://gitlab.com/freepascal.org/fpc/source/-/issues/40208

There are various considerations (ideas/requests) to hopefully helpimprove debugging experience.

I have recently added 3 issues, but there is more. And I wanted to add abit of background here, since it is not all black and white.


1) Scoped Enums https://gitlab.com/freepascal.org/fpc/source/-/issues/40208

2) Unit Search orderhttps://gitlab.com/freepascal.org/fpc/source/-/issues/402093) DW_AT_external for typeshttps://gitlab.com/freepascal.org/fpc/source/-/issues/40210

4) "official" marker for string vs pchar vs array
5) Duplicated (artificial) types under Windows
5a) Missing address for class methods
6) "var param" for function calls / managed param
...

-----------
1) is simple to reason (IMHO)
There is an example in sysutils:
  {$SCOPEDENUMS OFF}
  TUseBoolStrs = (False, True);

If the debugger reads this, before getting to the definition of the"True" (boolean), then expressions could fail if they contain the boolconstants true/false.


-----------
2) is required for looking up global vars.
A global var of the same name can exist in different units.
If paused on some code
  if GlobalFoo > 5 then

The debugger needs to work out which GlobalFoo that is.

2) May or may not have an impact on type lookups. See 3.

-----------
3) DW_AT_external (or visibility) for types

After reconsidering, that one is actually more debatable. But IMHO stilluseful.


unit foo;
interface
implementation
type PCHAR = ^widechar; // does not want to be seen outside this unit.

Granted this is not the most likely case to happen. But it may happen.

At first types seem to be save-ish. If a variable is declared in thecurrent unit (or otherwise found in the correct unit, according to "unitsearch order", then the debug info of that variable points to thecorrect type.

No ambiguity, not even with global types.

The issue occurs, when a user writes a watch, using type casts withglobal types (that aren't from the current unit).

     pchar(foo)

pinteger(foo) // this one can be ^smallint from unit system,though that is not a implementation vs interface

    TForm1(Sender)

In each case the debugger needs to find the correct type (if more thatone exists). And in each case, that is never 100% accurate, unless onlyone type exists.

But imho can still benefit from the difference between implementationand interface. Unless fully qualified, the user is unlikely to want theabove "pchar=^widechar" from some unit (maybe not even known to him).

As a side note, initially I thought that once unit-search-order isknown, the issue would be solved for good. But it wont. For"TForm1(Sender)": "Sender: TControl" can be in units that do not use"unit1". Yet the user would expect the debugger to find it.

And (on windows) a "uses unit1; var TempForm: TForm1" copies thedefinition of "TForm1" into that unit. In that case the debugger willalways think as "TForm1" to belong to that unit. Which will likely becorrect, while paused in that unit, but may not be correct, if paused inanother unit, and just searching for the global definition of "TForm1".

So in the end the debugger will need to deal with the possibility ofambiguity.=> if that includes "types from implementation" is therefore not so bigof an issue. (still might be useful).



-----------
4) "official" marker for string vs pchar vs array

Not sure if that is reported already. Depending on dwarf version"string" (ansistring) is a pointer (either TAG pointer/reference orlocation expression) to

- char  (dwarf 2)
- array of char (dwarf 3)

Currently for dwarf 2, the debugger can't tell the difference. If theuser says: foo[1]The debugger does not know, if the first or second char is meant (0 or 1based index)

With dwarf 3 the difference would be in the display format "('a', 'b','c') vs 'abc'.But currently the debugger (fpdebug) can tell the difference, becausefpc has a tiny difference in how it encodes the "stride".

That is obviously an implementation detail, and not very future proof.

Therefore an "official" marker would be nice.
- it appears there is none in dwarf
- it could be a custom addition to dwarf

- documenting an "implementation detail" (such as the stride), sofpdebug can safely rely on it.


-----------
5) Duplicated (artificial) types under Windows

As mentioned, declaring
   var foo: TStringList

copies the type definition of TStringList to that unit (Windows only),on Linux there is a cross compilation unit reference (well at least, ifthe source unit has debug info, otherwise IIRC it also is a copy).


Maybe those copies should be marked DW_AT_artificial ?
From DWARF

>> A compiler may wish to generate debugging information entries forobjects or types that were not actually declared in the source of theapplication>> Any debugging information entry representing the declaration of anobject or type artificially generated by a compiler and not explicitlydeclared by the source program may have a DW_AT_artificial attribute.Then again contrary to those statements in the list of attributes foreach DW_TAG, many tags that match the description do not have it listed.

Knowing it is only a "copy" means less entries to consider when lookingup a type across units.

Especially, if the debugger may end up, having to determine if two typesin two units are equal or not. Which for structure types can mean a lotof work. And they may even differ, because the copy omits addresses formethods.


And that is the 2nd issue, the copy omits addresses for methods.
If a user does
   MyStringList.count()
fpdebug can not call the function.

Knowing the type def is a copy, would be half of the solution. It wouldstill need to know, where the original resides, to find the address.For this DW_AT_decl_file could help => though only if that file has fulldebug info.

- if that file has no debug info, then there is an issue

- if that file declares the type, but does not contain a variable ofthat type, the type def may have been stripped too.


Not currently sure how best to solve this.

----
6) "var param" for function calls
I have actually to double check some of this....

They simply implemented by either
- DW_TAG_reference (which if only used for this, would be ok)
- deref in the location expression (which would also apply to pointer types)

The problem occurs when fpdebug does function eval in watches.
   procedure Foo(var Bar: integer);
   procedure Other(Bar: PInteger);

if the user does watch:  Foo(myInt);

the debugger needs to decide if to call that function. Or if to give anerror for non-matching params.

The same/similar happens for managed types. Currently the debugger mustrely on hardcoded, fpc-specific info to know if it must call ansi_decrefon a string returned from a function call (which must not be done if itis a pchar / and if it is an array, yet another helper function needs tobe called).Not only does the debugger need to know if a type needs calls do managerefcounts, it also needs to know what those calls are.


_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
https://lists.freepascal.org/cgi-bin/mailman/listinfo/fpc-devel

[fpc-devel] DW_AT_external and other additions to FPC generated dwarf

Reply via email to