Abacn commented on issue #23904:
URL: https://github.com/apache/beam/issues/23904#issuecomment-2027669667
THe memory profile (pipeline option `--profile_memory --profile_cpu`) wasn't
quite helpful. The generated memory profiles look like the following
```
Partition of a set of 22679 objects. Total size = 2069184 bytes.
Index Count % Size % Cumulative % Kind (class / dict of class)
0 9796 43 912749 44 912749 44 str
1 4565 20 324624 16 1237373 60 tuple
2 962 4 269480 13 1506853 73 dict (no owner)
3 194 1 180983 9 1687836 82 bytes
4 5185 23 124440 6 1812276 88 float
5 39 0 91312 4 1903588 92 list
6 1180 5 33192 2 1936780 94 int
7 50 0 18992 1 1955772 95 frozenset
8 92 0 16200 1 1971972 95 types.CodeType
9 110 0 14960 1 1986932 96 function
<125 more rows. Type e.g. '_.more' to view.>
Index Count % Size % Cumulative % Kind (class / dict of class)
10 9 0 9984 0 1996916 97 abc.ABCMeta
11 9 0 8976 0 2005892 97 dict of module
12 72 0 5760 0 2011652 97
org.apache.beam.model.pipeline.v1.metrics_pb2.Monitorin
gInfo
13 9 0 4896 0 2016548 97 dict of abc.ABCMeta
14 4 0 4256 0 2020804 98 type
15 6 0 3476 0 2024280 98 re.Pattern
16 5 0 3200 0 2027480 98 dict of
apache_beam.testing.synthetic_pipeline.SyntheticSource
17 1 0 2552 0 2030032 98 random.Random
18 2 0 2400 0 2032432 98 urllib.parse.Quoter
19 3 0 1920 0 2034352 98 dict of threading.Thread
<115 more rows. Type e.g. '_.more' to view.>
```
or
```
Partition of a set of 16762 objects. Total size = 3649165 bytes.
Index Count % Size % Cumulative % Kind (class / dict of class)
0 9 0 2189046 60 2189046 60 bytes
1 9966 59 901079 25 3090125 85 str
2 2322 14 162448 4 3252573 89 tuple
3 528 3 141944 4 3394517 93 dict (no owner)
4 51 0 97704 3 3492221 96 list
5 2671 16 64104 2 3556325 97 float
6 703 4 19800 1 3576125 98 int
7 19 0 11856 0 3587981 98 collections.deque
8 32 0 6912 0 3594893 99 frozenset
9 8 0 6048 0 3600941 99 re.Pattern
<73 more rows. Type e.g. '_.more' to view.>
Index Count % Size % Cumulative % Kind (class / dict of class)
10 68 0 5440 0 3606381 99
org.apache.beam.model.pipeline.v1.metrics_pb2.Monitorin
gInfo
11 19 0 4408 0 3610789 99 dict of threading.Condition
12 6 0 3840 0 3614629 99 dict of
apache_beam.testing.synthetic_pipeline.SyntheticSource
13 21 0 2856 0 3617485 99 function
14 39 0 2808 0 3620293 99 types.BuiltinMethodType
15 4 0 2560 0 3622853 99 dict of threading.Thread
16 2 0 2128 0 3624981 99 type
17 5 0 2000 0 3626981 99 dict of
urllib3.connection.HTTPSConnection
18 3 0 1368 0 3628349 99 types.FrameType
19 34 0 1360 0 3629709 99 types.CellType
<63 more rows. Type e.g. '_.more' to view.>
```
or
```
Partition of a set of 3959 objects. Total size = 271112 bytes.
Index Count % Size % Cumulative % Kind (class / dict of class)
0 1419 36 100336 37 100336 37 tuple
1 344 9 98784 36 199120 73 dict (no owner)
2 1696 43 40704 15 239824 88 float
3 130 3 11315 4 251139 93 str
4 289 7 8104 3 259243 96 int
5 4 0 3024 1 262267 97 re.Pattern
6 3 0 1368 1 263635 97 types.FrameType
7 17 0 1360 1 264995 98
org.apache.beam.model.pipeline.v1.metrics_pb2.Monitorin
gInfo
8 2 0 781 0 265776 98 bytes
9 5 0 680 0 266456 98 function
<24 more rows. Type e.g. '_.more' to view.>
Index Count % Size % Cumulative % Kind (class / dict of class)
10 1 0 640 0 267096 99 dict of threading.Thread
11 1 0 624 0 267720 99 collections.deque
12 7 0 608 0 268328 99 list
13 9 0 360 0 268688 99 types.CellType
14 8 0 320 0 269008 99 _thread.lock
15 2 0 256 0 269264 99
grpc._cython.cygrpc.SendMessageOperation
16 1 0 232 0 269496 99 dict of threading.Condition
17 2 0 160 0 269656 99
org.apache.beam.model.fn_execution.v1.beam_fn_api_pb2.D
ata
18 2 0 160 0 269816 100 urllib.parse.SplitResult
19 2 0 144 0 269960 100 builtins.weakref
<14 more rows. Type e.g. '_.more' to view.>
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]