-
Notifications
You must be signed in to change notification settings - Fork 381
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP: Independent child process monitoring #118 #134
Changes from 6 commits
0d956a1
a846fa6
e32679b
bdf9995
aab15ed
f1cee23
099c5c8
9c62f0a
d333fda
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -5,3 +5,6 @@ MANIFEST | |
*.egg-info | ||
*.pyc | ||
*~ | ||
|
||
# Ignore mprof generated files | ||
mprofile_*.dat |
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -107,10 +107,10 @@ decorator function. Use as follows:: | |
del b | ||
return a | ||
|
||
If a python script with decorator ``@profile`` is called using ``-m | ||
If a python script with decorator ``@profile`` is called using ``-m | ||
memory_profiler`` in the command line, the ``precision`` parameter is ignored. | ||
|
||
Time-based memory usage | ||
Time-based memory usage | ||
========================== | ||
Sometimes it is useful to have full memory usage reports as a function of | ||
time (not line-by-line) of external processes (be it Python scripts or not). | ||
|
@@ -131,14 +131,14 @@ e.g. `mprof run -h`. | |
In the case of a Python script, using the previous command does not | ||
give you any information on which function is executed at a given | ||
time. Depending on the case, it can be difficult to identify the part | ||
of the code that is causing the highest memory usage. | ||
of the code that is causing the highest memory usage. | ||
|
||
Adding the `profile` decorator to a function and running the Python | ||
script with | ||
script with | ||
|
||
mprof run <script> | ||
|
||
will record timestamps when entering/leaving the profiled function. Runnning | ||
will record timestamps when entering/leaving the profiled function. Running | ||
|
||
mprof plot | ||
|
||
|
@@ -152,16 +152,49 @@ A discussion of these capabilities can be found `here <http://fa.bianp.net/blog/ | |
|
||
.. warning:: If your Python file imports the memory profiler `from memory_profiler import profile` these timestamps will not be recorded. Comment out the import, leave your functions decorated, and re-run. | ||
|
||
The available commands for `mprof` are: | ||
The available commands for `mprof` are: | ||
|
||
- ``mprof run``: running an executable, recording memory usage | ||
- ``mprof run``: running an executable, recording memory usage | ||
- ``mprof plot``: plotting one the recorded memory usage (by default, | ||
the last one) | ||
- ``mprof list``: listing all recorded memory usage files in a | ||
user-friendly way. | ||
- ``mprof clean``: removing all recorded memory usage files. | ||
- ``mprof rm``: removing specific recorded memory usage files | ||
|
||
Tracking forked child processes | ||
=============================== | ||
In a multiprocessing context the main process will spawn child processes whose | ||
system resources are allocated separately from the parent process. This can | ||
lead to an inaccurate report of memory usage since by default only the parent | ||
process is being tracked. The ``mprof`` utility provides two mechanisms to | ||
track the usage of child processes: sum the memory of all children to the | ||
parent's usage and track each child individual. | ||
|
||
To create a report that combines memory usage of all the children and the | ||
parent, use the ``include_children`` flag in either the ``profile`` decorator or | ||
ass a command line argument to ``mprof``:: | ||
|
||
mprof run --include-children <script> | ||
|
||
The second method tracks each child independently of the main process, | ||
serializing child rows by index to the output stream. Use the ``multiprocess`` | ||
flag and plot as follows:: | ||
|
||
mprof run --multiprocess <script> | ||
mprof plot | ||
|
||
This will create a plot using matplotlib similar to this: | ||
|
||
.. image:: https://cloud.githubusercontent.com/assets/745966/24075879/2e85b43a-0bfa-11e7-8dfe-654320dbd2ce.png | ||
: target: https://github.com/fabianp/memory_profiler/pull/134 | ||
: height: 350px | ||
|
||
You can combine both the ``include_children`` and ``multiprocess`` flags to show | ||
the total memory of the program as well as each child individually. | ||
|
||
.. warning:: currently the child tracking only works if a ``stream`` is provided to the ``profile`` (e.g. from the command line or in the decorator). | ||
|
||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I am confused, as the example works fine even without decorated functions There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I meant that the stream argument has to be passed in, if Perhaps I could rephrase as follows: "Currently tracking individual children requires a reporting output stream; if you'd like direct access to the memory usage of individual children, see the Though of course, that then points the user to an internal function. I'm open to suggestions. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I've put placeholder comments in the code (line 363 and 403) where this issue occurs. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Ah, thanks for the explanation. I think the API would be simpler if we allow memory_usage to return a nested list instead embedding this into the stream. What do you think? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Ok, I've updated |
||
Setting debugger breakpoints | ||
============================= | ||
It is possible to set breakpoints depending on the amount of memory used. | ||
|
@@ -260,7 +293,7 @@ LogFile of memory profiler module. | |
>>> import sys | ||
>>> sys.stdout = LogFile('memory_profile_log') | ||
|
||
``Customised reporting:`` | ||
``Customized reporting:`` | ||
|
||
Sending everything to the log file while running the memory_profiler | ||
could be cumbersome and one can choose only entries with increments | ||
|
@@ -412,6 +445,8 @@ cleanup. | |
|
||
`Dmitriy Novozhilov <https://github.com/demiurg906>`_ and `Sergei Lebedev <https://github.com/superbobry>`_ added support for `tracemalloc <https://docs.python.org/3/library/tracemalloc.html>`_. | ||
|
||
`Benjamin Bengfort <https://github.com/bbengfort>`_ added support for tracking the usage of individual child processes and plotting them. | ||
|
||
========= | ||
License | ||
========= | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,55 @@ | ||
""" | ||
An undecorated example of a script that allocates memory in multiprocessing | ||
workers to demonstrate the use of memory_profiler with multiple processes. | ||
|
||
Run this script with mprof run -M python multiprocessing_example.py | ||
You can then visualize the usage with mprof plot. | ||
""" | ||
|
||
import time | ||
import multiprocessing as mp | ||
|
||
# Big numbers | ||
X6 = 10 ** 6 | ||
X7 = 10 ** 7 | ||
|
||
|
||
def worker(num, wait, amt=X6): | ||
""" | ||
A function that allocates memory over time. | ||
""" | ||
frame = [] | ||
|
||
for idx in range(num): | ||
frame.extend([1] * amt) | ||
time.sleep(wait) | ||
|
||
del frame | ||
|
||
|
||
def main_sequential(): | ||
""" | ||
A sequential version of the work, where one worker is called at a time. | ||
""" | ||
worker(5, 5, X6) | ||
worker(5, 2, X7) | ||
worker(5, 5, X6) | ||
worker(5, 2, X7) | ||
|
||
|
||
def main_multiproc(): | ||
""" | ||
A multiprocessing version of the work, where workers work in their own | ||
child processes and are collected by the master process. | ||
""" | ||
pool = mp.Pool(processes=4) | ||
tasks = [ | ||
pool.apply_async(worker, args) for args in | ||
[(5, 5, X6), (5, 2, X7), (5, 5, X6), (5, 2, X7)] | ||
] | ||
|
||
results = [p.get() for p in tasks] | ||
|
||
|
||
if __name__ == '__main__': | ||
main_multiproc() |
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -86,6 +86,38 @@ def _repr_pretty_(self, p, cycle): | |
p.text(u'<MemitResult : ' + msg + u'>') | ||
|
||
|
||
def _get_child_memory(process, meminfo_attr=None): | ||
""" | ||
Returns a generator that yields memory for all child processes. | ||
""" | ||
if not has_psutil: | ||
raise NotImplementedError(( | ||
"The psutil module is required to monitor the " | ||
"memory usage of child processes." | ||
)) | ||
|
||
# Convert a pid to a process | ||
if isinstance(process, int): | ||
if process == -1: process = os.getpid() | ||
process = psutil.Process(process) | ||
|
||
if not meminfo_attr: | ||
# Use the psutil 2.0 attr if the older version isn't passed in. | ||
meminfo_attr = 'memory_info' if hasattr(process, 'memory_info') else 'get_memory_info' | ||
|
||
# Select the psutil function get the children similar to how we selected | ||
# the memory_info attr (a change from excepting the AttributeError). | ||
children_attr = 'children' if hasattr(process, 'children') else 'get_children' | ||
|
||
# Loop over the child processes and yield their memory | ||
try: | ||
for child in getattr(process, children_attr)(recursive=True): | ||
yield getattr(child, meminfo_attr)()[0] / _TWO_20 | ||
except psutil.NoSuchProcess: | ||
# https://github.com/fabianp/memory_profiler/issues/71 | ||
yield 0.0 | ||
|
||
|
||
def _get_memory(pid, backend, timestamps=False, include_children=False, filename=None): | ||
# .. low function to get memory consumption .. | ||
if pid == -1: | ||
|
@@ -111,12 +143,7 @@ def ps_util_tool(): | |
else 'get_memory_info' | ||
mem = getattr(process, meminfo_attr)()[0] / _TWO_20 | ||
if include_children: | ||
try: | ||
for p in process.children(recursive=True): | ||
mem += getattr(p, meminfo_attr)()[0] / _TWO_20 | ||
except psutil.NoSuchProcess: | ||
# https://github.com/fabianp/memory_profiler/issues/71 | ||
pass | ||
mem += sum(_get_child_memory(process, meminfo_attr)) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Hi, quick question, as processes in modern linux share common memory after forking from their parent and only copy it on writes, wouldn't this There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Unfortunately I'm not sure of the behavior of psutil with regards to linux forks; I simply used the same meminfo_attr as the summation function. If that's true then this is indeed a problem, but I'd guess that we'd have to dig into psutil to figure it out. |
||
if timestamps: | ||
return mem, time.time() | ||
else: | ||
|
@@ -128,9 +155,11 @@ def ps_util_tool(): | |
def posix_tool(): | ||
# .. scary stuff .. | ||
if include_children: | ||
raise NotImplementedError('The psutil module is required when to' | ||
' monitor memory usage of children' | ||
' processes') | ||
raise NotImplementedError(( | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Sorry about this, I think this was just a carry over from the historical merge; I can reset back to the original if needed. |
||
"The psutil module is required to monitor the " | ||
"memory usage of child processes." | ||
)) | ||
|
||
warnings.warn("psutil module not found. memory_profiler will be slow") | ||
# .. | ||
# .. memory usage in MiB .. | ||
|
@@ -211,8 +240,8 @@ def run(self): | |
|
||
|
||
def memory_usage(proc=-1, interval=.1, timeout=None, timestamps=False, | ||
include_children=False, max_usage=False, retval=False, | ||
stream=None, backend=None): | ||
include_children=False, multiprocess=False, max_usage=False, | ||
retval=False, stream=None, backend=None): | ||
""" | ||
Return the memory usage of a process or piece of code | ||
|
||
|
@@ -243,6 +272,12 @@ def memory_usage(proc=-1, interval=.1, timeout=None, timestamps=False, | |
timestamps : bool, optional | ||
if True, timestamps of memory usage measurement are collected as well. | ||
|
||
include_children : bool, optional | ||
if True, sum the memory of all forked processes as well | ||
|
||
multiprocess : bool, optional | ||
if True, track the memory usage of all forked processes. | ||
|
||
stream : File | ||
if stream is a File opened with write access, then results are written | ||
to this file instead of stored in memory and returned at the end of | ||
|
@@ -314,10 +349,18 @@ def memory_usage(proc=-1, interval=.1, timeout=None, timestamps=False, | |
mem_usage = _get_memory( | ||
proc.pid, backend, timestamps=timestamps, | ||
include_children=include_children) | ||
|
||
if stream is not None: | ||
stream.write("MEM {0:.6f} {1:.4f}\n".format(*mem_usage)) | ||
|
||
# Only write children to the stream file, warn if appending to the return. | ||
if multiprocess: | ||
for idx, chldmem in enumerate(_get_child_memory(proc.pid)): | ||
stream.write("CHLD {0} {1:.6f} {2:.4f}\n".format(idx, chldmem, time.time())) | ||
else: | ||
ret.append(mem_usage) | ||
if multiprocess: | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Here is the first place where I don't return multiple values and instead warn |
||
warnings.warn("use include_children not multiprocess without a stream") | ||
else: | ||
ret = max(ret, | ||
_get_memory( | ||
|
@@ -348,8 +391,16 @@ def memory_usage(proc=-1, interval=.1, timeout=None, timestamps=False, | |
include_children=include_children) | ||
if stream is not None: | ||
stream.write("MEM {0:.6f} {1:.4f}\n".format(*mem_usage)) | ||
|
||
# Only write children to the stream file, warn if appending to the return. | ||
if multiprocess: | ||
for idx, chldmem in enumerate(_get_child_memory(proc.pid)): | ||
stream.write("CHLD {0} {1:.6f} {2:.4f}\n".format(idx, chldmem, time.time())) | ||
else: | ||
ret.append(mem_usage) | ||
|
||
if multiprocess: | ||
warnings.warn("use include_children not multiprocess without a stream") | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Here is the second place where I don't return multiple values and instead warn |
||
else: | ||
ret = max([ret, | ||
_get_memory(proc, backend, include_children=include_children) | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How hard do you think it would be to remove this limitation?, i.e., to track children even if the function is not @Profile'd