How can we use tqdm in a parallel execution with joblib?

Question

I want to run a function in parallel, and wait until all parallel nodes are done, using joblib. Like in the example:

from math import sqrt
from joblib import Parallel, delayed
Parallel(n_jobs=2)(delayed(sqrt)(i ** 2) for i in range(10))

But, I want that the execution will be seen in a single progressbar like with tqdm, showing how many jobs has been completed.

How would you do that?

http://danshiebler.com/2016-09-14-parallel-progress-bar/ Maybe this site can help you. — Tejas Shetty, Aug 19 '19 at 08:13

score 42 · Answer 1 · edited Aug 30 '19 at 10:24

42

Just put range(10) inside tqdm(...)! It probably seemed too good to be true for you, but it really works (on my machine):

from math import sqrt
from joblib import Parallel, delayed  
from tqdm import tqdm  
result = Parallel(n_jobs=2)(delayed(sqrt)(i ** 2) for i in tqdm(range(100000)))

edited Aug 30 '19 at 10:24

Leponzo

298
4
15

answered Jun 19 '18 at 10:05

tyrex

5,602
10
28
42

10

This only shows progress when the process starts, not when it is finished: `Parallel(n_jobs=10)(delayed(time.sleep)(i ** 2) for i in tqdm(range(10)))` – Davidmh Feb 25 '19 at 12:27
2

It works, but not with a list of strings for example... Also tried wrapping the list in `iter`... – curious95 Mar 20 '19 at 13:40
@curious95 Try putting the list into a generator, the following seems to work for me: `from math import sqrt from joblib import Parallel, delayed import multiprocessing from tqdm import tqdm rng = range(100000) rng = ['a','b','c','d'] for j in range(20): rng += rng def get_rng(): i = 0 for i in range(len(rng)): yield rng[i] result = Parallel(n_jobs=2)(delayed(sqrt)(len(i) ** 2) for i in tqdm(get_rng()))` – tyrex Mar 21 '19 at 17:58
2

In another question, there is a very elegant [solution](https://stackoverflow.com/a/58936697/5299750) to this problem. – Christian Steinmeyer Aug 20 '20 at 08:09
This won't work, `tqdm` will go to %100 immediately. – anilbey Mar 25 '21 at 09:50

score 14 · Answer 2 · answered Mar 08 '20 at 21:56

14

I've created pqdm a parallel tqdm wrapper with concurrent futures to comfortably get this done, give it a try!

To install

pip install pqdm

and use

from pqdm.processes import pqdm
# If you want threads instead:
# from pqdm.threads import pqdm

args = [1, 2, 3, 4, 5]
# args = range(1,6) would also work

def square(a):
    return a*a

result = pqdm(args, square, n_jobs=2)

answered Mar 08 '20 at 21:56

niedakh

1,989
2
13
17

Well done guy ! Can't stand why you're not accepted. Big Thank You ! – mat.viguier May 17 '20 at 09:42
Unfortunately this fails for me. I am not sure why, but it looks like pqdm does not wait until the end of the function calls. I do not have time now to create a MWE. Still, thanks for the effort (and +1). – Yair Daon Oct 27 '20 at 14:15
@YairDaon maybe try it will work with the bounded executor, try adding `bounded=True` to pqdm. – niedakh Oct 28 '20 at 15:34
This works like a charm, thanks for the library. It helps! – Chandra Kanth Mar 03 '21 at 12:33

nth · Answer 3 · 2020-04-04T20:36:31.117

As noted above, solutions that simply wrap the iterable passed to joblib.Parallel() do not truly monitor the progress of execution. Instead, I suggest subclassing Parallel and overriding the print_progress() method, as follows:

import joblib
from tqdm.auto import tqdm

class ProgressParallel(joblib.Parallel):
    def __call__(self, *args, **kwargs):
        with tqdm() as self._pbar:
            return joblib.Parallel.__call__(self, *args, **kwargs)

    def print_progress(self):
        self._pbar.total = self.n_dispatched_tasks
        self._pbar.n = self.n_completed_tasks
        self._pbar.refresh()

score 9 · Answer 4 · answered May 19 '20 at 20:48

Modifying nth's great answer to permit a dynamic flag to use TQDM or not and to specify the total ahead of time so that the status bar fills in correctly.

from tqdm.auto import tqdm
from joblib import Parallel

class ProgressParallel(Parallel):
    def __init__(self, use_tqdm=True, total=None, *args, **kwargs):
        self._use_tqdm = use_tqdm
        self._total = total
        super().__init__(*args, **kwargs)

    def __call__(self, *args, **kwargs):
        with tqdm(disable=not self._use_tqdm, total=self._total) as self._pbar:
            return Parallel.__call__(self, *args, **kwargs)

    def print_progress(self):
        if self._total is None:
            self._pbar.total = self.n_dispatched_tasks
        self._pbar.n = self.n_completed_tasks
        self._pbar.refresh()

Ben Usman · Answer 5 · 2016-11-14T21:26:59.587

Here's possible workaround

def func(x):
    time.sleep(random.randint(1, 10))
    return x

def text_progessbar(seq, total=None):
    step = 1
    tick = time.time()
    while True:
        time_diff = time.time()-tick
        avg_speed = time_diff/step
        total_str = 'of %n' % total if total else ''
        print('step', step, '%.2f' % time_diff, 
              'avg: %.2f iter/sec' % avg_speed, total_str)
        step += 1
        yield next(seq)

all_bar_funcs = {
    'tqdm': lambda args: lambda x: tqdm(x, **args),
    'txt': lambda args: lambda x: text_progessbar(x, **args),
    'False': lambda args: iter,
    'None': lambda args: iter,
}

def ParallelExecutor(use_bar='tqdm', **joblib_args):
    def aprun(bar=use_bar, **tq_args):
        def tmp(op_iter):
            if str(bar) in all_bar_funcs.keys():
                bar_func = all_bar_funcs[str(bar)](tq_args)
            else:
                raise ValueError("Value %s not supported as bar type"%bar)
            return Parallel(**joblib_args)(bar_func(op_iter))
        return tmp
    return aprun

aprun = ParallelExecutor(n_jobs=5)

a1 = aprun(total=25)(delayed(func)(i ** 2 + j) for i in range(5) for j in range(5))
a2 = aprun(total=16)(delayed(func)(i ** 2 + j) for i in range(4) for j in range(4))
a2 = aprun(bar='txt')(delayed(func)(i ** 2 + j) for i in range(4) for j in range(4))
a2 = aprun(bar=None)(delayed(func)(i ** 2 + j) for i in range(4) for j in range(4))

It is a walk around, but the progress bar updates only when a task is dispatched. The better timing to update the progress bar is the time when the task is completed. — Wedoso, Apr 12 '18 at 18:42

PureW · Accepted Answer · 2016-06-14T21:42:12.100

4

If your problem consists of many parts, you could split the parts into k subgroups, run each subgroup in parallel and update the progressbar in between, resulting in k updates of the progress.

This is demonstrated in the following example from the documentation.

>>> with Parallel(n_jobs=2) as parallel:
...    accumulator = 0.
...    n_iter = 0
...    while accumulator < 1000:
...        results = parallel(delayed(sqrt)(accumulator + i ** 2)
...                           for i in range(5))
...        accumulator += sum(results)  # synchronization barrier
...        n_iter += 1

https://pythonhosted.org/joblib/parallel.html#reusing-a-pool-of-workers

edited Jun 14 '16 at 21:42

answered Jun 14 '16 at 21:37

PureW

3,553
2
14
26

2

How does this answer the question about "a single progressbar"? – Nikos Alexandris Nov 18 '20 at 09:28
1

This absolutely doesn't answer the question about the progress bar though – ForceBru Dec 29 '20 at 15:43

How can we use tqdm in a parallel execution with joblib?

6 Answers6

Linked