ADO: Async all the way down the tubes?

Question

Okay, so "async all the way down" is the mandate. But when is it problematic?

For example, if you have limited access to a resource, as in a DbConnection or a file, when do you stop using async methods in favor of synchronous?

Let's review the complexity of an asynchronous database call: (Not putting .ConfigureAwait(false) for readability.)

// Step 1: Ok, no big deal, our connection is closed, let's open it and wait.
await connection.OpenAsync();
// Connection is open!  Let's do some work.

// Step 2: Acquire a reader.
using(var reader = await command.ExecuteReaderAsync())
{
    // Step 3: Start reading results.
    while(await reader.ReadAsync())
    {
        // get the data.
    }
}

Steps:

Should be reasonably innocuous and nothing to worry about.
But now we've acquired an open connection in a potentially limited connection pool. What if when waiting for step 2, other long running tasks are at the head of the line in the task scheduler?
Even worse now, we await with an open connection (and most likely added latency).

Aren't we holding open a connection longer than necessary? Isn't this an undesirable result? Wouldn't it be better to use synchronous methods to lessen the overall connection time, ultimately resulting in our data driven application performing better?

Of course I understand that async doesn't mean faster but async methods provide the opportunity for more total throughput. But as I've observed, there can definitely be weirdness when there are tasks scheduled in-between awaits that ultimately delay the operation, and essentially behave like blocking because of the limitations of the underlying resource.

[Note: this question is focused on ADO, but this also applies to file reads and writes.]

Hoping for some deeper insight. Thank you.

Definitely related: https://stackoverflow.com/questions/25086866/why-does-the-ef-6-tutorial-use-asynchronous-calls/25087273 — Oren Ferrari, Apr 14 '18 at 02:44
More discussion here: https://www.reddit.com/r/dotnet/comments/8c4ova/ado_async_all_the_way_down_the_tubes/?st=jfzskl5l&sh=e0899b22 — Oren Ferrari, Apr 14 '18 at 20:18

Phil Bolduc · Answer 1 · 2018-04-16T22:56:57.843

There are a few things to consider here:

Database connection pool limits, specifically the "Max Pool Size" which defaults to 100. The database connection pool has upper limit of the maximum number of connections. Besure to set "Max Pool Size=X" where X is the maximum number of database connections you want to have. This applies to either sync or async.
The thread pool settings. The thread pool will not add threads quickly if you load spikes. It will only add a new thread every 500ms or so. See MSDN Threading Guidelines from 2004 and The CLR Thread Pool 'Thread Injection' Algorithm. Here is a capture of the number of busy threads on one of my projects. The load spiked and requests were delayed due to lack of available threads to service the requests. The line increases as new threads were being added. Remember every thread required 1MB of memory for its stack. 1000 threads ~= 1GB of RAM just for threads.
The load characteristics of your project, relates to the thread pool.
The type of system you are providing, I will assume you are talking about a ASP.NET type app/api
The throughput (requests/sec) vs latency (sec/request) requirements. Async will add to latency but increase throughput.
The database/query performance, relates to the 50ms recommendation below

The article The overhead of async/await in NET 4.5 Edit 2018-04-16 the recommendation below applied to WinRT UI based applications.

Avoid using async/await for very short methods or having await statements in tight loops (run the whole loop asynchronously instead). Microsoft recommends that any method that might take longer than 50ms to return should run asynchronously, so you may wish to use this figure to determine whether it’s worth using the async/await pattern.

Also take a watch Diagnosing issues in ASP.NET Core Applications - David Fowler & Damian Edwards that talks about issues with thread pool and using async, sync, etc.

Hopefully this helps

The "50ms guideline" needs context: this guideline was given for UI threads in a client-side UI app. — Stephen Cleary, Apr 15 '18 at 21:52
Thanks Stephen. I missed the paragraph in the middle that stated "This was a central focus for Microsoft during the development of the WinRT API, and they ensured that any APIs that may take longer than 50ms to execute would only be available in an asynchronous form." and only quoted the recommendation (out of context) from the conclusion. — Phil Bolduc, Apr 16 '18 at 22:49

score 1 · Answer 2 · answered Apr 14 '18 at 03:32

1

Due to the way database connection pooling works at lower levels of protocol, the high level open / close commands don't have a lot of effect on performance. Generally though the internal thread scheduling IO is usually not a bottleneck unless you have some really long running tasks - we're talking something CPU intensive or worse - blocking inside. This will quickly exhaust your thread pool and things will start queuing up.

I would also suggest you investigate http://steeltoe.io, particularly the circuit breaker hystrix implementation. The way it works is it allows you to group your code into commands, and have command execution managed by command groups, which are essentially dedicated and segregated thread pools. The advantage is if you have a noisy, long running command, it can only exhaust it's own's command group thread pool without affecting the rest of the app. There are many other advantages of this portion of the library, primary being circuit breaker implementation, and one of my personal favorite collapsers. Imagine multiple incoming calls for a query GetObjectById being grouped into a single select * where id in(1,2,3) query and then results mapped back on the separate inbound requests. Db call is just an example, can be anything really.

answered Apr 14 '18 at 03:32

Andrew Stakhov

945
7
26

I'm thinking that this is confirming my suspicion that when it comes to Db access, might as well stick with synchronous code if you want to be sure your query gets in and gets out as fast as possible. It seems counter productive if you can support (for example) 100 queries concurrently, but each one takes a bit longer due to latency. Longer single queries can have an adverse effect on your application especially when using a caching solution. – Oren Ferrari Apr 14 '18 at 05:37
Not really. There is an underlying db connection pool that works independently of your conn. Open/close. If your not using async you are blocking actual threads instead of relying on callbacks to resume code execution when query results are available. This is by far more costly to your app then async call. – Andrew Stakhov Apr 14 '18 at 19:52
Right, but I'm not concerned about open/close. I'm more concerned about delays caused by the task scheduler when iterating using .ReadAsync(). Are you saying that if 1000 threads call .Open() and subsequently execute commands that there will be nothing told to wait while connections are freed for use? Is a better way to think about it the breadth a connection pool: it's simply the bandwidth by which you can make queries? – Oren Ferrari Apr 14 '18 at 20:21
The only time where async would be a problem is if you have a thread starvation scenario. In this case you are either queuing up many long running cpu tasks or blocking operations. In many cases such as network access or db, they go to os level by leveraging system interupts, which are inherently event based. You are essentially saying that when I get a signal on this interrupt, run my task. You don't tie a computational thread to waiting on this - it's a huge waste of expensive resource. This is is why modern code is moving away from sync model for such operations – Andrew Stakhov Apr 15 '18 at 13:43

score 1 · Answer 3 · answered Apr 15 '18 at 13:09

if you have limited access to a resource, as in a DbConnection or a file, when do you stop using async methods in favor of synchronous?

You shouldn't need to switch to synchronous at all. Generally speaking, async only works if it's used all the way. Async-over-sync is an antipattern.

Consider the asynchronous code:

using (connection)
{
  await connection.OpenAsync();
  using(var reader = await command.ExecuteReaderAsync())
  {
    while(await reader.ReadAsync())
    {
    }
  }
}

In this code, the connection is held open while the command is executed and the data is read. Anytime that the code is waiting on the database to respond, the calling thread is freed up to do other work.

Now consider the synchronous equivalent:

using (connection)
{
  connection.Open();
  using(var reader = command.ExecuteReader())
  {
    while(reader.Read())
    {
    }
  }
}

In this code, the connection is held open while the command is executed and the data is read. Anytime that the code is waiting on the database to respond, the calling thread is blocked.

With both of these code blocks, the connection is held open while the command is executed and the data is read. The only difference is that with the async code, the calling thread is freed up to do other work.

What if when waiting for step 2, other long running tasks are at the head of the line in the task scheduler?

The time to deal with thread pool exhaustion is when you run into it. In the vast majority of scenarios, it isn't a problem and the default heuristics work fine.

This is particularly true if you use async everywhere and don't mix in blocking code.

For example, this code would be more problematic:

using (connection)
{
  await connection.OpenAsync();
  using(var reader = command.ExecuteReader())
  {
    while(reader.Read())
    {
    }
  }
}

Now you have asynchronous code that, when it resumes, blocks a thread pool thread on I/O. Do that a lot, and you can end up in a thread pool exhaustion scenario.

Even worse now, we await with an open connection (and most likely added latency).

The added latency is miniscule. Like sub-millisecond (assuming no thread pool exhaustion). It's immeasurably small compared to random network fluctuations.

Aren't we holding open a connection longer than necessary? Isn't this an undesirable result? Wouldn't it be better to use synchronous methods to lessen the overall connection time, ultimately resulting in our data driven application performing better?

As noted above, synchronous code would hold the connection open just as long. (Well, OK, a sub-millisecond amount less, but that Doesn't Matter).

But as I've observed, there can definitely be weirdness when there are tasks scheduled in-between awaits that ultimately delay the operation, and essentially behave like blocking because of the limitations of the underlying resource.

It would be worrying if you observed this on the thread pool. That would mean you're already at thread pool exhaustion, and you should carefully review your code and remove blocking calls.

It's less worrying if you observed this on a single-thread scheduler (e.g., UI thread or ASP.NET Classic request context). In that case, you're not at thread pool exhaustion (though you still need to carefully review your code and remove blocking calls).

As a concluding note, it sounds as though you're trying to add async the hard way. It's harder to start at a higher level and work your way to a lower level. It's much easier to start at the lower level and work your way up. E.g., start with any I/O-bound APIs like DbConnection.Open / ExecuteReader / Read, and make those asynchronous first, and then let async grow up through your codebase.

Actuality I would highly disagree about comment regarding switching to async at lower levels when higher levels are sync. You would need to call .Result or .Wait at transition point, which is a great way to get a deadlock which is incredibly difficult to debug. https://olitee.com/2015/01/c-async-await-common-deadlock-scenario/ — Andrew Stakhov, Apr 15 '18 at 13:50
I'm thinking that the optimal pattern is to be async all the way until you call for a read. Since you have an active connection, command, and reader, .Read() should be extremely fast with little or no blocking. No need to potentially add tasks to the scheduler. Or do I have a conceptual error here? — Oren Ferrari, Apr 15 '18 at 21:09
So without any visible thread saturation (plenty of CPU overhead), my AsyncFileWriter example (https://github.com/electricessence/AsyncFileWriter/blob/master/AsyncFileWriter/AsyncFileWriter.cs#L65-L76) takes magnitudes (+20x) longer to complete when using .WriteAsync() (even if I properly flag the FileStream to be async.) Where the .Write() version only take a few seconds. This is a huge part of what is driving this question. The guidance from the other answer seems to be correct. Quick iterations should remain synchronous. — Oren Ferrari, Apr 15 '18 at 21:20
@macsux: I'm not saying to do sync-over-async. I'm saying that when you are transitioning to async, it's easier to start lower and work higher than it is to start higher and work lower. (Either way, you should end up async all the way). — Stephen Cleary, Apr 15 '18 at 21:47
@OrenFerrari: Different drivers are asynchronous at different points. I recommend going async all the way. If you're seeing 20x speed difference using async, I recommend 1) duplicating it with minimal excess code so that it is *obviously* the fault of the async API, and then 2) reporting it to Microsoft. — Stephen Cleary, Apr 15 '18 at 21:49
@StephenCleary, yeah I'm seeing some very strange results that point to subtlety of when to use async for everything all the way. It makes perfect sense to me up to the point where iteration occurs, then performance degrades significantly. I think part of the problem is it's really hard to test appropriately in a way that is analogous to its synchronous counterparts. But we've seen this before, I think even on here, if you try to iterate results as fast as possibly asynchronously, the latency will really screw things up. I'm going do No.1 right now as a benchmark. — Oren Ferrari, Apr 16 '18 at 05:19
@StephenCleary I would love to be wrong about this, but here's the measurement: (1,000,000 byte-arrays written per test). https://github.com/electricessence/AsyncFileWriter/blob/master/AsyncFileWriterTester/SynchronousTester.cs#L89-L96 File stream standard benchmark: Total Elapsed Time: 0.1675184 seconds. https://github.com/electricessence/AsyncFileWriter/blob/master/AsyncFileWriterTester/SynchronousTester.cs#L121-L131 File stream async benchmark: Total Elapsed Time: 3.1367992 seconds. Guess No.2 is in order. — Oren Ferrari, Apr 16 '18 at 05:46
@OrenFerrari: Thanks for the issue link! It does look like [it's a quirk with the `FileStream` API specifically, and not due to `async` itself](https://github.com/dotnet/corefx/issues/29129#issuecomment-381651962). So I'd assume the performance difference would not appear in ADO.NET code (assuming an actual db server, and not using something like SQLite). — Stephen Cleary, Apr 16 '18 at 16:12
@StephenCleary I genuinely want to mark this post as the answer, and of course trust your wisdom in this area. But I'm honestly not convinced. I think if all data access was routed through a single TaskScheduler then I think this would be less of a concern. But when tasks are being inserted between ```.ReadAsync()``` calls by unrelated processes: that just screams trouble for me. I wish there was a demo/test that could prove this. It's much easier test using a FileStream, and I'm seeing CPU usage weirdness in that arena (defect aside) as well. — Oren Ferrari, Apr 17 '18 at 23:05
Somewhat related: https://stackoverflow.com/questions/42415969/horrible-performance-when-using-sqlcommand-async-methods — Oren Ferrari, Apr 17 '18 at 23:06
@StephenCleary Here's your previous answer to the FileStream issue: https://stackoverflow.com/questions/18331349/c-sharp-4-5-file-read-performance-sync-vs-async — Oren Ferrari, Apr 17 '18 at 23:09

score 0 · Accepted Answer · answered Apr 18 '18 at 23:34

Significant amounts of iteration introduce significant added latency and extra CPU usage

See http://telegra.ph/SqlDataReader-ReadAsync-vs-Read-04-18 for details.

As suspected:

Using async does not come without cost and requires consideration. Certain types of operations lend themselves well to async, and others are problematic (for what should be obvious reasons).

High volume synchronous/blocking code has it's downsides, but for the most part is well managed by modern threading:

Testing / Profiling

4 x 100 paralleled queries, 1000 records each query.

Performance Profile for Synchronous Query

Average Query: 00:00:00.6731697, Total Time: 00:00:25.1435656

Performance Profile for Async Setup with Synchronous Read

Average Query: 00:00:01.4122918, Total Time: 00:00:30.2188467

Performance Profile for Fully Async Query

Average Query: 00:00:02.6879162, Total Time: 00:00:32.6702872

Assessment

The above results were run on SQL Server 2008 R2 using a .NET Core 2 console application. I invite anyone who has access to a modern instance of SQL Server to replicate these tests to see if there is a reversal in trend. If you find my testing method flawed, please comment so I correct and retest.

As you can easily see in the results. The more asynchronous operations we introduce, the longer the the queries take, and the longer the total time to complete. Even worse, fully asynchronous uses more CPU overhead which is counter productive to the idea that using async tasks would provide more available thread time. This overhead could be due to how I'm running these tests, but it's important to treat each test in a similar way to compare. Again, if anyone has a way to prove that async is better, please do.

I'm proposing here that "async all the way" has it's limitations and should be seriously scrutinized at certain iterative levels (like file, or data access).